Abstract
By sampling 100 encoding proteins from SARS-coronavirus (SARS-CoV, NC 004718) and other six coronaviruses and selecting 23 variables through stepwise multiple regression (SMR) from 172 variables, the multiple linear regression (MLR) model was established with good results of the quantitative modelling correlation coefficient R 2 = 0.645 and the cross-validation correlation coefficient R 2CV = 0.375. After removing 4 outliers, the quantitative modelling and cross-validation correlation coefficients were R 2= 0.743 and R 2CV = 0.543, respectively.
Keywords: SARS-CoV, coronavirus, multiple linear regression (MLR), stepwise multiple regression (SMR), encoding protein, identification
References
- 1.Rota P. A., Oberste M. S., Monroe S. S., et al. Characterization of a novel coronavirus associated with severe acute respiratory syndrome. Science. 2003;300(5624):1394–1399. doi: 10.1126/science.1085952. [DOI] [PubMed] [Google Scholar]
- 2.Drosten C., Gunther S., Preiser W., et al. Identification of a novel coronavirus in patients with severe acute respiratory syndrome. N. Engl. J. Med. 2003;348(20):1967–1976. doi: 10.1056/NEJMoa030747. [DOI] [PubMed] [Google Scholar]
- 3.Ksiazek T. G., Erdman D., Goldsmith C. S., et al. A novel coronavirus associated with severe acute respiratory syndrome. N. Engl. J. Med. 2003;348(20):1953–1966. doi: 10.1056/NEJMoa030781. [DOI] [PubMed] [Google Scholar]
- 4.Hellberg S., Sjostrom M., Skagerberg B., et al. Peptide quantitative structure-activity relationships, a multivariate approach. J. Med. Chem. 1987;30(7):1126–1135. doi: 10.1021/jm00390a003. [DOI] [PubMed] [Google Scholar]
- 5.Grantham R. Amino acid difference formula to help explain protein evolution. Science. 1974;185(4154):862–864. doi: 10.1126/science.185.4154.862. [DOI] [PubMed] [Google Scholar]
- 6.Janin J. Surface and inside volumes in globular proteins. Nature. 1979;277(5696):491–492. doi: 10.1038/277491a0. [DOI] [PubMed] [Google Scholar]
- 7.Levitt M. Conformational preferences of amino acids in globular proteins. Biochemistry. 1978;17(20):4277–4285. doi: 10.1021/bi00613a026. [DOI] [PubMed] [Google Scholar]
- 8.Fraga S., San-Fabian E., Thornton S., et al. Prediction of the secondary structure and functional sites of major histocompatibility complex molecules. J. Mol. Recognit. 1990;3(2):65–73. doi: 10.1002/jmr.300030203. [DOI] [PubMed] [Google Scholar]
- 9.He F. C. SARS—the Severe Acute Respiratory Syndrome (in Chinese) Beijing: Science Press; 2003. pp. 61–69. [Google Scholar]