Abstract
A novel subtype of influenza A virus 09H1N1 has rapidly spread across the world. Evolutionary analyses of this virus have revealed that 09H1N1 is a triple reassortant of segments from swine, avian and human influenza viruses. In this study, we investigated factors shaping the codon usage bias of 09H1N1 and carried out cluster analysis of 60 strains of influenza A virus from different subtypes based on their codon usage bias. We discovered that more preferentially used codons of 09H1N1 are A-ended or U-ended, and the intra-genomic codon usage bias of 09H1N1 is quite low. Base composition constraint, dinucleotide biases and translational selection are the main factors influencing the codon usage bias of 09H1N1. At the genome level, we find that the codon usage bias of 09H1N1 is similar to H1N1 (A/swine/Kansas/77778/2007H1N1), H9N2 from Asia, H1N2 from Asia and North America and H3N2 from North America. Our results provide insight for understanding the processes governing evolution, regulation of gene expression, and revealing the evolution of 09H1N1.
Key words: 09H1N1, Correspondence analysis, Codon usage bias
Footnotes
These authors contributed equally to this work.
Contributor Information
Xiao-chen Bo, Phone: +86-10-66931422, Email: boxc@bmi.ac.cn.
Sheng-qi Wang, Phone: +86-10-66932211, Email: sqwang@bmi.ac.cn.
References
- 1.Bao Y., Bolotov P., Dernovoy D., et al. The Influenza Virus Resource at the National Center for Biotechnology Information. J Virol. 2008;82(2):596–601. doi: 10.1128/JVI.02005-07. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Basak S., Banerjee T., Gupta S. K. Investigation on the causes of codon and amino acid usages variation between thermophilic Aquifex aeolicus and mesophilic Bacillus subtilis. J Biomol Struct Dyn. 2004;22(2):205–214. doi: 10.1080/07391102.2004.10506996. [DOI] [PubMed] [Google Scholar]
- 3.Charif D., Lobry J. SeqinR 1.0-2. A Contributed Package to the R Project for Statistical Computing Devoted to Biological Sequences Retrieval and Analysis. In: Bastolla U., Porto M., Roman E., Vendruscolo M., editors. Structural Approaches to Sequence Evolution. Berlin Heidelberg: Springer; 2007. pp. 207–232. [Google Scholar]
- 4.Dray S., Dufour A. B. The ade4 package: implementing the duality diagram for ecologists. J Stat Softw. 2007;22(4):1–20. [Google Scholar]
- 5.Garten R. J., Davis C. T., Russell C. A. Antigenic and Genetic Characteristics of Swine-Origin 2009 A (H1N1) influenza Viruses Circulating in Humans. Science. 2009;325(5937):197–201. doi: 10.1126/science.1176225. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Gu W., Zhou T., Ma J. The relationship between synonymous codon usage and protein structure in Escherichia coli and Homo sapiens. BioSystems. 2004;73(2):89–97. doi: 10.1016/j.biosystems.2003.10.001. [DOI] [PubMed] [Google Scholar]
- 7.Gupta S. K., Ghosh T. C. Gene expressivity is the main factor in dictating the codon usage variation among the genes in Pseudomonas aeruginosa. Gene. 2001;273(1):63–70. doi: 10.1016/S0378-1119(01)00576-5. [DOI] [PubMed] [Google Scholar]
- 8.Ihaka R., Gentleman R. R: A language for data analysis and graphics. J Comp Graph Stat. 1996;5(3):299–314. doi: 10.2307/1390807. [DOI] [Google Scholar]
- 9.Jenkins G. M., Holmes E. C. The extent of codon usage biases in human RNA viruses and its evolutionary origin. Virus Res. 2003;92(1):1–7. doi: 10.1016/S0168-1702(02)00309-X. [DOI] [PubMed] [Google Scholar]
- 10.Karlin S., Doerfler W., Cardon L. R. Why is CpG suppressed in the genomes of virtually all small eukaryotic viruses but not in those of large eukaryotic viruses? J Virol. 2007;68(5):2889–2897. doi: 10.1128/jvi.68.5.2889-2897.1994. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Kyte J., Doolittle R. F. A simple method for displaying the hydropathic character of a protein. J Mol Biol. 1982;157(1):105–32. doi: 10.1016/0022-2836(82)90515-0. [DOI] [PubMed] [Google Scholar]
- 12.Lobry J. R., Gautier C. Hydrophobicity, expressivity and aromaticity are the major trends of amino acid usage in 999 Escherichia coli chromosome encoded genes. Nucl Acids Res. 1994;22(15):3174–3180. doi: 10.1093/nar/22.15.3174. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Marais G. Duret L. Synonymous codon usage, accuracy of translation, and gene length in Caenorhabditis elegans. J Mol Evol. 2001;52(3):275–280. doi: 10.1007/s002390010155. [DOI] [PubMed] [Google Scholar]
- 14.McInerney J. O. Replicational and transcriptional selection on codon usage in Borrelia burgdorferi. Proc Natl Acad Sci USA. 1998;95(18):10698–10703. doi: 10.1073/pnas.95.18.10698. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Mooers A. ø., Holmes E. C. The evolution of base composition and phylogenetic inference. Trends Ecol Evol (Amst.) 2000;15(9):365–369. doi: 10.1016/S0169-5347(00)01934-0. [DOI] [PubMed] [Google Scholar]
- 16.Perriere G., Thioulouse J. Use and misuse of correspondence analysis in codon usage studies. Nucl Acids Res. 2002;30(20):4548–4555. doi: 10.1093/nar/gkf565. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Sharp P. M., Tuohy T. M., Mosurski K. R., et al. Codon usage in yeast: cluster analysis clearly differentiates highly and lowly expressed genes. Nucl Acids Res. 1986;14(13):5125–5143. doi: 10.1093/nar/14.13.5125. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Suzuki H., Brown C. J., Forney L. J. Comparison of correspondence analysis methods for synonymous codon usage in bacteria. DNA Res. 2008;15(6):357–365. doi: 10.1093/dnares/dsn028. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Tao P., Dai L., Luo M. Analysis of synonymous codon usage in classical swine fever virus. Virus Genes. 2009;38(1):104–112. doi: 10.1007/s11262-008-0296-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Trifonov V., Khiabanian H., Greenbaum B., et al. The origin of the recent swine influenza A (H1N1) virus infecting humans. Euro Surveill. 2009;14(17):pii=19193. [PubMed] [Google Scholar]
- 21.Trifonov V., Khiabanian H., Rabadan R. Geographic Dependence, Surveillance, and Origins of the 2009 Influenza A (H1N1) Virus. N Engl J Med. 2009;361(2):115–119. doi: 10.1056/NEJMp0904572. [DOI] [PubMed] [Google Scholar]
- 22.Wright F. The’ effective number of codons’ used in a gene. Gene. 1990;87(1):23–29. doi: 10.1016/0378-1119(90)90491-9. [DOI] [PubMed] [Google Scholar]
- 23.Zhou T., Gu W., Ma J., et al. Analysis of synonymous codon usage in H5N1 virus and other influenza A viruses. BioSystems. 2005;81(1):77–86. doi: 10.1016/j.biosystems.2005.03.002. [DOI] [PubMed] [Google Scholar]