Skip to main content
Genetics logoLink to Genetics
. 2004 Jun;167(2):949–958. doi: 10.1534/genetics.102.010959

Detecting selection in noncoding regions of nucleotide sequences.

Wendy S W Wong 1, Rasmus Nielsen 1
PMCID: PMC1470900  PMID: 15238543

Abstract

We present a maximum-likelihood method for examining the selection pressure and detecting positive selection in noncoding regions using multiple aligned DNA sequences. The rate of substitution in noncoding regions relative to the rate of synonymous substitution in coding regions is modeled by a parameter zeta. When a site in a noncoding region is evolving neutrally zeta = 1, while zeta > 1 indicates the action of positive selection, and zeta < 1 suggests negative selection. Using a combined model for the evolution of noncoding and coding regions, we develop two likelihood-ratio tests for the detection of selection in noncoding regions. Data analysis of both simulated and real viral data is presented. Using the new method we show that positive selection in viruses is acting primarily in protein-coding regions and is rare or absent in noncoding regions.

Full Text

The Full Text of this article is available as a PDF (113.4 KB).

Selected References

These references are in PubMed. This may not be the complete list of references from this article.

  1. Anisimova M., Bielawski J. P., Yang Z. Accuracy and power of the likelihood ratio test in detecting adaptive molecular evolution. Mol Biol Evol. 2001 Aug;18(8):1585–1592. doi: 10.1093/oxfordjournals.molbev.a003945. [DOI] [PubMed] [Google Scholar]
  2. Anisimova Maria, Bielawski Joseph P., Yang Ziheng. Accuracy and power of bayes prediction of amino acid sites under positive selection. Mol Biol Evol. 2002 Jun;19(6):950–958. doi: 10.1093/oxfordjournals.molbev.a004152. [DOI] [PubMed] [Google Scholar]
  3. Badrane H., Tordo N. Host switching in Lyssavirus history from the Chiroptera to the Carnivora orders. J Virol. 2001 Sep;75(17):8096–8104. doi: 10.1128/JVI.75.17.8096-8104.2001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Bonhoeffer S., Holmes E. C., Nowak M. A. Causes of HIV diversity. Nature. 1995 Jul 13;376(6536):125–125. doi: 10.1038/376125a0. [DOI] [PubMed] [Google Scholar]
  5. Breun L. A., Broering T. J., McCutcheon A. M., Harrison S. J., Luongo C. L., Nibert M. L. Mammalian reovirus L2 gene and lambda2 core spike protein sequences and whole-genome comparisons of reoviruses type 1 Lang, type 2 Jones, and type 3 Dearing. Virology. 2001 Sep 1;287(2):333–348. doi: 10.1006/viro.2001.1052. [DOI] [PubMed] [Google Scholar]
  6. Carter K. L., Roizman B. Alternatively spliced mRNAs predicted to yield frame-shift proteins and stable intron 1 RNAs of the herpes simplex virus 1 regulatory gene alpha 0 accumulate in the cytoplasm of infected cells. Proc Natl Acad Sci U S A. 1996 Oct 29;93(22):12535–12540. doi: 10.1073/pnas.93.22.12535. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Endo T., Ikeo K., Gojobori T. Large-scale search for genes on which positive selection may operate. Mol Biol Evol. 1996 May;13(5):685–690. doi: 10.1093/oxfordjournals.molbev.a025629. [DOI] [PubMed] [Google Scholar]
  8. Felsenstein J. Evolutionary trees from DNA sequences: a maximum likelihood approach. J Mol Evol. 1981;17(6):368–376. doi: 10.1007/BF01734359. [DOI] [PubMed] [Google Scholar]
  9. Fitch W. M., Bush R. M., Bender C. A., Cox N. J. Long term trends in the evolution of H(3) HA1 human influenza type A. Proc Natl Acad Sci U S A. 1997 Jul 22;94(15):7712–7718. doi: 10.1073/pnas.94.15.7712. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Fujiwara K., Yokosuka O., Fukai K., Imazeki F., Saisho H., Omata M. Analysis of full-length hepatitis A virus genome in sera from patients with fulminant and self-limited acute type A hepatitis. J Hepatol. 2001 Jul;35(1):112–119. doi: 10.1016/s0168-8278(01)00074-5. [DOI] [PubMed] [Google Scholar]
  11. Gaut B. S., Weir B. S. Detecting substitution-rate heterogeneity among regions of a nucleotide sequence. Mol Biol Evol. 1994 Jul;11(4):620–629. doi: 10.1093/oxfordjournals.molbev.a040141. [DOI] [PubMed] [Google Scholar]
  12. Goldman N., Yang Z. A codon-based model of nucleotide substitution for protein-coding DNA sequences. Mol Biol Evol. 1994 Sep;11(5):725–736. doi: 10.1093/oxfordjournals.molbev.a040153. [DOI] [PubMed] [Google Scholar]
  13. Hasegawa M., Kishino H., Yano T. Dating of the human-ape splitting by a molecular clock of mitochondrial DNA. J Mol Evol. 1985;22(2):160–174. doi: 10.1007/BF02101694. [DOI] [PubMed] [Google Scholar]
  14. Hughes A. L., Nei M. Pattern of nucleotide substitution at major histocompatibility complex class I loci reveals overdominant selection. Nature. 1988 Sep 8;335(6186):167–170. doi: 10.1038/335167a0. [DOI] [PubMed] [Google Scholar]
  15. Huttley G. A., Easteal S., Southey M. C., Tesoriero A., Giles G. G., McCredie M. R., Hopper J. L., Venter D. J. Adaptive evolution of the tumour suppressor BRCA1 in humans and chimpanzees. Australian Breast Cancer Family Study. Nat Genet. 2000 Aug;25(4):410–413. doi: 10.1038/78092. [DOI] [PubMed] [Google Scholar]
  16. Ito T., Tahara S. M., Lai M. M. The 3'-untranslated region of hepatitis C virus RNA enhances translation from an internal ribosomal entry site. J Virol. 1998 Nov;72(11):8789–8796. doi: 10.1128/jvi.72.11.8789-8796.1998. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Luo Kangxian, He Haitang, Liu Zhihua, Liu Dingxie, Xiao Hong, Jiang Xiaojing, Liang Weifang, Zhang Lian. Novel variants related to TT virus distributed widely in China. J Med Virol. 2002 May;67(1):118–126. doi: 10.1002/jmv.2200. [DOI] [PubMed] [Google Scholar]
  18. Muse S. V. Evolutionary analyses of DNA sequences subject to constraints of secondary structure. Genetics. 1995 Mar;139(3):1429–1439. doi: 10.1093/genetics/139.3.1429. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Muse S. V., Gaut B. S. A likelihood approach for comparing synonymous and nonsynonymous nucleotide substitution rates, with application to the chloroplast genome. Mol Biol Evol. 1994 Sep;11(5):715–724. doi: 10.1093/oxfordjournals.molbev.a040152. [DOI] [PubMed] [Google Scholar]
  20. Nam Jae-Hwan, Chae Soo-Lim, Park Sun-Hee, Jeong Yong-Seok, Joo Myung-Soo, Kang Chil-Yong, Cho Hae-Wol. High level of sequence variation in the 3' noncoding region of Japanese encephalitis viruses isolated in Korea. Virus Genes. 2002;24(1):21–27. doi: 10.1023/a:1014077719162. [DOI] [PubMed] [Google Scholar]
  21. Salemi Marco, Vandamme Anne-Mieke. Hepatitis C virus evolutionary patterns studied through analysis of full-genome sequences. J Mol Evol. 2002 Jan;54(1):62–70. doi: 10.1007/s00239-001-0018-9. [DOI] [PubMed] [Google Scholar]
  22. Sanchez A., Trappier S. G., Mahy B. W., Peters C. J., Nichol S. T. The virion glycoproteins of Ebola viruses are encoded in two reading frames and are expressed through transcriptional editing. Proc Natl Acad Sci U S A. 1996 Apr 16;93(8):3602–3607. doi: 10.1073/pnas.93.8.3602. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Seal Bruce S., Crawford John M., Sellers Holly S., Locke Devin P., King Daniel J. Nucleotide sequence analysis of the Newcastle disease virus nucleocapsid protein gene and phylogenetic relationships among the Paramyxoviridae. Virus Res. 2002 Feb 26;83(1-2):119–129. doi: 10.1016/s0168-1702(01)00427-0. [DOI] [PubMed] [Google Scholar]
  24. Shiroki K., Ishii T., Aoki T., Kobashi M., Ohka S., Nomoto A. A new cis-acting element for RNA replication within the 5' noncoding region of poliovirus type 1 RNA. J Virol. 1995 Nov;69(11):6825–6832. doi: 10.1128/jvi.69.11.6825-6832.1995. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Swanson W. J., Yang Z., Wolfner M. F., Aquadro C. F. Positive Darwinian selection drives the evolution of several female reproductive proteins in mammals. Proc Natl Acad Sci U S A. 2001 Feb 20;98(5):2509–2514. doi: 10.1073/pnas.051605998. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Swanson Willie J., Nielsen Rasmus, Yang Qiaofeng. Pervasive adaptive evolution in mammalian fertilization proteins. Mol Biol Evol. 2003 Jan;20(1):18–20. doi: 10.1093/oxfordjournals.molbev.a004233. [DOI] [PubMed] [Google Scholar]
  27. Thompson J. D., Higgins D. G., Gibson T. J. CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res. 1994 Nov 11;22(22):4673–4680. doi: 10.1093/nar/22.22.4673. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Vilcek S., Belák S. Organization and diversity of the 3'-noncoding region of classical swine fever virus genome. Virus Genes. 1997;15(2):181–186. doi: 10.1023/a:1007971110065. [DOI] [PubMed] [Google Scholar]
  29. Walker P. A., Leong L. E., Porter A. G. Sequence and structural determinants of the interaction between the 5'-noncoding region of picornavirus RNA and rhinovirus protease 3C. J Biol Chem. 1995 Jun 16;270(24):14510–14516. doi: 10.1074/jbc.270.24.14510. [DOI] [PubMed] [Google Scholar]
  30. Yamaguchi Y., Gojobori T. Evolutionary mechanisms and population dynamics of the third variable envelope region of HIV within single hosts. Proc Natl Acad Sci U S A. 1997 Feb 18;94(4):1264–1269. doi: 10.1073/pnas.94.4.1264. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Yang Z., Nielsen R., Goldman N., Pedersen A. M. Codon-substitution models for heterogeneous selection pressure at amino acid sites. Genetics. 2000 May;155(1):431–449. doi: 10.1093/genetics/155.1.431. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Yang Z. PAML: a program package for phylogenetic analysis by maximum likelihood. Comput Appl Biosci. 1997 Oct;13(5):555–556. doi: 10.1093/bioinformatics/13.5.555. [DOI] [PubMed] [Google Scholar]
  33. Yang Z, Bielawski JP. Statistical methods for detecting molecular adaptation. Trends Ecol Evol. 2000 Dec 1;15(12):496–503. doi: 10.1016/S0169-5347(00)01994-7. [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from Genetics are provided here courtesy of Oxford University Press

RESOURCES