This themed issue centered on the evolution and structure of proteins and proteomes is comprised of seven published manuscripts. These works highlight recent developments in efforts to model protein evolution as well as findings from empirical computational genomes approaches. Two key inter-related themes emerge from the synthesis of these works: the crucial role of epistasis and the effect of mutation-selection-drift balance on protein stability.
Epistasis is suggested to have a significant impact on how proteins evolve [1,2,3,4,5], and several works within this themed issue explore the connection between epistasis and protein evolution. Posfai et al. [6] analytically demonstrate the necessity of epistasis in maintaining the stability of protein structures during evolution. Concrete examples of the significance of epistasis are also illustrated [7,8]. Knops et al. [7] describe how epistatic interactions in a Hepatitis C Virus protein can enable the evolution of drug resistance, while Andreou et al. [8] find that changes outside of the binding pocket of polysaccharide deacetylase in Bacillus species may contribute to differing substrate specificity. The need to account for epistatic interaction in evolutionary modeling is also pointed out in a review by Teufel et al. [9]. When modeling protein evolution with mutation-selection models, break points reflecting shifts in sets of appropriate amino acids can be used to maintain the mathematical approximation of site-independence while modeling a site-interdependent (epistatic) process.
The second theme that emerges from this collection of works is the role of mutation–selection–drift balance on protein stability, a concept popularized in a classic paper [10]. There are many sequence combinations that contribute to a folded state, and selection rarely acts on the precise energetic contributions of single amino acids, but rather on the cumulative effect in creating a properly folded protein (see reference [11] for further discussion of this point). This is what underlies epistasis. Ahrens et al. [12] look at the interplay of different structural features over evolution in maintaining folded states, while Mesa-Torres et al. [13] present examples in which disease arises due to the emergence of conformational instability with mutation that is not properly compensated. Further, Bányai et al. [14] demonstrate a correlation between rates of amino acid substitution, domain evolution, and morphological evolution in Cephalochordates by correcting mis-annotations that led to the inference of a higher rate of domain evolution in this lineage.
The link between epistasis and the site-specific substitution process emerged in the 1960s among other early work [15]. The combination of mutation at individual sites and selection at a higher level of organization reflecting fold and function gives rise to a clock-like rate of change. While this has subsequently been linked to the neutral theory, negative selection can also give rise to clock-like rates if substitutions are not rare and occur with a regular process (albeit made more complicated by epistasis). It is this understanding and its link to functioning under different selective regimes (including positive directional selection) that remain at the forefront of the field.
The exploration of epistasis within proteins and the effect of mutation–selection–drift balance on protein stability by the works compiled in this special edition are representative of the current state of research. Considering that these themes emerged organically from this compilation suggests that these areas of study pose many open questions. Each of these manuscripts represents a further step in developing this overarching understanding of the processes that govern molecular evolution at the protein level.
Acknowledgments
A.I.T. acknowledges support from National Institutes of Health Grant R01 GM088344 and National Science Foundation Cooperative agreement no. DBI-0939454 (BEACON Center). D.A.L. acknowledges support from National Science Foundation Grant DBI-1515704.
Conflicts of Interest
The authors declare no conflicts of interest.
References
- 1.Breen M.S., Kemena C., Vlasov P.K., Notredame C., Kondrashov F.A. Epistasis as the primary factor in molecular evolution. Nature. 2012;490:535. doi: 10.1038/nature11510. [DOI] [PubMed] [Google Scholar]
- 2.Lunzer M., Golding G.B., Dean A.M. Pervasive cryptic epistasis in molecular evolution. PLoS Genet. 2010;6:e1001162. doi: 10.1371/annotation/d618ce28-5010-47df-a44e-148ecdc0fef6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Pollock D.D., Thiltgen G., Goldstein R.A. Amino acid coevolution induces an evolutionary stokes shift. Proc. Nat. Acad. Sci. USA. 2012;109:E1352–E1359. doi: 10.1073/pnas.1120084109. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Starr T.N., Thornton J.W. Epistasis in protein evolution. Prot. Sci. 2016;25:1204–1218. doi: 10.1002/pro.2897. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Wang Z.O., Pollock D.D. Context dependence and coevolution among amino acid residues in proteins. Methods Enzymol. 2005;395:779–790. doi: 10.1016/S0076-6879(05)95040-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Posfai A., Zhou J., Plotkin J.B., Kinney J.B., McCandlish D.M. Selection for protein stability enriches for epistatic interactions. Genes. 2018;9:423. doi: 10.3390/genes9090423. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Knops E., Sierra S., Kalaghatgi P., Heger E., Kaiser R., Kalinina O. Epistatic interactions in NS5A of hepatitis C virus suggest drug resistance mechanisms. Genes. 2018;9:343. doi: 10.3390/genes9070343. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Andreou A., Giastas P., Christoforides E., Eliopoulos E. Structural and evolutionary insights within the polysaccharide deacetylase gene family of Bacillus anthracis and Bacillus cereus. Genes. 2018;9:386. doi: 10.3390/genes9080386. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Teufel A., Ritchie A., Wilke C., Liberles D. Using the mutationselection framework to characterize selection on protein sequences. Genes. 2018;9:409. doi: 10.3390/genes9080409. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Taverna D.M., Goldstein R.A. Why are proteins marginally stable? Proteins. 2002;46:105–109. doi: 10.1002/prot.10016. [DOI] [PubMed] [Google Scholar]
- 11.Chi P.B., Liberles D.A. Selection on protein structure, interaction, and sequence. Prot. Sci. 2016;25:1168–1178. doi: 10.1002/pro.2886. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Ahrens J.B., Rahaman J., Siltberg-Liberles J. Analysis of structural and functional factors reveals general trends in eukaryote protein evolution. Genes. 2018;9:553. doi: 10.3390/genes9110553. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Mesa-Torres N., Betancor-Fernandez I., Oppici E., Cellini B., Salido E., Pey A. Evolutionary divergent suppressor mutations in conformational diseases. Genes. 2018;9:352. doi: 10.3390/genes9070352. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Bányai L., Kerekes K., Trexler M., Patthy L. Morphological stasis and proteome innovation in cephalochordates. Genes. 2018;9:353. doi: 10.3390/genes9070353. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Zuckerkandl E., Pauling L. Evolutionary divergence and convergence in protein. In: Bryson V., Vogel H., editors. Evolving Genes and Proteins. Academic Press; New York, NY, USA: 1965. pp. 97–166. [Google Scholar]