Skip to main content
F1000Research logoLink to F1000Research
. 2016 Feb 19;5:195. [Version 1] doi: 10.12688/f1000research.8008.1

Tripartite genome of all species

MengPing Long 1,2,a,#, TaoBo Hu 1,3,4,b,#
PMCID: PMC4911623  PMID: 27366319

Abstract

Neutral theory has dominated the molecular evolution field for more than half a century, but it has been severely challenged by the recently emerged Maximum Genetic Diversity (MGD) theory. However, based on our recent work of tripartite human genome architecture, we found that MGD theory may have overlooked the regulatory but variable genomic regions that increase with species complexity. Here we propose a new molecular evolution theory named Increasing Functional Variation (IFV) hypothesis. According to the IFV hypothesis, the genome of all species is divided into three regions that are ‘functional and invariable’, ‘functional and variable’ and ‘non-functional and variable’. While the ‘non-functional and variable’ region decreases as species become more complex, the other two regions increase.

Keywords: Increasing Functional Variation hypothesis, Maximum Genetic Diversity theory, Neutral theory, Evolution, Genome architecture, Gene regulation

Introduction

The structure and function of the genome have been a major question that all researchers want to solve. The current popular view of the genomic structure is represented by the neutral theory. The neutral theory states that the majority of the genome is variable and neutral 1. The variable property of these genomic regions would not change as the complexity of species increases ( Figure 1).

Figure 1. Comparison of IFV, MGD, and neutral theory.

Figure 1.

While the neutral theory and MGD theory analyze genome structure as bipartite, the IFV hypothesis adds an additional region which is the variable and functional gene regulatory region. As species complexity increases, the variable region of the genome would stay as variable according to neutral theory. While in MGD theory, as species complexity increases there would be less variable region. Unlike MGD theory, IFV hypothesis states that the functional variable region which contains gene regulatory elements would also increase with species complexity.

While in recent years, another theory called Maximum Genetic Diversity (MGD) provided unprecedented insights into the genome structure 25. The MGD theory originated from blasting some conserved proteins such as cytochrome C and hemoglobin of different species. By computing the changeable sites of each species 3, Huang found that more complex species have less changeable sites in certain regions of the genome. Thus, MGD theory states that as the complexity of species increases, the genome would have more invariable regions and less variable regions ( Figure 1).

IFV hypothesis

Here we proposed the Increasing Functional Variation (IFV) hypothesis inspired by both the MGD theory 2 and our recent work on human genome architecture 6. Recently, based on co-localization of various genomic features we divided the human genome into three parts, referred to as gene enriched (Genic) zones, gene regulatory elements enriched (Proximal) zones and non-functional features enriched (Distal) zones 6. We regard the Genic zones as mainly functional and invariable, and the Distal zones as mainly non-functional and variable. The Proximal zones that compose 31% of human genome contain the majority of gene regulatory elements including transcriptional factor binding sites (TFBSs) and are at the same time enriched with conserved indels. These features make Proximal zones functional and variable. It has been proven that as the complexity of species increase, there would be more gene regulatory region in the genome. Based on these two points, we propose that as the complexity of species increases, this variable part of the genome which contains functional regulatory elements would also increase. We call it the Increasing Functional Variation (IFV) hypothesis. Besides the variable gene regulatory region, the other part of the genome can be divided into two parts, the functional and invariable region and the non-functional and variable region. The alteration of these two parts with species complexity can be explained by MGD theory ( Figure 1). What the MGD theory lacks and IVF hypothesis complements is the existence of the variable and functional gene regulatory region in the genome. And according to the IFV hypothesis, as species complexity increases, the variable part of the genome would not simply decrease as stated by MGD theory. The differences between IVF hypothesis and MGD theory have been illustrated in Table 1.

Table 1. Comparison of IFV hypothesis and MGD theory.

IFV MGD
Genome architecture Tripartite Bipartite
Types of variable
region
Two One
Alteration of variable
region as species
complexity increases
Functional variable region
increases while
non-functional variable
region decreases
Decreases

Conclusions

MGD theory has refuted the idea stated by the neutral theory that the majority of the genome is neutral and variable among all species. Instead, it proved that the variable region of the genome would decrease as species become more complex.

However, MGD theory has its own limitation as pointed out by Ho shortly after the publication of MGD. As Ho has mentioned in her book 7, more complex species have more sequence diversity, which is needed for precise regulation of local somatic expression. Ho also stated that although MGD theory solved the paradoxes in molecular evolution, the diversity of complex species at somatic level can’t be explained by it. Our recent study 6 on human genome architecture discovered not only variable but also functional regions of the human genome. In an attempt to provide a more comprehensive view of genome structure and molecular evolution, we developed the IFV hypothesis based on our discovery of the variable property of the gene regulatory region.

Why would we develop this tripartite model of genome architecture across all species? As the ancient Chinese philosopher Lao Tzu stated in Tao Te Ching thousands of years ago that “ three engenders the myriad things”, which means “three” is the root of all things. If the truth of the universe is universal, we believe that the consistency between our tripartite genome architecture of all species and Lao Tzu’s philosophical thinking is not a coincidence.

Funding Statement

MPL & TBH are recipients of MD-PhD scholarship from Hong Kong University of Science and Technology. This work was supported by the grant to TBH from Central South University, China (2282013bks118).

I confirm that the funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

[version 1; referees: 1 approved

References

  • 1. Nei M, Suzuki Y, Nozawa M: The neutral theory of molecular evolution in the genomic era. Annu Rev Genomics Hum Genet. 2010;11:265–89. 10.1146/annurev-genom-082908-150129 [DOI] [PubMed] [Google Scholar]
  • 2. Hu T, Long M, Yuan D, et al. : The genetic equidistance result: misreading by the molecular clock and neutral theory and reinterpretation nearly half of a century later. Sci China Life Sci. 2013;56(3):254–61. 10.1007/s11427-013-4452-x [DOI] [PubMed] [Google Scholar]
  • 3. Huang S: The Overlap Feature of the Genetic Equidistance Result—A Fundamental Biological Phenomenon Overlooked for Nearly Half of a Century. Biological Theory. 2010;5(1):40–52. Reference Source [Google Scholar]
  • 4. Huang S: Inverse relationship between genetic diversity and epigenetic complexity.Preprint available at Nature Proceedings.2009. 10.1038/npre.2009.1751.2 [DOI] [Google Scholar]
  • 5. Huang S: The Genetic Equidistance Result of Molecular Evolution is Independent of Mutation Rates. J Comput Sci Syst Biol. 2008;1:92–102. 10.4172/jcsb.1000009 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6. Ng SK, Hu T, Long X, et al. : Feature co-localization landscape of the human genome. Sci Rep. 2016;6: 20650. 10.1038/srep20650 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7. Ho MW: Development and Evolution Revisited. In: Hood KE, Halpern CT, Greenberg G, Lerner RM, editors. Handbook of Developmental Science, Behavior, and Genetics Blackwell Publishing Ltd;2010. 10.1002/9781444327632.ch4 [DOI] [Google Scholar]
F1000Res. 2016 Jun 16. doi: 10.5256/f1000research.8617.r14116

Referee response for version 1

Alfredo Pulvirenti 1

The paper, in its current version, presents several major limitations. 

The authors should provide more details about the theories.

Furthermore, the rationale behind their IFV hypothesis should be given in a deeper way. Examples supporting the conclusions are strongly needed. The authors should also highlight the limitations and the opportunities of this hypothesis.

I have read this submission. I believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.

F1000Res. 2016 May 3. doi: 10.5256/f1000research.8617.r13461

Referee response for version 1

Rahul Banerjee 1

The manuscript by Long and Hu is of considerable interest in the field of evolution of genomes. However the authors could consider the following points to improve the quality of the manuscript :

  1. Is there any numerical measure of species complexity ?

  2. On page 2, column 2, line 6

    'Huang found that more complex species have less changeable sites in certain regions of the genomes.'

    It would be nice if some more details could be included with regard to these sites.

  3. The authors should be more elaborate as to why they considered the Genic zones to be mainly functional and invariable and the Distal zones as mainly non-functional and variable, and the Proximal zones functional and variable.

  4. The note about Lao Tzu was also quite attractive and added some philosophical charm to the scientific note.

I have read this submission. I believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.

F1000Res. 2016 Mar 29. doi: 10.5256/f1000research.8617.r12943

Referee response for version 1

Shaillay Dogra 1

It would be nicer for the readers if there was more explanation and discussion in the text on the various different theories, what they imply, their pros and cons etc. Currently, it requires some background reading on these topics for the not so well-informed reader.

A detailed discussion of the basis of the IVF hypothesis, the basis of the proposals, its criticism etc. is suggested for a greater understanding of the author's ideas.

I have read this submission. I believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.


Articles from F1000Research are provided here courtesy of F1000 Research Ltd

RESOURCES