Skip to main content
Proceedings of the National Academy of Sciences of the United States of America logoLink to Proceedings of the National Academy of Sciences of the United States of America
. 2008 Dec 17;105(51):20049–20050. doi: 10.1073/pnas.0811118106

The eocyte hypothesis and the origin of eukaryotic cells

John M Archibald 1,1
PMCID: PMC2629348  PMID: 19091952

In the June 1984 issue of PNAS, James Lake and colleagues (1) published a provocative article in which they proposed that eukaryotes (animals, fungi, plants, and protists) evolved from a specific group of thermophilic prokaryotes, the “eocyte” archaebacteria (1). Few questions capture the imagination of biologists like the origin of eukaryotic (nucleus-containing) cells such as our own, and as additional support accumulated (e.g., refs. 24) Lake's eocyte hypothesis garnered considerable attention. The idea that eukaryotes could have arisen from within an already diversified archaebacterial lineage was eventually overshadowed by Woese's (5) “three-domains” view of life in which archaebacteria (including eocytes) represent a natural (i.e., monophyletic) group to the exclusion of eukaryotes and eubacteria (5). In this issue of PNAS, Cox et al. (6) revisit the possibility of an archaebacterial origin for eukaryotes by using expanded molecular sequence datasets and ultra-modern phylogenetic approaches. Their analyses are a model of rigor and rekindle interesting and important ideas about the prokaryotic antecedents of eukaryotic cells.

Evolving Views on the Tree of Life

Next to life itself, the origin of complex cells is one of the most fundamental, and intractable, problems in evolutionary biology. Progress in this area relies heavily on an understanding of the relationships between present-day organisms, yet despite tremendous advances over the last half-century scientists remain firmly divided on how to best classify cellular life. Many adhere to the textbook concept of 2 basic types of cells, prokaryotes and eukaryotes, as championed by Stanier and van Niel (7). Others posit that at its deepest level life is not a dichotomy but a trichotomy comprised of cells belonging to the domains Bacteria, Archaea, and Eukarya, each monophyletic and sufficiently distinct from one another to warrant equal status (5, 8). The conceptual and practical challenges associated with establishing a genealogy-based classification scheme for microbes have been fiercely debated for decades (see ref. 9 for recent review), and the literature is rich in philosophy and rhetoric.

The genomics revolution of the 1990s brought tremendous optimism to the field of microbial systematics: if enough genomes from diverse organisms could be sequenced and compared, definitive answers to questions about evolutionary relationships within and between eubacteria, archaebacteria, and eukaryotes would surely emerge. More specifically, it should be possible to discern how eukaryotes evolved from prokaryotes (if indeed that is what happened), and perhaps even who among modern-day prokaryotic lineages is our closest ancestor. Unfortunately, with the sequences of hundreds of eubacterial, archaebacterial, and eukaryotic genomes has come the realization that the number of universally distributed genes suitable for global phylogenetic analysis is frustratingly small (10). Lateral (or horizontal) gene transfer has shown itself to be a pervasive force in the evolution of both prokaryotic and eukaryotic genomes, and even if a “core” set of genes can be identified (and there is much debate on this issue), how confident are we that the phylogenetic signal in these genes reflects the vertical history of cells? How meaningful are sequence alignment-independent, gene content-based approaches to resolving the “tree of life” (11)? To what extent is a “net of life” a more accurate and useful metaphor for describing the full spectrum of life on Earth (10, 1214)?

Cox et al. revisit the possibility of an archaebacterial origin for eukaryotes.

These vexing questions aside, it is important to consider what analyses of universally distributed genes have, and have not, revealed about the relationships between prokaryotes and eukaryotes. The advent of rRNA gene sequencing led to a revolution in microbiology with the discovery of the third domain of life, the archaebacteria (15), but rRNA phylogenies do not in and of themselves unambiguously support the monophyly of the group. Some analyses in fact place the crenarchaeotes [Lake's eocytes (1)] as the sister group of eukaryotes to the exclusion of the other main archaebacterial line, the euryarchaeotes (e.g., refs. 2 and 4). Similarly, although analyses of anciently diverged paralogs such as genes encoding translation elongation factors often [but not always (16)] place the root of the tree of life on the branch leading to eubacteria, they are equivocal as to whether archaebacteria are monophyletic or whether crenarchaeotes are adjacent to eukaryotes (e.g., refs. 17 and 18). A striking 11-aa insertion in the elongation factor 1α (EF-1α) protein identified by Rivera and Lake (3) supports the eocyte tree, as does the phylogeny of EF-1α itself (e.g., refs. 3 and 17).

New Support for an Old Idea

Cox et al. (6) perform a critical examination of the evidence for and against the eocyte hypothesis and the 3-domains tree: they revisit the phylogeny of large subunit (LSU) and small subunit (SSU) rRNA and carry out a comprehensive analysis of a set of 51 genes/proteins that includes components of the core replication, transcription, and translation machinery, the so-called “informational” genes (19). Using sophisticated methodologies that accommodate among-site compositional heterogeneity in DNA and protein sequences (20) and lineage-specific compositional changes over time (21), Cox et al. show that a combined 40-taxon LSU–SSU rRNA dataset produces topologies consistent with the eocyte tree, not the 3-domains tree. Thirty-nine of the 51 core proteins analyzed were found to be heterogeneous in amino acid composition, and remarkably, only a single protein (the largest subunit of RNA polymerase I) generated a statistically robust topology in which archaebacteria were monophyletic, i.e., crenarchaeotes and euryarchaeotes were each other's closest relatives. The remaining trees provided no strong evidence for or against either hypothesis, although of the 35 trees in which eukaryotes emerge from within a paraphyletic archaebacteria, 8 show the eocyte topology (6). Consistent with the results of Yutin et al. (22), it would appear that individual proteins analyzed in isolation do not paint a consistent picture as to whether the 3-domains tree should be favored over the eocyte tree or vice versa.

In search of increased resolution, Cox et al. (6) performed a battery of additional phylogenetic analyses on a concatenated set of 45 proteins culled from the original 51-protein dataset by eliminating alignments containing multiple eukaryotic paralogs. Using this supermatrix of 5,521 amino acid sites, traditional methods such as maximum parsimony resulted in the 3-domains topology, but a strongly-supported eocyte tree was obtained when maximum-likelihood and Bayesian analyses were performed, even under a “composition homogeneous” model. Use of a reduced amino acid alphabet to minimize the impact of saturation and/or compositional heterogeneity resulted in a robust crenarchaeote–eukaryote relationship. In sum, their analyses “… provide support for the eocyte tree, rather than the 3-domains tree” (6).

What are we to make of this conclusion? The gulf between prokaryotes and eukaryotes is obviously enormous, and the thorn in the side of the eocyte hypothesis has always been how to rationalize crenarchaeote–eukaryote monophyly with the apparent unity of archaebacteria from the perspective of molecular and cell biology. One of the most commonly raised objections is the fact that archaebacteria possess glycerol-ether membrane lipids, unlike the glycerol-ester lipids found in most eubacteria and eukaryotes. If, as the eocyte hypothesis predicts, the eukaryotic nucleocytoplasm evolved from within archaebacteria, eukaryotes would have had to replace their ancestral lipid biosynthesis pathway with a eubacterial-type system. At first glance this might seem problematic but it is not so difficult to imagine given that the “operational” genes of eukaryotes are primarily eubacterial in origin, not archaebacterial (23).

Conversely, comparative genomic studies have revealed that most “informational” genes in archaebacteria (e.g., those involved in DNA replication, transcription, and translation) are related to eukaryotic homologs (23, 24). Although it is often assumed that archaebacteria are monophyletic, there is in fact still much to learn about the biology of crenarchaeotes relative to euryarchaeotes. Indeed, the recent discovery of a novel cell division machinery in diverse crenarchaeal genera, unrelated to the FtsZ-based system found in euryarchaeotes and with proteins homologous to components of the eukaryote-specific endosomal protein sorting apparatus (25, 26), raises the possibility that a variety of molecular phylogeny-independent characters will eventually be brought to bear on the validity of the eocyte hypothesis.

The extent to which a small fraction of the genomes of living organisms can be used to trace the history of cellular lineages dating back >1 billion years will no doubt continue to be debated for years to come. Regardless, an important message to be taken from the analysis of Cox et al. (6) is just how sensitive the results of molecular phylogenies involving anciently diverged sequences can be to model misspecification and compositional heterogeneity. Beyond serving as a general impetus for reexamination of any number of tough phylogenetic problems, the results of Cox et al. serve to put the eocyte hypothesis firmly back “on the radar” of those struggling to understand the earliest events in the diversification of cellular lineages.

Footnotes

The author declares no conflict of interest.

See companion article on page 20356.

References

  • 1.Lake JA, Henderson E, Oakes M, Clark MW. Eocytes: A new ribosome structure indicates a kingdom with a close relationship to eukaryotes. Proc Natl Acad Sci USA. 1984;81:3786–3790. doi: 10.1073/pnas.81.12.3786. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Lake JA. Origin of the eukaryotic nucleus determined by rate-invariant analysis of rRNA sequences. Nature. 1988;331:184–186. doi: 10.1038/331184a0. [DOI] [PubMed] [Google Scholar]
  • 3.Rivera MC, Lake JA. Evidence that eukaryotes and eocyte prokaryotes are immediate relatives. Science. 1992;257:74–76. doi: 10.1126/science.1621096. [DOI] [PubMed] [Google Scholar]
  • 4.Tourasse NJ, Gouy M. Accounting for evolutionary rate variation among sequence sites consistently changes universal phylogenies deduced from rRNA and protein-coding genes. Mol Phylogenet Evol. 1999;13:159–168. doi: 10.1006/mpev.1999.0675. [DOI] [PubMed] [Google Scholar]
  • 5.Woese CR, Kandler O, Wheelis ML. Toward a natural system of organisms: Proposal for the domains Archaea, Bacteria, and Eucarya. Proc Natl Acad Sci USA. 1990;87:4576–4579. doi: 10.1073/pnas.87.12.4576. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Cox CJ, Foster PG, Hirt RP, Harris SR, Embley TM. The archaebacterial origin of eukaryotes. Proc Natl Acad Sci USA. 2008;105:20356–20361. doi: 10.1073/pnas.0810647105. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Stanier RA, van Niel CB. The concept of a bacterium. Arch Microbiol. 1962;42:17–35. doi: 10.1007/BF00425185. [DOI] [PubMed] [Google Scholar]
  • 8.Pace NR. Time for a change. Nature. 2006;441:289. doi: 10.1038/441289a. [DOI] [PubMed] [Google Scholar]
  • 9.Sapp J. Two faces of the prokaryote concept. Int Microbiol. 2006;9:163–172. [PubMed] [Google Scholar]
  • 10.Dagan T, Martin W. The tree of one percent. Genome Biol. 2006;7:118. doi: 10.1186/gb-2006-7-10-118. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Wolf YI, Rogozin IB, Grishin NV, Koonin EV. Genome trees and the tree of life. Trends Genet. 2002;18:472–479. doi: 10.1016/s0168-9525(02)02744-0. [DOI] [PubMed] [Google Scholar]
  • 12.Doolittle WF. Phylogenetic classification and the universal tree. Science. 1999;284:2124–2129. doi: 10.1126/science.284.5423.2124. [DOI] [PubMed] [Google Scholar]
  • 13.Kurland CG, Canback B, Berg OG. Horizontal gene transfer: A critical view. Proc Natl Acad Sci USA. 2003;100:9658–9662. doi: 10.1073/pnas.1632870100. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.McInerney JO, Cotton JA, Pisani D. The prokaryotic tree of life: past, present. .. and future? Trends Ecol Evol. 2008;23:276–281. doi: 10.1016/j.tree.2008.01.008. [DOI] [PubMed] [Google Scholar]
  • 15.Woese CR. Bacterial evolution. Microbiol Rev. 1987;51:221–271. doi: 10.1128/mr.51.2.221-271.1987. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Philippe H, Forterre P. The rooting of the universal tree of life is not reliable. J Mol Evol. 1999;49:509–523. doi: 10.1007/pl00006573. [DOI] [PubMed] [Google Scholar]
  • 17.Baldauf SL, Palmer JD, Doolittle WF. The root of the universal tree and the origin of eukaryotes based on elongation factor phylogeny. Proc Natl Acad Sci USA. 1996;93:7749–7754. doi: 10.1073/pnas.93.15.7749. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Gribaldo S, Cammarano P. The root of the uni-versal tree of life inferred from anciently duplicated genes encoding components of the protein-targeting machinery. J Mol Evol. 1998;47:508–516. doi: 10.1007/pl00006407. [DOI] [PubMed] [Google Scholar]
  • 19.Jain R, Rivera MC, Lake JA. Horizontal gene transfer among genomes: The complexity hypothesis. Proc Natl Acad Sci USA. 1999;96:3801–3806. doi: 10.1073/pnas.96.7.3801. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Lartillot N, Philippe H. A Bayesian mixture model for across-site heterogeneities in the amino acid replacement process. Mol Biol Evol. 2004;21:1095–1109. doi: 10.1093/molbev/msh112. [DOI] [PubMed] [Google Scholar]
  • 21.Foster PG. Modeling compositional heterogeneity. Syst Biol. 2004;53:485–495. doi: 10.1080/10635150490445779. [DOI] [PubMed] [Google Scholar]
  • 22.Yutin N, Makarova KS, Mekhedov SL, Wolf YI, Koonin EV. The deep archaeal roots of eukaryotes. Mol Biol Evol. 2008;25:1619–1630. doi: 10.1093/molbev/msn108. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Esser C, et al. A genome phylogeny for mitochondria among α-proteobacteria and a predominantly eubacterial ancestry of yeast nuclear genes. Mol Biol Evol. 2004;21:1643–1660. doi: 10.1093/molbev/msh160. [DOI] [PubMed] [Google Scholar]
  • 24.Rivera MC, Jain R, Moore JE, Lake JA. Genomic evidence for two functionally distinct gene classes. Proc Natl Acad Sci USA. 1998;95:6239–6244. doi: 10.1073/pnas.95.11.6239. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Lindas A-C, Karlsson EA, Lindgren MT, Ettema TJG, Bernander R. A unique cell division machinery in the Archaea. Proc Natl Acad Sci USA. 2008;105:18942–18946. doi: 10.1073/pnas.0809467105. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Samson RY, Obita T, Freund SM, Williams RL, Bell SD. A role for the ESCRT system in cell division in Archaea. Science. 2008 doi: 10.1126/science.1165322. [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from Proceedings of the National Academy of Sciences of the United States of America are provided here courtesy of National Academy of Sciences

RESOURCES