Version Changes
Revised. Amendments from Version 1
In response to reviewer comments, the results and discussion have been expanded to include a short section on Emc8 and Emc9 and a supplementary phylogenetic tree (Figure S1). The analysis demonstrates that Emc8 and Emc9 are vertebrate-specific paralogues resulting from a duplication of an ancestral protein in the lineage leading to vertebrates (Phylogenetic methods appear in the figure legend to Fig S1 - Associated references have also been added). These results suggest that vertebrate Emc8 and Emc9 should be renamed Emc8a and Emc8b, respectively. Figure 1 has been modified accordingly.
Abstract
The recently discovered endoplasmic reticulum (ER) membrane protein complex (EMC) has been implicated in ER-associated degradation (ERAD), lipid transport and tethering between the ER and mitochondrial outer membranes, and assembly of multipass ER-membrane proteins. The EMC has been studied in both animals and fungi but its presence outside the Opisthokont clade (animals + fungi + related protists) has not been demonstrated. Here, using homology-searching algorithms, I show that the EMC is truly an ancient and conserved protein complex, present in every major eukaryotic lineage. Very few organisms have completely lost the EMC, and most, even over 2 billion years of eukaryote evolution, have retained a majority of the complex members. I identify Sop4 and YDR056C in Saccharomyces cerevisiae as Emc7 and Emc10, respectively, subunits previously thought to be specific to animals. This study demonstrates that the EMC was present in the last eukaryote common ancestor (LECA) and is an extremely important component of eukaryotic cells even though its primary function remains elusive.
Keywords: Evolutionary cell biology, ER membrane protein complex (EMC), Membrane contact sites (MCS), ERMES, ER-mitochondria contact sites
Introduction
Recent studies suggest that the EMC (Endoplasmic Reticulum Membrane Complex) is a multifunctional, multi-subunit protein complex. In Homo sapiens, the EMC comprises ten subunits, Emc1-10, whereas in Saccharomyces cerevisiae the complex comprises only Emc1-6 ( Jonikas et al., 2009). The EMC has been implicated in several cellular processes. It has been implicated in ERAD (ER-associated degradation) ( Christianson et al., 2012; Jonikas et al., 2009; Richard et al., 2013) but the molecular mechanism for how EMC triggers ERAD has remained elusive. Emc6 contains a Rab5 interacting domain and has been shown to interact with Rab5A in humans during autophagosome formation ( Li et al., 2013). It has also been shown that the EMC is an ER-mitochondria tether in S. cerevisiae that interacts with the outer membrane protein Tom5 of the TOM (Translocase of the Mitochondrial Outer Membrane) complex ( Lahiri et al., 2014). Most recently, the EMC has been implicated in the proper assembly of multi-pass transmembrane (TM) proteins ( Satoh et al., 2015). These recent findings suggest that EMC involvement in ERAD may be due to secondary effects, as cells devoid of EMC components may result in either disruption of ER-mitochondria tethering, or the misfolding of multipass membrane proteins. Thus, the primary function of the EMC is still open for debate.
The ER-mitochondria encounter structure (ERMES), also involved in ER-mitochondria tethering, is a multifunctional protein complex implicated in both lipid transfer and mitochondrial outer membrane protein assembly ( AhYoung et al., 2015; Kornmann et al., 2009; Meisinger et al., 2006, Meisinger et al., 2007; Wideman et al., 2013; Wideman et al., 2010). However, ERMES as an ER-mitochondria tether is limited to a subset of eukaryote taxa ( Wideman et al., 2013), suggesting that a universal ER-mitochondria tethering complex might exist. Lahiri et al. (2014) state in their title that the EMC is a conserved protein complex. However, by stating that a protein is conserved, cell biologists and biochemists often simply mean that the protein is present in S. cerevisiae (fungi) and animals. Since the clade comprising animals and fungi only accounts for one fifth of the diversity of eukaryotes ( Adl et al., 2012), more work is necessary in order to support the claim made by Lahiri et al. Thus, I was prompted to investigate the taxonomic distribution of the EMC in order to (1) determine if it really is a conserved protein complex and (2) if it could possibly represent the pan-eukaryotic ER-mitochondria tether.
Methods
Sequences of experimentally validated EMC components (see Table S1 for accession numbers) from H. sapiens and S. cerevisiae were used as queries in BLAST ( Altschul et al., 1997) and pHMMer ( Finn et al., 2011) searches into the predicted proteomes of 70 organisms spanning the diversity of eukaryotes. Retrieved sequences were considered orthologous if they retrieved the original H. sapiens or S. cerevisiae EMC sequences as top hits when used as reciprocal BLAST or pHMMer queries into H. sapiens or S. cerevisiae predicted proteomes and did not retrieve any other closely related sequences (except in the case of Emc8 and Emc9, see below). In cases in which EMC components could not be identified in this manner, transcriptomes and genomes were searched using bioinformatically validated sequences from the previous step that were retrieved from closely related species. Genomes were downloaded from public repositories and genome project websites. See Table S1 for retrieved sequences.
Results and discussion
The EMC is an ancient and highly conserved protein complex
Using homology-searching algorithms EMC candidate proteins were identified in the vast majority of sequenced genomes representing the complete diversity of eukaryotes ( Figure 1). Emc8 and Emc9 are homologues but only a single homologue could be detected in most genomes. A subset of opisthokont Emc8/9 sequences were subjected to a phylogenetic analysis demonstrating that vertebrate Emc8 and Emc9 are the result of a vertebrate-specific duplication of Emc8 ( Figure S1; see legend for methods). Based on this knowledge, I suggest that the vertebrate Emc8 and Emc9 be renamed Emc8a and Emc8b, respectively.
Figure 1. Coulson plot showing distribution of EMC components across eukaryotes.

Coloured pies indicate presence of a particular subunit. Plot was generated using the Coulson plot generator ( Field et al., 2013). Asterisks indicate presence of orthologue in a different member of the genus but absent in the indicated species (see Table S1). Abbreviations: Vertebrates: Hsap, Homo sapiens; Mdom, Monodelphis domesticus; Drer, Danio rerio; Xtro, Xenopus tropicalis; Ggal, Gallus gallus; Mmus, Mus musculus; Invertebrates: Cele, Caenorhabditis elegans; Dmel, Drosophila melanogaster; Bflo, Branchiostoma floridae; Nvec, Nematostella vectensis; Tadh, Trichoplax adhaerens; Unicellular Holozoa: Mbre, Monosiga brevicollis; Cowc, Capsaspora owczarzaki; Sarc, Sphaeroforma arctica; Sros, Salpingoeca rosetta; Fungi: Spom, Schizosaccharomyces pombe; Scer, Saccharomyces cerevisiae; Ncra, Neurospora crassa; Cneo, Cryptococcus neoformans; Umay, Ustilago maydis; Bden, Batrachochytrium dendrobatidis; Ncer, Nosema ceranae; Ecun, Encephalitozoon cuniculi; Pir, Piromyces sp.; Spun, Spizellomyces punctatus; Rirr, Rhizophagus irregularis; Crev, Coemansia reversa; Ccor, Conidiobolus coronatus; Cang, Catenaria anguillulae; Rall, Rozella allomyces; Apusozoa: Ttra, Thecamonas trahens; Fonticulids: Fonticula alba; Amoebozoa: Acas, Acanthamoeba castellanii; Ddis, Dictyostelium discoideum; Ehis, Entamoeba histolytica; Ppal, Polysphondylium pallidum; Excavata: Ngru, Naegleria gruberi; Gint, Giardia intesinalis; Tvag, Trichomonas vaginalis; Bsal, Bodo saltans; Lmaj, Leishmania major; Tbru, Trypanosoma brucei; Stramenopiles: Bhom, Blastocystis hominis; Alim, Aurantiochytrium limacinum; Aana, Aureococcus anophagefferens; Tpse, Thalassiosira pseudonana; Ptri, Phaeodactylum tricornutum; Psoj, Phytophthora sojae; Esil, Ectocarpus siliculosus; Ngad, Nannochloropsis gaditana; Alveolates: Ptet, Paramecium tetraurelia; Tthe, Tetrahymena thermophila; Otri, Oxytricha trifallax; Tpar, Theileria parva; Smin, Symbiodinium minutum; Tgon, Toxoplasma gondii; Cpar, Cryptosporidium parvum; Pfal, Plasmodium falciparum; Rhizaria: Bnat, Bigelowiella natans; Rfil, Reticulomyxa filosa; Archaeplastida: Crei, Chlamydomonas reinhardtii; Cmer, Cyanidioschyzon merolae; Cyp, Cyanophora paradoxa; Atha, Arabidopsis thaliana; Ppat, Physcomitrella patens; Otau, Ostreococcus tauri; Gsul, Galdieria sulphuraria; Mpus, Micromonas pusilla; Ccri, Chondrus crispus; CCTH: Ehux, Emiliania huxleyi; Gthe, Guillardia theta.
A complete EMC (Emc1-8, 10) was found in at least one representative from each major lineage including animals, fungi, excavates, amoebozoa, green algae, plants, stramenopiles, alveolates, rhizaria, and haptophytes ( Figure 1). The relative sequence conservation of EMC components across diverse taxa suggests that the EMC has an ancient and critical role in cellular function.
Yeast Sop4 and YDR056C are Emc7 and Emc10, respectively
Although previous reports suggest S. cerevisiae EMC comprises only six subunits, I identified Sop4 and YDR056C as orthologues of Emc7 and Emc10, respectively. Supporting this, Jonikas et al. (2009), the original discoverers of the EMC, show by co-immunoprecipitation analyses that Sop4 and YDR056C are interacting partners of FLAG-tagged Emc3. This experiment not only confirms my bioinformatic classification but also puts into perspective a previous study on Sop4’s role in membrane protein quality control ( Luo et al., 2002). Furthermore, tracing the evolutionary history of the EMC in fungi demonstrates that Emc8 was lost only in Ascomycetes and a few basally diverging fungi whereas most fungi retain Emc8 (as well as Emc7 and 10).
The EMC has been independently lost in several lineages
Although the EMC was identified in representative taxa from every major eukaryote supergroup, I was unable to identify even a single EMC member in the genomes of the microsporidians Nosema ceranae and Encephalitozoon cuniculi, the metamonad Giardia intestinalis, the stramenopile Blastocystis hominis, the alveolate Theileria parva, and the red alga Cyanidioschyzon merolae ( Figure 1 and Figure 2). Trichomonas vaginalis, another metamonad retains only a rather divergent Emc2, that passed the test for orthology, but only weakly, suggesting that this protein is under relaxed selection, perhaps repurposed, or in the process of being lost. All other genomes from the remaining 65 species investigated contained clear representatives of EMC homologues ( Figure 1).
Figure 2. Evolutionary history of the EMC.

EMC 1-8 and 10 evolved prior to the divergence of the major eukaryote lineages. Green and red dashes represent gains and losses of EMC components, respectively. Coloured pies are schematic representations of which EMC components were present at different points over the course of evolution.
These disparate organisms that lack the EMC prompted the question: What cellular or biochemical features tie these diverse organisms together? The microsporidians, metamonads and B. hominis all contain reduced anaerobic mitochondria-related organelles (MROs) and also lack the EMC. However, the amoebozoan Entamoeba histolytica retains Emc1-4, 7 and 10, the apicomplexan Cryptosporidium parvum retains Emc1-4, and 8, and the fungus Piromyces sp. retains Emc1-4, 6, 7, and 10, but all three organisms also contain extremely reduced MROs. T. parva and C. merolae contain relatively normal mitochondria but completely lack the EMC. Thus, it seems that further insight into the cell biology of these organisms is required to understand why only these few species from unrelated lineages have lost the EMC. At this point, of the proposed functions of the EMC, its involvement in multipass membrane protein assembly is the best candidate for generalization to other eukaryotes. It explains the connection to ERAD as a secondary effect of misassembled multipass proteins and explains why an organism with extremely reduced mitochondria ( E. histolytica) might retain the EMC. Finally, although EMC involvement as an ER-mitochondria tether is attractive, the distribution of the only known MOM-localized interactor of EMC (Tom5) has not been identified in organisms other than animals and fungi ( Maćasev et al., 2004). Thus, until an ancient interaction partner is identified, the role of EMC as an ancient tether remains speculative.
Conclusions
Since the vast majority of species from each major branch of eukaryotes retain the EMC it can be inferred that it was present in the last eukaryote common ancestor (LECA). Since the sequences of most of the identified EMC homologues are very similar, it can be inferred that its function has likely been retained in most eukaryote lineages. Thus, the EMC is a generalizable eukaryotic feature as is its function—whatever it might be.
Data availability
The data referenced by this article are under copyright with the following copyright statement: Copyright: © 2015 Wideman JG
All sequence data are freely available in online databases (NCBI, JGI, or independent genome sequencing project websites).
Acknowledgements
I thank Joel Dacks for server access, computational time, and inspiration.
Funding Statement
JGW is the recipient of the European Molecular Biology Organization (EMBO) Long-term Fellowship (ALTF 761-2014).
I confirm that the funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
[version 2; referees: 2 approved
Supplementary materials
Supplementary Table S1Protein sequences retrieved in this study.
.
Supplementary Figure S1. Phylogenetic analysis of opisthokont Emc8/9 proteins.
Proteins were aligned using MUSCLE ( Edgar, 2004) and manually adjusted as needed using Mesquite ( http://mesquiteproject.org). Phylogenetic tree reconstructions were carried out using MrBayes v3.2.2 ( Ronquist et al., 2003) for Bayesian analysis. Maximum likelihood bootstrap values were obtained using RaxML ( Stamatakis, 2006) with 100 pseudoreplicates using the LG model ( Le & Gascuel, 2008). Support values: MrBayes/RAxML. Only support values >0.90/50 are shown.
References
- Adl SM, Simpson AGB, Lane CE, et al. : The revised classification of eukaryotes. J Eukaryot Microbiol. 2012;59(5):429–93. 10.1111/j.1550-7408.2012.00644.x [DOI] [PMC free article] [PubMed] [Google Scholar]
- AhYoung AP, Jiang J, Zhang J, et al. : Conserved SMP domains of the ERMES complex bind phospholipids and mediate tether assembly. Proc Natl Acad Sci U S A. 2015;112(25):E3179–88. 10.1073/pnas.1422363112 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Altschul SF, Madden TL, Schäffer AA, et al. : Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 1997;25(17):3389–402. 10.1093/nar/25.17.3389 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Christianson JC, Olzmann JA, Shaler TA, et al. : Defining human ERAD networks through an integrative mapping strategy. Nat Cell Biol. 2012;14(1):93–105. 10.1038/ncb2383 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Edgar RC: MUSCLE: a multiple sequence alignment method with reduced time and space complexity. BMC Bioinformatics. 2004;5:113. 10.1186/1471-2105-5-113 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Field HI, Coulson RMR, Field MC: An automated graphics tool for comparative genomics: the Coulson plot generator. BMC Bioinformatics. 2013;14:141. 10.1186/1471-2105-14-141 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Finn RD, Clements J, Eddy SR: HMMER web server: interactive sequence similarity searching. Nucleic Acids Res. 2011;39(Web Server issue):W29–37. 10.1093/nar/gkr367 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jonikas M, Collins S, Denic V, et al. : Comprehensive characterization of genes required for protein folding in the endoplasmic reticulum. Science. 2009;323(5922):1693–1697. 10.1126/science.1167983 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kornmann B, Currie E, Collins SR, et al. : An ER-mitochondria tethering complex revealed by a synthetic biology screen. Science. 2009;325(5939):477–81. 10.1126/science.1175088 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lahiri S, Chao JT, Tavassoli S, et al. : A Conserved Endoplasmic Reticulum Membrane Protein Complex (EMC) Facilitates Phospholipid Transfer from the ER to Mitochondria. PLoS Biol. 2014;12(10):e1001969. 10.1371/journal.pbio.1001969 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Le SQ, Gascuel O: An improved general amino acid replacement matrix. Mol Biol Evol. 2008;25(7):1307–1320. 10.1093/molbev/msn067 [DOI] [PubMed] [Google Scholar]
- Li Y, Zhao Y, Hu J, et al. : A novel ER-localized transmembrane protein, EMC6, interacts with RAB5A and regulates cell autophagy. Autophagy. 2013;9(2):150–63. 10.4161/auto.22742 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Luo W, Gong X, Chang A: An ER membrane protein, Sop4, facilitates ER export of the yeast plasma membrane [H +]ATPase, Pma1. Traffic. 2002;3(10):730–9. 10.1034/j.1600-0854.2002.31005.x [DOI] [PubMed] [Google Scholar]
- Maćasev D, Whelan J, Newbigin E, et al. : Tom22’, an 8-kDa trans-site receptor in plants and protozoans, is a conserved feature of the TOM complex that appeared early in the evolution of eukaryotes. Mol Biol Evol. 2004;21(8):1557–64. 10.1093/molbev/msh166 [DOI] [PubMed] [Google Scholar]
- Meisinger C, Pfannschmidt S, Rissler M, et al. : The morphology proteins Mdm12/Mmm1 function in the major beta-barrel assembly pathway of mitochondria. EMBO J. 2007;26(9):2229–39. 10.1038/sj.emboj.7601673 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Meisinger C, Wiedemann N, Rissler M, et al. : Mitochondrial protein sorting: differentiation of beta-barrel assembly by Tom7-mediated segregation of Mdm10. J Biol Chem. 2006;281(32):22819–26. 10.1074/jbc.M602679200 [DOI] [PubMed] [Google Scholar]
- Richard M, Boulin T, Robert VJP, et al. : Biosynthesis of ionotropic acetylcholine receptors requires the evolutionarily conserved ER membrane complex. Proc Natl Acad Sci U S A. 2013;110(11):E1055–63. 10.1073/pnas.1216154110 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ronquist F, Huelsenbeck JP: MrBayes 3: Bayesian phylogenetic inference under mixed models. Bioinformatics. 2003;19(12):1572–1574. 10.1093/bioinformatics/btg180 [DOI] [PubMed] [Google Scholar]
- Satoh T, Ohba A, Liu Z, et al. : dPob/EMC is essential for biosynthesis of rhodopsin and other multi-pass membrane proteins in Drosophila photoreceptors. Elife. 2015;4:e06306. 10.7554/eLife.06306 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Stamatakis A: RAxML-VI-HPC: maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models. Bioinformatics. 2006;22(21):2688–2690. 10.1093/bioinformatics/btl446 [DOI] [PubMed] [Google Scholar]
- Wideman JG, Gawryluk RMR, Gray MW, et al. : The ancient and widespread nature of the ER-mitochondria encounter structure. Mol Biol Evol. 2013;30(9):2044–9. 10.1093/molbev/mst120 [DOI] [PubMed] [Google Scholar]
- Wideman JG, Go NE, Klein A, et al. : Roles of the Mdm10, Tom7, Mdm12, and Mmm1 proteins in the assembly of mitochondrial outer membrane proteins in Neurospora crassa. Mol Biol Cell. 2010;21(10):1725–36. 10.1091/mbc.E09-10-0844 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wideman JG, Lackey SWK, Srayko MA, et al. : Analysis of mutations in Neurospora crassa ERMES components reveals specific functions related to β-barrel protein assembly and maintenance of mitochondrial morphology. PLoS One. 2013;8(8):e71837. 10.1371/journal.pone.0071837 [DOI] [PMC free article] [PubMed] [Google Scholar]

