Skip to main content

This is a preprint.

It has not yet been peer reviewed by a journal.

The National Library of Medicine is running a pilot to include preprints that result from research funded by NIH in PMC and PubMed.

bioRxiv logoLink to bioRxiv
[Preprint]. 2024 May 29:2024.05.26.595818. [Version 1] doi: 10.1101/2024.05.26.595818

Ancient eukaryotic protein interactions illuminate modern genetic traits and disorders

Rachael M Cox 1, Ophelia Papoulas 1, Shirlee Shril 2, Chanjae Lee 1, Tynan Gardner 1, Anna M Battenhouse 1, Muyoung Lee 1, Kevin Drew 3, Claire D McWhite 4, David Yang 1, Janelle C Leggere 1, Dannie Durand 5, Friedhelm Hildebrandt 2, John B Wallingford 1, Edward M Marcotte 1
PMCID: PMC11160598  PMID: 38853926

SUMMARY

All eukaryotes share a common ancestor from roughly 1.5 – 1.8 billion years ago, a single-celled, swimming microbe known as LECA, the Last Eukaryotic Common Ancestor. Nearly half of the genes in modern eukaryotes were present in LECA, and many current genetic diseases and traits stem from these ancient molecular systems. To better understand these systems, we compared genes across modern organisms and identified a core set of 10,092 shared protein-coding gene families likely present in LECA, a quarter of which are uncharacterized. We then integrated >26,000 mass spectrometry proteomics analyses from 31 species to infer how these proteins interact in higher-order complexes. The resulting interactome describes the biochemical organization of LECA, revealing both known and new assemblies. We analyzed these ancient protein interactions to find new human gene-disease relationships for bone density and congenital birth defects, demonstrating the value of ancestral protein interactions for guiding functional genetics today.

INTRODUCTION

The last eukaryotic common ancestor (LECA), existing approximately 1.5 to 1.8 billion years ago, was the unicellular ancestor of all extant eukaryotes [1,2]. The molecular makeup of this basal eukaryote may offer insights into the ancient genetic innovations that gave rise to the vast eukaryotic cellular complexity we observe today. As such, understanding the core genetic toolkit of LECA has been a long-standing goal in genetics and evolutionary biology.

Previous phylogenomic analyses are in strong agreement that LECA was highly complex and almost certainly contained most hallmarks of the eukaryotic cell. Synthesis of these studies suggests LECA had at least one nucleus [3,4] with linear chromosomes and centromeres [5]; an interconnected endomembrane system comprised of an endoplasmic reticulum [6], Golgi apparatus [7], vesicle trafficking system [8], and nuclear envelope [9]; a dynamic actin- and tubulin-based cytoskeleton [10] including pseudopodia [11], centrioles [12], and at least one cilium [13]; distinct degradative vesicles such as lysosomes [14] and peroxisomes [15]; and mitochondria capable of both aerobic and anaerobic respiration [2,16].

While there are many partial descriptions of the genetic content of LECA, systematic reconstructions of LECA genes are sparse [1720]. Moreover, there is as yet no integrated picture of LECA’s proteome or how these proteins interact in higher-order assemblies. Such interactome data would be useful because nearly every significant cellular process appears to rely on assemblies of proteins working together [21], often organized into extended networks. Thus, an interactome for LECA would provide a richer portrait of its genetics and biochemistry than is possible from genomic reconstructions alone. Moreover, since proteins are the primary drivers of molecular phenotype [22], protein interaction networks are valuable tools for inferring protein functions and uncovering genotype-to-phenotype relationships of medical and agricultural interest.

Previous efforts to map eukaryotic protein interactomes systematically have primarily concentrated on opisthokonts (i.e., animals and fungi) [2329] with the exception of a handful of studies in plants (e.g. [3032]) and protists [33,34]. However, comparing protein networks across more divergent species could provide many new insights not only into evolutionary conservation, but also the divergence of specific biological processes. For example, conserved cellular machinery can be used in different organism-specific contexts to produce distinct organism-specific phenotypes (e.g. the “phenolog hypothesis” [35]).

Here, we sought to reconstruct the LECA protein interaction network and to use this network to illuminate the etiology of human genetic disease. We first derived a conservative estimate of the LECA protein-coding gene set. We then integrated ~26,000 mass spectrometry experiments across 31 eukaryotes spanning ~1.8 billion years of evolution to generate a draft map of protein interactions present in LECA and widely conserved across the eukaryotic tree of life. We then used directed analysis of frogs, mice, and humans to demonstrate that these ancient interactions are enriched for human disease-linked proteins. Thus, the LECA interactome serves as both a guide for better understanding the deep origins of eukaryotic systems and as a framework for identifying new disease associations for human proteins.

RESULTS AND DISCUSSION

Inferring the gene content of LECA, the last eukaryotic common ancestor

We first determined a set of 10,092 groups of orthologous genes tracing back to the last eukaryotic common ancestor using Dollo parsimony [36] on reference proteomes from 156 species (see Methods and supporting Zenodo data repository). These were defined using the eggNOG algorithm [37] and are referred to hereafter as LECA OrthoGroups (OGs) (Table S1). Within these, we recover genes consistent with the suite of eukaryotic features inferred from ancestral trait reconstructions. We mapped these LECA OGs to known functional annotations [37]. Figure 1 highlights LECA OG functions associated with the nucleus, endoplasmic reticulum, Golgi apparatus, endosomes, digestive vesicles, transport vesicles, secretory vesicles, mitochondria, cilium and an extensive cytoskeleton likely capable of cell projection.

Figure 1. Inferred subcellular organization in LECA, the last eukaryotic common ancestor, based on its estimated gene content.

Figure 1.

Cell illustration adapted from multiple graphics sourced from SwissBioPics [41].

A substantial proportion of functionally annotated LECA OGs are dedicated to DNA replication/repair, transcription, translation, and RNA processing (~25%), underscoring eukaryote-specific innovations related to the segregation of transcription and translation within the nucleus and cytoplasm, respectively [38]. Examples of these innovations are the spliceosome, nuclear envelope, nuclear pore complex, and transport factors such as karyopherins regulated by the Ran-GTP system [9,39]. Another large group of genes (~30%) reflects the expansion and specialization of proteins and pathways related to the compartmentalization of energy production, energy conversion, and metabolism [40]. For example, we recover all V-ATPase subunits, a conserved protein complex responsible for the acidification of lysosomes and peroxisomes. Additionally, we traced mitochondrial-specific genes such as TIM and TOM translocases to LECA, in addition to a large suite of mitochondrial carrier family (MLF) proteins and generalist solute carrier (SLC) family proteins. Continuous MLF- and SLC-mediated transport of myriad metabolites across the mitochondrial membrane enables multiple modes of compartmentalized energy conversion [41], suggesting that LECA had already evolved sophisticated discriminatory pathways necessary for precise partitioning of varied substrates.

LECA also possessed a rich system of endomembranes, with ~15% of LECA OGs associated with membrane biogenesis, intracellular trafficking, and signal transduction. These include endosomal coat proteins such as clathrin, adaptin, and the COPI and COPII vesicle coat complexes which facilitate vesicle budding from the Golgi apparatus and endoplasmic reticulum, respectively [8]. Proteins related to signal transduction underwent a massive expansion at the root of eukaryotes, particularly within GTPases, such as the Ras, Ran, Rho, Rab, Arf, and dynamin superfamilies [42], highlighting the complexity and diversity of early signaling pathways and their relationships with endomembranes. Surprisingly, using eggNOG functional annotations we only found ~300 LECA OGs (~3%) associated with the cytoskeleton and cell motility (i.e., a cilium) while previous studies reported ~500 OGs [43,44].

Finally, the eggNOG algorithm identified 2,387 LECA OGs (~25%) of unknown function (Figure 1, bottom right), the majority of which are absent from the human genome. The high proportion of uncharacterized genes and low count of ciliary genes prompted us to attempt to assign function, or at a minimum, subcellular location, to LECA OGs by using annotations for specific extant proteins in these families present in the UniProt database [45] (see Methods and Zenodo repository). This allowed us to recover ~200 LECA OGs associated with cytoskeletal and cell motility pathways that were previously classified as “function unknown” by the eggNOG algorithm, bringing the total up to the ~500 expected OGs based on previous literature [43,44]. In sum, we assign tentative UniProt annotations and subcellular compartments to 1,066 of 2,387 LECA OGs originally categorized as “function unknown” by eggNOG.

Overall, slightly less than half of human genes (n = 9,908) map to 4,777 unique LECA OGs, and these differences can be instructive. For example, the human genome appears to have lost a large number of genes associated with carbohydrate metabolism, amino acid metabolism, and membrane biogenesis, including those for isocitrate lyase (ICL) and malate synthase, enzymes of the glyoxylate cycle. Glyoxylate is a highly reactive aldehyde and glyoxylate cycles have been demonstrated in nearly every other branch of life outside of mammals [46,47]. Because of malate synthase loss, human cells must rely on other enzymes to neutralize glyoxylate: (alanine-glyoxylate aminotransferase (AGT) in the peroxisome or glyoxylate reductase (GRHPR) in the cytosol). Defects in either human enzyme produce primary hyperoxaluria, i.e., renal disease caused by the failure to detoxify glyoxylate [48]. This example highlights how comparative evolution informs mechanisms of human disease.

Mapping the LECA protein interactome

We next sought to determine the higher-order organization of the LECA proteome using co-fractionation mass spectrometry (CFMS). CFMS is a high-throughput method for measuring protein-protein interactions (PPIs) en masse from virtually any species or cell line without recombinant tags or affinity reagents (Figure 2) [26,49]. CFMS reduces the complexity of a native protein lysate by gently separating protein complexes based on properties such as size, charge, or hydrophobicity, and utilizes the premise that stably interacting proteins will generally co-elute together irrespective of the native separation method used. While results from a single CFMS experiment are insufficient to draw reliable conclusions about specific protein-protein interactions, integrating observations from orthogonal separations from multiple species, cell types, and tissues confers strong statistical power for inferring conserved interactions [27,29,31].

Figure 2. Overview of experimental and computational methods.

Figure 2.

(A) Schematic representation of a co-fractionation mass spectrometry experiment. (B) Proteomics data used to construct the LECA interactome included eukaryotes spanning ~1.8 billion years of evolution. Tree structure is based on [26]. Branch lengths are not drawn to scale. (C) Schematic overview of the approach for computing protein-protein interaction (PPI) features based on CFMS (1) and APMS (2) datasets, scoring conserved PPIs based on these features (3), and clustering scored PPIs into complexes (4).

To detect conserved protein interactions, we integrated raw CFMS data from more than 10,000 individual biochemical fractions [26,27,31,33,34] across 31 diverse eukaryotic species spanning ~1.8 billion years of evolution (Figure 2B). In addition to external data sets, we generated new CFMS data for four minimally characterized and phylogenetically diverse unicellular eukaryotes: Brachionus rotundiformis (Amorphea, rotifer), Euglena gracilis (Excavata, algae), Phaeodactylum tricornutum (TSAR, diatom), and Tetrahymena thermophila (TSAR, ciliate). Because LECA was ciliated, we expanded our coverage of ciliary proteomes by collecting CFMS data from Sus scrofa (Amorphea, pig) tracheal tissue and Xenopus laevis (Amorphea, frog) sperm. Experimental details concerning cell types, tissues, developmental stages, and fractionation procedures for each separation can be found either in the Methods section or in the PRIDE database (accessions in Table S2).

In all, we measured 379,758,411 peptides that were uniquely assigned to 259,732 orthologous groups (or unique proteins not mapping to orthogroups) across the 31 species, 149 separations, and 10,491 fractions (Figure 3A). We further augmented our CFMS data with ~15,000 mass spectrometry proteomics experiments [28] that included affinity purification mass spectrometry (APMS) [5052], proximity labeling [53,54], and RNA-pulldown data [55]. In total, we incorporated data from 26,297 mass spectrometry experiments. We then filtered this data such that we only retained LECA OGs that were strongly observed, in that the sum total of peptide spectral matches (PSMs) across all 149 fractionations was greater than or equal to 150 PSMs. This resulted in elution profiles for 5,989 well-measured LECA OGs, encompassing approximately 60% of the estimated LECA gene set.

Figure 3. Determining the LECA protein interactome.

Figure 3.

Co-elution matrix and results of the protein interaction machine learning pipeline. (A) Heat map of the filtered elution matrix for 5,989 strongly observed LECA OGs across 10,481 CFMS mass spectrometry fractions (left) and a blow-up of elution vectors for the COPI, 20S proteasome, and eukaryotic initiation factor 3 complexes for a select subset of species (right). (B) Precision-recall performance of three classifiers trained with increasingly larger sets of ranked features. (C) Precision-recall curves for the reconstruction of known protein complexes defined by a walktrap algorithm, where pairwise PPI scores from each classifier are used as input. Points are labeled with the total number of protein clusters (complexes) constructed at each point in the hierarchy. (D) The likelihood that PPIs in our network are present in externally defined protein-protein or mRNA coexpression networks as a function of our model’s PPI score. As PPI scores increase, our model becomes increasingly likely to agree with external studies.

While many complex subunits have co-elution profiles with visually detectable correlation (as for the members of the COPI vesicle coat complex, 20S proteasome, and eukaryotic initiation factor 3 in Figure 3A, right), a computational framework is required for systematic identification and to properly control for false positives (Figure 2C). To this end, we employed a supervised machine learning pipeline trained on the data we observed for known protein complexes. We assembled a set of 1,499 known complexes from two databases that record PPIs for a variety of eukaryotic species [5658] (see Methods). We then assessed a number of interaction prediction models generated by three different classification algorithms: extremely randomized trees (“ExtraTreesClassifier”), linear support vector (“LinearSVC”), and stochastic gradient descent (“SGDClassifier”).

We constructed models for each with a variable number of features derived from the mass spectrometry datasets, ranking features on a classifier-by-classifier basis, and evaluated performance by measuring the precision and recall of known complexes withheld from the training data (Figure 3B).

The precision-recall of our initial models were 5–38% recall at 90% precision, similar to previous large-scale protein interaction maps papers [28,29,31]. However, after implementation of a custom data stratification approach (see Methods), we observed improvements to performance, with models ranging from 40–90% recall at 90% precision. We suspect that this large jump in performance stemmed from a combination of the large volume of high quality mass spectrometry data being integrated and the data stratification approach, which we found significantly reduced overfitting (see Methods).

Using each classifier and its best performing feature set, we identified PPIs with a 10% false discovery rate (FDR) threshold (see Zenodo repository) and clustered the proteins into complexes using an unsupervised community detection “walktrap” algorithm [59], weighting the interactions by their confidence scores. The walktrap procedure determined the “optimal” number of subcommunities to range between 100–400, differing significantly by classifier. These large clusters capture the general organization of eukaryotic cells (e.g., partitions include a large spliceosomal cluster, a large chromosomal maintenance cluster, clusters broadly associated with cilia, etc). In order to obtain more granular protein complexes, we further divided these clusters into increasingly smaller communities to define a hierarchy of protein interactions (Table S3).

As a positive control, we noted that this approach successfully delineated known complexes. For example, a large spliceosome cluster was demarcated into LSM, PrP19 complex, and U4/U6 × U5 tri-snRNP complexes with increased granularity (Table S3). For each classifier, we quantified the performance of the walktrap procedure by computing precision and recall for each cluster at each level of the hierarchy (Figure 3C) and observed a clear tradeoff for increasingly fine-grained subcommunities to show increased precision but decreased recall. Overall, the support vector classifier (Figure 3C, red) netted the highest quality protein complexes and was chosen as our final model. The resulting final LECA complexome consists of the highest confidence 109,466 pairwise interactions between 3,193 unique OGs, hierarchically assembled into 199 (less granular) to 2,014 (more granular) protein complexes, which are portrayed schematically in Figure 4.

Figure 4. Visualizing hierarchical clustering of protein complexes for a subset of the conserved eukaryotic interactome.

Figure 4.

The circles of the smallest diameter correspond to individual proteins, where their colors correspond to whether the proteins within each cluster are characterized to interact with each other in the literature (red), whether a novel protein is interacting with a known complex (blue), or whether all the associations within a cluster are uncharacterized (yellow).

We sought to assess how well our final protein interaction model agreed with independent studies that defined interactions using orthogonal approaches. We observe that protein pairs within our highest scoring threshold (≤10% FDR) are significantly more likely than random chance to agree with yeast 2-hybrid (Y2H) [60], mRNA co-expression [61], and cross-linking mass spectrometry (XLMS) [62] interactions (Figure 3D), and performed comparably with a previous interaction map in plants [31].

Finally, current phylogenomic studies hypothesize myriad protein assemblies at the root of eukaryotes [5,8,9,18,6365]. With an experimentally determined set of conserved LECA PPIs in hand, in the next sections, we examined the extent to which our interactome both recapitulates previously hypothesized and discovers new LECA complexes. We focused on ancient protein assemblies related to intracellular trafficking and cell projection because these are highly relevant to modern human disease [6668] and highlight below multiple examples where we uncovered previously undescribed interactions.

Deep conservation and loss of vesicle tethering complexes

One hallmark of eukaryotic cells is their system of intracellular trafficking by cargo-laden vesicles that bud from one compartment and fuse to another, supported by coat proteins (e.g. clathrin, COPI, and COPII), membrane-anchored SNARE proteins to facilitate membrane fusion [69], and tethering factors ensure target specificity [70]. Many compartment-specific tethering modules are thought to have been present in LECA (as reviewed in [63]), including the ER-associated TRAPP-I, TRAPP-II, and TRAPP-III complexes, the Golgi-associated retrograde protein (GARP), the conserved oligomeric Golgi (COG) complexes, the endosome-associated recycling protein (EARP), and the endolysosomal homotypic fusion and protein sorting (HOPS) complexes. However, the extent to which particular protein interactions are conserved remains an open area of research. In our LECA interactome, we recover all of these core tethering assemblies, along with some unexpected members (Figure 5A).

Figure 5. Notable LECA systems related to vesicle tethering and cell projection.

Figure 5.

(A) Node colors for each vesicle tethering complex correspond to their primary subcellular localization: endoplasmic reticulum (light green), Golgi apparatus (dark green), digestive vesicles (orange), or endosomes (yellow). (B) Dark and light blue nodes depict core and peripheral cell projection components. In both (A) and (B), edges between proteins are colored by the number of eukaryotic supergroups in which the interaction is observed: red for interactions observed in all supergroups considered, orange for interactions observed in three of the four eukaryotic supergroups, and yellow for interactions only observed in half of the supergroups. The four supergroups considered are Amorphea, Excavata, TSAR and Archaeplastida (see Figure 2).

The GARP and EARP protein complexes are closely related and share three subunits (VPS51, VPS52, VPS53) [71]. Localization is conferred by additional subunits: VPS50 for endosomes or VPS54 for the Golgi apparatus [71]. We observe strongly conserved interactions between all of these subunits, in addition to EIPR1 (EARP and GARP complex-interacting protein 1) (Figure 5A, top right). The interaction of EIPR1 with the GARP/EARP complexes was only recently discovered, first in high-throughput screens of human proteins [50,52] and then confirmed in targeted study in human neuroglioma cells [72]. While EIPR1 is speculated to be widely conserved, we find that its interaction with GARP/EARP is indeed ancient and likely traces back to LECA.

In modern eukaryotes the HOPS complex shares four of its six subunits (VPS11, VPS16, VPS18, VPS33) with the related CORVET complex, while the remaining two subunits (VPS39 and VPS41) are unique to HOPS. In yeast, the CORVET subunits direct the fusion of early and recycling endosomes while HOPS directs the fusion of late endosomes, lysosomes, and autophagosomes [73]. Interestingly, we observe conserved interactions between VPS8 (previously thought to be CORVET-specific) and the VPS16, VPS18, VPS39 and VPS41 subunits of the HOPS complex (Figure 5A, top left), raising the possibility that a single HOPS-like complex in LECA may have governed the endolysosomal vesicle fusion pathway, with subsequent lineage-specific duplication and specialization of subunits for different compartments.

Analogously, the ER/Golgi-associated TRAPP complex is thought to be composed of five core subunits (TRAPPC1–5) with additional subunits in distinct TRAPP-I (TRAPPC6), TRAPP-II (TRAPPC9, TRAPPC10, TRAPPC13), and TRAPP-III (TRAPPC8) complexes. However, the number and identity of proteins in TRAPP-I/II/III vary significantly by species [74]. In our ancient interactome, we observe strong interactions between each member of the core C1-C5 complex, conserved across all sampled eukaryotic supergroups. Unexpectedly, we find pan-eukaryotic evidence for TRAPPC12 in this core complex, previously thought to be metazoan-specific [75,76]. The remaining interactions are differentially lost in specific eukaryotic lineages, with TRAPPC10 absent in all five sampled TSAR species. Our data, combined with conflicting literature on the exact composition of TRAPP-I/II/III, thus suggests an ancient and flexible core complex where subunits differentially specialize along different eukaryotic branches.

The eight-subunit COG assembly governs retrograde intra-Golgi trafficking and comprises two heterotrimeric subcomplexes (COG2–4 and COG5–7) linked by a COG1-COG8 heterodimer [77]. Our LECA interactome recapitulates this assembly and includes interactions with TMF1 and the ubiquitin ligase complex RNF20-RNF40 (Figure 5A, bottom right). TMF1-COG interactions have only been previously observed for metazoan COG2 and COG6 [78], but our data show confident TMF1-COG interactions spanning Amorphea and Archaeplastida, with TMF1 lost in Excavata and TSAR. RNF20-RNF40 is generally described in nuclear roles like histone ubiquitination, transcription regulation, and DNA damage repair [79,80], but has been also linked to the Golgi-associated adapter protein WAC [81], involved in Golgi membrane fusion [82,83]. We see strong conservation of RNF20-RNF40 interactions with COG across Amorphea, TSAR, and Archaeplastida, suggesting that the nuclear-repurposing of this ubiquitin ligase complex could be a recent mammalian innovation.

Thus, the LECA interactome reveals the conservation and specialization of eukaryotic vesicle tethering complexes. We identified unexpected ancient interactions, such as those involving EIPR1 with GARP/EARP, TMF1 and RNF20-RNF40 with COG, and TRAPPC12 with the TRAPP complex, and additionally saw evidence for flexible and lineage-specific adaptations. Given this utility for examining evolutionary conservation and diversification of LECA-associated complexes, we next applied it to shed light on a central question of LECA evolution that is in dispute.

Primordial origins of cell projection and phagocytosis

While phagocytosis is a trait widely observed across diverse groups of eukaryotes, it is debated whether LECA had the ability to recognize and engulf large particles. Contention stems from arguments concerning how the first eukaryotic common ancestor (FECA) acquired the alpha-proteobacterial precursor of the mitochondrion, i.e., whether FECA was akin to a phagocytosing archaeon or a more “simple” prokaryote that existed in protracted syntrophy with an alpha-proteobacterium [8486]. Existing phylogenomic investigations into the origins of phagocytosis are conflicting; at least three independent studies conclude phagocytosis probably evolved independently in multiple eukaryotic lineages [8789], while others argue that the trait was present in LECA and the absence of phagocytosis in certain eukaryotic groups is due to secondary loss as they adapted to new niches [90,91]. Recent investigations into Asgard archaea, the sister group to eukaryotes, reveal a dynamic actin-based cytoskeleton composed of F-actin assemblies, actin-related proteins (Arps), and actin-binding proteins such as profilins and gelsolins capable of modulating eukaryotic actin [9294], suggesting that FECA may have had phagocytic capacity.

Within our LECA gene set, we find an extensive complement of LECA OGs generally associated both with cell projections (pseudopodia, lamellipodia, filopodia) and with phagocytosis (phagocytic cups and phagosomes) (Figures 1, 5B). Specifically, LECA appears to have had Rho GTPases such as RAC/CDC42, formins, coronins, cofilins, gelsolins, and proteins associated the ARP2/3, ENA/VASP, WASP, SCAR/WAVE, and PI3K complexes. For example, in the LECA interactome, we recovered all seven subunits of the ARP2/3 complex, responsible for the cytoskeletal rearrangement required for cell projection, clustering with other related protein complexes involved in cell protrusion in addition to a number of proteins that are critical for phagocytosis in extant eukaryotic cells (Figure 5B). The core ARP2/3 complex consists of the proteins ARP2, ARP3, ARPC1, ARPC2, ARPC3, ARPC4, and ARPC5. Interestingly, in our data, ARPC5 is the most peripherally associated component in the cluster and has been completely lost in all species sampled within Excavata and TSAR.

Combined with previous evidence [9597], the presence of these genes in LECA suggests that the ancestral eukaryotic cell was almost certainly capable of pseudopod formation and projection-based motility despite the lack of UniProt annotations in species outside Amorphea (Figure S1). However, because cell projections and phagocytosis share underlying molecular machinery, it is less clear if the presence of these systems necessarily imply a phagocytosing LECA, and more evidence is required to conclude that phagocytosis is an ancestral trait present at the root of eukaryotes. To address this question, we therefore explored the conservation of additional interactions that might shed light on this issue.

Among peripheral interactors of the ARP2/3 complex, we observe CAPZA and CAPZB forming the heterodimeric F-actin capping complex, an essential regulator of actin nucleation that restricts elongation [98], as well as formins and coronins known to promote elongation [95,99]. We also find interactions with WDR1, a promoter of cofilin-mediated actin severing [100] that assists both actin polymerization and depolymerization [101]. Research strongly implicates these systems in Amorphean phagocytosis: Coronins are strongly enriched in phagocytic cups and defects result in impaired phagocytosis in both Dictyostelium and mammalian cells [102104]. Furthermore, we observe protein interaction evidence in at least two major eukaryotic supergroups consistent with the reported roles of AAK1 in receptor mediated endocytosis [105] and unconventional myosin in phagocytosis [106109]. Taken together, our results strongly support a phagocytosing LECA.

The LECA interactome reveals a ciliary mechanism for EFHC2-associated renal failure

The conservation of protein interactions over billions of years of evolution implies that they are strongly constrained and that their malfunction is likely to be pathogenic. Thus, studying genetic variation through the lens of conserved protein interactions should clarify mechanisms of genetic disease development, tolerance, and resilience. Consequently, we expect the LECA protein interactions to offer direct insights into human disease genetics, and by similar logic, to genotype-phenotype relationships of other modern eukaryotes.

Slightly less than half of human genes date back to LECA. Where these conserved genes have been characterized, they have been shown to be responsible for a large and diverse subset of major human diseases, spanning developmental disorders, cancers, chronic respiratory diseases, neurodegenerative conditions, and motor disorders (Figure 6A). For example, of the ~100 human genes known to be associated with deafness, nearly ¾ were present in LECA (Figure 6A). While some human diseases are “new”, evolutionarily-speaking, such as deficiency of the animal-specific pituitary hormone, many other diseases, such as ciliary dyskinesia, arise nearly entirely from genes in LECA OGs. In order to test the utility of our data for illuminating the biology of extant species, we next asked if the LECA interactome could be leveraged to predict human disease mechanisms and novel gene-disease relationships.

Figure 6. LECA protein interactions suggest mechanisms of genetic disease, as for end stage renal disease gene EFHC2, identified by whole exome sequencing and confirmed to have a ciliary etiology.

Figure 6.

(A) Causal genes for human diseases are frequently ancient, as shown by plotting gene-disease relationships obtained from OMIM, with each point representing a unique disease group with an associated number of genes (x-axis) and age, determined as the percentage of genes in LECA OGs (y-axis). (B) Pedigree of the index family A4237. Squares represent males, circles females, black shading the affected proband individual A4237–22 included in whole-exome sequencing (WES), and white shading the unaffected parents and siblings. (C) Summary of the phenotype and recessive disease-causing R133H EFHC2 variant identified by WES. (D) Location of Arginine 133 in relation to EFHC2 exon/intron (black/white) structure and DM10 protein domains (purple), and its deep evolutionary conservation. (E) EFHC2-containing ciliary complex uncovered in the LECA interactome. (F) Localization of GFP-EFHC2 to axonemes in Xenopus motile cilia. (G) Introduction of the R133H mutation results in loss of ciliary localization of GFP-EFHC2, confirmed by co-labeling with membrane-RFP. Scale bar = 10 μm

Approximately 500 LECA OGs are related to cilia, and among the most common ciliopathies are diseases of the kidney [110]. We identified a male infant with microcephaly, seizures, polycystic kidney disease, and end-stage renal failure, and whole exome sequencing and pedigree analyses revealed a significant hemizygous, X-linked G>A variant in EFHC2 (rs34729789, 11:44148852:G:A) (Figures 6BD, S2). Essentially nothing is known of the function of EFHC2, though it and its paralog EFHC1 encode proteins thought to be microtubule inner proteins (MIPs) that function specifically in motile cilia; loss of their orthologues in Chlamydomonas or Tetrahymena leads to defective ciliary axonemes and/or ciliary beating, but do not disrupt ciliogenesis [111,112]. Because motile cilia are not present in mammalian kidneys, the link between EFHC2 and this patient’s disease was surprising. We therefore examined our LECA interactome for insights.

We first noted that the patient’s missense variant altered an arginine residue at position 133 to histidine, and this residue is conserved across Archaeplastida, Excavata, TSAR, and Amorphea (Figure 6D). Moreover, EFHC2 was closely and exclusively linked in our LECA interactome to other proteins involved in cilia motility (Figure 6E). We therefore examined the protein’s localization in Xenopus multiciliated cells, and found it to be very strongly localized to ciliary axonemes. By contrast, the disease-associated R133H variant failed to localize to cilia (Figure 6F, G), suggesting that defective ciliary localization of this protein contributed to ciliopathic kidney disease in the affected child.

This result then prompted us to ask if other proteins in the cluster, which are also thought to function specifically in motile cilia, might also be implicated in kidney disease. Indeed, previous genomic analyses link both PACRG [113] and TPPP [114] to chronic kidney disease, with PACRG specifically linked to end-stage renal disease [113]. This combination of clinical data, the LECA interactome, and specific hypothesis testing in a vertebrate model organism thus links EFHC2 ciliary function to an end-stage renal disease for which the molecular etiology was previously unknown and underscores the power of our comparative evolution strategy.

Network propagation for systematic ranking of gene-disease relationships

We next sought to score potential disease-causative proteins systematically within our conserved interactome on a disease-by-disease basis. To this end, we used cross-validated network guilt-by-association [115] to predict novel gene-disease pairs for 109 unique diseases based on clinically-validated genotype-to-phenotype relationships sourced from the OMIM database [116] (Figure 7A; see Methods). We measured the power of our approach as the areas under receiver operating characteristic curves (AUROC), and compared the predictive performance of known disease-associated gene sets versus that of random gene sets. As expected, random predictions have AUROC scores distributed around 0.5 (Figure 7A, yellow). In striking contrast, LECA-interactome predictions were skewed to higher scores (Figure 7A, blue), and using a conservative AUROC threshold of 0.7, we made strong new disease candidate predictions for almost one-third of the diseases considered (~35 Mendelian disorders). Below, we discuss the prediction and validation in animals of two such novel protein associations.

Figure 7. Guilt-by-association in the LECA interactome identifies ATP6V1A as causative for osteopetrosis and GLG1 for short-rib thoracic dysplasia (SRTD).

Figure 7.

(A) Guilt-by-association in the LECA PPI network correctly associates genes to human diseases for roughly a third of the 109 diseases tested, measured as the areas under receiver operating characteristic curves (AUROCs) of leave-one-out cross-validated predictions of known disease genes (light blue) versus random associations (yellow). (B) PPI network of genes clinically linked to osteopetrosis (black half-discs; 3 additional genes lie outside this cluster), the highest-ranking new candidates (purple), and their interactions with other V-ATPase subunits that were not indicated for osteopetrosis (orange). (C) For the top-scoring gene ATP6V1A, the bone mineral content is plotted for knockout (KO) mice with a heterozygous exon deletion in ATP6V1A (n=8 for each sex, n=16 total) compared to healthy control mice (female n=834, male n=780). Null mice show significantly increased bone density, consistent with the clinical manifestation of osteopetrosis. (D) The PPI network of genes clinically linked to SRTD (black half-discs) implicates GLG1 (yellow) and suggests a ciliary role, based on interactions with intraflagellar trafficking IFT-A (blue) and IFT-B (purple) complexes, cytoplasmic dyneins and dynactins (green), and other interactors (gray). (E) Morpholino knockdown (KD) of GLG1 significantly reduced the number of cilia in X. laevis multi-ciliated cells (Bonferroni adjusted t-test p < 10−16, n = 60 control cells, 79 knockdown cells, and 76 rescue cells, 9 embryos per condition over 3 injection replicates) compared to uninjected control animals; rescue by co-injection with a non-targeted GLG1 allele confirmed specificity. (F) In control Xenopus multi-ciliated cells, IFT56-GFP and IFT80-GFP, two subunits of IFT-B, are distributed as particles along the ciliary axonemes. However, MO knockdown of GLG1 leads to the accumulation of IFT-B proteins in the proximal region of axonemes. Scale bar = 10 μm. (G) This effect is quantified for IFT80-GFP for 3 cilia per cell for all cells analyzed in panel (E).

Identification and validation of ATP6V1A as a novel candidate for osteopetrosis

Our LECA network propagation approach implicated several vacuolar-type H+-ATPase (V-ATPase) proteins in the molecular etiology of osteopetrosis (AUROC ~0.8), a disorder in which bones grow abnormally and become overly dense [117] (Figure 7B). Given the key role of V-ATPases in regulating bone homeostasis by acidifying the space between osteoclasts and bone to help dissolution of bone hydroxyapatite [118], one might assume the disruption of many V-ATPase subunits would result in increased bone density. However, only three subunits have so far been implicated in osteopetrosis in humans [116,119] or mice [120]. The remaining V-ATPase subunits instead display a remarkably broad spectrum of disease associations, including cutis laxa (loose skin) and renal tubular acidosis [121], neurodegenerative disease, deafness [122], Zimmermann-Laband syndrome [123], and even osteoporosis (bone loss) [124], highlighting the need to elucidate the discrete molecular functions of specific V-ATPase subunits.

Our LECA network propagation approach gratifyingly made precise predictions, linking three specific subunits (ATP6V1A, ATP6V1B, ATP6V0D) to osteopetrosis (Figure 7B). To confirm these predictions we examined heterozygous CRISPR-Cas9 knockouts of ATP6V1A (performed by the KOMP2 high-throughput mouse phenotyping site at the Baylor College of Medicine, see Methods) and found these mice showed significantly increased bone mineral content (Figure 7C). The effect size was much stronger in female mice (p = 0.00003) than in male mice (p = 0.00813), echoing previous observations of sexual dimorphism in the body composition of mammals [125,126]. Despite the obvious lack of bones in the single celled last eukaryotic ancestor, then, our examination of the underlying protein interaction network of that organism nonetheless identified a specific mammalian phenotype with one specific subunit from among a large repertoire of closely related genes.

Ancient interactions suggest new candidate genes for a lethal human ciliopathy

Our highest scoring disease association (AUROC ~0.98) involved short-rib thoracic dysplasia (SRTD), a severe human ciliopathy characterized by skeletal abnormalities including dysplasia of the axial skeleton that in many cases lethally impairs respiratory function [127]. The disease is strongly associated with proteins involved in Intraflagellar Transport (IFT), the system which moves cargoes into and out of cilia, and this was reflected in our LECA interactome (Figure 7D). Our highest scoring non-IFT protein prediction, however, was the Golgi protein GLG1 [128]. This was an interesting candidate because mouse mutants of GLG1 display defects in rib development similar to SRTD [129], yet the protein has never been implicated in any aspect of ciliary biology.

We therefore explored the function of GLG1 in Xenopus multiciliated cells, and found it predominantly localized to the Golgi in MCCs, as expected. We observed no apparent localization at basal bodies or in cilia (not shown). Nonetheless, knockdown of GLG1 resulted in a significant loss of cilia from MCCs, an effect that was specific since it could be rescued by expression of GLG1-FLAG (Figure 7E). To ask if this defect in ciliogenesis was related to IFT, we performed live imaging of GFP fusions to two components of the IFT complex. In normal cells, both markers labeled small punctae in axonemes of Xenopus MCCs, consistent with previous imaging of IFT in these cells [130,131]. By contrast, GLG1 knockdown cells displayed large accumulations of IFT proteins within axonemes (Figure 7F, 7G) that resemble those seen previously after disruption of IFT [130,131].

Thus, analysis of the LECA interactome made a single, specific prediction of ciliary function for just one among the large array of Golgi-resident proteins, and that prediction was validated by experiments in Xenopus. These data provided new insights into the still obscure link between IFT and the Golgi [132,133] and, moreover, identified a plausible candidate gene for SRTD.

Conclusions

Studies of ancient protein-protein interactions and their conservation across species offer valuable insights for exploring the genetic underpinnings of contemporary genetic traits and diseases in modern species. In this work, we took an integrated approach to reconstruct the macromolecular assemblies of ancient proteins that, until now, have only been sparsely described. We defined a core set of likely LECA orthogroups, finding that slightly fewer than half of human genes can be traced back to this set. We integrated those data with more than 26,000 mass spectrometry proteomics experiments, capturing hundreds of millions of unique peptide measurements for hundreds of thousands of unique proteins in species sampled from across the tree of eukaryotes. Using these data, we reconstructed a high-quality conserved LECA protein interactome. This interaction network has formed the core of eukaryotic biology for nearly two billion years, and the dataset reveals new insights into both known protein complexes and novel assemblies.

Consistent with our central premise that the most highly-conserved protein assemblies will tend to be most critical for proper cell and organism function, the LECA interactome successfully predicts mechanisms of human disease and novel gene-disease relationships. We specifically presented evidence for a ciliary mechanism in human EFHC2-associated renal failure, identified the V-ATPase subunit ATP6V1A in the etiology of mammalian osteopetrosis, and demonstrated a role for the Golgi protein GLG1 in trafficking IFT-A proteins into cilia as a molecular mechanism for short-rib thoracic dysplasia. Given the intrinsic richness of these datasets, we expect this approach should similarly extend to traits and diseases in most other eukaryotic species, while providing insights into the specific molecular mechanisms involved due to being anchored in deeply conserved ancient protein activities.

Supplementary Material

Supplement 1

ACKNOWLEDGEMENTS

The authors gratefully acknowledge the generous support of the Tetrahymena stock center (Cornell University & Washington University in St. Louis), the Texas Advanced Computing Center at The University of Texas at Austin for providing high-performance computing resources, the Knockout Mouse Program (KOMP2) at the Baylor College of Medicine for Atp6v1a mice data, Johann Eberhart for rotifer cultures, the University of Texas UTEX Culture Collection of Algae for algal samples, and Angel Syrett and Elinor Marcotte for species illustrations. Research was funded by grants from the National Institute of General Medical Sciences (R35GM122480 to E.M.M. and F31GM143881 to R.M.C), National Institute of Diabetes and Digestive and Kidney Diseases (R01DK068306 to F.H.), National Institute of Child Health and Human Development (R00HD092613 to K.D. and R01HD085901 to J.B.W. and E.M.M.), Army Research Office (W911NF-12–1-0390 to E.M.M.), and Welch Foundation (F-1515 to E.M.M.).

Footnotes

DECLARATION OF INTERESTS

The authors declare no competing interests.

SUPPLEMENTAL INFORMATION

Detailed Supplemental Methods are provided, including 6 supplemental figures and 5 supplemental tables. All raw and interpreted mass spectrometry data were deposited to the ProteomeXchange via the PRIDE partner repository with the identifiers provided in Table S2. All project data files are deposited at Zenodo (https://doi.org/10.5281/zenodo.11267529). Custom R, Python, Bash and Perl scripts used for all analyses and figure generation are available at https://github.com/marcottelab/leca-proteomics.

REFERENCES

  • [1].Betts HC, Puttick MN, Clark JW, Williams TA, Donoghue PCJ, Pisani D. Integrated genomic and fossil evidence illuminates life’s early evolution and eukaryote origin. Nat Ecol Evol 2018;2:1556–62. 10.1038/s41559-018-0644-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [2].Brocks JJ, Nettersheim BJ, Adam P, Schaeffer P, Jarrett AJM, Güneli N, et al. Lost world of complex life and the late rise of the eukaryotic crown. Nature 2023;618:767–73. 10.1038/s41586-023-06170-w. [DOI] [PubMed] [Google Scholar]
  • [3].Skejo J, Garg SG, Gould SB, Hendriksen M, Tria FDK, Bremer N, et al. Evidence for a Syncytial Origin of Eukaryotes from Ancestral State Reconstruction. Genome Biol Evol 2021;13:evab096. 10.1093/gbe/evab096. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [4].Bremer N, Tria FDK, Skejo J, Martin WF. The Ancestral Mitotic State: Closed Orthomitosis With Intranuclear Spindles in the Syncytial Last Eukaryotic Common Ancestor. Genome Biol Evol 2023;15:evad016. 10.1093/gbe/evad016. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [5].Tromer EC, van Hooff JJE, Kops GJPL, Snel B. Mosaic origin of the eukaryotic kinetochore. Proc Natl Acad Sci 2019;116:12873–82. 10.1073/pnas.1821945116. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [6].Kontou A, Herman EK, Field MC, Dacks JB, Koumandou VL. Evolution of factors shaping the endoplasmic reticulum. Traffic 2022;23:462–73. 10.1111/tra.12863. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [7].Klute MJ, Melançon P, Dacks JB. Evolution and Diversity of the Golgi. Cold Spring Harb Perspect Biol 2011;3:a007849. 10.1101/cshperspect.a007849. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [8].Field MC, Dacks JB. First and last ancestors: reconstructing evolution of the endomembrane system with ESCRTs, vesicle coat proteins, and nuclear pore complexes. Curr Opin Cell Biol 2009;21:4–13. 10.1016/j.ceb.2008.12.004. [DOI] [PubMed] [Google Scholar]
  • [9].Mans B, Anantharaman V, Aravind L, Koonin EV. Comparative Genomics, Evolution and Origins of the Nuclear Envelope and Nuclear Pore Complex. Cell Cycle 2004;3:1625–50. 10.4161/cc.3.12.1316. [DOI] [PubMed] [Google Scholar]
  • [10].Wickstead B, Gull K. The evolution of the cytoskeleton. J Cell Biol 2011;194:513–25. 10.1083/jcb.201102065. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [11].Richards TA, Cavalier-Smith T. Myosin domain evolution and the primary divergence of eukaryotes. Nature 2005;436:1113–8. 10.1038/nature03949. [DOI] [PubMed] [Google Scholar]
  • [12].Hodges ME, Scheumann N, Wickstead B, Langdale JA, Gull K. Reconstructing the evolutionary history of the centriole from protein components. J Cell Sci 2010;123:1407–13. 10.1242/jcs.064873. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [13].Mitchell DR. Evolution of Cilia. Cold Spring Harb Perspect Biol 2017;9:a028290. 10.1101/cshperspect.a028290. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [14].Dacks JB, Field MC. Evolution of the eukaryotic membrane-trafficking system: origin, tempo and mode. J Cell Sci 2007;120:2977–85. 10.1242/jcs.013250. [DOI] [PubMed] [Google Scholar]
  • [15].Gabaldón T. Evolution of the Peroxisomal Proteome. In: del Río LA, Schrader M, editors. Proteomics Peroxisomes Identifying Nov. Funct. Regul. Netw., Singapore: Springer; 2018, p. 221–33. 10.1007/978-981-13-2233-4_9. [DOI] [Google Scholar]
  • [16].Müller M, Mentel M, van Hellemond JJ, Henze K, Woehle C, Gould SB, et al. Biochemistry and Evolution of Anaerobic Energy Metabolism in Eukaryotes. Microbiol Mol Biol Rev 2012;76:444–95. 10.1128/mmbr.05024-11. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [17].Koonin EV. The Incredible Expanding Ancestor of Eukaryotes. Cell 2010;140:606–8. 10.1016/j.cell.2010.02.022. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [18].Koumandou VL, Wickstead B, Ginger ML, van der Giezen M, Dacks JB, Field MC. Molecular paleontology and complexity in the last eukaryotic common ancestor. Crit Rev Biochem Mol Biol 2013;48:373–96. 10.3109/10409238.2013.821444. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [19].Deutekom ES, Snel B, van Dam TJP. Benchmarking orthology methods using phylogenetic patterns defined at the base of Eukaryotes. Brief Bioinform 2021;22:bbaa206. 10.1093/bib/bbaa206. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [20].O’Malley MA, Leger MM, Wideman JG, Ruiz-Trillo I. Concepts of the last eukaryotic common ancestor. Nat Ecol Evol 2019;3:338–44. 10.1038/s41559-019-0796-3. [DOI] [PubMed] [Google Scholar]
  • [21].Hartwell LH, Hopfield JJ, Leibler S, Murray AW. From molecular to modular cell biology. Nature 1999;402:C47–52. 10.1038/35011540. [DOI] [PubMed] [Google Scholar]
  • [22].Jimenez-Sanchez G, Childs B, Valle D. Human disease genes. Nature 2001;409:853–5. 10.1038/35057050. [DOI] [PubMed] [Google Scholar]
  • [23].Li S, Armstrong CM, Bertin N, Ge H, Milstein S, Boxem M, et al. A map of the interactome network of the metazoan C. elegans. Science 2004;303:540–3. 10.1126/science.1091403. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [24].Krogan NJ, Cagney G, Yu H, Zhong G, Guo X, Ignatchenko A, et al. Global landscape of protein complexes in the yeast Saccharomyces cerevisiae. Nature 2006;440:637–43. 10.1038/nature04670. [DOI] [PubMed] [Google Scholar]
  • [25].Guruharsha KG, Rual J-F, Zhai B, Mintseris J, Vaidya P, Vaidya N, et al. A protein complex network of Drosophila melanogaster. Cell 2011;147:690–703. 10.1016/j.cell.2011.08.047. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [26].Havugimana PC, Hart GT, Nepusz T, Yang H, Turinsky AL, Li Z, et al. A census of human soluble protein complexes. Cell 2012;150:1068–81. 10.1016/j.cell.2012.08.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [27].Wan C, Borgeson B, Phanse S, Tu F, Drew K, Clark G, et al. Panorama of ancient metazoan macromolecular complexes. Nature 2015;525:339–44. 10.1038/nature14877. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [28].Drew K, Wallingford JB, Marcotte EM. hu.MAP 2.0: integration of over 15,000 proteomic experiments builds a global compendium of human multiprotein assemblies. Mol Syst Biol 2021;17:e10016. 10.15252/msb.202010016. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [29].Sae-Lee W, McCafferty CL, Verbeke EJ, Havugimana PC, Papoulas O, McWhite CD, et al. The protein organization of a red blood cell. Cell Rep 2022;40:111103. 10.1016/j.celrep.2022.111103. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [30].Bassel GW, Gaudinier A, Brady SM, Hennig L, Rhee SY, De Smet I. Systems Analysis of Plant Functional, Transcriptional, Physical Interaction, and Metabolic Networks. Plant Cell 2012;24:3859–75. 10.1105/tpc.112.100776. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [31].McWhite CD, Papoulas O, Drew K, Cox RM, June V, Dong OX, et al. A Pan-plant Protein Complex Map Reveals Deep Conservation and Novel Assemblies. Cell 2020;181:460–474.e14. 10.1016/j.cell.2020.02.049. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [32].Pan J, Li L-P, You Z-H, Yu C-Q, Ren Z-H, Guan Y-J. Prediction of Protein–Protein Interactions in Arabidopsis, Maize, and Rice by Combining Deep Neural Network With Discrete Hilbert Transform. Front Genet 2021;12. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [33].Crozier TWM, Tinti M, Larance M, Lamond AI, Ferguson MAJ. Prediction of Protein Complexes in Trypanosoma brucei by Protein Correlation Profiling Mass Spectrometry and Machine Learning. Mol Cell Proteomics 2017;16:2254–67. 10.1074/mcp.O117.068122. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [34].Hillier C, Pardo M, Yu L, Bushell E, Sanderson T, Metcalf T, et al. Landscape of the Plasmodium Interactome Reveals Both Conserved and Species-Specific Functionality. Cell Rep 2019;28:1635–1647.e5. 10.1016/j.celrep.2019.07.019. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [35].McGary KL, Park TJ, Woods JO, Cha HJ, Wallingford JB, Marcotte EM. Systematic discovery of nonobvious human disease models through orthologous phenotypes. Proc Natl Acad Sci 2010;107:6544–9. 10.1073/pnas.0910200107. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [36].Farris JS. Phylogenetic Analysis Under Dollo’s Law. Syst Biol 1977;26:77–88. 10.1093/sysbio/26.1.77. [DOI] [Google Scholar]
  • [37].Huerta-Cepas J, Szklarczyk D, Heller D, Hernández-Plaza A, Forslund SK, Cook H, et al. eggNOG 5.0: a hierarchical, functionally and phylogenetically annotated orthology resource based on 5090 organisms and 2502 viruses. Nucleic Acids Res 2019;47:D309–14. 10.1093/nar/gky1085. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [38].Martin W, Koonin EV. Introns and the origin of nucleus–cytosol compartmentalization. Nature 2006;440:41–5. 10.1038/nature04531. [DOI] [PubMed] [Google Scholar]
  • [39].Alber F, Dokudovskaya S, Veenhoff LM, Zhang W, Kipper J, Devos D, et al. The molecular architecture of the nuclear pore complex. Nature 2007;450:695–701. 10.1038/nature06405. [DOI] [PubMed] [Google Scholar]
  • [40].Gabaldón T, Pittis AA. Origin and evolution of metabolic sub-cellular compartmentalization in eukaryotes. Biochimie 2015;119:262–8. 10.1016/j.biochi.2015.03.021. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [41].Ferrada E, Superti-Furga G. A structure and evolutionary-based classification of solute carriers. iScience 2022;25:105096. 10.1016/j.isci.2022.105096. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [42].Leipe DD, Wolf YI, Koonin EV, Aravind L. Classification and evolution of P-loop GTPases and related ATPases. J Mol Biol 2002;317:41–72. 10.1006/jmbi.2001.5378. [DOI] [PubMed] [Google Scholar]
  • [43].Li JB, Gerdes JM, Haycraft CJ, Fan Y, Teslovich TM, May-Simera H, et al. Comparative Genomics Identifies a Flagellar and Basal Body Proteome that Includes the BBS5 Human Disease Gene. Cell 2004;117:541–52. 10.1016/S0092-8674(04)00450-7. [DOI] [PubMed] [Google Scholar]
  • [44].Sigg MA, Menchen T, Lee C, Johnson J, Jungnickel MK, Choksi SP, et al. Evolutionary Proteomics Uncovers Ancient Associations of Cilia with Signaling Pathways. Dev Cell 2017;43:744–762.e11. 10.1016/j.devcel.2017.11.014. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [45].Consortium UniProt. UniProt: a worldwide hub of protein knowledge. Nucleic Acids Res 2019;47:D506–15. 10.1093/nar/gky1049. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [46].Kondrashov FA, Koonin EV, Morgunov IG, Finogenova TV, Kondrashova MN. Evolution of glyoxylate cycle enzymes in Metazoa: evidence of multiple horizontal transfer events and pseudogene formation. Biol Direct 2006;1:31. 10.1186/1745-6150-1-31. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [47].Dunn MF, Ramírez-Trujillo JA, Hernández-Lucas I. Major roles of isocitrate lyase and malate synthase in bacterial and fungal pathogenesis. Microbiology 2009;155:3166–75. 10.1099/mic.0.030858-0. [DOI] [PubMed] [Google Scholar]
  • [48].Salido E, Pey AL, Rodriguez R, Lorenzo V. Primary hyperoxalurias: Disorders of glyoxylate detoxification. Biochim Biophys Acta BBA - Mol Basis Dis 2012;1822:1453–64. 10.1016/j.bbadis.2012.03.004. [DOI] [PubMed] [Google Scholar]
  • [49].Kristensen AR, Gsponer J, Foster LJ. A high-throughput approach for measuring temporal changes in the interactome. Nat Methods 2012;9:907–9. 10.1038/nmeth.2131. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [50].Hein MY, Hubner NC, Poser I, Cox J, Nagaraj N, Toyoda Y, et al. A human interactome in three quantitative dimensions organized by stoichiometries and abundances. Cell 2015;163:712–23. 10.1016/j.cell.2015.09.053. [DOI] [PubMed] [Google Scholar]
  • [51].Boldt K, van Reeuwijk J, Lu Q, Koutroumpas K, Nguyen T- MT, Texier Y, et al. An organelle-specific protein landscape identifies novel diseases and molecular mechanisms. Nat Commun 2016;7:11491. 10.1038/ncomms11491. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [52].Huttlin EL, Bruckner RJ, Paulo JA, Cannon JR, Ting L, Baltier K, et al. Architecture of the human interactome defines protein communities and disease networks. Nature 2017;545:505–9. 10.1038/nature22366. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [53].Gupta GD, Coyaud É, Gonçalves J, Mojarad BA, Liu Y, Wu Q, et al. A Dynamic Protein Interaction Landscape of the Human Centrosome-Cilium Interface. Cell 2015;163:1484–99. 10.1016/j.cell.2015.10.065. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [54].Youn J-Y, Dunham WH, Hong SJ, Knight JDR, Bashkurov M, Chen GI, et al. High-Density Proximity Mapping Reveals the Subcellular Organization of mRNA-Associated Granules and Bodies. Mol Cell 2018;69:517–532.e11. 10.1016/j.molcel.2017.12.020. [DOI] [PubMed] [Google Scholar]
  • [55].Treiber T, Treiber N, Plessmann U, Harlander S, Daiß J-L, Eichner N, et al. A Compendium of RNA-Binding Proteins that Regulate MicroRNA Biogenesis. Mol Cell 2017;66:270–284.e13. 10.1016/j.molcel.2017.03.014. [DOI] [PubMed] [Google Scholar]
  • [56].Giurgiu M, Reinhard J, Brauner B, Dunger-Kaltenbach I, Fobo G, Frishman G, et al. CORUM: the comprehensive resource of mammalian protein complexes—2019. Nucleic Acids Res 2019;47:D559–63. 10.1093/nar/gky973. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [57].Meldal BHM, Forner-Martinez O, Costanzo MC, Dana J, Demeter J, Dumousseau M, et al. The complex portal - an encyclopaedia of macromolecular complexes. Nucleic Acids Res 2015;43:D479–84. 10.1093/nar/gku975. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [58].Meldal BHM, Bye-A-Jee H, Gajdoš L, Hammerová Z, Horáčková A, Melicher F, et al. Complex Portal 2018: extended content and enhanced visualization tools for macromolecular complexes. Nucleic Acids Res 2019;47:D550–8. 10.1093/nar/gky1001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [59].Pons P, Latapy M. Computing communities in large networks using random walks (long version) 2005. 10.48550/arXiv.physics/0512106. [DOI] [Google Scholar]
  • [60].Luck K, Kim D-K, Lambourne L, Spirohn K, Begg BE, Bian W, et al. A reference map of the human binary protein interactome. Nature 2020;580:402–8. 10.1038/s41586-020-2188-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [61].Kim CY, Baek S, Cha J, Yang S, Kim E, Marcotte EM, et al. HumanNet v3: an improved database of human gene networks for disease research. Nucleic Acids Res 2022;50:D632–9. 10.1093/nar/gkab1048. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [62].Bartolec TK, Vázquez-Campos X, Norman A, Luong C, Johnson M, Payne RJ, et al. Cross-linking mass spectrometry discovers, evaluates, and corroborates structures and protein–protein interactions in the human cell. Proc Natl Acad Sci 2023;120:e2219418120. 10.1073/pnas.2219418120. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [63].Koumandou VL, Dacks JB, Coulson RM, Field MC. Control systems for membrane fusion in the ancestral eukaryote; evolution of tethering complexes and SM proteins. BMC Evol Biol 2007;7:29. 10.1186/1471-2148-7-29. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [64].Carvalho-Santos Z, Azimzadeh J, Pereira-Leal JoséB, Bettencourt-Dias M. Tracing the origins of centrioles, cilia, and flagella. J Cell Biol 2011;194:165–75. 10.1083/jcb.201011152. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [65].Mani J, Meisinger C, Schneider A. Peeping at TOMs—Diverse Entry Gates to Mitochondria Provide Insights into the Evolution of Eukaryotes. Mol Biol Evol 2016;33:337–51. 10.1093/molbev/msv219. [DOI] [PubMed] [Google Scholar]
  • [66].D’Souza Z, Taher FS, Lupashin VV. Golgi inCOGnito: From vesicle tethering to human disease. Biochim Biophys Acta BBA - Gen Subj 2020;1864:129694. 10.1016/j.bbagen.2020.129694. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [67].Morava É, Guillard M, Lefeber DJ, Wevers RA. Autosomal recessive cutis laxa syndrome revisited. Eur J Hum Genet 2009;17:1099–110. 10.1038/ejhg.2009.22. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [68].Makhoul C, Gosavi P, Gleeson PA. Golgi Dynamics: The Morphology of the Mammalian Golgi Apparatus in Health and Disease. Front Cell Dev Biol 2019;7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [69].Bonifacino JS, Glick BS. The Mechanisms of Vesicle Budding and Fusion. Cell 2004;116:153–66. 10.1016/S0092-8674(03)01079-1. [DOI] [PubMed] [Google Scholar]
  • [70].Gillingham AK, Munro S. Transport carrier tethering – how vesicles are captured by organelles. Curr Opin Cell Biol 2019;59:140–6. 10.1016/j.ceb.2019.04.010. [DOI] [PubMed] [Google Scholar]
  • [71].Schindler C, Chen Y, Pu J, Guo X, Bonifacino JS. EARP is a multisubunit tethering complex involved in endocytic recycling. Nat Cell Biol 2015;17:639–50. 10.1038/ncb3129. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [72].Gershlick DC, Schindler C, Chen Y, Bonifacino JS. TSSC1 is novel component of the endosomal retrieval machinery. Mol Biol Cell 2016;27:2867–78. 10.1091/mbc.E16-04-0209. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [73].Balderhaar HJ kleine Ungermann C. CORVET and HOPS tethering complexes – coordinators of endosome and lysosome fusion. J Cell Sci 2013;126:1307–16. 10.1242/jcs.107805. [DOI] [PubMed] [Google Scholar]
  • [74].Kim JJ, Lipatova Z, Segev N. TRAPP Complexes in Secretion and Autophagy. Front Cell Dev Biol 2016;4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [75].Wendler F, Gillingham AK, Sinka R, Rosa-Ferreira C, Gordon DE, Franch-Marro X, et al. A genome-wide RNA interference screen identifies two novel components of the metazoan secretory pathway. EMBO J 2010;29:304–14. 10.1038/emboj.2009.350. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [76].Scrivens PJ, Noueihed B, Shahrzad N, Hul S, Brunet S, Sacher M. C4orf41 and TTC-15 are mammalian TRAPP components with a role at an early stage in ER-to-Golgi trafficking. Mol Biol Cell 2011;22:2083–93. 10.1091/mbc.e10-11-0873. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [77].Ungar D, Oka T, Vasile E, Krieger M, Hughson FM. Subunit Architecture of the Conserved Oligomeric Golgi Complex. J Biol Chem 2005;280:32729–35. 10.1074/jbc.M504590200. [DOI] [PubMed] [Google Scholar]
  • [78].Miller VJ, Sharma P, Kudlyk TA, Frost L, Rofe AP, Watson IJ, et al. Molecular Insights into Vesicle Tethering at the Golgi by the Conserved Oligomeric Golgi (COG) Complex and the Golgin TATA Element Modulatory Factor (TMF). J Biol Chem 2013;288:4229–40. 10.1074/jbc.M112.426767. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [79].Sethi G, Shanmugam MK, Arfuso F, Kumar AP. Role of RNF20 in cancer development and progression – a comprehensive review. Biosci Rep 2018;38:BSR20171287. 10.1042/BSR20171287. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [80].Fu J, Liao L, Balaji KS, Wei C, Kim J, Peng J. Epigenetic modification and a role for the E3 ligase RNF40 in cancer development and metastasis. Oncogene 2021;40:465–74. 10.1038/s41388-020-01556-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [81].Zhang F, Yu X. WAC, a Functional Partner of RNF20/40, Regulates Histone H2B Ubiquitination and Gene Transcription. Mol Cell 2011;41:384–97. 10.1016/j.molcel.2011.01.024. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [82].Totsukawa G, Kaneko Y, Uchiyama K, Toh H, Tamura K, Kondo H. VCIP135 deubiquitinase and its binding protein, WAC, in p97ATPase-mediated membrane fusion. EMBO J 2011;30:3581–93. 10.1038/emboj.2011.260. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [83].Joachim J, Jefferies HBJ, Razi M, Frith D, Snijders AP, Chakravarty P, et al. Activation of ULK Kinase and Autophagy by GABARAP Trafficking from the Centrosome Is Regulated by WAC and GM130. Mol Cell 2015;60:899–913. 10.1016/j.molcel.2015.11.018. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [84].Sagan L. On the origin of mitosing cells. J Theor Biol 1967;14:225–IN6. 10.1016/0022-5193(67)90079-3. [DOI] [PubMed] [Google Scholar]
  • [85].Martin W, Müller M. The hydrogen hypothesis for the first eukaryote. Nature 1998;392:37–41. 10.1038/32096. [DOI] [PubMed] [Google Scholar]
  • [86].Mills DB, Boyle RA, Daines SJ, Sperling EA, Pisani D, Donoghue PCJ, et al. Eukaryogenesis and oxygen in Earth history. Nat Ecol Evol 2022;6:520–32. 10.1038/s41559-022-01733-y. [DOI] [PubMed] [Google Scholar]
  • [87].Yutin N, Wolf MY, Wolf YI, Koonin EV. The origins of phagocytosis and eukaryogenesis. Biol Direct 2009;4:9. 10.1186/1745-6150-4-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [88].Martin WF, Tielens AGM, Mentel M, Garg SG, Gould SB. The Physiology of Phagocytosis in the Context of Mitochondrial Origin. Microbiol Mol Biol Rev 2017;81:10.1128/mmbr.00008–17. 10.1128/mmbr.00008-17. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [89].Bremer N, Tria FDK, Skejo J, Garg SG, Martin WF. Ancestral State Reconstructions Trace Mitochondria But Not Phagocytosis to the Last Eukaryotic Common Ancestor. Genome Biol Evol 2022;14:evac079. 10.1093/gbe/evac079. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [90].Maruyama S, Kim E. A Modern Descendant of Early Green Algal Phagotrophs. Curr Biol 2013;23:1081–4. 10.1016/j.cub.2013.04.063. [DOI] [PubMed] [Google Scholar]
  • [91].Burns JA, Pittis AA, Kim E. Gene-based predictive models of trophic modes suggest Asgard archaea are not phagocytotic. Nat Ecol Evol 2018;2:697–704. 10.1038/s41559-018-0477-7. [DOI] [PubMed] [Google Scholar]
  • [92].Akıl C, Robinson RC. Genomes of Asgard archaea encode profilins that regulate actin. Nature 2018;562:439–43. 10.1038/s41586-018-0548-6. [DOI] [PubMed] [Google Scholar]
  • [93].Akıl C, Tran LT, Orhant-Prioux M, Baskaran Y, Manser E, Blanchoin L, et al. Insights into the evolution of regulated actin dynamics via characterization of primitive gelsolin/cofilin proteins from Asgard archaea. Proc Natl Acad Sci 2020;117:19904–13. 10.1073/pnas.2009167117. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [94].Rodrigues-Oliveira T, Wollweber F, Ponce-Toledo RI, Xu J, Rittmann SK- MR, Klingl A, et al. Actin cytoskeleton and complex cell architecture in an Asgard archaeon. Nature 2023;613:332–9. 10.1038/s41586-022-05550-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [95].Pollard TD. Regulation of Actin Filament Assembly by Arp2/3 Complex and Formins. Annu Rev Biophys Biomol Struct 2007;36:451–77. 10.1146/annurev.biophys.35.040405.101936. [DOI] [PubMed] [Google Scholar]
  • [96].Davidson AJ, Amato C, Thomason PA, Insall RH. WASP family proteins and formins compete in pseudopod- and bleb-based migration. J Cell Biol 2018;217:701–14. 10.1083/jcb.201705160. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [97].Damiano-Guercio J, Kurzawa L, Mueller J, Dimchev G, Schaks M, Nemethova M, et al. Loss of Ena/VASP interferes with lamellipodium architecture, motility and integrin-dependent adhesion. eLife 2020;9:e55351. 10.7554/eLife.55351. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [98].Edwards M, Zwolak A, Schafer DA, Sept D, Dominguez R, Cooper JA. Capping protein regulators fine-tune actin assembly dynamics. Nat Rev Mol Cell Biol 2014;15:677–89. 10.1038/nrm3869. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [99].Gandhi M, Goode BL. Coronin: The Double-Edged Sword of Actin Dynamics. Madame Curie Biosci. Database Internet, Landes Bioscience; 2013. [Google Scholar]
  • [100].Bolger-Munro M, Choi K, Cheung F, Liu YT, Dang-Lawson M, Deretic N, et al. The Wdr1-LIMK-Cofilin Axis Controls B Cell Antigen Receptor-Induced Actin Remodeling and Signaling at the Immune Synapse. Front Cell Dev Biol 2021;9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [101].Pavlov D, Muhlrad A, Cooper J, Wear M, Reisler E. Actin Filament Severing by Cofilin. J Mol Biol 2007;365:1350–8. 10.1016/j.jmb.2006.10.102. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [102].Maniak M, Rauchenberger R, Albrecht R, Murphy J, Gerisch G. Coronin involved in phagocytosis: Dynamics of particle-induced relocalization visualized by a green fluorescent protein tag. Cell 1995;83:915–24. 10.1016/0092-8674(95)90207-4. [DOI] [PubMed] [Google Scholar]
  • [103].Yan M, Collins RF, Grinstein S, Trimble WS. Coronin-1 Function Is Required for Phagosome Formation. Mol Biol Cell 2005;16:3077–87. 10.1091/mbc.e04-11-0989. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [104].Yan M, Di Ciano-Oliveira C, Grinstein S, Trimble WS. Coronin Function Is Required for Chemotaxis and Phagocytosis in Human Neutrophils1. J Immunol 2007;178:5769–78. 10.4049/jimmunol.178.9.5769. [DOI] [PubMed] [Google Scholar]
  • [105].Toshima J, Toshima JY, Martin AC, Drubin DG. Phosphoregulation of Arp2/3-dependent actin assembly during receptor-mediated endocytosis. Nat Cell Biol 2005;7:246–54. 10.1038/ncb1229. [DOI] [PubMed] [Google Scholar]
  • [106].Yu B, Egbejimi A, Dharmat R, Xu P, Zhao Z, Long B, et al. Phagocytosed photoreceptor outer segments activate mTORC1 in the retinal pigment epithelium. Sci Signal 2018;11:eaag3315. 10.1126/scisignal.aag3315. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [107].Dürrwang U, Fujita-Becker S, Erent M, Kull FJ, Tsiavaliaris G, Geeves MA, et al. Dictyostelium myosin-IE is a fast molecular motor involved in phagocytosis. J Cell Sci 2006;119:550–8. 10.1242/jcs.02774. [DOI] [PubMed] [Google Scholar]
  • [108].Barger SR, Reilly NS, Shutova MS, Li Q, Maiuri P, Heddleston JM, et al. Membrane-cytoskeletal crosstalk mediated by myosin-I regulates adhesion turnover during phagocytosis. Nat Commun 2019;10:1249. 10.1038/s41467-019-09104-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [109].Fili N, Toseland CP. Unconventional Myosins: How Regulation Meets Function. Int J Mol Sci 2020;21:67. 10.3390/ijms21010067. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [110].Bergmann C, Guay-Woodford LM, Harris PC, Horie S, Peters DJM, Torres VE. Polycystic kidney disease. Nat Rev Dis Primer 2018;4:1–24. 10.1038/s41572-018-0047-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [111].Ikeda K, Brown JA, Yagi T, Norrander JM, Hirono M, Eccleston E, et al. Rib72, a Conserved Protein Associated with the Ribbon Compartment of Flagellar A-microtubules and Potentially Involved in the Linkage between Outer Doublet Microtubules. J Biol Chem 2003;278:7725–34. 10.1074/jbc.M210751200. [DOI] [PubMed] [Google Scholar]
  • [112].Stoddard D, Zhao Y, Bayless BA, Gui L, Louka P, Dave D, et al. Tetrahymena RIB72A and RIB72B are microtubule inner proteins in the ciliary doublet microtubules. Mol Biol Cell 2018;29:2566–77. 10.1091/mbc.E18-06-0405. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [113].Swan EJ, Salem RM, Sandholm N, Tarnow L, Rossing P, Lajer M, et al. Genetic risk factors affecting mitochondrial function are associated with kidney disease in people with Type 1 diabetes. Diabet Med 2015;32:1104–9. 10.1111/dme.12763. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [114].Han M, Moon S, Lee S, Kim K, An WJ, Ryu H, et al. Novel Genetic Variants Associated with Chronic Kidney Disease Progression. J Am Soc Nephrol 2023;34:857. 10.1681/ASN.0000000000000066. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [115].Wang PI, Marcotte EM. It’s the machine that matters: Predicting gene function and phenotype from protein networks. J Proteomics 2010;73:2277–89. 10.1016/j.jprot.2010.07.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [116].Amberger JS, Bocchini CA, Schiettecatte F, Scott AF, Hamosh A. OMIM.org: Online Mendelian Inheritance in Man (OMIM), an online catalog of human genes and genetic disorders. Nucleic Acids Res 2015;43:D789–98. 10.1093/nar/gku1205. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [117].Tolar J, Teitelbaum SL, Orchard PJ. Osteopetrosis. N Engl J Med 2004;351:2839–49. 10.1056/NEJMra040952. [DOI] [PubMed] [Google Scholar]
  • [118].Qin A, Cheng TS, Pavlos NJ, Lin Z, Dai KR, Zheng MH. V-ATPases in osteoclasts: Structure, function and potential inhibitors of bone resorption. Int J Biochem Cell Biol 2012;44:1422–35. 10.1016/j.biocel.2012.05.014. [DOI] [PubMed] [Google Scholar]
  • [119].Duan X, Yang S, Zhang L, Yang T. V-ATPases and osteoclasts: ambiguous future of V-ATPases inhibitors in osteoporosis. Theranostics 2018;8:5379–99. 10.7150/thno.28391. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [120].Nishi T, Kawasaki-Nishi S, Forgac M. Expression and function of the mouse V-ATPase d subunit isoforms. J Biol Chem 2003;278:46396–402. 10.1074/jbc.M303924200. [DOI] [PubMed] [Google Scholar]
  • [121].Esmail S, Kartner N, Yao Y, Kim JW, Reithmeier RAF, Manolson MF. Molecular mechanisms of cutis laxa– and distal renal tubular acidosis–causing mutations in V-ATPase a subunits, ATP6V0A2 and ATP6V0A4. J Biol Chem 2018;293:2787–800. 10.1074/jbc.M117.818872. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [122].Yuan Y, Zhang J, Chang Q, Zeng J, Xin F, Wang J, et al. De novo mutation in ATP6V1B2 impairs lysosome acidification and causes dominant deafness-onychodystrophy syndrome. Cell Res 2014;24:1370–3. 10.1038/cr.2014.77. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [123].Kortüm F, Caputo V, Bauer CK, Stella L, Ciolfi A, Alawi M, et al. Mutations in KCNH1 and ATP6V1B2 cause Zimmermann-Laband syndrome. Nat Genet 2015;47:661–7. 10.1038/ng.3282. [DOI] [PubMed] [Google Scholar]
  • [124].Duan X, Liu J, Zheng X, Wang Z, Zhang Y, Hao Y, et al. Deficiency of ATP6V1H Causes Bone Loss by Inhibiting Bone Resorption and Bone Formation through the TGF-β1 Pathway. Theranostics 2016;6:2183–95. 10.7150/thno.17140. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [125].Öz OK, Zerwekh JE, Fisher C, Graves K, Nanu L, Millsaps R, et al. Bone Has a Sexually Dimorphic Response to Aromatase Deficiency. J Bone Miner Res 2000;15:507–14. 10.1359/jbmr.2000.15.3.507. [DOI] [PubMed] [Google Scholar]
  • [126].Wells JCK. Sexual dimorphism of body composition. Best Pract Res Clin Endocrinol Metab 2007;21:415–30. 10.1016/j.beem.2007.04.007. [DOI] [PubMed] [Google Scholar]
  • [127].Lai B, Jiang H, Gao Y, Zhou X. Skeletal ciliopathy: pathogenesis and related signaling pathways. Mol Cell Biochem 2023. 10.1007/s11010-023-04765-5. [DOI] [PubMed] [Google Scholar]
  • [128].Steegmaier M, Borges E, Berger J, Schwarz H, Vestweber D. The E-selectin-ligand ESL-1 is located in the Golgi as well as on microvilli on the cell surface. J Cell Sci 1997;110:687–94. 10.1242/jcs.110.6.687. [DOI] [PubMed] [Google Scholar]
  • [129].Yang T, Mendoza-Londono R, Lu H, Tao J, Li K, Keller B, et al. E-selectin ligand–1 regulates growth plate homeostasis in mice by inhibiting the intracellular processing and secretion of mature TGF-β. J Clin Invest 2010;120:2474–85. 10.1172/JCI42150. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [130].Brooks ER, Wallingford JB. Control of vertebrate intraflagellar transport by the planar cell polarity effector Fuz. J Cell Biol 2012;198:37–45. 10.1083/jcb.201204072. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [131].Toriyama M, Lee C, Taylor SP, Duran I, Cohn DH, Bruel A-L, et al. The ciliopathy-associated CPLANE proteins direct basal body recruitment of intraflagellar transport machinery. Nat Genet 2016;48:648–56. 10.1038/ng.3558. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [132].Follit JA, Tuft RA, Fogarty KE, Pazour GJ. The intraflagellar transport protein IFT20 is associated with the Golgi complex and is required for cilia assembly. Mol Biol Cell 2006;17:3781–92. 10.1091/mbc.e06-02-0133. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [133].Quidwai T, Wang J, Hall EA, Petriman NA, Leng W, Kiesel P, et al. A WDR35-dependent coat protein complex transports ciliary membrane cargo vesicles to cilia. eLife 2021;10:e69786. 10.7554/eLife.69786. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [134].Boeckmann B, Marcet-Houben M, Rees JA, Forslund K, Huerta-Cepas J, Muffato M, et al. Quest for Orthologs Entails Quest for Tree of Life: In Search of the Gene Stream. Genome Biol Evol 2015;7:1988–99. 10.1093/gbe/evv121. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [135].Altenhoff AM, Garrayo-Ventas J, Cosentino S, Emms D, Glover NM, Hernández-Plaza A, et al. The Quest for Orthologs benchmark service and consensus calls in 2020. Nucleic Acids Res 2020;48:W538–45. 10.1093/nar/gkaa308. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [136].Cantalapiedra CP, Hernández-Plaza A, Letunic I, Bork P, Huerta-Cepas J. eggNOG-mapper v2: Functional Annotation, Orthology Assignments, and Domain Prediction at the Metagenomic Scale. Mol Biol Evol 2021;38:5825–9. 10.1093/molbev/msab293. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [137].Csűös M. Count: evolutionary analysis of phylogenetic profiles with parsimony and likelihood. Bioinformatics 2010;26:1910–2. 10.1093/bioinformatics/btq315. [DOI] [PubMed] [Google Scholar]
  • [138].Boeckmann B, Dylus D, Moretti S, Altenhoff A, Train C-M, Kriventseva E, et al. Taxon sampling unequally affects individual nodes in a phylogenetic tree: consequences for model gene tree construction in SwissTree 2017:181966. 10.1101/181966. [DOI] [Google Scholar]
  • [139].Altenhoff AM, Dessimoz C. Inferring orthology and paralogy. Methods Mol Biol Clifton NJ 2012;855:259–79. 10.1007/978-1-61779-582-4_9. [DOI] [PubMed] [Google Scholar]
  • [140].Altenhoff AM, Boeckmann B, Capella-Gutierrez S, Dalquen DA, DeLuca T, Forslund K, et al. Standardized benchmarking in the quest for orthologs. Nat Methods 2016;13:425–30. 10.1038/nmeth.3830. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [141].Nevers Y, Jones TEM, Jyothi D, Yates B, Ferret M, Portell-Silva L, et al. The Quest for Orthologs orthology benchmark service in 2022. Nucleic Acids Res 2022;50:W623–32. 10.1093/nar/gkac330. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [142].Daugherty MD, Malik HS. Rules of Engagement: Molecular Insights from Host-Virus Arms Races. Annu Rev Genet 2012;46:677–700. 10.1146/annurev-genet-110711-155522. [DOI] [PubMed] [Google Scholar]
  • [143].Gilad Y, Oshlack A, Smyth GK, Speed TP, White KP. Expression profiling in primates reveals a rapid evolution of human transcription factors. Nature 2006;440:242–5. 10.1038/nature04559. [DOI] [PubMed] [Google Scholar]
  • [144].de Mendoza A, Sebé-Pedrós A. Origin and evolution of eukaryotic transcription factors. Curr Opin Genet Dev 2019;58–59:25–32. 10.1016/j.gde.2019.07.010. [DOI] [PubMed] [Google Scholar]
  • [145].Sloan DB, Alverson AJ, Chuckalovcak JP, Wu M, McCauley DE, Palmer JD, et al. Rapid Evolution of Enormous, Multichromosomal Genomes in Flowering Plant Mitochondria with Exceptionally High Mutation Rates. PLOS Biol 2012;10:e1001241. 10.1371/journal.pbio.1001241. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [146].Luo A, Ho SYW. The molecular clock and evolutionary timescales. Biochem Soc Trans 2018;46:1183–90. 10.1042/BST20180186. [DOI] [PubMed] [Google Scholar]
  • [147].Gould SJ. Dollo on Dollo’s law: Irreversibility and the status of evolutionary laws. J Hist Biol 1970;3:189–212. 10.1007/BF00137351. [DOI] [PubMed] [Google Scholar]
  • [148].Rogozin IB, Wolf YI, Babenko VN, Koonin EV. Dollo parsimony and the reconstruction of genome evolution. In: Albert VA, editor. Parsimony Phylogeny Genomics, Oxford University Press; 2006, p. 0. 10.1093/acprof:oso/9780199297306.003.0011. [DOI] [Google Scholar]
  • [149].Goldberg EE, Igić B. ON PHYLOGENETIC TESTS OF IRREVERSIBLE EVOLUTION. Evolution 2008;62:2727–41. 10.1111/j.1558-5646.2008.00505.x. [DOI] [PubMed] [Google Scholar]
  • [150].Etten JV, Bhattacharya D. Horizontal Gene Transfer in Eukaryotes: Not if, but How Much? Trends Genet 2020;36:915–25. 10.1016/j.tig.2020.08.006. [DOI] [PubMed] [Google Scholar]
  • [151].Martin W, Borst P. Secondary loss of chloroplasts in trypanosomes. Proc Natl Acad Sci 2003;100:765–7. 10.1073/pnas.0437776100. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [152].Hannaert V, Saavedra E, Duffieux F, Szikora J-P, Rigden DJ, Michels PAM, et al. Plant-like traits associated with metabolism of Trypanosoma parasites. Proc Natl Acad Sci U S A 2003;100:1067–71. 10.1073/pnas.0335769100. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [153].Maruyama S, Matsuzaki M, Misawa K, Nozaki H. Cyanobacterial contribution to the genomes of the plastid-lacking protists. BMC Evol Biol 2009;9:197. 10.1186/1471-2148-9-197. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [154].Matsuzaki M, Misumi O, Shin-i T, Maruyama S, Takahara M, Miyagishima S, et al. Genome sequence of the ultrasmall unicellular red alga Cyanidioschyzon merolae 10D. Nature 2004;428:653–7. 10.1038/nature02398. [DOI] [PubMed] [Google Scholar]
  • [155].Reyes-Prieto A, Bhattacharya D. Phylogeny of Calvin cycle enzymes supports Plantae monophyly. Mol Phylogenet Evol 2007;45:384–91. 10.1016/j.ympev.2007.02.026. [DOI] [PubMed] [Google Scholar]
  • [156].Bhattacharya D, Yoon HS, Hackett JD. Photosynthetic eukaryotes unite: endosymbiosis connects the dots. BioEssays 2004;26:50–60. 10.1002/bies.10376. [DOI] [PubMed] [Google Scholar]
  • [157].Burki F, Roger AJ, Brown MW, Simpson AGB. The New Tree of Eukaryotes. Trends Ecol Evol 2020;35:43–55. 10.1016/j.tree.2019.08.008. [DOI] [PubMed] [Google Scholar]
  • [158].Hampl V, Hug L, Leigh JW, Dacks JB, Lang BF, Simpson AGB, et al. Phylogenomic analyses support the monophyly of Excavata and resolve relationships among eukaryotic “supergroups.” Proc Natl Acad Sci 2009;106:3859–64. 10.1073/pnas.0807880106. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [159].Parfrey LW, Grant J, Tekle YI, Lasek-Nesselquist E, Morrison HG, Sogin ML, et al. Broadly Sampled Multigene Analyses Yield a Well-Resolved Eukaryotic Tree of Life. Syst Biol 2010;59:518–33. 10.1093/sysbio/syq037. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [160].Raychowdhury MK, McLaughlin M, Ramos AJ, Montalbetti N, Bouley R, Ausiello DA, et al. Characterization of Single Channel Currents from Primary Cilia of Renal Epithelial Cells. J Biol Chem 2005;280:34718–22. 10.1074/jbc.M507793200. [DOI] [PubMed] [Google Scholar]
  • [161].McCafferty CL, Papoulas O, Jordan MA, Hoogerbrugge G, Nichols C, Pigino G, et al. Integrative modeling reveals the molecular architecture of the intraflagellar transport A (IFT-A) complex. eLife 2022;11:e81977. 10.7554/eLife.81977. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [162].Dentler WL. Chapter 3 Isolation of Cilia from Tetrahymena thermophila. Methods Cell Biol., vol. 47, Elsevier; 1995, p. 13–5. 10.1016/S0091-679X(08)60784-0. [DOI] [PubMed] [Google Scholar]
  • [163].Mallam AL, Marcotte EM. Systems-wide Studies Uncover Commander, a Multiprotein Complex Essential to Human Development. Cell Syst 2017;4:483–94. 10.1016/j.cels.2017.04.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [164].Drew K, Lee C, Cox RM, Dang V, Devitt CC, McWhite CD, et al. A systematic, label-free method for identifying RNA-associated proteins in vivo provides insights into vertebrate ciliary beating machinery. Dev Biol 2020;467:108–17. 10.1016/j.ydbio.2020.08.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [165].McWhite CD, Papoulas O, Drew K, Dang V, Leggere JC, Sae-Lee W, et al. Co-fractionation/mass spectrometry to identify protein complexes. STAR Protoc 2021;2:100370. 10.1016/j.xpro.2021.100370. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [166].Kwon T, Choi H, Vogel C, Nesvizhskii AI, Marcotte EM. MSblender: A Probabilistic Approach for Integrating Peptide Identifications from Multiple Database Search Engines. J Proteome Res 2011;10:2949–58. 10.1021/pr2002116. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [167].Olson RS, Bartley N, Urbanowicz RJ, Moore JH. Evaluation of a Tree-based Pipeline Optimization Tool for Automating Data Science. Proc. Genet. Evol. Comput. Conf. 2016, Denver Colorado USA: ACM; 2016, p. 485–92. 10.1145/2908812.2908918. [DOI] [Google Scholar]
  • [168].Newman MEJ, Girvan M. Finding and evaluating community structure in networks. Phys Rev E 2004;69:026113. 10.1103/PhysRevE.69.026113. [DOI] [PubMed] [Google Scholar]
  • [169].Groza T, Gomez FL, Mashhadi HH, Muñoz-Fuentes V, Gunes O, Wilson R, et al. The International Mouse Phenotyping Consortium: comprehensive knockout phenotyping underpinning the study of human disease. Nucleic Acids Res 2023;51:D1038–45. 10.1093/nar/gkac972. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [170].Birling M-C, Yoshiki A, Adams DJ, Ayabe S, Beaudet AL, Bottomley J, et al. A resource of targeted mutant mouse lines for 5,061 genes. Nat Genet 2021;53:416–9. 10.1038/s41588-021-00825-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [171].Kurbatova N, Mason JC, Morgan H, Meehan TF, Karp NA. PhenStat: A Tool Kit for Standardized Analysis of High Throughput Phenotypic Data. PLOS ONE 2015;10:e0131274. 10.1371/journal.pone.0131274. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [172].Park TJ, Gray RS, Sato A, Habas R, Wallingford JB. Subcellular Localization and Signaling Properties of Dishevelled in Developing Vertebrate Embryos. Curr Biol 2005;15:1039–44. 10.1016/j.cub.2005.04.062. [DOI] [PubMed] [Google Scholar]
  • [173].Huizar RL, Lee C, Boulgakov AA, Horani A, Tu F, Marcotte EM, et al. A liquid-like organelle at the root of motile ciliopathy. eLife 2018;7:e38497. 10.7554/eLife.38497. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [174].Lee C, Cox RM, Papoulas O, Horani A, Drew K, Devitt CC, et al. Functional partitioning of a liquid-like organelle during assembly of axonemal dyneins. eLife 2020;9:e58662. 10.7554/eLife.58662. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [175].Fisher M, James-Zorn C, Ponferrada V, Bell AJ, Sundararaj N, Segerdell E, et al. Xenbase: key features and resources of the Xenopus model organism knowledgebase. Genetics 2023;224:iyad018. 10.1093/genetics/iyad018. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplement 1

Articles from bioRxiv are provided here courtesy of Cold Spring Harbor Laboratory Preprints

RESOURCES