Abstract
Ubiquitin ligases (E3s) are basic components of the eukaryotic ubiquitination system. In this work, the emergence and diversification of fungal HECT ubiquitin ligases is described. Phylogenetic and structural data indicate that six HECT subfamilies (RSP5, TOM1, UFD4, HUL4, HUL4A and HUL5) existed in the common ancestor of all fungi. These six subfamilies have evolved very conservatively, with only occasional losses and duplications in particular fungal lineages. However, an early, drastic reduction in the number of HECT genes occurred in microsporidians, in parallel to the reduction of their genomes. A significant correlation between the total number of genes and the number of HECT-encoding genes present in fungi has been observed. However, transitions from unicellularity to multicellularity or vice versa apparently had no effect on the evolution of this family. Likely orthologs or co-orthologs of all fungal HECT genes have been detected in animals. Four genes are deduced to be present in the common ancestor of fungi, animals and plants. Protein-protein interactions detected in both the yeast Saccharomyces cerevisiae and humans suggest that some ancient functions of HECT proteins have been conserved since the animals/fungi split.
Introduction
Protein ubiquitination is involved in the control of multiple essential functions in all eukaryotic species1–3. Given its importance, there is a significant interest in understanding the evolution of the ubiquitination system, from its early origin4–6 to its complex patterns of diversification in eukaryotic phyla, in which the ubiquitination machinery typically involves hundreds of proteins7. Among them, the most diverse are ubiquitin ligases (E3s), the enzymes that transfer ubiquitin to target proteins. Ubiquitin ligases have been classified into different classes according to structural and functional features: Ring Finger-containing E3s (including U-box ubiquitin ligases), Cullin-containing E3 complexes, RBR ubiquitin ligases and HECT E3s8–10. In addition of many works analyzing the ubiquitin ligases of particular species, detailed studies of the evolution of several E3 gene families in the animal and plant kingdoms have been performed11–24. However, similar studies are lacking in other groups, most particularly in fungi. Only a few studies focused on the evolution of E3 families have included fungal species, most often the model yeasts Saccharomyces cerevisiae and Schizosaccharomyces pombe7,11–13,18,24,25. This is quite surprising, because fungi have several interesting features that may provide a decisive contribution to the understanding of the evolution of ubiquitination. First, general studies indicated that fungi have a particularly simple ubiquitination machinery7, a fact that remains unexplained. Second, very complete evolutionary studies can be performed, given the large number and diversity of fungal genomes that have been sequenced, which include unicellular and multicellular species26,27, related taxa in which significant gene amplifications and reductions have been detected28,29 and lineages in which whole genome duplications occurred30,31. Also, the fact that fungi are quite close relatives of animals may improve our understanding of the emergence and diversification of the ubiquitination system in this last group.
Here, a comprehensive analysis of the diversity of the HECT ubiquitin ligase protein family in fungi is performed in order to answer the following questions: (1) When the current diversity of fungal HECT genes emerged. In this study, microsporidia and cryptomycota will be considered as fungi, although whether they are better classified as the closest relatives of fungi sensu stricto remains an unsolved taxonomic problem, which depends on how the kingdom is defined32; (2) Whether amplifications or reductions of the HECT family have occurred, associated to general changes in genome complexity; (3) Whether the transitions from unicellularity and multicellularity (or vice versa) have had any effect on the sets of HECT genes; (4) How fungal HECT ubiquitin ligases relate to those present in animals and whether conservation of the functions of orthologous genes in both phyla exist; and, (5) Using comparative data from animals, plants and fungi, to establish the HECT genes already present in the ancestor of the three kingdoms. A general view of the evolution of these ubiquitin ligases in fungi, as well as insights on their early evolution in all eukaryotes, are thus obtained.
Results
Global diversity of fungal HECT ubiquitin ligases is low
As indicated in the Methods section, 5455 hits were detected in sequence databases when they were queried with the Saccharomyces cerevisiae HECT family proteins and a final dataset of 2899 HECT domains was obtained after eliminating duplicated and partial sequences. These 2899 sequences were aligned (Supplementary File 1) and a prospective maximum-likelihood phylogenetic analysis performed. It determined the presence of five well-supported groups, each of them including one of the five genes present in S. cerevisiae (Fig. 1). Only three sequences were excluded from these groups, being one of them, supposedly from the fly parasite Entomophthora muscae (GENB01017218.1), a contaminant derived from the host fly. The other two were true divergent sequences. These results suggest that the diversity of HECT proteins in fungi is low, similar to the diversity detected in plants23 but much lower than that observed in animals20.
A detailed analysis of 39 model species refines the classification of fungal HECTs
The results just described must be considered incomplete for two reasons. First, these searches may have unearthed only a fraction of the fungal HECT proteins. If some of them are extremely divergent, they may be so different from the S. cerevisiae sequences used to query the databases as to remain undetectable. Second, some significant sequences could have been detected but only partially retrieved and therefore subsequently eliminated. In order to avoid these problems, specific searches were performed for 38 species that comprehend the main fungal taxa (Table 1). As expected, several sequences that had been eliminated as partial were detected, correctly reconstructed and added to the dataset. In addition, very interestingly, a few additional, highly divergent sequences were also found.
Table 1.
Species | Taxonomic group | RSP5 | TOM1 | HUL5 | UFD4 | HUL4 | HUL4A | HUL4-like | Other | No. HECT genes | Total No. genes |
---|---|---|---|---|---|---|---|---|---|---|---|
Aspergillus niger | Ascomycota, pezizomycotina, eurotiomycetes | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 0 | 6 | 10785 |
Penicillium chrysogenum | Ascomycota, pezizomycotina, eurotiomycetes | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 0 | 6 | 11460 |
Coccidiodes immitis | Ascomycota, pezizomycotina, eurotiomycetes | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 0 | 6 | 9905 |
Pyrenophora tritici-repentis | Ascomycota, pezizomycotina, dothiomycetes | 1 | 1 | 1 | 1 | 0 | 1 | 0 | 0 | 5 | 12300 |
Mycosphaerella graminicola | Ascomycota, pezizomycotina, dothiomycetes | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 0 | 6 | 10964 |
Chaetomium globosum | Ascomycota, pezizomycotina, sordariomycetes | 1 | 1 | 1 | 1 | 0 | 1 | 0 | 0 | 5 | 11232 |
Fusarium verticilloides | Ascomycota, pezizomycotina, sordariomycetes | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 0 | 6 | 16012 |
Sclerotinia sclerotiorum | Ascomycota, pezizomycotina, leotiomycetes | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 0 | 6 | 14637 |
Tuber melanoporum | Ascomycota, pezizomycotina, pezyzomycetes | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 0 | 6 | 7496 |
Saccharomyces cerevisiae | Ascomycota, saccharomycotina | 1 | 1 | 1 | 1 | 1 | 0 | 0 | 0 | 5 | 5949 |
Kluyveromyces lactis | Ascomycota, saccharomycotina | 1 | 1 | 1 | 1 | 1 | 0 | 0 | 0 | 5 | 5379 |
Candida albicans | Ascomycota, saccharomycotina | 1 | 1 | 1 | 1 | 1 | 0 | 0 | 0 | 5 | 6263 |
Yarrowia lipolytica | Ascomycota, saccharomycotina | 1 | 1 | 1 | 1 | 1 | 0 | 0 | 0 | 5 | 7086 |
Schizosaccharomyces pombe | Ascomycota, taphrinomycotina | 3 | 1 | 1 | 1 | 1 | 0 | 0 | 0 | 7 | 6953 |
Pneumocystis carinii | Ascomycota, taphrinomycotina | 1 | 1 | 1 | 1 | 0 | 0 | 0 | 0 | 4 | 3695 |
Laccaria bicolor | Basidiomycota, agaricomycotina | 2 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 4 | 18264 |
Cryptococcus neoformans var. grubii | Basidiomycota, agaricomycotina | 1 | 1 | 1 | 1 | 0 | 0 | 0 | 0 | 4 | 6617 |
Puccinia graminis | Basidiomycota, pucciniomycotina | 1 | 1 | 1 | 1 | 1 | 0 | 0 | 0 | 5 | 16309 |
Ustilago maydis | Basidiomycota, ustilaginomycotina | 1 | 1 | 1 | 1 | 0 | 0 | 0 | 1 | 5 | 6910 |
Mortierella alpina | Mucoromycotina | 1 | 1 | 2 | 1 | 1 | 2 | 0 | 0 | 8 | 9909 |
Rhizopus oryzae | Mucoromycotina | 2 | 2 | 2 | 1 | 1 | 2 | 0 | 0 | 10 | 14134 |
Phycomyces blakesleeanus | Mucoromycotina | 2 | 1 | 2 | 1 | 1 | 1 | 0 | 0 | 8 | 16850 |
Entomophthora muscae | Zoopagomycota | 1 | 1 | 1 | 1 | 0 | 1 | 0 | 0 | 5 | ? |
Piromyces finnis | Neocallimastigomycota | 1 | 1 | 0 | 1 | 0 | 1 | 0 | 0 | 4 | 10978 |
Pecoramyces ruminatium | Neocallimastigomycota | 7 | 1 | 0 | 1 | 0 | 1 | 0 | 0 | 10 | 16347 |
Anaeromyces robustus | Neocallimastigomycota | 2 | 1 | 1 | 2 | 0 | 1 | 0 | 0 | 7 | 13083 |
Spizellomyces punctatus | Euchytrids, spizellomycetales | 1 | 1 | 2 | 1 | 1 | 1 | 0 | 0 | 7 | 9169 |
Allomyces macrogynus | Blastocladiomycota | 3 | 1 | 1 | 2 | 1 | 2 | 0 | 0 | 10 | 19282 |
Encephalitozoon cuniculi | Microsporidia | 0 | 1 | 0 | 0 | 0 | 0 | 1 | 0 | 2 | 2029 |
Ordospora colligata | Microsporidia | 0 | 1 | 0 | 0 | 0 | 0 | 1 | 0 | 2 | 1879 |
Nosema ceranae | Microsporidia | 0 | 1 | 0 | 0 | 0 | 0 | 1 | 2 | 4 | 2678 |
Hamiltosporidium tvaerminnensis | Microsporidia | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 4 | 5 | 2174 |
Nematocida parisii | Microsporidia | 0 | 1 | 0 | 0 | 0 | 0 | 1 | 1 | 3 | 2724 |
Mitosporidium daphniae | Microsporidia | 1 | 1 | 0 | 0 | 1 | 0 | 0 | 3 | 6 | 3330 |
Rozella allomycis | Cryptomycota | 1 | 1 | 1 | 1 | 0 | 1 | 0 | 1 | 6 | 6350 |
Fonticula alba | Cristidiscoidea, Fonticulida | 1 | 1 | 0 | 1 | 0 | 0 | 0 | 2 | 5 | 6457 |
Total number of genes: | 45 | 36 | 31 | 31 | 19 | 22 | 5 | 14 | 203 |
The final dataset for the 38 species contained 198 sequences. Five HECTs detected in the slime mold Fonticula alba were added to this dataset. Fonticula belongs to the Cristidiscoidea33, considered the sister group of the fungi sensu stricto/microsporidia/cryptomycota evolutionary lineage32,34 and it is thus a useful outgroup. Figure 2 shows a summary of the maximum likelihood (ML), neighbor-joining (NJ) and maximum parsimony (MP) trees obtained from the alignment of the 203 sequences (Supplementary File 2). It is obvious that Figs 1 and 2 are almost identical, except that Fig. 2 includes a few additional, highly divergent sequences, all but two coming either from Fonticula or from early-diverging fungal species, such as microsporidia or Rozella, and that the HUL4 group detected in our original analyses (Fig. 1) appears now subdivided into three subgroups. The name HUL4 was kept for the one that contains the S. cerevisiae HUL4 gene, while the other two were named HUL4A and HUL4-like. The latter includes just five microsporidian sequences. The differences between Figs 1 and 2 are caused by the presence of two highly divergent sequences (those from Ustilago and Rozella in Fig. 2), which were absent in the original dataset. The branch corresponding to the original HUL4 group plus these two additional sequences still had a very significant support in ML analyses (bootstrap = 99%) but it was not recovered either in NJ or MP analyses. Additional trees (Fig. 3) were generated for the sensu stricto fungal lineages, i. e. eliminating the sequences from microsporidia and Rozella, As expected, all the remnant sequences but the atypical one from Ustilago just mentioned, were included in the groups already observed in Fig. 2.
Patterns of HECT genes diversification in fungi
For the 39 model species analyzed, the distribution of genes among the groups indicated in Figs 2 and 3 are detailed in Table 1. The number of HECT genes in these species ranges from 2 to 10 (average: 5.2). Most fungi contain one or a few genes of the RSP5, TOM1, HUL4, HUL4A, HUL5 and UFD4 groups. The most prominent exception regards microsporidians, which lack several or even all of those genes. On the other hand, only a large amplification has been observed: seven RSP5 sequences were detected in Pecoramyces ruminatium (Table 1). Data in that Table can be summarized in an evolutionary context. Figure 4 shows the most parsimonious explanation for the pattern of gene presence/absence detailed in Table 1. Information in that figure can be used to obtain a classification of HECT sequences into subfamilies, as follows: (1) At least three groups, RSP5, TOM1 and UFD4, emerged before the fungi/Fonticula split and other three (HUL4, HUL4A and HUL5) were also present in the last common ancestor of all fungi. These results indicate that HUL4 and HUL4A sequences are better classified into two different subfamilies, as was already suggested by the trees summarized in Figs 2 and 3; (2) Microsporidian HUL4-like genes may be highly modified HUL4 genes, given that the supposed appearance of these HUL4-like genes coincides in time (i. e. occurs in the same branch of the tree) with the apparent loss of HUL4; and, (3) Identical logic can be applied to the two peculiar HUL4-related sequences detected in Ustilago and Rozella (Fig. 2), which can also be interpreted as extremely divergent HUL4 genes. Thus, only six HECT subfamilies must be defined: RSP5, TOM1, UFD4, HUL4, HUL4A and HUL5, with just a few sequences of Fonticula and microsporidia (isolated from these subfamilies; see Fig. 2) not being assignable to any of them.
Figure 4 indicates that some losses and duplications have occurred in particular lineages. The genes most frequently lost are HUL4 and HUL4A and the most commonly duplicated gene is RSP5, which is moreover the only one for which 3 or more paralogs are detected in some species (Table 1). As already indicated, an obvious early reduction of the number of canonical HECT genes is detected in microsporidians, which may be related to the know genome simplifications occurred in this group of parasites29,35. It is also clear that whole genome duplications (blue ellipses in Fig. 3) may have contributed to the emergence of new HECT genes in some species (as in Allomyces macrogynus), but not in others (e. g. Saccharomyces cerevisiae). A significant correlation between the number of HECT genes and the total number of genes in a given species (Table 1) was observed. The Spearmann rank order correlation coefficient is 0.576 (p = 0.0003) when all species are considered (except Entomophthora muscae, for which, to our knowledge, no estimation of the number of genes is available). This correlation is significant even when microsporidians are excluded (Spearmann coefficient = 0.428, p = 0.02), indicating that the general trends of genomic reductions/amplifications have had an impact on the conservation or loss of HECT genes. On the contrary, the transitions to multicellularity or back from multicellularity to unicellularity giving rise to yeasts such as S. cerevisiae or S. pombe27 did not have a significant impact on the evolution of the HECT family. It is particularly significant that the emergence of all fungal HECT subfamilies occurred before multicellular fungi arose27.
Comparative analysis of animal and fungal HECT genes
A comparison between the sets of HECT genes in metazoa and fungi may provide additional insights on their early evolution. Figures 5 and 6 respectively show the maximum likelihood (ML) and neighbor-joining (NJ) trees corresponding to an alignment (Supplementary File 3) of the 203 sequences described above, plus a dataset already used in previous studies20,23,36, which includes HECT sequences of animals and the choanoflagellate Monosiga brevicollis as an outgroup. Both trees are largely congruent and show that all fungal genes have close animal relatives. Thus, fungal RSP5 genes appear in a well-supported branch together with the nine animal genes of the NEDD4 subfamily (see ref.20 for details). Similarly, TOM1 appears together with animal HUWE1, in both trees with significant statistical support (see Figs 5 and 6). Also, HUL5 is related to the UBE3B/3C subfamily genes. However, in this case which of the animal genes is the most similar to HUL5 is unclear, because both trees provide different answers. Finally, the fungi HUL4 and HUL4A genes appear close in the trees to the animal UBE3A, HECTD2, HECTX and Small HERC subfamilies (Figs 5 and 6). Again, the exact relationships of the two fungal genes with the animal genes are unclear, all branches having low bootstrap support in one or both trees. Moreover, the HUL4A sequences appear divided into two or three groups, depending on the particular analysis performed.
Animal HECT proteins often have additional, subfamily-specific protein domains20. Because the acquisition of domains is a rare event, the presence of the same domain in proteins of different lineages, such as animals and fungi, may be used as an additional kind of evidence to establish orthology relationships. A total of 1096 predicted full-length proteins out of the 2899 sequences in our original dataset (characterized by having accession numbers starting with “XP_” or “NP_” in Supplementary File 1) were retrieved and searched for domains present in the Pfam database (see Methods). Out of 226 RSP5 sequences analyzed, 207 had the structure [C2 domain − 3 × WW domain − HECT domain] (see Fig. 3 for a scheme; from now on, protein structures are indicated in brackets, “3×” indicates three identical domains in tandem). This structure is very similar to the one most typical of animal NEDD4 subfamily proteins, [C2 − 4 × WW − HECT], thus confirming the close relationship among RSP5 and NEDD4 subfamily genes detected in the phylogenetic trees (Figs 5 and 6). Most of the 19 remnant RSP5 sequences had related but simpler structures ([3 × WW − H], [2 × WW − H], etc. where H refers to the HECT domain). These exceptions may be in some cases real, but most likely correspond to incorrectly annotated proteins (which typically are either artificially truncated proteins or very long proteins produced by the spurious fusion of two adjacent genes). The [C2 − N × WW − HECT] structure is also present in the choanoflagellate Monosiga brevicollis20 but not in plant HECTs23.
Similarly, 204 full-length TOM1 sequences were obtained. For 134 of them, the structure [DUF908 – DUF913 – DUF4414 – HECT] (DUF: “Domain of Unknown Function”) was obtained, 37 had the structure [DUF908 – DUF913 – UBA – DUF4414 – HECT] (examples of these two main structures are shown in Fig. 3) and the other 33 corresponded either to potentially truncated proteins lacking some of those domains or to proteins with two DUF908 or DUF913 domains instead of a single one. These results support that fungal TOM1 genes and animal or choanoflagellate HUWE1 genes20 are orthologs, as suggested by the phylogenetic trees (Figs 5 and 6), because HUWE1 proteins have a very similar domain composition (e. g. [DUF908 – DUF913 – UBA – WWE - DUF4414 – HECT] in human HUWE1). In addition, plant Subfamily V proteins, which also appear together with animal HUWE1 genes in phylogenetic trees23 also have a [DUF908 – DUF913 – UBA – DUF4414 – HECT] domain composition. Thus, it can be concluded that fungal TOM1, animal/choanoflagellate HUWE1 and plant Subfamily V genes all derive from an ancient gene, present in the common ancestor of the three kingdoms.
A total of 200 putative full-length HUL5 sequences were analyzed. In Pfam searches, 189 contained just the HECT domain, 5 had a [IQ – HECT] structure and 6 had other structures. The finding of some sequences with IQ domains was very interesting because that domain is present in both animal UBE3B and UBE3C proteins20, the closest relatives of HUL5 (Figs 5 and 6), and also in the very similar Subfamily II and Subfamily III proteins in plants23. However, the fact that apparently only a few sequences contained that domain was puzzling. It was observed that InterProScan analyses (see Methods), which use domain definitions of different structural databases, detected additional IQ domains in HUL5 proteins. This occurred when the Prosite definition of that domain, instead of the Pfam definition, was used. When all the HUL5 proteins were scanned for IQ domains in Prosite, it turned out that those domains were detected in 116 of them (see example in Fig. 3). In conclusion, these results together with the findings described before23 indicate the presence of an ancient gene, progenitor of fungal HUL5, animal UBE3B and UBE3C and plant Subfamily II and III genes, in the common ancestor of the three kingdoms.
A set of 258 HUL4, HUL4A and HUL4-like full-length proteins was obtained. 170 were found to contain only a HECT domain, 84 had an [AZUL – HECT] structure (Fig. 3) and only 4 had other structures. The AZUL domain is typical of UBE3A proteins and was here found only in HUL4A sequences. These [AZUL – H] structures have never been detected in plants23, but, very significantly, Pfam searches showed that they can be found in some proteins in alveolates (Tetrahymena thermophila, Oxytricha trifallax) or parabasalians (Trichomonas vaginalis), which suggests a very ancient origin. Neither HUL4 proteins nor animal HECTD2 proteins, which appear as very close in Figs 5 and 6, have any detectable domains besides, of course, the HECT domain. The same is true for animal HECTX proteins, which are also similar to HUL4 and HUL4A proteins (Figs 5 and 6). Therefore, structures do not provide in these cases additional information to assign orthology relationships. Interestingly, plants also have genes, those of Subfamily IV, which encode proteins very similar to HECTD2, UBE3A and HECTX23. These proteins also lack additional domains. Thus, although the exact orthology relationships cannot be determined in this case, it is reasonable to hypothesize that a gene encoding proteins with [H] or [AZUL – H] structures was present in the ancestor of plants, animals and fungi from which all these modern genes descend, as already suggested before23.
Finally, 208 full-length UFD4 sequences were obtained. All but four had the simplest possible structure, containing only the HECT domain. The four exceptions corresponded to annotation errors that put together two adjacent genes, In both ML and NJ analyses, UFD4 appear close to animal TRIP12 and HECTD1, in a strongly supported branch (Figs 5 and 6). Because the animal proteins contain additional domains, structural data cannot be used in this case to confirm orthology20. However, significantly, plants also have genes (Subfamily I) that encode proteins very similar to TRIP12 and HECTD123. Thus, it is again a reasonable hypothesis that these genes in fungi, animals and plants are co-orthologs, all deriving from a single common ancestral gene, as also proposed before23.
Assuming that all these results are correct, the conclusions derived from the patterns of presence/absence of genes in fungal species that were summarized in Fig. 4 should be slightly modified. Adding the new information, it is most likely that HUL4A and HUL5 are older than it was deduced using only fungal data. The comparisons with animal HECTs strongly suggest that these genes were present in the common ancestor of fungi and Fonticula alba and subsequently lost in the lineage that gave rise to the latter species.
Conservation of HECT protein-protein interactions in fungi and animals
Indirect evidence for functional conservation may be inferred from comparative protein-protein interaction data. All well-supported interacting partners characterized so far for Saccharomyces cerevisiae HECT proteins (found in at least two independent experiments) were downloaded from BioGRID (see Methods). A total number of 101 partners, namely 85 interactors with RSP5, 12 with TOM1 and 4 with UFD4, were obtained, while no interactions were recovered for either HUL4 or HUL5. In 21 cases, the human orthologs of those interactors were also found to interact with the corresponding human HECT proteins (Table 2). These are therefore ancient protein-protein interactions, conserved in both S. cerevisiae and H. sapiens. It is significant that these potentially conserved interactions involved RSP5 in yeasts and the orthologous proteins of the human NEDD4 subfamily in seventeen of these 21 cases.
Table 2.
S. cerevisiae HECTs | S. cerevisiae interacting proteins | Homo sapiens HECTs | H. sapiens interacting proteins | Function of HECT-interacting proteins in yeast | References |
---|---|---|---|---|---|
RSP5 | UBC1 | NEDD4L, ITCH, SMURF1, WWP2 | UBE2K | Ubiquitin-conjugating enzyme | 69 |
RSP5 | UBC4, UBC5 | NEDD4, NEDD4L, HECW1, HECW2, ITCH, SMURF1, SMURF2, WWP1, WWP2 | UBE2D1, UBE2D2, UBE2D3, UBE2E1, UBE2E3 | Ubiquitin-conjugating enzymes | 69 |
RSP5 | UBC6 | NEDD4L, HECW1, ITCH, WWP2 | UBE2J2 | Ubiquitin-conjugating enzyme | 69 |
RSP5 | UBI4 | HECW1 | UBC | Polyubiquitin | 70 |
RSP5 | RPN10 | NEDD4, NEDD4L, HUWE1, SMURF1 | PSMD4 | 26S proteasome regulatory subunit | 71 |
RSP5 | RPO21 | NEDD4, WWP1, WWP2, ITCH, SMURF1 | POLR2A | DNA-directed RNA polymerase II subunit | 72 |
RSP5 | RPB2 | NEDD4, HECW2, ITCH, WWP1 | POLR2B/RPB2 | DNA-directed RNA polymerase II subunit | 72 |
RSP5 | RPB3 | NEDD4, ITCH, WWP1, WWP2 | POLR2C | DNA-directed RNA polymerase II subunit | 72 |
RSP5 | RPB5 | NEDD4, ITCH, WWP2 | POLR2E | DNA-directed RNA polymerases I, II, and III subunit | 72 |
RSP5 | RPB7 | WWP1 | POLR2G | DNA-directed RNA polymerase II subunit | 72 |
RSP5 | SEC7 | ITCH | ARFGEF2 | Vesicular transport protein, guanine exchange factor | 73 |
RSP5 | HSE1 | ITCH | STAM1, STAM2 | Vacuolar protein-sorting machinery | 74 |
RSP5 | HSP82 | WWP1 | HSP90AA1 | ATP-dependent molecular chaperone | 75 |
RSP5 | LHP1 | SMURF1 | SSB | tRNA maturation control | 76 |
RSP5 | NFI1, SIZ1 | SMURF2 | PIAS3 | Sumo conjugation | 77 |
TOM1 | MLC1 | HUWE1 | MYL6B | Myosin light chain | 78 |
TOM1 | HSP82, HSC82 | HUWE1 | HSP90AA1, HSP90AB1, HSP90B1 | ATP-dependent molecular chaperones | 75 |
UFD4 | UBC4 | TRIP12 | UBE2D1, UBE2D4 | Ubiquitin-conjugating enzyme | 69 |
Discussion
The results described here for the fungal HECT genes, together with those obtained before for animal and plant HECTs20,23 provide a general view of the patterns of evolution of this family. It has been concluded that the common ancestor of all fungi had six different HECT genes: RSP5, TOM1, UFD4, HUL4, HUL4A and HUL5 (Fig. 4). These six genes were already present before the split between the microsporidian and Rozella lineages and the lineage that gave rise to all fungi sensu stricto (Fig. 4), so that conclusion holds no matter how the kingdom fungi is defined32. Unusual genes that may not be orthologous to any of those six have been found only either in very early diverging species (microsporidians, Rozella) or, in a single case, in the basidiomycete Ustilago maydis (Table 1). However, as already indicated, some of these uncommon genes probably are highly divergent members of the HUL4 subfamily.
The six ancestral genes have been conserved in most fungal species; only occasional, independent losses or duplications are observed in particular lineages (Fig. 4). The exception to this rule are microsporidians, in which severe genome reductions occurred associated to their intracellular lifestyle29,35, which are reflected in multiple losses of HECT genes (Table 1). It is interesting that, after a general reduction early in microsporidian evolution, lineage-specific expansions of multiple gene families, similar to those deduced to have occurred for microsporidian HECTs (see arrows in Fig. 3), have been described37. As indicated above, the number of HECT genes correlates with the total number of genes in the fungal genomes analyzed, so whole-genome duplications, genome amplifications and genome reductions may all have impinged on HECT gene numbers, as occurs in other protein families (e. g. ref.38). On the contrary, the transitions from unicellularity to multicellularity or vice versa does not seem to have had a significant impact on the number of HECT genes present. Not only multicellular species, such as Laccaria bicolor, Tuber melanosporum, etc., do not have more HECT genes than the unicellular ancestor of all fungi, but also yeasts as S. pombe have actually increased that number respect to its multicellular ancestors (Fig. 3)39,40. This situation is very different from what it was found in animals, in which multiple new subfamilies emerged in multicellular organisms after the split of animals from unicellular choanoflagellates20, but agrees well with the pattern observed in plants23. It can be concluded that changes in the number or diversity of HECT genes is not necessarily required for the transition from unicellular to multicellular organisms (or vice versa) to occur.
In fungi, RSP5 is the most conserved gene (Table 1, Fig. 4). Only some microsporidians lack an RSP5 ortholog, and a single event must be postulated to explain that fact (Fig. 4). In S. cerevisiae, RSP5 is the only HECT gene whose deletions cause lethality and RSP5 protein perform fundamental roles in multiple cellular process41,42. This central metabolic position of RSP5, if general in all fungi, may explain why losses of this gene are so uncommon (Fig. 4). On the contrary, the genes that are most commonly lost are HUL4A and HUL4. The roles of the first are totally unknown. For HUL4, the functional information is very limited. In S. cerevisiae, it may have a specific role in sporulation43,44.
Protein-protein interaction data provide clues about long-range functional conservation. Seven of the 21 cases described in Table 2 involve proteins that belong to the ubiquitination or proteasome machinery, being ubiquitin-conjugating (E2) enzymes, the most frequently characterized conserved partners of HECTs. All these interactions are easily explained by conservation of the fundamental ubiquitination machinery since the common ancestor of animals and fungi. In addition, some interactions already studied in detail from a functional point of view and that correspond to known functions of these proteins (e. g. RSP5 and NEDD4 - RNA polymerase II45,46; RSP5/ITCH - SEC. 7/ARFGEF247; RSP5/SMURF2 - SIZ1/PIAS3)48,49 were also found. For some of the other (e. g. RSP5-HSE1)50, the functional connection was known in yeasts but not, to our knowledge, in humans or other mammals. These results all support similar roles for fungal RSP5 and animal NEDD4 proteins, emerged before the two lineages split. Data in Table 2 also suggest a potential functional redundancy or collaboration of NEDD4 subfamily members regarding ubiquitination of RNA polymerase subunits. The only work that tackled this issue detected that only NEDD4, but not WWP1, WWP2 or SMURF1, was able to ubiquitinate RNA Pol II in vivo and NEDD4 and SMURF1, but not WWP1 or WWP2, were able to perform that reaction in an in vitro assay46. Additional research seems advisable, given that Table 2 results suggest that as many as six NEDD4 subfamily proteins may interact and potentially ubiquitinate RNA Pol II units.
As described in detail in the Results section, it has been deduced that the common ancestor of plants, animals and fungi already had four genes from which all observed today in those three kingdoms are derived. For TOM1 and HUL5, both high sequence similarity and similar protein domains have been detected in plant and animal orthologs (Figs 5 and 6; ref.23). For three other fungal genes, UFD4, HUL4 and HUL4A, it can also be proposed that orthologs/co-orthologs in animals and plants exist, all of them originating from two ancestral genes (see above). A caveat of this hypothesis is that it is mostly based on sequence similarity. Only the presence of an AZUL domain in some fungal HUL4A and in animal UBE3A proteins offers additional support for the corresponding genes being orthologous. Finally, RSP5 is clearly the fungal ortholog of the genes of the animal NEDD4 subfamily, a fact supported by sequence similarity, structural data and also functional data (Table 2). In plants, no RSP5/NEDD4 genes are present23.
It is interesting to compare these conclusions with those obtained in a previous work in which a “genomic survey”, i. e. a limited sampling of HECT sequences of all eukaryotes, was performed25. Those authors suggested that at least six genes were present in the last eukaryotic common ancestor (which they assumed was the same as the common ancestor of plants, animals and fungi, although this is not totally clear)51,52 and no less than nine in the ancestor of all fungi. It is simple to point out some basic mistakes in that study: (1) Extremely shallow and erroneous species sampling. We already showed in a previous work23 that the conclusions of those authors regarding plant species were incorrect, because they totally missed one of the subfamilies of plant HECTs, as well as an additional angiosperm-specific HECT lineage, due to poor species sampling. Regarding fungi, they considered only eight species, and, moreover, they failed to include any early-diverging species (microsporidia, cryptomycota) or any fungi-related protozoan lineage. Thus, they totally missed the HUL4-like genes and the highly divergent Fonticula-specific and microsporidian-specific genes characterized in this study; (2) Improper use of statistical support in phylogenetic trees. It can be observed by simply inspecting their figures that many of the critical branches from which they derived their main conclusions lack significant support. For example, two groups of sequences that they called “classes” and that they interpreted as indicating the existence of two corresponding genes present in the ancestor of all eukaryotes, had bootstrap support in ML analyses as low as 6% and 8%, respectively. The same occurs for about half of the groups that they call “subfamilies”, supposedly groups of orthologs in different eukaryotic taxa. On top of this, support was deemed to be sufficient or not haphazardly. Thus, a “class” was defined with as low as 60% Bayesian support and 8% ML support but the orthologous group HUL5/UBE3C/Plant subfamilies II + III, which they detected with a higher support (respectively, 70% and 11%) was not deemed significant, which led them to erroneously conclude that genes that not only have very similar sequences, but also all encode complex [DUF908 – DUF913 – (UBA) – DUF4414 – HECT] proteins, emerged three times independently; and, (3) No attempt to establish the true phylogenetic range of the genes analyzed. For example, they concluded that there are seven subfamilies of HECT genes in fungi instead of six, because they confused a HUL5 duplicate which is mucoromycota-specific and absent in all other fungi (see Fig. 3) with an ancient UBE3B ortholog.
It is easy to appreciate that combining a very shallow species sampling with lack of rigour when assessing the statistical support of phylogenetic trees leads to a systematic bias in the interpretation of the data. This occurs because any sequence found in just one model species, or a few closely related ones, will be assumed to already exist when the taxon that includes that/those species emerged (as an example, the mucoromycota-specific HUL5 genes just mentioned were implicitly assumed to be already present in the ancestor of all fungi). Of course, the alternative explanation, i. e. that the duplication is very recent, simply cannot be tested if species sampling is too superficial. Now, if a mistaken interpretation of recent duplicates as ancient genes is mixed together with a permissive acceptance of minimal statistic support for tree branches, then: (1) paralogous genes emerged in different groups will be often confused as orthologs; (2) all the genes merged together in a false orthology group will be assumed to be much older than actually are; and, (3) if the error involves distantly-related species, that “gene” will show an evolutionarily patchy distribution, with many independent losses having to be hypothesized to explain its presence in such distant relatives. Such a systematic bias explains why it was concluded by those authors that so many HECT genes were present early in eukaryotic evolution, leading them to necessarily compensate, in order to fit the data, by hypothesizing abundant losses of HECT genes, with a considerable number of them being lost many times independently. This extreme dispensability of multiple HECT family genes is not only biologically implausible, but also not supported at all by our three specific, complete studies in animals, plants and fungi (refs20,23 and this work). The (obvious) corollary is that analyzing the available information in full and in depth is advisable if the goal is to truly understand the evolution of a complex gene family.
Methods
Because the HECT domain is long (about 350 amino acids) and evolutionarily well conserved, it is easy to generate comprehensive databases of HECT ubiquitin ligases in any given lineage. Fungal HECT sequences were obtained using TblastN searches against the nr, wgs, est, tsa, gss, and htgs databases of the National Center for Biotechnology Information (NCBI; http://www.ncbi.nlm.nih.gov/), which were queried with the sequences of the HECT domains of Saccharomyces cerevisiae proteins. From 5455 positive hits, a final dataset of 2899 HECT domain sequences was obtained once duplicated and truncated sequences were eliminated. This dataset was aligned using ClustalX 2.153 and manually corrected using GeneDoc 2.754. The final alignment can be found in Supplementary File 1. Additional species-specific searches for 38 fungi and the protozoan Fonticula alba (see Table 1) were performed screening the same databases, but this time using as query sequences members of the five main groups detected when preliminary phylogenetic analyses were performed (Fig. 1) that belonged to species that were distantly related to S. cerevisiae according to fungi phylogenies (microsporidians, Allomyces, Piromyces, etc; Fig. 4). These searches quickly become saturated, i. e. the same sequences were repeatedly detected, indicating that no further HECT sequences were present in the databases. All these potential HECT domain sequences were examined in detail, combining protein alignments and searches in the Pfam database of protein domains (http://pfam.xfam.org/) to determine whether they indeed corresponded to bona fide HECT domain sequences. To obtain the full-length domains, it was sometimes necessary (especially when small exons, difficult to detect in TblastN searches, existed), to retrieve, examine and conceptually translate the corresponding nucleotide sequences. The final dataset for the 39 selected species (203 sequences) was aligned following the same methods described above. That alignment can be found in Supplementary File 2.
Phylogenetic trees were obtained with the programs IQ-TREE 1.5.5 (for Maximum Likelihood, ML, analyses)55, Mega 7.0.26 (Neighbor-Joining, NJ)56 and PAUP* 4.0 beta 10 (Maximum Parsimony, MP)57. The best model for ML analyses was determined using ModelFinder58. In our hands, single IQ-TREE analyses with default parameters (i. e. 1000 ultrafast bootstrap replicates59, 100 iterations of the nearest-neighbor interchange algorithm, perturbation strength = 0.5) generated clearly erroneous trees, corresponding to local optima. Therefore, and following the recommendations described in ref.55, searches were replicated 10 times with each of two different perturbation strengths (0.5 and 0.8) and the number of unsuccessful iterations to stop the tree search was increased to 500 in all cases. For each ML analysis, 1000 ultrafast bootstrap replicates were obtained. For NJ analyses, 1000 bootstrap replicates were also obtained. For MP analyses, which requires longer computation times, the tree bisection-recognition (TBR) search was used and 100 bootstrap replicates were generated. For the combined analysis of animal and fungal HECT, in addition of the ML and NJ trees shown in Figs 4 and 5, a maximum parsimony analysis was performed. However, due to limitations of the MP heuristic searches when so many sequences are analyzed, it did not provide significant support for the internal branches of the tree (not shown).
Most protein domain searches were implemented using the Pfam batch search (http://pfam.xfam.org/search#tabview=tab1). Only in particular cases, indicated in the text, the InterProScan searches (https://www.ebi.ac.uk/interpro/search/sequence-search) or the ScanProsite batch search at Prosite (https://prosite.expasy.org/scanprosite/) were used. The Pfam and InterPro (https://www.ebi.ac.uk/interpro/) databases were searched for particular combinations of domains. The total number of genes for each species (right column in Table 1) were obtained from the corresponding genome pages at the NCBI (https://www.ncbi.nlm.nih.gov/genome/) except for Mortierella alpina60, Rhizopus oryzae61, Pecoramyces ruminatium62 and Hamiltosporidium tvaerminnensis63. Correlation analyses were performed using SigmaPlot 13 (http://www.sigmaplot.co.uk). Finally, protein-protein interaction data for both Saccharomyces cerevisiae and Homo sapiens (Table 2) were downloaded from BioGRID (https://thebiogrid.org/) in October 2017. The S. cerevisiae proteins that were found to interact with HECT ubiquitin ligases in at least two experiments were compared using TBlastP with all H. sapiens proteins to detect potential orthologs. When significant sequence similarity was detected (minimal Expect value < e−10), the literature was scanned for studies indicating that those were true orthologous genes (“References” in Table 2).
Electronic supplementary material
Competing Interests
The author declares no competing interests.
Footnotes
Electronic supplementary material
Supplementary information accompanies this paper at 10.1038/s41598-018-24914-x.
Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
References
- 1.Glickman MH, Ciechanover A. The ubiquitin-proteasome proteolytic pathway: destruction for the sake of construction. Physiol Rev. 2002;82:373–428. doi: 10.1152/physrev.00027.2001. [DOI] [PubMed] [Google Scholar]
- 2.Kerscher O, Felberbaum R, Hochstrasser M. Modification of proteins by ubiquitin and ubiquitin-like proteins. Annu Rev Cell Dev Biol. 2006;22:159–180. doi: 10.1146/annurev.cellbio.22.010605.093503. [DOI] [PubMed] [Google Scholar]
- 3.Kulathu Y, Komander D. Atypical ubiquitination – the unexplored world of polyubiquitin beyond Lys48 and Lys63 linkages. Nat Rev Mol Cell Biol. 2012;13:508–523. doi: 10.1038/nrm3394. [DOI] [PubMed] [Google Scholar]
- 4.Nunoura T, et al. Insights into the evolution of Archaea and eukaryotic protein modifier systems revealed by the genome of a novel archaeal group. Nucl Acids Res. 2011;39:3204–3223. doi: 10.1093/nar/gkq1228. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Spang A, et al. Complex archaea that bridge the gap between prokaryotes and eukaryotes. Nature. 2015;521:173–179. doi: 10.1038/nature14447. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Zaremba-Niedzwiedzka K, et al. Asgard archaea illuminate the origin of eukaryotic cellular complexity. Nature. 2017;541:353–358. doi: 10.1038/nature21031. [DOI] [PubMed] [Google Scholar]
- 7.Hutchins AP, Liu S, Diez D, Miranda-Saavedra D. The repertoires of ubiquitinating and deubiquitinating enzymes in eukaryotic genomes. Mol Biol Evol. 2013;30:1172–1187. doi: 10.1093/molbev/mst022. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Petroski MD, Deshaies RJ. Function and regulation of cullin-ring ubiquitin ligases. Nat Rev Mol Cell Biol. 2005;6:9–20. doi: 10.1038/nrm1547. [DOI] [PubMed] [Google Scholar]
- 9.Deshaies RJ, Joazeiro CAP. RING domain E3 ubiquitin ligases. Annu Rev Biochem. 2009;78:399–434. doi: 10.1146/annurev.biochem.78.101807.093809. [DOI] [PubMed] [Google Scholar]
- 10.Zheng N, Shabek N. Ubiquitin ligases: Structure, function and regulation. Annu Rev Biochem. 2017;86:14.1–14.29. doi: 10.1146/annurev-biochem-060815-014922. [DOI] [PubMed] [Google Scholar]
- 11.Marín I, Ferrús A. Comparative genomics of the RBR family, including the Parkinson’s disease-related gene Parkin and the genes of the Ariadne subfamily. Mol Biol Evol. 2002;19:2039–2050. doi: 10.1093/oxfordjournals.molbev.a004029. [DOI] [PubMed] [Google Scholar]
- 12.Kong H, et al. Highly heterogeneous rates of evolution in the SKP1 gene family in plants and animals: Functional and evolutionary implications. Mol Biol Evol. 2004;21:117–128. doi: 10.1093/molbev/msh001. [DOI] [PubMed] [Google Scholar]
- 13.Lucas JI, Arnau V, Marín I. Comparative genomics and protein domain graph analyses link ubiquitination and RNA metabolism. J Mol Biol. 2006;357:9–17. doi: 10.1016/j.jmb.2005.12.068. [DOI] [PubMed] [Google Scholar]
- 14.Sardiello M, Cairo S, Fontanella B, Ballabio A, Meroni G. Genomic analysis of the TRIM family reveals two groups of genes with distinct evolutionary properties. BMC Evol Biol. 2008;8:225. doi: 10.1186/1471-2148-8-225. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Aguilar-Hernández V, Aguilar-Henonin L, Guzmán P. Diversity in the architecture of ATLs, a family of plant ubiquitin-ligases, leads to recognition and targeting of substrates in different cellular environments. Plos One. 2011;6:e23934. doi: 10.1371/journal.pone.0023934. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Flynn, M., Saha, O. & Young, P. Molecular evolution of the LNX gene family. BMC Evol Biol11, 235 (2011). [DOI] [PMC free article] [PubMed]
- 17.Hua Z, Zou C, Shiu SH, Vierstra RD. Phylogenetic comparison of F-Box (FBX) gene superfamily within the plant kingdom reveals divergent evolutionary histories indicative of genomic drift. Plos One. 2011;6:e16219. doi: 10.1371/journal.pone.0016219. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Marín I. Diversification of the cullin family. BMC Evol Biol. 2009;9:267. doi: 10.1186/1471-2148-9-267. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Marín, I. RBR ubiquitin ligases: Diversification and streamlining in animal lineages. J. Mol. Evol. 69, 54–64 (2009). [DOI] [PubMed]
- 20.Marín I. Animal HECT ubiquitin ligases: evolution and functional implications. BMC Evol Biol. 2010;10:56. doi: 10.1186/1471-2148-10-56. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Marín I. Diversification and specialization of plant RBR ubiquitin ligases. Plos One. 2010;5:e11579. doi: 10.1371/journal.pone.0011579. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Marín I. Ancient origin of animal U-box ubiquitin ligases. BMC Evol Biol. 2010;10:331. doi: 10.1186/1471-2148-10-331. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Marín I. Evolution of plant HECT ubiquitin ligases. Plos One. 2013;8:e68536. doi: 10.1371/journal.pone.0068536. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Vlasschaert C, Cook D, Xia X, Gray DA. The evolution and functional diversification of the deubiquitinating enzyme superfamily. Genome Biol Evol. 2017;9:558–573. doi: 10.1093/gbe/evx020. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Grau-Bové X, Sebé-Pedrós A, Ruiz-Trillo I. A genomic survey of HECT ubiquitin ligases in eukaryotes reveals independent expansions of the HECT system in several lineages. Genome Biol Evol. 2013;5:833847. doi: 10.1093/gbe/evt052. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Knoll AH. The multiple origins of complex multicellularity. Annu Rev Earth Planet Sci. 2011;39:217–239. doi: 10.1146/annurev.earth.031208.100209. [DOI] [Google Scholar]
- 27.Nagy, L. G. et al. Six key traits in fungi: Their evolutionary origins and genetic bases. Microbiol. Spectrum5, FUNK-0036-2016 (2017). [DOI] [PubMed]
- 28.Gladieux P, et al. Fungal evolutionary genomics provides insight into the mechanisms of adaptive divergence in eukaryotes. Mol Ecol. 2014;23:753–773. doi: 10.1111/mec.12631. [DOI] [PubMed] [Google Scholar]
- 29.Corradi N. Microsporidia: eukaryotic intracellular parasites shaped by gene loss and horizontal transfers. Annu Rev Microbiol. 2015;69:167–183. doi: 10.1146/annurev-micro-091014-104136. [DOI] [PubMed] [Google Scholar]
- 30.Albertin W, Marullo P. Polyploidy in fungi: evolution after whole-genome duplication. Proc R Soc B. 2012;279:2497–2509. doi: 10.1098/rspb.2012.0434. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Campbell MA, Ganley ARD, Gabaldón T, Cox MP. The case of the missing ancient fungal polyploids. Am Natur. 2016;188:602–614. doi: 10.1086/688763. [DOI] [PubMed] [Google Scholar]
- 32.Richards, T. A., Leonard, G. & Wideman, J. G. What defines the “kingdom” fungi? Microbiol. Spectrum5, FUNK-0044-2017 (2017). [DOI] [PubMed]
- 33.Ruggiero MA, et al. A higher level classification of all living organisms. Plos One. 2015;10:e0119248. doi: 10.1371/journal.pone.0119248. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Brown MW, Spiegel FW, Silberman JD. Phylogeny of the “forgotten” cellular slime mold, Fonticula alba, reveals a key evolutionary branch within opisthokonta. Mol Biol Evol. 2009;26:2699–2709. doi: 10.1093/molbev/msp185. [DOI] [PubMed] [Google Scholar]
- 35.Berbee ML, James TY, Strullu-Derrien C. Early diverging fungi: diversity and impact at the dawn of terrestrial life. Annu Rev MicroBiol. 2017;71:41–60. doi: 10.1146/annurev-micro-030117-020324. [DOI] [PubMed] [Google Scholar]
- 36.Henderson JM, et al. Identification of HECT E3 ubiquitin ligase family genes involved in stem cell regulation and regeneration in planarians. Dev Biol. 2015;404:21–34. doi: 10.1016/j.ydbio.2015.04.021. [DOI] [PubMed] [Google Scholar]
- 37.Nakjang S, et al. Reduction and expansion in microsporidian genome evolution: New insights from comparative genomics. Genome Biol Evol. 2013;5:2285–2303. doi: 10.1093/gbe/evt184. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Muszewska A, et al. Fungal lifestyle reflected in serine protease repertoire. Sci Rep. 2017;7:9147. doi: 10.1038/s41598-017-09644-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Dujon B. Yeast evolutionary genetics. Nat Rev Genet. 2010;11:512–524. doi: 10.1038/nrg2811. [DOI] [PubMed] [Google Scholar]
- 40.Nagy LG, et al. Latent homology and convergent regulatory evolution underlies the repeated emergence of yeasts. Nat Comm. 2014;5:4471. doi: 10.1038/ncomms5471. [DOI] [PubMed] [Google Scholar]
- 41.Finley D, Ulrich HD, Sommer T, Kaiser P. The ubiquitin-proteasome system of Saccharomyces cerevisiae. Genetics. 2012;192:319–360. doi: 10.1534/genetics.112.140467. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Rotin D, Kumar S. Physiological functions of the HECT family of ubiquitin ligases. Nat Rev Mol Cell Biol. 2009;10:398–409. doi: 10.1038/nrm2690. [DOI] [PubMed] [Google Scholar]
- 43.Friedlander G, et al. Modulation of the transcription regulatory program in yeast cells committed to sporulation. Genome Biol. 2006;7:R20. doi: 10.1186/gb-2006-7-3-r20. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Lam C, et al. A visual screen of protein localization during sporulation identifies new components of prospore membrane-associated complexes in budding yeast. Eukaryot Cell. 2014;13:383–391. doi: 10.1128/EC.00333-13. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Beaudenon SL, Huacani MR, Wang G, McDonnell DP, Huibregtse JM. Rsp5 ubiquitin-protein ligase mediates DNA damage-induced degradation of the large subunit of RNA polymerase II in Saccharomyces cerevisiae. Mol Cell Biol. 1999;19:6972–6979. doi: 10.1128/MCB.19.10.6972. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Anindya R, Aygün O, Svejstrup JQ. Damage-induced ubiquitylation of human RNA polymerase II by the ubiquitin ligase Nedd4, but not Cockayne Syndrome proteins or BRCA1. Mol Cell. 2007;28:386–397. doi: 10.1016/j.molcel.2007.10.008. [DOI] [PubMed] [Google Scholar]
- 47.Dehring DAK, Adler AS, Hosseini A, Hicke L. A C-terminal sequence in the guanine nucleotide exchange factor Sec7 mediaates Golgi association and interaction with the Rsp5 ubiquitin ligase. J Biol Chem. 2008;283:34188–34196. doi: 10.1074/jbc.M806023200. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Novoselova TV, Rose RS, Marks HM, Sullivan JA. SUMOylation regulates the homologous to E6-AP carboxyl terminus (HECT) ubiquitin ligase Rsp5p. J Biol Chem. 2013;288:10308–10317. doi: 10.1074/jbc.M112.424234. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Chandhoke AS, et al. The ubiquitin ligase Smurf2 suppresses TGFβ-induced epithelial-mesenchymal transition in a sumoylation-regulated manner. Cell Death Differ. 2016;23:876–888. doi: 10.1038/cdd.2015.152. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Ren Jl, Kee Y, Huibregtse JM, Piper RC. Hse1, a component of the yeast Hrs-STAM ubiquitin-sorting complex, associates with ubiquitin peptidases and a ligase to control sorting efficiency into multivesicular bodies. Mol Biol Cell. 2007;18:324–335. doi: 10.1091/mbc.E06-06-0557. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.He D, et al. An alternative root for the eukaryote tree of life. Curr Biol. 2014;24:465–470. doi: 10.1016/j.cub.2014.01.036. [DOI] [PubMed] [Google Scholar]
- 52.Derelle R, et al. Bacterial proteins pinpoint a single eukaryotic root. Proc Natl Acad Sci USA. 2015;112:693–699. doi: 10.1073/pnas.1420657112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Larkin MA, et al. Clustal W and Clustal X version 2.0. Bioinformatics. 2007;23:2947–2948. doi: 10.1093/bioinformatics/btm404. [DOI] [PubMed] [Google Scholar]
- 54.Nicholas, K. B. & Nicholas, jr. H. B. GeneDoc: a tool for editing and annotating multiple sequence alignments. Distributed by the author (1997).
- 55.Nguyen LT, Schmidt HA, von Haeseler A, Minh BQ. IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Mol Biol Evol. 2014;32:268–274. doi: 10.1093/molbev/msu300. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Kumar S, Stecher G, Tamura K. MEGA7: Molecular evolutionary genetics analysis version 7.0 for bigger datasets. Mol Biol Evol. 2016;33:1870–1874. doi: 10.1093/molbev/msw054. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Swofford, D. L. PAUP*. Phylogenetic analysis using parsimony (*and other methods). Version4. Sinauer Associates, Sunderland, Massachusetts (2003).
- 58.Kalyaanamoothly S, Minh BQ, Wong TKF, von Haeseler A, Jermiin LS. ModelFinder: fast model selection for accurate phylogenetic estimates. Nature Methods. 2017;14:587–589. doi: 10.1038/nmeth.4285. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Minh BQ, Nguyen MAT, von Haeseler A. Ultrafast approximation for phylogenetic bootstrap. Mol Biol Evol. 2013;30:1188–1195. doi: 10.1093/molbev/mst024. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Etienne KA, et al. Draft Genome Sequence of Mortierella alpina Isolate CDC-B6842. Genome Announc. 2014;2:e01180–13. doi: 10.1128/genomeA.01180-13. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Ma, et al. Genomic analysis of the basal lineage fungus Rhizopus oryzae reveals a whole-genome duplication. Plos Genet. 2009;5:e1000549. doi: 10.1371/journal.pgen.1000549. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Youssef, et al. The genome of the anaerobic fungus Orpinomyces sp. Strain C1A reveals the unique evolutionary history of a remarkable plant biomass degrader. Appl Environ MicroBiol. 2013;79:4620–4634. doi: 10.1128/AEM.00821-13. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Corradi N, Haag KL, Pombert JF, Ebert D, Keeling PJ. Draft genome sequence of the Daphnia pathogen Octosporea bayeri: insights into the gene content of a large microsporidian genome and a model for host-parasite interactions. Genome Biol. 2009;10:R106. doi: 10.1186/gb-2009-10-10-r106. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Corrochano LM, et al. Expansion of signal transduction pathways in fungi by extensive genome duplication. Curr Biol. 2016;26:1577–1584. doi: 10.1016/j.cub.2016.04.038. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Grossart HP, Wurzbacher C, James TY, Kagami M. Discovery of dark matter fungi in aquatic ecosystems demands a reappraisal of the phylogeny and ecology of zoosporic fungi. Fung Ecol. 2016;19:28–38. doi: 10.1016/j.funeco.2015.06.004. [DOI] [Google Scholar]
- 66.Vossbrinck, C. R., Debrunner-Vossbrinck, B. A. & Weiss, L. M. Phylogeny of the Microsporidia. In Mycrosporidia: Pathogens of Opportunity (eds Weiss, L. M. & Becnel, J. J.) 203–220 (John Wiley and Sons, Inc. 2014).
- 67.Pombert JF, Haag KL, Beidas S, Ebert D, Keeling PJ. The Ordospora colligata genome: evolution of extreme reduction in microsporidia and host-to-parasite horizontal gene transfer. mBio. 2015;6:e02400–14. doi: 10.1128/mBio.02400-14. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Ren R, et al. Phylogenetic resolution of deep eukaryotic and fungal relationships using highly conserved low-copy nuclear genes. Genome Biol Evol. 2016;11:2683–2701. doi: 10.1093/gbe/evw196. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Michelle C, Vourc’h P, Mignon L, Andres CR. What was the set of ubiquitin and ubiquitin-like conjugating enzymes in the eukaryote common ancestor? J Mol Evol. 2009;68:616–628. doi: 10.1007/s00239-009-9225-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Catic A, Ploegh HL. Ubiquitin – conserved protein or selfish gene? Trends Biochem Sci. 2005;30:600–604. doi: 10.1016/j.tibs.2005.09.002. [DOI] [PubMed] [Google Scholar]
- 71.Finley D, et al. Unified nomenclature for subunits of the Saccharomyces cerevisiae proteasome regulatory particle. Trends Biochem Sci. 1998;23:244–245. doi: 10.1016/S0968-0004(98)01222-5. [DOI] [PubMed] [Google Scholar]
- 72.Werner F, Grohmann D. Evolution of multisubunit RNA polymerases in the three domains of life. Nat Rev Microbiol. 2011;9:85–98. doi: 10.1038/nrmicro2507. [DOI] [PubMed] [Google Scholar]
- 73.Bui QT, Golinelli-Cohen MP, Jackson CL. Large Arf1 guanine nucleotide exchange factors: evolution, domain structure, and roles in membrane trafficking and human disease. Mol Genet Genomics. 2009;282:329–350. doi: 10.1007/s00438-009-0473-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74.Kanazawa C, et al. Effects of deficiencies of STAMs and Hrs, mammalian class E Vps proteins, on receptor downregulation. Biochem Biophys Res Comm. 2003;309:848–856. doi: 10.1016/j.bbrc.2003.08.078. [DOI] [PubMed] [Google Scholar]
- 75.Johnson JL. Evolution and function of diverse Hsp90 homologs and cochaperone proteins. Biochim Biophys Acta. 2012;1823:607–613. doi: 10.1016/j.bbamcr.2011.09.020. [DOI] [PubMed] [Google Scholar]
- 76.Bousquet-Antonelli C, Deragon JM. A comprehensive analysis of the La-motif protein superfamily. RNA. 2009;15:750–764. doi: 10.1261/rna.1478709. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 77.Schmidt D, Müller S. PIAS/SUMO: new partners in transcriptional regulation. Cell Mol Life Sci. 2003;60:2561–2574. doi: 10.1007/s00018-003-3129-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78.Stevens RC, Davis TN. Mlc1p is a light chain for the unconventional myosin Myo2p in Saccharomyces cerevisiae. J Cell Sci. 1998;142:711–722. doi: 10.1083/jcb.142.3.711. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.