Skip to main content
Proceedings of the National Academy of Sciences of the United States of America logoLink to Proceedings of the National Academy of Sciences of the United States of America
. 2018 Mar 5;115(12):3108–3113. doi: 10.1073/pnas.1713725115

RNAi is a critical determinant of centromere evolution in closely related fungi

Vikas Yadav a, Sheng Sun b, R Blake Billmyre b, Bhagya C Thimmappa a, Terrance Shea c, Robert Lintner c, Guus Bakkeren d, Christina A Cuomo c, Joseph Heitman b, Kaustuv Sanyal a,1
PMCID: PMC5866544  PMID: 29507212

Significance

The “centromere paradox” refers to rapidly evolving and highly diverse centromere DNA sequences even in closely related eukaryotes. However, factors contributing to this rapid divergence are largely unknown. Here, we identified large regional, LTR retrotransposon-rich centromeres in a group of human fungal pathogens belonging to the Cryptococcus species complex. We provide evidence that loss-of-functional RNAi machinery and possibly cytosine DNA methylation trigger instability of the genome by activation of centromeric retrotransposons presumably suppressed by RNAi. We propose that RNAi, together with cytosine DNA methylation, serves as a critical determinant that maintains repetitive transposon-rich centromere structures. This study explores the direct link between RNAi and centromere structure evolution.

Keywords: retrotransposons, Cryptococcus, Ustilago, experimental evolution, DNA methylation

Abstract

The centromere DNA locus on a eukaryotic chromosome facilitates faithful chromosome segregation. Despite performing such a conserved function, centromere DNA sequence as well as the organization of sequence elements is rapidly evolving in all forms of eukaryotes. The driving force that facilitates centromere evolution remains an enigma. Here, we studied the evolution of centromeres in closely related species in the fungal phylum of Basidiomycota. Using ChIP-seq analysis of conserved inner kinetochore proteins, we identified centromeres in three closely related Cryptococcus species: two of which are RNAi-proficient, while the other lost functional RNAi. We find that the centromeres in the RNAi-deficient species are significantly shorter than those of the two RNAi-proficient species. While centromeres are LTR retrotransposon-rich in all cases, the RNAi-deficient species lost all full-length retroelements from its centromeres. In addition, centromeres in RNAi-proficient species are associated with a significantly higher level of cytosine DNA modifications compared with those of RNAi-deficient species. Furthermore, when an RNAi-proficient Cryptococcus species and its RNAi-deficient mutants were passaged under similar conditions, the centromere length was found to be occasionally shortened in RNAi mutants. In silico analysis of predicted centromeres in a group of closely related Ustilago species, also belonging to the Basidiomycota, were found to have undergone a similar transition in the centromere length in an RNAi-dependent fashion. Based on the correlation found in two independent basidiomycetous species complexes, we present evidence suggesting that the loss of RNAi and cytosine DNA methylation triggered transposon attrition, which resulted in shortening of centromere length during evolution.


The centromere is a specialized DNA locus that is required for assembly of a multiprotein complex, the kinetochore, which drives faithful chromosome segregation in eukaryotes. Centromeres can be classified as point centromeres of short DNA sequences of <400 bp (e.g., Saccharomyces cerevisiae), and regional centromeres which are long, and can range between a few kilobases (kb) (e.g., Schizosaccharomyces pombe) to several megabases (Mb) (e.g., humans and plants) (1, 2). The repetitive DNA present in regional centromeres (core and pericentromeres) consists of either arrays of satellite DNA or transposons or both. Transposons, despite being present in lower copy numbers than the satellites, are proposed to play a major role in regional centromere evolution (1, 3, 4). Transposon domestication at the centromere can also give rise to functional domains or repeats, including satellite DNA repeats in a centromere. The dh/dg repeats in the fission yeast or α-satellite repeats in humans are proposed to be the result of such domestication events (4). The presence of CENP-B, a centromere DNA binding protein that shows a high level of similarity with DNA transposons in humans and its homologs in other organisms, provides another line of evidence toward the role of transposons in the evolution of centromere structure and function (5, 6). Notably, a large number of transposons present in centromeres are RNA transposons or retrotransposons, which propagate through RNA intermediates in the “copy–paste” mode. These elements are different from DNA transposons, which excise themselves from the original site and move to a new target site in the genome, and thus propagate in the “cut–paste” mode (7). Retrotransposons, owing to their mode of propagation, have been proposed as architects of the regional centromeres (4). These elements can produce multiple copies of themselves and integrate into the same centromere locus or centromeres of other chromosomes (3, 4, 8, 9).

A retroelement must be transcribed to generate its RNA intermediates required for transposition. A low level of transcription from the centromere in a range of organisms, including budding yeast, fission yeast, mouse, humans, and maize, has been reported (10, 11). Studies on centromere function in various organisms suggest that neither a complete absence of transcription nor a high level of transcription at the centromere supports assembly of the multisubunit kinetochore complex on the centromere DNA (10, 12). Long noncoding centromeric transcript RNAs are shown to be required for loading of critical kinetochore proteins such as CENP-C (13, 14). In fission yeast, transcripts from dh/dg repeat regions of the centromere are targets of the RNA interference (RNAi) machinery and are required for heterochromatin formation (15, 16). Lack of proper heterochromatinization at dh/dg repeats causes abnormal centromere function leading to chromosome missegregation, although RNAi does not directly affect CENP-A loading in this organism (16). These results highlight the importance of transcription of centromeres. However, the level of transcription at the centromere must be regulated at an optimal level for proper kinetochore assembly and genome stability.

By regulating transcription from retrotransposons, present at the centromere or elsewhere in the genome, RNAi contributes to genome defense (17, 18). The functional RNAi pathway involves key proteins, including Dicer (Dcr), Argonaute (Ago), and RNA-dependent RNA polymerase (Rdp). The RNAi machinery targets double-stranded RNA being generated from retroelements/repeats leading to its degradation and in some cases promoting heterochromatinization through repressive histone modifications (such as H3K9 dimethylation) and/or DNA methylation (15, 19). Thus, by controlling expression of transposons at centromeres, RNAi could play an important regulatory role in the structural evolution of centromeres. However, except in S. pombe, no specific study has been conducted to explore a possible link between RNAi and centromere structure–function evolution (9).

The presence/absence of retrotransposons or repetitive DNA sequences can also contribute to rapid divergence of centromeres in a species complex. For example, centromeres in three related species of the Schizosaccharomyces clade, S. pombe, Schizosaccharomyces japonicus, and Schizosaccharomyces octosporus, exhibit divergence in the centromeric architecture (9). S. japonicus centromeres contain mostly transposons, whereas S. pombe and S. octosporus contain repeat-rich centromeres, and have lost most of the active transposons (9). Another group of fungi, the Candida species complex in the CTG clade, have been studied extensively with respect to centromere structure and function. Candida albicans, Candida dubliniensis, and Candida lusitaniae harbor repeatless centromeres where centromere DNA sequences are unique and different in each species (2022). On the other hand, Candida tropicalis contains highly homogenized repeat-associated fission yeast-like centromeres (23). Both of these genera, Schizosaccharomyces and Candida, belong to the phylum Ascomycota of the fungal kingdom.

In this study, we identified centromeres in three closely related Cryptococcus species in the fungal phylum Basidiomycota. The three species, Cryptococcus neoformans (type strain H99), Cryptococcus deneoformans (reference isolate JEC21), and Cryptococcus deuterogattii (outbreak isolate R265), are haploid and diverged from a common ancestor as recently as 34 Mya (2426). All three species are human pathogens and two of these primarily infect immunocompromised individuals, including HIV/AIDS patients, to cause cryptococcal meningitis, a leading cause of death in these patients. While C. neoformans and C. deneoformans are the most studied species in this complex, a recent rise in cases of drug resistance and infections in otherwise healthy individuals have attracted attention to the lesser explored species C. deuterogattii (27, 28). Unlike C. neoformans and C. deneoformans, C. deuterogattii lost key components of the functional RNAi machinery (29). Here, we show that centromeres in all three Cryptococcus species are regional centromeres featuring retroelements or their remnants. Notably, the RNAi-deficient C. deuterogattii species harbors significantly shorter centromeres compared with the RNAi-proficient species C. neoformans or C. deneoformans. We also predicted centromere locations in the Ustilago species complex, a group of plant pathogens in the Basidiomycota, and observed a similar correlation between the loss of RNAi and shortening of putative centromeres. Overall, this study provides mechanistic insights into the structural evolution of centromeres in an RNAi-dependent manner.

Results and Discussion

Cryptococcus Species Harbor Large Regional Centromeres.

We previously reported the probable structure of centromeres in C. neoformans as long ORF-free regions that are rich in retrotransposable elements (25). In addition, centromeres in C. deneoformans were predicted as a single transposon-rich locus on each of the 14 chromosomes (26). In this study, we sought to identify the centromeres experimentally in three pathogenic species: C. neoformans, C. deneoformans, and C. deuterogattii, of the Cryptococcus species complex, including verifying centromere locations predicted earlier in C. neoformans and C. deneoformans.

To achieve this goal, two of the evolutionarily conserved inner kinetochore proteins, CENP-A and CENP-C, were identified in each of the three species (SI Appendix, Fig. S1 A and B). Both of these proteins were tagged with mCherry and showed localization patterns as reported previously (Fig. 1A and SI Appendix, Fig. S1C) (30). To identify functional centromeres in C. neoformans, we performed CENP-A and CENP-C chromatin immunoprecipitation (ChIP) followed by deep sequencing (ChIP-seq). ChIP-seq analysis revealed overlapping binding of both proteins at a single locus on each of the 14 chromosomes of C. neoformans (SI Appendix, Fig. S2 and Table S1). The binding regions of both proteins were largely overlapping and these regions spanned across the ORF-free, poorly transcribed regions on most chromosomes (Fig. 1B and SI Appendix, Fig. S3). The minor variations observed between the binding sites of these two proteins could be due to different sequencing approaches employed. While no specific patterns could be detected in CENP-A/CENP-C binding across these regions, occasional dips in the binding pattern of both proteins were observed (SI Appendix, SI Materials and Methods). The length of the CENP-A– and CENP-C–bound regions varied from 20 kb to 40 kb in C. neoformans unlike centromeres of many other fungi where the CENP-A–bound region remains nearly constant across chromosomes (2). A similar CENP-A binding pattern across the entire stretch of the centromere was found in Neurospora crassa (31). It is important to note here that centromeres in C. neoformans were not completely assembled, as they contain a few sequence gaps in the current assembly. We attempted to close these gaps by PacBio as well as Sanger sequencing (SI Appendix, SI Materials and Methods). This resulted in a significant improvement of the genome assembly as we could close the sequence gaps in 11 of 14 centromeres, leaving only CEN3, CEN11, and CEN14 with some sequence gaps. While this is the best assembly obtained thus far for the C. neoformans genome, the presence of additional breaks/gaps cannot be excluded.

Fig. 1.

Fig. 1.

The Cryptococcus species complex has large regional-type centromeres. (A) The subcellular localization patterns of a conserved kinetochore protein CENP-C at various cell cycle stages (interphase, prometaphase, and anaphase) in C. neoformans, C. deneoformans, and C. deuterogattii. (Scale bar, 5 µm.) (B) Overlapping binding of CENP-A (red) and CENP-C (green) on each of the 14 chromosomes of C. neoformans as revealed by ChIP-seq analysis of these proteins. An 80-kb DNA sequence harboring the centromere is shown for each of the 14 chromosomes. (C) CENP-C (mCherry)-ChIP-qPCR analysis confirmed enrichment of CENP-C on the predicted centromeres in C. deneoformans. Fold enrichment was normalized to a non-CEN region, the level of which is marked by the dotted line in the graph. Error bars represent SEM.

The C. neoformans genome shares a high level of gene synteny with that of C. deneoformans (SI Appendix, Fig. S4A and Table S2) (25, 32). By performing synteny analysis across centromere flanking regions of C. neoformans, we were able to predict putative centromeres in C. deneoformans. The predicted regions were large, ORF-free, poorly transcribed, and map to loci previously predicted as centromeres (SI Appendix, Figs. S3 and S4B and Table S1) (26). CENP-C ChIP-qPCR confirmed the authenticity of each of these regions as functional centromeres in all 14 chromosomes of C. deneoformans (Fig. 1C). Compared with the noncentromeric locus, we obtained significant enrichment of centromeric CENP-C binding using two pairs of primers located distantly on each of the 14 centromeres.

We performed PacBio and Nanopore sequencing for the C. deuterogattii genome and generated a complete, chromosome-level de novo genome assembly, improving on the previous assembly of 26 scaffolds (Fig. 2A and SI Appendix, SI Materials and Methods). A comparison of the C. deuterogattii genome with that of C. neoformans revealed a number of rearrangements between the two species (SI Appendix, Fig. S4C and Table S2). It was previously predicted that the C. neoformans and C. deuterogattii genomes have undergone an arm exchange involving chromosomes 1 and 2 (25). The new long-read assembly provides compelling evidence supporting this rearrangement. Next, we performed CENP-C ChIP-seq in C. deuterogattii and analyzed the data using the chromosomal assembly and obtained 14 binding peaks, one on each chromosome (Fig. 2B and SI Appendix, Fig. S4D and Table S1). Similar to C. neoformans and C. deneoformans, all 14 peaks mapped to ORF-free poorly transcribed regions in the C. deuterogattii genome (SI Appendix, Fig. S3).

Fig. 2.

Fig. 2.

ChIP sequencing and PacBio sequencing map centromeres on each of the 14 chromosomes in C. deuterogattii. (A) Map showing the 14 chromosomes in C. deuterogattii with the telomeres and centromeres marked. The centromeres are marked to scale, while the telomeres are marked as 10-kb regions for visualization purposes. (B) CENP-C (mCherry)-ChIP-seq identified 14 binding regions among 14 chromosomes in C. deuterogattii’s latest assembly. A 30-kb region spanning the CENP-C–bound region is shown for each chromosome.

Sequence analysis of the CENP-A/CENP-C–bound ORF-free centromere regions in all three species revealed the presence of retrotransposons Tcn1–Tcn6 (Fig. 3A) (33). While Tcn1–Tcn5 belong to the Ty3-gypsy family of retroelements, Tcn6 belongs to the Ty1-copia family of retroelements. The retroelements have a high degree of conservation (70% or more identity) across the three species (Dataset S1). However, they differ in their sequence from the centromeric retroelements of a closely related nonpathogenic Cryptococcus species, Cryptococcus amylolentus (SI Appendix, Fig. S5A and Dataset S1) (34). It is important to note that Tcn1–Tcn6 elements are not found in the C. amylolentus genome, indicating that these elements are specific to the pathogenic Cryptococcus species. Based on these results, we conclude that centromeres in pathogenic Cryptococcus species are large regional-type, and enriched in the same retroelements. These results suggest that neither the location nor the sequence elements of the centromeres have diverged significantly among these closely related fungal species. However, further analysis revealed that the centromeres in C. deuterogattii are significantly shorter, with an average length of around 14 kb compared with those of C. neoformans or C. deneoformans, which have an average length of 44 kb and 62 kb, respectively (Fig. 3B). Consequently, the CENP-C–bound regions were also found to be shortened in C. deuterogattii (5–15 kb) compared with C. neoformans (20–40 kb). Thus, we conclude that the Cryptococcus species have CENP-A/CENP-C–rich regional centromeres of varying lengths, but the centromere DNA sequence elements are highly conserved.

Fig. 3.

Fig. 3.

Centromeres are enriched with retrotransposons in the Cryptococcus species complex. (A) The presence of various retrotransposons across the centromeres in C. neoformans, C. deneoformans, and C. deuterogattii. The numbers refer to centromere-flanking ORFs which are preceded by “CNAG_” for C. neoformans, “CN” for C. deneoformans, and “CNBG_” for C. deuterogattii. The diagrams are drawn to scale. (B) The length of each centromere of the respective species was plotted. Each dot represents one centromere, and the horizontal red line depicts the mean centromere length of the corresponding species. (C) Distribution of retrotransposons, Tcn1–Tcn6, in centromeres and across the genome in C. neoformans, C. deneoformans, and C. deuterogattii. (D) A bar diagram showing the distribution of full-length versus truncated Tcn elements at the centromeres in all three species. (E) Comparison of retrotransposon elements present in centromeres of RNAi-proficient (C. neoformans and C. deneoformans) and RNAi-deficient (C. deuterogattii) species. INT, integrase; LTR, long terminal repeat; RH, RNaseH; RT, reverse transcriptase; Unique, unique DNA sequence in each retroelement.

Loss of the RNAi Machinery and Shortening of Centromeres Are Correlated in C. deuterogattii.

Ago, Dcr, and Rdp—the key proteins of the RNAi machinery—are all present in C. neoformans and C. deneoformans but all are absent in C. deuterogattii (29). We previously demonstrated that RNAi suppresses transcription and transposition of retrotransposons, including the Tcn elements in the RNAi-proficient species C. neoformans and C. deneoformans (35, 36). Real-time PCR analysis of two of the Tcn elements confirmed that the expression levels of these elements are higher in C. deuterogattii compared with C. neoformans (SI Appendix, Fig. S5B). To further investigate alterations in the centromere length observed, we performed a detailed analysis of the retrotransposons, Tcn1–Tcn6, in all three species. These retroelements are specifically enriched and mostly restricted (>95%) to centromeric regions in all three species (Fig. 3C and SI Appendix, Figs. S2 and S4). The most striking observation is that the centromeres in C. neoformans and C. deneoformans harbor a significant proportion of full-length retroelements (20–30%), whereas the C. deuterogattii centromeres are completely devoid of such full-length elements (Fig. 3D). Instead, retroelements present in C. deuterogattii centromeres are only remnants of transposons (Fig. 3E). They lack one or more of the essential domains (LTRs, reverse transcriptase, integrase) required for transposition activity, rendering them nonfunctional for further transposition. Thus, this analysis reveals that loss of RNAi in C. deuterogattii is correlated with the loss of full-length retrotransposons and an overall reduction in the length of the centromeres.

By comparison with C. neoformans, the genome of C. deuterogattii seems to have lost additional genes besides key enzymes in the RNAi pathway (24, 29, 37). These additional lost genes include genes involved in protein processing, protein degradation, and the mitochondrial oxidation pathway. In addition, we found that the sole DNA methyltransferase encoding gene, DNMT5, is truncated in C. deuterogattii (SI Appendix, Fig. S6A). A recent study reported that putative centromeres of C. neoformans are methylated at the DNA level (38). Combining the available bisulfite sequencing results with our ChIP-seq results revealed that the centromere DNA sequence of C. neoformans is indeed extensively methylated (SI Appendix, Fig. S2). Incidentally, due to the presence of a truncated form of DNMT5, cytosine DNA methylation at centromeres is found to be absent in C. deuterogattii (SI Appendix, Fig. S6 B and C). The lack of centromere DNA methylation in C. deuterogattii was further supported by base modification analysis based on PacBio single-molecule real-time (SMRT) sequencing (Fig. 4A and SI Appendix, Fig. S6D). While C. neoformans centromeres harbor extensive base modifications, little or no methylation is associated with C. deuterogattii centromeres. Previous reports suggest that the DNA methylation suppresses recombination as well as regulates retrotransposition in various organisms (39, 40).

Fig. 4.

Fig. 4.

RNAi as a key determinant of longer centromeres in closely related fungi. (A) Base modification analysis based on SMRT PacBio sequencing revealed a high level of methylation in C. neoformans CEN14 (depicted as gray-shaded region) but not in C. deuterogattii CEN14 (see SI Appendix, Fig. S6D for remaining centromeres). (B) Comparison of the predicted centromere length in U. maydis, U. bromivora, and U. hordei. Only 20 predicted centromeres are plotted for U. hordei, while all 23 are shown for both U. maydis and U. bromivora. Each dot represents one centromere, and the horizontal line depicts the mean value. (C) Graph showing the correlation between centromere (ORF-free region) lengths and status of RNAi among the fungal species. The star in each of the boxes represents the mean value, the boxes depict the range from the 25th percentile to 75th percentile values, and the terminal vertical lines mark the range of centromere lengths. Species lacking any one of the three key proteins (Ago, Dcr, or Rdp) of the RNAi machinery were considered to harbor incomplete RNAi machinery for this analysis (SI Appendix, Table S4). (D) Possible sequence of events that might have occurred due to loss of RNAi machinery and/or DNA methylation in an RNAi-proficient strain (C. neoformans or C. deneoformans) that led to a genome with truncated retrotransposons that are unable to transpose in an RNAi-deficient strain (C. deuterogattii). (E) PacBio sequencing results of strains passaged for 1,000 doublings revealed reduction in CEN7 length in the rdp1Δ mutant but not in C. neoformans wild-type or the ago1Δ strains.

To strengthen support for our hypothesis on the roles of RNAi in centromere evolution, we also identified centromeres in another basidiomycete species complex, the Ustilago species complex. Similar to the Cryptococcus species complex, the Ustilago species complex harbors three species of which Ustilago maydis lost all three major proteins of the RNAi machinery along with the DNA methyltransferase while Ustilago hordei and Ustilago bromivora harbor the complete machinery (41, 42). In Cryptococcus, centromeres are present in large ORF-free and poorly transcribed regions of the genome. The centromeres in three Cryptococcus species are also syntenic with each other. Keeping in mind these centromeric features, we predicted centromeres in all three species of the Ustilago complex by in silico analysis and found that the RNAi-deficient species U. maydis possesses shorter centromeres (average length 12 kb) compared with both of the RNAi-proficient species U. hordei (average length 36 kb) and U. bromivora (average length 27 kb) (Fig. 4B and SI Appendix, Fig. S7 and Table S3). The centromere locations identified in U. maydis matched well with the ones predicted previously (43). Earlier studies have reported that U. maydis harbors fewer transposons than U. hordei and U. bromivora (41, 42). These observations further confirmed the correlation between loss of RNAi and a reduction in centromere length. We further extended our analysis to fungal species harboring regional centromeres, and in which RNAi status is known, and found that species with all components of the RNAi machinery harbor longer centromeres than the species that lack one or more of the RNAi components (Fig. 4C and SI Appendix, Table S4).

Rather than a single optimized DNA sequence, centromeres are among the most rapidly changing loci in the genome despite having a conserved and essential function, a phenomenon termed the “centromere paradox” (44). A number of studies verified that centromeres are evolving rapidly, even among very closely related species of plants and animals (3, 8, 45). This process of rapid evolution of centromeres is well studied in fungal species, especially among the members of the Ascomycota (9, 23, 46). The nature of centromeres in another significant fungal phylum, the Basidiomycota, was unknown. In this report, we identified centromeres in three pathogenic species of the Cryptococcus species complex. Using multiple sequencing and analysis methods, we significantly improved the genome assembly for C. neoformans and obtained a full genome-wide, chromosome-level assembly for C. deuterogattii. The three Cryptococcus species also provided an opportunity to test the role of RNAi in centromere evolution. We discovered that centromeres in the RNAi-deficient species are shorter compared with the RNAi-proficient species. Analysis of the centromere DNA sequence revealed that the RNAi-deficient species possesses fewer and truncated retrotransposons compared with the RNAi-proficient species. Taken together, our study reveals that centromeres evolve rapidly among closely related species in the Basidiomycota phylum of fungi.

The majority of the siRNA in C. neoformans maps to centromeric retrotransposons, the level of which drops in RNAi-defective mutants of C. neoformans (36, 47). The key proteins required for a functional RNAi machinery were found to be absent in C. deuterogattii (29). We propose that loss of RNAi in this species might have led to amplification of retroelements, which in turn would have integrated into the centromere-proximal sites, leading to a transient elongation of the centromeres. Consequently, the presence of retroelements in close vicinity to each other might have enhanced the rate of recombination between these elements. Recombination would cause shortening of these regions and might render retrotransposons inactive. Further, the absence of RNAi along with cytosine DNA methylation can contribute to enhancing the rate of recombination between these elements (39, 40, 48). It is notable that both transposition and recombination can be damaging to the genome if they are associated with loss of essential genes. Thus, cells that have no essential genes inactivated after transposition and recombination would survive. Because the partial loss of a centromere may not affect its function (49, 50), the probability of such events being tolerated at the centromere locus may be higher. Thus, RNAi-deficient strains with a stabilized genome will be likely to have shorter centromeres with truncated retroelements, similar to those of C. deuterogattii or U. maydis, while the intermediate population during evolution might have possessed centromeres of varying length (Fig. 4D).

We attempted to test our hypothesis by performing experimental evolution experiments. C. neoformans and its derived RNAi mutants were grown for 1,000 doublings under standard laboratory conditions (SI Appendix, Fig. S8 and SI Materials and Methods). The passaged strains did not show any obvious growth defects as measured by their generation time (SI Appendix, Table S5). PacBio sequencing of wild-type and mutant strains (both passaged and unpassaged) revealed two rearrangements in the centromeric regions of RNAi mutants compared with the wild-type grown under similar conditions (SI Appendix, Table S6). CEN7 of rdp1Δ mutant exhibited a reduction in length in the 1,000 doubling passaged strains compared with the 1,000 doubling wild-type and unpassaged strains (Fig. 4E). In addition, both unpassaged and passaged RNAi mutants carried a shorter CEN2 compared with the wild type (SI Appendix, Fig. S8 and Table S6). These results suggest that centromeres are prone to structural alterations in the absence of RNAi. Overall, our study provides evidence of RNAi in maintaining the structure of retrotransposon-rich centromeres in fungi. Experimental evolution in RNAi as well as cytosine DNA methylation mutants under conditions that favor retrotransposon expression, such as during the sexual cycle, will be tested in the future to gain further insight into this process.

Transposons play a major role in shaping the evolution of genomes, including the centromere, in multiple ways (18, 51). It has been proposed that the centromeric repeats present in fission yeast, maize, and the α-satellite repeats in human centromeres evolved from transposable elements (4). Transposons also have been shown to play an active role in centromere evolution among closely related species in plants (3). Based on a study in the Schizosaccharomyces group, it was proposed that loss of transposons observed in S. pombe centromeres occurred as a result of recombination between LTR elements that remained present in the S. japonicus genome (9). The loss of retrotransposons was also correlated with a shift in transposon regulation from RNAi to the CENP-B homolog in S. pombe, Cbp1 (5). In this study, we show structural changes in centromeres mediated by retrotransposons. Centromeres in the RNAi-proficient species C. neoformans and C. deneoformans harbor full-length retrotransposable elements, whereas RNAi-deficient C. deuterogattii centromeres contain only footprints of these elements. We propose that the truncation of retrotransposons could have occurred due to recombination among retroelements rendering them inactive. One such example is reported in angiosperms where retrotransposon truncation via illegitimate recombination resulted in genome size reduction (52, 53). Loss of full-length retroelements can prove to be advantageous for C. deuterogattii as a pathogen. Host conditions might induce retrotransposition and damage a pathogen’s genome despite suppression of active transposition by RNAi in an RNAi-proficient species (54, 55). While loss of RNAi in such situations can be lethal due to unregulated transposition, truncation and inactivation of full-length retroelements can provide selective advantages. The C. deuterogattii genome is 1.3 Mb smaller compared with C. neoformans and centromere shortening accounts for approximately one-third of this reduction (total centromere lengths being >630 kb in C. neoformans versus 203 kb in C. deuterogattii). A relatively faster generation time (SI Appendix, Table S5) of C. deuterogattii compared with C. neoformans or C. deneoformans may be due to the smaller genome of the former. Thus, we speculate that loss of full-length retroelements triggered by the loss of RNAi in C. deuterogattii could provide a pathogenic advantage over its related RNAi-proficient species in addition to other factors (27). There is as yet no established direct correlation between pathogenesis and the loss of RNAi along with full-length retrotransposon. The Cryptococcus species complex might prove to be a good model system with which to address such questions.

Materials and Methods

The strains and primers used in this study are listed in SI Appendix, Tables S7 and S8, respectively. Cryptococcus was grown in YPD (1% yeast extract, 2% peptone, and 2% dextrose) media at 30 °C. Cryptococcus cells were transformed using biolistics. Transformants were selected on YPD agar media containing 200 µg/mL of G418 (Sigma-Aldrich) or hygromycin (Invitrogen). Details of all of the experimental procedures and sequence analysis are given in SI Appendix, SI Materials and Methods. All of the sequencing data (including ChIP-seq and PacBio sequencing) have been deposited under NCBI BioProject accession no. PRJNA395628 and the Nanopore data under PRJNA13691. The reference number for each study is provided in SI Appendix, Table S9.

Supplementary Material

Supplementary File
Supplementary File

Acknowledgments

We thank Genotypic Technology Private Limited (Bangalore, India) for C. neoformans CENP-A and CENP-C ChIP-seq library preparation and raw sequence data generation reported in this publication; the Duke University core sequencing facility; the University of North Carolina core sequencing facility; and the Broad Technology Labs. V.Y. is a Senior Research Fellow, supported by the Council of Scientific and Industrial Research, Government of India. K.S. is a Tata Innovation Fellow and is also supported by intramural funding from the Jawaharlal Nehru Centre for Advanced Scientific Research. S.S., R.B.B., and J.H. are supported by NIH/National Institute of Allergy and Infectious Diseases (NIAID) R37 Merit Award AI39115-20 and R01 award AI50113-13. C.A.C. is supported by NIH/NIAID Grant U19AI110818 to the Broad Institute.

Footnotes

The authors declare no conflict of interest.

This article is a PNAS Direct Submission.

Data deposition: The data reported in this paper (including ChIP-seq and PacBio sequencing) have been deposited under NCBI BioProject [accession no. PRJNA395628 (sequences, including ChIP-seq and PacBio sequencing) and PRJNA13691 (Nanopore data)].

This article contains supporting information online at www.pnas.org/lookup/suppl/doi:10.1073/pnas.1713725115/-/DCSupplemental.

References

  • 1.Brown JD, O’Neill RJ. The Evolution of Centromeric DNA Sequences. eLS. John Wiley & Sons; Hoboken, NJ: 2014. [Google Scholar]
  • 2.Roy B, Sanyal K. Diversity in requirement of genetic and epigenetic factors for centromere function in fungi. Eukaryot Cell. 2011;10:1384–1395. doi: 10.1128/EC.05165-11. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Gao D, Jiang N, Wing RA, Jiang J, Jackson SA. Transposons play an important role in the evolution and diversification of centromeres among closely related species. Front Plant Sci. 2015;6:216. doi: 10.3389/fpls.2015.00216. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Wong LH, Choo KH. Evolutionary dynamics of transposable elements at the centromere. Trends Genet. 2004;20:611–616. doi: 10.1016/j.tig.2004.09.011. [DOI] [PubMed] [Google Scholar]
  • 5.Cam HP, Noma K, Ebina H, Levin HL, Grewal SI. Host genome surveillance for retrotransposons by transposon-derived proteins. Nature. 2008;451:431–436. doi: 10.1038/nature06499. [DOI] [PubMed] [Google Scholar]
  • 6.Mateo L, González J. Pogo-like transposases have been repeatedly domesticated into CENP-B-related proteins. Genome Biol Evol. 2014;6:2008–2016. doi: 10.1093/gbe/evu153. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Goodier JL, Kazazian HH., Jr Retrotransposons revisited: The restraint and rehabilitation of parasites. Cell. 2008;135:23–35. doi: 10.1016/j.cell.2008.09.022. [DOI] [PubMed] [Google Scholar]
  • 8.Comai L, Maheshwari S, Marimuthu MPA. Plant centromeres. Curr Opin Plant Biol. 2017;36:158–167. doi: 10.1016/j.pbi.2017.03.003. [DOI] [PubMed] [Google Scholar]
  • 9.Rhind N, et al. Comparative functional genomics of the fission yeasts. Science. 2011;332:930–936. doi: 10.1126/science.1203357. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Chan FL, Wong LH. Transcription in the maintenance of centromere chromatin identity. Nucleic Acids Res. 2012;40:11178–11188. doi: 10.1093/nar/gks921. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Hall LE, Mitchell SE, O’Neill RJ. Pericentric and centromeric transcription: A perfect balance required. Chromosome Res. 2012;20:535–546. doi: 10.1007/s10577-012-9297-9. [DOI] [PubMed] [Google Scholar]
  • 12.Pezer Z, Ugarković D. Role of non-coding RNA and heterochromatin in aneuploidy and cancer. Semin Cancer Biol. 2008;18:123–130. doi: 10.1016/j.semcancer.2008.01.003. [DOI] [PubMed] [Google Scholar]
  • 13.Du Y, Topp CN, Dawe RK. DNA binding of centromere protein C (CENPC) is stabilized by single-stranded RNA. PLoS Genet. 2010;6:e1000835. doi: 10.1371/journal.pgen.1000835. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Scott KC. Transcription and ncRNAs: At the cent(rome)re of kinetochore assembly and maintenance. Chromosome Res. 2013;21:643–651. doi: 10.1007/s10577-013-9387-3. [DOI] [PubMed] [Google Scholar]
  • 15.Volpe T, Martienssen RA. RNA interference and heterochromatin assembly. Cold Spring Harb Perspect Biol. 2011;3:a003731. doi: 10.1101/cshperspect.a003731. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Pidoux AL, Allshire RC. The role of heterochromatin in centromere function. Philos Trans R Soc Lond B Biol Sci. 2005;360:569–579. doi: 10.1098/rstb.2004.1611. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Dumesic PA, Madhani HD. Recognizing the enemy within: Licensing RNA-guided genome defense. Trends Biochem Sci. 2014;39:25–34. doi: 10.1016/j.tibs.2013.10.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Fedoroff NV. Presidential address. Transposable elements, epigenetics, and genome evolution. Science. 2012;338:758–767. doi: 10.1126/science.338.6108.758. [DOI] [PubMed] [Google Scholar]
  • 19.Matzke MA, Mosher RA. RNA-directed DNA methylation: An epigenetic pathway of increasing complexity. Nat Rev Genet. 2014;15:394–408. doi: 10.1038/nrg3683. [DOI] [PubMed] [Google Scholar]
  • 20.Padmanabhan S, Thakur J, Siddharthan R, Sanyal K. Rapid evolution of Cse4p-rich centromeric DNA sequences in closely related pathogenic yeasts, Candida albicans and Candida dubliniensis. Proc Natl Acad Sci USA. 2008;105:19797–19802. doi: 10.1073/pnas.0809770105. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Kapoor S, Zhu L, Froyd C, Liu T, Rusche LN. Regional centromeres in the yeast Candida lusitaniae lack pericentromeric heterochromatin. Proc Natl Acad Sci USA. 2015;112:12139–12144. doi: 10.1073/pnas.1508749112. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Sanyal K, Baum M, Carbon J. Centromeric DNA sequences in the pathogenic yeast Candida albicans are all different and unique. Proc Natl Acad Sci USA. 2004;101:11374–11379. doi: 10.1073/pnas.0404318101. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Chatterjee G, et al. Repeat-associated fission yeast-like regional centromeres in the ascomycetous budding yeast Candida tropicalis. PLoS Genet. 2016;12:e1005839. doi: 10.1371/journal.pgen.1005839. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.D’Souza CA, et al. Genome variation in Cryptococcus gattii, an emerging pathogen of immunocompetent hosts. mBio. 2011;2:e00342-10. doi: 10.1128/mBio.00342-10. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Janbon G, et al. Analysis of the genome and transcriptome of Cryptococcus neoformans var. grubii reveals complex RNA expression and microevolution leading to virulence attenuation. PLoS Genet. 2014;10:e1004261. doi: 10.1371/journal.pgen.1004261. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Loftus BJ, et al. The genome of the basidiomycetous yeast and human pathogen Cryptococcus neoformans. Science. 2005;307:1321–1324. doi: 10.1126/science.1103773. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Bielska E, May RC. What makes Cryptococcus gattii a pathogen? FEMS Yeast Res. 2016;16:fov106. doi: 10.1093/femsyr/fov106. [DOI] [PubMed] [Google Scholar]
  • 28.Dixit A, Carroll SF, Qureshi ST. Cryptococcus gattii: An emerging cause of fungal disease in North America. Interdiscip Perspect Infect Dis. 2009;2009:840452. doi: 10.1155/2009/840452. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Feretzaki M, Billmyre RB, Clancey SA, Wang X, Heitman J. Gene network polymorphism illuminates loss and retention of novel RNAi silencing components in the Cryptococcus pathogenic species complex. PLoS Genet. 2016;12:e1005868. doi: 10.1371/journal.pgen.1005868. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Kozubowski L, et al. Ordered kinetochore assembly in the human-pathogenic basidiomycetous yeast Cryptococcus neoformans. mBio. 2013;4:e00614-e13. doi: 10.1128/mBio.00614-13. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Smith KM, Phatale PA, Sullivan CM, Pomraning KR, Freitag M. Heterochromatin is required for normal distribution of Neurospora crassa CenH3. Mol Cell Biol. 2011;31:2528–2542. doi: 10.1128/MCB.01285-10. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Sun S, Xu J. Chromosomal rearrangements between serotype A and D strains in Cryptococcus neoformans. PLoS One. 2009;4:e5524. doi: 10.1371/journal.pone.0005524. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Goodwin TJ, Poulter RT. The diversity of retrotransposons in the yeast Cryptococcus neoformans. Yeast. 2001;18:865–880. doi: 10.1002/yea.733. [DOI] [PubMed] [Google Scholar]
  • 34.Sun S, et al. Fungal genome and mating system transitions facilitated by chromosomal translocations involving intercentromeric recombination. PLoS Biol. 2017;15:e2002527. doi: 10.1371/journal.pbio.2002527. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Janbon G, et al. Characterizing the role of RNA silencing components in Cryptococcus neoformans. Fungal Genet Biol. 2010;47:1070–1080. doi: 10.1016/j.fgb.2010.10.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Wang X, et al. Sex-induced silencing defends the genome of Cryptococcus neoformans via RNAi. Genes Dev. 2010;24:2566–2582. doi: 10.1101/gad.1970910. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Farrer RA, et al. Genome evolution and innovation across the four major lineages of Cryptococcus gattii. mBio. 2015;6:e00868-15. doi: 10.1128/mBio.00868-15. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Huff JT, Zilberman D. Dnmt1-independent CG methylation contributes to nucleosome positioning in diverse eukaryotes. Cell. 2014;156:1286–1297. doi: 10.1016/j.cell.2014.01.029. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Maloisel L, Rossignol JL. Suppression of crossing-over by DNA methylation in Ascobolus. Genes Dev. 1998;12:1381–1389. doi: 10.1101/gad.12.9.1381. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Mirouze M, et al. Loss of DNA methylation affects the recombination landscape in Arabidopsis. Proc Natl Acad Sci USA. 2012;109:5880–5885. doi: 10.1073/pnas.1120841109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Laurie JD, et al. Genome comparison of barley and maize smut fungi reveals targeted loss of RNA silencing components and species-specific presence of transposable elements. Plant Cell. 2012;24:1733–1745. doi: 10.1105/tpc.112.097261. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Rabe F, et al. A complete toolset for the study of Ustilago bromivora and Brachypodium sp. as a fungal-temperate grass pathosystem. eLife. 2016;5:e20522. doi: 10.7554/eLife.20522. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Kämper J, et al. Insights from the genome of the biotrophic fungal plant pathogen Ustilago maydis. Nature. 2006;444:97–101. doi: 10.1038/nature05248. [DOI] [PubMed] [Google Scholar]
  • 44.Henikoff S, Ahmad K, Malik HS. The centromere paradox: Stable inheritance with rapidly evolving DNA. Science. 2001;293:1098–1102. doi: 10.1126/science.1062939. [DOI] [PubMed] [Google Scholar]
  • 45.Melters DP, et al. Comparative analysis of tandem repeats from hundreds of species reveals unique insights into centromere evolution. Genome Biol. 2013;14:R10. doi: 10.1186/gb-2013-14-1-r10. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Bensasson D, Zarowiecki M, Burt A, Koufopanou V. Rapid evolution of yeast centromeres in the absence of drive. Genetics. 2008;178:2161–2167. doi: 10.1534/genetics.107.083980. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Dumesic PA, et al. Stalled spliceosomes are a signal for RNAi-mediated genome defense. Cell. 2013;152:957–968. doi: 10.1016/j.cell.2013.01.046. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Ellermeier C, et al. RNAi and heterochromatin repress centromeric meiotic recombination. Proc Natl Acad Sci USA. 2010;107:8701–8705. doi: 10.1073/pnas.0914160107. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Mravinac B, et al. Histone modifications within the human X centromere region. PLoS One. 2009;4:e6602. doi: 10.1371/journal.pone.0006602. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Steiner NC, Clarke L. A novel epigenetic effect can alter centromere function in fission yeast. Cell. 1994;79:865–874. doi: 10.1016/0092-8674(94)90075-2. [DOI] [PubMed] [Google Scholar]
  • 51.Chuong EB, Elde NC, Feschotte C. Regulatory activities of transposable elements: From conflicts to benefits. Nat Rev Genet. 2017;18:71–86. doi: 10.1038/nrg.2016.139. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Devos KM, Brown JK, Bennetzen JL. Genome size reduction through illegitimate recombination counteracts genome expansion in Arabidopsis. Genome Res. 2002;12:1075–1079. doi: 10.1101/gr.132102. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Vitte C, Panaud O. LTR retrotransposons and flowering plant genome size: Emergence of the increase/decrease model. Cytogenet Genome Res. 2005;110:91–107. doi: 10.1159/000084941. [DOI] [PubMed] [Google Scholar]
  • 54.Horváth V, Merenciano M, González J. Revisiting the relationship between transposable elements and the eukaryotic stress response. Trends Genet. 2017;33:832–841. doi: 10.1016/j.tig.2017.08.007. [DOI] [PubMed] [Google Scholar]
  • 55.Nicolás FE, Torres-Martínez S, Ruiz-Vázquez RM. Loss and retention of RNA interference in fungi and parasites. PLoS Pathog. 2013;9:e1003089. doi: 10.1371/journal.ppat.1003089. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary File
Supplementary File

Articles from Proceedings of the National Academy of Sciences of the United States of America are provided here courtesy of National Academy of Sciences

RESOURCES