Skip to main content
eLife logoLink to eLife
. 2020 Apr 20;9:e56026. doi: 10.7554/eLife.56026

Centromere deletion in Cryptococcus deuterogattii leads to neocentromere formation and chromosome fusions

Klaas Schotanus 1, Joseph Heitman 1,
Editors: Wolf-Dietrich Heyer2, Kevin Struhl3
PMCID: PMC7188483  PMID: 32310085

Abstract

The human fungal pathogen Cryptococcus deuterogattii is RNAi-deficient and lacks active transposons in its genome. C. deuterogattii has regional centromeres that contain only transposon relics. To investigate the impact of centromere loss on the C. deuterogattii genome, either centromere 9 or 10 was deleted. Deletion of either centromere resulted in neocentromere formation and interestingly, the genes covered by these neocentromeres maintained wild-type expression levels. In contrast to cen9∆ mutants, cen10∆ mutant strains exhibited growth defects and were aneuploid for chromosome 10. At an elevated growth temperature (37°C), the cen10∆ chromosome was found to have undergone fusion with another native chromosome in some isolates and this fusion restored wild-type growth. Following chromosomal fusion, the neocentromere was inactivated, and the native centromere of the fused chromosome served as the active centromere. The neocentromere formation and chromosomal fusion events observed in this study in C. deuterogattii may be similar to events that triggered genomic changes within the Cryptococcus/Kwoniella species complex and may contribute to speciation throughout the eukaryotic domain.

Research organism: Other

Introduction

Eukaryotic organisms have linear chromosomes with specialized regions: telomeres that cap the ends, origins of replication, and centromeres that are critical for chromosome segregation. During cell division, the centromere binds to a specialized protein complex known as the kinetochore (Cheeseman, 2014). Most centromeres are regional, sequence-independent, and defined by the replacement of the canonical histone H3 by the histone homolog CENP-A (CenH3 or Cse4) (Henikoff and Furuyama, 2010). In humans, centromeres contain higher-order α-satellite DNA arrays that span 0.1 to 4.8 Mb (McNulty and Sullivan, 2018), which is in contrast to most fungal centromeres, which contain transposable elements and repetitive sequences (Friedman and Freitag, 2017). Fungal regional centromeres range from the small centromeres of Candida albicans, (the CENP-A enriched regions range from 2 to 3.3 kb and are located in 4 to 18 kb open-reading frame ORF-free regions), to the large regional centromeres described in Neurospora crassa, (which range from 174 to 287 kb and consist mainly of truncated transposable elements) (Sanyal et al., 2004; Smith et al., 2011). Similar to mice, some fungi have pericentric regions (Guenatri et al., 2004). The most prominent examples are the centromeres of Schizosaccharomyces pombe, which have a CENP-A-enriched region comprised of a central core flanked by heterochromatic pericentric regions divided into outer and inner repeats (Ishii et al., 2008; Rhind et al., 2011). Budding yeast have sequence-dependent centromeres, which are short and have a conserved organization with three centromere DNA elements (consensus DNA elements (CDEs) I-III) (Kobayashi et al., 2015). However, the budding yeast Naumovozyma castellii has unique consensus DNA elements that differ from those of other budding yeast species (Kobayashi et al., 2015).

Infrequently, centromeres can be spontaneously inactivated, resulting in neocentromere formation (i.e., evolutionary new centromeres) (Ventura et al., 2007). Neocentromere formation can occur either while the native centromeric sequence is still present on the chromosome or when the native centromere has been mutated or deleted (e.g., from chromosomal rearrangements or γ irradiation damage Burrack and Berman, 2012; Tolomeo et al., 2017; Ventura et al., 2007). In addition, several studies have described neocentromere formation after deletion of native centromeres by molecular genetic engineering in fungi, chickens, and Drosophila (Alkan et al., 2007; Ishii et al., 2008; Ketel et al., 2009; Shang et al., 2013). In some organisms, the formation of neocentromeres can be deleterious, leading to disease, cancer, or infertility (Burrack and Berman, 2012; Garsed et al., 2014; Nergadze et al., 2018; Scott and Sullivan, 2014; Warburton, 2004). For example, human neocentromeres are often identified in liposarcomas (Garsed et al., 2014). However, neocentromere formation also can be beneficial, leading to speciation (Ventura et al., 2007).

Fungal neocentromeres are well described in the diploid yeast C. albicans and the haploid fission yeast S. pombe (Ishii et al., 2008; Ketel et al., 2009; Thakur and Sanyal, 2013). Deletion of C. albicans native centromere 5 or 7 has been shown to induce neocentromere formation and does not result in chromosome loss (Ketel et al., 2009; Thakur and Sanyal, 2013). In these cases, neocentromeres conferred chromosomal stability similar to the native centromere (Ketel et al., 2009; Mishra et al., 2007). Deletion of a native centromere in S. pombe led to either neocentromere formation or chromosome fusion (Ishii et al., 2008; Ohno et al., 2016). S. pombe neocentromeres formed in telomere-proximal regions near heterochromatin, and neocentromere organization featured a CENP-A-enriched core domain and heterochromatin at the subtelomeric (distal) side. Interestingly, neocentromere formation occurred at the same regions in both wild-type and heterochromatin-deficient strains, suggesting that heterochromatin is dispensable for neocentromere formation in S. pombe, although the rate of survival by chromosome fusion was significantly increased in heterochromatin-deficient mutants (Ishii et al., 2008). Deletion of kinetochore proteins (mhf1∆ and mhf2∆) led to a shift of CENP-A binding, resulting in a CENP-A-enriched region directly adjacent to the native centromere (Lu and He, 2019).

In some cases, neocentromeres span genes that are silenced, such as the neocentromeres in C. albicans. However the mechanisms that mediate silencing of neocentromeric genes are unknown in C. albicans, as proteins that are necessary for heterochromatin formation and gene silencing in other species (HP1, Clr4, and DNA methyltransferase) are absent in C. albicans (Ketel et al., 2009). Neocentromeres of S. pombe can also span genes. These genes are upregulated during nitrogen starvation and expressed at low levels during stationary growth in wild-type cells, but are silenced under all conditions tested when spanned by neocentromeres. In addition to neocentromeric genes, genes located within native centromeres have been identified in other fungi as well as rice and chicken (Nagaki et al., 2004; Schotanus et al., 2015; Shang et al., 2013).

Recently, the centromeres of the human pathogenic fungus Cryptococcus deuterogattii were characterized and compared to those of the closely related species Cryptococcus neoformans (centromeres ranging from 27 to 64 kb), revealing dramatically smaller centromeres in C. deuterogattii (ranging from 8.7 to 21 kb) (Janbon et al., 2014; Yadav et al., 2018). C. deuterogattii is responsible for an ongoing cryptococcosis outbreak in the Pacific Northwest regions of Canada and the United States (Fraser et al., 2005). In contrast to the sister species C. neoformans, C. deuterogattii commonly infects immunocompetent patients (Fraser et al., 2005). C. deuterogattii is a haploid basidiomycetous fungus with 14 chromosomes (D'Souza et al., 2011; Farrer et al., 2015; Yadav et al., 2018). The dramatic reduction in centromere size in C. deuterogattii may be attributable to loss of the RNAi pathway (Farrer et al., 2015; Yadav et al., 2018). The centromeres of C. deuterogattii consist of truncated transposable elements, and active transposable elements are missing throughout the genome (Yadav et al., 2018). This is in stark contrast to C. neoformans, which has active transposable elements in centromeric regions (Dumesic et al., 2015; Janbon et al., 2014; Yadav et al., 2018).

Neocentromeres are frequently formed near genomic repeats, yet C. deuterogattii lacks active transposons that might seed neocentromere formation. Thus, C. deuterogattii is a unique organism in which to study centromere structure and function. To elucidate centromeric organization, the native centromeres of chromosomes 10 and 9 were deleted, leading to characterization of the first neocentromeres in the Basidiomycota phylum of the fungal kingdom.

Results

Deletion of centromere 9 and 10 results in neocentromere formation

To determine if neocentromere formation occurs in the C. deuterogattii reference strain R265, either centromere 9 or 10 was deleted. Centromere 9 (CEN9) was deleted by CRISPR-Cas9-mediated transformation. Two guide RNAs flanking the centromere were used and CEN9 was replaced with a NAT dominant drug-resistance gene by homologous recombination. Biolistic transformation was used to replace centromere 10 (CEN10) with either the NAT or NEO dominant drug-resistance gene via homologous recombination. Viable transformants with the correct integration and deletion were obtained and confirmed by 5’ junction, 3’ junction, loss of deleted regions, and spanning PCRs as well as Southern blot analysis for cen10∆ (Figure 1—figure supplement 1, Figure 1—figure supplement 2). Multiple independent cen9∆ and cen10∆ deletion mutants (cen9-A to -F and cen10-A to -G) were obtained from independent transformations. Pulsed-field gel electrophoresis (PFGE) confirmed that cenΔ mutants had a wild-type karyotype and that chromosome 9 and 10 remained linear, because a circular chromosome would not have entered the gel (Figure 1—figure supplement 3).

The formation of neocentromeres on chromosome 10 in C. deuterogattii was infrequent. A total of 99 independent biolistic transformations resulted in only seven confirmed cen10∆ mutants (7/21 total candidate transformants, 33% homologous integration), suggesting that CEN10 deletion is lethal in most circumstances. In comparison, deletion of nonessential genes by homologous recombination in the C. deuterogattii R265 strain typically results in ~100 colonies with a high success rate (~80–90% homologous integration). We estimate that the likelihood of deleting a centromere and recovering a viable colony is at least 1000-fold lower than would be expected from the deletion of a non-essential gene. The deletion of CEN9 was more efficient as this was mediated by CRISPR-Cas9 cleavage with two guide RNAs and a repair allele.

Chromatin immunoprecipitation of mCherry-CENP-A followed by high-throughput sequencing (ChIP-seq) for six cen9∆ (-A to -F) and seven cen10∆ mutants (-A to -G) was performed (Figure 1). Prior to the ChIP-seq experiment, all of the centromere deletion mutants were streak purified from single colonies. The sequence reads were mapped to a complete whole-genome assembly, followed by the normalization of the reads by subtraction of the input from the ChIPed sample (Yadav et al., 2018). To quantify the ChIP-seq data, the CENP-A-enriched regions were compared with the centromeres previously identified based on CENP-C enrichment. Both the CENP-A- and CENP-C-enriched peaks were congruent for all of the native centromeres (Yadav et al., 2018). This analysis identified 13 of the 14 native centromeres (CEN1-8, CEN11-14 and depending on the centromere mutant either CEN9 or CEN10), indicating that, as expected, the native centromere of chromosome 9 or 10 was missing in all of the cen9∆ and cen10∆ deletion mutants respectively (Figure 1). Instead, neocentromeres were observed.

Figure 1. Centromere deletion leads to neocentromere formation.

For each panel, the chromosome coordinates are indicated. Genes (CDS) are shown in blue arrows and the truncated transposable elements, located in the native centromere (CEN9 or CEN10), are colored according to their class (Tcn4 in orange and Tcn6 in green). Previously generated RNA-sequencing obtained from wild-type cells was re-mapped and shown in green. In each panel, the wild-type CENP-A content is shown. In the wild type, CENP-A is only enriched at the native centromeres. For each cenΔ mutant, the neocentromeric region is shown by enrichment of CENP-A and the fold enrichment is indicated on the right of each ChIP-seq track. (A) Schematic full overview of chromosome 9, the indentation represents the native centromere 9 position. The light grey area points to the zoomed-in chromosomal region shown with the detailed view of the native centromere (CEN9) and the location of the cen9∆ mutant neocentromeres. Neocentromeres of cen9-B, cen9-C and cen9-E formed at the same chromosomal location. The dark gray line, below the transposable elements, indicates the deleted region in the cen9∆ mutants (B) Detailed view of the neocentromere of cen10-B and the secondary CENP-A peak of cen10-A and cen10-C. (C) Overview of the chromosomal 10 region spanning 100 to 410 kb. cen10-A and cen10-C have two regions enriched with CENP-A (primary and secondary). (D) Schematic full overview of the full chromosome 10, the indentation represents the chromosomal location of the native centromere (CEN10). The light grey areas point to the zoomed-in chromosomal regions shown in panel C and below. The neocentromere of cen10-E is indicated with an arrow. Lower panel, detailed view of the native centromere (CEN10) and the neocentromeres formed in cen10-A, cen10∆-C, cen10-D, cen10-F and cen10-G mutants. The dark gray line, below the transposable elements, indicates the deleted region in the cen10∆ mutants (E) Detailed view of the telocentric neocentromere of cen10-E.

Figure 1.

Figure 1—figure supplement 1. Confirmation of centromere 9 and 10 deletion by PCR.

Figure 1—figure supplement 1.

(A) The native centromeric regions of chromosome 9 in the wild type and cen9∆ mutants are shown. Genes flanking the centromeric region in the WT strain are indicated with an arrow and gene ID. The centromere was replaced by a nourseothricin (NAT) drug-resistance gene cassette, indicated with a rectangle (labelled NAT). Black lines indicate the length of the PCR product used to confirm the centromere 9 deletion. Black arrows indicate primers. (B) PCR confirmation of the centromere 9 deletion in cen9∆ mutants ran on an ethidium bromide-stained gel. The WT (W) and no-template control (B) are included as controls. For the spanning PCR, both primers are located outside of the transformed product and for the junction PCRs, one primer is outside of the transformed product and one primer is located inside the nourseothricin (NAT) drug-resistance gene cassette. (C) Centromere 10 region of the wild type (WT) and cen10∆ mutants are shown. Genes and nourseothricin (NAT) drug-resistance gene cassette, primers, and the length of the PCR product are labeled as in A. (D) PCR confirmation of the centromere 10 deletion in cen10∆ mutants ran on an ethidium bromide-stained gel. The WT and no-template control (B) are included as controls.
Figure 1—figure supplement 2. Centromere 10 is deleted in cen10∆ isolates.

Figure 1—figure supplement 2.

(A) The region corresponding to the native centromere 10 of the wild type and cen10∆ mutants is shown. Upstream and downstream flanking regions, used for homologous recombination, are indicated with rectangles (labeled ‘5’ region’ or ‘3’ region’). The nourseothricin drug resistance gene cassette is indicated with a rectangle, (labeled NAT). Grey lines indicate XbaI restriction sites. PCR products of the 5’ and 3’ regions were used as Southern blot probes, and the expected restriction pattern is indicated above. (B) Southern blot analysis for the wild-type and cen10∆ mutant strains is presented. Left panel shows the Southern blot, right panel shows the ethidium bromide-stained gel prior to Southern blotting. Both the wild type and cen10∆ mutants have the expected restriction pattern which is shown in panel A.
Figure 1—figure supplement 3. cen9∆ and cen10∆ mutants have a wild-type karyotype.

Figure 1—figure supplement 3.

PFGE analysis with the wild type and cen9∆ and cen10∆ mutants. Cells were isolated from growing at 30°C. S. cerevisiae chromosomes serve as size markers and are indicated with ‘M’ on both sides of the ethidium bromide stained gel (Chromosome sizes shown on the left side). The chromosome sizes of the reference strain R265 are shown on the right side. In several cases, chromosomes co-migrate as indicated. (A) PFGE analysis of cen10∆ mutants. Order of samples from left to right are wild type and cen10∆ mutants (-A to -G). (B) Southern blot analysis of the PFGE from panel A. A region of CEN10 served as the probe. On chromosome 10, the probe only has homology with the centromere. In the wild-type strain, the probe hybridized to chromosome 10 (arrow). cen10∆ mutants lack centromere 10 and thus do not show hybridization of the probe. The probe has homology to centromeres of other chromosomes, resulting in cross-hybridization. (C) PFGE analysis of cen9∆ mutants. Order of samples from left to right are cen9∆ mutants (-A to -D) and wild type.
Figure 1—figure supplement 4. ChIP-qPCR with additional kinetochore proteins.

Figure 1—figure supplement 4.

To confirm the binding of the kinetochore to the CENP-A-enriched regions, two additional kinetochore proteins were tagged with GFP and ChIP-qPCR analysis was performed. cen9∆ mutants were transformed with a construct expressing CENP-C-GFP and cen10∆ mutants were transformed with a construct expressing Mis12-GFP. As a control, the wild type was transformed with a construct expressing CENP-C-GFP or Mis12-GFP. For each, ChIP-qPCR data are shown for 1) the internal positive control (CEN6), 2) primer pair(s) specific for the neocentromere(s) and 3) enrichment compared to actin (set to 1). Error bars show standard deviation. (A) qPCR results of the ChIPs with cen10∆ mutants transformed with Mis12-GFP. cen10-A and cen10-C have two CENP-A-enriched regions (primary peak and secondary peak) and this is indicated in the figure. (B) qPCR results of ChIPs with cen9∆ mutants transformed with CENP-C-GFP. (C) qPCR results of the ChIP with the wild-type strain transformed with Mis12-GFP. This panel serves as a control for the ChIP-qPCRs performed in panel A. As the neocentromere of cen10-B and the secondary peak of cen10-A and cen10-C are formed in the same chromosomal region, the qPCR reaction for this chromosomal region in the wild type is only shown once. Similarly, the primary peak of cen10-A and cen10-C formed in the same chromosomal location, and this region is only shown once in the wild type. (D) qPCR results of the ChIP with the wild-type strain transformed with CENP-C-GFP. These qPCRs serve as a control for the ChIP-qPCRs performed in panel B. Three mutants have neocentromeres formed at the same chromosomal location (cen9-B, cen9-C and cen9-E) and the CENP-C enrichment of this region in the wild type is only shown once.
Figure 1—figure supplement 5. cen10∆ mutants have elongated cell morphology.

Figure 1—figure supplement 5.

Cell morphology of >1000 cells each for the wild-type strain and five cen10∆ mutant strains was analyzed, counted, and plotted as a percentage of total cell number. L = large, S = small, Scale bar = 10 μm. (A) Percentage of cells with elongated cell morphology. Formation of abnormal cell morphology is rare. The cen10∆-C and cen10-F mutants have an increased number of elongated cells. cen10∆-D, cen10∆-E, and cen10∆-G mutants had <0.5% elongated cell morphology. (B) Representative view of wild-type and cen10∆-F mutant cells. Several cen10∆-F mutants had enlarged cell shapes and formed elongated cell clusters. (C) Representative view of wild-type colonies and a population of mixed colony sizes of cen10∆ mutants. Shown here is cen10∆-A. Large colonies have a size to similar to wild type.

Except for the neocentromere of isolate cen10-E, the neocentromeres formed in close proximity to the native centromere (CEN9 and CEN10). Almost all neocentromeres were shorter than the native centromere, with the exception of cen10-G which was larger than native centromere 10 (Table 1).

Table 1. Genes located inside neocentromeres.

The chromosomal locations, sizes, and GC content (%) for the native centromere and cen∆ mutants are shown. For the neocentromeres, gene ID, predicted function, and the amount of CENP-A coverage are indicated.

Chr coor (bp) Size (kb) Size compared
to native
centromere (%)
GC% Genes spanned
by neocentromere
Gene ID % covered by
Neocentromere
Exons inside
neocentromere
Native centromere 9 Chr9:755,771–762,621 6.84 - 43.6 - - - -
cen9-A Chr9:785,352–789,247 3.87 56.6 46.1 Escrt-II complex subunit (VPS25) CNBG_5690 100
Iron regulator 1 CNBG_9614 14.6 Last exon
cen9-B Chr9:775,164–780,756 4.41 64.5 46.6 Xylosylphosphotransferase
(XPT1)
CNBG_5687 6.9
Transglycosylase SLT domain-containing protein CNBG_9613 100
Glutamate synthase (NADPH/NADH) CNBG_5689 33.7
cen9-C Chr9:775,164–780,756 4.41 64.5 46.6 Xylosylphosphotransferase
(XPT1)
CNBG_5687 6.9
Transglycosylase SLT domain-containing protein CNBG_9613 100
Glutamate synthase (NADPH/NADH) CNBG_5689 33.7
cen9-D Chr9:750,902–755,294 4.37 63.9 41.9 Hypothetical protein CNBG_5684 92.8
Derlin-2/3 CNBG_5685 100
cen9-E Chr9:775,164–780,756 5.56 81.3 50 Xylosylphosphotransferase
 (XPT1)
CNBG_5687 6.9
Transglycosylase SLT domain-containing protein CNBG_9613 100 Last exon
Glutamate synthase (NADPH/NADH) CNBG_5689 33.7 Last exon
cen9-F Chr9:771,614–775,469 3.83 56.0 51.5 Xylosylphosphotransferase (XPT1) CNBG_5687 100
Native centromere 10 Chr10:362,876–369,657 6.77 - 42.6 - - - -
cen10-A Chr10:115,954–120,422 4.46 65.9 46.9 CENPC/MIF2 CNBG_4461 88.3 1, 2, 3, 4 (only5th is outside)
Hypothetical protein CNBG_4462 100
Chr10:391,090–393,946 2.85 42.1 48.9 Serine/threonine-protein phosphatase 2A activator 2(RRD2) CNBG_9459 10.6 Last exon (5th)
Hypothetical protein CNBG_4366 100
Hypothetical protein CNBG_4365 23.4 Last exon (3th)
cen10-B Chr10:115,954–120,422 4.46 65.9 46.9 CENPC/MIF2 CNBG_4461 88.3 1, 2, 3, 4 (only 5th is outside)
Hypothetical protein CNBG_4462 100
cen10-C Chr10:115,954–120,422 4.46 65.9 46.9 CENPC/MIF2 CNBG_4461 88.3 1, 2, 3, 4 (only 5th is outside)
Hypothetical protein CNBG_4462 100
Chr10:391,090–393,946 2.85 42.1 48.9 Serine/threonine-protein phosphatase 2A activator 2(RRD2) CNBG_9459 10.6 Last exon (5th)
Hypothetical protein CNBG_4366 100
Hypothetical protein CNBG_4365 23.4 Last exon (3th)
cen10-D Chr10:352,648–355,154 2.51 37.1 48 Ser/Thr protein kinase CNBG_4379 88.4
cen10-E Chr10:1–4,385 4.38 64.7 53.2 Hypothetical protein CNBG_10450 100
Hypothetical protein CNBG_4495 100
cen10-F Chr10:342,517–345,159 2.64 39.0 45.5 Hypothetical protein CNBG_4383 18.6 Last two exons
Hypothetical protein CNBG_10075 100
Hexokinase (HXK1) CNBG_4382 15.3 Last three exons
cen10-G Chr10:378,389–386,366 7.97 117.7 46.5 High osmolarity signaling protein (SHO1) CNBG_4373 100
Hypothetical protein CNBG_4372 100
Hypothetical protein CNBG_4371 100
Hypothetical protein CNBG_4370 100

In three of the independent cen9∆ mutants (cen9-B, -C and -E), neocentromeres formed at the same chromosomal location (Figure 1A). Interestingly, two independent cen10∆ mutants (cen10-A and cen10-C) contained two CENP-A-enriched regions on chromosome 10, with a primary peak and a smaller secondary peak with reduced levels of CENP-A (1.3- to 1.75-fold lower) compared to the primary CENP-A peak (Figure 1C). The chromosomal location of the secondary peak was similar to the neocentromere of cen10-B (which had only one neocentromere) (Figure 1B).

The two CENP-A-enriched regions suggest four possible models: 1) aneuploidy in which cells harbor two chromosomes, 2) a dicentric chromosome with two neocentromeres (neodicentric), 3) instability between two different neocentromere states (neocentromere switching), 4) or only one CENP-A-enriched region functions as a centromere and the second CENP-A-enriched region is not bound by the kinetochore (Figure 1).

The neocentromeres were located in unique, nonrepetitive sequences and were not flanked by repetitive regions. The GC content of neocentromeres is similar to the overall GC content of chromosome 9 and 10, whereas the native centromere has a lower GC content (Table 1). Comparing the reference genome with de novo genome assemblies of cen10-A, cen10-B, and cen10-E confirmed that transposable elements did not enter these genomic regions during neocentromere formation (Supplementary file 1). Instead of spanning repeats and transposable elements like the native centromeres, neocentromeres span genes.

All of the neocentromeres of chromosome 9 formed in a region within 26 kb of the chromosomal location of the native centromere 9 (Figure 1A). Interestingly, the neocentromeres of the independent cen9-B, cen9-C, and cen9-E mutants all formed at the same chromosomal location and had the same length (4.41 kb); these neocentromeres spanned three genes. One gene was completely covered by CENP-A and this gene encodes a transglycosylase SLT domain-containing protein. The two other genes (a gene encoding a xylosylphosphotransferase and a gene encoding glutamate synthase (NADPH/NADH)) were partially covered with CENP-A. Mutant cen9-A had a 3.87 kb long neocentromere located 26 kb 3’ to the native centromere and spanned two genes. The first gene was completely spanned by CENP-A and encodes an ESCRT-II complex subunit (Vps25) protein. The second gene was only partially covered and encodes an iron regulator protein. The neocentromere of cen9-D was located directly to the left of the native centromere and was 4.37 kb in length. This neocentromere spanned two genes, coding for a hypothetical protein (92% covered by CENP-A) and a gene encoding for Derlin-2/3 that was completely covered by CENP-A. Lastly, the neocentromere of mutant cen9-F was 3.83 kb in length and spanned one gene (encoding a xylosylphosphotransferase, XPT1), which was completely covered by CENP-A. This neocentromere was located 12 kb away (3’) from the native centromere 9.

Like the neocentromeres of cen9∆ mutants, the neocentromeres of cen10∆ mutants also spanned genes and interestingly, the kinetochore protein CENP-C was located inside the neocentromere of cen10-B and in the secondary peak of cen10-A and -C (Table 1, Figure 1B). The neocentromere in cen10-B spanned 4.46 kb, was located 242 kb away from the 3’ region of the native CEN10, and was located 115 kb from the telomere (Figure 1B). In addition to the gene encoding CENP-C, the CENP-A-enriched region spanned a hypothetical protein (Table 1). The primary CENP-A-enriched region of cen10-A and cen10-C spanned a gene encoding a serine/threonine-protein phosphatase 2A activator 2 (RRD2) and a hypothetical protein (Figure 1D). This neocentromere spanned 2.85 kb and was located closer to the native CEN10 (21 kb from the native centromere) than the neocentromere of cen10∆-B and the secondary CENP-A peak of cen10-A and cen10-C. The neocentromere of cen10-D was the smallest neocentromere (2.5 kb) and partially (88.4%) spanned a gene encoding a Ser/Thr protein kinase and formed 7.4 kb from the location of the native CEN10 (Figure 1D). The neocentromere of cen10-E spanned two hypothetical proteins, was 4.38 kb in length and was located directly adjacent to the right telomere (Figure 1E). Mutant cen10-F had a neocentromere of 2.64 kb, which spanned one hypothetical gene completely and two genes (hypothetical and a hexokinase (HXK1)) partially; the neocentromere formed at a chromosomal location 20 kb 5’ of the native centromere (Figure 1D). The neocentromere of cen10-G was the largest neocentromere with a CENP-A-enriched region of 7.97 kb, and was in fact larger than the native CEN10. This neocentromere spanned four genes, including a gene coding for a high osmolarity protein (Sho1) and three genes coding for hypothetical proteins (Figure 1D).

To test if the kinetochore was binding to the CENP-A-enriched regions of chromosomes 9 and 10, and to validate if the neocentromeres were fully functional as centromeres, two additional kinetochore proteins were epitope-tagged with GFP. cen9∆ mutants were transformed with an overlap PCR product expressing CENPC-GFP. As the neocentromeres of three cen10∆ mutants spanned the gene encoding CENP-C, all cen10∆ mutants were transformed with an overlap PCR product, expressing Mis12-GFP. In addition to the cen9∆ and cen10∆ mutants, the wild type was transformed with constructs expressing Mis12-GFP and CENP-C-GFP, and these served as controls. ChIP-qPCRs for cen9∆ mutants, cen10∆ mutants, and wild-type strains with Mis12-GFP or CENP-C-GFP were performed (Figure 1—figure supplement 4). Because Mis12 is an outer kinetochore protein, the formaldehyde cross-linking was extended to 45 min (15 min was used for CENP-A and CENP-C) for this protein. For all qPCR analyses, the native centromere 6 CEN6) was used as an internal control and for each neocentromere specific primer pairs were designed. For cen9∆ and cen10∆ mutants a similar level of Mis12 or CENP-C enrichment at the neocentromeres and (CEN6) was observed. This suggested that the CENP-A-enriched regions of chromosome 9 of the cen9∆ mutants and chromosome 10 of cen10∆ mutants identified by ChIP-seq were functional centromeres and indeed neocentromeres (Figure 1—figure supplement 4).

Previously generated RNA sequence data were remapped to the C. deuterogattii reference strain R265 and analyzed to determine if the regions where neocentromeres formed in the cenΔ mutants were transcribed in the wild type (Figure 1Supplementary file 2Schneider et al., 2012). In the wild-type strain, all genes spanned by neocentromeres in the cenΔ mutants were expressed (Figure 1). However, the expression levels of the neocentromeric genes were lower than their neighboring genes. For example, the expression level of the gene (CNBG_5686) flanking native centromere 9 was three times higher than the genes spanned by neocentromeres in cen9∆ mutants (Figure 1A). The same trend was observed in the cen10∆ mutants. Here, the expression level of the gene (CNBG_4365) 3’ flanking the neocentromere (primary CENP-A peak) of cen10-A and cen10-C was more than six times higher than the genes spanned by the neocentromere (Figure 1D). Also, the neocentromere of cen10-D is flanked by genes whose expression was two times higher than the genes spanned by the neocentromere (Figure 1D). This suggests that neocentromeres are formed in chromosomal regions with lower gene expression in C. deuterogattii. The majority of the genes (24/28) flanking native centromeres are transcribed in the direction towards the native centromere. All of the neocentromeres observed span one or more genes and most of the flanking genes are transcribed in the direction away from the neocentromere.

The expression levels of the neocentromeric genes in cen9∆ and cen10∆ mutants were assayed by qPCR (Figure 2). The neocentromeric genes of chromosome 9 were normalized to actin. To compensate for the ploidy levels of chromosome 10 in cen10∆ mutants, a housekeeping gene located on chromosome 10 was used to normalize the expression of genes spanned by neocentromeres located on chromosome 10. The expression levels of the CENP-A-associated neocentromeric genes were all found to be similar to the wild-type strain (Figure 2).

Figure 2. Expression of neocentromeric genes.

Figure 2.

Expression of the neocentromeric genes was assessed by qPCR for all cenΔ mutants and expression is shown as Log2ΔΔCt. For cen10-A, cen10-B and cen10-C, two genes were selected from each neocentromeric region, all other cen∆ mutants are represented by one gene spanned by CENP-A. cen10-B has only one CENP-A-enriched region, and in this case, the genes located within primary peak of cen10-A and cen10-C served as controls. The qPCRs of cen10∆ mutants are normalized with a housekeeping gene located on chromosome 10. The qPCRs of cen9∆ mutants are normalized with actin. Error bars show standard deviation.

Neocentromere formation can reduce fitness

 cen10∆ mutants were noted to grow more slowly than wild type. To investigate this, the growth of cen10∆ and wild-type strains was measured during the course of a 22-hour cell growth experiment (Figure 3A). The majority of cen10∆ mutants exhibited slower growth rates compared to the wild-type parental strain R265. Six of seven cen10∆ mutants exhibited significant fitness defects compared to the wild-type strain, with doubling times ranging from 101 to 111 min compared to 81 min for the wild type (Figure 3B). In contrast, one mutant, which has a telocentric neocentromere (cen10-E), grew similarly to the wild type and had a similar doubling time (84 min for the mutant vs 81 min for the wild-type strain). Compared to the wild type, cen10∆ mutants with increased doubling times produced smaller colonies during growth on non-selective media (Figure 1—figure supplement 5C).

Figure 3. cen10∆ mutant strains have reduced fitness compared to the wild-type strain.

(A) Six out of seven cen10∆ mutants had a longer doubling time and slower growth than the wild-type strain. In contrast cen10-E grows similarly to the wild type. Error bars show standard deviation. (B) Doubling times and fold change compared to wild type are shown. (C) Competition assays with the wild type and cen9∆ and cen10∆ mutant strains. Mixed cultures (1:1) were grown overnight and plated with and without selection agents. After four days, colonies were counted and the percentage of cenΔ mutants (black) and wild type (grey) in each culture was plotted. As a control (C) a wild-type strain with a NAT marker was mixed with the wild type.

Figure 3.

Figure 3—figure supplement 1. cen10∆ mutants with chromosomal fusion have a wild-type growth rate.

Figure 3—figure supplement 1.

The doubling times of large and small colonies derived from cen10∆ mutants at 30°C were determined. Large colonies had a growth rate similar to the wild-type strain, while small colonies and the initial cen10∆ mutants had a similar growth rate with one another but that was slower than the wild-type strain and the derived large colonies. (A) Growth curves for cen10-A-derived isolates and the wild-type strain are shown. (B) Growth curves for cen10∆-B-derived isolates and the wild-type strain are shown.

To compare fitness, a competition assay was performed with 1:1 mixtures of wild-type and cen9∆ or cen10∆ mutants grown in liquid YPD medium (Figure 3C). With no growth defect, the expectation was that the wild-type strain and centromere deletion mutants would grow at the same growth rate, resulting in a 1:1 ratio. In fact, fewer cen10∆ cells were found in the population after growth in competition with the wild-type strain, and this observation is consistent with the slower doubling time of cen10∆ mutants resulting in reduced fitness compared to wild type (Figure 3). Compared to the wild-type cells, there were fewer cen9∆ mutant cells in the population. However, the number was closer to a 1:1 ratio (Figure 3C). The ratio of the cen9∆ mutants in the population was similar to the ratio of the cen10-E mutant, which had a wild-type growth rate. Due to this observation, we hypothesize that the growth rate of the cen9∆ mutants is similar to wild type.

cen10∆ isolates are aneuploid

Because deletion of a centromere could lead to defects in chromosome segregation, cenΔ mutants were assessed for aneuploidy (Figure 4). Overall, cen10∆ mutants exhibited a mixture of large and small colony sizes during growth on YPD medium at 37°C, while cen9∆ mutants exhibited a uniform, wild-type like, colony size (Figure 1—figure supplement 5).

Figure 4. cen10∆ mutants are aneuploid.

The whole genomes of small and large colonies derived from four cen10∆ mutants were sequenced and read coverage (corresponding to ploidy levels) was plotted. Small colonies of cen10∆ mutants were partially aneuploid for chromosome 10, while the large colonies are euploid. (A) Genome-wide read depth coverage for small and large colonies. On the right, the fold coverage for the highest ploidy level is indicated for each sample. For example, chromosome 10 of cen10∆-B-S1 had an aneuploidy level of 1.35-fold compared to the wild-type strain. Chromosome 4 had a small region with increased read depth due to the ribosomal rDNA gene cluster and was excluded from the analysis. Chromosome 8 of cen10-E was duplicated. In addition, cen10∆-E-S3 had an additional duplicated region of 162 kb of chromosome 5 that spans the sequence of native centromere 5. (B) Detailed view of read depth of chromosome 10. As in panel A, read depth is indicated on the right. The native centromeric location is shown by a black square. Due to the deletion of centromere 10, the location of the native centromere lacks sequence reads for each sample.

Figure 4.

Figure 4—figure supplement 1. Deletion within subtelomeric regions in chromosome fusion isolates.

Figure 4—figure supplement 1.

Sequence reads were mapped to the reference R265 genome. Regions with sequence coverage are shown in blue, and those without sequence coverage are shown in white. Subtelomeric regions of the large cen10∆ colonies have lost sequence coverage, whereas cen10∆ small colonies have wild-type sequence coverage. Telomeric loss was not observed for the large colonies of cen10Δ∆-E. For all panels, genes (CDS) are shown on the top (light blue). (A) Detailed view of the 3’ subtelomeric region of chromosome 10. Large colonies of cen10-A and cen10-B lost sequences corresponding to a region of 6.5 and ~8 kb. (B) Detailed view of the 3’ subtelomeric region of chromosome 4. Large colonies of cen10-A have lost sequences corresponding to a 12 kb region. (C) Detailed view of the 3’ subtelomeric region of chromosome 7. Large colonies of cen10-B lost sequences corresponding to an ~18.5 kb region.

Aneuploidy in C. neoformans often leads to a similar mixed colony size phenotype as that observed in the cen10∆ mutants (Sun et al., 2014). To exacerbate the aneuploidy-associated slow growth phenotype, four cen10∆ mutants were grown at elevated temperature (37°C), causing these isolates to produce smaller, growth-impaired and larger, growth-improved colonies (Figure 3—figure supplement 1). Three small and two large colonies were selected from each isolate and whole-genome analysis was performed based on Illumina sequencing. Sequences were mapped to the reference R265 genome, revealing that the small colonies were indeed aneuploid (Figure 4A). The small colonies of cen10-B and cen10-C had ploidy levels for chromosome 10 in the range of 1.25- to 1.36-fold higher compared to the other 13 chromosomes, which suggested that only a proportion of the cells (25% to 36%), were aneuploid (Figure 4B). The remainder of the genome was euploid. Chromosome 10 of the small colonies derived from isolate cen10-A and cen10-E exhibited ploidy levels ranging from 1.1- to 1.14-fold, reflecting less aneuploidy. Importantly, for all of the large colonies derived from isolates cen10-A, cen10-B, cen10-C, and cen10-E the fold coverage of chromosome 10 was restored to the wild-type euploid level (1.0 fold compared to wild type). The ploidy levels of chromosome 10 were 1-fold for all of the large colonies compared to wild type, indicating that the ploidy level of chromosome 10 of the large colonies was restored to euploid.

cen10∆ chromosome is rescued by chromosome fusion

Based on whole-genome sequencing and PFGE analysis, fusion of cen10∆ chromosome 10 to other chromosomes was a common event in the large colonies (Figure 5, Figure 6). Whole-genome sequence analysis revealed that sequences corresponding to the 3’ subtelomeric region of chromosome 10 (including one gene) were absent in the sequences obtained from all of the large colonies analyzed (Figure 4—figure supplement 1A). In addition, the large colonies of cen10-A were missing sequences for two genes in the 5’ subtelomeric region of chromosome 4 (Figure 4—figure supplement 1B). Large colonies of cen10-B were missing 18.5 kb at the 5’ subtelomere of chromosome 7 (including eight genes) (Figure 4—figure supplement 1C). The large colonies of cen10-E lacked a small part of one gene in the 3’ subtelomeric region of chromosome 1. In total, of the 14 subtelomeric genes that were lost in these three chromosome-fusion isolates, ten encoded hypothetical proteins and four encoded proteins with predicted functions. BlastN analysis in the de novo genome assemblies of the large colonies confirmed that the subtelomeric regions were not located on minichromosomes or inserted in other chromosomes. Seven genes have homologs in C. neoformans and are present in C. neoformans deletion libraries (Liu et al., 2008Supplementary file 3). This observation suggested that either subtelomeric deletions occurred, or that chromosomal fusions led to the loss of subtelomeric regions. Notably, sequences from the small colonies spanned the entire genome with no evidence of these subtelomeric deletions (Figure 4—figure supplement 1F).

Figure 5. cen10∆ mutants undergo chromosome fusion leading to improved fitness at 37°C.

Chromosomal fusions were studied in detail for three cen10∆ mutants restored to wild-type growth levels at 37°C (large colonies). After chromosome fusion, the fused chromosomes of cen10-A-L and cen10∆-B-L lost the gene CNBG_6141, which is located in the 3’ subtelomeric region of chromosome 10. Genes present in the fused chromosome are depicted in green, and genes lost after chromosome fusion are indicated in red. Gray highlights indicate regions present in both the parental and fused chromosomes. Each fusion occurred in a unique nonrepetitive region. (A) cen10∆-A-L1, the fusion occurred between chromosome 10 and chromosome 4. (B) In cen10∆-B-L1, chromosomal fusion occurred between chromosomes 10 and 7. (C) cen10∆-E-L1 chromosomal fusion occurred between chromosomes 10 and 1.

Figure 5.

Figure 5—figure supplement 1. Chromosome fusion in large cen10∆ colonies.

Figure 5—figure supplement 1.

Sequence reads were mapped back to the de novo cen10∆ genome assemblies to verify the quality of the de novo cen10∆ mutant genome assemblies. For each panel the full length of the scaffold, which includes the fusion point, and a 100 bp detailed region is shown. Sequences homologous to chromosome 10 are depicted in blue and sequence reads mapped back to the de novo genome assemblies are shown in grey. (A) The de novo genome assembly of cen10∆-A-L1 contains a scaffold (~211 kb) of a fused chromosome and this scaffold consists of an 81.97 kb region of chromosome 10 (blue) and a 129.38 kb region of chromosome 4 (green). The sequence coverage of this scaffold is 104 fold. The chromosome fusion occurred with a 2 bp overlap between chromosome 4 and 10. (B) The de novo genome assembly of cen10∆-B-L1 contains a scaffold (~886 kb) and consists of an 80.46 kb region of chromosome 10 (blue) and an 806.09 kb region of chromosome 7 (red). The fusion point lacks overlap between chromosomes 7 and 10. The sequence coverage of this scaffold is 104 fold. (C) The de novo genome assembly of cen10∆-E-L1 consists of a scaffold (~585 kb) of a fused chromosome and this scaffold consists of an 83.59 kb region of chromosome 10 (blue) and 501.37 kb of chromosome 1 (pink). The sequence coverage of this scaffold is 140 fold. The chromosome fusion occurred with a 6 bp overlap between chromosome 1 and 10. (D) PCR confirmation of the chromosome fusion occurring in the large colonies derived from cen10∆-B, cen10∆-C, and cen10∆-A. For each chromosomal fusion, a PCR spanning the fusion was performed. For each chromosome fusion, PCRs were performed for: 1) a large colony derived from the cen10∆ mutant, 2) the original cen10∆ mutant, 3) a small colony derived from growth at 37°C, and 4) the wild type.

Figure 6. Chromosome fusion results in neocentromere inactivation and karyotype reduction.

Figure 6.

(A) Neocentromeres are inactive after chromosomal fusion. For each neocentromere two qPCR primer pairs located in genes spanned by the neocentromere in cen10-A and cen10∆-B mutant were used in a ChIP-qPCR experiment. Analyzed is the CENP-A enrichment of 1) a cen10∆ mutant, 2) a large colony derived from the cen10∆ mutant, and 3) the wild-type strain. Centromere 6 (CEN6) was included as a positive control, and actin was included as a negative control. Data are shown for cen10∆-A, cen10∆-A-L1, cen10∆-B, cen10∆-B-L2, and wild type. For cen10∆-A and cen10∆-A-L1 mutants, the chromosomal regions investigated are indicated according to the primary and secondary CENP-A peaks of the cen10∆-A mutant. The cen10∆-B mutant has only one CENP-A-enriched region which co-localized with the secondary CENP-A peak of cen10∆-A and this region is labeled with neocen in cen10∆-B and cen10∆-B-L1. Error bars show standard deviation. (B) PFGE analysis shows that the band corresponding to chromosome 10 was lost in the large colonies and instead larger bands appear due to the fusion of chromosome 10 with other chromosomes. cen10∆ deletion mutants and small colonies derived from 37°C show a wild-type karyotype. Chromosome 10 of the large colonies was fused to chromosome 13, 10, or 1, respectively. Due to limitations of PFGE conditions, the chromosome 10–chromosome 1 fusion did not separate from chromosomes 2, 3, and 4. The positions of the fused chromosomes are indicated with arrows.

We hypothesized that the subtelomeric gene loss was due to chromosomal fusion and tested this hypothesis with de novo genome assemblies and PFGE (Figure 5 and Figure 6). Based on de novo genome assemblies for the large colonies of cen10-A, cen10∆-B, and cen10∆-E, chromosome 10 fused with chromosome 4, 7, or 1, respectively (Figure 5 and Supplementary file 3). In the large colony of cen10∆-A (cen10∆-A-L1), the fusion occurred between chromosome 10 and chromosome 4 (Figure 5A). Chromosomal fusion led to the loss of the CNBG_10211 gene (on chromosome 4), and the fusion junction was within the CNBG_6174 gene of chromosome 4. For cen10∆-B-L1, the chromosomal fusion occurred between chromosomes 10 and 7 (Figure 5B). Seven genes of chromosome 7 were lost in the fused chromosome. The chromosome fusion junction was intergenic on chromosome 7. cen10∆-E-L1 was due to a chromosomal fusion between chromosomes 10 and 1 (Figure 5C). The fusion was intragenic for both chromosomes. The fusion point occurred in CNBG_6141 on chromosome 10 and CNBG_10308 on chromosome 1.

Because all of the large cen10∆ colonies had chromosome 10 fusions, we examined the fusion location on chromosome 10 in detail. The fusions occurred 1.7, 0.3, and 3.6 kb from the chromosome 10 gene CNBG_6142, respectively (Figure 5—figure supplement 1). The fusion occurred in unique DNA sequences and was not flanked by repetitive regions. The overlapping region between chromosome 10 and the fused chromosome was at most 6 bp, suggesting that these fusions occurred via microhomology-mediated end joining (MMEJ) (also known as alternative nonhomologous end-joining [Alt-NHEJ]).

Chromosome fusion may result in loss of the neocentromeres, and the kinetochore may bind to the native centromere of the fused chromosome and function as the active centromere. This hypothesis was tested by performing ChIP-qPCR for CENP-A binding (Figure 6A). For each neocentromere (either of the cen10-A or cen10-B mutant), CENP-A enrichment was tested with four primer sets located in the neocentromere. The CENP-A enrichment for these four locations was tested in 1) the initial cen10∆ mutant, 2) a large colony derived from a specific cen10∆ mutant and 3) wild type (Figure 6A). As expected, the ChIP-qPCR analysis showed CENP-A enrichment for the neocentromeres of the initial cen10-A and cen10-B mutants. The neocentromeric regions of cen10-A and cen10-B were not enriched with CENP-A in the wild-type strain, showing there was no occupancy by CENP-A prior to neocentromere formation. For all analyzed cen10∆ chromosome 10 fusion isolates, the neocentromeres were not CENP-A-associated, and were similar to the wild-type background levels. Therefore, the neocentromeres were no longer active in the chromosome fusion strains (Figure 6A). This suggests that the native centromere of the fusion partner of chromosome 10 (i.e., chromosome 1, 4, or 7) was the active centromere of the Chr10-Chr1, Chr10-Chr4, and Chr10-Chr7 fusions.

In addition to cen10-A, -B, and -E mutants, whole-genome sequencing was performed for two large colonies of cen10∆-C. Although it was not possible to identify the chromosome fusion based on whole-genome sequencing data for either of the large colonies of the cen10-C mutant, PFGE analysis showed that cen10∆-C-L1 had a fusion between chromosomes 10 and 13 (Figure 6b). cen10∆-C-L2 had read coverage of 1.99-fold for a region of ~200 kb of chromosome 10 (Figure 4B). The rest of chromosome 10 was euploid, suggesting that the ~200 kb region was duplicated and was either a single chromosome or fused to another chromosome in this isolate. PFGE analysis suggested that this fragment was duplicated on chromosome 10, resulting in a larger chromosome (Figure 6B). In contrast to the other fused chromosomes, this chromosomal fragment did not fuse to a chromosome with a native centromere, and the fact that the mutant still exhibited a fitness defect was consistent with this interpretation. The larger chromosome was euploid, suggesting that the unstable neocentromere(s), rather than causing aneuploidy, resulted in a fitness cost in this isolate.

Discussion

Composition of neocentromeres in C. deuterogattii

The native centromeres of C. deuterogattii are found in repetitive regions and are flanked by, but do not contain, protein-encoding genes (Yadav et al., 2018). By contrast, neocentromeres of C. deuterogattii span genes, lack repetitive elements, and like the native centromeres, are flanked by genes. In general (with one exception), the neocentromeres of C. deuterogattii are significantly shorter than the native centromeres, whereas most neocentromeres in other species have similar lengths as the native centromeres.

Native centromeres of S. pombe have a central core that is enriched with CENP-A and flanked by repetitive pericentric regions (Ishii et al., 2008). While neocentromere formation in S. pombe favors repeats in the pericentric regions, neocentromere formation is possible without the repetitive pericentric regions (Ishii et al., 2008). The majority of the neocentromeres in C. albicans and chickens are formed close to native centromeres due to seeding of CENP-A that is located near the native centromere (the so-called CENP-A cloud) (Ketel et al., 2009; Shang et al., 2013). Our results and the earlier reports discussed, suggest that the chromosomal location of the native centromere is the main determinant of neocentromere formation. One exception was the neocentromere of cen10-E, which directly flanked the left telomere. Interestingly, this was the only cen10∆ mutant that had a growth rate similar to wild type.

Several C. deuterogattii neocentromeres formed in the same location; however, there is no apparent consensus between the different regions occupied by different neocentromeres. A similar trend has been observed in neocentromere formation in C. albicans (Ketel et al., 2009). Evolutionary new centromeres (ECNs) in the largest crucifer tribe Arabideae originated several times independently and are located in the same chromosomal location (Mandáková et al., 2020). Our results suggest that neocentromeres form by mechanisms that do not rely on nearby transposable elements/repeats to initiate de novo centromere assembly.

Neocentromeric genes are expressed

Neocentromeres induced in several species can span genes, resulting in silencing or reduced gene expression. For example, all genes within five independent neocentromeres in C. albicans that spanned nine genes were suppressed (Burrack et al., 2016). In S. pombe, neocentromeres span genes that are only expressed in response to nitrogen starvation in the wild-type strain, and neocentromere formation silences these genes during nitrogen starvation (Ishii et al., 2008). The native centromere 8 of rice contains an approximately 750 kb CENP-A-enriched region with four genes that are expressed in both leaf and root tissues of three closely related species (Fan et al., 2011; Nagaki et al., 2004). Neocentromeres of rice span genes that are expressed at similar levels as in the wild type (Zhang et al., 2013). Chicken neocentromeres have been induced on chromosome Z or 5 (Shang et al., 2013). Chromosome Z neocentromeres span eight genes, but in wild-type cells only MAMDC2 is expressed during normal growth. The other seven genes were either not expressed at any detectable level in all tested developmental stages or were only expressed during early embryonic stages (Shang et al., 2013). When a neocentromere formed, expression of the MAMDC2-encoding gene was reduced 20- to 100-fold. Chromosome 5 of chickens is diploid, and neocentromeres on this chromosome span genes that are expressed. The hypothesis behind this phenomenon is that one allele functions as a centromere, while the other allele codes for the genes.

Because the cen10∆ mutants of C. deuterogattii were aneuploid, the expression of genes spanned by chromosome 10 neocentromeres was normalized to expression levels of a housekeeping gene located on chromosome 10. The expression of genes enriched for CENP-A chromatin is similar to that of wild type, and if the allelic hypothesis, like in chickens, were valid, the expectation would be a 60% reduction in expression levels. The genes spanned by neocentromeres of cen9∆ mutants are also expressed at wild-type levels. As the cen9∆ mutants have uniform, wild-type colony sizes, the ploidy levels of these mutants were not tested and we hypothesize that these mutants are haploid/euploid.

Genes contained in regions in which C. deuterogattii neocentromeres formed in cenΔ mutants were actively expressed in the wild-type strain, and this is similar to human neocentromeres that can form in regions with or without gene expression (Alonso et al., 2010; Marshall et al., 2008). However, we have identified chromosomal regions that lack gene expression on chromosomes 9 and 10, although these regions were not close to the native centromere.

Of the C. deuterogattii genes spanned by the neocentromere region, one encodes the kinetochore component CENP-C. Several independent biolistic transformations were performed to delete the gene encoding CENP-C, but all attempts were unsuccessful. This suggests that CENPC is an essential gene. In fission yeast, deletion of the gene encoding the CENP-C homolog Cnp3 was lethal at 36°C, but mutants were still viable at 30°C (Suma et al., 2018). However, CENP-A was mislocalized in the cnp3∆ mutants. Another gene partially located inside a C. deuterogattii neocentromere encodes the serine/threonine-protein phosphatase 2A activator 2 (RRD2). The RRD2 homolog is not essential in S. cerevisiae (Higgs and Peterson, 2005).

Compared with other haploid fungi, the neocentromeric genes of C. deuterogattii are similar to the native centromeric genes of the haploid plant pathogenic fungus Zymoseptoria tritici. Z. tritici has short regional centromeres with an average size of 10.3 kb, and 18 out of 21 native centromeres have a total of 39 expressed genes (Schotanus et al., 2015).

cen10∆ mutants with two CENP-A-enriched regions

The appearance of two CENP-A-enriched regions of C. deuterogattii cen10∆ mutants could be explained in a few ways. First, neocentromere formation could lead to a dicentric chromosome 10 in which the centromeres may differ in functional capacity. Dicentric chromosomes are not by definition unstable, for example the dominant-negative mutation of the mammalian telomere protein TRF2 results in chromosome fusions, leading to the formation of dicentric chromosomes (Stimpson et al., 2010). The formation of dicentric chromosomes occurred in 97% of the fused mammalian chromosomes, which were stable for at least 180 cell divisions (Stimpson et al., 2010). Several microscopic studies showed that chromosomes with two regions of centromere-protein enrichment are stable (Higgins et al., 2005; Stimpson et al., 2012; Stimpson et al., 2010; Sullivan and Willard, 1998). This suggests that a dineocentric chromosome 10 could be stable in the population. Second, the two CENP-A-enriched peaks could be the result of a mixed population and either due to an unstable primary neocentromere and/or aneuploidy. The primary neocentromere could be associated with the majority of the cells, whereas the secondary CENP-A peak would be only found in a small number of cells (and the primary neocentromere is lost in these isolates). This is reflected by lower CENP-A enrichment for the secondary peak, and the hypothesis of putative dicentrics is due to a mixture of alleles in the population. Third, the neocentromeres could be unstable, which could lead to the formation of two CENP-A-enriched regions with centromere function switching between the regions. However, our data would argue against this latter model. Prior to the ChIP-seq analysis of the cen10∆ mutants, colonies were isolated by streak purification (eight times), suggesting that the presence of two distinct CENP-A peaks occurs continuously.

cen10∆ mutants are partially aneuploid

Neocentromere formation in chickens results in a low number of aneuploid cells (Shang et al., 2013). Based on whole-genome sequencing of a population of cells, the C. deuterogattii cen10∆ isolates are partially aneuploid for chromosome 10. For fully aneuploid isolates, the coverage of Illumina reads is expected to be 2-fold; the cen10∆ isolates with two CENP-A peaks showed aneuploidy levels up to 1.28-fold or were even euploid. This suggests that, like the chicken neocentromeric isolates, only a small number of cells in a population of C. deuterogattii cen10∆ isolates are aneuploid.

cen10∆ mutants have reduced fitness

In C. albicans, deletion of centromere 5 results in neocentromere formation, and these isolates have fitness similar to the wild-type strain (Ketel et al., 2009). Similar results were reported for neocentromeres in chicken and S. pombe, in which strains with neocentromeres or chromosome fusion have a growth rate similar to the wild-type strain (Ishii et al., 2008; Shang et al., 2013).

If centromere deletions occurred in nature, we hypothesize that the wild type would outcompete all of the cen∆ isolates. The virulence of the cen∆ mutants was not assayed. Based on reduced fitness of the cen∆ mutants, we hypothesize that pathogenicity of the cen∆ mutants would be lower than the wild type. However, when chromosome fusion occurs the growth rate is restored to a near wild-type level and we hypothesize that the isolates with 13 chromosomes could have virulence similar to the wild type. Several genes were lost due to the fusion events in the cen∆ mutants; to our knowledge these lost genes have not been associated with pathogenicity of C. deuterogattii.

Neocentromere stains exhibit impaired growth and chromosome fusion restores wild-type growth at elevated temperatures

Deletion of a centromere in S. pombe leads to either neocentromere formation or chromosome fusion due to a noncanonical homologous recombination pathway (Ishii et al., 2008; Ohno et al., 2016). This is in contrast to neocentromere formation in C. deuterogattii, which results in 100% neocentromere formation. Based on PFGE analysis, the karyotype of the cenΔ isolates is wild type at 30°C, but chromosome fusion can occur at 37°C within the cen10∆ mutants and lead to improved growth at 30°C.

The location of the cen10∆ neocentromere had no apparent influence on the ability to undergo chromosome fusion as we have shown with a telocentric neocentromere, a dicentric mutant, and a neocentromere located 118 kb away from the telomere.

The fused chromosomes have no or only short homology at the breakpoints that is insufficient for homologous recombination, suggesting that the chromosome fusions arise via MMEJ. Future experiments to test this hypothesis could involve deleting genes involved in the MMEJ pathway, such as CDC9 and DNL4 (Sinha et al., 2016).

A prominent chromosome fusion occurred during the speciation of humans. Compared to other great apes, humans have a reduced karyotype, which is due to the fusion of two ancestral chromosomes that resulted in chromosome 2 in modern humans, Denisovans, and Neanderthals (Miga, 2017). Human chromosome2 still harbors signatures of telomeric repeats at the fusion point (interstitial telomeric sequences [ITS]), suggesting that this chromosome is derived from a telomere-telomere fusion. By synteny analysis, the inactive centromere of chimpanzee chromosome 2b can be identified on human chromosome 2, and there are relics of α satellite DNA at this now extinct centromere (Miga, 2017). Moreover, a dominant-negative mutation of the human telomeric protein TRF2 leads to telomere-telomere fusions, mainly between acrocentric chromosomes (Stimpson et al., 2010; van Steensel et al., 1998). In the fungal species Malassezia, chromosome breakage followed by chromosome fusion has led to speciation (Sankaranarayanan et al., 2020). The short regional centromeres (3–5 kb) are fragile and this led most likely to chromosome reduction. By contrast in C. deuterogattii, the chromosomes involved in chromosomal fusion of the cen10∆ mutants were all metacentric, and fusion occurred in non-telomeric sequences.

Another example of telomeric fusions is the presence of ITS regions in several genomes. In budding yeast, the experimental introduction of an ITS into an intron of the URA3 gene resulted in four classes of chromosome rearrangements, including: 1) inversion, 2) gene conversion, 3) mini-chromosome formation due to deletion or duplication, and 4) mini-chromosome formation due to translocation (Aksenova et al., 2013). Based on our de novo genome assemblies of the C. deuterogattii large-colony cen10∆ mutants, chromosome fusions occurred with no signs of chromosome rearrangements. Thus, these chromosome fusions did not produce ITS regions, which would otherwise destabilize the genome.

Conclusions

Our work shows that, like in other model systems, neocentromeres can be induced in C. deuterogattii. However, C. deuterogattii neocentromeres have several unique characteristics, such as spanning genes whose expression is unaffected by centromere assembly. In some instances, deletion of CEN10 led to chromosome fusion, resulting in enhanced fitness and leading to inactivation of the neocentromere. Presumably, deletion of other centromeres could be carried out, leading to a C. deuterogattii strain with only one or a few chromosomes, as was recently reported in S. cerevisiae (Luo et al., 2018; Shao et al., 2018).

Materials and methods

Key Resources Table Template and Guidelines.

Key resources table.

Reagent type
(species) or resource
Designation Source or reference Identifiers Additional
information
Genetic reagent
Cryptococcus deuterogattii
R265 This study R265 expressing mCherry-CENPA
Genetic reagent
Cryptococcus deuterogattii
cen10-A This study R265 centromere 10 deletion mutant with expressing mCherry-CENPA
Genetic reagent
Cryptococcus deuterogattii
cen10△-B This study R265 centromere 10 deletion mutant with expressing mCherry-CENPA
Genetic reagent
Cryptococcus deuterogattii
cen10△-C This study R265 centromere 10 deletion mutant with expressing mCherry-CENPA
Genetic reagent
Cryptococcus deuterogattii
cen10△-D This study R265 centromere 10 deletion mutant with expressing mCherry-CENPA
Genetic reagent
Cryptococcus deuterogattii
cen10△-E This study R265 centromere 10 deletion mutant with expressingmCherry-CENPA
Genetic reagent
Cryptococcus deuterogattii
cen10△-F This study R265 centromere 10 deletion mutant with expressing mCherry-CENPA
Genetic reagent
Cryptococcus deuterogattii
cen10△-G This study R265 centromere 10 deletion mutant with expressing mCherry-CENPA
Genetic reagent
Cryptococcus deuterogattii
cen10△-A-S1 This study Small colony derived from R265 centromere 10A deletion mutant with expressing mCherry-CENPA
Genetic reagent
Cryptococcus deuterogattii
cen10△-A-L1 This study Large colony derived from R265 centromere 10A deletion mutant with expressingmCherry-CENPA
Genetic reagent
Cryptococcus deuterogattii
cen10△-B-S1 This study Small colony derived from R265 centromere 10B deletion mutant with expressing mCherry-CENPA
Genetic reagent
Cryptococcus deuterogattii
cen10△-B-S2 This paper Small colony derived from R265 centromere 10B deletion mutant with expressing mCherry-CENPA
Genetic reagent
Cryptococcus deuterogattii
cen10△-B-S3 This study Small colony derived from R265 centromere 10B deletion mutant with expressing mCherry-CENPA
Genetic reagent
Cryptococcus deuterogattii
cen10△-B-L1 This study Large colony derived from R265 centromere 10B deletion mutant with expressing mCherry-CENPA
Genetic reagent
Cryptococcus deuterogattii
cen10△-B-L2 This study Large colony derived from R265 centromere 10B deletion mutant with expressing mCherry-CENPA
Genetic reagent
Cryptococcus deuterogattii
cen10△-C-S1 This study Small colony derived from R265 centromere 10C deletion mutant with expressing mCherry-CENPA
Genetic reagent
Cryptococcus deuterogattii
cen10△-C-S2 This study Small colony derived from R265 centromere 10C deletion mutant with expressing mCherry-CENPA
Genetic reagent
Cryptococcus deuterogattii
cen10△-C-S3 This study Small colony derived from R265 centromere 10C deletion mutant with expressing mCherry-CENPA
Genetic reagent
Cryptococcus deuterogattii
cen10△-C-L1 This study Large colony derived from R265 centromere 10C deletion mutant with expressing mCherry-CENPA
Genetic reagent
Cryptococcus deuterogattii
cen10△-C-L2 This study Large colony derived from R265 centromere 10C deletion mutant with expressing mCherry-CENPA
Genetic reagent
Cryptococcus deuterogattii
cen10△-E-S1 This study Small colony derived from R265 centromere 10E deletion mutant with expressing mCherry-CENPA
Genetic reagent
Cryptococcus deuterogattii
cen10△-E-S2 This study Small colony derived from R265 centromere 10E deletion mutant with expressing mCherry-CENPA
Genetic reagent
Cryptococcus deuterogattii
cen10△-E-S3 This study Small colony derived from R265 centromere 10E deletion mutant with expressing mCherry-CENPA
Genetic reagent
Cryptococcus deuterogattii
cen10△-E-L1 This study Large colony derived from R265 centromere 10E deletion mutant with expressing mCherry-CENPA
Genetic reagent
Cryptococcus deuterogattii
cen10△-E-L2 This study Large colony derived from R265 centromere 10E deletion mutant with expressing mCherry-CENPA
Genetic reagent
Cryptococcus deuterogattii
cen9△-A This study R265 centromere 9 deletion mutant with expressing mCherry-CENPA
Genetic reagent
Cryptococcus deuterogattii
cen9△-B This study R265 centromere 9 deletion mutant with expressing mCherry-CENPA
Genetic reagent
Cryptococcus deuterogattii
cen9△-C This study R265 centromere 9 deletion mutant with expressing mCherry-CENPA
Genetic reagent
Cryptococcus deuterogattii
cen9△-D This study R265 centromere 9 deletion mutant with expressing mCherry-CENPA
Genetic reagent
Cryptococcus deuterogattii
cen9△-E This study R265 centromere 9 deletion mutant with expressing mCherry-CENPA
Genetic reagent
Cryptococcus deuterogattii
cen9-F This study R265 centromere 9 deletion mutant with expressing mCherry-CENPA
Genetic reagent
Cryptococcus deuterogattii
R265 MIS12 This study R265 expressing GFP-MIS12 and mCherry-CENPA
Genetic reagent
Cryptococcus deuterogattii
cen10-A MIS12 This study R265 Centromere 10 mutant with expressing GFP-MIS12 and mCherry-CENPA
Genetic reagent
Cryptococcus deuterogattii
cen10△-B MIS12 This study R265 Centromere 10 mutant with expressing GFP-MIS12 and mCherry-CENPA
Genetic reagent
Cryptococcus deuterogattii
cen10△-C MIS12 This study R265 Centromere 10 mutant with expressing GFP-MIS12 and mCherry-CENPA
Genetic reagent
Cryptococcus deuterogattii
cen10△-D MIS12 This study R265 Centromere 10 mutant with expressing GFP-MIS12 and mCherry-CENPA
Genetic reagent
Cryptococcus deuterogattii
cen10△-E MIS12 This study R265 Centromere 10 mutant with expressing GFP-MIS12 and mCherry-CENPA
Genetic reagent
Cryptococcus deuterogattii
R265 CENPC This study R265 with expressing GFP-CENPC and mCherry-CENPA
Genetic reagent
Cryptococcus deuterogattii
cen9-A CENPC This study R265 Centromere 9 mutant with expressing GFP-CENPC and mCherry-CENPA
Genetic reagent
Cryptococcus deuterogattii
cen9△-B CENPC This study R265 Centromere 9 mutant with expressing GFP-CENPC andmCherry-CENPA
Genetic reagent
Cryptococcus deuterogattii
cen9△-C CENPC This study R265 Centromere 9 mutant with expressing GFP-CENPC andmCherry-CENPA
Genetic reagent
Cryptococcus deuterogattii
cen9△-D CENPC This study R265 Centromere 9 mutant with expressing GFP-CENPC and mCherry-CENPA
Genetic reagent
Cryptococcus deuterogattii
cen9△-E CENPC This study R265 Centromere 9 mutant with expressing GFP-CENPC and mCherry-CENPA
Antibody Anti-mCherry antibody(Rabbit polyclonal) Abcam Cat. no.
ab183628
ChIP (1/5000)
Antibody Anti-GFP (Rabbit polyclonal) antibody Abcam Cat. no.
ab290
ChIP (5 µg for 1 µg of chromatin)
Other Dynabeads Protein A for Immunoprecipitation Invitrogen Cat. no.
10001D
ChIP (20 µl per500 µl fraction)
Software, algorithm Bowtie2 Langmead, 2010
Software, algorithm Spades Bankevich et al., 2012
Software, algorithm IGV Thorvaldsdóttir et al., 2013
Software, algorithm HISAT2 Pertea et al., 2016
Sequence-based reagent List of primers used in this study Sigma In Supplementary file 4

Strains, primers, and culture conditions

Primers are listed in Supplementary file 4. Strains used in this study are listed in Supplementary file 5. All strains were stored in glycerol at −80°C, inoculated on solid YPD (yeast extract, peptone, and dextrose) media, and grown for two days at 30°C. Liquid YPD cultures were inoculated from single colonies of solid media and grown, while shaking, at 30°C overnight.

Genetic manipulations

DNA sequences (1 to 1.5 kb) of the CEN10-flanking regions were PCR-amplified with Phusion High-Fidelity DNA Polymerase (NEB, Ipswich MA, USA). Flanking regions were fused on both sides of either the NEO or NAT dominant selectable marker via overlap PCR, conferring G418 or nourseothricin resistance, respectively. Deletion of C. deuterogattii CEN10 was achieved through homologous recombination via biolistic introduction of an overlap-PCR product as previously described (Billmyre et al., 2017; Davidson et al., 2002). Deletion of CEN9 was performed by CRISPR-CAS9 mediated transformation with two guide RNAs flanking CEN9 and homologous recombination was mediated by the introduction of an overlap PCR product as previously described (Fan and Lin, 2018). Transformants were selected on YPD medium containing G418 (200 μg/mL) or nourseothricin (100 μg/mL).

Subsequently, the 5’ junction, 3’ junction, and spanning PCR and Southern blot analyses were performed to confirm the correct replacement of CEN10 by the appropriate drug resistance marker. To identify centromeres, the gene CNBG_0491, which encodes CENP-A, was N-terminally fused to the gene encoding the fluorescent mCherry protein by overlap PCR, and C. deuterogattii strains were biolistically transformed as previously described (Billmyre et al., 2017). A subset of cen9∆ mutants were biolistically transformed with an overlap PCR product containing CENPC C-terminally fused with GFP. As three cen10∆ mutants have a neocentromere that spans the gene encoding CENPC, a subset of cen10∆ mutants were transformed instead with an overlap PCR product containing MIS12 C-terminally fused with GFP. Both PCR products encoding CENPC-GFP and MIS12-GFP were randomly integrated in the genome and confirmed by a PCR spanning either CENPC-GFP or MIS12-GFP.

Growth and competition assays

Three replicate cultures for seven independent cen10∆ deletion mutants and the wild-type strain were grown in liquid YPD at 30°C overnight. Cells were diluted to an OD600 of 0.01 and grown in 50 mL YPD at 30°C. The OD600 of the triplicate cultures was measured every two hours with a SmartSpec 3000 (BioRad) until stationary phase was reached (T = 22 hr).

For competition assays, three independent replicate cultures (cen9∆, cen10∆, control, and wild type) were grown overnight in 8 mL YPD. Subsequently, the cell density of the cultures was determined using a hemocytometer. For each independent cenΔ deletion mutant, 500,000 cells were co-cultured in a 1:1 ratio with wild-type cells. After 24 hr, the cultures were inoculated on 1) a YPD plate to determine the total colony-forming units (CFUs) and 2) a YPD plate containing G418 or nourseothricin to calculate the proportion of cen10∆ mutant CFUs compared to the wild-type CFUs. Plates were incubated at 30°C and the colonies were counted after 4 days. The cell morphology of >1000 cells of the wild type and of five cen10∆ mutant strains was analyzed, and the number of elongated cells was quantified (Figure 1—figure supplement 5).

Whole-genome sequencing, read mapping for aneuploidy/RNA-seq, and de novo genome assemblies

Genomic DNA was isolated using the CTAB protocol and sent to the Duke University Sequencing and Genomic Technologies Shared Resource facility for library preparation and Illumina sequencing. Sequencing was performed with a HiSeq 4000 sequencer, and 150 bp paired-end reads were generated. The resulting DNA sequence reads were trimmed, quality-filtered, and subsequently mapped with Bowtie2 to a complete PacBio, Nanopore-based, Illumina Pilon-corrected, whole-genome assembly of the C. deuterogattii R265 reference genome (version R265_fin_nuclear). Reads were visualized with IGV (Langmead, 2010; Quinlan and Hall, 2010; Thorvaldsdóttir et al., 2013; Yadav et al., 2018). Previously generated RNA sequencing reads (NCBI, SRA: SRR5209627) were remapped to the C. deuterogattii R265 reference genome by HISAT2 according to the default settings (Schneider et al., 2012; Pertea et al., 2016).

Genomes were de novo assembled with Spades using the default conditions (Bankevich et al., 2012). Genome assemblies were confirmed with PCRs using primers flanking the chromosome fusions and the PCR products obtained span the chromosomal fusions (Figure 5—figure supplement 1). The read coverage at chromosome fusions was analyzed and compared to the average read coverage of the contig (Figure 5—figure supplement 1).

Chromatin immunoprecipitation (ChIP) followed by high-throughput sequencing or qPCR

ChIP analyses were performed as previously described with minor modifications (Schotanus et al., 2015; Soyer et al., 2015). In short, 500 mL YPD cultures (1000 ml YPD for Mis12 ChIPs) were grown overnight at 30°C, after which 37% formaldehyde was added to a final concentration of 0.5% for crosslinking. The cultures were then incubated for 15 min, formaldehyde was quenched with 2.5 M glycine (1/20 vol), and cells were washed with cold PBS. The crosslinking time of Mis12-GFP tagged isolates was extended to 45 min. Cells were resuspended in chromatin buffer (50 mM HEPES-NaOH, pH 7.5; 20 mM NaCl; 1 mM Na-EDTA, pH 8.0; 1% [v/v] Triton X-100; 0.1% [w/v] sodium deoxycholate [DOC]) containing protease inhibitors (cOmplet Tablets, mini EDTA-free EASYpack, Roche), followed by homogenization by bead beating with a miniBead beater (BioSpec products) using 18 cycles of 1.5 min on and 1.5 min off. The supernatant containing chromatin was sheared by sonication (24 cycles of 15 s on, 15 s off, burst at high level) (Bioruptor UCD-200, Diagenode). Chromatin was isolated by centrifugation, and the supernatant was divided into a sample fraction and a sonication control. The sample fraction was precleared with protein-A beads (1 to 3 hr) and subsequently divided into two aliquots. One tube served as the input control, and a mCherry or GFP antibody (MCherry: ab183628, Abcam, GFP: ab290, Abcam) was added to the remaining half of the sample. The samples were incubated overnight at 4°C and then processed according to a previously published protocol (Soyer et al., 2015). After completing the ChIP experiment, the samples were analyzed by ChIP-qPCR or sent to the Duke University Sequencing and Genomic Technologies Shared Resource facility for library preparation and Illumina sequencing. Samples cen10-A, cen10-B and cen10-C were sequenced with a HiSeq 2500 sequencer, and single reads of 50 bp were obtained. All other ChIP-seq samples were sequenced with a NovaSeq 600 sequencer and 50 bp PE reads were obtained. For each centromere mutant and the wild type, a ChIPed and input sample were sequenced. Reads were mapped to the reference genome, similar to the whole-genome sequencing reads. To analyze the ChIP-seq data the ChIPed sample was normalized with the input sample and visualized with the IGV viewer.

RNA isolation and qPCRs

Cells were grown in an overnight culture of 25 mL YPD at 30°C. RNA was isolated with TRIzol LS (Thermo Fisher Scientific) according to the manufacturer’s instructions. Subsequently, cDNA was synthesized with the SuperScriptFirst-Strand Synthesis System (Thermo Fisher Scientific) according to the manufacturer’s instructions. qPCRs were performed in triplicate with Brilliant III Ultra-Fast SYBR Green qPCR Master Mix (Agilent Technologies) on an ABI 1900HT qPCR machine.

Pulsed-field gel electrophoresis (PFGE)

Isolation of whole chromosomes and conditions for PFGE and chromoblot analysis were performed as previously described (Findley et al., 2012).

Acknowledgements

We thank Tom Petes, Beth Sullivan, Kaustuv Sanyal, Sue Jinks-Robertson, Vikas Yadav, Shelby Priest, Shelly Clancey, and Inge van der Kloet for comments on the manuscript. We would like to thank all of the members of the Heitman and Sanyal labs who contribute to the bi-weekly Skype meeting. These studies were supported by NIH/NIAID grants R01 AI050113-15 and R37 MERIT award AI039115-22 to JH. JH is co-director and fellow of the CIFAR program Fungal Kingdom: Threats and Opportunities.

Funding Statement

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Contributor Information

Joseph Heitman, Email: heitm001@duke.edu.

Wolf-Dietrich Heyer, University of California, Davis, United States.

Kevin Struhl, Harvard Medical School, United States.

Funding Information

This paper was supported by the following grants:

  • NIH AI050113-15 to Joseph Heitman.

  • NIH AI039115-22 to Joseph Heitman.

  • CIFAR Fungal Kingdom: Threats and Opportunities to Joseph Heitman.

Additional information

Competing interests

No competing interests declared.

Author contributions

Conceptualization, Data curation, Software, Formal analysis, Validation, Investigation, Visualization, Methodology, Writing - original draft, Project administration, Writing - review and editing.

Conceptualization, Resources, Supervision, Funding acquisition, Project administration, Writing - review and editing.

Additional files

Supplementary file 1. Neocentromeres are not enriched with transposable elements.

To exclude the possibility that transposable elements were deposited into the neocentromeres, BlastN searches in a database with de novo genome assemblies of cen10-A, cen10-B, cen10∆-C, and cen10∆-E were performed. As input the homologous wild-type sequence of the chromosomal location of the neocentromeres was used. All neocentromeres of the tested cen10∆ mutants have the same length as the homologous sequence in the wild -type.

elife-56026-supp1.xlsx (10.8KB, xlsx)
Supplementary file 2. Neocentromeric regions are expressed in the wild-type strain.

Expression levels of genes where neocentromeres formed in the cenΔ mutants were analyzed in the R265 wild-type strain. Previously generated RNA sequencing data were remapped to the R265 reference genome, and the expression levels (FPKM) were analyzed for the native genes in each region wherein a neocentromere was formed in the cenΔ mutants. Expression analysis of several housekeeping genes was included for control purposes, and the median RNA expression level of all genes located on chromosomes 9 and 10 are listed. Expression levels of genes located in the subtelomeric regions were also analyzed.

elife-56026-supp2.xlsx (13.2KB, xlsx)
Supplementary file 3. Genes located in subtelomeric regions of chromosomes 1, 4, 7, and 10.

We have indicated whether each gene located in the subtelomeric regions was lost in the chromosomal fusion of the large-colony cen10∆ mutants. For each gene in the subtelomeric region of chromosome 1, 4, 7, and 10, several characteristics, such as the chromosomal location, putative function, and the presence of putative C. neoformans homologs in existing mutant libraries, are indicated. There is no correlation between the loss of genes with or without predicted function.

elife-56026-supp3.xlsx (12.2KB, xlsx)
Supplementary file 4. Primers used in this study.

For each primer, the lab identifier, purpose, and sequence are shown.

elife-56026-supp4.xlsx (17.1KB, xlsx)
Supplementary file 5. Strains used in this study.

For each strain used in this study, the lab strain identifier, description, and parental strain are indicated.

elife-56026-supp5.xlsx (11.1KB, xlsx)
Transparent reporting form

Data availability

ChIP and whole-genome sequencing reads and de novo genome assemblies were deposited under NCBI BioProject Accession ID: PRJNA511460.

The following dataset was generated:

Schotanus K. 2020. Neocentromere formation in Cryptococcus deuterogattii. NCBI BioProject. PRJNA511460

The following previously published dataset was used:

Universidade Federal do Rio Grande do Sul 2017. Cryptococcus gattii R265 mRNA RNA-Seq. NCBI Sequence Read Archive. SRR5209627

References

  1. Aksenova AY, Greenwell PW, Dominska M, Shishkin AA, Kim JC, Petes TD, Mirkin SM. Genome rearrangements caused by interstitial telomeric sequences in yeast. PNAS. 2013;110:19866–19871. doi: 10.1073/pnas.1319313110. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Alkan C, Ventura M, Archidiacono N, Rocchi M, Sahinalp SC, Eichler EE. Organization and evolution of primate centromeric DNA from whole-genome shotgun sequence data. PLOS Computational Biology. 2007;3:e181. doi: 10.1371/journal.pcbi.0030181. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Alonso A, Hasson D, Cheung F, Warburton PE. A paucity of heterochromatin at functional human neocentromeres. Epigenetics & Chromatin. 2010;3:6. doi: 10.1186/1756-8935-3-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Bankevich A, Nurk S, Antipov D, Gurevich AA, Dvorkin M, Kulikov AS, Lesin VM, Nikolenko SI, Pham S, Prjibelski AD, Pyshkin AV, Sirotkin AV, Vyahhi N, Tesler G, Alekseyev MA, Pevzner PA. SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing. Journal of Computational Biology. 2012;19:455–477. doi: 10.1089/cmb.2012.0021. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Billmyre RB, Clancey SA, Heitman J. Natural mismatch repair mutations mediate phenotypic diversity and drug resistance in Cryptococcus deuterogattii. eLife. 2017;6:e28802. doi: 10.7554/eLife.28802. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Burrack LS, Hutton HF, Matter KJ, Clancey SA, Liachko I, Plemmons AE, Saha A, Power EA, Turman B, Thevandavakkam MA, Ay F, Dunham MJ, Berman J. Neocentromeres provide chromosome segregation accuracy and centromere clustering to multiple loci along a Candida albicans chromosome. PLOS Genetics. 2016;12:e1006317. doi: 10.1371/journal.pgen.1006317. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Burrack LS, Berman J. Neocentromeres and Epigenetically inherited features of centromeres. Chromosome Research. 2012;20:607–619. doi: 10.1007/s10577-012-9296-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Cheeseman IM. The kinetochore. Cold Spring Harbor Perspectives in Biology. 2014;6:a015826. doi: 10.1101/cshperspect.a015826. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. D'Souza CA, Kronstad JW, Taylor G, Warren R, Yuen M, Hu G, Jung WH, Sham A, Kidd SE, Tangen K, Lee N, Zeilmaker T, Sawkins J, McVicker G, Shah S, Gnerre S, Griggs A, Zeng Q, Bartlett K, Li W, Wang X, Heitman J, Stajich JE, Fraser JA, Meyer W, Carter D, Schein J, Krzywinski M, Kwon-Chung KJ, Varma A, Wang J, Brunham R, Fyfe M, Ouellette BF, Siddiqui A, Marra M, Jones S, Holt R, Birren BW, Galagan JE, Cuomo CA. Genome variation in Cryptococcus gattii, an emerging pathogen of immunocompetent hosts. mBio. 2011;2:e00342. doi: 10.1128/mBio.00342-10. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Davidson RC, Blankenship JR, Kraus PR, de Jesus Berrios M, Hull CM, D'Souza C, Wang P, Heitman J. A PCR-based strategy to generate integrative targeting alleles with large regions of homology. Microbiology. 2002;148:2607–2615. doi: 10.1099/00221287-148-8-2607. [DOI] [PubMed] [Google Scholar]
  11. Dumesic PA, Homer CM, Moresco JJ, Pack LR, Shanle EK, Coyle SM, Strahl BD, Fujimori DG, Yates JR, Madhani HD. Product binding enforces the genomic specificity of a yeast polycomb repressive complex. Cell. 2015;160:204–218. doi: 10.1016/j.cell.2014.11.039. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Fan C, Walling JG, Zhang J, Hirsch CD, Jiang J, Wing RA. Conservation and purifying selection of transcribed genes located in a rice centromere. The Plant Cell. 2011;23:2821–2830. doi: 10.1105/tpc.111.085605. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Fan Y, Lin X. Multiple applications of a transient CRISPR-Cas9 coupled with electroporation (TRACE) system in the Cryptococcus neoformans species complex. Genetics. 2018;208:1357–1372. doi: 10.1534/genetics.117.300656. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Farrer RA, Desjardins CA, Sakthikumar S, Gujja S, Saif S, Zeng Q, Chen Y, Voelz K, Heitman J, May RC, Fisher MC, Cuomo CA. Genome evolution and innovation across the four major lineages of Cryptococcus gattii. mBio. 2015;6:e00868. doi: 10.1128/mBio.00868-15. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Findley K, Sun S, Fraser JA, Hsueh YP, Averette AF, Li W, Dietrich FS, Heitman J. Discovery of a modified tetrapolar sexual cycle in Cryptococcus amylolentus and the evolution of MAT in the Cryptococcus species complex. PLOS Genetics. 2012;8:e1002528. doi: 10.1371/journal.pgen.1002528. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Fraser JA, Giles SS, Wenink EC, Geunes-Boyer SG, Wright JR, Diezmann S, Allen A, Stajich JE, Dietrich FS, Perfect JR, Heitman J. Same-sex mating and the origin of the vancouver island Cryptococcus gattii outbreak. Nature. 2005;437:1360–1364. doi: 10.1038/nature04220. [DOI] [PubMed] [Google Scholar]
  17. Friedman S, Freitag M. Centromeres and Kinetochores. Springer International Publishing; 2017. Centrochromatin of fungi; pp. 85–109. [DOI] [Google Scholar]
  18. Garsed DW, Marshall OJ, Corbin VD, Hsu A, Di Stefano L, Schröder J, Li J, Feng ZP, Kim BW, Kowarsky M, Lansdell B, Brookwell R, Myklebost O, Meza-Zepeda L, Holloway AJ, Pedeutour F, Choo KH, Damore MA, Deans AJ, Papenfuss AT, Thomas DM. The architecture and evolution of cancer neochromosomes. Cancer Cell. 2014;26:653–667. doi: 10.1016/j.ccell.2014.09.010. [DOI] [PubMed] [Google Scholar]
  19. Guenatri M, Bailly D, Maison C, Almouzni G. Mouse centric and pericentric satellite repeats form distinct functional heterochromatin. Journal of Cell Biology. 2004;166:493–505. doi: 10.1083/jcb.200403109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Henikoff S, Furuyama T. Epigenetic inheritance of centromeres. Cold Spring Harbor Symposia on Quantitative Biology. 2010;75:51–60. doi: 10.1101/sqb.2010.75.001. [DOI] [PubMed] [Google Scholar]
  21. Higgins AW, Gustashaw KM, Willard HF. Engineered human dicentric chromosomes show centromere plasticity. Chromosome Research. 2005;13:745–762. doi: 10.1007/s10577-005-1009-2. [DOI] [PubMed] [Google Scholar]
  22. Higgs HN, Peterson KJ. Phylogenetic analysis of the formin homology 2 domain. Molecular Biology of the Cell. 2005;16:1–13. doi: 10.1091/mbc.e04-07-0565. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Ishii K, Ogiyama Y, Chikashige Y, Soejima S, Masuda F, Kakuma T, Hiraoka Y, Takahashi K. Heterochromatin integrity affects chromosome reorganization after centromere dysfunction. Science. 2008;321:1088–1091. doi: 10.1126/science.1158699. [DOI] [PubMed] [Google Scholar]
  24. Janbon G, Ormerod KL, Paulet D, Byrnes EJ, Yadav V, Chatterjee G, Mullapudi N, Hon CC, Billmyre RB, Brunel F, Bahn YS, Chen W, Chen Y, Chow EW, Coppée JY, Floyd-Averette A, Gaillardin C, Gerik KJ, Goldberg J, Gonzalez-Hilarion S, Gujja S, Hamlin JL, Hsueh YP, Ianiri G, Jones S, Kodira CD, Kozubowski L, Lam W, Marra M, Mesner LD, Mieczkowski PA, Moyrand F, Nielsen K, Proux C, Rossignol T, Schein JE, Sun S, Wollschlaeger C, Wood IA, Zeng Q, Neuvéglise C, Newlon CS, Perfect JR, Lodge JK, Idnurm A, Stajich JE, Kronstad JW, Sanyal K, Heitman J, Fraser JA, Cuomo CA, Dietrich FS. Analysis of the genome and transcriptome of Cryptococcus neoformans var. grubii reveals complex RNA expression and microevolution leading to virulence attenuation. PLOS Genetics. 2014;10:e1004261. doi: 10.1371/journal.pgen.1004261. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Ketel C, Wang HS, McClellan M, Bouchonville K, Selmecki A, Lahav T, Gerami-Nejad M, Berman J. Neocentromeres form efficiently at multiple possible loci in Candida Albicans. PLOS Genetics. 2009;5:e1000400. doi: 10.1371/journal.pgen.1000400. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Kobayashi N, Suzuki Y, Schoenfeld LW, Müller CA, Nieduszynski C, Wolfe KH, Tanaka TU. Discovery of an unconventional centromere in budding yeast redefines evolution of point centromeres. Current Biology. 2015;25:2026–2033. doi: 10.1016/j.cub.2015.06.023. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Langmead B. Aligning short sequencing reads with bowtie. Current Protocols in Bioinformatics. 2010;32:11. doi: 10.1002/0471250953.bi1107s32. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Liu OW, Chun CD, Chow ED, Chen C, Madhani HD, Noble SM. Systematic genetic analysis of virulence in the human fungal pathogen cryptococcus neoformans. Cell. 2008;135:174–188. doi: 10.1016/j.cell.2008.07.046. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Lu M, He X. Centromere repositioning causes inversion of meiosis and generates a reproductive barrier. PNAS. 2019;116:21580–21591. doi: 10.1073/pnas.1911745116. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Luo J, Sun X, Cormack BP, Boeke JD. Karyotype engineering by chromosome fusion leads to reproductive isolation in yeast. Nature. 2018;560:392–396. doi: 10.1038/s41586-018-0374-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Mandáková T, Hloušková P, Koch MA, Lysak MA. Genome evolution in Arabideae was marked by frequent centromere repositioning. The Plant Cell. 2020;32:650–665. doi: 10.1105/tpc.19.00557. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Marshall OJ, Chueh AC, Wong LH, Choo KH. Neocentromeres: new insights into centromere structure, disease development, and karyotype evolution. The American Journal of Human Genetics. 2008;82:261–282. doi: 10.1016/j.ajhg.2007.11.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. McNulty SM, Sullivan BA. Alpha satellite DNA biology: finding function in the recesses of the genome. Chromosome Research. 2018;26:115–138. doi: 10.1007/s10577-018-9582-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Miga KH. Chromosome-specific centromere sequences provide an estimate of the ancestral chromosome 2 fusion event in hominin genomes. Journal of Heredity. 2017;108:45–52. doi: 10.1093/jhered/esw039. [DOI] [PubMed] [Google Scholar]
  35. Mishra PK, Baum M, Carbon J. Centromere size and position in Candida albicans are evolutionarily conserved independent of DNA sequence heterogeneity. Molecular Genetics and Genomics. 2007;278:455–465. doi: 10.1007/s00438-007-0263-8. [DOI] [PubMed] [Google Scholar]
  36. Nagaki K, Cheng Z, Ouyang S, Talbert PB, Kim M, Jones KM, Henikoff S, Buell CR, Jiang J. Sequencing of a rice centromere uncovers active genes. Nature Genetics. 2004;36:138–145. doi: 10.1038/ng1289. [DOI] [PubMed] [Google Scholar]
  37. Nergadze SG, Piras FM, Gamba R, Corbo M, Cerutti F, McCarter JGW, Cappelletti E, Gozzo F, Harman RM, Antczak DF, Miller D, Scharfe M, Pavesi G, Raimondi E, Sullivan KF, Giulotto E. Birth, evolution, and transmission of satellite-free mammalian centromeric domains. Genome Research. 2018;28:789–799. doi: 10.1101/gr.231159.117. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Ohno Y, Ogiyama Y, Kubota Y, Kubo T, Ishii K. Acentric chromosome ends are prone to fusion with functional chromosome ends through a homology-directed rearrangement. Nucleic Acids Research. 2016;44:232–244. doi: 10.1093/nar/gkv997. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Pertea M, Kim D, Pertea GM, Leek JT, Salzberg SL. Transcript-level expression analysis of RNA-seq experiments with HISAT, StringTie and Ballgown. Nature Protocols. 2016;11:1650–1667. doi: 10.1038/nprot.2016.095. [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Quinlan AR, Hall IM. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics. 2010;26:841–842. doi: 10.1093/bioinformatics/btq033. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Rhind N, Chen Z, Yassour M, Thompson DA, Haas BJ, Habib N, Wapinski I, Roy S, Lin MF, Heiman DI, Young SK, Furuya K, Guo Y, Pidoux A, Chen HM, Robbertse B, Goldberg JM, Aoki K, Bayne EH, Berlin AM, Desjardins CA, Dobbs E, Dukaj L, Fan L, FitzGerald MG, French C, Gujja S, Hansen K, Keifenheim D, Levin JZ, Mosher RA, Müller CA, Pfiffner J, Priest M, Russ C, Smialowska A, Swoboda P, Sykes SM, Vaughn M, Vengrova S, Yoder R, Zeng Q, Allshire R, Baulcombe D, Birren BW, Brown W, Ekwall K, Kellis M, Leatherwood J, Levin H, Margalit H, Martienssen R, Nieduszynski CA, Spatafora JW, Friedman N, Dalgaard JZ, Baumann P, Niki H, Regev A, Nusbaum C. Comparative functional genomics of the fission yeasts. Science. 2011;332:930–936. doi: 10.1126/science.1203357. [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Sankaranarayanan SR, Ianiri G, Coelho MA, Reza MH, Thimmappa BC, Ganguly P, Vadnala RN, Sun S, Siddharthan R, Tellgren-Roth C, Dawson TL, Heitman J, Sanyal K. Loss of centromere function drives karyotype evolution in closely related Malassezia species. eLife. 2020;9:e53944. doi: 10.7554/eLife.53944. [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Sanyal K, Baum M, Carbon J. Centromeric DNA sequences in the pathogenic yeast Candida albicans are all different and unique. PNAS. 2004;101:11374–11379. doi: 10.1073/pnas.0404318101. [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Schneider RO, Fogaça NS, Kmetzsch L, Schrank A, Vainstein MH, Staats CC. Zap1 regulates zinc homeostasis and modulates virulence in cryptococcus gattii. PLOS ONE. 2012;7:e43773. doi: 10.1371/journal.pone.0043773. [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Schotanus K, Soyer JL, Connolly LR, Grandaubert J, Happel P, Smith KM, Freitag M, Stukenbrock EH. Histone modifications rather than the novel regional centromeres of Zymoseptoria tritici distinguish core and accessory chromosomes. Epigenetics & Chromatin. 2015;8:41. doi: 10.1186/s13072-015-0033-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Scott KC, Sullivan BA. Neocentromeres: a place for everything and everything in its place. Trends in Genetics. 2014;30:66–74. doi: 10.1016/j.tig.2013.11.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Shang WH, Hori T, Martins NM, Toyoda A, Misu S, Monma N, Hiratani I, Maeshima K, Ikeo K, Fujiyama A, Kimura H, Earnshaw WC, Fukagawa T. Chromosome engineering allows the efficient isolation of vertebrate neocentromeres. Developmental Cell. 2013;24:635–648. doi: 10.1016/j.devcel.2013.02.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Shao Y, Lu N, Wu Z, Cai C, Wang S, Zhang LL, Zhou F, Xiao S, Liu L, Zeng X, Zheng H, Yang C, Zhao Z, Zhao G, Zhou JQ, Xue X, Qin Z. Creating a functional single-chromosome yeast. Nature. 2018;560:331–335. doi: 10.1038/s41586-018-0382-x. [DOI] [PubMed] [Google Scholar]
  49. Sinha S, Villarreal D, Shim EY, Lee SE. Risky business: microhomology-mediated end joining. Mutation Research/Fundamental and Molecular Mechanisms of Mutagenesis. 2016;788:17–24. doi: 10.1016/j.mrfmmm.2015.12.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Smith KM, Phatale PA, Sullivan CM, Pomraning KR, Freitag M. Heterochromatin is required for normal distribution of Neurospora crassa CenH3. Molecular and Cellular Biology. 2011;31:2528–2542. doi: 10.1128/MCB.01285-10. [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Soyer JL, Möller M, Schotanus K, Connolly LR, Galazka JM, Freitag M, Stukenbrock EH. Chromatin analyses of Zymoseptoria tritici: methods for chromatin immunoprecipitation followed by high-throughput sequencing (ChIP-seq) Fungal Genetics and Biology. 2015;79:63–70. doi: 10.1016/j.fgb.2015.03.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
  52. Stimpson KM, Song IY, Jauch A, Holtgreve-Grez H, Hayden KE, Bridger JM, Sullivan BA. Telomere disruption results in non-random formation of de novo dicentric chromosomes involving acrocentric human chromosomes. PLOS Genetics. 2010;6:e1001061. doi: 10.1371/journal.pgen.1001061. [DOI] [PMC free article] [PubMed] [Google Scholar]
  53. Stimpson KM, Matheny JE, Sullivan BA. Dicentric chromosomes: unique models to study centromere function and inactivation. Chromosome Research. 2012;20:595–605. doi: 10.1007/s10577-012-9302-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  54. Sullivan BA, Willard HF. Stable dicentric X chromosomes with two functional centromeres. Nature Genetics. 1998;20:227–228. doi: 10.1038/3024. [DOI] [PubMed] [Google Scholar]
  55. Suma M, Kitagawa T, Nakase Y, Nakazawa N, Yanagida M, Matsumoto T. Fission yeast CENP-C (Cnp3) plays a role in restricting the site of CENP-A accumulation. G3: Genes|Genomes|Genetics. 2018;8:2723–2733. doi: 10.1534/g3.118.200486. [DOI] [PMC free article] [PubMed] [Google Scholar]
  56. Sun S, Billmyre RB, Mieczkowski PA, Heitman J. Unisexual reproduction drives meiotic recombination and phenotypic and karyotypic plasticity in Cryptococcus neoformans. PLOS Genetics. 2014;10:e1004849. doi: 10.1371/journal.pgen.1004849. [DOI] [PMC free article] [PubMed] [Google Scholar]
  57. Thakur J, Sanyal K. Efficient neocentromere formation is suppressed by gene conversion to maintain centromere function at native physical chromosomal loci in Candida albicans. Genome Research. 2013;23:638–652. doi: 10.1101/gr.141614.112. [DOI] [PMC free article] [PubMed] [Google Scholar]
  58. Thorvaldsdóttir H, Robinson JT, Mesirov JP. Integrative genomics viewer (IGV): high-performance genomics data visualization and exploration. Briefings in Bioinformatics. 2013;14:178–192. doi: 10.1093/bib/bbs017. [DOI] [PMC free article] [PubMed] [Google Scholar]
  59. Tolomeo D, Capozzi O, Stanyon RR, Archidiacono N, D'Addabbo P, Catacchio CR, Purgato S, Perini G, Schempp W, Huddleston J, Malig M, Eichler EE, Rocchi M. Epigenetic origin of evolutionary novel centromeres. Scientific Reports. 2017;7:41980. doi: 10.1038/srep41980. [DOI] [PMC free article] [PubMed] [Google Scholar]
  60. van Steensel B, Smogorzewska A, de Lange T. TRF2 protects human telomeres from end-to-end fusions. Cell. 1998;92:401–413. doi: 10.1016/S0092-8674(00)80932-0. [DOI] [PubMed] [Google Scholar]
  61. Ventura M, Antonacci F, Cardone MF, Stanyon R, D'Addabbo P, Cellamare A, Sprague LJ, Eichler EE, Archidiacono N, Rocchi M. Evolutionary formation of new centromeres in macaque. Science. 2007;316:243–246. doi: 10.1126/science.1140615. [DOI] [PubMed] [Google Scholar]
  62. Warburton PE. Chromosomal dynamics of human neocentromere formation. Chromosome Research. 2004;12:617–626. doi: 10.1023/B:CHRO.0000036585.44138.4b. [DOI] [PubMed] [Google Scholar]
  63. Yadav V, Sun S, Billmyre RB, Thimmappa BC, Shea T, Lintner R, Bakkeren G, Cuomo CA, Heitman J, Sanyal K. RNAi is a critical determinant of centromere evolution in closely related fungi. PNAS. 2018;115:3108–3113. doi: 10.1073/pnas.1713725115. [DOI] [PMC free article] [PubMed] [Google Scholar]
  64. Zhang B, Lv Z, Pang J, Liu Y, Guo X, Fu S, Li J, Dong Q, Wu HJ, Gao Z, Wang XJ, Han F. Formation of a functional maize centromere after loss of centromeric sequences and gain of ectopic sequences. The Plant Cell. 2013;25:1979–1989. doi: 10.1105/tpc.113.110015. [DOI] [PMC free article] [PubMed] [Google Scholar]

Decision letter

Editor: Wolf-Dietrich Heyer1
Reviewed by: Wolf-Dietrich Heyer2

In the interests of transparency, eLife publishes the most substantive revision requests and the accompanying author responses.

Acceptance summary:

In this manuscript the authors analyze the consequences of centromere deletions on chromosomes 9 and 10 in the fungal pathogen Cryptococcus deuterogattii. The authors identify the formation on neocentromeres at two different locations on chromosome 10. Elevated growth temperature selects for chromosome fusions of chromosome 10 with another chromosome involving inactivation of the neocentromere. The chromosomes 9 with neocentromeres do not display this behavior. Notably, the neocentromeric genes are constitutively expressed. The events described in this work present a mechanism that could explain the changes in genome organization within the Cryptococcus pathogenic species complex.

Decision letter after peer review:

[Editors’ note: the authors submitted for reconsideration following the decision after peer review. What follows is the decision letter after the first round of review.]

Thank you for submitting your work entitled "Centromere deletion in Cryptococcus deuterogattii leads to neocentromere formation and chromosome fusions" for consideration by eLife. Your article has been reviewed by three peer reviewers, including Wolf-Dietrich Heyer as the Reviewing Editor and Reviewer #1, and the evaluation has been overseen by a Senior Editor.

Our decision has been reached after consultation between the reviewers. Based on these discussions and the individual reviews below, we regret to inform you that your work will not be considered further for publication in eLife.

As you can see from the reviews, there was a span of opinions, which converged at the end of our discussion. We all agreed that the manuscript is interesting and that the process of neocentromere formation is an important topic. We appreciate the amount of work required to adequately analyze and describe these events, but the analysis offers little new insights into the mechanism of neocentromere formation beyond the description. As eLife wants to limit the required revisions to take no more than 2 months, we could not see how the major points listed in the reviews could be addressed in that time span. For this reason, we decided on a rejection. However, if you elect to address the comments in a future submission to eLife, I would be happy to serve as a reviewing editor again.

Reviewer #1:

This is a well written manuscript analyzing the consequences of centromere 10 deletion in the fungal pathogen Cryptococcus deuterogattii. The authors identify the formation on neocentromeres at two different locations on chromosome 10. Elevated growth temperature selects for chromosome fusions of chromosome 10 with another chromosome involving inactivation of the neocentromere. The events described in this work present a mechanism that could explain the changes in genome organization within the Cryptococcus pathogenic species complex.

The evidence presented supports the conclusions, and my main comment is that the analysis of the 2 isolates with 2 mapped neocentromeres remains inconclusive.

Essential revision:

1) The authors identify 2 isolates that contain 2 neocentromeres on chromosome 10 as defined by CENP-A ChIP. They present 4 models to explain the observation of having 2 CENP-A enrichment loci in a single chromosome: 1) aneuploidy, 2) a dicentric, 3) instability and neo-centromere switching, 4) only one CENPA locus being a functional centromere. The authors show that only a minority fraction of cells is aneuploid based on sequence read analysis. In the Discussion, they argue against a switching model, and discuss the possibility of the other two model. This is not very satisfying, and a ChIP experiment using a kinetochore component could be conducted to assess the functionality of both CENPA loci to assemble a kinetochore. There may be alternative routes to resolve this.

Reviewer #2:

Cryptococcus deuterogattii has regional centromeres and lacks active transposons. In this paper, authors deleted centromere 10 and examined chromosomes in surviving clones after centromere deletion (Cen10Δ mutants). This deletion caused neocentromere formation in two particular sites. Unlike neocentromere observed in other organisms, neocentromere formation in C. deuterogattii did not prevent expression of gene that cover neocentromeres. Cen10Δ mutants showed growth defects and aneuploidy. If growth temperature was increased, neocentromeres were inactivated and chromosome-fusion occurred. Finally authors speculated that these observations might occur under natural situation during speciation.

This is potentially interesting study, but work in this study is relatively descriptive. My major concern is unclearness how authors' observation in this study will be generalized. Authors performed various nice experiments and data are clear, however, I am not sure whether chromosomes with a neocentromere always get fusion with other chromosomes with growth fitness. In addition, it is also unclear whether cells with a neocentromere always show aneuploid. I agree that neocentromeres in this study are relatively weak compared with native centromeres, but I am not sure whether neocentromeres in C. deuterogattii are always weak or not. If authors show an evidence that observations in this study frequently occur or they can clearly explain about generality of this work, my mind will be changed.

Reviewer #3:

Centromeres vary in size and sequence between organisms. In most organisms studied, the centromere is marked by the presence of the histone H3 variant, CENPA. Accumulated evidence indicates that CENPA is the epigenetic mark that specifies centromere identity. Consistent with an epigenetic mode of centromere propagation in which sequence does not dominate, neocentromeres can arise on chromosomal regions which have no homology to centromere sequences. Many naturally-occurring neocentromeres have been reported and neocentromeres have been experimentally induced in several organisms, including fission yeast, Candida and chicken.

This manuscript describes the isolation and characterization of neocentromere strains in the human pathogen Cryptococcus deuterogattii, using an approach similar to previous studies in other species including Candida albicans. The authors delete the centromere of chromosome 10 and recover rare survivors which they go on to show by CENPA ChIP-seq have acquired either one or two neocentromeres on chromosome 10. They report that this has little impact on underlying gene expression. By whole genome sequencing the authors provide evidence that the strains have a degree of aneuploidy, and are growth impaired. By incubation of neocentromere strains at 37oC, small and large colonies are isolated and the authors show that the large colonies have undergone fusion of chromosome 10 with various other chromosomes. They also claim that the neocentromeres become inactivated upon chromosome fusion.

Although this manuscript is interesting, it has a preliminary feel. Whilst it is interesting that neocentromeres can form in Cryptococcus, which, like Candida, is a human pathogen, it is questionable whether the study provides significant advances in understanding the processes of neocentromere formation. The manuscript would also benefit from significant rewriting.

There is little discussion of potential relevance to the virulence / pathogenicity of Cryptococcus deuterogattii.

Several points need to be addressed before the manuscript could be considered for publication in eLife.

A more detailed description of C. deuterogattii centromeres is required – size, how many types of transposons are present? What is the level of sharing between centromeres? How much unique sequence is at each centromere? In particular a more-detailed diagram of cen10 is needed.

How many total transformants were obtained from the 99 transformations? Was it only the 7 mentioned? Or were there many NAT/NEO positive transformants that were incorrect? The approach used relies on correct integration and neocentromere formation to both occur almost in quick succession in order to recover the desired isolates. Did the authors consider/try a “two-step” approach such as that used in S. pombe (Ishii et al., 2008), in which a split marker gene is united via Cre-lox to delete the centromere?

Three strains are analysed by CENPA ChIP-seq and two neocentromeres are identified. Cen10Δ-1 has CENPA enrichment at both neocen1 and neocen2 sites, cen10Δ-2 has only neocen1, and cen10Δ-3 has both. What about the other neocen strains (cen10Δ-4,5,6,7)? Where is CENPA located in these other isolates, especially in cen10Δ-5?

Figure 1: cen10 and the precise region deleted must be indicated (either as a bar above the region or with shading that extends down Figure 1A). Relevant features, such as the transposon remnants at cen10, neocen1 (or A), neocen2 (or B), the gene names for the genes that are mentioned in the text and/or analysed in other figures must be labelled.

I assume that the reads mapping to the (deleted) cen10 region in cen10Δ-1,-2,-3 are due to CENPA ChIP-seq reads from copies of the transposons at other centromeres mapping to cen10 in the reference genome? This must be explained in the legend.

In part B the peak height at cen10Δ-2 neocen1 looks higher than those in cen10Δ-1, and -3. But in C – a zoom-in of that region – they all look similar. Also the “background”? across the chromosome arm in cen10Δ-1 appears higher than in the other isolates – what is real and what is due to differences in what is being displayed / scales?

The information associated with all ChIP-seq data (and similar) figures must be improved. The y axes are not labelled. What exactly is plotted? Has the data been normalized to input? Are the scales the same in all parts of the figure (the absence of any numbers on the y axes makes it impossible to interpret)? The legends must be more precise and informative.

What are the segregation properties of chr10 in the cen10Δ strains? Do neocen1 and/or neocen2 behave as centromeres? Ideally this should be done by integration of lacO arrays on chr10 and visualization via LacI-GFP. Integration of arrays near neocen1 and (separately) near cen10/neocen2 would enable predictions about the behavior of different loci to be tested. For instance, one would expect the neocen2 locus to be far from the spindle pole body in wild type (non-neocen) cells but close to it in cen10Δ-2/neocen cells. In addition, a single chr10 bearing two neocentromeres might exhibit segregation defects such as lagging/stretched chromosomes or chromosome breakage. Introduction of lacO arrays should be attempted, alternatively cen10Δ cells should at least be stained for CENPA and DAPI and cells with observable chromosome segregation defects quantified.

To assess whether either/both neocentromere is assembling a bona fide kinetochore ChIP-qPCR should be performed for a kinetochore protein (ideally an outer kinetochore protein). (Subsection “Deletion of centromere 10 results in neocentromere formation” paragraph three)

Differences in colony size are mentioned multiple times in the manuscript. It would be of interest to see examples of colonies of the various cen10Δ strains at different temperatures (in addition to the growth curves).

Figure 2: Has the qRT-PCR data presented in Figure 2 been normalized in any way to account for the ploidy difference between the wild-type strain and the cen10Δ strains? For instance, cen10Δ-2-S3 (which is presumably similar to cen10Δ-2) has a chr10 ploidy of 1.4 X wild-type. qPCR could be performed on genomic DNA (relative to a control euploid locus) and the qRT-PCR expression data normalized to it. If such normalization has been done already it is not described in the manuscript (and it should be).

Figure 4: Is it known where on chr10 the extra centromere-containing region of chromosome 5 is in cen10Δ-5-S3? It is intriguing that the neocen1 region appears to be duplicated in cen10Δ-L2, and, based on PFGE it is a duplication on chr10. What are the levels of CENPA on (duplicated?) neocen1? Is a double copy of neocen1 giving improved chromosome segregation?

It is interesting that cen10-Δ5 derivatives are euploid for chr10 but aneuploid for chr8. What might be the explanation for this observation? Are two copies of chr8 somehow beneficial?

“In contrast to the other fused chromosomes, this chromosomal fragment did not fuse to a chromosome with a native centromere, and thus the mutant still had a fitness defect.”, this statement is too strong – it's speculation. “thus” should be replaced by “consistent with”.

Subsection “cen10Δ isolates are aneuploid”: cen10Δ-5 strains are not euploid for chromosome 8.

Figure 4 legend – indicates that endogenous cen10 reads are absent due to its deletion. However, there are reads in the ChIP-seq in Figure 1. What is the reason for this difference?

The claim in the Abstract that the chr10 neocentromere is inactivated upon chromosome fusion must be supported by more substantial data. Figure 6 needs major improvement. It shows CENPA ChIP-qPCR for only wt, cen10Δ-1L and cen10Δ-2L, analyzing neocen1 and neocen2 regions. The CENPA ChIP-qPCR for positive control ie cen10-Δ1 and cen10-Δ2 with “active” neocentromeres (neocen1 and neocen2) is missing. Without that it is hard to interpret the data.

Which large colonies are analysed? Are they the same ones as in Figure 4, 5? There should be an indication of which strain they correspond to, or they should be given a new number.

CENPA ChIP-qPCR should be performed on three biological replicates and graphs show standard deviation.

CENPA ChIP-qPCR should be done for all original strains (and/or S strains) and fast-growing (L) derivatives. Ideally including qPCR for cen10 (as appropriate) and the centromere of the fusion chromosomes.

It is interesting that CENPA is apparently present on rrd2-1/neocen2 in wild type cells (in qPCR at least). There is no comment on this in the text. Could it be that wild-type cells do contain a small amount of CENPA at this location and, in the case of centromere deletion, this seeds further CENPA deposition to establish a neocentromere?

Figure 6B and subsection “cen10Δ chromosome is rescued by chromosome fusion”. The evidence that cen10Δ-3-L1 has a fusion between chr10 and chr13 should be explicitly described. Has PCR been done to confirm this?

All cen10Δ-5 strains analysed by PFGE are aneuploid for chromosome 8. In the figure legend there is the speculation that this occurred during or before transformation. Such speculative statements should not be placed in figure legends. I understand from the text and strain list that strain cen10Δ-5 (KS6) in the originally-derived and restreaked neocentromere strain and that the small and large colonies (KS21-25) are derived from that one original strain by growing at 37 and selecting small and large colonies. So the chr8 aneuploidy could have arisen in KS6 after transformation.

Figure 6B. Conclusions from data must be stated in the text, not the legend! Cen10Δ-5-L1 – from the data presented in the figure it can only be concluded that the chromosome 10 band is gone, it could have fused with another chromosome resulting in a fusion chromosome that migrates at the same size as other chromosomes. Southern would need to be performed with a chr10 probe to rigorously assess this. The sequencing data also confirms the fusions. But what is the data for 10-13 fusion in cen10Δ-L2?

Figure legends should only describe what was done and what is presented so that the reader can understand the data. Legends should not contain interpretation or discussion. All legends should be reviewed and modified accordingly. In addition, the figures themselves would be far easier for the reader to comprehend if the labelling of figures was improved.

[Editors’ note: further revisions were suggested prior to acceptance, as described below.]

Thank you for resubmitting your work entitled "Centromere deletion in Cryptococcus deuterogattii leads to neocentromere formation and chromosome fusions" for further consideration by eLife. Your revised article has been evaluated by Kevin Struhl (Senior Editor) and Wolf Heyer (Reviewing Editor) as well as two additional reviewers.

The manuscript has substantially improved and all reviewers recognized the strong efforts put into this revision. There are some remaining issues that need to be addressed before acceptance, as outlined below:

1) The remaining revisions do not require new experimentation.

2) The major point is the analysis of the ChIP seq experiments, which should change to fold enrichment over input on the y axis in ChIP-seq analyses.

Original comment:

In part B the peak height at cen10Δ-2 neocen1 looks higher than those in cen10Δ-1, and -3. But in C – a zoom-in of that region – they all look similar. Also, the “background”? across the chromosome arm in cen10Δ-1 appears higher than in the other isolates – what is real and what is due to differences in what is being displayed / scales?

Author response:

We thank the reviewers for noticing these issues, which are due to the "auto scale setting" feature of the IGV viewer. For the revised Figure 1, we set the read coverage to the maximum peak height, which reduced the background.

Reviewer comment:

OK, but it is more informative and best practice to indicate the fold enrichment over input on the y axis in ChIP-seq analyses.

Original comment:

The information associated with all ChIP-seq data (and similar) figures must be improved. The y axes are not labelled. What exactly is plotted? Has the data been normalized to input? Are the scales the same in all parts of the figure (the absence of any numbers on the y axes makes it impossible to interpret)? The legends must be more precise and informative.

Author resposne:

We have introduced "The read coverage (y-axis) shows the enrichment of CENP-A and

the x-axis shows the chromosome coordinates." All data shown are the result of the ChIP sample normalized with input. As expected, there is some variation in the number of reads of each ChIP analysis and this might be due to the ChIP, sequencing, library preparation, or experimental variation. For each sample, the reads are normalized to 1 within the sample.

Reviewer comment:

OK, but as above it is more informative and best practice to indicate the fold enrichment over input on the y axis in ChIP-seq analyses

3) The quality of the graphs, e.g. those presented in Figures 2 and 6 could be improved.

4) What is the strange effect on the ethidium-stained gels in Figure 1—figure supplement 1 and Figure 5—figure supplement 1?

5) The Discussion is quite lengthy and would benefit from being edited down. There are many of redundant descriptions. It is not necessary to summarize results there.

6) “C. deuterogattii is responsible for an ongoing outbreak in the Pacific Northwest regions of Canada and the United States” – outbreak of what?

7) In some places in the text it would be helpful to refer to specific parts of figures, e.g. Figure 3C.

8) Please indicate the precise region deleted in cen9Δ and cen10Δ in Figure 1. Discuss cen9Δ and cen10Δ in same order in text and figure.

eLife. 2020 Apr 20;9:e56026. doi: 10.7554/eLife.56026.sa2

Author response


[Editors’ note: the authors resubmitted a revised version of the paper for consideration. What follows is the authors’ response to the first round of review.]

Reviewer #1:

This is a well written manuscript analyzing the consequences of centromere 10 deletion in the fungal pathogen Cryptococcus deuterogattii. The authors identify the formation on neocentromeres at two different locations on chromosome 10. Elevated growth temperature selects for chromosome fusions of chromosome 10 with another chromosome involving inactivation of the neocentromere. The events described in this work present a mechanism that could explain the changes in genome organization within the Cryptococcus pathogenic species complex.

The evidence presented supports the conclusions, and my main comment is that the analysis of the 2 isolates with 2 mapped neocentromeres remains inconclusive.

Essential revision:

1) The authors identify 2 isolates that contain 2 neocentromeres on chromosome 10 as defined by CENP-A ChIP. They present 4 models to explain the observation of having 2 CENP-A enrichment loci in a single chromosome: 1) aneuploidy, 2) a dicentric, 3) instability and neo-centromere switching, 4) only one CENPA locus being a functional centromere. The authors show that only a minority fraction of cells is aneuploid based on sequence read analysis. In the Discussion, they argue against a switching model, and discuss the possibility of the other two model. This is not very satisfying, and a ChIP experiment using a kinetochore component could be conducted to assess the functionality of both CENPA loci to assemble a kinetochore. There may be alternative routes to resolve this.

Thank you for summarizing the findings, models, and sequencing. To confirm the CENP-A ChIP data, we have transformed a large subset of the centromere mutants with genes encoding either CENP-C or Mis12 epitope-tagged with GFP. As cen10Δ mutant-A, -B and -C span the gene encoding CENP-C we have transformed these isolates with Mis12-GFP. The cen9Δ mutants were transformed with CENP-C-GFP. Subsequently, we performed ChIP-qPCRs and this experiment showed that Mis12 and CENP-C co-localize with the CENP-A-enriched regions and confirmed that the neocentromeres are functional centromeres. The fact that we still see two enriched CENP-A regions in cenΔ-A and -C suggests that the observation is real; however, we cannot exclude the models above. Unfortunately, we cannot perform single-cell ChIPs and with the current experiment we are sequencing a population of cells. For example, we noticed when we streak purify cen10Δ cells, we obtain a population of colonies of mixed sizes (large and small). When we streak purify the small colonies, we again obtain a population of mixed size colonies, which is in contrast to streak purifying large colonies which only subsequently produce large colonies. As we show in the manuscript, the large colonies are the products s of chromosome fusion and the neocentromere is silenced.

In addition, we have identified the neocentromeres of cen10Δ-D to -G and have six mutants wherein centromere 9 is deleted. This allowed us to compare the neocentromeres on different chromosomes.

Reviewer #2:

Cryptococcus deuterogattii has regional centromeres and lacks active transposons. In this paper, authors deleted centromere 10 and examined chromosomes in surviving clones after centromere deletion (Cen10Δ mutants). This deletion caused neocentromere formation in two particular sites. Unlike neocentromere observed in other organisms, neocentromere formation in C. deuterogattii did not prevent expression of gene that cover neocentromeres. Cen10Δ mutants showed growth defects and aneuploidy. If growth temperature was increased, neocentromeres were inactivated and chromosome-fusion occurred. Finally authors speculated that these observations might occur under natural situation during speciation.

This is potentially interesting study, but work in this study is relatively descriptive. My major concern is unclearness how authors' observation in this study will be generalized. Authors performed various nice experiments and data are clear, however, I am not sure whether chromosomes with a neocentromere always get fusion with other chromosomes with growth fitness. In addition, it is also unclear whether cells with a neocentromere always show aneuploid. I agree that neocentromeres in this study are relatively weak compared with native centromeres, but I am not sure whether neocentromeres in C. deuterogattii are always weak or not. If authors show an evidence that observations in this study frequently occur or they can clearly explain about generality of this work, my mind will be changed.

Thank you for your comments. To answer the question raised, we have deleted another centromere (CEN9) and have identified the neocentromeres of cen10Δ isolates -D to -G. Only cen10Δ mutants have reduced fitness and undergo chromosome fusion. Centromere 9 mutants also exhibit a slow-growth phenotype, but the fitness defect is not as pronounced as in the cen10Δ mutants and we did not observe a population of mixed colony sizes in cen9Δ mutants. In the revised manuscript evidence is presented that the cen10Δ neocentromeres may be less fit than the native centromere and at higher growth temperature chromosome fusion has occurred in isolates producing large colonies. This shows a parallel with previous studies on neocentromere formation in S. pombe. However, in S. pombe, upon centromere deletion, either neocentromere formation or chromosome fusion occurs. Based on de novogenome assemblies, three chromosome fusions were identified and confirmed by PFGE, indicating that chromosome fusion occurred in all of the large cen10Δ isolates analyzed. In addition, the neocentromeres of cen9Δ and cen10Δ span actively transcribed genes, which has not been shown before. In S. pombe and C. albicans the genes are silenced. In chickens, genes are expressed in neocentromeres; however, these chromosomes are diploid and the neocentromere could bind to one only chromosome and the gene on the other chromosome could still be expressed.

Reviewer #3:

Centromeres vary in size and sequence between organisms. In most organisms studied, the centromere is marked by the presence of the histone H3 variant, CENPA. Accumulated evidence indicates that CENPA is the epigenetic mark that specifies centromere identity. Consistent with an epigenetic mode of centromere propagation in which sequence does not dominate, neocentromeres can arise on chromosomal regions which have no homology to centromere sequences. Many naturally-occurring neocentromeres have been reported and neocentromeres have been experimentally induced in several organisms, including fission yeast, Candida and chicken.

This manuscript describes the isolation and characterization of neocentromere strains in the human pathogen Cryptococcus deuterogattii, using an approach similar to previous studies in other species including Candida albicans. The authors delete the centromere of chromosome 10 and recover rare survivors which they go on to show by CENPA ChIP-seq have acquired either one or two neocentromeres on chromosome 10. They report that this has little impact on underlying gene expression. By whole genome sequencing the authors provide evidence that the strains have a degree of aneuploidy, and are growth impaired. By incubation of neocentromere strains at 37oC, small and large colonies are isolated and the authors show that the large colonies have undergone fusion of chromosome 10 with various other chromosomes. They also claim that the neocentromeres become inactivated upon chromosome fusion.

Although this manuscript is of interest, it has a preliminary feel. Whilst it is interesting that neocentromeres can form in Cryptococcus, which, like Candida, is a human pathogen, it is questionable whether the study provides significant advances in understanding the processes of neocentromere formation. The manuscript would also benefit from significant rewriting.

There is little discussion of potential relevance to the virulence / pathogenicity of Cryptococcus deuterogattii.

Thank you for the comments and suggestions. We have included a section about possible effects on virulence in the Discussion. The competition assays showed that both cen9Δ and cen10Δ mutants grow slower than the wild type. If centromere deletions occurred in nature, we hypothesize that the wild type would outcompete cenΔ isolates. Based on the reduced fitness of the cenΔ mutants we hypothesize that cenΔ mutants would be less virulent than the wild type.

Several points need to be addressed before the manuscript could be considered for publication in eLife.

A more detailed description of C. deuterogattii centromeres is required – size, how many types of transposons are present? What is the level of sharing between centromeres? How much unique sequence is at each centromere? In particular a more-detailed diagram of cen10 is needed.

The centromeres of C. deuterogattii were described in detail in a recent publication of the lab (Yadav et al., 2018). In that article the centromere content was analyzed in great detail, including the transposable element composition of the centromeres.

In the Introduction we have added more information about the size of C. deuterogattii and C. neoformans centromeres. “Recently, the centromeres of the human pathogenic fungus Cryptococcus deuterogattii were characterized and compared to those of the closely related species Cryptococcus neoformans (centromeres ranging from 27 to 64 kb), revealing dramatically smaller centromeres in C. deuterogattii (ranging from 8.7 to 21 kb) (Janbon et al., 2014; Yadav et al., 2018).” In the updated figures of the revised manuscript, we have employed the same color scheme for Tcn1-6 as presented by Yadav et al., 2018.

How many total transformants were obtained from the 99 transformations? Was it only the 7 mentioned? Or were there many NAT/NEO positive transformants that were incorrect? The approach used relies on correct integration and neocentromere formation to both occur almost in quick succession in order to recover the desired isolates. Did the authors consider/try a “two-step” approach such as that used in S. pombe (Ishii et al., 2008), in which a split marker gene is united via Cre-lox to delete the centromere?

We only saw 14 false positives for all 99 biolistic transformations. For the false positives, the NAT/NEO cassette was integrated randomly in the genome and these transformants were not characterized further. Thus, a total of 21 transformants were obtained, of which 7 were bona fide cen10Δ mutants (a frequency of 33%). While deleting genes for other projects, we have observed that the homologous recombination transformation rate is high in C. deuterogattii.

Thank you for the suggestion to use the Cre-lox method for the deletion of centromeres. To our knowledge, this method has not been developed or applied for C. deuterogattii. Instead, we have applied a CRISPR-Cas9 based homologous recombination method to delete centromere 9. This resulted in the successful deletion of centromere 9. The efficiency of centromere 9 deletion was significantly improved; however, the false positive (NAT ectopic integration in the genome) rate was significantly higher as well. As we have used CRISPR-Cas9, we did not make any conclusions for the transformation rate to delete centromere 9.

Three strains are analysed by CENPA ChIP-seq and two neocentromeres are identified. Cen10Δ-1 has CENPA enrichment at both neocen1 and neocen2 sites, cen10Δ-2 has only neocen1, and cen10Δ-3 has both. What about the other neocen strains (cen10Δ-4,5,6,7)? Where is CENPA located in these other isolates, especially in cen10Δ-5?

We have performed ChIP-seq for all of the cen10Δ mutants (-A to -G) and included the neocentromeres of cen10Δ -D to -G in our analysis in the revised manuscript. In addition, we obtained six cen9Δ mutants and have performed ChIP-seq for all of these mutants as well. This allowed us to define the chromosomal position for all of the neocentromeres. The neocentromere of cen10Δ-E is located directly adjacent to the left telomere. In the manuscript we speculate that this might be a reason why the centromere deletion mutant does not exhibit a fitness defect compared to the other cen10Δ mutants.

Figure 1: cen10 and the precise region deleted must be indicated (either as a bar above the region or with shading that extends down Figure 1A). Relevant features, such as the transposon remnants at cen10, neocen1 (or A), neocen2 (or B), the gene names for the genes that are mentioned in the text and/or analysed in other figures must be labelled.

C. deuterogattii has only truncated transposable elements and these elements are only present in the native centromeres. We have indicated the truncated transposable elements present in the native centromere (CEN9 and CEN10) with colors and labeled Tcn4 (orange) and Tcn6 (green). The neocentromeres are formed in unique chromosomal locations that lack transposable elements and repeats. Also, the neocentromeres span genes and are flanked by genes. We have updated Figure 1 and have labeled all of the genes.

I assume that the reads mapping to the (deleted) cen10 region in cen10Δ-1,-2,-3 are due to CENPA ChIP-seq reads from copies of the transposons at other centromeres mapping to cen10 in the reference genome? This must be explained in the legend.

Yes, this is correct and we have added this information in the revised figure legend. For the same reason, we still see reads mapped back to the deleted native centromere in cen9Δ-D. This is absent in other cen9Δ mutants; one possibility is that this is due to the ChIP library preparation and some reads are more enriched than others.

In part B the peak height at cen10Δ-2 neocen1 looks higher than those in cen10Δ-1, and -3. But in C – a zoom-in of that region – they all look similar. Also the “background”? across the chromosome arm in cen10Δ-1 appears higher than in the other isolates – what is real and what is due to differences in what is being displayed / scales?

We thank the reviewers for noticing these issues, which are due to the “auto scale setting” feature of the IGV viewer. For the revised Figure 1, we set the read coverage to the maximum peak height, which reduced the background.

The information associated with all ChIP-seq data (and similar) figures must be improved. The y axes are not labelled. What exactly is plotted? Has the data been normalized to input? Are the scales the same in all parts of the figure (the absence of any numbers on the y axes makes it impossible to interpret)? The legends must be more precise and informative.

We have introduced “The read coverage (y-axis) shows the enrichment of CENP-A and the x-axis shows the chromosome coordinates.” All data shown are the result of the ChIP sample normalized with input. As expected, there is some variation in the number of reads of each ChIP analysis and this might be due to the ChIP, sequencing, library preparation, or experimental variation. For each sample, the reads are normalized to 1 within the sample.

What are the segregation properties of chr10 in the cen10Δ strains? Do neocen1 and/or neocen2 behave as centromeres? Ideally this should be done by integration of lacO arrays on chr10 and visualization via LacI-GFP. Integration of arrays near neocen1 and (separately) near cen10/neocen2 would enable predictions about the behavior of different loci to be tested. For instance, one would expect the neocen2 locus to be far from the spindle pole body in wild type (non-neocen) cells but close to it in cen10Δ-2/neocen cells. In addition, a single chr10 bearing two neocentromeres might exhibit segregation defects such as lagging/stretched chromosomes or chromosome breakage. Introduction of lacO arrays should be attempted, alternatively cen10Δ cells should at least be stained for CENPA and DAPI and cells with observable chromosome segregation defects quantified.

Thank you for the suggestion to test for segregation properties. We have tried several times to cross the cen10Δ mutants. However, even for the wild-type controls it is challenging to observe mating for C. deuterogattii. Unfortunately, due to this technical limitation at present the ability to test the segregation properties of the neocentromeres is limited. To verify if the neocentromeres are functional ChIP-qPCR with Mis12 or CENP-C were conducted, and enrichment for all earlier identified CENP-A enriched regions was observed. Studying the chromosomes of C. deuterogattii by microscopy has not as yet been accomplished or reported and is beyond the state of the art. The chromosomes are small and only one mCherry/GFP signal in the nucleus is visible.

To assess whether either/both neocentromere is assembling a bona fide kinetochore ChIP-qPCR should be performed for a kinetochore protein (ideally an outer kinetochore protein). (Subsection “Deletion of centromere 10 results in neocentromere formation” paragraph three)

We have transformed a large subset of neocentromere mutants with a gene encoding an additional epitope-tagged kinetochore protein. We have analyzed cen10Δ mutants with Mis12- GFP and cen9Δ mutants with CENP-C-GFP. Subsequently ChIP-qPCR was performed. The results obtained confirm the neocentromeric (based on CENP-A enrichment) location in cenΔ mutants.

Differences in colony size are mentioned multiple times in the manuscript. It would be of interest to see examples of colonies of the various cen10Δ strains at different temperatures (in addition to the growth curves).

We have included a supplementary figure with an example of a population of colonies with a mixed colony size for a centromere 10 deletion mutant.

Figure 2: Has the qRT-PCR data presented in Figure 2 been normalized in any way to account for the ploidy difference between the wild-type strain and the cen10Δ strains? For instance, cen10Δ-2-S3 (which is presumably similar to cen10Δ-2) has a chr10 ploidy of 1.4 X wild-type. qPCR could be performed on genomic DNA (relative to a control euploid locus) and the qRT-PCR expression data normalized to it. If such normalization has been done already it is not described in the manuscript (and it should be).

Thank you for this suggestion. For the revised Figure 2, we have normalized gene expression of the cen10Δ mutants with a housekeeping gene located on chromosome 10 and normalized to the wild-type. All cen9Δ mutants are normalized to the wild type and actin.

Figure 4: Is it known where on chr10 the extra centromere-containing region of chromosome 5 is in cen10Δ-5-S3? It is intriguing that the neocen1 region appears to be duplicated in cen10Δ-L2, and, based on PFGE it is a duplication on chr10. What are the levels of CENPA on (duplicated?) neocen1? Is a double copy of neocen1 giving improved chromosome segregation?

ChIP-seq has not been conducted with these isolates and the centromeric region of the fused chromosomes is not at present established

It is interesting that cen10-Δ5 derivatives are euploid for chr10 but aneuploid for chr8. What might be the explanation for this observation? Are two copies of chr8 somehow beneficial?

Thank you for the suggestion; we are not aware that aneuploidy for chromosome 8 can be beneficial. However, this is an interesting observation.

“In contrast to the other fused chromosomes, this chromosomal fragment did not fuse to a chromosome with a native centromere, and thus the mutant still had a fitness defect.”, this statement is too strong – it's speculation. “thus” should be replaced by “consistent with”.

Thank you for the suggestion, we have replaced “thus” with “consistent with”

Subsection “cen10Δ isolates are aneuploid”: cen10Δ-5 strains are not euploid for chromosome 8.

We have re-written the sentence.

Figure 4 legend indicates that endogenous cen10 reads are absent due to its deletion. However, there are reads in the ChIP-seq in Figure 1. What is the reason for this difference?

This is due to the presence of repeats in the remaining native centromeres. The native centromeres span truncated transposable elements and repeats. Due to the mapping with Bowtie2, the reads are mapped equally between repeats. All of the data shown is based on mapping the reads to the wild-type genome assembly. Due to duplicated regions (TEs/repeats), sequencing reads derived from regions other than centromere 9 or 10 are mapped back to the location of the native centromere in the centromere deletion mutants.

We have confirmed the centromere mutants based on Southern blot analysis and PCR confirmation, which both document the lack of the native centromere in the centromere 9 or 10 mutants.

The claim in the Abstract that the chr10 neocentromere is inactivated upon chromosome fusion must be supported by more substantial data. Figure 6 needs major improvement. It shows CENPA ChIP-qPCR for only wt, cen10Δ-1L and cen10Δ-2L, analyzing neocen1 and neocen2 regions. The CENPA ChIP-qPCR for positive control ie cen10-Δ1 and cen10-Δ2 with “active” neocentromeres (neocen1 and neocen2) is missing. Without that it is hard to interpret the data.

Which large colonies are analysed? Are they the same ones as in Figure 4, 5? There should be an indication of which strain they correspond to, or they should be given a new number.

CENPA ChIP-qPCR should be performed on three biological replicates and graphs show standard deviation.

Thank you for the comment. All qPCR results shown in the manuscript were performed in three replicates and the standard deviation for each measurement has been included in the revised manuscript. The controls suggested (cen10Δ-A and cen10Δ-B) have also now been included. For cen10Δ-A and cen10Δ-A-L1 we have indicated the two CENP-A-enriched regions. For cen10Δ-B and cen10Δ-B-L2 we have used the same primer set, although this cen10Δ mutant only has one CENP-A-enriched region. As control, the wild type is presented. The same strains in Figure 4 and 5 were used for this analysis and we have added this information in the figure legend of Figure 6 (cen10Δ-A-L1 and cen10Δ-B-L2).

CENPA ChIP-qPCR should be done for all original strains (and/or S strains) and fast-growing (L) derivatives. Ideally including qPCR for cen10 (as appropriate) and the centromere of the fusion chromosomes.

We now have included qPCR data for an additional kinetochore protein and this confirms the CENP-A sequencing shown in Figure 1. This is based on independent experiments and before the ChIPs were performed, we had to transform all of the original cenΔ mutants. After transformation, all of the transformants obtained were streak purified twice.

When we streak purify cen10Δ mutants for single colonies, we always observe a mixed population of colony sizes. When we streak purify small cen10Δ colonies we again always see a mixed population. Streak purification of large cen10Δ colonies instead results in only large colonies. The growth curves of Figure 3—figure supplement 1 shows that small colonies grow like the original mutants and, in contrast. the large colonies have wild-type fitness levels. All of the ChIP experiments are based on large volumes as we cannot conduct single cell ChIPs. As mentioned, we cannot perform ChIP-qPCRs for the native centromere 10, as this centromere has been deleted.

Based on all of these experiments and conclusions we hypothesize that all small colonies of the same cen10Δ have the same neocentromeric location and all large colonies have inactive neocentromeres.

It is interesting that CENPA is apparently present on rrd2-1/neocen2 in wild type cells (in qPCR at least). There is no comment on this in the text. Could it be that wild-type cells do contain a small amount of CENPA at this location and, in the case of centromere deletion, this seeds further CENPA deposition to establish a neocentromere?

The qPCRs shown in Figure 6 show a slight enrichment of CENP-A on the RRD2 gene. However, this was only in one region located in the gene, and the second region was not enriched. We now included CENP-A reads of the wild type and show this for all chromosomal location where neocentromeres are formed. Based on ChIP-seq we do not see any enrichment for CENP-A in the RRD2-2 region.

Figure 6B and subsection “cen10Δ chromosome is rescued by chromosome fusion”. The evidence that cen10Δ-3-L1 has a fusion between chr10 and chr13 should be explicitly described. Has PCR been done to confirm this?

Long-read sequencing data for this strain is not available and thus it is challenging to design a primer pair to validate the fusion junction. However, all of the chromosome fusions based on de novogenome assembly were confirmed by PCR and this information is now included as a panel in Figure 5—figure supplement 1. T. Beside the PCR confirmation, the chromosome fusions were confirmed by read coverage. In the de novoassembly, the reads show a continuous pattern and there are no read breaks.

All cen10Δ-5 strains analysed by PFGE are aneuploid for chromosome 8. In the figure legend there is the speculation that this occurred during or before transformation. Such speculative statements should not be placed in figure legends. I understand from the text and strain list that strain cen10Δ-5 (KS6) in the originally-derived and restreaked neocentromere strain and that the small and large colonies (KS21-25) are derived from that one original strain by growing at 37 and selecting small and large colonies. So the chr8 aneuploidy could have arisen in KS6 after transformation.

Thank you for this suggestion; we have removed this speculation. We can only speculate when the chr8 aneuploidy occurred in this mutant. The chromosome duplication might have occurred after the transformation event. As we see this chromosome duplication for all small and large colonies it suggests that all of the cells have the parental chr8 duplication.

Figure 6B. Conclusions from data must be stated in the text, not the legend! Cen10Δ-5-L1 – from the data presented in the figure it can only be concluded that the chromosome 10 band is gone, it could have fused with another chromosome resulting in a fusion chromosome that migrates at the same size as other chromosomes. Southern would need to be performed with a chr10 probe to rigorously assess this. The sequencing data also confirms the fusions. But what is the data for 10-13 fusion in cen10Δ-L2?

We did not include this strain in our sequencing experiment.

Figure legends should only describe what was done and what is presented so that the reader can understand the data. Legends should not contain interpretation or discussion. All legends should be reviewed and modified accordingly. In addition, the figures themselves would be far easier for the reader to comprehend if the labelling of figures was improved.

We have now updated all figure legends to enhance the labeling and presentation of the content.

[Editors’ note: what follows is the authors’ response to the second round of review.]

The manuscript has substantially improved and all reviewers recognized the strong efforts put into this revision. There are some remaining issues that need to be addressed before acceptance, as outlined below:

1) The remaining revisions do not require new experimentation.

2) The major point is the analysis of the ChIP seq experiments, which should change to fold enrichment over input on the y axis in ChIP-seq analyses.

Original comment:

In part B the peak height at cen10Δ-2 neocen1 looks higher than those in cen10Δ-1, and -3. But in C – a zoom-in of that region – they all look similar. Also, the “background”? across the chromosome arm in cen10Δ-1 appears higher than in the other isolates – what is real and what is due to differences in what is being displayed / scales?

Author response:

We thank the reviewers for noticing these issues, which are due to the "auto scale setting" feature of the IGV viewer. For the revised Figure 1, we set the read coverage to the maximum peak height, which reduced the background.

Reviewer comment:

OK, but it is more informative and best practice to indicate the fold enrichment over input on the y axis in ChIP-seq analyses.

Original comment:

The information associated with all ChIP-seq data (and similar) figures must be improved. The y axes are not labelled. What exactly is plotted? Has the data been normalized to input? Are the scales the same in all parts of the figure (the absence of any numbers on the y axes makes it impossible to interpret)? The legends must be more precise and informative.

Author resposne:

We have introduced "The read coverage (y-axis) shows the enrichment of CENP-A and

the x-axis shows the chromosome coordinates." All data shown are the result of the ChIP sample normalized with input. As expected, there is some variation in the number of reads of each ChIP analysis and this might be due to the ChIP, sequencing, library preparation, or experimental variation. For each sample, the reads are normalized to 1 within the sample.

Reviewer comment:

OK, but as above it is more informative and best practice to indicate the fold enrichment over input on the y axis in ChIP-seq analyses.

Thank you for this comment. We have now indicated the height of the CENP-A peaks on the right side of each panel in the revised version of Figure 1. The height of the ChIP-seq peak shows the fold enrichment of the CENP-A peak when subtracted from the input sample.

3) The quality of the graphs, e.g. those presented in Figures 2 and 6 could be improved.

Thank you for this suggestion. To improve the figure quality we have removed the pattern of the bar plots and we use now a solid grey fill for the revised versions. In addition, each bar plot now has a black border and the error-bars are have been re-colored to black.

4) What is the strange effect on the ethidium-stained gels in Figure 1—figure supplement 1 and Figure 5—figure supplement 1?

Thank you for this comment; to be honest we don’t have a good explanation. We might have exceeded the voltage limits while running the gels as the shape of the ladder bands show a V-shape. Another option could be improper mixture of the agarose gel; however, this seems less unlikely as each PCR series was run on independent agarose gels.

5) The Discussion is quite lengthy and would benefit from being edited down. There are many of redundant descriptions. It is not necessary to summarize results there.

Thank you for this comment. As suggested, we have removed redundant text and the revised Discussion is now ~1.2 pages shorter. In addition, two short paragraphs were moved to the revised Results section.

6) “C. deuterogattii is responsible for an ongoing outbreak in the Pacific Northwest regions of Canada and the United States” – outbreak of what?

The outbreak referred to is that of cryptococcosis in the Pacific Northwest. The sentence in the revised manuscript has been replaced by: “C. deuterogattii is responsible for an ongoing cryptococcosis outbreak in the Pacific Northwest regions of Canada and the United States (Fraser et al., 2005).”

7) In some places in the text it would be helpful to refer to specific parts of figures, e.g. Figure 3C.

Thank you for this comment. We have inserted several references to the figures in the text and this should make the presentation more readily understood by the reader.

8) Please indicate the precise region deleted in cen9Δ and cen10Δ in Figure 1. Discuss cen9Δ and cen10Δ in same order in text and figure.

The deleted region of chromosome 9 in Figure 1A and the deleted region of chromosome 10 in Figure 1D are now indicated with a grey line in the revised figure. In the revised manuscript, each paragraph starts now with the description of results of centromere 9 followed by centromere 10.

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Data Citations

    1. Schotanus K. 2020. Neocentromere formation in Cryptococcus deuterogattii. NCBI BioProject. PRJNA511460 [DOI] [PMC free article] [PubMed]
    2. Universidade Federal do Rio Grande do Sul 2017. Cryptococcus gattii R265 mRNA RNA-Seq. NCBI Sequence Read Archive. SRR5209627

    Supplementary Materials

    Supplementary file 1. Neocentromeres are not enriched with transposable elements.

    To exclude the possibility that transposable elements were deposited into the neocentromeres, BlastN searches in a database with de novo genome assemblies of cen10-A, cen10-B, cen10∆-C, and cen10∆-E were performed. As input the homologous wild-type sequence of the chromosomal location of the neocentromeres was used. All neocentromeres of the tested cen10∆ mutants have the same length as the homologous sequence in the wild -type.

    elife-56026-supp1.xlsx (10.8KB, xlsx)
    Supplementary file 2. Neocentromeric regions are expressed in the wild-type strain.

    Expression levels of genes where neocentromeres formed in the cenΔ mutants were analyzed in the R265 wild-type strain. Previously generated RNA sequencing data were remapped to the R265 reference genome, and the expression levels (FPKM) were analyzed for the native genes in each region wherein a neocentromere was formed in the cenΔ mutants. Expression analysis of several housekeeping genes was included for control purposes, and the median RNA expression level of all genes located on chromosomes 9 and 10 are listed. Expression levels of genes located in the subtelomeric regions were also analyzed.

    elife-56026-supp2.xlsx (13.2KB, xlsx)
    Supplementary file 3. Genes located in subtelomeric regions of chromosomes 1, 4, 7, and 10.

    We have indicated whether each gene located in the subtelomeric regions was lost in the chromosomal fusion of the large-colony cen10∆ mutants. For each gene in the subtelomeric region of chromosome 1, 4, 7, and 10, several characteristics, such as the chromosomal location, putative function, and the presence of putative C. neoformans homologs in existing mutant libraries, are indicated. There is no correlation between the loss of genes with or without predicted function.

    elife-56026-supp3.xlsx (12.2KB, xlsx)
    Supplementary file 4. Primers used in this study.

    For each primer, the lab identifier, purpose, and sequence are shown.

    elife-56026-supp4.xlsx (17.1KB, xlsx)
    Supplementary file 5. Strains used in this study.

    For each strain used in this study, the lab strain identifier, description, and parental strain are indicated.

    elife-56026-supp5.xlsx (11.1KB, xlsx)
    Transparent reporting form

    Data Availability Statement

    ChIP and whole-genome sequencing reads and de novo genome assemblies were deposited under NCBI BioProject Accession ID: PRJNA511460.

    The following dataset was generated:

    Schotanus K. 2020. Neocentromere formation in Cryptococcus deuterogattii. NCBI BioProject. PRJNA511460

    The following previously published dataset was used:

    Universidade Federal do Rio Grande do Sul 2017. Cryptococcus gattii R265 mRNA RNA-Seq. NCBI Sequence Read Archive. SRR5209627


    Articles from eLife are provided here courtesy of eLife Sciences Publications, Ltd

    RESOURCES