Skip to main content
Genome Biology and Evolution logoLink to Genome Biology and Evolution
. 2019 Jul 25;11(8):2312–2329. doi: 10.1093/gbe/evz165

Intraspecific Diversity of Fission Yeast Mitochondrial Genomes

Yu-Tian Tao 1,2, Fang Suo 1, Sergio Tusso 3,4, Yan-Kai Wang 1, Song Huang 1,5, Jochen B W Wolf 3,4, Li-Lin Du 1,5,
Editor: Kenneth Wolfe
PMCID: PMC6736045  PMID: 31364709

Abstract

The fission yeast Schizosaccharomyces pombe is an important model organism, but its natural diversity and evolutionary history remain under-studied. In particular, the population genomics of the S. pombe mitochondrial genome (mitogenome) has not been thoroughly investigated. Here, we assembled the complete circular-mapping mitogenomes of 192 S. pombe isolates de novo, and found that these mitogenomes belong to 69 nonidentical sequence types ranging from 17,618 to 26,910 bp in length. Using the assembled mitogenomes, we identified 20 errors in the reference mitogenome and discovered two previously unknown mitochondrial introns. Analyzing sequence diversity of these 69 types of mitogenomes revealed two highly distinct clades, with only three mitogenomes exhibiting signs of inter-clade recombination. This diversity pattern suggests that currently available S. pombe isolates descend from two long-separated ancestral lineages. This conclusion is corroborated by the diversity pattern of the recombination-repressed K-region located between donor mating-type loci mat2 and mat3 in the nuclear genome. We estimated that the two ancestral S. pombe lineages diverged about 31 million generations ago. These findings shed new light on the evolution of S. pombe and the data sets generated in this study will facilitate future research on genome evolution.

Keywords: population genomics, Schizosaccharomyces pombe, mitogenome, mitochondrial DNA, de novo assembly, evolutionary history

Introduction

The fission yeast Schizosaccharomyces pombe is a unicellular fungal species belonging to the Taphrinomycotina subphylum of the Ascomycota phylum (Liu et al. 2008). The first descriptions of this species in the 1890s reported it as a microorganism associated with fermented alcoholic drinks, including its presence in East African millet beer and in the fermenting sugarcane molasses for making a distilled liquor (Batavia Arrack) in Indonesia (Lindner 1893; Vorderman 1893; Eijkman 1894; Lodder and Kreger-Van Rij 1952; Barnett and Lichtenthaler 2001). Thereafter, S. pombe has been found in various human-associated environments throughout the world, but has never been isolated in truly wild settings (Brown et al. 2011; Jeffares et al. 2015; Jeffares 2018). In 1947, Urs Leupold, the founder of fission yeast genetics, selected an S. pombe isolate from French grape juice as the subject of his PhD research, and this strain (hereafter referred to as the Leupold strain) has essentially been the only strain used for modern S. pombe molecular biology studies (Osterwalder 1924; Leupold 1950; Hu et al. 2015). In 2002, the complete genome sequence of the Leupold strain was published, making S. pombe the sixth eukaryotic species with a sequenced genome (Wood et al. 2002). Today, S. pombe is recognized as one of the most prominent model organisms for understanding the molecular mechanisms of cellular processes (Hoffman et al. 2015; Hayles and Nurse 2018).

In recent years, the intraspecific genomic diversity of S. pombe has begun to be investigated (Brown et al. 2011; Rhind et al. 2011; Avelar et al. 2013; Clément-Ziza et al. 2014; Fawcett et al. 2014; Zanders et al. 2014; Hu et al. 2015; Jeffares et al. 2015, 2017). In particular, Jeffares et al. sequenced the genomes of 161 S. pombe isolates and comprehensively explored genomic variation within this species (Jeffares et al. 2015, 2017). However, the breadth and depth of knowledge on the natural diversity and evolutionary history of S. pombe remain limited, especially compared with the other model yeast species, Saccharomyces cerevisiae (Duan et al. 2018; Peter et al. 2018).

The mitochondrion originates from a bacterial endosymbiont and, after extensive reductive evolution, still retains a small genome (Lang et al. 1997). Compared with the nuclear genome, the smaller size, higher copy number, and lower level of recombination of the mitochondrial genome (mitogenome) have long made it an attractive subject for intraspecific comparative studies, shedding light on the evolution of many species including humans (Ingman et al. 2000). Recently, the population mitogenomic approach has begun to be applied to fungal species, leading to new insights about population structure and evolutionary dynamics (Jung et al. 2012; Freel et al. 2015; Wolters et al. 2015; Leducq et al. 2017).

The mitogenome of the Leupold strain of S. pombe was completely sequenced more than 10 years before its nuclear genome (Lang 1984; Lang et al. 1985, 1987; Trinkl et al. 1989). It contains 2 rRNA genes (rnl and rns), 8 protein-coding genes (atp6, atp8, atp9, cob, cox1, cox2, cox3, and rps3 which is also known as var1), a gene encoding the RNA component of mitochondrial RNaseP (rnpB), and 25 tRNA genes (Bullerwell et al. 2003; Schäfer 2003). No complete mitogenome sequences of any other S. pombe isolates have been reported thus far. Restriction fragment analysis and limited Sanger sequencing have indicated that presence–absence polymorphisms of mitochondrial introns are widespread among S. pombe isolates (Zimmer et al. 1984, 1987), but an accurate and thorough understanding of intraspecific mitogenomic variation of S. pombe is still lacking.

In this study, we used both published and newly generated genome sequencing data to perform de novo assembly of the mitogenomes in 199 S. pombe isolates. We successfully assembled the complete mitogenome sequences of 192 isolates. Analyzing these mitogenome sequences led to the discovery of reference mitogenome errors, new mitochondrial introns, and divergence patterns providing new insights into the evolutionary history of S. pombe. In particular, we show that S. pombe isolates descend from two long-separated ancient lineages. An independent study published after the initial submission of this article reaches the same conclusion through analyzing the nuclear genome diversity of S. pombe isolates (Tusso et al. 2019).

Materials and Methods

Previously Published Genome Sequencing Data of 161 JB Strains

The Illumina genome sequencing data of 161 S. pombe strains with names that begin with the initials JB were downloaded from European Nucleotide Archive (ENA) according to the ENA accession numbers given in Jeffares et al. (2015), and are listed in supplementary table S1, Supplementary Material online. For 11 sequencing runs that belong to the ENA Study accession number PRJEB6284, we noticed discrepancies between the read numbers reported in Jeffares et al. (2015) and the read numbers in the data downloaded from ENA. The authors of the prior study confirmed that strain name mix-ups had occurred during the submission of the sequencing data to the ENA by The Genome Analysis Centre (Jeffares D, personal communication). A list of these 11 sequencing runs with their correct corresponding strain names is provided as supplementary table S2, Supplementary Material online.

Genome Sequencing of 38 Strains from Culture Collections in China and United States

To explore intraspecific diversity beyond the previously analyzed strains, we acquired 38 S. pombe strains from four culture collections in China and one culture collection in the United States: 20 strains from CGMCC (China General Microbiological Culture Collection Center), 13 strains from CICC (China Center of Industrial Culture Collection), 1 strain from CICIM (Culture and Information Centre of Industrial Microorganisms of China Universities), 1 strain from CFCC (China Forest Culture Collection Center), and 3 strains from NRRL (United States Department of Agriculture Agricultural Research Service Culture Collection) (supplementary table S1, Supplementary Material online). Only 3 of these 38 strains have isolation information: CGMCC 2.1043 was isolated from fermented grains for making Moutai, a Chinese liquor; NRRL Y-11791 was from reconstituted lime juice (location unknown); NRRL Y-48646 was from a wine producing company (location unknown). Single-cell-derived clones of these strains were deposited into our laboratory strain collection (DY collection) and given strain names that begin with the initials DY (supplementary table S1, Supplementary Material online). Cells grown on YES solid media were used for genomic DNA preparation using the MasterPure Yeast DNA Purification Kit (Epicentre). The kit manufacturer's protocol was followed, with the exception of lysing the cells by glass bead beating in a FastPrep-24 homogenizer (MP Biomedicals) for 20 s at a speed setting of 6.4 m/s. The sequencing library for DY15505 was constructed using the NEBNext DNA Library Prep Master Mix (NEB). For the other 37 strains, tagmentation-based sequencing library preparation was performed using home-made Tn5 transposase (Picelli et al. 2014). Post-tagmentation gap filling and PCR amplification were performed using the KAPA HiFi HotStart PCR Kit (Kapa Biosystems) with the following cycling parameters: 3 min at 72 °C, 30 s at 95 °C, and then 11 cycles of 10 s at 95 °C, 30 s at 55 °C, and 30 s at 72 °C. AMPure XP beads (Beckman Coulter) were used to select PCR product in the size range of 400–700 bp. Paired-end sequencing was performed using Illumina HiSeq 2000 (2 × 96 read pairs), HiSeq 2500 (2 × 100 read pairs or 2 × 101 read pairs), or HiSeq X Ten sequencers (2 × 150 read pairs). Sequencing data for these 38 strains have been deposited at NCBI SRA under accession numbers SRR8698890–SRR8698927.

De Novo Assembly of the Mitogenomes

The genome sequencing data for the above mentioned 199 strains (161 JB strains and 38 DY strains) were cleaned by Trimmomatic version 0.32 with options LEADING:30, TRAILING:30, SLIDINGWINDOW:4:30, and MINLEN:80 (MINLEN:130 for 150-bp HiSeq X Ten reads) (Bolger et al. 2014). We found empirically that de novo assembly of the mitogenome requires data downsampling with 300,000 cleaned read pairs being a suitable downsampling target data size except for longer-read-length HiSeq X Ten-generated data, which required a lower downsampling target read number. Downsampling was performed using the software seqtk (https://github.com/lh3/seqtk, last accessed July 29, 2019). Three independent sets of randomly downsampled data were obtained using the seed numbers 100, 500, and 800. De novo assembly was performed using A5-miseq version 20150522 (Coil et al. 2015). From the A5-miseq output we selected the mtDNA-containing contigs based on length and sequence. Custom Perl scripts were used to trim overlapping sequences at the end of the mtDNA contigs and set the starting position of the circular-mapping mtDNA to that of the reference S. pombe mitogenome (accession number NC_001326.1). Full-length mitogenome assemblies were usually obtained from all three sets of downsampled data. Pilon version 1.21 was used to polish the assemblies (Walker et al. 2014). Polished assemblies were verified by mapping all cleaned reads of a strain to the corresponding assembly and manually examining the mapping results on a genome browser. No assembly errors were apparent during manual inspection, nor was any obvious heteroplasmy observed. In total, we obtained full-length mitogenome assemblies for 192 of the 199 strains (supplementary table S1, Supplementary Material online). Among these, there were 69 different types of mitogenome sequences which we designated as MT type 1–69 or, for brevity, MT1–MT69 (supplementary table S1, Supplementary Material online).

Validation of the Mitogenome Assemblies Using Third-Generation Sequencing

To validate the assemblies based on Illumina-generated short reads, we assembled mitogenomes using long read data from two types of third-generation sequencing technology: the RSII platform of Pacific Biosciences (PacBio) and MinIOn by Oxford Nanopore (Tusso et al. 2019) (data deposited in Bioproject PRJNA527756 of the sequencing read archive at the National Center for Biotechnology Information). PacBio data for 15 strains (JB4, JB22, JB760, JB842, JB853, JB858, JB872, JB873, JB900, JB918, JB934, JB939, JB1197, JB1205, and JB1206) and MinION data for 7 strains (JB22, JB760, JB858, JB873, JB934, JB1197, and JB1205) were used. De novo assembly was performed using Canu 1.5 (Koren et al. 2017) with default parameters followed by polishing using the package BridgeMapper of the SMRT Analysis Software v2.3.0. A second polishing was performed using short reads aligned with BWA 0.7.15 and Pilon 1.22 (Walker et al. 2014). Custom python scripts were used to identify and trim the resulting mitogenome assemblies. For comparison, long-read-based mitogenome assemblies were aligned to the corresponding short-read-based ones using MAFFT 7.407 (Katoh and Standley 2013). Because the mitogenome of JB842 was not assembled from short reads, PacBio-based assembly of JB842 was compared with MT47, the short-read-based assembly of JB851 and JB857, which share the same nuclear genome type with JB842. In all cases, the long-read-based assemblies were identical to the corresponding short-read-based assemblies.

Mitogenome Annotation

Protein-coding genes in the assembled mitogenomes were annotated using MFannot based on genetic code 4 (the only difference between genetic code 4 and the standard code is UGA being a tryptophan codon, not a stop codon) (Lang et al. 2007; Valach et al. 2014). MFannot was also used for predicting intronic regions and intron types (group I or group II intron). tRNA and rRNA annotations were transferred from the reference mitogenome using the software RATT (Otto et al. 2011). The EMBL-format reference annotation file required by RATT was generated from the GenBank format file using the software Artemis (Carver et al. 2012), which was also used to convert the RATT output from EMBL format to GenBank format. Results of software-based annotation were verified by manual inspection. The annotations of mt-tRNAArg(UCU) and mt-tRNAGlu(UUC) were revised according to the recently published S. pombe mitochondrial transcriptome analysis (the starting and ending positions of the former were shifted upstream for one nucleotide and six nucleotides, respectively, and the ending position of the latter was shifted downstream for one nucleotide) (Shang et al. 2018). The 69 types of mitogenomes (MT1–MT69) together with their annotations have been deposited at GenBank under accession numbers MK618072–MK618140. The lengths of these mitogenomes, the total lengths of different types of sequence features in these mitogenomes, and the intron presence–absence patterns are listed in supplementary table S3, Supplementary Material online.

Published sequences and annotations of the mitogenomes of the three other Schizosaccharomyces species were used for the analysis of genes in these three mitogenomes (accession numbers NC_004312.1 and AF275271.2 for S. octosporus, accession numbers NC_004332.1 and AF547983.1 for S. japonicus, and accession number MK457734 for S. cryophilus) (Bullerwell et al. 2003; Rhind et al. 2011).

De Novo Assembly of the Recombination-Repressed K-Region in the Nuclear Genome

Using the same set of Illumina genome sequencing data of 199 S. pombe strains, we performed targeted de novo assembly of the K-region. For this purpose, we employed the assembler software TASR version 1.6.2 in the de novo assembly mode (-i 1 mode) (https://github.com/warrenlr/TASR, last accessed July 29, 2019) (Warren and Holt 2011). Because the 4.3-kb centromere-repeat-like cenH element within the K-region cannot be assembled using short sequencing reads, we chose the 4.6-kb reference genome sequence of the mat2cenH interval (nucleotides 4,557–9,169 of GenBank accession number FP565355.1) and the 1.9-kb reference genome sequence of the cenHmat3 interval (nucleotides 13,496–15,416 of GenBank accession number FP565355.1) as the input sequences provided to TASR for read recruitment. Based on read mapping, 12 of the 199 strains lacked the K-region (supplementary table S1, Supplementary Material online). These 12 strains included JB22 (Leupold’s 972 strain), an h−S mating type strain in which the K-region is known to be absent (Beach and Klar 1984). For 150 (80%) of the remaining 187 strains, we were able to generate complete assemblies corresponding to the two target sequences (supplementary table S1, Supplementary Material online). The failure to fully assemble the sequences for the other 37 strains appeared to be mainly owing to insufficient sequencing depth, as 92% (110/119) of the strains with >40× average nuclear genome sequencing depth (based on cleaned reads) had fully assembled sequences, whereas only 59% (40/68) of the strains with <40× sequencing depth had fully assembled sequences (supplementary table S1, Supplementary Material online). We concatenated the fully assembled mat2cenH interval, 100 Ns (representing the unassembled cenH sequence), and the fully assembled cenH–mat3 interval together as the K-region sequence. Mapping reads to the assembled K-region sequences showed that, for strains that appear to have more than one copy of the K-region based on read depth (supplementary table S1, Supplementary Material online), no obvious sequence differences exist between K-region copies, except for one single-nucleotide inter-copy variation in DY29155 and DY29156. Among the K-region sequences of 150 strains, there are 29 nonidentical sequence types. We designated them K-region type 1–29 or, for brevity, K1–K29 (supplementary table S1, Supplementary Material online). In particular, we assigned K-region type 1 (K1) to the K-region sequence in JB50 (Leupold's 968 h90 strain), a strain that should have the same K-region sequence as the reference genome. However, K1 differed in 34 positions from the K-region sequence in the reference genome, including 26 nucleotide substitutions and 8 one-base indels. For all but one of these 34 positions, the reference genome alleles did not exist in any K-region types, whereas the alleles of K1 were shared by other K-region types. Thus, these differences are most likely due to reference sequence errors. The 29 types of K-region sequences (K1–K29) have been deposited at GenBank under accession numbers MK618141–MK618169.

Phylogenetic Tree Construction

We used two methods to construct phylogenetic trees based on gene sequences present in all 69 MT types. In the first method, we used the non-intronic nucleotide sequences of nine genes (rnl, rns, cox1, cox3, cob, atp6, atp8, atp9, and cox2) present in the mitogenomes of all four fission yeast species to construct a neighbor-joining tree based on the p-distance model in MEGA 7.0.18 (Kumar et al. 2016). Bootstrap analysis with 1,000 replicates was performed. In the second method, we employed MEGA to construct a maximum likelihood tree using the non-intronic nucleotide sequences of the above nine genes plus rps3 and rnpB. The model recommended by MEGA, TN93+G+I, was used. Bootstrap analysis with 1,000 bootstrap replicates was performed. For the construction of the phylogenetic trees of introns, a maximum likelihood tree of each intron was constructed with 100 bootstrap replicates using the model suggested by MEGA. For the construction of the phylogenetic trees of intron-encoded proteins (IEPs), we obtained protein sequences closely related to Schizosaccharomyces IEPs by BLASTP search of NCBI nonredundant (nr) database. Maximum likelihood trees were constructed with 100 bootstrap replicates using the model suggested by MEGA. A neighbor-joining tree and a maximum likelihood tree of the K-region were also constructed using MEGA.

ADMIXTURE Analysis and Heatmap Analysis

For the 69 MT types, nucleotide substitution variants were identified from the sequence alignment of intron-removed sequences. Bi-allelic single-nucleotide variants (SNVs) were merged into bi-allelic multi-nucleotide variants (MNVs) if two neighboring bi-allelic SNVs were less than 15 bp apart and shared the same allelic partition of the 69 MT types. We used custom Perl scripts and PLINK 1.07 to generate a binary PLINK BED format file, which was used as input for ADMIXTURE version 1.3.0 (Alexander et al. 2009). K values were varied from 2 to 8. For each K value, 10 replicate ADMIXTURE runs were performed using seeds from 1 to 10. Post-processing and visualization of ADMIXTURE results were carried out using the CLUMPAK web server (Kopelman et al. 2015). The major modes identified by CLUMPAK are presented. Results from K > 3 appear no longer informative and are not shown. SNVs and MNVs in non-intronic sequences were visualized in a heatmap by employing the R package ComplexHeatmap. For the 29 types of K-region sequences, bi-allelic SNVs and MNVs were identified in the same way, and were used for heatmap analysis.

Recombination Analysis

A gap-stripped and intron-free alignment of 69 MT types was used as input for the RDP4 program (Martin et al. 2015). Seven statistical methods implemented in RDP4, including RDP, GENECONV, BootScan, Maxchi, Chimaera, SiScan, and 3Seq, were used for the detection of recombination events.

Divergence Time Estimation

The S. pombe nuclear genome is mostly euchromatic, with heterochromatin only existing in centromeres, telomeres, mating-type region, and rDNA. Heterochromatic regions tend to have higher mutation rates (Polak et al. 2015; Sun et al. 2016). Because the K-region, being part of the mating-type region, is heterochromatic (Grewal and Klar 1997), it may have a mutation rate higher than the mutation rate of the nuclear genome as a whole. To obtain a calibrated mutation rate for the K-region, we chose two strains reflecting ancestral genetic divergence (Tusso et al. 2019): the pure Sp lineage strain JB869 (Sp ancestry proportion 0.98) harboring the K1 type K-region sequence and the pure Sk lineage strain JB758 (Sk ancestry proportion 0.98) harboring the K19 type K-region sequence. We performed the calibration by comparing the SNV differences between JB869 and JB758 in the K-region versus the SNV differences between these two strains in the feature-free regions of the nuclear genome, which lack any PomBase-annotated features including genes and other genomic features such as repeat regions and low-complexity regions (Lock et al. 2019). We chose the feature-free regions for comparison because the K-region lacks protein-coding genes and is probably under little selective constraint and because the feature-free regions are also subject to little purifying selection (Jeffares et al. 2015). The coordinates of the feature-free regions were extracted from the genome annotation files chromosome1.contig, chromosome2.contig, and chromosome3.contig downloaded from ftp://ftp.pombase.org/pombe/genome_sequence_and_features/artemis_files/ (last accessed July 29, 2019; last modification dates of these annotation files are all October 18, 2017). Using published variant data (Jeffares et al. 2015), we found that JB869 and JB758 differ by 70 SNVs in the 6.53-kb K-region and 12,321 SNVs in the 1.465-Mb feature-free regions. Thus, the SNV density of the K-region was 27% higher than that of the feature-free regions of the nuclear genome. Essentially the same difference in SNV density was obtained when making comparisons using other pure-lineage Sp and Sk strains (Tusso et al. 2019). We interpret this difference in SNV density as a 27% increase in the mutation rate of the K-region relative to the mutation rate of the remaining nuclear genome. Using 1.27 as the calibration parameter and 2.00 × 10−10 mutations per site per generation as the genome-wide mutation rate (Behringer and Hall 2015; Farlow et al. 2015), we calculated the mutation rate of the K-region to be 2.54 × 10−10 mutations per site per generation. BEAST version 2.4.7 and its associated programs were used for divergence time estimation, assuming a strict molecular clock (Bouckaert et al. 2014). All positions in the K-region were used. The site model was selected using bModelTest version 1.0.4. We compared two tree priors, the Yule model and the birth–death model, and chose the latter according to evaluations performed using Tracer version 1.6. Five independent runs were performed for 10 million generations each. We initiated runs on random starting trees, and sampled the trees every 10,000th generation. Effective sampling sizes were above 200 for all parameters. Results of the five runs were combined, with 10% removed as burn-in, using LogCombiner. Maximum clade credibility trees were summarized using TreeAnnotator, with posterior probability limit set to 0.5. Trees were visualized using FigTree.

Results

De Novo Assembly of the Complete Mitogenomes of 192 S. pombe Strains

A previous study generated Illumina-based genome sequencing data of 161 S. pombe isolates (JB strains) (Jeffares et al. 2015). Based on SNVs in the nuclear genome, Jeffares et al. showed that these JB strains have 57 types of nuclear genomes, with 129 strains falling into 25 “clonal clusters” each composed of multiple strains with near-identical nuclear genomes and 32 other strains each possessing a uniquely distinct nuclear genome. Jeffares et al. chose a set of 57 isolates, called “non-clonal strains,” to represent the 57 types of distinct nuclear genomes that differ from each other by no less than 1,900 SNVs (Jeffares et al. 2015). We used the publicly available Illumina sequencing data of these 161 JB strains (supplementary tables S1 and S2, Supplementary Material online) to perform de novo assembly of their mitogenomes. We were able to assemble the complete circular-mapping mitogenomes of 154 (96%) JB strains (supplementary table S1, Supplementary Material online). Validation using PacBio- and MinION-generated long read data of 15 diverse JB strains (Tusso et al. 2019) shows no difference between long-read-based mitogenome assemblies and Illumina-based mitogenome assemblies. The 154 assembled mitogenomes encompass 59 nonidentical sequence types, which we term MT types (supplementary table S1, Supplementary Material online). MT types may differ by as little as one nucleotide.

The 59 MT types present among the JB strains by and large correlated with the 57 previously defined nuclear genome types present among these strains (supplementary table S1, Supplementary Material online). All but two of the 57 non-clonal strains have fully assembled mitogenomes. These 55 mitogenomes fall into 53 MT types, with three non-clonal strains, JB1205, JB1206, and JB1207, sharing the same MT type. The two non-clonal strains without fully assembled mitogenomes, JB842 and JB874, belong to clonal clusters 18 and 24, respectively. Other strains belonging to these two clusters do have fully assembled mitogenomes, which indicate that these two clusters correspond to two additional MT types (MT47 and MT21). The remaining four MT types (MT25, MT39, MT52, and MT65) are each highly similar to the MT type of a non-clonal strain and thus represent intra-cluster variations, with differences being a single SNV (MT25 vs. MT26 for cluster 10 and MT52 vs. MT53 for cluster 2), a single one-nucleotide indel (MT65 vs. MT64 for cluster 23), or the presence–absence polymorphisms of mitochondrial introns (MT39 vs. MT40 for cluster 15). The fact that we have obtained full-length mitogenomes from JB strains representing all 57 nuclear genome types indicates that the mitogenome diversity among the JB strains has been comprehensively captured.

To explore intraspecific diversity beyond that of the JB strains, we obtained 38 additional S. pombe isolates (DY strains) from Chinese and US culture collections, performed genome sequencing on them, and assembled the full-length mitochondrial genomes for all of them (supplementary table S1, Supplementary Material online). The resulting 38 mitogenomes fall into 19 nonidentical types, including 9 MT types present among the JB strains, and 10 MT types not present among the JB strains. Thus, overall, we identified 69 MT types from the fully assembled mitogenomes of 192 S. pombe isolates. We performed gene annotation on these MT types. The annotated sequences of the 69 MT types have been deposited at GenBank (accession numbers MK618072–MK618140).

Identification of 20 Errors in the Reference S. pombe Mitogenome

The reference S. pombe mitogenome (accession numbers NC_001326.1 and X54421.1) was from a Leupold-background strain with the genotype h ade7-50 (Lang 1984; Zimmer et al. 1984). The reference S. pombe nuclear genome was derived from Leupold's 972 h−S strain (Wood et al. 2002), called JB22 in the JB strain set. JB22 is the non-clonal strain representing clonal cluster 1 of the JB strains (Jeffares et al. 2015, 2017). Our de novo assembly of the mitogenomes showed that the mitogenomes of all clonal cluster 1 JB strains are identical. We designate this type of mitogenome MT1 (accession number MK618072).

Despite both being from the Leupold strain background, MT1 differs from the reference S. pombe mitogenome in 20 positions, including 13 single-nucleotide substitutions, 1 double-nucleotide substitution, 5 single-nucleotide indels, and 1 three-nucleotide insertion (table 1). Thirteen of these 20 differences are located in protein-coding genes, and 11 of them alter amino acid sequences (table 1). A previous study has uncovered 16 of these 20 differences by mapping Illumina sequencing reads to the reference mitogenome, but did not ascertain whether the differences were due to reference errors or polymorphisms (Iben et al. 2011). Our mitogenome assemblies show for all of these 20 positions that the sequence of MT1 is identical to those of the other 68 MT types. Thus, these 20 differences are caused by reference errors, not by naturally existing polymorphisms.

Table 1.

Differences between the Reference S. pombe Mitogenome (NC_001326.1) and MT1

Reference Genome Position Reference Genome Sequence MT1 Sequence Type of Sequence Difference Affected Gene Amino Acid Difference Detected before by Iben et al. (2011)
1,788 C T Single-nucleotide substitution rnl N/A Yes
2,091 A G Single-nucleotide substitution rnl N/A Yes
2,364 C CC Insertion rnl N/A No
4,102 C G Single-nucleotide substitution rns N/A Yes
4,463 C A Single-nucleotide substitution rns N/A Yes
4,525 AC A Deletion rns N/A No
4,777 AA A Deletion Intergenic N/A No
6,906 A T Single-nucleotide substitution cox1-I2b S21C Yes
8,376 C CTAC Insertion cox1 An extra Y residue after Y400 No
8,529 G T Single-nucleotide substitution cox1 E451D Yes
10,309 TC CT Double-nucleotide substitution cob I46T Yes
11,218 T A Single-nucleotide substitution cob-I1 I121N Yes
11,644 A C Single-nucleotide substitution cob-I1 H263P Yes
13,196 C T Single-nucleotide substitution cob-I1 Synonymous Yes
13,751 C T Single-nucleotide substitution cob Synonymous Yes
15,081 TA T Deletion atp6 L109F and R111G Yes
15,088 G GA Insertion atp6 Yes
15,200 G T Single-nucleotide substitution atp6 V149F Yes
18,648 G A Single-nucleotide substitution cox2 V30I Yes
19,152 A G Single-nucleotide substitution cox2 S198G Yes

N/A, not applicable.

Discovery of Two New Mitochondrial Introns

Using restriction fragment analysis, a previous study of 26 S. pombe isolates estimated that the mitogenomes vary in length from 17.6 to 24.6 kb (Zimmer et al. 1987). We found here that among the 69 MT types, the mitogenome length varied between 17,618 and 26,910 bp. Length variation is almost entirely due to intron presence–absence polymorphisms (fig. 1A and supplementary table S3, Supplementary Material online; intron presence–absence polymorphisms are described in a later section).

Fig. 1.

Fig. 1.

—Length polymorphism among 69 types of S. pombe mitogenomes (MT types) and two newly discovered mitochondrial introns. (A) Length variation among 69 MT types ordered from the shortest to the longest. The sequence of each MT type is divided into five feature categories. The total length of each category is shown here in color and also listed in supplementary table S3, Supplementary Material online. (B) Diagram depicting the locations of the nine distinct S. pombe mitochondrial introns. At the top, an intron-less mitogenome is depicted, with the lengths of genes and intergenic sequences drawn to scale. At the bottom, cox1, cob, and cox2, the three genes harboring introns, are enlarged, and intron insertion positions are denoted by black vertical lines. Introns are shown as rectangles. The lengths of introns are not drawn to scale. (C) Comparison between the cox1-I1b intron in MT1 and the cox1-I1b′ intron in MT53. The DNA sequences of these two introns are divided into four segments: a segment coding for the LAGLIDADG domains, two segments upstream of the LAGLIDADG segment (one with 100% identity between cox1-I1b and cox1-I1b′ and the other with much lower identity), and a segment downstream of the LAGLIDADG segment. For each segment, percentage sequence identity (based on the length of sequence alignment) is shown. (D) The amino acid sequences of the two LAGLIDADG motifs in the IEPs of cox1-I1b and cox1-I1b′. In each motif, the 8th residue critical for endonuclease activity is denoted by an arrowhead. (E) Comparison between the cox1-I4 intron in MT11 and the cox1-aI1 intron in the reference Sa. cerevisiae mitogenome (NC_001224.1). The DNA sequences of these two introns are divided into three segments: a segment coding for the RT domain (reverse transcriptase domain), X domain (maturase domain), and En domain (endonuclease domain, formerly known as Zn domain), a segment upstream of the RT–X–En segment, and a segment downstream of the RT–X–En segment.

In S. pombe, there are seven previously known mitochondrial introns, which are called cox1-I1a, cox1-I1b, cox1-I2a, cox1-I2b, cox1-I3, cob-I1, and cox2-I1 (Schäfer 2003) (fig. 1B). Three of them, cox1-I1b (Schäfer et al. 1991), cox1-I2b (Lang 1984), and cob-I1 (Lang et al. 1985), are present in the Leupold strain. The other four introns, cox1-I1a (Schäfer and Wolf 1999), cox1-I2a (Trinkl and Wolf 1986), cox1-I3 (Trinkl and Wolf 1986), and cox2-I1 (Schäfer et al. 1998), are absent in the Leupold strain. In our de novo assembled mitogenomes, we identified two new introns, which we named cox1-I1b′ and cox1-I4, respectively (fig. 1B). cox1-I1b′ is located at the exact same position as cox1-I1b, and its presence is mutually exclusive with the presence of cox1-I1b. Both cox1-I1b′ and cox1-I1b are group I introns, and both encode proteins containing two LAGLIDADG endonuclease domains. The first 249 nucleotides of these two introns are identical, but the remaining portions are rather divergent, with the LAGLIDADG-domain-coding sequences exhibiting only 63% identity (fig. 1C). Despite this divergence, among the proteins in the NCBI nr database, the closest homolog of the IEP of cox1-I1b′ is the IEP of cox1-I1b, suggesting recently shared ancestry of these two proteins (supplementary fig. S1, Supplementary Material online).

The IEP of cox1-I1b has been shown to possess both homing endonuclease and intron maturase activities (Schäfer et al. 1994; Pellenz et al. 2002; Schäfer 2003). For LAGLIDADG proteins, the nuclease activity requires that the 8th residue in the namesake LAGLIDADG motif must be an acidic residue to allow coordination with metal ions essential for catalysis (Chevalier et al. 2004). The 8th residues of the two LAGLIDADG motifs in the IEP of cox1-I1b are acidic residues (fig. 1D). In contrast, the 8th residues of the two LAGLIDADG motifs in the IEP of cox1-I1b′ are nonacidic residues (fig. 1D), suggesting that this protein probably has lost the homing endonuclease activity and acts solely as a maturase. This kind of degeneration of endonuclease function is remarkably common among the Schizosaccharomyces group I intron IEPs, as half of them (7/14) have nonacidic residues at the 8th position of at least one LAGLIDADG motif (supplementary fig. S2, Supplementary Material online).

In S. pombe, all previously analyzed mitochondrial intron IEPs are thought to be translated as fusions with upstream exons, as the coding sequences of IEPs are always in-frame with 5′ exons (Schäfer 2003). We observed an exception to this rule in several cox1-I1b′ sequences. In three MT types (MT52, MT53, and MT66), the LAGLIDADG-domain-coding sequences in cox1-I1b′ are out-of-frame with 5′ exons due to a one-nucleotide insertion about 70 bp upstream of the LAGLIDADG domains (supplementary fig. S3, Supplementary Material online). This observation raises the possibility that S. pombe mitochondrial intron IEPs may not always be translated as in-frame extensions of the preceding exons.

The other intron newly identified in this study, cox1-I4, is located at a position downstream of all previously known cox1 introns in S. pombe (fig. 1B). It is a group II intron. Our phylogenetic analysis showed that the IEP of cox1-I4 does not share a close relationship with any of the other Schizosaccharomyces group II intron IEPs (supplementary fig. S4, Supplementary Material online). Instead, it is most closely related to the IEPs encoded by cox1-ai1 and cox1-ai2 introns in Sa.cerevisiae and cox1-ai1 introns in other species of the family Saccharomycetaceae (fig. 1E and supplementary fig. S4, Supplementary Material online). Thus, S. pombe cox1-I4 may have arisen through horizontal transfer from a Saccharomycetaceae species.

Phylogenetic Relationship of 69 MT Types Based on Non-Intronic Sequences

Using the non-intronic sequences of nine genes (cox1, cox3, cob, atp6, atp8, atp9, cox2, rnl, and rns), which are conserved among the four Schizosaccharomyces species, we constructed a neighbor-joining tree of the 69 MT types (fig. 2, left). In this tree, the 69 MT types mostly fall into two highly distinct clades, with the single exception being MT15, locating at an intermediate position between the two clades. The same tree topology was obtained when we constructed a maximum likelihood tree using the non-intronic sequences of the above nine genes plus rps3 and rnpB (supplementary fig. S5, Supplementary Material online). The smaller of the two clades contains MT1, the mitogenome in the Leupold strain background, from which the S. pombe reference genome was derived. Thus, we term this clade containing 14 MT types the REF clade. Accordingly, we term the other clade, which contains 54 MT types, the NONREF clade.

Fig. 2.

Fig. 2.

—Phylogenetic analysis and maximum-likelihood clustering analysis of the non-intronic sequences of 69 MT types. For the phylogenetic analysis (left), a neighbor-joining tree was constructed using a concatenated alignment of the non-intronic sequences of nine genes conserved across four Schizosaccharomyces species. The three non-pombe Schizosaccharomyces species (S. octosporus, S. cryophilus, and S. japonicus) were used as outgroup to root the tree. Outgroup branches are not drawn to scale. Bootstrap values higher than 70% are shown on the branches. Scale bar, 0.001 substitutions per site. For the maximum-likelihood clustering analysis using the ADMIXTURE program (right), the input was 222 segregating sites, which correspond to all bi-allelic SNV and MNV sites in non-intronic sequences (supplementary table S4, Supplementary Material online). The 69 MT types, except for MT15, are classified into two clades (REF and NONREF clades), with each clade further divided into two subclades (A and B subclades for the REF clade, and S and D subclades for the NONREF clade). Arrowheads denote the three MT types that exhibit signs of inter-clade recombination. Tree branches and MT type names are colored according to subclade affiliation. The name of a representative strain for each MT type is shown in parentheses. For the 59 MT types present among the JB strains, JB strains are chosen as representative strains, with preference given to the non-clonal strains. For each MT type, the continent(s) where associated strains have been isolated are indicated by colored circles to the right of the tree, with the continent having more associated strains placed to the left (see supplementary table S1, Supplementary Material online for more detailed information on strains).

The REF clade has a substantially lower within-clade diversity than the NONREF clade. Nonetheless, MT types in the REF clade can be clearly divided into two subclades, which we term REF-A and REF-B. Within the NONREF clade, the relatedness among the MT types is highly uneven, with 40 MT types (MT16–MT55) falling into a closely related monophyletic cluster, which we term NONREF-S subclade (S stands for similar). The large number of closely related MT types in the NONREF-S subclade may be partly due to nonrandom sampling of S. pombe isolates (see “Discussion” section). The other 14 MT types (MT56–MT69) in the NONREF clade are much more diverse, and we group them into a paraphyletic subclade, termed NONREF-D (D stands for diverse).

For the REF clade and the NONREF-D subclade, affiliated strains tend to share geographic origins (fig. 2, middle). MT types in the REF clade are mainly associated with strains collected from Europe, whereas MT types in the NONREF-D subclade are mainly associated with strains collected from Asia-Pacific. Among the 30 REF clade strains with known collection locations, 22 were collected from Mediterranean European countries including France, Spain, Italy, and Malta (supplementary table S1, Supplementary Material online), suggesting a possible Southern European origin of this MT clade. Among the eight NONREF-D strains with known collection locations, six were collected from Asian countries (supplementary table S1, Supplementary Material online), suggesting that this subclade of high diversity is mainly distributed in Asia. In contrast, the NONREF-S subclade, despite its low internal diversity, has the broadest geographic distribution, with associated strains coming from all continents where S. pombe has been isolated. One possible explanation is that the NONREF-S strains have been distributed around the world by human migration (see “Discussion” section).

To verify and complement the results obtained using phylogenetic tree construction, we identified from the alignment of non-intronic sequences a total of 222 bi-allelic SNVs and MNVs (supplementary table S4, Supplementary Material online), and performed maximum-likelihood clustering analysis using the ADMIXTURE program (fig. 2, right) (Alexander et al. 2009). In addition, we directly visualized these bi-allelic SNVs and MNVs in a heatmap (fig. 3). These analyses lent support to the clade and subclade division. For the ADMIXTURE analysis, when K = 2, the REF clade and the NONREF clade are clearly distinguished; when K = 3, the NONREF clade is further separated into two clusters, corresponding to the NONREF-S subclade and the NONREF-D subclade. MT15, the MT type situated between the REF clade and the NONREF clade in the phylogenetic trees, exhibits an inter-clade mosaic pattern for both K values, suggesting that it may be a recombination product between REF and NONREF mitogenomes. Interestingly, two other MT types, MT22 and MT23, also consistently exhibit inter-clade mosaic patterns in the ADMIXTURE results, albeit to a lesser degree than MT15.

Fig. 3.

Fig. 3.

—A two-color heatmap of the 222 bi-allelic SNV and MNV sites in non-intronic sequences of 69 MT types. These are the same 222 sites used in the ADMIXTURE analysis shown in figure 2. Each row in the heatmap represents an MT type (in the same order as in fig. 2), and each column represents a polymorphic site. MT1 alleles are colored in yellow and non-MT1 alleles are colored in green. Sites locating within the eight protein-coding genes and the three large RNA genes are indicated by bracketed lines above the heatmap. MT type names and names of the representative strains are colored according to subclade affiliation. Arrowheads denote the three MT types that exhibit signs of inter-clade recombination.

Inspecting the heatmap confirmed that MT15, MT22, and MT23 are the products of inter-clade recombination (fig. 3). MT15 appears to result from two inter-clade recombination events, with two stretches of its sequence resembling the REF-A subclade and the other two stretches resembling the NONREF-S subclade. The bulk of the sequences in MT22 and MT23 match those in other NONREF-S mitogenomes, with two small stretches of sequences in the rnl gene of MT22 and one small stretch of sequence spanning the rnpB gene in MT23 exhibiting REF clade patterns. We also performed statistical identification of recombination events using the program RDP4 (Martin et al. 2015) (supplementary table S5, Supplementary Material online). Consistent with the results of ADMIXTURE analysis and visual inspection of the heatmap, the only recombination events supported by all seven recombination detection methods employed by RDP4 are those associated with MT15, MT22, and MT23. Recombination events not associated with these three MT types have much weaker support and appear to be mostly intra-clade recombination events occurring between NONREF mitogenomes.

Together, the above analyses of the non-intronic sequences demonstrate that present-day S. pombe mitogenomes descend from two well-separated ancient lineages, with only rare mitogenome recombination having occurred between lineages. This runs counter the expectation from a previously published nuclear genome analysis concluding that S. pombe lacks strong population structure (Jeffares et al. 2015). The results are, however, consistent with an independent study published after the initial submission of this article revealing that S. pombe nuclear genomes also descend from two ancestral lineages (see a later section) (Tusso et al. 2019).

Presence–Absence Polymorphisms and Phylogeny of Mitochondrial Introns

There are 18 types of intron presence–absence patterns in the 69 MT types (fig. 4A and supplementary table S3, Supplementary Material online). For each of the eight intron insertion sites, introns are only present in some but not all MT types, indicating that intron gain and/or loss have happened at all sites. The four group I intron sites are occupied in appreciably higher proportions (93%, 70%, 87%, and 67% for cox1-I1b/I1b′, cox1-I2a, cox1-I2b, and cox1-I3, respectively) than the four group II intron sites (41%, 14%, 28%, 54% for cox1-I1a, cox1-I4, cob-I1, and cox2-I1, respectively).

Fig. 4.

Fig. 4.

—Intron presence–absence patterns and maximum likelihood trees of mitochondrial introns. (A) Intron presence–absence patterns at the 8 intron insertion sites in the 69 MT types. Gray squares indicate intron presence. The presence of cox1-I1b and cox1-I1b′ is respectively indicated by the bold letter b and the characters b′ inside a square. The phylogenetic tree on the left is the same as in figure 2 without the outgroup branches. (B) Maximum likelihood trees of the nine introns. The MT type from which the intron sequence originated is represented by the respective number in a colored square. The color of the square denotes the subclade affiliation of the MT type. The mosaic MT type, MT15, which partly resembles the REF-A subclade and partly resembles the NONREF-S subclade, is represented by a square colored half-and-half with the colors of those two subclades. Trees were rooted by midpoint rooting. Scale bar, 0.01 substitutions per site.

The REF clade and the NONREF clade show distinct intron presence–absence patterns, with 6 of 8 intron sites exhibiting statistically significant differences between the clades (supplementary fig. S6, Supplementary Material online). cox1-I1a, cox1-I2a, cox1-I3, and cox2-I1 are completely or almost completely absent in the REF clade, but are common or ubiquitous in the NONREF clade. In contrast, cob-I1 is present in 93% of MT types in the REF clade but is present in only 9% of the MT types in the NONREF clade. For the cox1-I1b/I1b′ site, cox1-I1b is present in all MT types in the REF clade, whereas cox1-I1b′ is present in 83% of the MT types in the NONREF clade. These opposing patterns suggest that the two ancient S. pombe mitogenome lineages evolved different intron contents after their divergence.

Intron presence–absence patterns also exhibit a correlation with the subclade division within the REF clade. The REF-A subclade and the REF-B subclade are perfectly distinguished by the presence–absence patterns of cox1-I2b and cox1-I4, with the REF-A MT types all having cox1-I2b but not cox1-I4, and the REF-B MT types all having cox1-I4 but not cox1-I2b.

Within the low-nucleotide-diversity NONREF-S subclade, two group II introns, cox1-I1a and cox2-I1, are respectively present in 45% and 67.5% of the 40 MT types, and their presence–absence patterns do not obviously correlate with the nucleotide-based phylogeny, indicating that these two introns may have undergone extensive gain and/or loss events during the evolution of NONREF-S mitogenomes. Remarkably, all 18 NONREF-S MT types containing cox1-I1a also contain cox2-I1 (P = 0.00008, Fisher’s exact test), suggesting that, for reason(s) unclear to us, the presence of cox1-I1a in this subclade may be dependent on the presence of cox2-I1.

We constructed maximum likelihood trees for each of the nine introns (fig. 4B). By and large, the phylogeny based on the sequences of a given intron mirrors the phylogeny of the MT types harboring that intron, and shows a clear distinction between the REF clade and the NONREF clade for the four introns with appreciable presence in both clades (cox1-I1b, cox1-I2b, cox1-I4, and cob-I1). Thus, during S. pombe evolution, mitochondrial introns have rarely crossed the boundary between the two clades, consistent with the low extent of inter-clade recombination described earlier. There are a few notable exceptions. MT7 is the only REF clade MT type containing cox1-I1a and cox2-I1, and these two introns in MT7 are respectively identical to those in the majority of the NONREF-S MT types, suggesting that they may originate from cross-clade transfer. MT16 is one of few NONREF MT types harboring cox1-I1b and cox1-I4, and these two introns in MT16 are respectively identical to those in the REF-B MT types but different from those in the other NONREF MT types, suggesting that they may also result from cross-clade transfer.

De Novo Assembly and Phylogenetic Analysis of the K-Region

As the above results indicate, contemporary S. pombe mitogenomes descend from two distinct ancient lineages. Next, we addressed the question whether S. pombe nuclear genomes also share a similar evolutionary history. Because a previously published nuclear genome analysis suggested that interbreeding between populations has occurred during the evolution of S. pombe (Jeffares et al. 2015), and such interbreeding (admixture) is expected to cause nuclear genome recombination that can interfere with phylogenetic inference (Posada and Crandall 2002), we reasoned that a nuclear genome region where recombination is repressed may be better suited for deducing the phylogenetic history of the S. pombe nuclear genome. Based on this rationale, we chose to analyze the K-region in the nuclear genome (Grewal and Klar 1997), which is situated between two donor mating-type loci mat2 and mat3, and is a known “cold spot” for both meiotic recombination and mitotic recombination (Egel 1984; Thon and Klar 1993). Using the genome sequencing data of the 199 S. pombe strains described above, we performed read mapping analysis and found that among these strains, 12 lack the K-region (supplementary table S1, Supplementary Material online). For 150 of the 187 K-region-containing strains, we obtained by de novo assembly the complete sequences of the two unique sections of the K-region, and found that these K-region sequences belong to 29 nonidentical sequence types, which we term K-region types (fig. 5A and supplementary table S1, Supplementary Material online). K-region types may differ by as little as one nucleotide.

Fig. 5.

Fig. 5.

—Phylogeny of the 29 K-region types and their relationship to MT types. (A) The part of chromosome II where the mating-type loci are located is depicted in a diagram at the top. The lengths of the L-region, the K-region, and the three mat genes are not drawn completely to scale. cenH is a centromere-repeat-like sequence that cannot be assembled from Illumina sequencing data. A neighbor-joining tree of the 29 K-region types (bottom left) was constructed. The tree was rooted by midpoint rooting. Bootstrap values higher than 70% are displayed on the tree. Scale bar, 0.002 substitutions per site. We also constructed a maximum likelihood tree which shows the same topology (supplementary fig. S7, Supplementary Material online). A heatmap (bottom middle) is used to visualize the 173 bi-allelic SNVs and MNVs in the K-region. K1 alleles are colored in yellow and non-K1 alleles are colored in green. MT types corresponding to each K-region type are displayed to the right of the heatmap. (B) If an MT type and its corresponding K-region type both belong to the low-diversity clade (group) or both belong to the high-diversity clade (group), the MT type is deemed “MT-K inter-clade non-mixed.” Otherwise, the MT type is deemed “MT-K inter-clade mixed.” For each MT type subclade, the numbers of MT types falling into these two categories are presented in a stacked bar chart. P values for between-subclade differences were calculated using Fisher’s exact test. Only P values <0.1 are shown.

Phylogenetic analysis of the 29 K-region types shows that, like the mitogenomes, K-region sequences fall into two highly distinct clades (fig. 5A, bottom left, and supplementary fig. S7, Supplementary Material online). Moreover, similar to the situation in mitogenomes, the smaller clade, consisting of 6 K-region types (K1–K6), has a low internal diversity, whereas the larger clade, consisting of 23 K-region types (K7–K29), has a substantially higher internal diversity. For clarity, we refer to the two K-region clades as “groups.” The K-region type found in the reference nuclear genome, which we denote K1, belongs to the low-diversity group. A heatmap analysis visualizing all bi-allelic SNVs and MNVs in the K-region sequences confirmed the deep divergence of the two groups and showed a lack of inter-group recombination (fig. 5A, bottom middle, and supplementary table S6, Supplementary Material online). These results indicate that present-day S. pombe nuclear genomes also descend from two long-separated lineages, which probably correspond to the two ancient lineages of the mitogenomes. Based on this idea, the low-diversity K-region group should correspond to the REF clade of the MT types, and the high-diversity K-region group should correspond to the NONREF clade of the MT types.

The 150 strains with assembled K-region sequences are associated with 60 of the 69 MT types (fig. 5A, bottom right, and supplementary table S1, Supplementary Material online). Any given MT type usually corresponds to only one of the 29 K-region type, except for MT53, whose associated strains have two types of K-regions (K7 and K8, differing by one single-nucleotide indel). The correlation between mitogenome clade affiliation and K-region group affiliation is only barely statistically significant (P = 0.042, Fisher’s exact test), with 58% (7/12) of REF clade MT types corresponding to K-region types in the low-diversity group, and 74% (35/47) of the NONREF clade MT types corresponding to K-region types in the high-diversity group. It is likely that interbreeding between populations has resulted in “MT-K inter-clade mixed” strains, in which the mitogenome and the K-region from different ancient lineages are brought together by hybridization.

We separately examined the extent of MT-K inter-clade mixing for each subclade of the MT types (fig. 5B). For the REF-A, REF-B, and NONREF-S subclades, 30% (3/10), 100% (2/2), and 35% (12/34) of the MT types are respectively MT-K inter-clade mixed. However, for the NONREF-D subclade, none of the 13 MT types are MT-K inter-clade mixed. Thus, strains harboring the NONREF-D mitogenomes appear to have historically undergone less cross-lineage interbreeding.

Tusso et al. (2019) have independently identified two ancestral S. pombe lineages by performing in-depth analysis of the nuclear genomes of the 161 JB strains. They named the two lineages Sp and Sk, respectively (Tusso et al. 2019). Overall, the Sp lineage corresponds to the REF mitogenome clade and the low-diversity K-region group defined in this study: all pure Sp lineage JB strains [Sp ancestry proportion >0.9, a criterion for pure-lineage strain used in Tusso et al. {2019}] fall into the REF mitogenome clade and the low-diversity K-region group (supplementary figs. S8 and S9, Supplementary Material online). Conversely, the Sk lineage corresponds to the NONREF mitogenome clade and the high-diversity K-region group. Consistent with the MT-K inter-clade mixing patterns described above, a large majority of JB strains with NONREF-D mitogenomes are pure Sk lineage strains (Sk ancestry proportion >0.9), whereas most JB strains with NONREF-S mitogenomes have mosaic nuclear genomes (Sp ancestry proportions falling between 0.2 and 0.9). The only pure-lineage JB strains with NONREF-S mitogenomes are pure Sk lineage JB strains belonging to “clonal cluster 2” defined in Jeffares et al. (2015) (i.e., strains with MT52 and MT53 mitogenomes) (supplementary fig. S8, Supplementary Material online). This clonal cluster includes the “non-clonal strain” JB864 (Jeffares et al. 2015), which corresponds to NCYC132, the strain used in the 1950s and 1960s by Murdoch Mitchison in his cell cycle research (Mitchison 1970), and the type strain of S. pombe, CBS356, which was originally isolated from arak mash and was received by CBS in 1922 from the Král collection, the world’s first culture collection established in 1890 (Vaughan-Martini and Martini 2011). Given the high relatedness of NONREF-S mitogenomes, it is plausible that they may have all originated in the relatively recent past from one pure-lineage ancestral strain whose only unadmixed descendants among currently available S. pombe isolates are clonal cluster 2 strains.

Estimation of the Divergence Time of the Two Ancient Lineages of S. pombe

To obtain an estimate of the divergence time of the two ancestral lineages, we employed the Bayesian evolutionary analysis software BEAST to perform divergence dating on the 29 K-region types (fig. 6). Mutation accumulation studies have estimated a nuclear mutation rate of 2.00 × 10−10 substitutions per site per generation in S. pombe (Behringer and Hall 2015; Farlow et al. 2015). Applying a calibration (see “Materials and Methods” section) we obtained a mutation rate estimation of 2.54 × 10−10 substitutions per site per generation for the K-region. Using this value as the mutation rate prior to perform BEAST analysis, we obtained a divergence time of 31.3 million generations for the two ancient lineages.

Fig. 6.

Fig. 6.

—Divergence time of the two ancient lineages of S. pombe based on the K-region sequences was estimated using Bayesian evolutionary analysis implemented in BEAST 2.

Discussion

In this study, we de novo assembled and annotated full-length mitogenomes that encompass the mitogenome diversity of previously sequenced 161 JB strains (Jeffares et al. 2015) and 38 additional isolates (DY strains, this study). This comprehensive data set allowed us to thoroughly examine the intraspecific mitogenome diversity existing among currently available S. pombe isolates and obtain new insights into the evolutionary history of this species.

Our analyses of the diversity patterns of mitogenome sequences and K-region sequences revealed that S. pombe isolates descend from two long-separated ancient lineages. The phenomenon of MT-K inter-clade mixing suggests that these two lineages have undergone admixture in recent historical time. The same conclusions have been reached in an independent study by Tusso et al. who analyzed admixture proportions of the nuclear genome and the effect of admixture on phenotypic variation and reproductive isolation (Tusso et al. 2019). They revealed that a majority of the currently available S. pombe isolates have mosaic nuclear genomes that resulted from recent admixture between the two ancestral lineages.

Unlike in animals, where mitogenomes are usually inherited uniparentally, in fungi, biparental transmission of mitogenomes is common and thus allows the recombination between parental mitogenomes (Xu and Li 2015). For budding yeast species belonging to the family Saccharomycetaceae, naturally occurring mitogenome recombination appears to be common (Wu and Hao 2014; Wu et al. 2015; Leducq et al. 2017; Peris et al. 2017). In contrast, we show here that, despite a high level of inter-lineage admixture existing among the S. pombe isolates, inter-lineage recombination of S. pombe mitogenomes has rarely happened. A likely explanation is that, unlike the budding yeasts, S. pombe is a haplontic species, growing vegetatively as haploids, and only forming diploids transiently during sexual reproduction. The formation of a zygotic S. pombe diploid cell is immediately followed by meiosis and sporulation, and as a result, mitogenomes from two parental haploid cells may rarely have a chance to mix and recombine before being partitioned into four separate haploid progeny spores.

Even though S. pombe isolates have mostly been collected by chance rather than through dedicated search for this species, there have been a few cases of isolating multiple S. pombe strains from one relatively small geographic region. In particular, Carlos Augusto Rosa and his colleagues have isolated S. pombe from cachaça distilleries in the southeastern Brazilian state of Minas Gerais (Pataro et al. 2000; Gomes et al. 2002), and from the frozen fruit pulps acquired in markets in the eastern Brazilian state of Sergipe (Trindade et al. 2002). These Brazilian strains correspond to ten MT types, with six MT types (MT45, MT47, MT49, MT50, MT51, and MT54) associated with the cachaça strains and four MT types (MT19, MT30, MT32, and MT37) associated with the fruit pulp strains. These ten MT types all fall into the homogeneous NONREF-S subclade and account for 25% of the MT types in this subclade, suggesting that nonrandom sampling partly contributes to the large size of this subclade.

In the 1960s, Tommaso Castelli deposited into the DBVPG culture collection 13 S. pombe strains isolated from grape must and wine from the Mediterranean islands of Sicily and Malta (DBVPG online catalog, http://www.dbvpg.unipg.it/index.php/en/database; last accessed July 29, 2019). The six Sicily strains share the same MT type (MT9, subclade REF-A), whereas the seven Malta strains are associated with three MT types (MT4 in subclade REF-A, and MT24 and MT29 in subclade NONREF-S). The fact that MT types in both clades are found in strains isolated from wine-related substrates within a localized area (the size of Malta is only 316 km2) suggests ongoing opportunities for inter-clade exchange.

Given that the Rosa strains and the Castelli strains, the only notable S. pombe isolates with restricted geographic origins, together account for only 30% of the MT types in the NONREF-S subclade, the large size of this low-diversity subclade requires explanation(s) in addition to geographic sampling bias. Also in need of explanation are the extraordinarily wide distribution and the high extent of admixture of the strains associated with this subclade. We speculate that an ancestral pure-lineage S. pombe strain harboring a NONREF-S mitogenome and a nuclear genome similar to that of JB864 may have by chance become associated with humans earlier than S. pombe strains harboring other types of mitogenomes and, as a result, gained a world-wide distribution through co-migration with humans. In turn, the spreading of this strain may have also led to its encountering and hybridizing with REF clade strains. An alternative and nonexclusive explanation is that the NONREF-S mitogenomes may provide selective advantages in human-related substrates where S. pombe has most often been found, including cultivated fruits (raw and processed), cultivated sugar cane (raw and processed), and fermented beverages. We note that several JB strains with NONREF-D mitogenomes (JB913, JB1205, and JB1206) have mosaic nuclear genomes (supplementary fig. S8, Supplementary Material online), suggesting that ancestral pure-lineage strains harboring NONREF-D mitogenomes have also contributed to inter-lineage admixture.

Based on our estimation, the two ancient lineages of S. pombe diverged about 31.3 million generations ago. To our knowledge, the shortest generation time (doubling time) reported for S. pombe under optimal laboratory growth conditions is approximately 2 h (Johnson 1968). At such a growth rate, S. pombe can go through 12 generations per day, or 4,383 generations per year, and a divergence time of 31.3 million generations corresponds to 7,141 years. However, it is highly unlikely that S. pombe can proliferate continuously at this high rate in the wild. Taking inevitable encounters with unfavorable growth conditions into consideration, previous studies have estimated that the average generation time of Sa. cerevisiae in the wild can be more than 10 times longer than the shortest generation time observed in the laboratory (Fay and Benavides 2005; Ruderfer et al. 2006). Applying the same rationale, if we assume that S. pombe may go through as few as 400 generations per year in the wild, 31.3 million generations correspond to as many as 78,250 years. We emphasize that this is not a precise estimation of the divergence date because of the uncertainty on how to convert time from generations to years. Nevertheless, the divergence time of the two ancient lineages of S. pombe may fall within the most recent glacial period (“ice age”), which occurred from approximately 110,000 to 12,000 years ago (van Ommen 2015). The expansion of ice sheets and permafrost during a glacial period can lead to vicariance, the splitting of a population through the formation of geographic barriers (Hewitt 2000; Neiva et al. 2018). We speculate that glacial vicariance may have resulted in the allopatric separation of an ancestral population of S. pombe into isolated subpopulations. One of these subpopulations may have survived the glacial period in a glacial refugium in southern Europe, and become the low-diversity lineage with the REF clade mitogenomes.

It is of note that the mainly Asian distribution of the high-diversity pure-lineage NONREF strains [corresponding to the pure-lineage Sk strains described in Tusso et al. {2019}] is reminiscent of the situation of the other model yeast species Sa.cerevisiae, whose highest intraspecific diversity exists in China (Wang et al. 2012). Based on this geographic pattern of diversity of Sa.cerevisiae and other lines of evidence, the whole Saccharomyces species complex is now believed to have originated in Asia (Duan et al. 2018; Peter et al. 2018). It is possible that Asia is also a centre of origin of S. pombe.

The intraspecific S. pombe divergence patterns observed in this study are consistent with the following speculative evolutionary scenario: During the last glacial period, an ancient population of S. pombe was separated into refugia; one subpopulation suffered a bottleneck and became a low-diversity lineage mainly distributed in Southern Europe, whereas another subpopulation became a higher-diversity lineage mainly distributed in Asia; after the glacial period ended, perhaps aided by human migration, these two long-separated lineages came into secondary contact and began to hybridize; human migration has also shaped the worldwide distribution of S. pombe, and in particular, has spread strains with the NONREF-S mitogenomes to all over the world.

Supplementary Material

Supplementary data are available at Genome Biology and Evolution online.

Supplementary Material

evz165_Supplementary_Data

Acknowledgments

We thank Wen Hu for generating the Illumina sequencing library of DY15505. We thank Yang Liu and Wei Jiang for contributing to the mitogenome analysis at the early stage of this work. This work was supported by the Ministry of Science and Technology of China and by the Beijing Municipal Government.

Author Contributions

Yu-Tian Tao performed the genome sequencing of the DY strains, analyzed the assembled mitogenomes and K-region sequences, and prepared the manuscript; Fang Suo performed the de novo assembly of mitogenomes and K-region sequences; Yan-Kai Wang and Song Huang provided the Tn5 transposase; after the initial submission of this article, Sergio Tusso and Jochen Wolf performed mitogenome assembly validation using third-generation sequencing data, contributed the admixture proportion data, and edited the revised manuscript; Li-Lin Du devised and coordinated the project and together with Yu-Tian Tao wrote the manuscript.

Data deposition: The 69 types of mitogenomes (MT1–MT69) together with their annotations have been deposited at GenBank under accession numbers MK618072–MK618140. The 29 types of K-region sequences (K1–K29) have been deposited at GenBank under accession numbers MK618141–MK618169. Illumina sequencing data of 38 DY strains have been deposited at NCBI SRA under accession numbers SRR8698890–SRR8698927.

Literature Cited

  1. Alexander DH, Novembre J, Lange K.. 2009. Fast model-based estimation of ancestry in unrelated individuals. Genome Res. 19(9):1655–1664. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Avelar AT, Perfeito L, Gordo I, Ferreira MG.. 2013. Genome architecture is a selectable trait that can be maintained by antagonistic pleiotropy. Nat Commun. 4:2235. [DOI] [PubMed] [Google Scholar]
  3. Barnett JA, Lichtenthaler FW.. 2001. A history of research on yeasts 3: Emil Fischer, Eduard Buchner and their contemporaries, 1880–1900. Yeast 18:363–388. [DOI] [PubMed] [Google Scholar]
  4. Beach DH, Klar AJ.. 1984. Rearrangements of the transposable mating-type cassettes of fission yeast. EMBO J. 3(3):603–610. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Behringer MG, Hall DW.. 2015. Genome-wide estimates of mutation rates and spectrum in Schizosaccharomyces pombe indicate CpG sites are highly mutagenic despite the absence of DNA methylation. G3 (Bethesda) 6:149–160. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Bolger AM, Lohse M, Usadel B.. 2014. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30(15):2114–2120. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Bouckaert R, et al. 2014. BEAST 2: a software platform for Bayesian evolutionary analysis. PLoS Comput Biol. 10(4):e1003537. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Brown WRA, et al. 2011. A geographically diverse collection of Schizosaccharomyces pombe isolates shows limited phenotypic variation but extensive karyotypic diversity. G3 (Bethesda) 1:615–626. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Bullerwell CE, Leigh J, Forget L, Lang BF.. 2003. A comparison of three fission yeast mitochondrial genomes. Nucleic Acids Res. 31(2):759–768. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Carver T, Harris SR, Berriman M, Parkhill J, McQuillan JA.. 2012. Artemis: an integrated platform for visualization and analysis of high-throughput sequence-based experimental data. Bioinformatics 28(4):464–469. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Chevalier B, et al. 2004. Metal-dependent DNA cleavage mechanism of the I-CreI LAGLIDADG homing endonuclease. Biochemistry 43(44):14015–14026. [DOI] [PubMed] [Google Scholar]
  12. Clément-Ziza M, et al. 2014. Natural genetic variation impacts expression levels of coding, non-coding, and antisense transcripts in fission yeast. Mol Syst Biol. 10:764. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Coil D, Jospin G, Darling AE.. 2015. A5-miseq: an updated pipeline to assemble microbial genomes from Illumina MiSeq data. Bioinformatics 31(4):587–589. [DOI] [PubMed] [Google Scholar]
  14. Duan S-F, et al. 2018. The origin and adaptive evolution of domesticated populations of yeast from Far East Asia. Nat Commun. 9(1):2690. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Egel R. 1984. Two tightly linked silent cassettes in the mating-type region of Schizosaccharomyces pombe. Curr Genet. 8(3):199–203. [DOI] [PubMed] [Google Scholar]
  16. Eijkman C. 1894. Mikrobiologisches über die Arrakfabrikation in Batavia. Centralblatt Bakteriologie Parasitenkunde 16:97–103. [Google Scholar]
  17. Farlow A, et al. 2015. The spontaneous mutation rate in the fission yeast Schizosaccharomyces pombe. Genetics 201(2):737–744. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Fawcett JA, et al. 2014. Population genomics of the fission yeast Schizosaccharomyces pombe. PLoS One 9(8):e104241. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Fay JC, Benavides JA.. 2005. Evidence for domesticated and wild populations of Saccharomyces cerevisiae. PLoS Genet. 1(1):e5–71. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Freel KC, Friedrich A, Schacherer J.. 2015. Mitochondrial genome evolution in yeasts: an all-encompassing view. FEMS Yeast Res. 15(4):fov023. [DOI] [PubMed] [Google Scholar]
  21. Gomes FCO, et al. 2002. Physiological diversity and trehalose accumulation in Schizosaccharomyces pombe strains isolated from spontaneous fermentations during the production of the artisanal Brazilian cachaça. Can J Microbiol. 48(5):399–406. [DOI] [PubMed] [Google Scholar]
  22. Grewal SI, Klar AJ.. 1997. A recombinationally repressed region between mat2 and mat3 loci shares homology to centromeric repeats and regulates directionality of mating-type switching in fission yeast. Genetics 146(4):1221–1238. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Hayles J, Nurse P.. 2018. Introduction to fission yeast as a model system. Cold Spring Harb Protoc. 2018(5):pdb.top079749. [DOI] [PubMed] [Google Scholar]
  24. Hewitt G. 2000. The genetic legacy of the Quaternary ice ages. Nature 405(6789):907–913. [DOI] [PubMed] [Google Scholar]
  25. Hoffman CS, Wood V, Fantes PA.. 2015. An ancient yeast for young geneticists: a primer on the Schizosaccharomyces pombe model system. Genetics 201(2):403–423. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Hu W, Suo F, Du L-L.. 2015. Bulk segregant analysis reveals the genetic basis of a natural trait variation in fission yeast. Genome Biol Evol. 7(12):3496–3510. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Iben JR, et al. 2011. Comparative whole genome sequencing reveals phenotypic tRNA gene duplication in spontaneous Schizosaccharomyces pombe La mutants. Nucleic Acids Res. 39(11):4728–4742. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Ingman M, Kaessmann H, Pääbo S, Gyllensten U.. 2000. Mitochondrial genome variation and the origin of modern humans. Nature 408(6813):708–713. [DOI] [PubMed] [Google Scholar]
  29. Jeffares DC. 2018. The natural diversity and ecology of fission yeast. Yeast 35(3):253–260. [DOI] [PubMed] [Google Scholar]
  30. Jeffares DC, et al. 2015. The genomic and phenotypic diversity of Schizosaccharomyces pombe. Nat Genet. 47(3):235–241. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Jeffares DC, et al. 2017. Transient structural variations have strong effects on quantitative traits and reproductive isolation in fission yeast. Nat Commun. 8(1):14061. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Johnson BF. 1968. Morphometric analysis of yeast cells. II. Cell size of Schizosaccharomyces pombe during the growth cycle. Exp Cell Res. 49(1):59–68. [DOI] [PubMed] [Google Scholar]
  33. Jung PP, Friedrich A, Reisser C, Hou J, Schacherer J.. 2012. Mitochondrial genome evolution in a single protoploid yeast species. G3 (Bethesda) 2:1103–1111. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Katoh K, Standley DM.. 2013. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol Biol Evol. 30(4):772–780. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Kopelman NM, Mayzel J, Jakobsson M, Rosenberg NA, Mayrose I.. 2015. CLUMPAK: a program for identifying clustering modes and packaging population structure inferences across K. Mol Ecol Resour. 15(5):1179–1191. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Koren S, et al. 2017. Canu: scalable and accurate long-read assembly via adaptive k-mer weighting and repeat separation. Genome Res. 27(5):722–736. [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Kumar S, Stecher G, Tamura K.. 2016. MEGA7: molecular evolutionary genetics analysis version 7.0 for bigger datasets. Mol Biol Evol. 33(7):1870–1874. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Lang BF. 1984. The mitochondrial genome of the fission yeast Schizosaccharomyces pombe: highly homologous introns are inserted at the same position of the otherwise less conserved cox1 genes in Schizosaccharomyces pombe and Aspergillus nidulans. EMBO J. 3(9):2129–2136. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Lang BF, Ahne F, Bonen L.. 1985. The mitochondrial genome of the fission yeast Schizosaccharomyces pombe. The cytochrome b gene has an intron closely related to the first two introns in the Saccharomyces cerevisiae cox1 gene. J Mol Biol. 184(3):353–366. [DOI] [PubMed] [Google Scholar]
  40. Lang BF, et al. 1997. An ancestral mitochondrial DNA resembling a eubacterial genome in miniature. Nature 387(6632):493–497. [DOI] [PubMed] [Google Scholar]
  41. Lang BF, Cedergren R, Gray MW.. 1987. The mitochondrial genome of the fission yeast, Schizosaccharomyces pombe. Sequence of the large-subunit ribosomal RNA gene, comparison of potential secondary structure in fungal mitochondrial large-subunit rRNAs and evolutionary considerations. Eur J Biochem. 169(3):527–537. [DOI] [PubMed] [Google Scholar]
  42. Lang BF, Laforest M-J, Burger G.. 2007. Mitochondrial introns: a critical view. Trends Genet. 23(3):119–125. [DOI] [PubMed] [Google Scholar]
  43. Leducq J-B, et al. 2017. Mitochondrial recombination and introgression during speciation by hybridization. Mol Biol Evol. 34(8):1947–1959. [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Leupold U. 1950. Die Vererbung von Homothallie und Heterothallie bei Schizosaccharomyces pombe. Compt Rend Lab Carlsberg. 24:381–480. [Google Scholar]
  45. Lindner P. 1893. Schizosaccharomyces pombe n. sp., ein neuer Gährungserreger. Wochenschr Brauerei 10:1298–1300. [Google Scholar]
  46. Liu Y, et al. 2008. Phylogenomic analyses support the monophyly of Taphrinomycotina, including Schizosaccharomyces fission yeasts. Mol Biol Evol. 26(1):27–34. [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Lock A, et al. 2019. PomBase 2018: user-driven reimplementation of the fission yeast database provides rapid and intuitive access to diverse, interconnected information. Nucleic Acids Res. 47(D1):D821–827. [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Lodder J, Kreger-Van Rij NJW.. 1952. The yeasts: a taxonomic study, 1st edn. Amsterdam: North-Holland Publishing Company. p. 81–94.
  49. Martin DP, Murrell B, Golden M, Khoosal A, Muhire B.. 2015. RDP4: detection and analysis of recombination patterns in virus genomes. Virus Evol. 1(1):vev003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Mitchison JM. 1970. Physiological and cytological methods for Schizosaccharomyces pombe. In: Prescott DM, editor. Methods in Cell Physiology. Vol. 4. New York and London: Academic Press. [Google Scholar]
  51. Neiva J, et al. 2018. Glacial vicariance drives phylogeographic diversification in the amphi-boreal kelp Saccharina latissima. Sci Rep. 8(1):1112. [DOI] [PMC free article] [PubMed] [Google Scholar]
  52. Osterwalder A. 1924. Schizosaccharomyces liquefaciens n. sp., eine gegen freie schweflige Säure widerstandsfähige Gärhefe. Mitt Gebiete Lebensmittelunters Hyg. 15:5–28. [Google Scholar]
  53. Otto TD, Dillon GP, Degrave WS, Berriman M.. 2011. RATT: rapid annotation transfer tool. Nucleic Acids Res. 39(9):e57. [DOI] [PMC free article] [PubMed] [Google Scholar]
  54. Pataro C, et al. 2000. Yeast communities and genetic polymorphism of Saccharomyces cerevisiae strains associated with artisanal fermentation in Brazil. J Appl Microbiol. 89(1):24–31. [DOI] [PubMed] [Google Scholar]
  55. Pellenz S, Harington A, Dujon B, Wolf K, Schäfer B.. 2002. Characterization of the I-Spom I endonuclease from fission yeast: insights into the evolution of a group I intron-encoded homing endonuclease. J Mol Evol. 55(3):302–313. [DOI] [PubMed] [Google Scholar]
  56. Peris D, et al. 2017. Mitochondrial introgression suggests extensive ancestral hybridization events among Saccharomyces species. Mol Phylogenet Evol. 108:49–60. [DOI] [PubMed] [Google Scholar]
  57. Peter J, et al. 2018. Genome evolution across 1, 011 Saccharomyces cerevisiae isolates. Nature 556(7701):339–344. [DOI] [PMC free article] [PubMed] [Google Scholar]
  58. Picelli S, et al. 2014. Tn5 transposase and tagmentation procedures for massively scaled sequencing projects. Genome Res. 24(12):2033–2040. [DOI] [PMC free article] [PubMed] [Google Scholar]
  59. Polak P, et al. 2015. Cell-of-origin chromatin organization shapes the mutational landscape of cancer. Nature 518(7539):360–364. [DOI] [PMC free article] [PubMed] [Google Scholar]
  60. Posada D, Crandall KA.. 2002. The effect of recombination on the accuracy of phylogeny estimation. J Mol Evol. 54(3):396–402. [DOI] [PubMed] [Google Scholar]
  61. Rhind N, et al. 2011. Comparative functional genomics of the fission yeasts. Science 332(6032):930–936. [DOI] [PMC free article] [PubMed] [Google Scholar]
  62. Ruderfer DM, Pratt SC, Seidel HS, Kruglyak L.. 2006. Population genomic analysis of outcrossing and recombination in yeast. Nat Genet. 38(9):1077–1081. [DOI] [PubMed] [Google Scholar]
  63. Schäfer B. 2003. Genetic conservation versus variability in mitochondria: the architecture of the mitochondrial genome in the petite-negative yeast Schizosaccharomyces pombe. Curr Genet. 43(5):311–326. [DOI] [PubMed] [Google Scholar]
  64. Schäfer B, Kaulich K, Wolf K.. 1998. Mosaic structure of the cox2 gene in the petite negative yeast Schizosaccharomyces pombe: a group II intron is inserted at the same location as the otherwise unrelated group II introns in the mitochondria of higher plants. Gene 214(1–2):101–112. [DOI] [PubMed] [Google Scholar]
  65. Schäfer B, et al. 1991. The mitochondrial genome of fission yeast: inability of all introns to splice autocatalytically, and construction and characterization of an intronless genome. Mol Gen Genet. 225(1):158–167. [DOI] [PubMed] [Google Scholar]
  66. Schäfer B, et al. 1994. A mitochondrial group-I intron in fission yeast encodes a maturase and is mobile in crosses. Curr Genet. 25(4):336–341. [DOI] [PubMed] [Google Scholar]
  67. Schäfer B, Wolf K.. 1999. A novel group-II intron in the cox1 gene of the fission yeast Schizosaccharomyces pombe is inserted in the same codon as the mobile group-II intron aI2 in the Saccharomyces cerevisiae cox1 homologue. Curr Genet. 35(6):602–608. [DOI] [PubMed] [Google Scholar]
  68. Shang J, Yang Y, Wu L, Zou M, Huang Y.. 2018. The S. pombe mitochondrial transcriptome. RNA 24(9):1241–1254. [DOI] [PMC free article] [PubMed] [Google Scholar]
  69. Sun L, et al. 2016. Preferential protection of genetic fidelity within open chromatin by the mismatch repair machinery. J Biol Chem. 291(34):17692–17705. [DOI] [PMC free article] [PubMed] [Google Scholar]
  70. Thon G, Klar AJ.. 1993. Directionality of fission yeast mating-type interconversion is controlled by the location of the donor loci. Genetics 134(4):1045–1054. [DOI] [PMC free article] [PubMed] [Google Scholar]
  71. Trindade RC, Resende MA, Silva CM, Rosa CA.. 2002. Yeasts associated with fresh and frozen pulps of Brazilian tropical fruits. Syst Appl Microbiol. 25(2):294–300. [DOI] [PubMed] [Google Scholar]
  72. Trinkl H, Lang BF, Wolf K.. 1989. Nucleotide sequence of the gene encoding the small ribosomal RNA in the mitochondrial genome of the fission yeast Schizosaccharomyces pombe. Nucleic Acids Res. 17(16):6730.. [DOI] [PMC free article] [PubMed] [Google Scholar]
  73. Trinkl H, Wolf K.. 1986. The mosaic cox1 gene in the mitochondrial genome of Schizosaccharomyces pombe: minimal structural requirements and evolution of group I introns. Gene 45(3):289–297. [DOI] [PubMed] [Google Scholar]
  74. Tusso S, et al. 2019. Ancestral admixture is the main determinant of global biodiversity in fission yeast. Mol Biol Evol. Epub ahead of print. doi: 10.1093/molbev/msz126 [DOI] [PMC free article] [PubMed] [Google Scholar]
  75. Valach M, Burger G, Gray MW, Lang BF.. 2014. Widespread occurrence of organelle genome-encoded 5S rRNAs including permuted molecules. Nucleic Acids Res. 42(22):13764–13777. [DOI] [PMC free article] [PubMed] [Google Scholar]
  76. van Ommen T. 2015. Palaeoclimate: northern push for the bipolar see-saw. Nature 520(7549):630–631. [DOI] [PubMed] [Google Scholar]
  77. Vaughan-Martini A, Martini A.. 2011. Schizosaccharomyces Lindner (1893) In: Kurtzman CP, Fell JW & Boekhout T, editors. The Yeasts, A Taxonomic Study, 5th edn. Amsterdam: Elsevier. p. 779–784. [Google Scholar]
  78. Vorderman A. 1893. Analecta op bromatologisch gebied. I. Geneeskg Tijdschr Ned Indië 33:343–397. [Google Scholar]
  79. Walker BJ, et al. 2014. Pilon: an integrated tool for comprehensive microbial variant detection and genome assembly improvement. PLoS One 9(11):e112963. [DOI] [PMC free article] [PubMed] [Google Scholar]
  80. Wang Q-M, Liu W-Q, Liti G, Wang S-A, Bai F-Y.. 2012. Surprisingly diverged populations of Saccharomyces cerevisiae in natural environments remote from human activity. Mol Ecol. 21(22):5404–5417. [DOI] [PubMed] [Google Scholar]
  81. Warren RL, Holt RA.. 2011. Targeted assembly of short sequence reads. PLoS One 6(5):e19816. [DOI] [PMC free article] [PubMed] [Google Scholar]
  82. Wolters JF, Chiu K, Fiumera HL.. 2015. Population structure of mitochondrial genomes in Saccharomyces cerevisiae. BMC Genomics 16(1):451. [DOI] [PMC free article] [PubMed] [Google Scholar]
  83. Wood V, et al. 2002. The genome sequence of Schizosaccharomyces pombe. Nature 415(6874):871–880. [DOI] [PubMed] [Google Scholar]
  84. Wu B, Buljic A, Hao W.. 2015. Extensive horizontal transfer and homologous recombination generate highly chimeric mitochondrial genomes in yeast. Mol Biol Evol. 32(10):2559–2570. [DOI] [PubMed] [Google Scholar]
  85. Wu B, Hao W.. 2014. Horizontal transfer and gene conversion as an important driving force in shaping the landscape of mitochondrial introns. G3 (Bethesda) 4:605–612. [DOI] [PMC free article] [PubMed] [Google Scholar]
  86. Xu J, Li H.. 2015. Current perspectives on mitochondrial inheritance in fungi. Cell Health Cytoskeleton 7:143–154. [Google Scholar]
  87. Zanders SE, et al. 2014. Genome rearrangements and pervasive meiotic drive cause hybrid infertility in fission yeast. Elife 3:e02630. [DOI] [PMC free article] [PubMed] [Google Scholar]
  88. Zimmer M, Lückemann G, Lang BF, Wolf K.. 1984. The mitochondrial genome of the fission yeast Schizosaccharomyces pombe. 3. Gene mapping in strain EF1 (CBS 356) and analysis of hybrids between the strains EF1 and ade7-50h. Mol Gen Genet. 196(3):473–481. [DOI] [PubMed] [Google Scholar]
  89. Zimmer M, Welser F, Oraler G, Wolf K.. 1987. Distribution of mitochondrial introns in the species Schizosaccharomyces pombe and the origin of the group II intron in the gene encoding apocytochrome b. Curr Genet. 12(5):329–336. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

evz165_Supplementary_Data

Articles from Genome Biology and Evolution are provided here courtesy of Oxford University Press

RESOURCES