Skip to main content
PeerJ logoLink to PeerJ
. 2020 Jan 3;8:e8273. doi: 10.7717/peerj.8273

The inflated mitochondrial genomes of siphonous green algae reflect processes driving expansion of noncoding DNA and proliferation of introns

Sonja I Repetti 1,, Christopher J Jackson 1, Louise M Judd 2, Ryan R Wick 2, Kathryn E Holt 2, Heroen Verbruggen 1
Editor: Jürg Bähler
PMCID: PMC6944098  PMID: 31915577

Abstract

Within the siphonous green algal order Bryopsidales, the size and gene arrangement of chloroplast genomes has been examined extensively, while mitochondrial genomes have been mostly overlooked. The recently published mitochondrial genome of Caulerpa lentillifera is large with expanded noncoding DNA, but it remains unclear if this is characteristic of the entire order. Our study aims to evaluate the evolutionary forces shaping organelle genome dynamics in the Bryopsidales based on the C. lentillifera and Ostreobium quekettii mitochondrial genomes. In this study, the mitochondrial genome of O. quekettii was characterised using a combination of long and short read sequencing, and bioinformatic tools for annotation and sequence analyses. We compared the mitochondrial and chloroplast genomes of O. quekettii and C. lentillifera to examine hypotheses related to genome evolution. The O. quekettii mitochondrial genome is the largest green algal mitochondrial genome sequenced (241,739 bp), considerably larger than its chloroplast genome. As with the mtDNA of C. lentillifera, most of this excess size is from the expansion of intergenic DNA and proliferation of introns. Inflated mitochondrial genomes in the Bryopsidales suggest effective population size, recombination and/or mutation rate, influenced by nuclear-encoded proteins, differ between the genomes of mitochondria and chloroplasts, reducing the strength of selection to influence evolution of their mitochondrial genomes.

Keywords: Ostreobium, Bryopsidales, Siphonous, Chlorophyta, Genome evolution, Mitochondria, Natural selection, Genetic drift, Organelles

Introduction

Although they share many unifying features, the green algae (Chlorophyta) are a diverse group with genomes that vary considerably in structure and gene content (Yurina & Odintsova, 2016). Most work on Chlorophyta genomes to date has focused on organellar genomes, with in excess of 150 chloroplast and 70 mitochondrial genomes published on GenBank. These organelle genome data have been used for phylogenetic analyses to resolve lineage relationships within the Chlorophyta (Fučíková et al., 2014; Cremen et al., 2018) and to investigate organellar genome structural adaptation and unique features. As with many plastid-bearing taxa, more effort has been made to sequence and characterise chloroplast genomes rather than mitochondrial, as plastid genomes tend to be favoured for their use in phylogenetic studies due to high coding content, relatively conserved genome and simple inheritance patterns (Smith & Keeling, 2015; Fang et al., 2017). There are some notable exceptions in the Chlorophyta such as the fragmented hairpin plasmids seen in Cladophorales plastids (Del Cortona et al., 2017) and the large and highly repetitive Acetabularia acetabulum (Dasycladales) plastid genome (De Vries et al., 2013). Despite this, mitochondrial genomes tend to show greater abnormalities in structure than plastid genomes (Burger, Gray & Lang, 2003), and some of the most unusual genomic features have been observed in eukaryotic mitochondrial genomes (Smith & Keeling, 2015).

Comparative studies in green algal chloroplast and mitochondrial genomes have found that many instances of genome size variation are not due to gene gain or loss, but rather reduction or expansion of noncoding DNA including introns (Burger, Gray & Lang, 2003; Smith et al., 2013; Marcelino et al., 2016). In the Chlamydomonadales, Smith et al. (2013) observed an increase in chloroplast DNA from approximately 60% noncoding DNA in smaller-sized unicellular lineages to greater than 80% in multicellular Volvox carteri, while the number of genes differs by only two between the smallest and largest genomes. A similar pattern was observed for their mitochondrial DNA (Smith et al., 2013). According to Smith & Lee (2010), the large noncoding DNA content in V. carteri genomes could support the mutational-hazard hypothesis (MHH). The MHH proposes that excess DNA is more likely to accumulate in genomes with a low mutation rate and small effective population size (Ne) (Lynch, 2006; Smith, 2016) due to noncoding DNA imposing a fitness burden by increasing the potential for harmful mutations. Such excess DNA can be eradicated more effectively when there is a larger effective population size for natural selection to act upon through purifying selection (Lynch & Walsh, 2007). The MHH might explain the streamlining of organelle genomes within various lineages including prasinophytes, red algae, and some fungi, which have high estimated mutation rates (Smith, 2016). A large reduction in Ne is expected for multicellular organisms compared with unicellular, therefore purifying selection should be less efficient meaning there is less restriction on the accumulation of noncoding DNA. This may explain the difference in noncoding content between unicellular members of the Chlamydomonadales, such as Chlamydomonas, and colonial V. carteri (Smith et al., 2010). Various studies have supported and challenged the MHH (Smith, 2016), but on the whole emphasise the importance of mutation and genetic drift in shaping organelle genomes (Lynch, 2006; Smith, 2016).

It has also been suggested that selection can influence the mutation rate of genomes. The drift-barrier hypothesis (Lynch et al., 2016) proposes that selection acts to reduce the mutation rate with an overall limit set by genetic drift. In genomes with very high mutation rates, ‘antimutators’ such as DNA repair proteins are advantageous and decrease the mutation rate until the strength of selection is matched by that of genetic drift and mutation bias (Lynch et al., 2016). In contrast, in genomes with low mutation rates, antimutators will not be advantageous enough to be selected for, whilst having mild mutators will not be sufficiently disadvantageous to be selected against. Therefore, mutation rate will increase until selection is strong enough to prevent the genome evolving a higher mutation rate (Lynch et al., 2016). Krasovec et al. (2017) combined a review of literature with mutation rate estimates for four prasinophytes and found support for this theory, with mutation rate tending to decrease as Ne, therefore strength of selection, increases.

The effective population size of genomes influences the efficacy of selection relative to genetic drift (Ness et al., 2016). Within genomes, loci linked to sites under selection will exhibit locally reduced Ne due to hitchhiking, where mutations that are not necessarily optimal are fixed because they are located close to another gene undergoing selection (Smith & Haigh, 1974), background selection, when selection acts against linked deleterious alleles that are retained in a population by mutation (Charlesworth, 1994), and Hill–Robertson (HR) interference (Platt, Weber & Liberles, 2018 and references therein). Under the HR effect (Hill & Robertson, 1966), weak selection acting on linked sites reduces the overall effectiveness of selection due to linkage disequilibrium (Comeron, Williford & Kliman, 2008). Recombination breaks up such linkage and prevents local reduction in Ne due to selection acting upon linked sites (Cutter & Payseur, 2013; Platt, Weber & Liberles, 2018). It has been observed that genes located in regions of low recombination have a greater number of introns, and these introns have greater lengths (Comeron & Kreitman, 2000). It has been proposed that introns may be advantageous in reducing intragenic HR effect and are hence retained, as they might increase rates of recombination that lower intragenic HR effects (Comeron, Williford & Kliman, 2008). Comeron & Kreitman (2000) suggest that longer introns are advantageous in regions of low recombination, as they reduce the HR effect relative to a shorter intron by increasing the recombination rate between mutations in different exons, and that they will be maintained due to hitchhiking effects between favourable mutations and the longest intron variant.

The class Ulvophyceae within the Chlorophyta show high morphological diversity, including a range of cell configurations (Cocquyt et al., 2010; Fang et al., 2017), and are thus a useful model for examining the evolution of genome features under differing evolutionary constraints. The order Bryopsidales are a lineage of siphonous seaweeds with thali comprised of a single giant tubular cell containing cytoplasm with a large number of nuclei and other organelles free to move around the entire plant (Vroom & Smith, 2001, 2003; Verbruggen et al., 2009; Mine, Sekida & Okuda, 2015). Within the Bryopsidales, the size and gene arrangement of chloroplast genomes has been found to vary considerably while gene content was much less variable (Cremen et al., 2018). While appreciable short-read sequencing data have been generated for the Bryopsidales, mitochondrial genomes have not been found to assemble well from the sequencing libraries used to assemble chloroplast genomes, and the first bryopsidalean mitochondrial genome was published only very recently for the sea grape Caulerpa lentillifera (Zheng et al., 2018). This 209,034 bp circular-mapping genome is the largest Chlorophyta mitochondrial genome published to date, an order of magnitude larger than the average for Chlorophyta mitochondrial genomes and rich in intergenic sequences and introns, the causes of which have not been examined.

Unlike the genome expansion observed in the Chlamydomonadales, which appears to occur in both organellar compartments (Smith et al., 2013; Zhang et al., 2019), C. lentillifera’s mitochondrial genome is much larger than its chloroplast genome. This is unusual for green algae where mitochondrial genomes are generally smaller than chloroplast genomes (Leliaert et al., 2012). Furthermore, mitochondrial genome expansion is less frequent in green algae compared with plants (Mower, Sloan & Alverson, 2012). While Zheng et al. (2018) did not propose an evolutionary explanation for these findings, the inflated noncoding regions described are consistent with previous studies in plants showing that the variation in the sizes of their mitochondrial genomes is predominantly explained by changes in noncoding DNA content including repeats, introns, intergenic DNA and DNA of foreign origin (Yurina & Odintsova, 2016). Therefore, the mitochondrial genome of C. lentillifera provides an interesting case in the Chlorophyta to examine the evolution of organellar genomes and, in particular, the processes driving genome expansion. However, it remains undetermined whether a large mitochondrial genome is a feature only of Caulerpa, or if it is in fact characteristic of the order Bryopsidales.

In this paper, we focus on another member of the Bryopsidales that is currently of considerable ecological interest: the genus Ostreobium, an endolithic limestone-boring alga (Verbruggen & Tribollet, 2011). Ostreobium is found in a diverse range of calcium carbonate environments around the world, is abundant in the skeleton of many coral species, and it is one of the most common genera of endolithic autotrophs in coral reefs (Tribollet, 2008). Endolithic algae like Ostreobium have a number of observed and predicted roles in the coral skeleton (Ricci et al., 2019). While their boring can destabilise coral skeletons and impact on reef structure (Schlichter, Kampmann & Conrady, 1997), they may have a mutualistic relationship with corals, providing them with metabolites that could enable them to survive bleaching events (Schlichter, Kampmann & Conrady, 1997; Tribollet, 2008). Its endolithic lifestyle means that Ostreobium inhabits low light habitats with limited available photosynthetically active radiation (Wilhelm & Jakob, 2006; Magnusson, Fine & Kühl, 2007) and it has been shown to have far red-shifted absorption spectra, allowing it to use the long wavelengths of light available in the coral skeleton (Wilhelm & Jakob, 2006; Magnusson, Fine & Kühl, 2007; Tribollet, 2008).

Ostreobium species have consistently small chloroplast genomes relative to the median for Bryopsidales of 105 kb (Cremen et al., 2018), with the chloroplast genome of Ostreobium strain HV05042 the most compact found so far (80,584 bp) in the Ulvophyceae (Marcelino et al., 2016; Verbruggen et al., 2017). Marcelino et al. (2016) identified only three introns in the Ostreobium quekettii chloroplast genome, and there was an overall reduction in intergenic regions that resulted in its reduced size. They hypothesised that this might reflect evolution in response to resource constraints in its low light environment (Marcelino et al., 2016), but did not examine the mitochondrial genome for evidence of similar selection pressures.

The goal of this study is to evaluate the characteristics of mitochondrial genomes in the Bryopsidales, as well as to determine the evolutionary forces that have shaped organelle genome dynamics in the Bryopsidales. Our approach consists of the sequencing, assembly and annotation of the O. quekettii mitochondrial genome, as well as analysis of noncoding elements: repeats, introns and intron-encoded open reading frames (ORFs). We also compare the O. quekettii mitochondrial genome with that of C. lentillifera, interpreting results in the context of existing evolutionary genomics theory.

Materials and Methods

Culturing and DNA extraction

Ostreobium quekettii (SAG culture collection strain 6.99) was cultured in F/2 media on a 14H–8H light/dark cycle at ~19 °C. Total DNA was extracted using a modified cetyltrimethylammonium bromide method described in Cremen et al. (2016).

DNA sequencing

Total DNA was sequenced using Illumina sequencing technology (HiSeq 2000, 150 bp paired-end reads, ~40 GB data), at Novogene, Beijing. Reads were trimmed with Cutadapt v1.12 (Martin, 2011) using the parameters −e 0.1 −q 10 −O 1 −a AGATCGGAAGAGC.

Total DNA was also extracted as above and sequenced using nanopore sequencing technology (MinION; Oxford Nanopore Technologies, Oxford, UK), producing 1,889,814 reads and ~9.5 GB data. Reads were filtered using Filtlong (https://github.com/rrwick/Filtlong), removing reads less than 1,000 bp in length, with average quality of less than 60 (as defined by the Phred scores), or minimum quality over a sliding window of less than 40.

Genome assembly

The Illumina data were assembled using Spades v3.12.0 (Bankevich et al., 2012) with the—careful option. The Nanopore data were assembled using Canu 1.7.1 (Koren et al., 2017) with the parameters genomeSize = 300 m corOutCoverage = 10,000 corMhapSensitivity = high corMinCoverage = 0.

A hybrid (combined long-read and short-read) genome assembly was performed with MaSuRCA v3.2.7 (Zimin et al., 2013), using the Illumina reads as short-read input. For long-read input, contigs produced by Spades were treated as ‘pseudo–Nanopore’ reads and combined with Nanopore data, and this combined dataset was used.

Transcriptome

Total RNA was extracted using Plant RNA reagent (Thermofisher, Waltham, MA, USA). A strand-specific 100 bp paired-end library was constructed and sequenced using Illumina HiSeq 2500 (ENA study accession number PRJEB35267). Quality filtering of reads was performed using Trimmomatic 0.39 (Bolger, Lohse & Usadel, 2014) with the following settings: LEADING:3 TRAILING:3 SLIDINGWINDOW:4:20. A transcriptome was constructed using Trinity version 2.8.3 (Grabherr et al., 2011).

Identification and manual curation of mitochondrial genome

The C. lentillifera mitochondrial genome (GenBank accession KX761577.1) (Zheng et al., 2018) was used as the query in a BLASTn search against the long-read Canu assembly within Geneious version 11.1.2 (Kearse et al., 2012) with default settings. Only a single contig was identified as a likely candidate for the O. quekettii mitochondrial genome based on the results of BLASTx searches of sections of this candidate contig against the NCBI nr and nt databases.

This contig was used as the query in a BLASTn search against the O. quekettii Spades short-read assembly, within Geneious with default settings. Top hits were aligned with the long-read contig within Geneious. Scaffolds from the short-read assembly were used as a reference to manually correct the long-read contig within Geneious. In order to verify that the genome was circular-mapping, we searched for a short-read scaffold spanning across both ends of the long-read contig when aligned.

This manual result was compared with the mitochondrial contig identified, through blast searches against the NCBI nr and nt databases, in the MaSuRCA hybrid assembly by alignment in Geneious.

Genome annotation

The genome was initially annotated using MFannot (Beck & Lang, 2010), and DOGMA (Wyman, Jansen & Boore, 2004) with very relaxed settings (protein identity cut off 25%, RNA identity cut off 30%). Annotations of predicted protein coding genes were confirmed through extraction of ORFs and BLAST searches of these against the NCBI nr and nt databases, as well as alignment with transcripts that were recovered as hits from BLASTn searches against the O. quekettii transcriptome within Geneious, using default settings.

tRNAs were identified using tRNAscan-SE (Lowe & Chan, 2016), tRNAfinder (Kinouchi & Kurokawa, 2006), ARAGORN (Laslett & Canback, 2004) and tRNADB-CE’s BLASTN/Pattern Search (Abe et al., 2010). rRNAs were identified with RNAmmer (Lagesen et al., 2007) and RNAweasel (Lang, Laforest & Burger, 2007).

Repeats were identified using the tandem repeats database (Gelfand, Rodriguez & Benson, 2007), the RepeatFinder package in Geneious with a minimum repeat length of 50 bp (Pombert et al., 2004) and REPuter (Kurtz et al., 2001) with minimal repeat size setting of 12 bp (Smith & Lee, 2009). Forward, reverse, complement, and reverse complement repeats were all considered under REPuter.

A map of the genome was created with Circos (Krzywinski et al., 2009) and manually annotated in Inkscape 0.92 (www.inkscape.org). The sequence for the O. quekettii mitochondrial genome is available on GenBank (Accession number MN514984).

Open reading frames

Open reading frames were predicted using ORF finder in Geneious with a minimum length setting of 300 bp. These were used as queries in BLASTx searches against the NCBI nr and nt databases (e value = e−1), and the translated ORFs were used as queries in a batch sequence search against the Pfam database (Finn et al., 2017). Only ORFs that had BLAST results and identified Pfam domains were retained in the final genome annotation. In order to determine relationships between ORFs indicative of common origin, the annotated ORFs along with those from the chloroplast genome of O. quekettii (Marcelino et al., 2016) and the mitochondrial genome of C. lentillifera (Zheng et al., 2018) were clustered based on all-against-all BLAST+ similarities using CLANs (Frickey & Lupas, 2004). This was performed through the MPI Bioinformatics toolkit (Zimmermann et al., 2018), with the BLOSUM62 scoring matrix and extraction of BLAST HSPs up to e-values of 1e−4. The output from CLANs was annotated in Inkscape.

Introns

Intron boundaries were inferred by aligning predicted genes with their corresponding transcripts and intron-lacking homologues from other green algae. Intron class was predicted using RNAweasel, and Rfam sequence search (Griffiths-Jones et al., 2003).

In order to identify potential common origins of introns in the O. quekettii and C. lentillifera mitochondrial genomes as well as the O. quekettii plastid genome, we used a distance-based clustering technique. In contrast to the analysis of ORFs above, this analysis used DNA sequences and used introns that were not interrupted by ORFs, in the hope that this would represent a subset of the data that could be more easily aligned if they contained conserved regions. From the intronic DNA sequences, a distance matrix was constructed by comparing the introns from these genomes using Clustal Omega (Sievers et al., 2011) with the ‘—distmat-out’ and ‘—full’ flags. This distance matrix was used as the input to construct a neighbour joining (NJ) tree using Neighbor within the PHYLIP package (Felsenstein, 2005). We also constructed a distance matrix using a subset of these introns from only the O. quekettii chloroplast and mitochondrial genomes and this was used to construct a NJ tree. This NJ tree was further visualised and annotated in MEGA (Kumar et al., 2018). Clusters of introns identified in the NJ tree were aligned in Geneious using MAFFT to better evaluate similarity. Insertion sites of introns were also compared between the O. quekettii and C. lentillifera genomes by aligning gene sequences in Geneious using MAFFT.

Rates of evolution

To obtain an estimate of the relative rates of evolution in the O. quekettii mitochondrial and chloroplast genomes, we aligned all the protein coding and rRNA genes common between the O. quekettii and C. lentillifera mitochondrial and chloroplast genomes in Geneious using the default aligner. We generated estimates of base substitutions per site between sequences using the Jukes–Cantor model (Jukes & Cantor, 1969) in MEGA. We also attempted to obtain dN/dS ratios for gene alignments using PAL2NAL (Suyama, Torrents & Bork, 2006).

Mitochondrion-targeted nuclear genes

In the context of evaluating hypotheses around mitochondrial DNA repair machinery (see below), we searched for homologues of nuclear-encoded DNA repair proteins that are targeted to the mitochondrion in higher plants: MSH1, RECA proteins and OSB1. Arabidopsis sequences were used as queries in BLAST searches with default settings against the nuclear contigs in the O. quekettii hybrid assembly, as well as the O. quekettii transcriptome in Geneious. We searched for putative targeting signals to the mitochondrion with DeepLoc-1.0 using default settings (Almagro Armenteros et al., 2017).

Results

Ostreobium mitochondrial genome

The mitochondrial genome was captured by a single contig in the Canu assembly, but six separate contigs in the Spades assembly. The genome was captured by a single contig in the hybrid genome assembly that showed 99.97% pairwise identity with our manually corrected version of the genome. The combination of long- and short-read data improved this assembly and allowed us to capture the entire genome with a high level of accuracy.

The mitochondrial genome of O. quekettii SAG6.99 assembled into a 241,739 bp circular-mapping molecule (Fig. 1). All 64 codons are used (Table S1) and the 28 tRNAs encoded by the O. quekettii mtDNA (Table S2) appear to be sufficient to recognise all of these codons assuming the standard genetic code and maximum use of wobbling and superwobbling (Alkatib et al., 2012). Only 25% of the genome is coding DNA (Table 1). The genome encodes 3 rRNAs and 28 tRNAs (Tables 2 and 3), resembling other green algal mitochondrial genomes (Table S3). It also encodes 34 named protein-coding genes commonly found in green algae (Table 3; Table S3). This does include some genes that are less common in Chlorophyta mitochondrial genomes: nad10, which is absent from the mtDNA of sequenced land plants and many green algae (Mower, Sloan & Alverson, 2012), and tatC, a gene encoding a component of the inner membrane TAT translocase responsible for transporting folded proteins across the membrane in bacteria but whose function in mitochondria remains unclear (Carrie, Weißenberger & Soll, 2016; Petrů et al., 2018).

Figure 1. Mitochondrial genome map of Ostreobium quekettii SAG6.99.

Figure 1

The position of tRNAs are shown on the outer track (red lines). The first inner circle represents the position, size and the names of the protein-coding and rRNA genes. The introns are shown in the second inner circle and are colour coded according to intron types/subtypes: group I derived (very light blue), group IA (light blue), group IB (blue), group ID (dark blue), group II (orange), unknown (grey). The third inner circle represents the position and length (line thickness) of repeats.

Table 1. Summary of coding and noncoding content of the mtDNA of Ostreobium quekettii SAG6.99.

Percent of total noncoding DNA Percent of overall genome
Coding (rRNA, tRNA, ORFs, protein coding genes) 25%
Repeats 7% 5%
Intergenic DNA 61% 46%
Introns (including ORFs) 39%
Introns (excluding ORFs) 39% 29%
Total noncoding DNA (excluding ORFs) 75%
Total intronic and intergenic DNA (including ORFs) 85%

Table 2. Genes, introns and open reading frames present in mtDNA of Ostreobium quekettii SAG6.99. For further information on introns and ORFs, see Supplemental Material.

Number in genome
Protein coding genes 34
rRNA 3
tRNA 28
Genes containing introns 18
Introns 47
 Group I 14
 Group II 28
 Unclear 5
Introns containing ORFs 18
ORFs 20
 Intronic 20
 Intergenic 0

Table 3. Protein coding and ribosomal genes present in the mtDNA of Ostreobium quekettii SAG6.99.

Protein genes
Complex I (nad) nad1, nad2, nad3, nad4, nad4L, nad5, nad6, nad7, nad9, nad10
Complex III (cob) cob
Complex IV (cox) cox1, cox2, cox3
Complex V (atp) atp1, atp4, atp6, atp8, atp9
SSU ribosomal proteins (rps) rps2, rps3, rps4, rps7, rps10, rps11, rps12, rps13, rps14, rps19
LSU ribosomal proteins (rpl) rpl5, rpl6, rpl14, rpl16
Ribosomal RNAs rrn5, rrnS, rrnL
Putative Protein Transporter tatC

The overall GC content of the genome is 48.3% (Table 1), which is higher than the average for eukaryotic mitochondrial genomes (38%) (Smith & Lee, 2008) but similar to that of C. lentillifera (50.9%) (Zheng et al., 2018) and not extreme for green algae (Del Vasto et al., 2015). It does however vastly exceed the GC content of the O. quekettii chloroplast genome (31.9%) (Marcelino et al., 2016).

The O. quekettii mitochondrial genome contains 373 repeats, found in both introns and intergenic DNA, that represent 5% of the total genome (Fig. 1; Table 1), with a minimum length of 31 and maximum of 299 bp (mean 107.3 ± 61.7 SD). This is similar to the mitochondrial genome of C. lentillifera, where we estimated 349 repeats represent approximately 4% of the genome, with a minimum length of 25 bp and maximum of 163 bp (mean 68.3 ± 30.1 SD).

Introns

Most of the genome’s inflated size (75%) is from expanded intergenic and intronic regions rather than extra coding material. Eighteen of the 34 named protein-coding genes contain one or multiple intron(s) (Table 2; Table S4). Introns include both group I and group II introns, as well as five whose class could not be confidently determined (Table 2; Table S4). The Ostreobium mitochondrial genome contains 47 introns, compared with only 29 in the mtDNA of C. lentillifera (Zheng et al., 2018). Group II introns, which appear to have been abundant early in the evolution of Bryopsidales plastid genomes (Cremen et al., 2018) and are the dominant type in plant mtDNA but less common elsewhere (Lang, Laforest & Burger, 2007), are the dominant type in the O. quekettii mtDNA. This contrasts with the C. lentillifera mitochondrial genome, which also contains both types of introns but with group I introns predominating (Zheng et al., 2018).

Open reading frames-lacking introns from the O. quekettii and C. lentillifera mitochondrial genomes mostly form separate clusters in NJ trees of pairwise DNA distances (Fig. S1). Alignments of the few introns that do cluster together did not show convincing homology between C. lentillifera and O. quekettii mitochondrial genomes (alignments not shown), nor between introns in the O. quekettii mitochondrial and plastid genomes (Fig. S2). Furthermore, comparing insertion sites for introns between O. quekettii and C. lentillifera only revealed five sites where intron insertion sites of the same type of intron were within 3 bp of each other, and only a single site where they matched exactly (Table S4). The NJ tree did identify groups of similar group II introns within the O. quekettii mitochondrial genome, and these sequences did show convincing similarity at the DNA level in alignments, suggesting that these groups of introns proliferated within the Ostreobium lineage (Fig. S2).

Open reading frames with identifiable Pfam domains were only found in introns, not intergenic DNA (Table 2). Eighteen of the introns in the O. quekettii mitochondrial genome contain one or more ORFs. In a similarity network generated from all against all BLAST+ similarities, these ORFs appear to cluster together based on shared Pfam domains, which predict similar functions for these ORFs (Fig. 2). ORFs show homology to intron-encoded proteins that act as maturases and homing endonucleases, which enable splicing and promote intron mobility (Lambowitz & Belfort, 1993; Lambowitz & Zimmerly, 2011; Hausner, 2012). They contain a variety of domains, but mostly two types of domains with the amino-acid motif LAGLIDADG, which are common in both group I and group II introns (Hausner, 2012). Three ORFs contain double LAGLIDADG_1 domains, which in other lineages are in intron-encoded proteins with maturase activity that enable intron splicing (Lambowitz & Belfort, 1993). Four ORFs contain an RVT domain; Cremen et al. (2018) found evidence for the mobility of group II intron-encoded ORFs containing an RVT domain within Bryopsidales chloroplast genomes.

Figure 2. Similarity network generated from all against all BLAST+ similarities of ORFs encoded in the Caulerpa lentillifera mitochondrion, and Ostreobium quekettii mitochondrion and chloroplast.

Figure 2

Each node represents an ORF, and each edge (line) represents a significant HSP (high scoring segment pair), shaded according to p value. Generated using CLANS through the MPI Bioinformatics Toolkit (Scoring Matrix BLOSSUM62, extracting BLAST HSPs up to e-values of 1e−4, using p-values better than 1.0).

Only one ORF identified by Zheng et al. (2018) in C. lentillifera had a positive hit to the Pfam database: ORF932 in cox1 had a putative LAGLIDADG_1 domain. This sequence clustered with O. quekettii ORF sequences containing the same domain (Fig. 2).

None of the O. quekettii mitochondrial ORFs show similarity to the single ORF460 identified in the O. quekettii chloroplast, which has a putative intron splicing function (Cremen et al., 2018). Along with the results for the introns, this suggests there is little evidence for the transfer of ORFs or introns between organellar genomes in O. quekettii.

Testing evolutionary theories: rates of evolution and mitochondrion-targeted proteins

The inflated mitochondrial genome of O. quekettii prompted several hypotheses (see below) that could be examined using molecular data.

In order to gain insight into whether there are differential mutation rates between the plastid and mitochondrial genomes, that might help explain their differing sizes in both O. quekettii and C. lentillifera, we aligned genes from the two organellar genomes between the two species. Estimates of base substitutions per site using a simple Jukes–Cantor model showed a slightly higher (t = 2.94, p < 0.01) number of average base substitutions per site for the mitochondrial genomes (mean 0.456 ± 0.207 SE) compared with chloroplast genomes (mean 0.370 ± 0.016 SE) (Table S5). Genes showed signs of saturated divergence, with estimated dS values considerably greater than one (e.g. atp6 dS = 46.8711).

To identify potential homologues of nuclear-encoded DNA repair proteins that are targeted to the mitochondrion in higher plants and may play a role in organellar genome maintenance and size (see below), we performed blast searches of Arabidopsis sequences against the O. quekettii transcriptome and nuclear contigs. We identified a sequence in O. quekettii with strong similarity to the Arabidopsis DNA mismatch repair protein MSH1 and putative targeting to the mitochondrion (DeepLoc-1.0L: Mitochondrion 0.7151, Soluble 0.6145). Pfam and InterPro searches identified a putative specific DNA-binding GIY-YIG domain in this MSH1 homologue, as well as in potential homologues in other green algae. However, this domain lacks most of the key residues conserved among GIY-YIG family members (Garrison & Arrizabalaga, 2009). We also identified putative mitochondrion-targeted (DeepLoc-1.0L: Mitochondrion 0.9268 and 0.6334, Soluble 0.5699 and 0.6798) homologues of RECA proteins, which are also predicted to play a role in controlling mitochondrial genome maintenance in plants. Searches of the O. quekettii transcriptome revealed potential OSB1 homologues, predicted to be involved in homologous recombination-dependent repair, containing a central OB-fold domain but lacking a targeting signal to the mitochondrion or chloroplast (DeepLoc-1.0L: Nucleus 0.6485, Soluble 0.7227).

Discussion

The inflated mitochondrial genome of O. quekettii, the largest Chlorophyta mitochondrial genome sequenced thus far, even larger than that of C. lentillifera, lends support to large mitochondrial genomes being a characteristic of the order Bryopsidales. The O. quekettii mitochondrial genome encodes all genes commonly found in Chlorophyta mitochondrial genomes, including most ribosomal protein genes, which have been unevenly retained in plant mitochondria (Palmer et al., 2000; Mower, Sloan & Alverson, 2012), and are missing in some Chlorophyta lineages such as the Chlorophyceae and Pedinomonas (e.g. Fig. S3). Surprisingly, although ribosomal proteins have been identified in other Ulvophyceae mitochondrial genomes thus far, none were identified in the mitochondrial genome of C. lentillifera (Zheng et al., 2018). The large size of both Bryopsidales mitochondrial genomes relative to their plastids is not typical of the Chlorophyta, where chloroplast genomes tend to be either similar size or larger and contain more noncoding DNA than their mostly compact intron-poor mitochondrial counterparts (Leliaert et al., 2012). Instead, these genomes are more typical of land plants: bloated with introns and intergenic DNA (Leliaert et al., 2012; Mower, Sloan & Alverson, 2012). In fact, the Bryopsidales mitochondrial genomes are larger than those of many streptophytes, above the average range cited for non-vascular land plants but not quite as extreme as has been described in the Trachaeophytes (Mower, Sloan & Alverson, 2012). The GC content of the Bryopsidales mitochondrial genomes is also high, markedly higher than their chloroplasts. Differing nucleotide content between organelle genomes, however, is not unheard of in green algae, nor is drastic variation in organelle nucleotide content between even closely related lineages (Smith et al., 2011; Del Vasto et al., 2015).

There is evidence for at least proliferation of group II introns within the Ostreobium lineage, something that is not uncommon in green algal chloroplast genomes (Cremen et al., 2018). However, a lack of sequence similarity for ORFs identified between their genomes, alongside the fact that their introns are not readily alignable, suggests that either the introns and associated proteins arose independently between C. lentillifera and O. quekettii before proliferating within their respective lineages, or that sequences have diverged so much that similarities are no longer recognisable. This is not unlikely given the estimated 479 million years divergence time between these lineages (Verbruggen et al., 2009). Cremen et al. (2018) did find some homologous ORFs, with conserved protein domains, between Bryopsidales chloroplast genomes. The lack of similarity between Ostreobium mitochondrial ORFs and its single plastid ORF with a putative intron splicing function (Cremen et al., 2018), as well as between introns from the organellar genomes, indicates there has been no transfer of introns between organellar genomes.

The considerable difference in their sizes and gene density as well as a lack of evidence for ORF or intron transfer between organellar compartments indicates that evolutionary forces are acting upon plastid and mitochondrial genomes differently in the Bryopsidales. The MHH (Lynch, Koskella & Schaack, 2006) and drift-barrier (Lynch et al., 2016) hypotheses emphasise the importance of mutation rate in determining genome sizes. Molecular evolution rates of the Ostreobium chloroplast are slow compared with other Bryopsidales; Marcelino et al. (2016) propose that this is due to the low light habitat of Ostreobium, which might reduce cases of direct sunlight-induced mutation as well as drive the evolution of slow metabolic rates and generation times with fewer mutations accumulated per unit of time. Estimates in this study indicate a slightly higher substitution rate in Bryopsidales mitochondrial genomes compared with the chloroplasts, which appears to contradict the MHH that states genomes with a higher mutation rate should show signs of compaction, as they are under greater selective pressure to remove excess DNA that carries a mutational burden (Lynch, Koskella & Schaack, 2006). dS values estimated from gene alignments show signs of saturated divergence, making accurate calculation of dN/dS unrealistic (Yang & Bielawski, 2000; Hurst, 2002). This is unsurprising due to the considerable time since the divergence of these lineages. It is not uncommon for mutation rates to vary between lineages and even between genome compartments of the same lineages by a considerably larger margin than was estimated in this study, and it can be difficult to draw direct connexions between mutation rates and genome architecture (Smith, 2016). An example is the green alga Dunaliella salina, which contains inflated organelle genomes, both chloroplast and mitochondrial, but has order-of-magnitude differences in mutation rates between the two compartments, with substitution rates between two strains of D. salina 2–13 times greater in mtDNA than ptDNA (Del Vasto et al., 2015). Differing mutation rates alone appear insufficient to explain the genome dynamics of the two Bryopsidales organellar compartments.

The higher mutation rate in the Bryopsidales mitochondrial genomes may have driven selection for more efficient DNA repair proteins, as would be predicted by the drift-barrier hypothesis (Lynch et al., 2016). It has been proposed that two distinct strategies have evolved to protect organelle genomes from the negative effects of non-homologous recombination. In animal mtDNA, elevated rates of evolution cause noncoding elements to pose a high mutational burden, leading to selection against noncoding elements, avoiding the build-up of repeats and introns (Lynch, Koskella & Schaack, 2006; Galtier, 2011). Plants appear to have efficient recombination-mediated DNA repair of coding DNA, explaining their low measured mutation rate (Odahara et al., 2009; Davila et al., 2011; Christensen, 2014). The nuclear-encoded RECA3 and MSH1 genes in plants are hypothesised to control mitochondrial genome maintenance, by preventing replication of short repeats while allowing recombination-dependent replication of longer repeats (Shedge et al., 2007). We identified a putative mitochondrion-targeted MSH1 homologue in O. quekettii. MSH1 encodes a protein with six conserved domains (Kowalski et al., 1999) including domain VI, a GIY-YIG homing endonuclease, which is predicted to be responsible for specific DNA-binding and suppression of homologous recombination (Fukui et al., 2018) and is specific to only the plant form of the protein (Abdelnoor et al., 2006; Shedge et al., 2007). Domain VI is absent from nuclear localised homologues in plants (MSH2–MSH6) and from the yeast MSH1 protein (Abdelnoor et al., 2006). Although InterPro and Pfam predicted a GIY-YIG domain in the O. quekettii MSH1 homologue, the fact that it lacks most of the residues that are typically conserved in this domain leaves its function unresolved.

Ostreobium quekettii also appears to encode mitochondrially-targeted RecA proteins. RecA recombinases in Arabidopsis are involved in strand exchange and the joining of paired DNA ends during homologous recombination (Kühn & Gualberto, 2012; Gualberto et al., 2014). RECA1 is chloroplast targeted, RECA2 is dual targeted to the mitochondrion and chloroplast, and RECA3 is targeted to the mitochondrion (Shedge et al., 2007). RECA3 mutations in Arabidopsis result in mitochondrial rearrangements similar but not identical to MSH1 mutants (Shedge et al., 2007). Odahara et al. (2009) propose that these RecA proteins mediate homologous recombination, which is significant for suppressing short repeat-mediated genome rearrangements in plant mitochondria. They suggest that this genome stabilisation provided by RecA could allow the number of group II introns, the dominant form in the O. quekettii mitochondrion, to increase (Odahara et al., 2009). OSB1 is another putative component of homologous recombination-dependent repair in plant mitochondria, which is also likely involved in restricting mtDNA recombination (Kühn & Gualberto, 2012; Gualberto et al., 2014). Searches of the O. quekettii transcriptome revealed potential OSB1 homologues. However, these lack targeting signals to organelles that would provide evidence supporting their predicted function.

In contrast to the mitochondrial and chloroplast genomes of V. carteri (Smith & Lee, 2009), and those recently published for Haematococcus lacustris (Zhang et al., 2019), as well as the mitochondrial genomes of many land plants (Palmer et al., 2000; Mower, Sloan & Alverson, 2012), little of the expanded content of Bryopsidales mtDNA is repetitive DNA. However, most of the repeats in the O. quekettii mitochondrial genome are so-called ‘intermediate’ repeats (50–600 bp) (Kühn & Gualberto, 2012). Repeats of this length are associated with MSH1-induced recombination in Arabidopsis mitochondria, that can lead to accumulation of DNA as well as complex rearrangements (Gualberto et al., 2014). Along with error prone repair these processes might result in low numbers of alternative genome configurations, ‘mitotypes’ that could eventually increase to become the dominant form of mtDNA (Gualberto et al., 2014). This working hypothesis, based on molecular data, proposes that a recombination-associated repair process in the Bryopsidales mitochondrion has resulted in its inflated size, and encourages further study to resolve the role played by these putative mitochondrion-targeted sequences as well as to determine whether recombination is in fact occurring in Bryopsidales mitochondrial genomes at all.

It is interesting to consider whether the build-up of introns in Bryopsidales mitochondrial genomes plays a part in modifying recombination if it is in fact occurring. Introns may play a role in reducing intragenic Hill–Roberston effects, that tend to result in locally reduced Ne, by increasing recombination rate between regions of the genome (Comeron & Kreitman, 2000). Bottlenecking of genomes, such as during the production of gametes for sexual reproduction, can also result in smaller genome Ne (Neiman & Taylor, 2009), which would reduce selection and might contribute to the higher mutational load observed in Bryopsidales mitochondrial genomes. Sexual reproduction has not been observed in O. quekettii, however it has been described in other Bryopsidales (Morabito, Gargiulo & Genovese, 2010) and, to our knowledge, no strictly asexual lineages have been observed. Comparing relative copy numbers of organelle genomes between adult plants and gametes could provide insight into the extent of bottlenecking that occurs during reproduction in the Bryopsidales.

The mutation and recombination rates in organelles are at least in part under the control of nuclear-encoded maintenance pathways (Smith & Keeling, 2015), with organelle genomes usually lacking the genes necessary for their own DNA replication and repair (Sloan & Taylor, 2012). It has been proposed that the evolution of similar features in both plastid and mitochondrial genomes within a species, as seen in some of the Chlorophyta, is due to ‘leakage’ of nuclear-encoded proteins controlling these processes between organellar compartments, with proteins normally targeted to one organelle also becoming targeted to the other (Smith & Keeling, 2015). The differing configurations of the two organellar genomes in C. lentillifera and O. quekettii suggest that this has not occurred in the Bryopsidales. As more nuclear genomes for Bryopsidales become available (Arimoto et al., 2019), there will soon be the opportunity for a more thorough study of all three genomic compartments in the Bryopsidales, including an examination of organelle-targeted DNA maintenance machinery to further uncover the forces underpinning their divergent organelle genome sizes.

Conclusions

Marcelino et al. (2016) suggested that selection due to a low light environment resulted in the reduced chloroplast genome of O. quekettii, but it is unclear why this selection would not act against the expanded genome size in the mitochondrion. It would appear that effective population size, recombination and/or mutation rate, influenced by nuclear-encoded proteins, are different between the two genomes, leading to a reduction in the strength of selection to influence evolution of the mitochondrial genome. Ultimately, it is likely overly simplistic to attempt to find a single explanation to cover all mitochondrial genome expansion (Smith & Keeling, 2015), with the evolution of organellar genomes in the Bryopsidales and other lineages a combination of many forces and factors.

Supplemental Information

Supplemental Information 1. Neighbour joining tree constructed from Clustal Omega distance matrix of ORF-lacking introns in C. lentillifera mitochondrial genome and O. quekettii mitochondrial and plastid genomes.
DOI: 10.7717/peerj.8273/supp-1
Supplemental Information 2. Neighbour joining tree from Clustal Omega distance matrix of ORF-lacking introns in O. quekettii mitochondrial and chloroplast genomes, and MAFFT alignments corresponding to clusters.
DOI: 10.7717/peerj.8273/supp-2
Supplemental Information 3. Raw data: Fasta alignments of clusters of ORF-lacking introns from Figure S2.
DOI: 10.7717/peerj.8273/supp-3
Supplemental Information 4. Codon usage in the protein coding genes in mtDNA of O. quekettii SAG6.99.
DOI: 10.7717/peerj.8273/supp-4
Supplemental Information 5. tRNAs present in mtDNA of O. quekettii SAG6.99.
DOI: 10.7717/peerj.8273/supp-5
Supplemental Information 6. Comparison of protein coding and ribosomal RNA genes in the mitochondrial genomes of a selection of Chloroplastida including O. quekettii SAG6.99.
DOI: 10.7717/peerj.8273/supp-6
Supplemental Information 7. Introns and associated ORFs within mtDNA of O. quekettii SAG6.99.
DOI: 10.7717/peerj.8273/supp-7
Supplemental Information 8. Estimates of base substitutions per site between genes from O. quekettii and C. lentillifera using the Jukes–Cantor model conducted in MEGA.
DOI: 10.7717/peerj.8273/supp-8

Acknowledgments

This research was supported by use of the computational facilities of the University of Melbourne High Performance Computing platform and the Nectar Research Cloud. This paper benefitted from useful discussions with Joseph Bielawski about evolution, as well as helpful comments from Geoffrey McFadden and Patrick Buerger on an earlier draft. We appreciate the thoughtful comments on the manuscript made by Jiri Neustupa and an anonymous reviewer.

Funding Statement

Funding was provided by the Australian Research Council (DP150100705) and the University of Melbourne (Early Career Researcher grant to Heroen Verbruggen and Computational Biology Research Initiative seed funding to Heroen Verbruggen, Kathryn Holt and Linda Blackall). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Additional Information and Declarations

Competing Interests

The authors declare that they have no competing interests.

Author Contributions

Sonja I. Repetti conceived and designed the experiments, performed the experiments, analysed the data, prepared figures and/or tables, authored or reviewed drafts of the paper, approved the final draft.

Christopher J. Jackson conceived and designed the experiments, performed the experiments, analysed the data, authored or reviewed drafts of the paper, approved the final draft.

Louise M. Judd performed the experiments, authored or reviewed drafts of the paper, approved the final draft.

Ryan R. Wick performed the experiments, authored or reviewed drafts of the paper, approved the final draft.

Kathryn E. Holt conceived and designed the experiments, contributed reagents/materials/analysis tools, authored or reviewed drafts of the paper, approved the final draft.

Heroen Verbruggen conceived and designed the experiments, contributed reagents/materials/analysis tools, authored or reviewed drafts of the paper, approved the final draft.

DNA Deposition

The following information was supplied regarding the deposition of DNA sequences:

The O. quekettii mitochondrial genome sequence is available at GenBank: MN514984.

Data Availability

The following information was supplied regarding data availability:

MAFFT alignments of ORF-lacking intron clusters identified in Fig. S2 are available in a Supplemental File.

Raw RNA sequencing data is available at ENA: PRJEB35267.

References

  • Abdelnoor et al. (2006).Abdelnoor RV, Christensen AC, Mohammed S, Munoz-Castillo B, Moriyama H, Mackenzie SA. Mitochondrial genome dynamics in plants and animals: convergent gene fusions of a MutS homologue. Journal of Molecular Evolution. 2006;63(2):165–173. doi: 10.1007/s00239-005-0226-9. [DOI] [PubMed] [Google Scholar]
  • Abe et al. (2010).Abe T, Ikemura T, Sugahara J, Kanai A, Ohara Y, Uehara H, Kinouchi M, Kanaya S, Yamada Y, Muto A, Inokuchi H. tRNADB-CE 2011: tRNA gene database curated manually by experts. Nucleic Acids Research. 2010;39(Database):D210–D213. doi: 10.1093/nar/gkq1007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Alkatib et al. (2012).Alkatib S, Scharff LB, Rogalski M, Fleischmann TT, Matthes A, Seeger S, Schöttler MA, Ruf S, Bock R. The contributions of wobbling and superwobbling to the reading of the genetic code. PLOS Genetics. 2012;8(11):e1003076. doi: 10.1371/journal.pgen.1003076. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Almagro Armenteros et al. (2017).Almagro Armenteros JJ, Sønderby CK, Sønderby SK, Nielsen H, Winther O. DeepLoc: prediction of protein subcellular localization using deep learning. Bioinformatics. 2017;33(21):3387–3395. doi: 10.1093/bioinformatics/btx431. [DOI] [PubMed] [Google Scholar]
  • Arimoto et al. (2019).Arimoto A, Nishitsuji K, Higa Y, Arakaki N, Hisata K, Shinzato C, Satoh N, Shoguchi E. A siphonous macroalgal genome suggests convergent functions of homeobox genes in algae and land plants. DNA Research. 2019;26(2):183–192. doi: 10.1093/dnares/dsz002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Bankevich et al. (2012).Bankevich A, Nurk S, Antipov D, Gurevich AA, Dvorkin M, Kulikov AS, Lesin VM, Nikolenko SI, Pham S, Prjibelski AD, Pyshkin AV, Sirotkin AV, Vyahhi N, Tesler G, Alekseyev MA, Pevzner PA. SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing. Journal of Computational Biology. 2012;19(5):455–477. doi: 10.1089/cmb.2012.0021. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Beck & Lang (2010).Beck N, Lang B. MFannot, organelle genome annotation websever. Canada: Université de Montréal QC; 2010. [Google Scholar]
  • Bolger, Lohse & Usadel (2014).Bolger AM, Lohse M, Usadel B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 2014;30(15):2114–2120. doi: 10.1093/bioinformatics/btu170. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Burger, Gray & Lang (2003).Burger G, Gray MW, Lang BF. Mitochondrial genomes: anything goes. Trends in Genetics. 2003;19(12):709–716. doi: 10.1016/j.tig.2003.10.012. [DOI] [PubMed] [Google Scholar]
  • Carrie, Weißenberger & Soll (2016).Carrie C, Weißenberger S, Soll J. Plant mitochondria contain the protein translocase subunits TatB and TatC. Journal of Cell Science. 2016;129(20):3935–3947. doi: 10.1242/jcs.190975. [DOI] [PubMed] [Google Scholar]
  • Charlesworth (1994).Charlesworth B. The effect of background selection against deleterious mutations on weakly selected, linked variants. Genetics Research. 1994;63(3):213–227. doi: 10.1017/S0016672300032365. [DOI] [PubMed] [Google Scholar]
  • Christensen (2014).Christensen AC. Genes and junk in plant mitochondria—repair mechanisms and selection. Genome Biology and Evolution. 2014;6(6):1448–1453. doi: 10.1093/gbe/evu115. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Cocquyt et al. (2010).Cocquyt E, Verbruggen H, Leliaert F, De Clerck O. Evolution and cytological diversification of the green seaweeds (Ulvophyceae) Molecular Biology and Evolution. 2010;27(9):2052–2061. doi: 10.1093/molbev/msq091. [DOI] [PubMed] [Google Scholar]
  • Comeron & Kreitman (2000).Comeron JM, Kreitman M. The correlation between intron length and recombination in Drosophila: dynamic equilibrium between mutational and selective forces. Genetics. 2000;156:1175–1190. doi: 10.1093/genetics/156.3.1175. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Comeron, Williford & Kliman (2008).Comeron JM, Williford A, Kliman RM. The Hill–Robertson effect: evolutionary consequences of weak selection and linkage in finite populations. Heredity. 2008;100(1):19–31. doi: 10.1038/sj.hdy.6801059. [DOI] [PubMed] [Google Scholar]
  • Cremen et al. (2016).Cremen MCM, Huisman JM, Marcelino VR, Verbruggen H. Taxonomic revision of Halimeda (Bryopsidales, Chlorophyta) in south-western Australia. Australian Systematic Botany. 2016;29(1):41–54. doi: 10.1071/SB15043. [DOI] [Google Scholar]
  • Cremen et al. (2018).Cremen MCM, Leliaert F, Marcelino VR, Verbruggen H. Large diversity of nonstandard genes and dynamic evolution of chloroplast genomes in siphonous green algae (Bryopsidales, Chlorophyta) Genome Biology and Evolution. 2018;10(4):1048–1061. doi: 10.1093/gbe/evy063. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Cutter & Payseur (2013).Cutter AD, Payseur BA. Genomic signatures of selection at linked sites: unifying the disparity among species. Nature Reviews Genetics. 2013;14(4):262–274. doi: 10.1038/nrg3425. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Davila et al. (2011).Davila JI, Arrieta-Montiel MP, Wamboldt Y, Cao J, Hagmann J, Shedge V, Xu Y-Z, Weigel D, Mackenzie SA. Double-strand break repair processes drive evolution of the mitochondrial genome in Arabidopsis. BMC Biology. 2011;9(1):64. doi: 10.1186/1741-7007-9-64. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • De Vries et al. (2013).De Vries J, Habicht J, Woehle C, Huang C, Christa G, Wägele H, Nickelsen J, Martin WF, Gould SB. Is ftsH the key to plastid longevity in sacoglossan slugs? Genome Biology and Evolution. 2013;5(12):2540–2548. doi: 10.1093/gbe/evt205. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Del Cortona et al. (2017).Del Cortona A, Leliaert F, Bogaert KA, Turmel M, Boedeker C, Janouškovec J, Lopez-Bautista JM, Verbruggen H, Vandepoele K, De Clerck O. The plastid genome in cladophorales green algae is encoded by hairpin chromosomes. Current Biology. 2017;27(24):3771–3782.e6. doi: 10.1016/j.cub.2017.11.004. [DOI] [PubMed] [Google Scholar]
  • Del Vasto et al. (2015).Del Vasto M, Figueroa-Martinez F, Featherston J, Gonzalez MA, Reyes-Prieto A, Durand PM, Smith DR. Massive and widespread organelle genomic expansion in the green algal genus Dunaliella. Genome Biology and Evolution. 2015;7(3):656–663. doi: 10.1093/gbe/evv027. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Fang et al. (2017).Fang L, Leliaert F, Zhang ZH, Penny D, Zhong BJ. Evolution of the chlorophyta: insights from chloroplast phylogenomic analyses. Journal of Systematics and Evolution. 2017;55(4):322–332. doi: 10.1111/jse.12248. [DOI] [Google Scholar]
  • Felsenstein (2005).Felsenstein J. PHYLIP (Phylogeny inference package) Seattle: Department of Genome Sciences, University of Washington; 2005. [Google Scholar]
  • Finn et al. (2017).Finn RD, Attwood TK, Babbitt PC, Bateman A, Bork P, Bridge AJ, Chang H-Y, Dosztányi Z, El-Gebali S, Fraser M, Gough J, Haft D, Holliday GL, Huang H, Huang X, Letunic I, Lopez R, Lu S, Marchler-Bauer A, Mi H, Mistry J, Natale DA, Necci M, Nuka G, Orengo CA, Park Y, Pesseat S, Piovesan D, Potter SC, Rawlings ND, Redaschi N, Richardson L, Rivoire C, Sangrador-Vegas A, Sigrist C, Sillitoe I, Smithers B, Squizzato S, Sutton G, Thanki N, Thomas PD, Tosatto S C E, Wu CH, Xenarios I, Yeh L-S, Young S-Y, Mitchell AL. InterPro in 2017—beyond protein family and domain annotations. Nucleic Acids Research. 2017;45(D1):D190–D199. doi: 10.1093/nar/gkw1107. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Frickey & Lupas (2004).Frickey T, Lupas A. CLANS: a Java application for visualizing protein families based on pairwise similarity. Bioinformatics. 2004;20(18):3702–3704. doi: 10.1093/bioinformatics/bth444. [DOI] [PubMed] [Google Scholar]
  • Fučíková et al. (2014).Fučíková K, Leliaert F, Cooper ED, Škaloud P, D’hondt S, De Clerck O, Gurgel CF, Lewis LA, Lewis PO, Lopez-Bautista JM. New phylogenetic hypotheses for the core chlorophyta based on chloroplast sequence data. Frontiers in Ecology and Evolution. 2014;2:63. [Google Scholar]
  • Fukui et al. (2018).Fukui K, Harada A, Wakamatsu T, Minobe A, Ohshita K, Ashiuchi M, Yano T. The GIY-YIG endonuclease domain of Arabidopsis MutS homolog 1 specifically binds to branched DNA structures. FEBS Letters. 2018;592(24):4066–4077. doi: 10.1002/1873-3468.13279. [DOI] [PubMed] [Google Scholar]
  • Galtier (2011).Galtier N. The intriguing evolutionary dynamics of plant mitochondrial DNA. BMC Biology. 2011;9(1):61. doi: 10.1186/1741-7007-9-61. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Garrison & Arrizabalaga (2009).Garrison EM, Arrizabalaga G. Disruption of a mitochondrial MutS DNA repair enzyme homologue confers drug resistance in the parasite Toxoplasma gondii. Molecular Microbiology. 2009;72(2):425–441. doi: 10.1111/j.1365-2958.2009.06655.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Gelfand, Rodriguez & Benson (2007).Gelfand Y, Rodriguez A, Benson G. TRDB—the tandem repeats database. Nucleic Acids Research. 2007;35(Database):D80–D87. doi: 10.1093/nar/gkl1013. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Grabherr et al. (2011).Grabherr MG, Haas BJ, Yassour M, Levin JZ, Thompson DA, Amit I, Adiconis X, Fan L, Raychowdhury R, Zeng Q, Chen Z, Mauceli E, Hacohen N, Gnirke A, Rhind N, Di Palma F, Birren BW, Nusbaum C, Lindblad-Toh K, Friedman N, Regev A. Full-length transcriptome assembly from RNA-Seq data without a reference genome. Nature Biotechnology. 2011;29(7):644–652. doi: 10.1038/nbt.1883. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Griffiths-Jones et al. (2003).Griffiths-Jones S, Bateman A, Marshall M, Khanna A, Eddy SR. Rfam: an RNA family database. Nucleic Acids Research. 2003;31(1):439–441. doi: 10.1093/nar/gkg006. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Gualberto et al. (2014).Gualberto JM, Mileshina D, Wallet C, Niazi AK, Weber-Lotfi F, Dietrich A. The plant mitochondrial genome: dynamics and maintenance. Biochimie. 2014;100:107–120. doi: 10.1016/j.biochi.2013.09.016. [DOI] [PubMed] [Google Scholar]
  • Hausner (2012).Hausner G. Introns, mobile elements, and plasmids. In: Bullerwell CE, editor. Organelle Genetics. Berlin: Springer; 2012. pp. 329–357. [Google Scholar]
  • Hill & Robertson (1966).Hill WG, Robertson A. The effect of linkage on limits to artificial selection. Genetics Research. 1966;8(3):269–294. doi: 10.1017/S0016672300010156. [DOI] [PubMed] [Google Scholar]
  • Hurst (2002).Hurst LD. The Ka/Ks ratio: diagnosing the form of sequence evolution. Trends in Genetics. 2002;18(9):486–487. doi: 10.1016/S0168-9525(02)02722-1. [DOI] [PubMed] [Google Scholar]
  • Jukes & Cantor (1969).Jukes TH, Cantor CR. Evolution of protein molecules. Mammalian Protein Metabolism. 1969;3:132. [Google Scholar]
  • Kearse et al. (2012).Kearse M, Moir R, Wilson A, Stones-Havas S, Cheung M, Sturrock S, Buxton S, Cooper A, Markowitz S, Duran C, Thierer T, Ashton B, Meintjes P, Drummond A. Geneious basic: an integrated and extendable desktop software platform for the organization and analysis of sequence data. Bioinformatics. 2012;28(12):1647–1649. doi: 10.1093/bioinformatics/bts199. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Kinouchi & Kurokawa (2006).Kinouchi M, Kurokawa K. [Special issue: fact databases and freewares] tRNAfinder: a software system to find all tRNA genes in the DNA sequence based on the cloverleaf secondary structure. Journal of Computer Aided Chemistry. 2006;7:116–124. doi: 10.2751/jcac.7.116. [DOI] [Google Scholar]
  • Koren et al. (2017).Koren S, Walenz BP, Berlin K, Miller JR, Bergman NH, Phillippy AM. Canu: scalable and accurate long-read assembly via adaptive k-mer weighting and repeat separation. Genome Research. 2017;27(5):722–736. doi: 10.1101/gr.215087.116. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Kowalski et al. (1999).Kowalski JC, Belfort M, Stapleton MA, Holpert M, Dansereau JT, Pietrokovski S, Baxter SM, Derbyshire V. Configuration of the catalytic GIY-YIG domain of intron endonuclease I-TevI: coincidence of computational and molecular findings. Nucleic Acids Research. 1999;27(10):2115–2125. doi: 10.1093/nar/27.10.2115. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Krasovec et al. (2017).Krasovec M, Eyre-Walker A, Sanchez-Ferandin S, Piganeau G. Spontaneous mutation rate in the smallest photosynthetic eukaryotes. Molecular Biology and Evolution. 2017;34(7):1770–1779. doi: 10.1093/molbev/msx119. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Krzywinski et al. (2009).Krzywinski M, Schein J, Birol I, Connors J, Gascoyne R, Horsman D, Jones SJ, Marra MA. Circos: an information aesthetic for comparative genomics. Genome Research. 2009;19(9):1639–1645. doi: 10.1101/gr.092759.109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Kühn & Gualberto (2012).Kühn K, Gualberto JM. Recombination in the stability, repair and evolution of the mitochondrial genome. In: Maréchal-Drouard L, editor. Advances in Botanical Research 63. Oxford: Academic Press; 2012. pp. 215–252. [Google Scholar]
  • Kumar et al. (2018).Kumar S, Stecher G, Li M, Knyaz C, Tamura K. MEGA X: molecular evolutionary genetics analysis across computing platforms. Molecular Biology and Evolution. 2018;35(6):1547–1549. doi: 10.1093/molbev/msy096. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Kurtz et al. (2001).Kurtz S, Choudhuri JV, Ohlebusch E, Schleiermacher C, Stoye J, Giegerich R. REPuter: the manifold applications of repeat analysis on a genomic scale. Nucleic Acids Research. 2001;29(22):4633–4642. doi: 10.1093/nar/29.22.4633. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Lagesen et al. (2007).Lagesen K, Hallin P, Rødland EA, Stærfeldt H-H, Rognes T, Ussery DW. RNAmmer: consistent and rapid annotation of ribosomal RNA genes. Nucleic Acids Research. 2007;35(9):3100–3108. doi: 10.1093/nar/gkm160. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Lambowitz & Belfort (1993).Lambowitz AM, Belfort M. Introns as mobile genetic elements. Annual Review of Biochemistry. 1993;62(1):587–622. doi: 10.1146/annurev.bi.62.070193.003103. [DOI] [PubMed] [Google Scholar]
  • Lambowitz & Zimmerly (2011).Lambowitz AM, Zimmerly S. Group II introns: mobile ribozymes that invade DNA. Cold Spring Harbor Perspectives in Biology. 2011;3(8):a003616. doi: 10.1101/cshperspect.a003616. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Lang, Laforest & Burger (2007).Lang BF, Laforest M-J, Burger G. Mitochondrial introns: a critical view. Trends in Genetics. 2007;23(3):119–125. doi: 10.1016/j.tig.2007.01.006. [DOI] [PubMed] [Google Scholar]
  • Laslett & Canback (2004).Laslett D, Canback B. ARAGORN, a program to detect tRNA genes and tmRNA genes in nucleotide sequences. Nucleic Acids Research. 2004;32(1):11–16. doi: 10.1093/nar/gkh152. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Leliaert et al. (2012).Leliaert F, Smith DR, Moreau H, Herron MD, Verbruggen H, Delwiche CF, De Clerck O. Phylogeny and molecular evolution of the green algae. Critical Reviews in Plant Sciences. 2012;31(1):1–46. doi: 10.1080/07352689.2011.615705. [DOI] [Google Scholar]
  • Lowe & Chan (2016).Lowe TM, Chan PP. tRNAscan-SE On-line: integrating search and context for analysis of transfer RNA genes. Nucleic Acids Research. 2016;44(W1):W54–W57. doi: 10.1093/nar/gkw413. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Lynch (2006).Lynch M. Streamlining and simplification of microbial genome architecture. Annual Review of Microbiology. 2006;60(1):327–349. doi: 10.1146/annurev.micro.60.080805.142300. [DOI] [PubMed] [Google Scholar]
  • Lynch et al. (2016).Lynch M, Ackerman MS, Gout J-F, Long H, Sung W, Thomas WK, Foster PL. Genetic drift, selection and the evolution of the mutation rate. Nature Reviews Genetics. 2016;17(11):704–714. doi: 10.1038/nrg.2016.104. [DOI] [PubMed] [Google Scholar]
  • Lynch, Koskella & Schaack (2006).Lynch M, Koskella B, Schaack S. Mutation pressure and the evolution of organelle genomic architecture. Science. 2006;311(5768):1727–1730. doi: 10.1126/science.1118884. [DOI] [PubMed] [Google Scholar]
  • Lynch & Walsh (2007).Lynch M, Walsh B. The origins of genome architecture 98. Sunderland: Sinauer Associates; 2007. [Google Scholar]
  • Magnusson, Fine & Kühl (2007).Magnusson SH, Fine M, Kühl M. Light microclimate of endolithic phototrophs in the scleractinian corals Montipora monasteriata and Porites cylindrica. Marine Ecology Progress Series. 2007;332:119–128. doi: 10.3354/meps332119. [DOI] [Google Scholar]
  • Marcelino et al. (2016).Marcelino VR, Cremen MCM, Jackson CJ, Larkum AAW, Verbruggen H. Evolutionary dynamics of chloroplast genomes in low light: a case study of the endolithic green alga Ostreobium quekettii. Genome Biology and Evolution. 2016;8(9):2939–2951. doi: 10.1093/gbe/evw206. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Martin (2011).Martin M. Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet.journal. 2011;17(1):10–12. doi: 10.14806/ej.17.1.200. [DOI] [Google Scholar]
  • Mine, Sekida & Okuda (2015).Mine I, Sekida S, Okuda K. Cell wall and cell growth characteristics of giant-celled algae. Phycological Research. 2015;63(2):77–84. doi: 10.1111/pre.12088. [DOI] [Google Scholar]
  • Morabito, Gargiulo & Genovese (2010).Morabito M, Gargiulo G, Genovese G. A review of life history pathways in Bryopsis species. Atti della Accademia Peloritana dei Pericolanti-Classe di Scienze Fisiche, Matematiche e Naturali. 2010;LXXXVIII(2):C1A1002005. doi: 10.1478/C1A1002005. [DOI] [Google Scholar]
  • Mower, Sloan & Alverson (2012).Mower JP, Sloan DB, Alverson AJ. Plant mitochondrial genome diversity: the genomics revolution. In: Wendel J, Greilhuber J, Dolezel J, Leitch IJ, editors. Plant Genome Diversity. Vol. 1. Wien: Springer; 2012. pp. 123–144. [Google Scholar]
  • Neiman & Taylor (2009).Neiman M, Taylor DR. The causes of mutation accumulation in mitochondrial genomes. Proceedings of the Royal Society B: Biological Sciences. 2009;276(1660):1201–1209. doi: 10.1098/rspb.2008.1758. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Ness et al. (2016).Ness RW, Kraemer SA, Colegrave N, Keightley PD. Direct estimate of the spontaneous mutation rate uncovers the effects of drift and recombination in the Chlamydomonas reinhardtii plastid genome. Molecular Biology and Evolution. 2016;33(3):800–808. doi: 10.1093/molbev/msv272. [DOI] [PubMed] [Google Scholar]
  • Odahara et al. (2009).Odahara M, Kuroiwa H, Kuroiwa T, Sekine Y. Suppression of repeat-mediated gross mitochondrial genome rearrangements by RecA in the moss Physcomitrella patens. Plant Cell. 2009;21(4):1182–1194. doi: 10.1105/tpc.108.064709. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Palmer et al. (2000).Palmer JD, Adams KL, Cho Y, Parkinson CL, Qiu Y-L, Song K. Dynamic evolution of plant mitochondrial genomes: mobile genes and introns and highly variable mutation rates. Proceedings of the National Academy of Sciences of the United States of America. 2000;97(13):6960–6966. doi: 10.1073/pnas.97.13.6960. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Petrů et al. (2018).Petrů M, Wideman J, Moore K, Alcock F, Palmer T, Doležal P. Evolution of mitochondrial TAT translocases illustrates the loss of bacterial protein transport machines in mitochondria. BMC Biology. 2018;16(1):141. doi: 10.1186/s12915-018-0607-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Platt, Weber & Liberles (2018).Platt A, Weber CC, Liberles DA. Protein evolution depends on multiple distinct population size parameters. BMC Evolutionary Biology. 2018;18(1):17. doi: 10.1186/s12862-017-1085-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Pombert et al. (2004).Pombert J-F, Otis C, Lemieux C, Turmel M. The complete mitochondrial DNA sequence of the green alga Pseudendoclonium akinetum (Ulvophyceae) highlights distinctive evolutionary trends in the chlorophyta and suggests a sister-group relationship between the ulvophyceae and chlorophyceae. Molecular Biology and Evolution. 2004;21(5):922–935. doi: 10.1093/molbev/msh099. [DOI] [PubMed] [Google Scholar]
  • Ricci et al. (2019).Ricci F, Marcelino VR, Blackall L, Kühl M, Medina M, Verbruggen H. Beneath the surface: community assembly and functions of the coral skeleton microbiome. Microbiome. 2019;7:159. doi: 10.1186/s40168-019-0762-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Schlichter, Kampmann & Conrady (1997).Schlichter D, Kampmann H, Conrady S. Trophic potential and photoecology of endolithic algae living within coral skeletons. Marine Ecology. 1997;18(4):299–317. doi: 10.1111/j.1439-0485.1997.tb00444.x. [DOI] [Google Scholar]
  • Shedge et al. (2007).Shedge V, Arrieta-Montiel M, Christensen AC, Mackenzie SA. Plant mitochondrial recombination surveillance requires unusual RecA and MutS homologs. Plant Cell. 2007;19(4):1251–1264. doi: 10.1105/tpc.106.048355. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Sievers et al. (2011).Sievers F, Wilm A, Dineen D, Gibson TJ, Karplus K, Li W, Lopez R, McWilliam H, Remmert M, Söding J, Thompson JD, Higgins DG. Fast, scalable generation of high‐quality protein multiple sequence alignments using clustal omega. Molecular Systems Biology. 2011;7(1):539. doi: 10.1038/msb.2011.75. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Sloan & Taylor (2012).Sloan DB, Taylor DR. Evolutionary rate variation in organelle genomes: the role of mutational processes. In: Bullerwell CE, editor. Organelle Genetics. Berlin: Springer; 2012. pp. 123–146. [Google Scholar]
  • Smith (2016).Smith DR. The mutational hazard hypothesis of organelle genome evolution: 10 years on. Molecular Ecology. 2016;25(16):3769–3775. doi: 10.1111/mec.13742. [DOI] [PubMed] [Google Scholar]
  • Smith et al. (2011).Smith DR, Burki F, Yamada T, Grimwood J, Grigoriev IV, Van Etten JL, Keeling PJ. The GC-rich mitochondrial and plastid genomes of the green alga Coccomyxa give insight into the evolution of organelle DNA nucleotide landscape. PLOS ONE. 2011;6(8):e23624. doi: 10.1371/journal.pone.0023624. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Smith & Haigh (1974).Smith JM, Haigh J. The hitch-hiking effect of a favourable gene. Genetics Research. 1974;23(1):23–35. doi: 10.1017/S0016672300014634. [DOI] [PubMed] [Google Scholar]
  • Smith et al. (2013).Smith DR, Hamaji T, Olson BJSC, Durand PM, Ferris P, Michod RE, Featherston J, Nozaki H, Keeling PJ. Organelle genome complexity scales positively with organism size in volvocine green algae. Molecular Biology and Evolution. 2013;30(4):793–797. doi: 10.1093/molbev/mst002. [DOI] [PubMed] [Google Scholar]
  • Smith & Keeling (2015).Smith DR, Keeling PJ. Mitochondrial and plastid genome architecture: reoccurring themes, but significant differences at the extremes. Proceedings of the National Academy of Sciences of the United States of America. 2015;112(33):10177–10184. doi: 10.1073/pnas.1422049112. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Smith & Lee (2008).Smith DR, Lee RW. Mitochondrial genome of the colorless green alga Polytomella capuana: a linear molecule with an unprecedented GC content. Molecular Biology and Evolution. 2008;25(3):487–496. doi: 10.1093/molbev/msm245. [DOI] [PubMed] [Google Scholar]
  • Smith & Lee (2009).Smith D, Lee RW. The mitochondrial and plastid genomes of Volvox carteri: bloated molecules rich in repetitive DNA. BMC Genomics. 2009;10:132. doi: 10.1186/1471-2164-10-132. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Smith & Lee (2010).Smith DR, Lee RW. Low nucleotide diversity for the expanded organelle and nuclear genomes of Volvox carteri supports the mutational-hazard hypothesis. Molecular Biology and Evolution. 2010;27(10):2244–2256. doi: 10.1093/molbev/msq110. [DOI] [PubMed] [Google Scholar]
  • Smith et al. (2010).Smith DR, Lee RW, Cushman JC, Magnuson JK, Tran D, Polle JEW. The Dunaliella salina organelle genomes: large sequences, inflated with intronic and intergenic DNA. BMC Plant Biology. 2010;10(1):83. doi: 10.1186/1471-2229-10-83. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Suyama, Torrents & Bork (2006).Suyama M, Torrents D, Bork P. PAL2NAL: robust conversion of protein sequence alignments into the corresponding codon alignments. Nucleic Acids Research. 2006;34(Suppl. 2):W609–W612. doi: 10.1093/nar/gkl315. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Tribollet (2008).Tribollet A. The boring microflora in modern coral reef ecosystems: a review of its roles. In: Wisshak M, Tapanila L, editors. Current Developments in Bioerosion. Berlin: Springer; 2008. pp. 67–94. [Google Scholar]
  • Verbruggen et al. (2009).Verbruggen H, Ashworth M, LoDuca ST, Vlaeminck C, Cocquyt E, Sauvage T, Zechman FW, Littler DS, Littler MM, Leliaert F. A multi-locus time-calibrated phylogeny of the siphonous green algae. Molecular Phylogenetics and Evolution. 2009;50(3):642–653. doi: 10.1016/j.ympev.2008.12.018. [DOI] [PubMed] [Google Scholar]
  • Verbruggen et al. (2017).Verbruggen H, Marcelino VR, Guiry MD, Cremen MCM, Jackson CJ. Phylogenetic position of the coral symbiont Ostreobium (Ulvophyceae) inferred from chloroplast genome data. Journal of Phycology. 2017;53(4):790–803. doi: 10.1111/jpy.12540. [DOI] [PubMed] [Google Scholar]
  • Verbruggen & Tribollet (2011).Verbruggen H, Tribollet A. Boring algae. Current Biology. 2011;21(21):R876–R877. doi: 10.1016/j.cub.2011.09.014. [DOI] [PubMed] [Google Scholar]
  • Vroom & Smith (2001).Vroom PS, Smith CM. The challenge of siphonous green algae. American Scientist. 2001;89(6):525–531. doi: 10.1511/2001.6.524. [DOI] [Google Scholar]
  • Vroom & Smith (2003).Vroom PS, Smith CM. Life without cells. Biologist. 2003;50:222–226. [Google Scholar]
  • Wilhelm & Jakob (2006).Wilhelm C, Jakob T. Uphill energy transfer from long-wavelength absorbing chlorophylls to PS II in Ostreobium sp. is functional in carbon assimilation. Photosynthesis Research. 2006;87(3):323. doi: 10.1007/s11120-005-9002-3. [DOI] [PubMed] [Google Scholar]
  • Wyman, Jansen & Boore (2004).Wyman SK, Jansen RK, Boore JL. Automatic annotation of organellar genomes with DOGMA. Bioinformatics. 2004;20(17):3252–3255. doi: 10.1093/bioinformatics/bth352. [DOI] [PubMed] [Google Scholar]
  • Yang & Bielawski (2000).Yang Z, Bielawski JP. Statistical methods for detecting molecular adaptation. Trends in Ecology & Evolution. 2000;15(12):496–503. doi: 10.1016/S0169-5347(00)01994-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Yurina & Odintsova (2016).Yurina N, Odintsova M. Mitochondrial genome structure of photosynthetic eukaryotes. Biochemistry (Moscow) 2016;81(2):101–113. doi: 10.1134/S0006297916020048. [DOI] [PubMed] [Google Scholar]
  • Zhang et al. (2019).Zhang X, Bauman N, Brown R, Richardson TH, Akella S, Hann E, Morey R, Smith DR. The mitochondrial and chloroplast genomes of the green alga Haematococcus are made up of nearly identical repetitive sequences. Current Biology. 2019;29(15):R736–R737. doi: 10.1016/j.cub.2019.06.040. [DOI] [PubMed] [Google Scholar]
  • Zheng et al. (2018).Zheng F, Liu H, Jiang M, Xu Z, Wang Z, Wang C, Du F, Shen Z, Wang B. The complete mitochondrial genome of the Caulerpa lentillifera (Ulvophyceae, Chlorophyta): sequence, genome content, organization structure and phylogenetic consideration. Gene. 2018;673:225–238. doi: 10.1016/j.gene.2018.06.050. [DOI] [PubMed] [Google Scholar]
  • Zimin et al. (2013).Zimin AV, Marçais G, Puiu D, Roberts M, Salzberg SL, Yorke JA. The MaSuRCA genome assembler. Bioinformatics. 2013;29(21):2669–2677. doi: 10.1093/bioinformatics/btt476. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Zimmermann et al. (2018).Zimmermann L, Stephens A, Nam S-Z, Rau D, Kübler J, Lozajic M, Gabler F, Söding J, Lupas AN, Alva V. A completely Reimplemented MPI bioinformatics toolkit with a new HHpred server at its Core. Journal of Molecular Biology. 2018;430(15):2237–2243. doi: 10.1016/j.jmb.2017.12.007. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplemental Information 1. Neighbour joining tree constructed from Clustal Omega distance matrix of ORF-lacking introns in C. lentillifera mitochondrial genome and O. quekettii mitochondrial and plastid genomes.
DOI: 10.7717/peerj.8273/supp-1
Supplemental Information 2. Neighbour joining tree from Clustal Omega distance matrix of ORF-lacking introns in O. quekettii mitochondrial and chloroplast genomes, and MAFFT alignments corresponding to clusters.
DOI: 10.7717/peerj.8273/supp-2
Supplemental Information 3. Raw data: Fasta alignments of clusters of ORF-lacking introns from Figure S2.
DOI: 10.7717/peerj.8273/supp-3
Supplemental Information 4. Codon usage in the protein coding genes in mtDNA of O. quekettii SAG6.99.
DOI: 10.7717/peerj.8273/supp-4
Supplemental Information 5. tRNAs present in mtDNA of O. quekettii SAG6.99.
DOI: 10.7717/peerj.8273/supp-5
Supplemental Information 6. Comparison of protein coding and ribosomal RNA genes in the mitochondrial genomes of a selection of Chloroplastida including O. quekettii SAG6.99.
DOI: 10.7717/peerj.8273/supp-6
Supplemental Information 7. Introns and associated ORFs within mtDNA of O. quekettii SAG6.99.
DOI: 10.7717/peerj.8273/supp-7
Supplemental Information 8. Estimates of base substitutions per site between genes from O. quekettii and C. lentillifera using the Jukes–Cantor model conducted in MEGA.
DOI: 10.7717/peerj.8273/supp-8

Data Availability Statement

The following information was supplied regarding data availability:

MAFFT alignments of ORF-lacking intron clusters identified in Fig. S2 are available in a Supplemental File.

Raw RNA sequencing data is available at ENA: PRJEB35267.


Articles from PeerJ are provided here courtesy of PeerJ, Inc

RESOURCES