Skip to main content
International Journal of Molecular Sciences logoLink to International Journal of Molecular Sciences
. 2021 Feb 11;22(4):1811. doi: 10.3390/ijms22041811

Nanopore Sequencing Resolves Elusive Long Tandem-Repeat Regions in Mitochondrial Genomes

Liina Kinkar 1, Robin B Gasser 1,*, Bonnie L Webster 2,3, David Rollinson 2,3, D Timothy J Littlewood 2,3, Bill CH Chang 1, Andreas J Stroehlein 1, Pasi K Korhonen 1, Neil D Young 1,*
Editor: Miroslav Plohl
PMCID: PMC7918261  PMID: 33670420

Abstract

Long non-coding, tandem-repetitive regions in mitochondrial (mt) genomes of many metazoans have been notoriously difficult to characterise accurately using conventional sequencing methods. Here, we show how the use of a third-generation (long-read) sequencing and informatic approach can overcome this problem. We employed Oxford Nanopore technology to sequence genomic DNAs from a pool of adult worms of the carcinogenic parasite, Schistosoma haematobium, and used an informatic workflow to define the complete mt non-coding region(s). Using long-read data of high coverage, we defined six dominant mt genomes of 33.4 kb to 22.6 kb. Although no variation was detected in the order or lengths of the protein-coding genes, there was marked length (18.5 kb to 7.6 kb) and structural variation in the non-coding region, raising questions about the evolution and function of what might be a control region that regulates mt transcription and/or replication. The discovery here of the largest tandem-repetitive, non-coding region (18.5 kb) in a metazoan organism also raises a question about the completeness of some of the mt genomes of animals reported to date, and stimulates further explorations using a Nanopore-informatic workflow.

Keywords: Schistosoma haematobium, mitochondrial (mt) genome, tandem-repetitive DNA, non-coding (control) region, Oxford Nanopore technology, informatics

1. Introduction

Mitochondrial (mt) genomes display marked diversity in size and sequence among eukaryotic lineages, ranging from 6 kb in Plasmodium falciparum (malaria parasite) to >11 Mb in Silene conica (catchfly plant) [1,2]. In contrast to plants, fungi and numerous protists, published evidence indicates that the mt genomes of most metazoans appear to be remarkably compact, with seemingly limited variation in size [3,4,5,6,7]. Most animal mt genomes, particularly those of bilaterians, are 15–20 kb in size and encode ~37 genes [3,4]. Non-coding regions are usually reported to be short, apart from a ‘control region’ which often contains tandem-repetitive elements, usually comprising no more than 1.5 kb of the mt genome [8]. However, some early studies [9,10,11] had suggested the presence of long tandem-repeat regions of many thousands of nucleotide bases in mt genomes, which have remained largely elusive due to the technological challenges associated with sequencing them.

Long stretches of repetitive DNA are notoriously difficult to sequence using conventional Sanger- and second-generation (short-read) sequencing methods [12]. Repetitive elements that extend beyond the usual read length capacity of these platforms (~1 kb for Sanger sequencing; 100–300 bp for second-generation methods) cannot be reliably assembled (e.g., [13]). Although long-range PCR can be used to amplify DNA regions of several kilobases, sequencing through repetitive regions often leads to erroneous and/or ambiguous sequence reads as a consequence of self-priming of randomly-amplified repeat-segments, chimeras and/or jumping PCR artefacts [14,15]. Third-generation, single-molecule sequencing platforms, such as ‘PacBio’ and ‘Oxford Nanopore’, which can achieve read lengths of 80 kb to >1 Mb [16,17], now enable repetitive and structurally complex DNA elements to be resolved with confidence. Nanopore sequencing is particularly useful, as there is no theoretical limit on maximum read length, suggesting that mt genomes can be sequenced outright, irrespective of structural complexities.

The recent discovery of long stretches of tandem-repetitive DNA elements (~4–7 kb) in the mt genomes of parasitic flatworms using advanced sequencing/informatic workflows [18,19,20,21] has emphasised the need to scrutinise published mt genomes of socio-economically important trematodes such as the carcinogenic human blood-fluke, Schistosoma haematobium. Although the protein-coding complement of the mt genome of this dioecious trematode had been thoroughly characterised [22], there was an indication that the non-coding region inferred (estimated at 390 bp) was incomplete, being suggestive that structurally complex, repetitive DNA in this region could not be assembled at the time. Here, we used a Nanopore-informatic workflow to tackle this problem. We demonstrate how this workflow resolved this previously-elusive DNA region in the mt genome of S. haematobium, and revealed unexpected and marked variability in the length, structure and number of tandem-repeats in this non-coding mt region within this species. This workflow should have broad applicability to elucidating complex non-coding regions in metazoan mt genomes.

2. Results

2.1. Sequence Data Sets and Mapping Results

A total of 26,402 long-reads contained sequence tracts that matched the 5′- and 3′-flanking sequences of the incomplete non-coding region of the published mt genome of S. haematobium (see [22]). An appraisal of these reads revealed a tandem-repetitive region containing two distinct types of units (designated ShR1 and ShR2). We identified 4098 long-reads containing numerous such units (Figure S1), and 1760 of these reads bridged the entire tandem-repetitive non-coding region (Figure 1 and Figure S1). Then, we examined the nature and extent of repeats within this region.

Figure 1.

Figure 1

Distribution of repeat unit frequencies and long-read lengths. Both panels represent Nanopore long-reads (n = 1760) that spanned the entirety of the tandem-repeat region.

2.2. Complete mt Genomes for S. haematobium, with a Marked Variation in the Number of Repeat Units within the Non-Coding Region

Scrutiny of the 1760 long-reads revealed marked variation in the number of repeat units (range: 12 to 83) within the non-coding region (Figure 1); respective mt genome lengths were estimated at 19.4 and 47.8 kb, and associated non-coding regions were between 4.4 kb and 32.8 kb. Of all long-reads, those with 47, 46, 41, 35, 29 or 20 tandem-repeat units had the highest frequencies (Figure 1; Table 1). These high-frequency reads allowed us to define six complete mt genomes of 33,418 bp to 22,567 bp at high (>550- to 150-times) read-coverage. These representative genomes each harboured a single tandem-repetitive non-coding region varying in size from 18,458 bp to 7607 bp (Table 1; Figure 2 and Figure S2). The accuracy and completeness of the six mt genome sequences were confirmed by read-mapping (Figure 2 and Figure S2), with average nucleotide coverages ranging from 553- to 157-times. An analysis of the 1760 long-reads (range: 8362 to 46,810 bp) revealed that the lengths of the commonest sequences corresponded to those of the representative mt genomes (Figure 1). A comparison of the six mt genomes with that published previously for S. haematobium (15,003 bp; GenBank accession no. DQ157222; [22]) revealed a high nucleotide sequence similarity of ≥99.84% (prior to polishing, ≥99.77%) for all 12 protein-coding genes, 22 tRNAs and two rRNAs, and the same gene order.

Table 1.

Features of the six representative mitochondrial (mt) genomes defined for Schistosoma haematobium and their respective tandem-repetitive, non-coding regions.

Features (cf. Figure 2, Panel b)
Mt genome length/size (bp) 33,418 32,880 30,902 28,515 26,127 22,567
Length of tandem-repeat region (bp) 18,458 17,920 15,942 13,555 11,167 7607
Number of repeat units in this non-coding region
Tandem-repeat region relative to mt genome length/size (%)
No. of long-reads representing the tandem-repeat region
47
55.2
511
46
54.5
151
41
51.6
155
35
47.5
179
29
42.7
170
20
33.7
249

Figure 2.

Figure 2

Complete mitochondrial (mt) genomes representing Schistosoma haematobium. (a) Schematic representation of the dominant, representative mt genome of S. haematobium (33,418 bp; inner circle), including the newly-identified tandem-repeat region (18,458 bp; units ShR1 and ShR2 in orange and purple, respectively). The 12 protein-coding genes, 2 rRNAs and 22 tRNAs (in shades of grey) are in accord with a published reference mt genome available in GenBank (accession no. DQ157222; [22]); short non-coding regions (<350 bp) are in black. The outer circle represents the coverage of long-reads (produced by Oxford Nanopore sequencing) across the genome. The graph shows the depth of nucleotides at each position (grey dots) and the smoothed average of depth across the genome (solid dark grey line). Circular axes represent every 100 reads mapped. Numbers on the outer circle represent positions on the genome in base pairs. (b) Schematic representation of all established mt genome lengths in S. haematobium supported by >150 Nanopore long-reads (cf. Figure 1). Numbers inside represent the length of the mt genome and the number of units in the tandem-repeat region. Sizes of the circles are proportional to genome lengths.

2.3. Structural Features of the Non-Coding Regions

The annotation of the non-coding region for all six representative mt genomes revealed that units ShR1 (386 to 397 bp) and ShR2 (408 to 420 bp) were repeated in an alternating pattern. However, the first and last units in each tandem-repeat region were incomplete copies of ShR1; such ‘imperfect’ units lacked either the 5′- or 3′-end of ShR1 and were 356 to 358 bp and 90 bp in size, respectively (Figure 3). Irrespective of this size variation, ShR1 units were very similar in sequence upon alignment (average nucleotide identity: >99%; excluding ‘imperfect’ units). The only notable difference occurred in a~70 bp TA-rich sequence tract and related to a 5′-ATAAT-3′ repeat-motif, which was either absent, or repeated once or twice (Figure 3). The ShR2 units in all six genomes were conserved in sequence (average nucleotide identity: 99.30%) upon alignment, with most nucleotide differences relating to single insertion/deletion events (indels) in homopolymers. TA-rich regions in ShR2 units were short and did not exceed 15 bp.

Figure 3.

Figure 3

Structure of the tandem-repeat regions in the mitochondrial (mt) genomes of Schistosoma haematobium. (a) A schematic representation of the units ShR1 (orange) and ShR2 (purple). Insertions of 5′-ATAAT-3′ motifs within a TA-rich region in ShR1s are indicated. Scale in base pairs is shown at the bottom. (b) Secondary structures predicted for ShR1 and ShR2 (units in which they occur are indicated on the left of the structure). (c) A schematic representation of the hierarchical cluster analysis of repeat unit patterns (cf. Figure S3). Orange and purple shapes correspond to units ShR1 and ShR2 in panels (a) and (b). Grey numbers at branch tips correspond to the number of units. Lineage 1 (top) and 2 (bottom) are shaded in grey. Dashed lines demarcate the end of common repeat unit patterns within lineages. Scale in base pairs is shown at the bottom.

Parts of units ShR1 and ShR2 were predicted to fold into secondary DNA structures, varying from simple hairpin-loops to complex multi-branched structures (Figure 3). Notably, a 65 bp-section at the 5′-ends of ShR1s (excluding the first ShR1 unit of each genome, being incomplete at the 5′-end) assumed a tRNA-like structure (Figure 3) predicted to code for a serine. This sequence was conserved among all such ShR1 units in all six genomes, with a single nucleotide difference detected in only four of a total of 155 units.

2.4. Tandem-Repeat Patterns in the Non-Coding Region

The hierarchical cluster analysis of the patterns of units ShR1 and ShR2 in the 1760 long-reads revealed six well-supported groups of reads (with approximately unbiased p values at the majority of nodes being ≥95), which corresponded to the non-coding regions of the six mt genomes (Figure 3 and Figure S3). Two dominant lineages of repeat patterns were identified: ‘Lineage 1′ comprising 41, 35, 29 or 20 units, and ‘Lineage 2′ with 47 or 46 units (Figure 3 and Figure S3).

Scrutiny of repeat patterns within each of the two lineages indicated variation in the arrangement of the two units near the 3′-ends of the six distinct non-coding regions (Figure 3). The first and last ‘imperfect’ ShR1 units were consistent in length and nucleotide composition—the first was incomplete at the 5′-end and included a 5′-ATAATATAAT-3′ insertion; the last was consistently a 90 bp tract at the 5′-end. In all tandem-repeat lengths within Lineage 1, the penultimate unit was represented by an ShR1 unit carrying a single 5′-ATAAT-3′ insertion.

3. Discussion

The definition of non-coding regions in mt genomes requires an accurate assessment of the lengths, length-frequencies and nucleotide compositions of repetitive sequences. Although this has been challenging to achieve, particularly for expansive tandem-repeat regions and when amount, quality and molecular weight of genomic DNA for analyses are limiting, recent studies using third-generation (long-read) sequencing methods [18,19,20,21,23] have shown considerable promise to overcome this challenge. Here, we have demonstrated the effectiveness of Oxford Nanopore sequencing technology to read through long and complex tandem-repeat regions in mt genomes within a pool of S. haematobium adults, and the utility of a practical informatic approach to reliably define consensus non-coding regions and to dissect the nature and extent of heterogeneity within them.

The substantial lengths of the tandem-repetitive regions discovered here were unexpected (Figure 1 and Figure 2). More than a third of each of the six representative mt genomes characterised was repetitive DNA, with the largest genome (33.4 kb) harbouring the longest tandem-repeat region (18.5 kb) recorded to date for a metazoan organism. Usually, non-coding regions exceeded the total length of all protein-coding genes (Table 1; Figure 2). Tandem-repeat regions of <7 kb were rare within the pool of worms sequenced, and we detected one of 32.8 kb with 83 units (Figure 1). Although the biological function of such lengthy tandem-repeat regions is presently unknown, we hypothesise that they contain elements that regulate replication and/or transcription in S. haematobium.

In the absence of evolutionary pressure on the copy-number of repeat units, a small mt genome would be expected to have an advantage over a larger variant by being replicated more rapidly [24], leading to an accumulation of small mt genomes in cells. However, there seems to be an active mechanism generating and favouring long mt genome sizes in S. haematobium, overriding a putative sized-based selective advantage of a compact genome. Some examples of a selective inheritance of large variants of mt genomes have been attributed to expanded origins of replication, which are thought to increase replication efficiency [25,26,27]. This might be the case for S. haematobium. This proposal is plausible because, in other animals (e.g., mammals, birds, fish and insects), repetitive elements, stretches of non-coding DNA, TA-rich regions and/or secondary structures, such as hairpin and clover-leaf conformations (Figure 3), are often present in genuine ‘control’ regions, known to contain regulatory signals for replication and/or transcription [28,29,30,31,32,33]. Such expanded, regulatory regions might be retained solely due to a ‘selfish’ advantage in transmission [26], or may provide a means of ‘fine-tuning’ cellular energy production and genome maintenance strategies to particular environmental conditions, as we have hypothesised previously for other flatworms [20,21].

The six distinct, predominant sizes of tandem-repeat regions discovered suggest that repeat units might be under ‘stabilizing selection’, such that numbers and/or patterns of these units are ‘optimised’ for effective and essential regulation of transcription and replication [34]. Further, in the absence of any selective forces, one would expect the sizes of tandem-repeat regions representing Lineages 1 and 2 (Figure 3) to be randomly distributed. However, the hierarchical cluster analysis conducted here indicated that this was not the case (Figure S3)—there was a clear distinction between individual genomes with 20, 29, 35 or 41 units (exclusive to Lineage 1) and those with 46 or 47 units (Lineage 2), suggesting that the latter lineage might be selectively maintained at a higher and narrower size range. These two lineages might have evolved through sex-specific selection, one being specific to males and the other to females. Sex-specificity would assume a biparental inheritance of mt genomes, which has been suggested for a closely-related species—S. mansoni (see [35]). This mode of inheritance might lead to some embryos retaining, and others eliminating male-transmitted mt DNA, resulting in sex-specific mt genomes populating the germline [36]. However, as a pool of worms was sequenced here, we could not definitively establish whether the variation identified in the tandem-repeat region is within or among individual worms, or within or among cells of particular tissues (heteroplasmy). Future work is warranted to obtain long-read data from individual worms (females and males) from distinct, geographically disparate populations of S. haematobium, to gain a better appreciation of the diversity in length and structure of the tandem-repeat regions in the mt genome of this species, and to attempt to establish the selection pressure(s) leading to such variation.

The diversity in the length and composition of repeat units in mt tandem-repetitive, non-coding regions in S. haematobium raises a question as to the molecular mechanisms underlying this variation in flatworms. Lineage-specific variation in repeat patterns indicates that mechanisms leading to unit expansions, contractions and/or rearrangements appear to conserve the order of these units at the 5′-ends of tandem-repeat regions, whereas modifications seem to occur at the 3′-ends (Figure 3). Although the mutational processes leading to this diversity are unknown in flatworms, slipped-strand mispairing, imprecise or pre-mature termination of replication and/or inter- or intra-molecular recombination events during genome replication and/or maintenance have been proposed [8,37,38,39,40,41]. Clearly, these aspects warrant investigation. Future work might utilise two-dimensional neutral agarose gel electrophoresis and electron microscopy techniques [42] to explore the mode(s) of replication in S. haematobium, and to investigate whether tandem-repeat regions contain replication origins.

The presence of expansive tandem-repetitive regions is likely a common feature of schistosome species, as repetitive sequence tracts of ≥4 kb have been identified in S. bovis (partial assembly of PacBio-based long-read data; [19]), and detected in S. mansoni, S. japonicum and S. mekongi (short-read data, or restriction fragment length polymorphism and Southern blotting results; [11,43,44,45]). In other metazoans, long mt tandem-repetitive regions (>3 kb) seem to be rare, yet have been indicated in some phylogenetically divergent animal lineages including nematodes [10], insects [9,46], birds [47] and amphibians [23]. The propensity of some animal species/lineages to generate and tolerate lengthy tandem-repeat regions, while others select for a relatively short length is puzzling, and the evolutionary drive underlying such distinct architectures remains to be systematically addressed. Such investigations have largely been hampered by the inability of conventional methods to sequence across complex repetitive, non-coding regions, which has led to gaps in many published mt genome assemblies [48,49,50,51,52,53]. Clearly, Nanopore technology, with its ability to sequence long, intact DNA strands, without the need for read-assembly, lends itself well to the decoding of the mt genomes of a wide range of taxa across the Tree of Life [54]. Such an effort could open up new areas of investigation into the function and evolution of non-coding regions in the mt genomes of eukaryotes.

4. Materials and Methods

4.1. Parasite Material

An Egyptian strain of S. haematobium, for which a draft nuclear genome has been characterised [55,56], was used here. This strain is maintained in the Biomedical Research Institute, Rockville, Maryland [57] in Bulinus truncatus (intermediate snail host) and Mesocricetus auratus (hamster; mammalian definitive host). Adult worms were prepared and stored as described previously [55].

4.2. Isolation of High Molecular Weight Genomic DNA, Library Construction and Sequencing

High-quality total genomic DNA was isolated from a pool of 50 male and female adult worm pairs of S. haematobium using the Circulomics Tissue Kit (Circulomics, Baltimore, MD, USA) and used to construct two rapid-sequencing (SQK-RAD004) and two ligation-sequencing genomic DNA libraries (SQK-LSK109), according to the manufacturer’s protocol (Oxford Nanopore Technologies, Oxford, UK). For one rapid-sequencing and one ligation-sequencing library, low molecular weight DNA was removed using the 10 kb-Short Read Eliminator (SRE) kit (Circulomics, Baltimore, MD, USA). Each library was sequenced (48 h) in a distinct flow cell (R9.4.1) using the MinION sequencer (Oxford Nanopore Technologies). Following sequencing, bases were ‘called’ from HDF5 files (FAST5 format) using the program Guppy v.3.1.5 (Oxford Nanopore Technologies) and stored in the FASTQ format [58].

4.3. Defining the mt Genomes

First, long-reads containing sequence tracts that matched perfectly those flanking the incomplete non-coding region (i.e., positions 4921 to 5420 at the 5′-end and 5465 to 5964 at the 3′-end) of the published mt genome of S. haematobium from Mali (GenBank accession no. DQ157222; [22]) were identified using the BLASTn tool [59]. These reads were then assembled using the program Canu v. 2.0 [60], and repeats identified using the program ‘repeat-match’ in the MUMmer package v. 3.23 [61]. A library of identified repeat units and published mt protein genes of S. haematobium (cf. DQ157222; [22]) was used to critically assess the completeness of the non-coding region and the frequency of such units in reads identified using the program RepeatMasker v. 4.0.5 (http://www.repeatmasker.org). As some sequence reads produced using Nanopore technology can contain random errors, only reads with high coverage (≥150-times) with the commonest repeat unit frequencies (±1 unit, with no overlap permitted) were used to define consensus sequences. Coding regions were then polished with available short-read data [55] using Pilon v. 1.23 [62]. Finally, long-read data were mapped to the defined mt genomes using Minimap2 [63] (alignment threshold: 70% of read length), and coverage of the genomes was determined using mpileup in the SAMtools package [64]. The frequency of repeat units in long-reads, read lengths and nucleotide coverages were plotted using the software package R [65]; circular plots were generated using the tool Circos v0.69-8 [66].

4.4. Annotation of the mt Genomes and Characterisation of the Tandem-Repeat Regions

The newly-defined mt genomes were compared with that published for S. haematobium (DQ157222; [22]), and tRNA, rRNA and protein-coding genes annotated accordingly. The open reading frame (ORF) of each protein-coding gene was verified using the program Geneious v.11.1.5 [67], employing the mt genetic code for echinoderms and flatworms ([68]; https://www.ncbi.nlm.nih.gov/Taxonomy/Utils/wprintgc.cgi#SG9). Secondary structures were predicted using the Vienna RNA Websuite [69] and drawn using the tool Forna [70]. Complete mt genome sequences were deposited in the GenBank database under the accession nos. MW067222MW067227; raw data are available in the Sequence Read Archive (SRA) under the accession no. PRJNA78265.

4.5. Hierarchical Cluster Analysis

The repeat units and their order in long-reads spanning the entirety of the non-coding region were established using the program RepeatMasker v. 4.0.5, employing a library of identified units. For each long-read, the units within this region were used to create a document-term matrix employing the textmineR v. 3.0.4 package in R (www.rtextminer.com). This matrix was then re-weighted using the TF-IDF method by multiplying the repeat term frequency (TF) by an inverse document frequency (IDF) using textmineR. Subsequently, the reweighted matrix was subjected to hierarchical clustering using pvclust v. 2.2-0 [71], and using a correlation distance measure, the Ward’s (ward.D) agglomerative clustering method and 10,000 bootstrap replicates. The final dendrogram plot was created using the ggplot2 package v. 3.3.2 in R.

Acknowledgments

Schistosoma haematobium material was kindly provided by Margaret Mentink-Kane of the NIH-NIAID Schistosomiasis Resource Center, Biomedical Resource Institute, Rockville, MD 20850, USA.

Supplementary Materials

The following are available online at https://www.mdpi.com/1422-0067/22/4/1811/s1, Figure S1: distribution of repeat unit frequencies in Nanopore long-reads, Figure S2: nucleotide coverage of Nanopore long-reads across mitochondrial (mt) genomes with 46, 41, 35, 29 and 20 repeat units, Figure S3: hierarchical cluster analysis of the patterns of repeat units in the mitochondrial (mt) genomes of Schistosoma haematobium.

Author Contributions

Conceptualization, L.K., R.B.G., A.J.S., P.K.K. and N.D.Y.; methodology, L.K., R.B.G., A.J.S., P.K.K. and N.D.Y.; software, L.K., R.B.G., A.J.S., P.K.K. and N.D.Y.; validation, L.K., R.B.G. and N.D.Y.; formal analysis, L.K., R.B.G. and N.D.Y.; investigation, L.K., R.B.G., B.L.W., D.R., D.T.J.L. and N.D.Y.; resources, R.B.G., B.L.W., D.R., D.T.J.L. and B.C.H.C.; data curation, L.K., R.B.G. and N.D.Y.; writing—original draft preparation, L.K., R.B.G. and N.D.Y.; writing—review and editing, B.L.W., D.R., D.T.J.L., A.J.S. and P.K.K.; visualization, L.K., R.B.G. and N.D.Y.; supervision, R.B.G. and N.D.Y.; project administration, R.B.G. and N.D.Y.; funding acquisition, R.B.G., B.C.H.C., P.K.K. and N.D.Y. All authors have read and agreed to the published version of the manuscript.

Funding

This project was supported through grants LP180101334 (N.D.Y. and P.K.K.) and LP180101085 (R.B.G. and B.C.H.C.) from the Australian Research Council (ARC). P.K.K. held an Early Career Research Fellowship from the National Health and Medical Research Council (NHMRC) of Australia. We acknowledge the use of HPC-GPGPU Facility computer resources hosted at the University of Melbourne. This Facility is supported by ARC Linkage Infrastructure, Equipment and Facilities (LIEF) Grant LE170100200.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Complete mitochondrial genome sequences have been deposited in the GenBank database under the accession nos. MW067222MW067227; raw data are available in the Sequence Read Archive (SRA) under the accession no. PRJNA78265.

Conflicts of Interest

The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results.

Footnotes

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

  • 1.Sharma I., Rawat D.S., Pasha S.T., Biswas S., Sharma Y.D. Complete nucleotide sequence of the 6 kb element and conserved cytochrome b gene sequences among Indian isolates of Plasmodium falciparum. Int. J. Parasitol. 2001;31:1107–1113. doi: 10.1016/S0020-7519(01)00218-1. [DOI] [PubMed] [Google Scholar]
  • 2.Sloan D.B., Alverson A.J., Chuckalovcak J.P., Wu M., McCauley D.E., Palmer J.D., Taylor D.R. Rapid evolution of enormous, multichromosomal genomes in flowering plant mitochondria with exceptionally high mutation rates. PLoS Biol. 2012;10:e1001241. doi: 10.1371/journal.pbio.1001241. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Gissi C., Iannelli F., Pesole G. Evolution of the mitochondrial genome of Metazoa as exemplified by comparison of congeneric species. Heredity. 2008;101:301–320. doi: 10.1038/hdy.2008.62. [DOI] [PubMed] [Google Scholar]
  • 4.Lavrov D.V., Pett W. Animal mitochondrial DNA as we do not know it: Mt-genome organization and evolution in nonbilaterian lineages. Genome Biol. Evol. 2016;8:2896–2913. doi: 10.1093/gbe/evw195. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Zíková A., Hampl V., Paris Z., Týč J., Lukeš J. Aerobic mitochondria of parasitic protists: Diverse genomes and complex functions. Mol. Biochem. Parasitol. 2016;209:46–57. doi: 10.1016/j.molbiopara.2016.02.007. [DOI] [PubMed] [Google Scholar]
  • 6.Sandor S., Zhang Y., Xu J. Fungal mitochondrial genomes and genetic polymorphisms. Appl. Microbiol. Biotechnol. 2018;102:9433–9448. doi: 10.1007/s00253-018-9350-5. [DOI] [PubMed] [Google Scholar]
  • 7.Kozik A., Rowan B.A., Lavelle D., Berke L., Schranz M.E., Michelmore R.W., Christensen A.C. The alternative reality of plant mitochondrial DNA: One ring does not rule them all. PLoS Genet. 2019;15:e1008373. doi: 10.1371/journal.pgen.1008373. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Lunt D.H., Whipple L.E., Hyman B.C. Mitochondrial DNA variable number tandem repeats (VNTRs): Utility and problems in molecular ecology. Mol. Ecol. 1998;7:1441–1455. doi: 10.1046/j.1365-294x.1998.00495.x. [DOI] [PubMed] [Google Scholar]
  • 9.Boyce T.M., Zwick M.E., Aquadro C.F. Mitochondrial DNA in the bark weevils: Size, structure and heteroplasmy. Genetics. 1989;123:825–836. doi: 10.1093/genetics/123.4.825. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Okimoto R., Chamberlin H.M., Macfarlane J.L., Wolstenholme D.R. Repeated sequence sets in mitochondrial DNA molecules of root knot nematodes (Meloidogyne): Nucleotide sequences, genome location and potential for host-race identification. Nucleic Acids Res. 1991;19:1619–1626. doi: 10.1093/nar/19.7.1619. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Després L., Imbert-Establet D., Monnerot M. Molecular characterization of mitochondrial DNA provides evidence for the recent introduction of Schistosoma mansoni into America. Mol. Biochem. Parasitol. 1993;60:221–229. doi: 10.1016/0166-6851(93)90133-I. [DOI] [PubMed] [Google Scholar]
  • 12.Tørresen O.K., Star B., Mier P., Andrade-Navarro M.A., Bateman A., Jarnot P., Gruca A., Grynberg M., Kajava A.V., Promponas V.J., et al. Tandem repeats lead to sequence assembly errors and impose multi-level challenges for genome and protein databases. Nucleic Acids Res. 2019;47:10994–11006. doi: 10.1093/nar/gkz841. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Monnens M., Thijs S., Briscoe A.G., Clark M., Frost E.J., Littlewood D.T.J., Sewell M., Smeets K., Artois T., Vanhove M.P.M. The first mitochondrial genomes of endosymbiotic rhabdocoels illustrate evolutionary relaxation of atp8 and genome plasticity in flatworms. Int. J. Biol. Macromol. 2020;162:454–469. doi: 10.1016/j.ijbiomac.2020.06.025. [DOI] [PubMed] [Google Scholar]
  • 14.Hu M., Jex A.R., Campbell B.E., Gasser R.B. Long PCR amplification of the entire mitochondrial genome from individual helminths for direct sequencing. Nat. Protoc. 2007;2:2339–2344. doi: 10.1038/nprot.2007.358. [DOI] [PubMed] [Google Scholar]
  • 15.Hommelsheim C.M., Frantzeskakis L., Huang M., Ülker B. PCR amplification of repetitive DNA: A limitation to genome editing technologies and many other applications. Sci. Rep. 2014;4:5052. doi: 10.1038/srep05052. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Van Dijk E.L., Jaszczyszyn Y., Naquin D., Thermes C. The third revolution in sequencing technology. Trends Genet. 2018;34:666–681. doi: 10.1016/j.tig.2018.05.008. [DOI] [PubMed] [Google Scholar]
  • 17.Kono N., Arakawa K. Nanopore sequencing: Review of potential applications in functional genomics. Dev. Growth Differ. 2019;61:316–326. doi: 10.1111/dgd.12608. [DOI] [PubMed] [Google Scholar]
  • 18.Oey H., Zakrzewski M., Narain K., Devi K.R., Agatsuma T., Nawaratna S., Gobert G.N., Jones M.K., Ragan M.A., McManus D.P., et al. Whole-genome sequence of the oriental lung fluke Paragonimus westermani. GigaScience. 2019;8:giy146. doi: 10.1093/gigascience/giy146. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Oey H., Zakrzewski M., Gravermann K., Young N.D., Korhonen P.K., Gobert G.N., Nawaratna S., Hasan S., Martínez D.M., You H., et al. Whole-genome sequence of the bovine blood fluke Schistosoma bovis supports interspecific hybridization with S. haematobium. PLoS Pathog. 2019;15:e1007513. doi: 10.1371/journal.ppat.1007513. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Kinkar L., Korhonen P.K., Cai H., Gauci C.G., Lightowlers M.W., Saarma U., Jenkins D.J., Li J., Li J., Young N.D., et al. Long-read sequencing reveals a 4.4 kb tandem repeat region in the mitogenome of Echinococcus granulosus (sensu stricto) genotype G1. Parasit. Vectors. 2019;12:238. doi: 10.1186/s13071-019-3492-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Kinkar L., Young N.D., Sohn W.-M., Stroehlein A.J., Korhonen P.K., Gasser R.B. First record of a tandem-repeat region within the mitochondrial genome of Clonorchis sinensis using a long-read sequencing approach. PLoS Negl. Trop. Dis. 2020;14:e0008552. doi: 10.1371/journal.pntd.0008552. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Littlewood D.T.J., Lockyer A.E., Webster B.L., Johnston D.A., Le T.H. The complete mitochondrial genomes of Schistosoma haematobium and Schistosoma spindale and the evolutionary history of mitochondrial genome changes among parasitic flatworms. Mol. Phylogenet. Evol. 2006;39:452–467. doi: 10.1016/j.ympev.2005.12.012. [DOI] [PubMed] [Google Scholar]
  • 23.Hemmi K., Kakehashi R., Kambayashi C., Du Preez L., Minter L., Furuno N., Kurabayashi A. Exceptional enlargement of the mitochondrial genome results from distinct causes in different rain frogs (Anura: Brevicipitidae: Breviceps) Int. J. Genom. 2020;2020:e6540343. doi: 10.1155/2020/6540343. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Diaz F., Bayona-Bafaluy M.P., Rana M., Mora M., Hao H., Moraes C.T. Human mitochondrial DNA with large deletions repopulates organelles faster than full-length genomes under relaxed copy number control. Nucleic Acids Res. 2002;30:4626–4633. doi: 10.1093/nar/gkf602. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Shao R., Barker S.C., Mitani H., Aoki Y., Fukunaga M. Evolution of duplicate control regions in the mitochondrial genomes of metazoa: A case study with Australasian Ixodes ticks. Mol. Biol. Evol. 2005;22:620–629. doi: 10.1093/molbev/msi047. [DOI] [PubMed] [Google Scholar]
  • 26.Havird J.C., Forsythe E.S., Williams A.M., Werren J.H., Dowling D.K., Sloan D.B. Selfish mitonuclear conflict. Curr. Biol. 2019;29:R496–R511. doi: 10.1016/j.cub.2019.03.020. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Van den Ameele J., Li A.Y.Z., Ma H., Chinnery P.F. Mitochondrial heteroplasmy beyond the oocyte bottleneck. Semin. Cell Dev. Biol. 2020;97:156–166. doi: 10.1016/j.semcdb.2019.10.001. [DOI] [PubMed] [Google Scholar]
  • 28.Lee W.-J., Conroy J., Howell W.H., Kocher T.D. Structure and evolution of teleost mitochondrial control regions. J. Mol. Evol. 1995;41:54–66. doi: 10.1007/BF00174041. [DOI] [PubMed] [Google Scholar]
  • 29.Zhang D.-X., Hewitt G.M. Insect mitochondrial control region: A review of its structure, evolution and usefulness in evolutionary studies. Biochem. Syst. Ecol. 1997;25:99–120. doi: 10.1016/S0305-1978(96)00042-7. [DOI] [Google Scholar]
  • 30.Boore J.L. Animal mitochondrial genomes. Nucleic Acids Res. 1999;27:1767–1780. doi: 10.1093/nar/27.8.1767. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Ruokonen M., Kvist L. Structure and evolution of the avian mitochondrial control region. Mol. Phylogenet. Evol. 2002;23:422–432. doi: 10.1016/S1055-7903(02)00021-0. [DOI] [PubMed] [Google Scholar]
  • 32.Pereira F., Soares P., Carneiro J., Pereira L., Richards M.B., Samuels D.C., Amorim A. Evidence for variable selective pressures at a large secondary structure of the human mitochondrial DNA control region. Mol. Biol. Evol. 2008;25:2759–2770. doi: 10.1093/molbev/msn225. [DOI] [PubMed] [Google Scholar]
  • 33.Falkenberg M. Mitochondrial DNA replication in mammalian cells: Overview of the pathway. Essays Biochem. 2018;62:287–296. doi: 10.1042/EBC20170100. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Casane D., Dennebouy N., de Rochambeau H., Mounolou J.C., Monnerot M. Nonneutral evolution of tandem repeats in the mitochondrial DNA control region of lagomorphs. Mol. Biol. Evol. 1997;14:779–789. doi: 10.1093/oxfordjournals.molbev.a025818. [DOI] [PubMed] [Google Scholar]
  • 35.Jannotti-Passos L.K., Souza C.P., Parra J.C., Simpson A.J.G. Biparental mitochondrial DNA inheritance in the parasitic trematode Schistosoma mansoni. J. Parasitol. 2001;87:79–82. doi: 10.1645/0022-3395(2001)087[0079:BMDIIT]2.0.CO;2. [DOI] [PubMed] [Google Scholar]
  • 36.Guerra D., Plazzi F., Stewart D.T., Bogan A.E., Hoeh W.R., Breton S. Evolution of sex-dependent mtDNA transmission in freshwater mussels (Bivalvia: Unionida) Sci. Rep. 2017;7:1551. doi: 10.1038/s41598-017-01708-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Levinson G., Gutman G.A. Slipped-strand mispairing: A major mechanism for DNA sequence evolution. Mol. Biol. Evol. 1987;4:203–221. doi: 10.1093/oxfordjournals.molbev.a040442. [DOI] [PubMed] [Google Scholar]
  • 38.Lunt D.H., Hyman B.C. Animal mitochondrial DNA recombination. Nature. 1997;387:247. doi: 10.1038/387247a0. [DOI] [PubMed] [Google Scholar]
  • 39.Boore J.L., Brown W.M. Big trees from little genomes: Mitochondrial gene order as a phylogenetic tool. Curr. Opin. Genet. Dev. 1998;8:668–674. doi: 10.1016/S0959-437X(98)80035-X. [DOI] [PubMed] [Google Scholar]
  • 40.Pâques F., Leung W.-Y., Haber J.E. Expansions and contractions in a tandem repeat induced by double-strand break repair. Mol. Cell. Biol. 1998;18:2045–2054. doi: 10.1128/MCB.18.4.2045. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Rokas A., Ladoukakis E., Zouros E. Animal mitochondrial DNA recombination revisited. Trends Ecol. Evol. 2003;18:411–417. doi: 10.1016/S0169-5347(03)00125-3. [DOI] [Google Scholar]
  • 42.Lewis S.C., Joers P., Willcox S., Griffith J.D., Jacobs H.T., Hyman B.C. A rolling circle replication mechanism produces multimeric lariats of mitochondrial DNA in Caenorhabditis elegans. PLoS Genet. 2015;11:e1004985. doi: 10.1371/journal.pgen.1004985. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Després L., Imbert-Establet D., Combes C., Bonhomme F., Monnerot M. Isolation and polymorphism in mitochondrial DNA from Schistosoma mansoni. Mol. Biochem. Parasitol. 1991;47:139–141. doi: 10.1016/0166-6851(91)90156-Z. [DOI] [PubMed] [Google Scholar]
  • 44.Le T.H., Humair P.F., Blair D., Agatsuma T., Littlewood D.T., McManus D.P. Mitochondrial gene content, arrangement and composition compared in African and Asian schistosomes. Mol. Biochem. Parasitol. 2001;117:61–71. doi: 10.1016/S0166-6851(01)00330-9. [DOI] [PubMed] [Google Scholar]
  • 45.Protasio A.V., Tsai I.J., Babbage A., Nichol S., Hunt M., Aslett M.A., Silva N.D., Velarde G.S., Anderson T.J.C., Clark R.C., et al. A systematically improved high quality genome and transcriptome of the human blood fluke Schistosoma mansoni. PLoS Negl. Trop. Dis. 2012;6:e1455. doi: 10.1371/journal.pntd.0001455. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Lin Z.J., Wang X., Wang J., Tan Y., Tang X., Werren J.H., Zhang D., Wang X. Comparative analysis reveals the expansion of mitochondrial DNA control region containing unusually high G-C tandem repeat arrays in Nasonia vitripennis. Int. J. Biol. Macromol. 2021;166:1246–1257. doi: 10.1016/j.ijbiomac.2020.11.007. [DOI] [PubMed] [Google Scholar]
  • 47.Omote K., Nishida C., Dick M.H., Masuda R. Limited phylogenetic distribution of a long tandem-repeat cluster in the mitochondrial control region in Bubo (Aves, Strigidae) and cluster variation in Blakiston’s fish owl (Bubo blakistoni) Mol. Phylogenet. Evol. 2013;66:889–897. doi: 10.1016/j.ympev.2012.11.015. [DOI] [PubMed] [Google Scholar]
  • 48.Zhang P., Liang D., Mao R.-L., Hillis D.M., Wake D.B., Cannatella D.C. Efficient sequencing of Anuran mtDNAs and a mitogenomic exploration of the phylogeny and evolution of frogs. Mol. Biol. Evol. 2013;30:1899–1915. doi: 10.1093/molbev/mst091. [DOI] [PubMed] [Google Scholar]
  • 49.Xia Y., Zheng Y., Miura I., Wong P.B., Murphy R.W., Zeng X. The evolution of mitochondrial genomes in modern frogs (Neobatrachia): Nonadaptive evolution of mitochondrial genome reorganization. BMC Genom. 2014;15:691. doi: 10.1186/1471-2164-15-691. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Solà E., Álvarez-Presas M., Frías-López C., Littlewood D.T.J., Rozas J., Riutort M. Evolutionary analysis of mitogenomes from parasitic and free-living flatworms. PLoS ONE. 2015;10:e0120081. doi: 10.1371/journal.pone.0120081. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Chen Z.-T., Yu B., Du Y.-Z. The nearly complete mitochondrial genome of a snout weevil, Eucryptorrhynchus brandti (Coleoptera: Curculionidae) Mitochondrial DNA A DNA Map. Seq. Anal. 2016;27:2736–2737. doi: 10.3109/19401736.2015.1046171. [DOI] [PubMed] [Google Scholar]
  • 52.Prada C.F., Boore J.L. Gene annotation errors are common in the mammalian mitochondrial genomes database. BMC Genom. 2019;20:73. doi: 10.1186/s12864-019-5447-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Song N., Li X., Yin X., Li X., Yin S., Yang M. The mitochondrial genome of Apion squamigerum (Coleoptera, Curculionoidea, Brentidae) and the phylogenetic implications. PeerJ. 2020;8:e8386. doi: 10.7717/peerj.8386. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Hug L.A., Baker B.J., Anantharaman K., Brown C.T., Probst A.J., Castelle C.J., Butterfield C.N., Hernsdorf A.W., Amano Y., Ise K., et al. A new view of the tree of life. Nat. Microbiol. 2016;1:1–6. doi: 10.1038/nmicrobiol.2016.48. [DOI] [PubMed] [Google Scholar]
  • 55.Young N.D., Jex A.R., Li B., Liu S., Yang L., Xiong Z., Li Y., Cantacessi C., Hall R.S., Xu X., et al. Whole-genome sequence of Schistosoma haematobium. Nat. Genet. 2012;44:221–225. doi: 10.1038/ng.1065. [DOI] [PubMed] [Google Scholar]
  • 56.Stroehlein A.J., Korhonen P.K., Chong T.M., Lim Y.L., Chan K.G., Webster B., Rollinson D., Brindley P.J., Gasser R.B., Young N.D. High-quality Schistosoma haematobium genome achieved by single-molecule and long-range sequencing. GigaScience. 2019;8:giz108. doi: 10.1093/gigascience/giz108. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Lewis F.A., Liang Y.-S., Raghavan N., Knight M. The NIH-NIAID schistosomiasis resource center. PLoS Negl. Trop. Dis. 2008;2:e267. doi: 10.1371/journal.pntd.0000267. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Cock P.J.A., Fields C.J., Goto N., Heuer M.L., Rice P.M. The Sanger FASTQ file format for sequences with quality scores, and the Solexa/Illumina FASTQ variants. Nucleic Acids Res. 2010;38:1767–1771. doi: 10.1093/nar/gkp1137. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Altschul S.F., Gish W., Miller W., Myers E.W., Lipman D.J. Basic local alignment search tool. J. Mol. Biol. 1990;215:403–410. doi: 10.1016/S0022-2836(05)80360-2. [DOI] [PubMed] [Google Scholar]
  • 60.Chin C.-S., Alexander D.H., Marks P., Klammer A.A., Drake J., Heiner C., Clum A., Copeland A., Huddleston J., Eichler E.E., et al. Nonhybrid, finished microbial genome assemblies from long-read SMRT sequencing data. Nat. Methods. 2013;10:563–569. doi: 10.1038/nmeth.2474. [DOI] [PubMed] [Google Scholar]
  • 61.Kurtz S., Phillippy A., Delcher A.L., Smoot M., Shumway M., Antonescu C., Salzberg S.L. Versatile and open software for comparing large genomes. Genome Biol. 2004;5:R12. doi: 10.1186/gb-2004-5-2-r12. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62.Walker B.J., Abeel T., Shea T., Priest M., Abouelliel A., Sakthikumar S., Cuomo C.A., Zeng Q., Wortman J., Young S.K., et al. Pilon: An integrated tool for comprehensive microbial variant detection and genome assembly improvement. PLoS ONE. 2014;9:e112963. doi: 10.1371/journal.pone.0112963. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Li H. Minimap2: Pairwise alignment for nucleotide sequences. Bioinformatics. 2018;34:3094–3100. doi: 10.1093/bioinformatics/bty191. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64.Li H., Handsaker B., Wysoker A., Fennell T., Ruan J., Homer N., Marth G., Abecasis G., Durbin R. The Sequence Alignment/Map format and SAMtools. Bioinformatics. 2009;25:2078–2079. doi: 10.1093/bioinformatics/btp352. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65.R Core Team . R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing; Vienna, Austria: 2018. [(accessed on 23 September 2020)]. Available online: https://www.R-project.org. [Google Scholar]
  • 66.Krzywinski M., Schein J., Birol I., Connors J., Gascoyne R., Horsman D., Jones S.J., Marra M.A. Circos: An information aesthetic for comparative genomics. Genome Res. 2009;19:1639–1645. doi: 10.1101/gr.092759.109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67.Kearse M., Moir R., Wilson A., Stones-Havas S., Cheung M., Sturrock S., Buxton S., Cooper A., Markowitz S., Duran C., et al. Geneious Basic: An integrated and extendable desktop software platform for the organization and analysis of sequence data. Bioinformatics. 2012;28:1647–1649. doi: 10.1093/bioinformatics/bts199. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68.Telford M.J., Herniou E.A., Russell R.B., Littlewood D.T.J. Changes in mitochondrial genetic codes as phylogenetic characters: Two examples from the flatworms. Proc. Natl. Acad. Sci. USA. 2000;97:11359–11364. doi: 10.1073/pnas.97.21.11359. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69.Gruber A.R., Lorenz R., Bernhart S.H., Neuböck R., Hofacker I.L. The Vienna RNA websuite. Nucleic Acids Res. 2008;36:W70–W74. doi: 10.1093/nar/gkn188. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70.Kerpedjiev P., Hammer S., Hofacker I.L. Forna (force-directed RNA): Simple and effective online RNA secondary structure diagrams. Bioinformatics. 2015;31:3377–3379. doi: 10.1093/bioinformatics/btv372. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 71.Suzuki R., Shimodaira H. Pvclust: An R package for assessing the uncertainty in hierarchical clustering. Bioinformatics. 2006;22:1540–1542. doi: 10.1093/bioinformatics/btl117. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Data Availability Statement

Complete mitochondrial genome sequences have been deposited in the GenBank database under the accession nos. MW067222MW067227; raw data are available in the Sequence Read Archive (SRA) under the accession no. PRJNA78265.


Articles from International Journal of Molecular Sciences are provided here courtesy of Multidisciplinary Digital Publishing Institute (MDPI)

RESOURCES