Abstract
Long-read sequencing (LRS) has transformed life science research by introducing third-generation sequencing (TGS) platforms applicable across various research fields, including environmental sciences. In the past decade, LRS platforms have been utilized to extensively study algal systems by improving genomic approaches such as metabarcoding, chromosome-level genome and pangenome assemblies, as well as providing new insights into algae-associated microbiomes and host–symbiont interactions. This review aims to discuss recent advancements in LRS in algal research. To achieve this aim, a systematic review was conducted according to the PRISMA 2020 guidelines and across three electronic databases (Web of Science, Scopus, and Google Scholar), with additional citation searching for relevant studies in four key algal research areas: metabarcoding, genomics, pangenomics, and host–symbionts interactions. Following the inclusion and exclusion criteria, only 51 studies were selected for this review. Throughout the review, we summarize the challenges of short-read sequencing (SRS) and discuss how LRS platforms address these challenges in algal studies. Furthermore, we discuss the future of LRS and explore how artificial intelligence (AI) can advance research on algal biology and ecology.
Keywords: long-read sequencing, nanopore sequencing, metabarcoding, 16S/18S rRNA gene, pangenome, algal host–bacterial symbiosis, algal microbiome, ONT, PacBio
1. Introduction
During the past decade, algae have been studied primarily using “short-read” sequencing Illumina-based platforms relying on platform-specific library preparation protocols. However, due to short-read length (i.e., app. 100 to 300 bp sequences), Illumina-based platforms currently face challenges in PCR bias, high levels of sequence biodiversity, reduced coverage of GC-rich regions, and difficulties in resolving highly repetitive regions, paralogous regions of the genomes, structural variations, index hopping, and large segmental duplications [1,2,3,4,5,6]. These challenges combined can now be addressed with the advent of long-read sequencing (LRS) and third-generation sequencing (TGS) tools.
TGS technology refers to single-molecule sequencing technology which can accurately sequence long strands of nucleic acid without an intermediary and without previous retro transcription or amplification [7]. Two popular LRS platforms, Oxford Nanopore Technology (ONT) and Pacific Biosciences (PacBio), provide median length of reads from 5 to 20 kbp and throughputs from 15 to 50 Gbp [8], covering full-length 16S-, 18S rRNA genes and ITS (internal transcribed spacer) region for prokaryotic and eukaryotic species [9,10] (Figure 1). The length of LRS depends on the specific technology and instrument platform used. The longer reads offer several advantages over short-read sequencing (SRS) methods, resulting in fewer sequencing fragments with wider coverage, accurately, assembling genomes and minimizing errors in genome assembling, particularly in highly repetitive regions [11]. Larger variations in the sequence are difficult to detect with short reads; second-generation sequencing (SGS) is well suited for identifying single-nucleotide variations (SNVs) and small insertions and deletions (indels), but not larger variations in the sequences [12,13].
Figure 1.
The comparison of second-(Illumina) and third-generation (PacBio and ONT) sequencing workflows: (1) Environmental sample collection; (2) DNA extraction and purification; (3) Library preparation (Illumina and ONT use adapter-ligated libraries, while PacBio uses hairpin-ligated Single Molecule Real-Time (SMRTbell) library); (4) Sequencing methods (Illumina uses sequencing by synthesis with bridge clusters and addition of fluorescently labeled nucleotides; PacBio uses single-molecule sequencing on a Zero Mode Waveguide (ZMW) flow cell; ONT uses nanopore channels for single-molecule sequencing); (5) Raw reads generation and basecalling: (Illumina generates short reads (~100–300 bp) based on fluorescence; PacBio and ONT generate long reads (~5–20 kbp), with PacBio using fluorescence and ONT measuring current changes as a readout). The hybrid approach combines both methods, producing short and long reads where short reads polish long reads from mismatches/errors in the DNA sequence for high-quality genome assembly.
The criticisms towards LRS were due to its early, relatively low accuracy, which has significantly improved with advancements in different aspects of nanopore sequencing, including nanopore sensors and associated chemicals, speed, and the ability to handle larger sets of reads [14]. The error rate in ONT is related to its capability to control the speed of DNA molecules passing through the nanopore, whereas the PacBio error rate is affected by random errors [15,16]. The integration of phi29 DNA polymerase and introduction of new flow cells reduced the error rate in ONT [17]. Furthermore, PacBio implemented circular consensus sequencing, and PacBio’s Revio iteration has reached an accuracy level of 99.9%, closely comparable with SRS [18,19]. The notable drawback of LRS when compared to SRS is its relatively higher sequencing cost, which continues to be more expensive despite all advancements in technologies. The relatively low cost and portability of sequencing module from ONT (MinION) [20] (though not consumables) made use of the technology possible by individual laboratories and not by core facilities. It was particularly important for environmental scientists who traditionally rely on smaller grants. There is an increased interest in applying LRS in studying algal communities. Research groups have reported using these platforms for algal taxonomic profiling [21,22], characterizing species of harmful algal blooms (HABs) [23,24,25,26], analyzing algal-associated microbiomes [27,28,29,30], assembling algal genomes at the chromosome scale [31], studying algal pangenomes [32,33,34], and investigating algal host–symbiont interactions [35,36,37].
Currently, however, broad, comprehensive reviews on the use of SGS in algal systematics are available [38,39,40] or studies on certain aspects of LRS, such as metagenomics [7] or algal microbiomes [28]. To systematize current advances in LRS in studies of algal ecosystems, we focused on the following four critical areas of algal research: algal metabarcoding, algal genomics, pangenomics, and algal host–bacterial symbiont studies. We also investigated the current performance of SGS compared to traditional visual methods, such as microscopy, and discussed challenges and future directions for integrating LRS data into multi-omics studies.
2. Materials and Methods
This systematic review was conducted in accordance with the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines [41]. The focus of the review was long-read sequencing in algal systems, not in clinical settings; thus, it was not registered.
2.1. Search Strategy
For this review, we focused on studies that examined the use of common LRS platforms (ONT, PacBio) in the following areas: algal metabarcoding, algal genomics, algal pangenome, and algae-host bacterial symbionts. To identify relevant studies, we conducted an initial search from 8 to 9 September 2025, across several electronic databases: Web of Science (WoS), Scopus, and Google Scholar. The following search terms were used to cover all relevant studies: algae (“algae”, “algal”), metabarcoding (“18S”, “16S”), genomics (“long read genome sequencing”, “whole genome”, “whole genome sequencing”, “genome”), pangenome (“pangenome”, “pan-genome”), host–symbiont (“symbiont”), and long-read sequencing (“nanopore sequencing”, “long read nanopore sequencing”, “PacBio”, “ONT”, “long read sequencing”, “next generation sequencing”, “long read”). The general search query we employed across all databases was algae AND area (metabarcoding OR genomics OR pangenome OR host-symbiont) AND long-read sequencing. The time limit for publications was set to 2015–2025. To ensure that we included articles that may have been missed in the initial search, we conducted an additional manual search. Only a select few studies comparing SGS with traditional methods were identified through citation searching, and these studies were not included in a systematic review because they were outside the scope of the research topic.
2.2. Inclusion and Exclusion Criteria
Following the removal of duplicates, articles were included for further screening if they met the following inclusion criteria: (1) written in English; (2) original research articles; (3) focused on algae-based system; (4) employed long-read sequencing platforms to study algal metabarcoding, genomics, pangenome, or algae host–bacterial symbionts; and (5) provided sufficient methodology reporting. Articles were excluded if they were not written in English, were not published as research articles (e.g., reviews, manuscripts, book chapters, conference papers, etc.), or were not relevant to the research topic and algae-based study system.
2.3. Data Extraction
We collected the following data for each publication: the first author’s name, the year of study, the algae system, DNA extraction method, the library preparation protocol, the sequencing platforms used, the read length (paired-end or single-end), and the sequencing depth/coverage per sample. Additionally, we included data on genome assembly and polishing tools for genomics studies and pangenome pipeline tools for pangenome studies.
2.4. Study Selection
Our systematic review focused on examining long-read sequencing in algal metabarcoding, genomics, pangenomics, and host–symbiont studies. The comprehensive search strategy allowed us to retrieve 679 articles from three databases (WoS, Scopus, and Google Scholar), and an additional manual search resulted in 47 studies (Figure 2). After removing duplicates, 593 articles were screened based on title, abstract, and keywords. At this stage, the articles were removed based on language, article type, and relevance to the four key research areas. This screening procedure resulted in the removal of 477 articles. During the inclusion step, the remaining 116 articles were assessed based on the inclusion criteria outlined in the Materials and Methods Section 2, resulting in the removal of 65 articles for different reasons highlighted in Figure 2. After this step-by-step selection process, a total of 51 articles were included for systematic review analysis. Of these studies, 25 articles were algal metabarcoding studies, 14 articles were algal genomics studies, 6 articles were algal pangenome studies, and 6 articles were algal host–symbiont studies.
Figure 2.
PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) flowchart summarizing the steps of article identification, selection, screening and inclusion stages. This systematic review adapted the PRISMA guidelines. This review, however, did not include clinical data and, therefore, was not registered.
3. Results and Discussion
3.1. Long-Read Metabarcoding
Algal metagenomics (untargeted sequencing of DNA from all organisms associated with a given algal sample) [42] and 16S rRNA and 18S rRNA gene amplicon sequencing have become instrumental methods in determining algal diversity, monitoring HABs, and characterizing algal microbiomes [28,30,43,44]. As proof of concept, LRS’s potential to detect toxic algae was observed in mock community studies, where the MinION (ONT) platform demonstrated capabilities to determine dinoflagellate species belonging to Alexandrium genus [44]. There is a wide collection of studies available that confirmed LRS capacity for detection and monitoring potentially toxic cyanobacteria (e.g., Microcystis) [21]. Baharudin and co-authors [26] have utilized LRS for monitoring toxic algae belonging to dinophytes in shrimp aquaculture ponds, highlighting its utility as a warning system in aquaculture industries that are often susceptible to HAB outbreaks. Moreover, several studies report the use of LRS in assessing microbial communities associated with causative agents such as Microcystis [30] and toxic dinoflagellates (Alexandrium tamarense and Cochlodinium polykrikoides) [27], which underscores the potential of LRS in evaluating complex HAB dynamics.
Currently, SRS is commonly used in algal metabarcoding, targeting specific hypervariable regions in two marker genes, the 16S and 18S rRNA genes, for the determination of cyanobacteria and eukaryotic algae [5,45]. The 16S rRNA is a gold standard for bacterial community profiling due to ubiquity, the low frequency of horizontal gene transfer (HGT), taxonomic information from hypervariable regions (V1–V9), and the presence of universally conserved regions that serve as primer-binding sites for PCR. The Tara Oceans expeditions produced a very large metabarcoding dataset capturing the highest diversity and large number of sequences from undescribed species among picoplankton groups [46]. The utility of metabarcoding and metagenomics lies in their potential to provide quantitative measures of algal biodiversity, albeit without characterizing newly discovered species beyond the sequences obtained [39]; however, it is increased by a hybrid sequencing approach (SRS and LRS together) [47].
Metagenomics answers research questions in two different ways: (1) shotgun metagenomics and (2) amplicon sequencing. In amplicon sequencing, the application of the method, called “meta-profiling”, utilizes a single standardized genetic marker, amplified to identify multiple species simultaneously from complex samples [48]. The whole genome shotgun sequencing approach utilizes total DNA, which is randomly fragmented during the library preparation step, fragments of which are sequenced in parallel, and later are assembled into contiguous sequences based on overlapping regions. Moreover, such approaches in SRS platforms may target short-amplicon sequences not covering the whole gene and usually extend only to several hundred bps [49,50,51,52]. Thus, for the 18S ribosomal rRNA gene, regions commonly used to characterize the diversity of eukaryotic communities by metabarcoding are the V4 and V9 gene regions.
Additionally, short-reads have difficulty reliably resolving taxonomic assignments at the species level, with most short-read studies showing a consistent decrease in taxonomic resolution from genus to species levels [49,50,51,53,54]. The growing preference for full-length 16S rRNA and 18S rRNA gene sequencing using long-read platforms reflects a trend towards species- and potentially strain-level resolution in microbial and microalgal research [55,56,57]. With the advent of LRS platforms, it has become possible to address the issues. Currently, LRS platforms are capable of targeting full-length 16S and 18S rRNA, with the possibility to sequence the entire rRNA genes 16S-ITS-23S [58] and 18S-ITS1-5.8S-ITS2-28S (Figure 3, Table A1). However, species-level identification is not always achievable, and early versions of ONT platforms were hampered by reads of low accuracy.
Figure 3.
Overview of long-read sequencing in algal metabarcoding. The step-by-step procedure: (1) algal sample collection; (2) full-length rRNA gene sequencing (16S-ITS-23S and 18S-ITS-28S); (3) OTUs/ASVs clustering; (4) taxonomic profiling at the species level. (ITS—internal transcribed spacer, OTUs—operational taxonomic units, ASVs—amplicon sequence variants, S—sample).
While partial 16S rRNA gene regions are generally sufficient for identifying of bacterial heterotrophs, sequencing the full-length 16S rRNA gene provides improved taxonomic resolution [59]. However, the reliable identification of oxygenic phototrophs may require additional 18S rRNA nuclear (nuITS1, nuITS2), plastid (rbcL, tufA, and Cp23S), and mitochondrial genes (cytochrome c oxidase subunit (COI)) [60,61,62]. However, DNA barcoding of microalgae remains limited by the lack of universally applicable markers that could distinguish closely related species. Using multiple genes for barcoding appears to be more effective [60,61,62]. Different groups of microalgae are often targeted with different barcode markers for SRS. Thus, for diatoms the best functioning barcode markers were reported as follows: (i) the 3′ end of the large subunit of the rbcL (rbcL-3 P), (ii) a 540 bp fragment situated 417 bp downstream of the start codon of the rbcL (540 bp rbcL), (iii) the 5′ end of the mitochondrial cytochrome c oxidase I gene (COI-5 P), (iv) a partial sequence of the large ribosomal subunit (D1–D3 LSU, usually either D1–D2 or D2–D3), and (v) the V4 sub-region of the small ribosomal subunit (V4 SSU) [63]. Recent advances in LRS accuracy have prompted the use of parallel sequencing for mitochondrial DNA (mtDNA) and other amplicons in biodiversity research. Thus, Karin and co-authors [64] and Vossen & Buermans [65] employed a methodology involving two asymmetrically indexed and overlapping fragments of mtDNA.
Notably, some of these studies compared the performance of short- and long-read sequencing in resolving taxonomic levels, especially at the genus and species level [26,27,66,67,68,69,70]. These comparisons have shown that LRS provides better taxonomic resolution at the species level [26,27,66]. Thus, Latz and co-authors [66] compared PacBio vs. Illumina platforms (MiSeq and NovaSeq) and reported that long-read amplicons and ITS1 alone provided higher taxonomic resolution than V4 region primers.
We examined SRS in comparison with traditional methods that rely on morphology and microscopy. Specifically, the concordance between these two methods was determined by the percentage overlap of taxonomic assignments at the genus and species levels. Importantly, to the best of our knowledge, the current literature is rich in comparative studies focusing on SRS, whereas such comparisons remain limited for LRS. Nevertheless, this analysis is essential in demonstrating the current performance of short-read-based tools compared to traditional morphological approaches.
Our analysis revealed that at the genus level, short-read-based sequencing demonstrates better concordance with traditional methods, averaging between 11.5% and 58.2%, than at the species level, between 2.5% and 20.1% (Figure 4) [49,50,51,53,71,72,73,74]. One possible explanation for this high variation is that different studies targeted different marker genes (16S V3-V4, 18S V4/V9, 23S, rbcL).
Figure 4.
Overlap % of the identification of algal taxa by SGS versus traditional methods at the genus and species levels (SRS—SRS-based methods, M—microscopy- or morphology-based methods). For studies with multiple comparative analyses, a single representative analysis was included [49,50,51,52,53,54,71,72,73,74].
At the species level, the overlap percentage across all studies decreased significantly, especially in those studies that conducted both genus- and species-level comparisons (Figure 4).
LRS and use of full-length 16S rRNA gene provided a higher proportion of reads that were further assigned to the species level [66,75,76]. However, the LRS’s potential to determine finer taxonomic levels is currently hindered by gaps in reference databases and in bioinformatics pipelines [68,69].
3.2. Long-Read Algal Genomics
With the advent of next-generation sequencing (NGS) platforms, whole-genome sequencing (WGS) has become the standard procedure in genomics [77] and environmental [78,79,80,81,82,83,84,85] research. Short-sequencing platforms made WGS widely acceptable, enabling the sequencing and assembly of many novel genomes and the generation of accessible genome data [86,87,88]. Billions of short reads spanning the length up to 300 bp can reliably provide information on single-nucleotide variants, copy-number variants, and insertions or deletions (indels) [89,90,91]. However, SRS platforms currently face challenges in resolving highly repetitive regions, genome structural variations, segmental duplications, and GC-rich regions [2,3,4,92].
In recent decades, LRS platforms have made great advancements in resolving these issues. LRS can generate reads ranging from 10 kilobases to several megabases, enabling the production of continuous sequences directly from native DNA [2]. The major advantage of ONT and PacBio technologies in constructing complete genomes is the ability to reduce the number of contigs while increasing the N50 metric (Figure 5, Table S1).
Figure 5.
Overview of long-read sequencing in algal genomics. The step-by-step assembly is illustrated in the following steps: (1) DNA extraction from an algal sample; (2) generation of long-reads using ONT and/or PacBio; (3) assembly of contigs from long-reads (as illustrated 1-6 long reads are assembled according to their overlapping regions to produce contigs); (4) quality control assessment based on N50 and BUSCO % (C); (5) assembly of complete genomes with a high N50 metric and a low number of contigs. (N50—contig length representing 50% of the total (contiguity), BUSCO—Benchmarking Universal Single-Copy Orthologs (completeness), S—sample).
This capability of LRS is evident in several studies that report highly contiguous and complete genomes for lichenized algae, HAB-forming, and psychrophilic organisms, as well as algicidal and intertidal algal species [31,93,94,95,96,97,98]. Numerous studies have successfully generated near-complete genomes at the chromosome scale and even at the telomere-to-telomere (T2T) scale [81,94,97,99], as evidenced by their high Benchmarking Universal Single-Copy Orthologs (BUSCO) scores, which provide evolutionarily sound quantitation of completeness and redundancy in terms of expected gene content [100]. Although LRS, in comparison to SRS, generates highly contiguous genomes, its per-read sequencing accuracy and the quantification of LRS data remain challenging.
Constructing a complete algal genome allows us to understand evolutionary dynamics and investigate the putative genes behind environmental adaptations. Several studies have employed WGS to examine the evolutionary changes in genome structure and organization. These include revealing whole-genome duplication events in Diplosphaera chodatii Bialosuknia and retrotransposon proliferation in Chlamydomonas sp. ICE-L [96,101,102]. Some studies further extended the WGS to examine HGT events using a three-step approach: (1) identifying candidate HGT genes by querying databases for homologous sequences using DIAMOND or BLAST; (2) evaluating candidate genes with Alien Index analysis; (3) constructing a maximum likelihood tree. Following this approach, Jian and co-authors [23] identified HGT events in obligate algal heterotrophs. Notably, this study used the HGTphyloDetect tool to identify HGT events. This tool integrates the three aforementioned steps into a single program, enabling a comprehensive HGT analysis [101].
Furthermore, Zhang and co-authors [102] investigated HGT-acquired proteins from prokaryotic donors as potential drivers for the emergence of a psychrophilic lifestyle in Chlamydomonas sp. ICE-L. Additionally, Chen and co-authors [95], as well as Wang and co-authors [93], examined various environmental adaptations obtained through putative bacterial HGT genes [93,95]. Thus, the complete and contiguous genomes generated through LRS significantly contribute to our understanding of the evolution and adaptation of algal lineages.
3.3. Pangenome Long-Read Sequencing
Delineating species boundaries has been a long-standing challenge in algal studies [103,104]. Methods of species delimitation in algae relying on morphology or sexual compatibility have gradually been replaced by molecular sequence data [105]. It is particularly important for small and morphologically indistinguishable organisms [106,107], or algae with high morphological variability and plasticity [108,109,110,111], such as colonial Microcystis Lemmermann. Several statistical approaches have been developed, including (1) distance-based approaches to determine barcode gaps, defined as the difference between intra- and interspecific genetic divergence [112,113], and (2) tree-based methods [114,115]; both approaches are typically applied to single-locus datasets. On the other side are multi-locus delimitation methods, based on the (3) multispecies coalescent framework that include Bayes Factor delimitation [116] and SpedeSTEM [117].
Pangenome studies allow reclassification of species, thereby changing and improving traditional taxonomic criteria [118]. It also allows cataloging multiple genomes from a species and identifies shared genes across lineages that are responsible for essential metabolic processes, as well as accessory genes in specific lineages that contribute to adaptive traits [119]. LRS platforms have become instrumental in establishing pangenomes, delineating species, identifying core and accessory genomes, and tracing HGT events across a phylogenetic tree (Figure 6). One major conclusion from pangenome research is that no single strain can ever be considered the typical reference since not all genes will be present in all strains, and perhaps not in the same order [120].
Figure 6.
Overview of long-read sequencing in algal pangenome studies. (A) Collection of genome data for pangenome analysis based on de novo assembly (G1–G2) and publicly available data (illustrated in the map); (B) multiple alignment of all included genomes against known reference genome data; (C) pangenome construction, with the dotted areas illustrating the shared/core genome; (D) Venn diagram of core and accessory genome data; (E) phylogenic analysis for species delimitation and HGT analysis to determine evolutionary and environmental adaptations (S—species, HGT—horizontal gene transfer).
In pangenome studies, an efficient sampling strategy is crucial to maximizing biogeographic and genomic diversity [121]. To investigate this diversity, some studies utilized available complete and draft algal genomes from NCBI databases [32,122,123], while others sampled algae from their local environments and generated genome assemblies using a hybrid short- and long-read sequencing approach [33,34,124,125]. Irrespective of the strategy employed, the geographic sources of these algal samples are predominantly concentrated in the USA, Canada, and Europe, followed by East Asian countries (Table S2). This pattern aligns with the sampling bias observed in many pangenome studies, highlighting the challenges associated with the overrepresentation of some regions [126]. However, making affordable LRS platforms, such as MinION (ONT), accessible to underrepresented regions could help bridge the sampling gap, providing missing data for ongoing pangenome projects.
Several studies have utilized LRS to construct pangenomes of HAB-forming species, including Microcystis, Prymnesium parvum N.Carter, and Aureococcus anophagefferens Hargraves & Sieburth [32,33,34,123,125]. Studies highlighted the remarkable plasticity of the Microcystis pangenome, reporting that 92.5% of the genes are dispensable and revealed the presence of 16 putative genospecies [32,123]. Similarly, the haptophyte HAB-forming P. parvum pangenome has a significant dispensable genome and includes at least three cryptic morphospecies [33]. On the other hand, haptophyte A. anophagefferens has a highly conserved core genome [34,125].
Many reported accessory genes in the dispensable genome may have originated from HGT events [33,34,122,123,127]. Using a three-step approach, these studies identified HGT events from prokaryotic, viral, and eukaryotic donors responsible for toxin metabolism in Microcystis and environmental adaptations in various algae, including shifts in lipid metabolism, halogenated hydrocarbons, carbon acquisition, and stress response. However, there is ongoing debate about whether some of these genes can truly be classified as HGT-derived. Challenges include gene erosion, bacterial contamination, and differential loss of genes during eukaryotic HGT [127]. The use of LRS can facilitate an acquisition of long fragment information and sequence regions that are problematic with short-read sequencing approaches. Overall, eukaryotic HGT events represent a minor percentage in whole pangenome analyses (e.g., HGT-derived genes in Cyanidiales: 1% [127]; in cryptophytes, rhizarians, alveolates, stramenopiles, and haptophytes (CRASH taxa group): 0.16–1.44% [122]). Despite their rarity, these HGT events may provide insight into evolutionary adaptation and contribute to the genetic and functional diversity observed in algal species.
3.4. Algal Host–Bacterial Symbiont Long-Read Sequencing
Bacteria for a long time have been considered as contaminants in microalgal cultures, with efforts directed toward obtaining axenic algal monocultures. However, the recognition of mutualistic microalgae–bacteria interactions has opened opportunities to use bacterial exosymbionts as growth-promoting partners in microalgal cultivation [128,129]. These species-specific phytoplankton–bacteria interactions should often be considered as symbiotic [130,131].
The successful separation of the host genome from prokaryotic contamination and associated symbionts presents a significant challenge in genomic studies. Several strategies have been developed to address this issue in algae [35,93,132,133,134] and computational tools developed to distinguish hosts from associated microbes and symbionts [134,135]. While combining these strategies with SRS has led to notable improvements in host–symbiont studies, major challenges remain in resolving structural variations (inter- and intra-genomic repeats, segmental duplications) of host–symbiont genomes [128]. LRS offers potential solutions to these challenges by facilitating the disentangling of the genomes of the algal host from the symbionts and associated microbial species.
Algae can host bacterial endosymbionts or a diverse group of exosymbionts inhabiting the phycosphere, a diffuse area immediately surrounding an algal cell (Figure 7). The symbiotic partners may provide various services to their algal hosts, such as nutrient exchange, and help algal hosts survive in environments that experience light- and heat stress [37,136,137,138,139]. The genomic basis of such symbiotic partnerships can be effectively studied using LRS and refined host–symbiont separation strategies, including metagenome-assembled genomes (MAGs) binning, Hi-C analysis, and a reference-based approach (Figure 7). Several research groups have implemented a combination of short and long reads to generate complete symbiont genomes via MAGs binning using tools such as MyCC, MaxBin2, and MetaBAT2 [140,141,142].
Figure 7.
Overview of long-read sequencing in algal host–bacterial symbiont studies. (A) Two common symbionts reported in algae—endosymbionts and symbionts—inhabiting the diffuse area surrounding phytoplankton (single cell or colony) known as the phycosphere. (B) Host–symbiont separation strategies: (1) MAGs binning: Clustering assembled contigs into representative bins based on nucleotide composition and coverage. (2) Hi-C analysis: Visual segregation of genome regions to reveal the genome’s 3D structure. (3) Reference-based approach: Assembled contigs undergo a targeted search and are evaluated against the reference genome. (C) Generation of symbiont-free host genomes and complete MAGs/symbiont genomes, and HGT analysis to characterize evolutionary dynamics and environmental adaptations. (MAGs—metagenome-assembled genomes, HGT—horizontal gene transfer).
In some studies, researchers combined PacBio high-fidelity reads and ONT reads with Hi-C analysis to visually segregate the host genome from symbionts [137,139]. For a more targeted approach, researchers preferred a reference-based strategy for isolating the symbiont’s genome, utilizing a BLAST search of long contigs against a reference genome [136,137,143]. Notably, the study by Wang and co-authors [137] employed the newly developed reference-free tool Symbiont Screener for generating high-quality host reads free from symbionts and associated microbial partners using a trio-based screening model [35]. These approaches demonstrate the effectiveness of LRS in addressing challenges in host–symbiont studies, including separating host and symbiont genomes and managing high levels of contamination from symbionts and other prokaryotes.
LRS tools are also invaluable for uncovering HGT events and their contribution to host adaptation. Wang and co-authors [137] identified 286 HGTs in the intertidal alga Pyropia haitanensis, with 50% of these transfers originating from symbiotic bacterial partners in the phycosphere, specifically from the Pseudomonas, Actinobacteria, and Bacteroidetes taxa. Among them, Saccharothrix sp. was isolated and shown to be associated with major environmental adaptation to heat stress in P. haitanensis. Likewise, Xu and co-authors [139] found 100 high-confidence HGTs distributed across 13 chromosomes of the intertidal algae Bryopsis corticulans Setchell. Although the exact taxonomic classification of the bacterial symbiont was not reported, this bacterium was shown to have conferred a cryptochrome gene to B. corticulans, thereby providing environmental adaptation to light-fluctuating environments. Thus, the capacity of LRS tools to clarify symbiotic partnerships can be utilized to investigate HGT events between hosts and their associated symbionts, helping to elucidate the environmental adaptations of algal species. LRS platforms demonstrate a wide range of applications in algal systematics. The advent of LRS on the ONT and PacBio platforms has resolved many challenges associated with SGS methods in areas like algal metabarcoding, genomics, pangenomics, and host–symbiont studies. Moreover, these new platforms have created opportunities for transcriptomic analyses, enabling the sequencing of full-length RNA molecules.
Advances in SRS technology provided a powerful foundation for metagenomics and the generation of high-quality genomes. However, genome assembly from short reads ranging between 100 and 300 bp in size due to the presence of repeated sequences [144] may result in fragmented and incomplete genome assemblies [92,145]. Since adopting long-read sequencing ONT and PacBio platforms, the field of genomics has undergone a revolutionary transformation [7,146]. These technologies represent a third generation in sequencing, addressing many challenges in SGS, including PCR amplification bias, difficulties resolving repeat regions, structural variations, GC-rich regions, and achieving finer levels of taxonomic resolution [28]. A key limitation of LRS is the availability of high molecular weight DNA, ideally, with large fragment sizes.
Following the introduction of platforms by ONT and PacBio, the LRS market has grown significantly, now offering a diverse array of long-sequencing instruments. Presently, there are several approaches that utilize nanopore chemistry [147]. Notable alternatives include the following: (1) QitanTech (China) technology has recently developed QNome-3841 nanopore system [148,149,150]; (2) AxBio (China, USA) technology has launched in 2023 AxiLona AXP100 instrument, which allows full-length 16S rRNA gene sequencing, providing higher-resolution bacterial community analysis [151,152]; (3) Bionano (USA) technology uses optical image mapping based on fluorescently labeled polynucleotides passing through a nanopore [153]; (4) Quantapore (USA) have been around since 2009 and developed an optical Nanopore DNA sequencing platform [147]; (5) Cambridge Nucleomics (UK) is another company which uses nanopore chemistry for direct quantification of tandem repeats in native RNA [154]. In the future, these nanopore-based systems could diversify existing LRS tools, thus offering more opportunities for long-read sequencing tools for algal research and beyond.
The widespread application of artificial intelligence (AI) methods, such as machine learning (ML), is rapidly transforming the computational aspects of LRS, including basecalling [155,156,157], methylation detection [158,159,160], and predictive analytics [161,162]. In the field of metagenomics, AI is advancing the detection of microbial signatures from complex microbiomes, thereby facilitating environmental monitoring and the characterization of key bioindicator species [163,164]. Currently, there are studies reporting on the use of AI in algal systematics. Notably, some research describes the use of ML approaches in identifying key algal taxa [165]. Additionally, deep learning-based approaches, such as Tiara, are being utilized to differentiate between host genomes and symbionts [139,166]. Overall, integrating AI into LRS shows promise for improving taxonomic classification, generating high-quality assemblies, and facilitating comprehensive environmental monitoring.
Some limitations of LRS should be considered when evaluating these review findings. First, the review was focused on ONT and PacBio long-read sequencing technologies, leaving out new platforms developed in China (QitanTech, AxBio, Geneus Gseq-500), the USA (Bionano, Nabsys, Quantapore), and the UK (Cambridge Nucleomics), among others [147,167,168]. Moreover, several studies have already been published that use QNome-3841 (QitanTech) nanopore sequencing platform in algal systematics [169]. Secondly, there is a significant challenge to develop bioinformatic methodologies, algorithms, and pipelines for these emerging technologies, as well as new statistical methods for long-read data.
Last but not least, the quality and completeness of available databases limit the utility of LRS. This issue is particularly evident in algal metabarcoding projects, where misidentified algal species can become prevalent depending on the workflow and the quality of the database used [68,170,171]. Furthermore, existing algal genome resources are heavily biased toward a few selected algal lineages, with several algal groups either underrepresented or lacking reference genomes altogether [172,173]. These challenges underscore the need for well-curated algal databases to enhance the utility of long-read sequencing-based approaches.
4. Conclusions
Long-read sequencing offers a scalable, rapid, and effective approach, providing numerous opportunities for algal metabarcoding, genomics, pangenomics, and host–symbiont studies. By utilizing long-read sequencing in these critical areas of algal research, we can accurately profile key algal taxa at the species level, characterize the algal microbiome, delimit species boundaries, and investigate the complex interactions between hosts and their symbionts. Coupling the advantages of long-read sequencing with bioinformatics pipelines enables the exploration of HGT, adaptations to environmental changes, and the evolutionary dynamics of algal lineages. In the future, emerging tools and advances in AI are expected to significantly advance the capabilities of long-read sequencing across various research domains. The valuable insights gained from algal studies can also be applied to other areas of environmental and conservation research.
Acknowledgments
We acknowledge help and advice from Ayagoz Meirkhanova, Nazarbayev University.
Abbreviations
The following abbreviations are used in this manuscript:
| LRS | long-read sequencing |
| SRS | short-read sequencing |
| TGS | third-generation sequencing |
| SGS | second-generation sequencing |
| NGS | next-generation sequencing |
| WGS | whole-genome sequencing |
| ONT | Oxford Nanopore Technology |
| PacBio | Pacific Biosciences |
| HGT | horizontal gene transfer |
| SNV | single-nucleotide variation |
| HAB | harmful algal bloom |
| BUSCO | Benchmarking Universal Single-Copy Orthologs |
| T2T | telomere-to-telomere |
| MAG | metagenome-assembled genome |
| AI | artificial intelligence |
| ML | machine learning |
| ITS | internal transcribed spacer |
| coi | cytochrome c oxidase subunit 1 |
| CTAB | cetyltrimethylammonium bromide |
| PE | paired-end |
| SE | single-end |
| mtDNA | mitochondrial DNA |
| eDNA | environmental DNA |
| N/A | not available for this study |
Supplementary Materials
The following supporting information can be downloaded at https://www.mdpi.com/article/10.3390/ijms27052415/s1. Reference [174] is cited in the Supplementary Materials.
Appendix A
Table A1.
Summary of long-read algal metabarcoding studies and associated wet-lab techniques.
| Study | Sequencing Strategy (16S or 18S) | Reported Algae | DNA Extraction Method/Kit | Library Preparation (Long-Read vs. Short-Read, Primers) | Sequencing Platforms | Read Length/Paired-End or Single-End |
|---|---|---|---|---|---|---|
| Shin et al., 2018 [27] | 16S rRNA gene V3-V4 region, 16S full-length rRNA gene | Algal microbiomes: Alexandrium tamarense, Cochlodinium polykrikoides | QIAamp DNA microbiome kit (Qiagen) # | Short-read: 16S rRNA gene V3-V4 region * Long-read: 16S full-length rRNA gene, S-D-bact-0008-c-S20/S-D-bact-1391-a-A-17 primers |
Illumina MiSeq, MinION (ONT) | MiSeq: 2 × 250 bp PE MinION: 1242 bp SE |
| Zhu et al., 2018 [175] | 18S full-length rRNA gene | Ulvophyceae, Trebouxiophyceae, Chlorophyceae, Dinophyceae, Eustigmatophyceae | Higher plant DNA kit (Omega) # | Long-read: 18S full-length rRNA gene, NS1F/NS8R and NS1F/1650R primers | PacBio Sequel |
1650–1850 bp SE |
| Curren et al., 2019 [43] | 16S full-length rRNA gene | 40 genera, 46 species | Phenol:chloroform:IAA method | Long-read: 16S full-length rRNA gene * | MinION R9.4 (ONT) | 1502 bp SE |
| Hatfield et al., 2020 [44] | 18S-ITS-28S rRNA gene | Alexandrium genus | Power Biofilm DNA isolation Kit (Qiagen, Hilden, Germany) | Long-read: 18S-ITS1-5.8S-ITS2-28S rRNA gene, 18ScomF1/D2C primers | MinION R9.4 (ONT) | 2928 bp SE |
| Schafran et al., 2020 [171] | 16S full-length rRNA gene, 23S full-length rRNA gene | Anabaena/Dolichospermum spp. | E.Z.N.A. Plant DNA kit (Omega Bio-tek) # | Long-read: 16S full-length rRNA gene, 23S full-length rRNA gene, N/A | MinION R9 (ONT) | ~1185 bp SE (16S), 2469 bp SE (23S) |
| Mirasbekov et al., 2021 [176] | 16S full-length rRNA gene | Phylum level | PowerWater DNA Isolation Kit (Qiagen, USA) | Long-read: 16S full-length rRNA gene * | MinION R9.4 (ONT) | N/A |
| van der Loos et al., 2021 [28] | 16S full-length rRNA gene | Algal microbiomes: Ulva australis, Ulva lacinulata | DNA PowerSoil kit (Qiagen) # | Long-read: 16S full-length rRNA gene, 27F/1492R primers | MinION R9.4.1 (ONT) | 1000–2000 bp SE |
| Koeppel et al., 2022 [177] | eDNA sequencing | Microcystis spp. (non-target) | DNeasy PowerWater kit # | Long-read: eDNA | MinION (ONT) | N/A |
| Latz et al., 2022 [66] | 18S rRNA gene V4, V6–V8 regions, 18S-ITS-28S rRNA gene | Algal groups: Stramenopiles, Alveolata, Archaeplastida, Hacrobia | ZymoBIOMICS DNA Miniprep kit (Zymo Research Corp) #, NucleoSpin Soil Kit (Macherey u nd Nagel) # | Short-read: 18S rRNA gene V4, V6–V8 regions * Long-read: 18S-ITS1-5.8S-ITS2-28S rRNA gene, V4_Balzano_F/V4F with 21R, 2742R and 3143R primers |
Illumina MiSeq, Illumina NovaSeq 6000, PacBio Sequel |
MiSeq: 2 × 300 bp PE NovaSeq: 2 × 150 bp PE PacBio: 4283 bp SE |
| O’Neill et al., 2022 [178] | 18S full-length rRNA gene | 341 genera, 982 species | High Pure PCR Template Preparation kit (Roche) # | Long-read: 18S full-length rRNA gene * | MinION (ONT) | MinION: ~1800 bp SE |
| He et al., 2023 [67] | 18S rRNA gene V4 region, 18S full-length rRNA gene | 118 species (37 species HAB-forming) | MicroElute Genomic DNA Kit (Omega) # | Short-read: 18S rRNA gene V4 region, 528F/706R primers Long-read: 18S full-length rRNA gene, 28F/42R primers | Illumina MiSeq, PacBio |
MiSeq: 398 bp PacBio: 1789 bp SE |
| Hu et al., 2023 [179] | 18S full-length rRNA gene | Algal genera: Gonium, Pandorina, Volvulina, Platydorina, Colemanosphaera, Yamagishiella, Eudorina, Pleodorina | DNA extraction Kit (Tiangen) # | Long-read: 18S full-length rRNA gene, Euk-A/Euk-B primers | PacBio Sequel II | 1690–1825 bp SE |
| Gaonkar & Campbell, 2024 [68] | 18S rRNA gene V4, V8–V9 regions, 18S full-length rRNA gene | Algal groups: Dinoflagellates, Diatoms, Chlorophyta, Rhodophyta, Haptophyta, Cryptophytes | AllPrep DNA/RNA MiniKit (Qiagen, USA) | Long-read: 18S full-length rRNA gene, SSUF/ITS-1dr primers Short-read: 18S rRNA gene V4, V8–V9 regions, Reuk454FWD1/ReukREV3 primers (V4), V8f/1510r primers (V8–V9) | MinION Mk1C (ONT), Illumina MiSeq | MinION: 1829 bp SE MiSeq: 2x300 bp PE |
| Judd et al., 2024 [29] | N/A | Algal microbiome: Amphidinium carterae | CTAB protocol | N/A | MinION, GridION, or PromethION (ONT) | N/A |
| Liu et al., 2024 [180] | 18S full-length rRNA gene | Algal classes: Ulvophyceae, Chlorophyceae, Trebouxiophyceae | HP PlantDNA Kit (Omega Bio-Tek, Norcross, GR, USA) | Long-read: 18S full-length rRNA gene, Euk18SA/Euk18SB primers | PacBio Sequel | N/A |
| Meirkhanova et al., 2024 [181] | 16S full-length rRNA gene | Synthetic algal-bacterial community: Cyanobacteria, Chlorophyta, Cryptophyta groups | DNEasy Power Water Kit (Qiagen, Hilden, Germany) | Long-read: 16S full-length rRNA gene, 27F/1492R primers | MinION Mk1C (ONT) | N/A |
| Mordret et al., 2024 [182] | 18S-ITS-28S rRNA gene | 73 phytoplankton strains | MasterPure Complete DNA and RNA Purification Kit (Epicenter Biotechnologies, USA), DNeasy Plant Pro Kit (Qiagen, USA) | Long-read: 18S-ITS1-5.8S-ITS2-28S rRNA gene, SSU-F and 3NDF with 21R, D3Ca-R primers |
MinION R9.4.1 (ONT) | 4391 bp SE |
| Baharudin et al., 2025 [26] | 18S rRNA gene V7-V9 region, 18S-ITS-28S rRNA gene | Harmful dinophytes: Pfiesteria piscicida, P. shumwayae, Luciella masanensis, Gyrodinium jinhaense, Malayana penaeicida gen. et sp. nov. | Toyobo MagExtractor Plant Genome kit (Toyobo, Tokyo, Japan) | Short-read: 18S rRNA gene V7–V9 region, 18S-V7F/18S-V9R primers Long-read: 18S-ITS1-5.8S-ITS2-28S rRNA gene, 18ScomF1/D2C primers |
Illumina MiSeq 2000, MinION R10.4.1 (ONT) | MiSeq: 2 × 300 bp PE MinION: >3000 bp SE |
| Gong et al., 2025 [21] | 16S full-length rRNA gene | Anabaena spp., Aphanizomenon spp., Cylindrospermum spp., Dolichospermum spp., Microcoleus/Phormidium spp., Lyngbya/Microseira spp., Microcystis spp., Nostoc spp., Synechocystis spp., Planktothrix spp., Pseudanabaena spp. | DNeasy Blood & Tissue Kits (Qiagen, German-town, MD, USA), DNeasy PowerSoil Pro Kit (Qiagen) # | Long-read: 16S full-length rRNA gene, 27F/1492R primers | GridION R10 (ONT) | 800–2000 bp SE |
| Marter et al., 2025 [183] | 16S-ITS rRNA gene | Algal microbiome: Coleofasciculus chthonoplastes | DNeasy Blood and Tissue Kit (Qiagen, Hilden, Germany) | Long-read: 16S-ITS rRNA gene, 16S_27f/23S_130r primers | PacBio Sequel IIe | 1827–3044 bp SE |
| Meirkhanova et al., 2025 [30] | 16S full-length rRNA gene | Algal microbiomes: Microcystis spp., Cryptomonas spp. | DNEasy Power Water Kit (Qiagen, Hilden, Germany) | Long-read: 16S full-length rRNA gene, 27F/1492R primers | MinION Mk1C (ONT) | N/A |
| Mordret et al., 2025 [22] | 18S-ITS-28S rRNA gene | Prorocentrum nux | DNeasy Plant Pro Kit (Qiagen, USA) | Long-read: 18S-ITS1-5.8S-ITS2-28S rRNA gene, SSU-F and 3NDF with 21R, D3Ca-R primers | MinION R9.4.1 (ONT) | N/A |
| Punnarak et al., 2025 [70] | 18S rRNA gene V4 region and coi gene, 18S full-length rRNA gene | Algal classes: Bacillariophyceae, Dictyochophyceae, Dinophyceae, Haptophyceae, Pelagophyceae, Raphidophyceae | ZymoBIOMICS DNA Miniptep Kit (Zymo Research, CA, USA), E.Z.N.A. Soil DNA kit (Omega Bio-Tek, Norcross, GA, USA) | Short-read: 18S rRNA gene V4 region, TAReuk454FWD1/TAReukREV3 primers coi gene, mlCOIinfF/HCO2198 primers Long-read: 18S full-length rRNA gene, Euk-F/Euk-R primers | Illumina NovaSeq 6000, PacBio Sequel II | NovaSeq: 2 × 250 PE PacBio: ~1800 bp SE |
| Rousseau et al., 2025 [69] | 16S rRNA gene V3–V4, V4–V5 regions, 16S full-length rRNA gene, 18S rRNA gene ITS1-ITS2 region | Algal microbiome: Ascophyllum nodosum | CTAB protocol | Short-read: 16S rRNA gene V3–V4, V4–V5 regions, 18S rRNA gene ITS2 regions, S-B-bact-0341-b-S-17F/799F_rc primers (V3-V4), 515F/926R primers (V4-V5), ITS1/PCR1-ITS4 primers (ITS2), 5,8S-FUN/ITS4-FUN primers (ITS2) Long-read: 16S full-length rRNA gene, 18S rRNA gene ITS1-ITS2 region, 27F/1492R primers (16S), ITS9-mum/LR3-I (ITS1-ITS2) | Illumina MiSeq, Illumina NovaSeq 6000, MinION Mk1C (ONT) | MiSeq: 2 × 300 bp PE NovaSeq: 2 × 250 bp PE MinION: 1326 bp SE, 1413 bp SE |
| Yu et al., 2025 [184] | 16S full-length rRNA gene | Algal microbiome: Microcystis aeruginosa | CTAB protocol | Long-read: 16S full-length rRNA gene, N/A | PacBio | N/A |
* where available the primer names were written, the full primer sequences can be accessed in the article; # manufacturer city/country information is unavailable for the given DNA extraction kit.
Author Contributions
K.K., A.A., A.S. and N.S.B.—conceptualization, methodology; K.K., A.A. and N.S.B.—writing—original draft preparation; K.K., A.A., A.S. and N.S.B.—writing—review and editing; N.S.B.—supervision. All authors have read and agreed to the published version of the manuscript.
Institutional Review Board Statement
Not applicable.
Informed Consent Statement
Not applicable.
Data Availability Statement
No new experimental data were created or analyzed in this study. Raw data sharing is not applicable to this article.
Conflicts of Interest
The authors declare no conflicts of interest.
Funding Statement
The authors declare that financial support was received for the research, authorship, and/or publication of this article. This research was supported by the Ministry of High Education and Sciences, Kazakhstan #AP26104995 and FDCRGP grant #SSH2024005 from Nazarbayev University to N.S.B.
Footnotes
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.
References
- 1.Alberdi A., Aizpurua O., Gilbert M.T.P., Bohmann K. Scrutinizing Key Steps for Reliable Metabarcoding of Environmental Samples. Methods Ecol. Evol. 2018;9:134–147. doi: 10.1111/2041-210X.12849. [DOI] [Google Scholar]
- 2.Pollard M.O., Gurdasani D., Mentzer A.J., Porter T., Sandhu M.S. Long Reads: Their Purpose and Place. Hum. Mol. Genet. 2018;27:R234–R241. doi: 10.1093/hmg/ddy177. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Goldstein S., Beka L., Graf J., Klassen J.L. Evaluation of Strategies for the Assembly of Diverse Bacterial Genomes Using MinION Long-Read Sequencing. BMC Genom. 2019;20:23. doi: 10.1186/s12864-018-5381-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Browne P.D., Nielsen T.K., Kot W., Aggerholm A., Gilbert M.T.P., Puetz L., Rasmussen M., Zervas A., Hansen L.H. GC Bias Affects Genomic and Metagenomic Reconstructions, Underrepresenting GC-Poor Organisms. Gigascience. 2020;9:giaa008. doi: 10.1093/gigascience/giaa008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.van der Loos L.M., Nijland R. Biases in Bulk: DNA Metabarcoding of Marine Communities and the Methodology Involved. Mol. Ecol. 2021;30:3270–3288. doi: 10.1111/mec.15592. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Kircher M., Sawyer S., Meyer M. Double indexing overcomes inaccuracies in multiplex sequencing on the Illumina platform. Nucleic Acids Res. 2012;40:e3. doi: 10.1093/nar/gkr771. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Mohammadi M.M., Bavi O. DNA Sequencing: An Overview of Solid-State and Biological Nanopore-Based Methods. Biophys. Rev. 2022;14:99–110. doi: 10.1007/s12551-021-00857-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Orellana L.H., Krüger K., Sidhu C., Amann R. Comparing Genomes Recovered from Time-Series Metagenomes Using Long- and Short-Read Sequencing Technologies. Microbiome. 2023;11:105. doi: 10.1186/s40168-023-01557-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Deamer D., Akeson M., Branton D. Three Decades of Nanopore Sequencing. Nat. Biotechnol. 2016;34:518–524. doi: 10.1038/nbt.3423. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Santos A., van Aerle R., Barrientos L., Martinez-Urtaza J. Computational Methods for 16S Metabarcoding Studies Using Nanopore Sequencing Data. Comput. Struct. Biotechnol. J. 2020;18:296–305. doi: 10.1016/j.csbj.2020.01.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Rhoads A., Au K.F. PacBio Sequencing and Its Applications. Genom. Proteom. Bioinform. 2015;13:278–289. doi: 10.1016/j.gpb.2015.08.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Depienne C., Mandel J.-L. 30 Years of Repeat Expansion Disorders: What Have We Learned and What Are the Remaining Challenges? Am. J. Hum. Genet. 2021;108:764–785. doi: 10.1016/j.ajhg.2021.03.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Chaushevska M., Alapont-Celaya K., Schack A.K., Krych L., Navas M.C.G., Krithara A., Madjarov G. Get Ready for Short Tandem Repeats Analysis Using Long Reads-the Challenges and the State of the Art. Front. Genet. 2025;16:1610026. doi: 10.3389/fgene.2025.1610026. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Sanderson N.D., Hopkins K.M.V., Colpus M., Parker M., Lipworth S., Crook D., Stoesser N. Evaluation of the Accuracy of Bacterial Genome Reconstruction with Oxford Nanopore R10.4.1 Long-Read-Only Sequencing. Microb. Genom. 2024;10:001246. doi: 10.1099/mgen.0.001246. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Amarasinghe S.L., Su S., Dong X., Zappia L., Ritchie M.E., Gouil Q. Opportunities and Challenges in Long-Read Sequencing Data Analysis. Genome Biol. 2020;21:30. doi: 10.1186/s13059-020-1935-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Delahaye C., Nicolas J. Sequencing DNA with Nanopores: Troubles and Biases. PLoS ONE. 2021;16:e0257521. doi: 10.1371/journal.pone.0257521. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.MacKenzie M., Argyropoulos C. An Introduction to Nanopore Sequencing: Past, Present, and Future Considerations. Micromachines. 2023;14:459. doi: 10.3390/mi14020459. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Travers K.J., Chin C.-S., Rank D.R., Eid J.S., Turner S.W. A Flexible and Efficient Template Format for Circular Consensus Sequencing and SNP Detection. Nucleic Acids Res. 2010;38:e159. doi: 10.1093/nar/gkq543. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Damaraju N., Miller A.L., Miller D.E. Long-Read DNA and RNA Sequencing to Streamline Clinical Genetic Testing and Reduce Barriers to Comprehensive Genetic Testing. J. Appl. Lab. Med. 2024;9:138–150. doi: 10.1093/jalm/jfad107. [DOI] [PubMed] [Google Scholar]
- 20.Wasswa F.B., Kassaza K., Nielsen K., Bazira J. MinION Whole-Genome Sequencing in Resource-Limited Settings: Challenges and Opportunities. Curr. Clin. Microbiol. Rep. 2022;9:52–59. doi: 10.1007/s40588-022-00183-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Gong P., Antrim A.K., Cicerrella A.S., Chung S.H., Luo X., Barker N.D., Cooley E., Ji Q. Design and Validation of Universal and Taxon-Specific 16S rRNA qPCR Primers for Detection of Freshwater Harmful Algal Bloom-Forming Cyanobacteria. J. Microbiol. Methods. 2025;235:107146. doi: 10.1016/j.mimet.2025.107146. [DOI] [PubMed] [Google Scholar]
- 22.Mordret S., MacKinnon J., Breglia S.A., Slamovits C.H., Chénard C. An Update on the Morphology and Phylogeny of the Nanoplanktonic Dinoflagellate Prorocentrum nux. J. Eukaryot. Microbiol. 2025;72:e70019. doi: 10.1111/jeu.70019. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Jian J., Wang Z., Chen C., Workman C.T., Fang X., Larsen T.O., Guo J., Sonnenschein E.C. Two High-Quality Prototheca zopfii Genomes Provide New Insights into Their Evolution as Obligate Algal Heterotrophs and Their Pathogenicity. Microbiol. Spectr. 2024;12:e04148-23. doi: 10.1128/spectrum.04148-23. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Kuhl H., Strassert J.F.H., Čertnerová D., Varga E., Kreuz E., Lamatsch D.K., Wuertz S., Köhler J., Monaghan M.T., Stöck M. The Haplotype-Resolved Prymnesium Parvum (Type B) Microalga Genome Reveals the Genetic Basis of Its Fish-Killing Toxins. Curr. Biol. 2024;34:3698–3706.e4. doi: 10.1016/j.cub.2024.06.033. [DOI] [PubMed] [Google Scholar]
- 25.Fallon T.R., Shende V.V., Wierzbicki I.H., Pendleton A.L., Watervoort N.F., Auber R.P., Gonzalez D.J., Wisecaver J.H., Moore B.S. Giant Polyketide Synthase Enzymes in the Biosynthesis of Giant Marine Polyether Toxins. Science. 2024;385:671–678. doi: 10.1126/science.ado3290. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Baharudin S.N., Hii K.S., Azmi N.F.M., Kassim N.S., Abdullah N., Teng S.T., Liu M., Kuwata K., Iwataki M., Gu H., et al. High Molecular Diversity of Potentially Harmful Heterotrophic Dinophytes in Tropical Shrimp Aquaculture Ponds Reveals a New Dinophyte Malayana penaeicida Gen. et Sp. Nov. (Peridiniales, Dinophyceae) Harmful Algae. 2025;148:102906. doi: 10.1016/j.hal.2025.102906. [DOI] [PubMed] [Google Scholar]
- 27.Shin H., Lee E., Shin J., Ko S.-R., Oh H.-S., Ahn C.-Y., Oh H.-M., Cho B.-K., Cho S. Elucidation of the Bacterial Communities Associated with the Harmful Microalgae Alexandrium tamarense and Cochlodinium polykrikoides Using Nanopore Sequencing. Sci. Rep. 2018;8:5323. doi: 10.1038/s41598-018-23634-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.van der Loos L.M., D’hondt S., Willems A., De Clerck O. Characterizing Algal Microbiomes Using Long-Read Nanopore Sequencing. Algal Res. 2021;59:102456. doi: 10.1016/j.algal.2021.102456. [DOI] [Google Scholar]
- 29.Judd M., Wira J., Place A.R., Bachvaroff T. Long-Read Sequencing Unlocks New Insights into the Amphidinium Carterae Microbiome. Mar. Drugs. 2024;22:342. doi: 10.3390/md22080342. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Meirkhanova A., Zhumakhanova A., Len P., Schoenbach C., Levi E.E., Jeppesen E., Davidson T.A., Barteneva N.S. Heatwave-Induced Thermal Stratification Shaping Microbial-Algal Communities Under Different Climate Scenarios as Revealed by Long-Read Sequencing and Imaging Flow Cytometry. Toxins. 2025;17:370. doi: 10.3390/toxins17080370. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Raymond B.B., Guenzi-Tiberi P., Maréchal E., Quarmby L.M. Snow Alga Sanguina aurantia as Revealed through de Novo Genome Assembly and Annotation. G3 Genes Genomes Genet. 2024;14:jkae181. doi: 10.1093/g3journal/jkae181. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Cai H., McLimans C.J., Beyer J.E., Krumholz L.R., Hambright K.D. Microcystis Pangenome Reveals Cryptic Diversity within and across Morphospecies. Sci. Adv. 2023;9:eadd3783. doi: 10.1126/sciadv.add3783. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Wisecaver J.H., Auber R.P., Pendleton A.L., Watervoort N.F., Fallon T.R., Riedling O.L., Manning S.R., Moore B.S., Driscoll W.W. Extreme Genome Diversity and Cryptic Speciation in a Harmful Algal-Bloom-Forming Eukaryote. Curr. Biol. 2023;33:2246–2259.e1–e8. doi: 10.1016/j.cub.2023.05.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Sibbald S.J., Lawton M., Maclean C., Roger A.J., Archibald J.M. Pangenome Biology and Evolution in Harmful Algal-Bloom-Forming Pelagophytes. Curr. Biol. 2025;35:4215–4228. doi: 10.1016/j.cub.2025.07.055. [DOI] [PubMed] [Google Scholar]
- 35.Xu M., Guo L., Qi Y., Shi C., Liu X., Chen J., Han J., Deng L., Liu X., Fan G. Symbiont-Screener: A Reference-Free Tool to Separate Host Sequences from Symbionts for Error-Prone Long Reads. Front. Mar. Sci. 2023;10:1087447. doi: 10.3389/fmars.2023.1087447. [DOI] [Google Scholar]
- 36.Leonard G., Vitonytė I., Savory F.R., Hansson E.M., Cameron D.D., Brockhurst M.A., Richards T.A. De novo Genome sequence Assembly of the Model Algal Endosymbiont Micractinium conductrix Derived from its Host Paramecium bursaria 186b. bioRxiv. 2025 doi: 10.64898/2025.12.15.694297. [DOI] [Google Scholar]
- 37.Chandola U., Gaudin M., Trottier C., Lavier-Aydat L.-J., Manirakiza E., Menicot S., Fischer E.J., Louvet I., Lacour T., Chaumier T., et al. Non-Cyanobacterial Diazotrophs Support the Survival of Marine Microalgae in Nitrogen-Depleted Environment. Genome Biol. 2025;26:146. doi: 10.1186/s13059-025-03597-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Kim K.M., Park J.-H., Bhattacharya D., Yoon H.S. Applications of Next-Generation Sequencing to Unravelling the Evolutionary History of Algae. Int. J. Syst. Evol. Microbiol. 2014;64:333–345. doi: 10.1099/ijs.0.054221-0. [DOI] [PubMed] [Google Scholar]
- 39.Oliveira M.C., Repetti S.I., Iha C., Jackson C.J., Díaz-Tapia P., Lubiana K.M.F., Cassano V., Costa J.F., Cremen M.C.M., Marcelino V.R., et al. High-Throughput Sequencing for Algal Systematics. Eur. J. Phycol. 2018;53:256–272. doi: 10.1080/09670262.2018.1441446. [DOI] [Google Scholar]
- 40.Blaby-Haas C.E., Merchant S.S. Comparative and Functional Algal Genomics. Annu. Rev. Plant Biol. 2019;70:605–638. doi: 10.1146/annurev-arplant-050718-095841. [DOI] [PubMed] [Google Scholar]
- 41.Page M.J., McKenzie J.E., Bossuyt P.M., Boutron I., Hoffmann T.C., Mulrow C.D., Shamseer L., Tetzlaff J.M., Akl E.A., Brennan S.E., et al. The PRISMA 2020 Statement: An Updated Guideline for Reporting Systematic Reviews. BMJ. 2021;372:n71. doi: 10.1136/bmj.n71. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Marchesi J.R., Ravel J. The Vocabulary of Microbiome Research: A Proposal. Microbiome. 2015;3:31. doi: 10.1186/s40168-015-0094-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Curren E., Yoshida T., Kuwahara V.S., Leong S.C.Y. Rapid Profiling of Tropical Marine Cyanobacterial Communities. Reg. Stud. Mar. Sci. 2019;25:100485. doi: 10.1016/j.rsma.2018.100485. [DOI] [Google Scholar]
- 44.Hatfield R.G., Batista F.M., Bean T.P., Fonseca V.G., Santos A., Turner A.D., Lewis A., Dean K.J., Martinez-Urtaza J. The Application of Nanopore Sequencing Technology to the Study of Dinoflagellates: A Proof of Concept Study for Rapid Sequence-Based Discrimination of Potentially Harmful Algae. Front. Microbiol. 2020;11:844. doi: 10.3389/fmicb.2020.00844. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Kezlya E., Tseplik N., Kulikovskiy M. Genetic Markers for Metabarcoding of Freshwater Microalgae: Review. Biology. 2023;12:1038. doi: 10.3390/biology12071038. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.de Vargas C., Audic S., Henry N., Decelle J., Mahé F., Logares R., Lara E., Berney C., Le Bescot N., Probert I., et al. Eukaryotic Plankton Diversity in the Sunlit Ocean. Science. 2015;348:1261605. doi: 10.1126/science.1261605. [DOI] [PubMed] [Google Scholar]
- 47.Bowles A.M.C., Williams T.A., Donoghue P.C.J., Campbell D.A., Williamson C.J. Metagenome-Assembled Genome of the Glacier Alga Ancylonema Yields Insights into the Evolution of Streptophyte Life on Ice and Land. New Phytol. 2024;244:1629–1643. doi: 10.1111/nph.19860. [DOI] [PubMed] [Google Scholar]
- 48.Ebenezer V., Medlin L.K., Ki J.-S. Molecular Detection, Quantification, and Diversity Evaluation of Microalgae. Mar. Biotechnol. 2012;14:129–142. doi: 10.1007/s10126-011-9427-y. [DOI] [PubMed] [Google Scholar]
- 49.Malashenkov D.V., Dashkova V., Zhakupova K., Vorobjev I.A., Barteneva N.S. Comparative Analysis of Freshwater Phytoplankton Communities in Two Lakes of Burabay National Park Using Morphological and Molecular Approaches. Sci. Rep. 2021;11:16130. doi: 10.1038/s41598-021-95223-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Gelis M.M., Canino A., Bouchez A., Domaizon I., Laplace-Treyture C., Rimet F., Alric B. Assessing the Relevance of DNA Metabarcoding Compared to Morphological Identification for Lake Phytoplankton Monitoring. Sci. Total Environ. 2024;914:169774. doi: 10.1016/j.scitotenv.2023.169774. [DOI] [PubMed] [Google Scholar]
- 51.Neri F., Ubaldi M., Accoroni S., Ricci S., Banchi E., Romagnoli T., Totti C. Comparative Analysis of Phytoplankton Diversity Using Microscopy and Metabarcoding: Insights from an eLTER Station in the Northern Adriatic Sea. Hydrobiologia. 2025;852:169–183. doi: 10.1007/s10750-024-05692-2. [DOI] [Google Scholar]
- 52.Kim Y.-S., Yun H.-S., Lee J.-H., Lee K.-L., Choi J.-S., Won D.H., Kim Y.J., Kim H.-S., Yoon H.-S. Comparison of Metabarcoding and Microscopy Methodologies to Analyze Diatom Communities in Five Estuaries Along the Southern Coast of the Korean Peninsula. Microb. Ecol. 2024;87:95. doi: 10.1007/s00248-024-02396-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Kulaš A., Udovič G.M., Tapolczai K., Žutinić P., Orlić S., Levkov Z. Diatom eDNA Metabarcoding and Morphological Methods for Bioassessment of Karstic River. Sci. Total Environ. 2022;829:154536. doi: 10.1016/j.scitotenv.2022.154536. [DOI] [PubMed] [Google Scholar]
- 54.Nunes M., Adams J., Van Aswegen S., Matcher G. A Comparison between the Morphological and Molecular Approach to Identify the Benthic Diatom Community in the St Lucia Estuary, South Africa. Afr. J. Mar. Sci. 2019;41:429–442. doi: 10.2989/1814232X.2019.1689169. [DOI] [Google Scholar]
- 55.Karst S.M., Ziels R.M., Kirkegaard R.H., Sørensen E.A., McDonald D., Zhu Q., Knight R., Albertsen M. High-Accuracy Long-Read Amplicon Sequences Using Unique Molecular Identifiers with Nanopore or PacBio Sequencing. Nat. Methods. 2021;18:165–169. doi: 10.1038/s41592-020-01041-y. [DOI] [PubMed] [Google Scholar]
- 56.Kerkhof L.J., Roth P.A., Deshpande S.V., Bernhards R.C., Liem A.T., Hill J.M., Häggblom M.M., Webster N.S., Ibironke O., Mirzoyan S., et al. A Ribosomal Operon Database and MegaBLAST Settings for Strain-Level Resolution of Microbiomes. FEMS Microbes. 2022;3:xtac002. doi: 10.1093/femsmc/xtac002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Zhang T., Li H., Ma S., Cao J., Liao H., Huang Q., Chen W. The Newest Oxford Nanopore R10.4.1 Full-Length 16S rRNA Sequencing Enables the Accurate Resolution of Species-Level Microbial Community Profiling. Appl. Environ. Microbiol. 2023;89:e00605-23. doi: 10.1128/aem.00605-23. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Srinivas M., Walsh C.J., Crispie F., O’Sullivan O., Cotter P.D., van Sinderen D., Kenny J.G. Evaluating the Efficiency of 16S-ITS-23S Operon Sequencing for Species Level Resolution in Microbial Communities. Sci. Rep. 2025;15:2822. doi: 10.1038/s41598-024-83410-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Johnson J.S., Spakowicz D.J., Hong B.-Y., Petersen L.M., Demkowicz P., Chen L., Leopold S.R., Hanson B.M., Agresta H.O., Gerstein M., et al. Evaluation of 16S rRNA Gene Sequencing for Species and Strain-Level Microbiome Analysis. Nat. Commun. 2019;10:5029. doi: 10.1038/s41467-019-13036-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Mohammed-Geba K., Elamin A.M., Hassan A., Mohammed E., Salah-Eldin A.-E., Schott E.J., Galal-Khallaf A. Environmental DNA-Based Metabarcoding Reveals a High Animal Biodiversity Level within Red Sea Mangrove Beds. Front. Mar. Sci. 2025;12:1686361. doi: 10.3389/fmars.2025.1686361. [DOI] [Google Scholar]
- 61.Dehghani J., Atazadeh E., Omidi Y., Movafeghi A. The Use of 18S Ribosomal DNA, ITS and rbcL Molecular Markers to Study the Genus Dunaliella (Dunaliellaceae) in Iranian Samples: A Phylogenetic Approach. Oceanol. Hydrobiol. Stud. 2019;49:88–98. doi: 10.1515/ohs-2020-0009. [DOI] [Google Scholar]
- 62.Ballesteros I., Terán P., Guamán-Burneo C., González N., Cruz A., Castillejo P. DNA Barcoding Approach to Characterize Microalgae Isolated from Freshwater Systems in Ecuador. Neotrop. Biodivers. 2021;7:170–183. doi: 10.1080/23766808.2021.1920296. [DOI] [Google Scholar]
- 63.Kabiraj S., Bhuyan S.J., Goutam U. Species Identification in Diatoms Using DNA Barcoding: An Overview. Biol. Forum Int. J. 2022;14:979–985. [Google Scholar]
- 64.Karin B.R., Arellano S., Wang L., Walzer K., Pomerantz A., Vasquez J.M., Chatla K., Sudmant P.H., Bach B.H., Smith L.L., et al. Highly-Multiplexed and Efficient Long-Amplicon PacBio and Nanopore Sequencing of Hundreds of Full Mitochondrial Genomes. BMC Genom. 2023;24:229. doi: 10.1186/s12864-023-09277-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Vossen R.H.A.M., Buermans H.P.J. Full-Length Mitochondrial-DNA Sequencing on the PacBio RSII. Volume 1492. Humana Press; New York, NY, USA: 2017. pp. 179–184. Methods in Molecular Biology. [DOI] [PubMed] [Google Scholar]
- 66.Latz M.A.C., Grujcic V., Brugel S., Lycken J., John U., Karlson B., Andersson A., Andersson A.F. Short- and Long-Read Metabarcoding of the Eukaryotic rRNA Operon: Evaluation of Primers and Comparison to Shotgun Metagenomics Sequencing. Mol. Ecol. Resour. 2022;22:2304–2318. doi: 10.1111/1755-0998.13623. [DOI] [PubMed] [Google Scholar]
- 67.He L., Yu Z., Xu X., Zhu J., Yuan Y., Cao X., Song X. Metabarcoding Analysis Identifies High Diversity of Harmful Algal Bloom Species in the Coastal Waters of the Beibu Gulf. Ecol. Evol. 2023;13:e10127. doi: 10.1002/ece3.10127. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Gaonkar C.C., Campbell L. A Full-Length 18S Ribosomal DNA Metabarcoding Approach for Determining Protist Community Diversity Using Nanopore Sequencing. Ecol. Evol. 2024;14:e11232. doi: 10.1002/ece3.11232. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Rousseau C., Henry N., Rousvoal S., Tanguy G., Legeay E., Leblanc C., Dittami S.M. A Practical Comparison of Short- and Long-Read Metabarcoding Sequencing: Challenges and Solutions for Plastid Read Removal and Microbial Community Exploration of Seaweed Samples. Mol. Ecol. Resour. 2025;25:e14129. doi: 10.1111/1755-0998.14129. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Punnarak P., Tang S., Janpoom S., Prasertlux S., Khamnamtong B., Wimolsakcharoen W., Thitiphuree T., Getwech C., Dhanasin P., Klinbunga S., et al. Distribution of Eukaryotic Environmental DNA in Water and Sediment from Offshore Petroleum Platforms in the Gulf of Thailand. Diversity. 2025;17:179. doi: 10.3390/d17030179. [DOI] [Google Scholar]
- 71.Weydmann-Zwolicka A., Dąbrowska A.M., Mioduchowska M., Zwolicki A. Comparison of DNA Metabarcoding and Microscopy in Analysing Planktonic Protists from the European Arctic. Mar. Biodivers. 2024;54:44. doi: 10.1007/s12526-024-01436-6. [DOI] [Google Scholar]
- 72.Wang B., Li R., Lan X., Kong D., Liu X., Xie S. Benthic Diatom eDNA Metabarcoding for Ecological Assessment of an Urban River: A Comparison with Morphological Method. Ecol. Indic. 2024;166:112302. doi: 10.1016/j.ecolind.2024.112302. [DOI] [Google Scholar]
- 73.Vidaković D., Mayombo N.A.S., Castellanos A.B., Kloster M., Beszteri B. Diatom Metabarcoding as a Tool to Assess the Water Quality of Two Large Tributaries of the Danube River. Ecol. Indic. 2024;168:112793. doi: 10.1016/j.ecolind.2024.112793. [DOI] [Google Scholar]
- 74.Borrego-Ramos M., Bécares E., García P., Nistal A., Blanco S. Epiphytic diatom-based biomonitoring in Mediterranean ponds: Traditional microscopy versus metabarcoding approaches. Water. 2021;13:1351. doi: 10.3390/w13101351. [DOI] [Google Scholar]
- 75.Buetas E., Jordán-López M., López-Roldán A., D’Auria G., Martínez-Priego L., De Marco G., Carda-Diéguez M., Mira A. Full-Length 16S rRNA Gene Sequencing by PacBio Improves Taxonomic Resolution in Human Microbiome Samples. BMC Genom. 2024;25:310. doi: 10.1186/s12864-024-10213-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.Esberg A., Fries N., Haworth S., Johansson I. Saliva Microbiome Profiling by Full-Gene 16S rRNA Oxford Nanopore Technology versus Illumina MiSeq Sequencing. npj Biofilms Microbiomes. 2024;10:149. doi: 10.1038/s41522-024-00634-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 77.Ekblom R., Wolf J.B. A field guide to whole-genome sequencing, assembly and annotation. Evol. Appl. 2014;7:1026–1042. doi: 10.1111/eva.12178. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78.Casabianca S., Cornetti L., Capellacci S., Vernesi C., Penna A. Genome complexity of harmful microalgae. Harmful Algae. 2017;63:7–12. doi: 10.1016/j.hal.2017.01.003. [DOI] [PubMed] [Google Scholar]
- 79.Parker M.S., Mock T., Armbrust E.V. Genomic insights into marine microalgae. Annu. Rev. Genet. 2008;42:619–645. doi: 10.1146/annurev.genet.42.110807.091417. [DOI] [PubMed] [Google Scholar]
- 80.Abbaszade G., Szabó A., Vajna B., Farkas R., Szabó C., Tóth E. Whole Genome Sequence Analysis of Cupriavidus Campinensis S14E4C, a Heavy Metal Resistant Bacterium. Mol. Biol. Rep. 2020;47:3973–3985. doi: 10.1007/s11033-020-05490-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 81.Hulatt C.J., Wijffels R.H., Posewitz M.C. The Genome of the Haptophyte Diacronema Lutheri (Pavlova Lutheri, Pavlovales): A Model for Lipid Biosynthesis in Eukaryotic Algae. Genome Biol. Evol. 2021;13:evab178. doi: 10.1093/gbe/evab178. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 82.Liu C., Shi X., Wu F., Ren M., Gao G., Wu Q. Genome Analyses Provide Insights into the Evolution and Adaptation of the Eukaryotic Picophytoplankton Mychonastes Homosphaera. BMC Genom. 2020;21:477. doi: 10.1186/s12864-020-06891-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 83.Cho C.H., Park S.I., Huang T.-Y., Lee Y., Ciniglia C., Yadavalli H.C., Yang S.W., Bhattacharya D., Yoon H.S. Genome-Wide Signatures of Adaptation to Extreme Environments in Red Algae. Nat. Commun. 2023;14:10. doi: 10.1038/s41467-022-35566-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 84.Chen N., Xu Q., Zhu J., Song H., He L., Liu S., Song X., Yuan Y., Chen Y., Cao X., et al. Chromosome-Scale Genome Assembly Reveals Insights into the Evolution and Ecology of the Harmful Algal Bloom Species Phaeocystis Globosa Scherffel. iScience. 2024;27:110575. doi: 10.1016/j.isci.2024.110575. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 85.Jian J., Wu Z., Silva-Núñez A., Li X., Zheng X., Luo B., Liu Y., Fang X., Workman C.T., Larsen T.O., et al. Long-Read Genome Sequencing Provides Novel Insights into the Harmful Algal Bloom Species Prymnesium parvum. Sci. Total Environ. 2024;908:168042. doi: 10.1016/j.scitotenv.2023.168042. Correction in Sci. Total Environ. 2025, 996, 180283. [DOI] [PubMed] [Google Scholar]
- 86.Metzker M.L. Sequencing Technologies—The next Generation. Nat. Rev. Genet. 2010;11:31–46. doi: 10.1038/nrg2626. [DOI] [PubMed] [Google Scholar]
- 87.Leinonen R., Sugawara H., Shumway M., on behalf of the International Nucleotide Sequence Database Collaboration The Sequence Read Archive. Nucleic Acids Res. 2011;39:D19–D21. doi: 10.1093/nar/gkq1019. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 88.Kodama Y., Shumway M., Leinonen R. The Sequence Read Archive: Explosive Growth of Sequencing Data. Nucleic Acids Res. 2012;40:D54–D56. doi: 10.1093/nar/gkr854. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 89.Hayes J.L., Tzika A., Thygesen H., Berri S., Wood H.M., Hewitt S., Pendlebury M., Coates A., Willoughby L., Watson C.M., et al. Diagnosis of Copy Number Variation by Illumina next Generation Sequencing Is Comparable in Performance to Oligonucleotide Array Comparative Genomic Hybridisation. Genomics. 2013;102:174–181. doi: 10.1016/j.ygeno.2013.04.006. [DOI] [PubMed] [Google Scholar]
- 90.Sudmant P.H., Rausch T., Gardner E.J., Handsaker R.E., Abyzov A., Huddleston J., Zhang Y., Ye K., Jun G., Fritz M.H.-Y., et al. An Integrated Map of Structural Variation in 2,504 Human Genomes. Nature. 2015;526:75–81. doi: 10.1038/nature15394. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 91.Ratan A., Olson T.L., Loughran T.P., Miller W. Identification of Indels in Next-Generation Sequencing Data. BMC Bioinform. 2015;16:42. doi: 10.1186/s12859-015-0483-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 92.Alkan C., Sajjadian S., Eichler E.E. Limitations of Next-Generation Genome Sequence Assembly. Nat. Methods. 2011;8:61–65. doi: 10.1038/nmeth.1527. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 93.Wang D., Yu X., Xu K., Bi G., Cao M., Zelzion E., Fu C., Sun P., Liu Y., Kong F., et al. Pyropia yezoensis Genome Reveals Diverse Mechanisms of Carbon Acquisition in the Intertidal Environment. Nat. Commun. 2020;11:4028. doi: 10.1038/s41467-020-17689-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 94.Dreher T.W., Davis E.W., Mueller R.S. Complete Genomes Derived by Directly Sequencing Freshwater Bloom Populations Emphasize the Significance of the Genus Level ADA Clade within the Nostocales. Harmful Algae. 2021;103:102005. doi: 10.1016/j.hal.2021.102005. [DOI] [PubMed] [Google Scholar]
- 95.Chen H., Chu J.S.-C., Chen J., Luo Q., Wang H., Lu R., Zhu Z., Yuan G., Yi X., Mao Y., et al. Insights into the Ancient Adaptation to Intertidal Environments by Red Algae Based on a Genomic and Multiomics Investigation of Neoporphyra haitanensis. Mol. Biol. Evol. 2022;39:msab315. doi: 10.1093/molbev/msab315. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 96.Gueidan C., Mead O.L., Nazem-Bokaee H., Mathews S. First Draft of an Annotated Genome for a Lichenised Strain of the Green Alga Diplosphaera chodatii (Prasiolales, Trebouxiophyceae) Eur. J. Phycol. 2023;58:427–437. doi: 10.1080/09670262.2023.2165711. [DOI] [Google Scholar]
- 97.Moretto J.A., Berthold D.E., Lefler F.W., Mazzei V., Loftin K.A., Laughinghouse H.D., IV Genome Sequences of Toxigenic Cyanobacteria from a Bloom in Lake Mattamuskeet, North Carolina (United States) J. Phycol. 2024;60:1349–1355. doi: 10.1111/jpy.13523. [DOI] [PubMed] [Google Scholar]
- 98.Lai Y., Liu X., Chen Z., Li Y., Shi X. Comparative Genomics Reveals the Molecular Basis for Divergent Algicidal Strategies in Two Alteromonas Macleodii Strains. Appl. Environ. Microbiol. 2025;92:e01965-25. doi: 10.1128/aem.01965-25. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 99.Guo L., Liang S., Zhang Z., Liu H., Wang S., Pan K., Xu J., Ren X., Pei S., Yang G. Genome Assembly of Nannochloropsis oceanica Provides Evidence of Host Nucleus Overthrow by the Symbiont Nucleus during Speciation. Commun. Biol. 2019;2:249. doi: 10.1038/s42003-019-0500-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 100.Manni M., Berkeley M.R., Seppey M., Zdobnov E.M. BUSCO: Assessing Genomic Data Quality and Beyond. Curr. Protoc. 2021;1:e323. doi: 10.1002/cpz1.323. [DOI] [PubMed] [Google Scholar]
- 101.Yuan L., Lu H., Li F., Nielsen J., Kerkhoven E.J. HGTphyloDetect: Facilitating the Identification and Phylogenetic Analysis of Horizontal Gene Transfer. Brief. Bioinform. 2023;24:bbad035. doi: 10.1093/bib/bbad035. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 102.Zhang Z., Qu C., Zhang K., He Y., Zhao X., Yang L., Zheng Z., Ma X., Wang X., Wang W., et al. Adaptation to Extreme Antarctic Environments Revealed by the Genome of a Sea Ice Green Alga. Curr. Biol. 2020;30:3330–3341.e7. doi: 10.1016/j.cub.2020.06.029. [DOI] [PubMed] [Google Scholar]
- 103.Coleman A.W., Suarez A., Goff L.J. Molecular Delineation of Species and Syngens in Volvocacean Green Algae (Chlorophyta) J. Phycol. 1994;30:80–90. doi: 10.1111/j.0022-3646.1994.00080.x. [DOI] [Google Scholar]
- 104.Read B.A., Kegel J., Klute M.J., Kuo A., Lefebvre S.C., Maumus F., Mayer C., Miller J., Monier A., Salamov A., et al. Pan Genome of the Phytoplankton Emiliania Underpins Its Global Distribution. Nature. 2013;499:209–213. doi: 10.1038/nature12221. [DOI] [PubMed] [Google Scholar]
- 105.Leliaert F., Verbruggen H., Vanormelingen P., Steen F., López-Bautista J.M., Zuccarello G.C., De Clerck O. DNA-Based Species Delimitation in Algae. Eur. J. Phycol. 2014;49:179–196. doi: 10.1080/09670262.2014.904524. [DOI] [Google Scholar]
- 106.Verbruggen H., Vlaeminck C., Sauvage T., Sherwood A.R., Leliaert F., De Clerck O. Phylogenetic Analysis of Pseudochlorodesmis Strains Reveals Cryptic Diversity Above the Family Level in the Siphonous Green Algae (Bryopsidales, Chlorophyta) J. Phycol. 2009;45:726–731. doi: 10.1111/j.1529-8817.2009.00690.x. [DOI] [PubMed] [Google Scholar]
- 107.Díaz-Tapia P., Ly M., Verbruggen H. Extensive Cryptic Diversity in the Widely Distributed Polysiphonia scopulorum (Rhodomelaceae, Rhodophyta): Molecular Species Delimitation and Morphometric Analyses. Mol. Phylogenet. Evol. 2020;152:106909. doi: 10.1016/j.ympev.2020.106909. [DOI] [PubMed] [Google Scholar]
- 108.Verbruggen H. Morphological Complexity, Plasticity, and Species Diagnosability in the Application of Old Species Names in DNA-Based Taxonomies. J. Phycol. 2014;50:26–31. doi: 10.1111/jpy.12155. [DOI] [PubMed] [Google Scholar]
- 109.Tran L.-A.T., Vieira C., Steinhagen S., Maggs C.A., Hiraoka M., Shimada S., Van Nguyen T., De Clerck O., Leliaert F. An Appraisal of Ulva (Ulvophyceae, Chlorophyta) Taxonomy. J. Appl. Phycol. 2022;34:2689–2703. doi: 10.1007/s10811-022-02815-x. [DOI] [Google Scholar]
- 110.Pinseel E., Sabbe K., Verleyen E., Vyverman W. A New Dawn for Protist Biogeography. Glob. Ecol. Biogeogr. 2024;33:e13925. doi: 10.1111/geb.13925. Correction in Glob. Ecol. Biogeogr. 2025, 34, e70042. [DOI] [Google Scholar]
- 111.Verbruggen H., Uthanumallian K., Powrie F., Jalali T., Cremen C., Preuss M., Duchene S., Diaz-Tapia P. Scaling Up Species Delimitation From DNA Barcodes to Whole Organelle Genomes: Strong Evidence for Discordance Among Genes and Methods for the Red Alga Dasyclonium. Mol. Ecol. Resour. 2025;25:e14132. doi: 10.1111/1755-0998.14132. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 112.Puillandre N., Modica M.V., Zhang Y., Sirovich L., Boisselier M.-C., Cruaud C., Holford M., Samadi S. Large-Scale Species Delimitation Method for Hyperdiverse Groups. Mol. Ecol. 2012;21:2671–2691. doi: 10.1111/j.1365-294X.2012.05559.x. [DOI] [PubMed] [Google Scholar]
- 113.Puillandre N., Brouillet S., Achaz G. ASAP: Assemble Species by Automatic Partitioning. Mol. Ecol. Resour. 2021;21:609–620. doi: 10.1111/1755-0998.13281. [DOI] [PubMed] [Google Scholar]
- 114.Fujisawa T., Barraclough T.G. Delimiting Species Using Single-Locus Data and the Generalized Mixed Yule Coalescent Approach: A Revised Method and Evaluation on Simulated Data Sets. Syst. Biol. 2013;62:707–724. doi: 10.1093/sysbio/syt033. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 115.Zhang J., Kapli P., Pavlidis P., Stamatakis A. A General Species Delimitation Method with Applications to Phylogenetic Placements. Bioinformatics. 2013;29:2869–2876. doi: 10.1093/bioinformatics/btt499. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 116.Leaché A.D., Fujita M.K., Minin V.N., Bouckaert R.R. Species Delimitation Using Genome-Wide SNP Data. Syst. Biol. 2014;63:534–542. doi: 10.1093/sysbio/syu018. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 117.Ence D.D., Carstens B.C. SpedeSTEM: A Rapid and Accurate Method for Species Delimitation. Mol. Ecol. Resour. 2011;11:473–480. doi: 10.1111/j.1755-0998.2010.02947.x. [DOI] [PubMed] [Google Scholar]
- 118.Medini D., Donati C., Rappuoli R., Tettelin H. The Pangenome: A Data-Driven Discovery in Biology. In: Tettelin H., Medini D., editors. The Pangenome: Diversity, Dynamics and Evolution of Genomes. Springer; Cham, Switzerland: 2020. [Google Scholar]
- 119.Vernikos G., Medini D., Riley D.R., Tettelin H. Ten Years of Pan-Genome Analyses. Curr. Opin. Microbiol. 2015;23:148–154. doi: 10.1016/j.mib.2014.11.016. [DOI] [PubMed] [Google Scholar]
- 120.Chan C., Salomé P.A. What Makes a Good Reference? First Steps toward a Chlamydomonas Pangenome. Plant Cell. 2022;35:628–629. doi: 10.1093/plcell/koac361. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 121.Füssy Z., Lampe R.H., Arrigo K.R., Barry K., Brisbin M.M., Brussaard C.P.D., Decelle J., de Vargas C., DiTullio G.R., Elbourne L.D.H., et al. Genome-Resolved Biogeography of Phaeocystales, Cosmopolitan Bloom-Forming Algae. Nat. Commun. 2025;16:8559. doi: 10.1038/s41467-025-63565-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 122.Fan X., Qiu H., Han W., Wang Y., Xu D., Zhang X., Bhattacharya D., Ye N. Phytoplankton Pangenome Reveals Extensive Prokaryotic Horizontal Gene Transfer of Diverse Functions. Sci. Adv. 2020;6:eaba0111. doi: 10.1126/sciadv.aba0111. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 123.Cao H., Xu D., Zhang T., Ren Q., Xiang L., Ning C., Zhang Y., Gao R. Comprehensive and Functional Analyses Reveal the Genomic Diversity and Potential Toxicity of Microcystis. Harmful Algae. 2022;113:102186. doi: 10.1016/j.hal.2022.102186. [DOI] [PubMed] [Google Scholar]
- 124.Luo X., Su H., Guan G., Ren M. Chromosome-Level Genome Assembly for an Edible Protein Microalgae Auxenochlorella Pyrenoidosa. Sci. Data. 2025;13:60. doi: 10.1038/s41597-025-06358-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 125.Gann E.R., Truchon A.R., Papoulis S.E., Dyhrman S.T., Gobler C.J., Wilhelm S.W. Aureococcus anophagefferens (Pelagophyceae) Genomes Improve Evaluation of Nutrient Acquisition Strategies Involved in Brown Tide Dynamics. J. Phycol. 2022;58:146–160. doi: 10.1111/jpy.13221. [DOI] [PubMed] [Google Scholar]
- 126.Penot M., Dacks J.B., Read B., Dorrell R.G. Genomic and Meta-Genomic Insights into the Functions, Diversity and Global Distribution of Haptophyte Algae. Appl. Phycol. 2022;3:340–359. doi: 10.1080/26388081.2022.2103732. [DOI] [Google Scholar]
- 127.Rossoni A.W., Price D.C., Seger M., Lyska D., Lammers P., Bhattacharya D., Weber A.P. The Genomes of Polyextremophilic Cyanidiales Contain 1% Horizontally Transferred Genes with Diverse Adaptive Functions. eLife. 2019;8:e45017. doi: 10.7554/eLife.45017. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 128.Yao S., Lyu S., An Y., Lu J., Gjermansen C., Schramm A. Microalgae–Bacteria Symbiosis in Microalgal Growth and Biofuel Production: A Review. J. Appl. Microbiol. 2019;126:359–368. doi: 10.1111/jam.14095. [DOI] [PubMed] [Google Scholar]
- 129.Abd-El-Aziz A., Elnagdy S.M., Han J., Mihelič R., Wang X., Agathos S.N., Li J. Bacteria-Microalgae Interactions from an Evolutionary Perspective and Their Biotechnological Significance. Biotechnol. Adv. 2025;82:108591. doi: 10.1016/j.biotechadv.2025.108591. [DOI] [PubMed] [Google Scholar]
- 130.Cooper M.B., Smith A.G. Exploring Mutualistic Interactions between Microalgae and Bacteria in the Omics Age. Curr. Opin. Plant Biol. 2015;26:147–153. doi: 10.1016/j.pbi.2015.07.003. [DOI] [PubMed] [Google Scholar]
- 131.Seymour J.R., Amin S.A., Raina J.-B., Stocker R. Zooming in on the Phycosphere: The Ecological Interface for Phytoplankton–Bacteria Relationships. Nat. Microbiol. 2017;2:17065. doi: 10.1038/nmicrobiol.2017.65. [DOI] [PubMed] [Google Scholar]
- 132.Girard E.B., del Rio-Hortega L., Pratama A.M.A., Volkenandt S., Macher J.-N., Wilken S., Renema W. Specific Host-Algae Relationship, yet Flexible Bacterial Microbiome, in Diatom-Bearing Foraminifera. Sci. Adv. 2025;11:eadx4098. doi: 10.1126/sciadv.adx4098. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 133.O’Brien P.A., Webster N.S., Miller D.J., Bourne D.G. Host-Microbe Coevolution: Applying Evidence from Model Systems to Complex Marine Invertebrate Holobionts. mBio. 2019;10:e02241-18. doi: 10.1128/mBio.02241-18. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 134.Maruyama S., Weis V.M. Limitations of Using Cultured Algae to Study Cnidarian-Algal Symbioses and Suggestions for Future Studies. J. Phycol. 2021;57:30–38. doi: 10.1111/jpy.13102. [DOI] [PubMed] [Google Scholar]
- 135.Meng A., Marchet C., Corre E., Peterlongo P., Alberti A., Da Silva C., Wincker P., Pelletier E., Probert I., Decelle J., et al. A de Novo Approach to Disentangle Partner Identity and Function in Holobiont Systems. Microbiome. 2018;6:105. doi: 10.1186/s40168-018-0481-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 136.Tschitschko B., Esti M., Philippi M., Kidane A.T., Littmann S., Kitzinger K., Speth D.R., Li S., Kraberg A., Tienken D., et al. Rhizobia–Diatom Symbiosis Fixes Missing Nitrogen in the Ocean. Nature. 2024;630:899–904. doi: 10.1038/s41586-024-07495-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 137.Wang W., Ge Q., Wen J., Zhang H., Guo Y., Li Z., Xu Y., Ji D., Chen C., Guo L., et al. Horizontal Gene Transfer and Symbiotic Microorganisms Regulate the Adaptive Evolution of Intertidal Algae, Porphyra Sense Lato. Commun. Biol. 2024;7:976. doi: 10.1038/s42003-024-06663-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 138.González-González L.M., de-Bashan L.E. Toward the Enhancement of Microalgal Metabolite Production through Microalgae-Bacteria Consortia. Biology. 2021;10:282. doi: 10.3390/biology10040282. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 139.Xu P., Liu X., Ke L., Li K., Wang W., Jiao Y. The Genomic Insights of Intertidal Adaptation in Bryopsis corticulans. New Phytol. 2025;246:1691–1709. doi: 10.1111/nph.70083. [DOI] [PubMed] [Google Scholar]
- 140.Lin H.-H., Liao Y.-C. Accurate Binning of Metagenomic Contigs via Automated Clustering Sequences Using Information of Genomic Signatures and Marker Genes. Sci. Rep. 2016;6:24175. doi: 10.1038/srep24175. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 141.Wu Y.-W., Simmons B.A., Singer S.W. MaxBin 2.0: An Automated Binning Algorithm to Recover Genomes from Multiple Metagenomic Datasets. Bioinformatics. 2016;32:605–607. doi: 10.1093/bioinformatics/btv638. [DOI] [PubMed] [Google Scholar]
- 142.Kang D.D., Li F., Kirton E., Thomas A., Egan R., An H., Wang Z. MetaBAT 2: An Adaptive Binning Algorithm for Robust and Efficient Genome Reconstruction from Metagenome Assemblies. PeerJ. 2019;7:e7359. doi: 10.7717/peerj.7359. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 143.Schvarcz C.R., Stancheva R., Turk-Kubo K.A., Wilson S.T., Zehr J.P., Edwards K.F., Steward G.F., Archibald J.M., Oatley G., Sinclair E., et al. The Genome Sequences of the Marine Diatom Epithemia pelagica Strain UHM3201 (Schvarcz, Stancheva & Steward, 2022) and Its Nitrogen-Fixing, Endosymbiotic Cyanobacterium. Wellcome Open Res. 2024;9:232. doi: 10.12688/wellcomeopenres.21534.1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 144.Pop M., Salzberg S.L. Bioinformatics Challenges of New Sequencing Technology. Trends Genet. 2008;24:142–149. doi: 10.1016/j.tig.2007.12.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 145.Zoonomia Consortium A comparative genomics multitool for scientific discovery and conservation. Nature. 2020;587:240–245. doi: 10.1038/s41586-020-2876-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 146.Schadt E.E., Turner S., Kasarskis A. A Window into Third-Generation Sequencing. Hum. Mol. Genet. 2010;19:R227–R240. doi: 10.1093/hmg/ddq416. [DOI] [PubMed] [Google Scholar]
- 147.Dorey A., Howorka S. Nanopore DNA Sequencing Technologies and Their Applications towards Single-Molecule Proteomics. Nat. Chem. 2024;16:314–334. doi: 10.1038/s41557-023-01322-x. [DOI] [PubMed] [Google Scholar]
- 148.Peng K., Yin Y., Li Y., Qin S., Liu Y., Yang X., Wang Z., Li R. QitanTech Nanopore Long-Read Sequencing Enables Rapid Resolution of Complete Genomes of Multi-Drug Resistant Pathogens. Front. Microbiol. 2022;13:778659. doi: 10.3389/fmicb.2022.778659. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 149.Peng K., Li C., Wang Q., Xin X., Wang Z., Li R. The Applications and Advantages of Nanopore Sequencing in Bacterial Antimicrobial Resistance Surveillance and Research. npj Antimicrob. Resist. 2025;3:87. doi: 10.1038/s44259-025-00157-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 150.Yang T.-T., Zhang J.-R., Xie Z.-H., Ren Z.-L., Yan J.-W., Ni M. Nanopore Sequencing of Forensic Short Tandem Repeats Using QNome of Qitan Technology. Electrophoresis. 2024;45:1535–1545. doi: 10.1002/elps.202300270. [DOI] [PubMed] [Google Scholar]
- 151.Sun B., Guo J., Jin H., Jin Z., Sun Y., Mao Y., Xie F., He Y., Sun Z., Li W., et al. MetaCONNET: A Metagenomic Polishing Tool for Long-Read Assemblies. PLoS ONE. 2024;19:e0313515. doi: 10.1371/journal.pone.0313515. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 152.Curtis-Joseph N., Peterson R., Brown C.E., Beekman C., Belenky P. Mouse Diet and Vendor Impact Microbiome Perturbation and Recovery from Early-Life Pulses of Amoxicillin. Front. Microbiomes. 2024;3:1432202. doi: 10.3389/frmbi.2024.1432202. [DOI] [Google Scholar]
- 153.Chen P., Jing X., Ren J., Cao H., Hao P., Li X. Modelling BioNano Optical Data and Simulation Study of Genome Map Assembly. Bioinformatics. 2018;34:3966–3974. doi: 10.1093/bioinformatics/bty456. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 154.Patiño-Guillén G., Pešović J., Panić M., Earle M., Ninković A., Petruşca S., Savić-Pavićević D., Keyser U.F., Bošković F. Quantification of Disease-Associated RNA Tandem Repeats by Nanopore Sensing. bioRxiv. 2025 doi: 10.1101/2025.05.15.653973. [DOI] [Google Scholar]
- 155.Boža V., Brejová B., Vinař T. DeepNano: Deep Recurrent Neural Networks for Base Calling in MinION Nanopore Reads. PLoS ONE. 2017;12:e0178751. doi: 10.1371/journal.pone.0178751. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 156.Huang N., Nie F., Ni P., Luo F., Wang J. SACall: A Neural Network Basecaller for Oxford Nanopore Sequencing Data Based on Self-Attention Mechanism. IEEE/ACM Trans. Comput. Biol. Bioinform. 2022;19:614–623. doi: 10.1109/TCBB.2020.3039244. [DOI] [PubMed] [Google Scholar]
- 157.Senanayake A., Gamaarachchi H., Herath D., Ragel R. DeepSelectNet: Deep Neural Network Based Selective Sequencing for Oxford Nanopore Sequencing. BMC Bioinform. 2023;24:31. doi: 10.1186/s12859-023-05151-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 158.Ni P., Huang N., Zhang Z., Wang D.-P., Liang F., Miao Y., Xiao C.-L., Luo F., Wang J. DeepSignal: Detecting DNA Methylation State from Nanopore Sequencing Reads Using Deep-Learning. Bioinformatics. 2019;35:4586–4595. doi: 10.1093/bioinformatics/btz276. [DOI] [PubMed] [Google Scholar]
- 159.Chen Y., Wu B., Ding Y.-Y., Niu L.-J., Bai X., Lin Z.-B., Xiao C.-L. High Accuracy Methylation Identification Tools on Single Molecular Level for PacBio HiFi Data. bioRxiv. 2024 doi: 10.1101/2024.08.14.607879. [DOI] [Google Scholar]
- 160.Bai X., Yao H.-C., Wu B., Liu L.-R., Ding Y.-Y., Xiao C.-L. DeepBAM: A High-Accuracy Single-Molecule CpG Methylation Detection Tool for Oxford Nanopore Sequencing. Brief. Bioinform. 2024;25:bbae413. doi: 10.1093/bib/bbae413. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 161.Kumar B., Lorusso E., Fosso B., Pesole G. A Comprehensive Overview of Microbiome Data in the Light of Machine Learning Applications: Categorization, Accessibility, and Future Directions. Front. Microbiol. 2024;15:1343572. doi: 10.3389/fmicb.2024.1343572. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 162.Santucci K., Cheng Y., Xu S.-M., Janitz M. Enhancing Novel Isoform Discovery: Leveraging Nanopore Long-Read Sequencing and Machine Learning Approaches. Brief. Funct. Genom. 2024;23:683–694. doi: 10.1093/bfgp/elae031. [DOI] [PubMed] [Google Scholar]
- 163.Asnicar F., Thomas A.M., Passerini A., Waldron L., Segata N. Machine Learning for Microbiologists. Nat. Rev. Microbiol. 2024;22:191–205. doi: 10.1038/s41579-023-00984-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 164.Pita-Galeana M.A., Ruhle M., López-Vázquez L., de Anda-Jáuregui G., Hernández-Lemus E. Computational Metagenomics: State of the Art. Int. J. Mol. Sci. 2025;26:9206. doi: 10.3390/ijms26189206. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 165.Hu H., Wei X.-Y., Liu L., Wang Y.-B., Jia H.-J., Bu L.-K., Pei D.-S. Supervised Machine Learning Improves General Applicability of eDNA Metabarcoding for Reservoir Health Monitoring. Water Res. 2023;246:120686. doi: 10.1016/j.watres.2023.120686. [DOI] [PubMed] [Google Scholar]
- 166.Karlicki M., Antonowicz S., Karnkowska A. Tiara: Deep Learning-Based Classification System for Eukaryotic Sequences. Bioinformatics. 2022;38:344–350. doi: 10.1093/bioinformatics/btab672. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 167.Abdirassilova A.A., Yessimseit D.T., Rysbekova A.K., Kassenova A.K., Abdeliyev B.Z., Zhumadilova Z.B., Tokmurziyeva G.Z., Dikhanbayev A.S., Agzam S.D., Motin V.L., et al. Application of Long-Read Sequencing for Genotyping, Epigenetic Profiling and Surveillance of Yersinia pestis Isolates from Natural Foci and Disease Outbreaks in Central Asia. bioRxiv. 2025 doi: 10.1101/2025.10.14.682323. [DOI] [PubMed] [Google Scholar]
- 168.Wang L., Wang Q., Tang Z., Wang Y., Qu Y., Wen D., Zhong Q., Hu H., Liu Y., He M. Concurrent Classical and Hypervirulent Klebsiella Pneumoniae Infection in Distinct Host Niches Revealed by a Rapid Nanopore Whole-Genome and Plasmid Sequencing Method. Front. Cell. Infect. Microbiol. 2025;15:1633833. doi: 10.3389/fcimb.2025.1633833. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 169.Wu Q., Gao J., Sa B., Cong H., Deng W., Zhang Y., Zhong X., Zhang J., Wang L., Liu H., et al. Genomes of Prochlorococcus, Synechococcus, Bacteria, and Viruses Recovered from Marine Picocyanobacteria Cultures Based on Illumina and Qitan Nanopore Sequencing. Sci. Data. 2025;12:612. doi: 10.1038/s41597-025-04762-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 170.Hu T., Chitnis N., Monos D., Dinh A. Next-Generation Sequencing Technologies: An Overview. Hum. Immunol. 2021;82:801–811. doi: 10.1016/j.humimm.2021.02.012. [DOI] [PubMed] [Google Scholar]
- 171.Schafran P.W., Cai V., Yang H.-P., Li F.-W. Metagenomic Characterization of a Harmful Algal Bloom Using Nanopore Sequencing. bioRxiv. 2020 doi: 10.1101/2020.11.13.381525. [DOI] [Google Scholar]
- 172.Hanschen E.R., Starkenburg S.R. The State of Algal Genome Quality and Diversity. Algal Res. 2020;50:101968. doi: 10.1016/j.algal.2020.101968. [DOI] [Google Scholar]
- 173.Kwon T., Hanschen E.R., Hovde B.T. Addressing the Pervasive Scarcity of Structural Annotation in Eukaryotic Algae. Sci. Rep. 2023;13:1687. doi: 10.1038/s41598-023-27881-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 174.Petroll R., West J.A., Ogden M., McGinley O., Craig R.J., Coelho S.M., Borg M. The Expanded Bostrychia Moritziana Genome Unveils Evolution in the Most Diverse and Complex Order of Red Algae. Curr. Biol. 2025;35:2771–2788.e8. doi: 10.1016/j.cub.2025.04.044. [DOI] [PubMed] [Google Scholar]
- 175.Zhu H., Li S., Hu Z., Liu G. Molecular Characterization of Eukaryotic Algal Communities in the Tropical Phyllosphere Based on Real-Time Sequencing of the 18S rDNA Gene. BMC Plant Biol. 2018;18:365. doi: 10.1186/s12870-018-1588-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 176.Mirasbekov Y., Abdimanova A., Sarkytbayev K., Samarkhanov K., Abilkas A., Potashnikova D., Arbuz G., Issayev Z., Vorobjev I.A., Malashenkov D.V., et al. Combining Imaging Flow Cytometry and Molecular Biological Methods to Reveal Presence of Potentially Toxic Algae at the Ural River in Kazakhstan. Front. Mar. Sci. 2021;8:680482. doi: 10.3389/fmars.2021.680482. [DOI] [Google Scholar]
- 177.Koeppel A.F., Goodrum W.J., Steffen M.M., Wurch L.L., Turner S.D. Environmental DNA Sequencing Dataset from Lake Erie Algal Blooms Using Oxford Nanopore MinION. Data Brief. 2022;45:108688. doi: 10.1016/j.dib.2022.108688. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 178.O’Neill E.A., Fehrenbach G., Murphy E., Alencar S.A., Pogue R., Rowan N.J. Use of next Generation Sequencing and Bioinformatics for Profiling Freshwater Eukaryotic Microalgae in a Novel Peatland Integrated Multi-Trophic Aquaculture (IMTA) System: Case Study from the Republic of Ireland. Sci. Total Environ. 2022;851:158392. doi: 10.1016/j.scitotenv.2022.158392. [DOI] [PubMed] [Google Scholar]
- 179.Hu Y., Zhang J., Huang J., Zhou M., Hu S. The Biogeography of Colonial Volvocine Algae in the Yangtze River Basin. Front. Microbiol. 2023;14:1078081. doi: 10.3389/fmicb.2023.1078081. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 180.Liu B., Liu Y., Dai Q., Zhu H., Liu G. Molecular Insights into Hidden Diversity of the Epilithic Green Algae in Chishui River, China. Eur. J. Phycol. 2024;59:243–253. doi: 10.1080/09670262.2024.2317803. [DOI] [Google Scholar]
- 181.Meirkhanova A., Marks S., Feja N., Vorobjev I.A., Barteneva N.S. Spectral Algal Fingerprinting and Long Sequencing in Synthetic Algal–Microbial Communities. Cells. 2024;13:1552. doi: 10.3390/cells13181552. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 182.Mordret S., MacKinnon J., Behnke J., O’Leary S.J., Chénard C. Identification of Phytoplankton Isolates from the Eastern Canadian Waters Using Long-Read Sequencing. J. Plankton Res. 2024;46:527–541. doi: 10.1093/plankt/fbae043. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 183.Marter P., Freese H.M., Ringel V., Brinkmann H., Pradella S., Rohde M., Jarek M., Spröer C., Wagner-Döbler I., Overmann J., et al. Superior Resolution Profiling of the Coleofasciculus Microbiome by Amplicon Sequencing of the Complete 16S rRNA Gene and ITS Region. Environ. Microbiol. Rep. 2025;17:e70066. doi: 10.1111/1758-2229.70066. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 184.Yu B.S., Park Y., Park S., Cho J.-W., Han K. Control System for Soil Microbes Detrimental to Trees Using Metabolites Derived from Algal Bloom-Causing Microorganisms. Sci. Total Environ. 2025;999:180359. doi: 10.1016/j.scitotenv.2025.180359. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
No new experimental data were created or analyzed in this study. Raw data sharing is not applicable to this article.







