Skip to main content
aBIOTECH logoLink to aBIOTECH
. 2025 Oct 21;7(1):100004. doi: 10.1016/j.abiote.2025.100004

Advances and applications in sequencing-based pathogen surveillance

Hao Luo 1,1, Yao Wang 1,1, Huiyu Hou 1, Junbo Yang 1,, Yong-Xin Liu 1,⁎⁎
PMCID: PMC12973409  PMID: 41940150

Abstract

The ongoing emergence of infectious diseases necessitates cutting-edge diagnostic methodologies. Traditional diagnostic methods are constrained by limited range, lengthy processing times, and inadequate sensitivity. High-throughput sequencing technologies, particularly multiplex polymerase chain reaction (PCR)-based targeted sequencing, have emerged as transformative tools for pathogen detection, offering enhanced sensitivity, specificity, and cost efficiency. However, challenges in primer design, such as dimerization and bias, limit the effectiveness of these approaches. This review explores advances in sequencing technologies, emphasizing the roles of culturomics, metagenomics, and metatranscriptomics in pathogen discovery. We spotlight innovative strategies for error-tolerant primer design that address existing limitations by balancing coverage and specificity, thereby optimizing the multiplex PCR process. Furthermore, integration of artificial intelligence enhances the precision and scalability of sequencing, enabling real-time diagnostics. Collectively, these advances offer promising pathways to bolster global health, food security, and ecological resilience through robust and sustainable pathogen-detection systems.

Keywords: Pathogen detection, High-throughput sequencing, Artificial intelligence, Diagnostic methodologies

1. Introduction

The livestock industry regularly faces significant economic losses due to infectious animal diseases, with recent examples including avian influenza, African swine fever, and foot-and-mouth disease [1,2]. At the same time, zoonotic pathogens pose critical threats to food safety and public health, with approximately 60 ​% of global emerging infectious diseases and 75 ​% of newly emerging human pathogens originating from animals, according to the World Health Organization [3]. Notable zoonotic pathogens include Salmonella spp., Campylobacter spp., Brucella spp., and the Betacoronavirus species SARS-CoV-2, all of which have caused significant public health challenges [4,5]. The COVID-19 pandemic, caused by SARS-CoV-2, has highlighted the devastating potential of zoonotic pathogens to emerge from animal reservoirs, such as bats or other intermediate hosts [6], and rapidly evolve into global health crises [7]. This pandemic not only claimed millions of lives but also disrupted economies, food systems, and global health infrastructure on an unprecedented scale. Similarly, plant pathogens have caused significant losses in agricultural productivity and biodiversity. Examples include Phytophthora infestans, the causative agent of late blight in potatoes [8,9], Xylella fastidiosa, responsible for diseases such as olive quick decline syndrome [10], Magnaporthe oryzae, the rice blast fungus [11,12], and Fusarium graminearum, the causative agent of wheat Fusarium head blight [13]. The globalization of agricultural trade has facilitated the movement of infected plant materials, seeds, and soil, further exacerbating the spread of these pathogens. The convergence of plant, animal, and human health underscores the importance of adopting a “One Health” approach, which acknowledges the interconnectedness of these systems [14]. Whether addressing zoonotic diseases such as COVID-19, livestock infections such as foot-and-mouth disease, or plant pathogens such as Phytophthora infestans, integrated efforts are essential for safeguarding global health, food security, and ecological resilience. The increasing pace of globalization and international trade has further exacerbated the cross-border spread of pests and pathogens, leading to novel host–pathogen interactions that threaten agricultural productivity, biodiversity, and ecosystem stability [3,15].

Effective and precise diagnostic methods are essential for mitigating these risks. However, traditional diagnostic approaches for human, animal, and plant pathogens, such as morphological observation, microorganism isolation and cultivation, biochemical identification, and immunological assays, are often time consuming, have low detection rates, and require specialized expertise [16,17]. Molecular biology techniques such as real-time PCR have enabled rapid pathogen identification and resistance-gene detection, but their limited detection range poses challenges, particularly in cases of mixed or rare infections [[18], [19], [20]].

By contrast, sequencing-based detection techniques, such as high-throughput sequencing for entire or specific nucleotide sequences [21,22], are gaining attention for their speed and enhanced sensitivity (Fig. 1A). In practical applications, whole-genome sequencing (WGS) combined with bioinformatics analysis has been widely used for the surveillance of Salmonella and other pathogens, as well as their antimicrobial resistance genes [23]. These methodologies enable detailed analysis of the microbiome at the species level and offer notable advantages in terms of detection range. Culturomics [24], metagenomics [25] (Fig. 1B–E), metatranscriptomics [26] (Fig. 1C–E), and targeted sequencing [22] enable in-depth analysis of microbial genomes (Fig. 1D and E). Beyond human health, these technologies are broadly applicable in veterinary and agricultural contexts, addressing the urgent need for advanced and environmentally sustainable diagnostic solutions.

Fig. 1.

Fig. 1

High-throughput sequencing technologies for pathogen detection. A High-throughput sequencing technologies for molecular characterization of pathogen infections. B Metagenomic next-generation sequencing (mNGS) technologies for pathogen detection. C Metatranscriptomic next-generation sequencing (mtNGS) technologies for pathogen detection. D Targeted sequencing technologies for pathogen detection. E Library construction and sequencing.

Compared with previous studies that have often focused on a single technology or a limited application area, our review offers a more comprehensive and systematic perspective. We synthesize the latest advances and cross-disciplinary applications of a range of cutting-edge sequencing technologies in the context of global infectious disease surveillance. Furthermore, we highlight how these technologies collectively contribute to transforming the paradigm of pathogen detection, thereby providing critical support for the development of sustainable One Health systems.

2. Exploring novel pathogen phylotypes through culturomics

Approximately 62 ​% of the bacterial phylotypes identified in human colon mucosal tissue and fecal samples are novel, with 80 ​% of the species previously considered unculturable [27]. Researchers have therefore developed innovative methodologies to explore these newfound bacteria and species [28,29]. Culturomics employs a range of culture conditions to culture and isolate species, using matrix-assisted laser desorption/ionization time-of-flight (MALDI–TOF) mass spectrometry or 16S rRNA gene sequencing for bacterial identification and WGS for resequencing of species [30], thus enabling precise identification. In one example, Clostridium butyricum, whose toxin secretion could not be detected using conventional metagenomic approaches, was successfully isolated and confirmed to be associated with necrotizing enterocolitis in preterm infants [31]. This approach has been invaluable for characterizing human microbiomes and extending insights into other domains. In animals, culturomics has been used to study gut microbiomes in livestock such as cattle and swine, revealing microbial diversity and its role in health and disease [32,33]. These findings support strategies to enhance livestock productivity and control zoonotic infections. Culturomics in plants has focused on rhizosphere, phyllosphere, and endophytic microbes, which are critical for plant growth, disease resistance, and nutrient cycling [[34], [35], [36]]. For instance, the approach has been used to isolate beneficial microbes that protect crops against fungal pathogens, reducing reliance on chemical pesticides [37]. Notably, some bacteria initially isolated as beneficial microorganisms have been re-evaluated in clinical settings. For example, Akkermansia muciniphila and Christensenella minuta, widely recognized for their probiotic properties [38,39], have been identified as potential pathogens in clinical specimens [40,41]. These findings underscore the dual roles of certain bacterial species in health and disease, highlighting the need for further research to determine their potential pathogenicity. However, one challenge faced by culturomics is the necessity for specific culture conditions to facilitate the growth of certain microorganisms that will thrive only if particular nutrients or specific environmental conditions are present. For instance, obligate anaerobes are unable to grow in the presence of oxygen. Another challenge is the time investment: the process of culturing and isolating microorganisms is time consuming, leading to delays in obtaining identification results [42]. Consequently, culturomics may not be the most suitable approach in urgent situations.

3. Pathogen detection by metagenomic and metatranscriptomic approaches

Metagenomic and metatranscriptomic next-generation sequencing approaches involve the direct sequencing of nucleotides from specimens, enabling the comprehensive acquisition of both host and microorganism data [43]. In contrast to culturomics, these approaches enable the broad identification of both known and unexpected pathogens, as well as the discovery of new organisms [[44], [45], [46]]. Metagenomic next-generation sequencing (mNGS) is an emerging technology that is transforming the diagnosis of complex clinical infections. It provides an unbiased method for detecting diverse pathogens, including bacteria, viruses, fungi, and parasites, by sequencing all microbial and host genetic material (DNA and RNA) within a patient sample [47]. mNGS has been used in clinical settings to identify pathogens associated with neonatal pneumonia, sepsis, and meningitis deaths, identifying fatal pathogens in 90 ​% of cases [48]. In foodborne outbreaks caused by bacteria such as Listeria monocytogenes, mNGS has proven effective for identifying pathogens and accurately tracing contamination sources through genomic comparison of patient and food samples [5]. This technology is also useful for diagnosing invasive fungal infections in immunocompromised individuals, as it can rapidly detect pathogens such as Aspergillus fumigatus and the multidrug-resistant yeast Candida auris directly from blood or tissue, enabling prompt and targeted therapy [49]. A large-scale virome study using high-throughput metatranscriptomics analyzed 1941 wild animals across five mammalian orders in China, identifying 102 mammalian-infecting viruses, including 65 novel viruses from 13 virus families [50]. This approach effectively revealed known and unknown pathogens and identified zoonotic threats, highlighting the utility of metatranscriptomics for pathogen discovery and surveillance. Furthermore, the use of metagenomic and metatranscriptomic technologies to monitor antibiotic-resistance genes is becoming increasingly important. Metagenomics can identify the potential presence of resistance genes, whereas metatranscriptomics can evaluate their actual expression. Together, these techniques provide a comprehensive view of the resistome's status and transmission risk [[51], [52], [53]]. In addition, metagenomic sequencing of sugar beet roots grown in disease-suppressive soils has identified microbial communities and functional profiles associated with disease suppression, highlighting biosynthetic gene clusters upregulated during infection to reconstruct the endophytic microbiome [54]. However, like other tests, metagenomic techniques also have drawbacks. A key challenge is the high host-to-microbial nucleic acid ratio in the sequencing results. Levels of human DNA often exceed those of microbial nucleic acids in samples, with over 99 ​% of reads deriving from the host, thus reducing the sensitivity of mNGS for host-DNA-rich samples and limiting detection by reducing sequencing depth and increasing error rate [55,56]. Similarly, in the study of plant endophytes, host DNA obscures microbial reads, reducing the overall analytical sensitivity for pathogen detection [57]. This predominance of host-derived sequences limits the detection of low-abundance microorganisms owing to the relatively sparse sequencing of microbial non-host reads. Although host-depletion methods can reduce the influence of host contamination, they also pose the risk of inadvertently removing low-abundance microbial sequences [43]. Moreover, WGS for microorganism identification requires a very high sequencing depth to obtain satisfactory coverage of the complete microbiome, potentially increasing the time and cost of detection.

4. Microorganism identification and surveillance through targeted sequencing

WGS may not be widely adopted for microorganism identification because of its high costs and the complexity involved in data analysis. Targeted sequencing, exemplified by 16S rRNA gene amplification and sequencing, has emerged as a cost-effective and expeditious alternative that focuses on specific genes or regions for identification purposes [58]. Compared with other sequencing-based methods, targeted sequencing is significantly faster, as it eliminates the need for experiments and analyses such as host removal, and it does not require resequencing of the entire genome [59]. Two primary strategies are available for targeted sequencing [1]: hybridization-based targeted sequencing, which selectively captures and sequences targeted gene segments through complementary interactions with specific probes [60]; and [2] multiplex PCR-based targeted sequencing [61], which uses multiplex PCR to enrich specific gene segments for sequencing. In contrast with hybridization-based targeted sequencing, multiplex PCR-based targeted sequencing requires only one or two rounds of PCR to complete library construction and sequencing, without the need for chip fabrication or hybridization processes. This not only renders it more cost effective but also reduces the barrier to entry, giving it significant potential for use in point-of-care tests. However, the selection of primers or probes is crucial for ensuring accurate amplicon enrichment or capture. To date, over 500 primer pairs have been used in 16S rRNA testing, and findings suggest that inappropriate primer choice can cause under-representation or bias against individual species [62]. Therefore, the design and selection of primers represent key challenges in targeted sequencing.

Among targeted-sequencing approaches, multiplex PCR-based targeted sequencing is particularly noteworthy because it simultaneously enriches and sequences multiple targets, making it highly adept at cost-effective detection of low-content and heterogeneous samples. Reports indicate that the cost of multiplex PCR-based targeted sequencing for the detection of respiratory pathogens is only a quarter of that associated with metagenomic sequencing [22]. During the COVID-19 pandemic, targeted sequencing was widely used to analyze SARS-CoV-2 lineage variations in human and wastewater samples [63], monitor viral abundance [64], trace sources of infection, and assess individual treatment responses [65]. Its application has since been expanded to other human-associated pathogens [66]. Furthermore, amplicon sequencing, as part of multiplex PCR-based targeted sequencing, has emerged as a nearly ubiquitous tool for the characterization of bacterial populations across diverse environments and host systems [67,68,69]. Methods based on 16S and ITS amplicon sequencing have been used to assess the similarity of pathogenic fungi on leaves of invasive Eupatorium adenophorum and coexisting native plants [70]. These methods have also been used to study soil-borne pathogens in cabbage, as well as bacterial and fungal communities in 12 microbiomes associated with the surrounding soil, roots, stems, and fruits of Capsicum annuum (chili pepper) [71,72]. This technique has significantly expanded our understanding of the microbiomes present in environmental DNA. However, although multiplex PCR-based targeted sequencing is cost effective and widely used, it can encounter challenges in accurately discerning species-level differences and evenly identifying all targets. These difficulties arise from the diverse and abundant nature of microorganisms and the fact that primer selection can markedly influence the representation of specific bacterial clades [62]. The accuracy of multiplex PCR-based targeted sequencing depends strongly on the choice of primers, complicating the primer-design process. In addition, as the number of primers increases, issues such as primer dimerization and off-target effects are exacerbated, leading to conflicts between detection range and compatibility with multiple primers.

5. Challenges and strategies for multiplex PCR-based targeted sequencing

Multiplex PCR-based targeted sequencing has been widely used by researchers and commercial entities because of its user-friendly nature and exceptional efficiency in enriching targets within samples [[73], [74], [75]]. This approach is particularly effective for detecting low-frequency somatic mutations in tumors and identifying pathogens by analyzing conserved sequences with ample variability [[76], [77], [78]]. Primers are crucial for the detection capacity of PCR-based targeted sequencing. However, the design of universal PCR primers for the detection of infectious pandemic pathogens, such as primers targeting the 16S rDNA region of bacteria, has proven challenging owing to the high diversity of target sequences. In addition to bacteria, the possible presence in samples of other infectious agents, including viruses, fungi, mycoplasma, chlamydia, and parasites, further complicates primer design for targeted sequencing [79]. A common strategy for addressing the diversity of target pathogens is to design separate primers for each pathogen type and then combine them. However, this approach increases the risks of primer-dimer formation and off-target amplification, potentially leading to PCR failure. The optimization of primer design thus remains a key challenge in multiplex PCR-based targeted sequencing.

Three strategies and various software tools have been used to design primers for multiplex PCR systems [80]. In the first strategy, primers are designed for individual sequences using software such as Primer3, followed by consolidation [81]. Candidate primers undergo two rounds of quality control to check their efficiency, specificity, and compatibility. In the initial quality check (QC1), online blast tools such as Primer-BLAST are used to detect potential off-target issues [82], and the secondary structures of the primers are assessed to improve efficiency. In the second round of quality checking (QC2), the compatibility between candidate primers, as well as the primer specificity, is thoroughly examined to eliminate primer dimers and off-target effects [83]. Clearly, this strategy is time consuming and labor intensive, making it unsuitable for high-throughput applications (Fig. 2A). In the second strategy, primers are designed by K-mer creation [84]. This approach offers high target coverage while minimizing the number of primers, thereby reducing the workload associated with compatibility and specificity assessments (Fig. 2B). In this method, input sequences are fragmented into short K-mers, and high-frequency K-mers are selected to design high-coverage primers, which are then assembled into a primer set. The first round of quality control evaluates parameters such as GC content and K-mer frequency in the target and host genomes. In the third strategy, multiple alignments are used to identify conserved regions for the design of degenerate primers (Fig. 2C) [85]; quality control is analogous to that used in the second strategy. Notably, degenerate primers provide broader target coverage and simplify the design process by focusing primarily on conserved regions. Although one might propose integrating degenerate primers into the second strategy, the high diversity of K-mers would require additional computational resources and time to incorporate degenerate bases effectively. Overall, multiple methods have been used for primer design, but most do not use degenerate primers and have limited applications [86,87].

Fig. 2.

Fig. 2

Advantages of error-tolerant degenerate primers. A Primer set design sequence by sequence. B Primer set design through K-mer creation. C Primer set design through multiple alignment. D Designing primers using the complete-match mode. E Error-tolerant mode. QC: quality control.

6. Error-tolerant strategy: improving PCR-based targeted sequencing

In general, primers are designed to minimize incorrect base pairings while ensuring comprehensive coverage, specificity, and sensitivity [88]. However, studies indicate that PCR can still be initiated by primers that contain mismatched bases when the thermodynamic stability of matched bases exceeds that of mismatched bases (also called errors) [89,90]. Thus, to achieve complete coverage, primers that contain mismatched bases can be used [85,91]. Most primer-design methods are limited to generating primers with maximal coverage based solely on complete matches between primers and templates [81,84,85,88,92]. Consequently, tools such as “TestPrime” in databases such as SILVA (https://www.arb-silva.de/search/testprime/) are used to evaluate the mismatch coverage of primers [93]. However, this approach does not offer maximal coverage when allowing for a specified number of mismatches (X errors, where X ​= ​0, 1, 2) (Fig. 2D). A tradeoff between X errors and primer specificity and sensitivity should be considered when designing primers with higher coverage and X errors (Fig. 2E).

In theory, a set of 512 primers should be sufficient for complete coverage of 10 targets (Fig. 3A). However, this ideal situation is rarely encountered, as the co-existence of 512 primers would dramatically increase the chances of primer-dimer formation and off-target effects. Moreover, the final concentration of each primer would be 512 times lower than that used in normal PCR, which would affect the annealing of primers to templates and thus reduce PCR efficiency. If the degeneracy does not exceed 4, the maximal coverage with complete match is only 4 out of 10 targets (Fig. 3B). To attain full coverage requires at least four rounds of primer design, each time choosing primers with the highest coverage. This method not only expands the number of primers used but also complicates primer design, because each round needs to consider the compatibility between newly designed and existing primers.

Fig. 3.

Fig. 3

Coverage and error-tolerant models for multiplex PCR primer design. A Illustration of the desired targets for multiplex PCR. Complete match: primers that completely match with templates. Complete coverage: a degenerate primer capable of binding to all input sequences in the complete-match model. 1-Mismatch: primers that contain a one-base mismatch with templates. B Representation of the complete-coverage design method and error-tolerant model. Degenerate primers with maximum complete coverage are highlighted in red, and degenerate primers with maximum error-tolerant coverage are highlighted in green. Cov: primers with complete coverage; E-Cov: primers with error-tolerant coverage; Deg: Degeneracy.

Using the error-tolerant mode, two candidate primers with a degeneracy of 4 can achieve 100 ​% coverage if a maximum of one error is allowed between the primer and the target sequences (Fig. 3B). Consequently, the final concentration of each primer will be reduced to 1/4 of the initial concentration. The use of limited primer degeneracy therefore results in relatively high final concentrations of all primers. Notably, the composition of degenerate bases in a primer designed using this error-tolerant mode differs from that of a primer designed for maximum coverage using the complete-match method (Fig. 3B, red and green backgrounds). Further refinement can be undertaken to optimize the placement of degenerate bases and mismatch positions. For instance, primer dimerization occurs primarily through complementary pairing at the 3′ end of primers [89,90,94,95]. Therefore, positioning degenerate bases away from the 3′ ends of primers can reduce the likelihood of dimer formation. In addition, mismatches at the 3′ end may significantly affect the binding efficiency of primers with templates. Hence, it is advisable to avoid base mismatches at the 3′ end to maintain the specificity and sensitivity of the primer.

7. Challenges in the design of error-tolerant degenerate primers

The design of error-tolerant primers is a classic nondeterministic polynomial-time (NP)-complete problem [91]. Several programs have been developed for the design of degenerate primers. A classic program for the design of error-tolerant degenerate primers is HYDEN, which accepts DNA sequences in a multiple-alignment format as input and generates a candidate primer set that covers all input sequences [96]. By contrast, Primux does not require a multiple sequence alignment and uses a strategy based on K-mer creation. However, both programs are time consuming, which has been attributed to the extensive sequence diversity and the sheer number of K-mers [97]. Both are also sensitive to the positions of base mismatches, but they do not have a module to control error positions and identify error types. By contrast, MultiPrime appears to deal well with these issues. It first clusters input sequences, then randomizes samples from the clusters, and finally designs primers in accordance with customer requirements, such as degeneracy, degenerate base positions, error numbers, and mismatch positions [98]. However, it cannot comprehensively evaluate the efficiency and specificity of error-tolerant primers [90]. Overall, relatively few experiments have focused on optimizing error properties. The optimal approach for designing error-tolerant primers with maximum error-tolerant coverage has not yet been identified. Thus, it is critical to precisely measure the effects of the numbers, positions, and types of base mismatches on primer efficiency and specificity in order to make multiplex PCR-based targeted sequencing more practical for targeted detection.

8. Artificial intelligence in sequencing technologies: revolutionizing pathogen detection and primer design

Artificial intelligence (AI) will continue to drive advances in omics technologies and targeted sequencing techniques, playing a significant role in the detection, prevention, and treatment of pathogenic microorganisms (Fig. 4). The combination of AI with sequencing technologies is revolutionizing pathogen detection in humans, animals, and plants, enabling the rapid, accurate, and scalable identification of microbial pathogens. In human pathogen detection, AI has advanced metagenomic sequencing by overcoming host-dominated nucleic acid signals, enhancing diagnostic sensitivity for low-abundance pathogens, and improving the identification of polymicrobial infections. For example, the combination of mNGS with AI algorithms enables the isolation of microbial signatures from complex clinical samples, which is crucial for the detection of emerging infectious agents and antimicrobial-resistant organisms, as demonstrated in mixed infections [99]. Moreover, machine learning (ML) has been effectively applied to process microscopy datasets related to infection biology. Various microscopy techniques, including light and electron microscopy, have contributed to the development of ML models capable of detecting bacteria, fungi, parasites, and viruses in host cells [[100], [101], [102]]. In veterinary diagnostics, AI-enhanced sequencing tools have been used to address major livestock pathogens such as African swine fever virus, avian influenza virus, and foot-and-mouth disease virus, enabling rapid pathogen identification to prevent outbreaks [103]. In addition, AI-optimized nanopore sequencing has been used to detect foodborne zoonotic pathogens, offering high sensitivity and specificity [[104], [105], [106]]. The integration of sequencing technologies into veterinary practice has also facilitated the monitoring of microbial resistance genes in livestock [107]. In agriculture, AI-driven sequencing technologies have enhanced the identification of plant pathogens, including bacteria, fungi, and viruses, through methods such as 16S rDNA sequencing. These techniques use AI to optimize primer design, improving detection specificity for low-abundance microbial populations, as shown in ornamental plants [108]. AI has been used to predict the development of winter barley net blotch, offering new opportunities for its diagnosis that show high diagnostic accuracy, predictive precision, and strong applicability under field conditions [109]. Such innovations support the early detection of crop diseases and can reduce reliance on chemical pesticides, promoting environmentally sustainable agricultural practices. Across these sectors, AI-powered sequencing has introduced innovations such as error-tolerant primer design [110], host-DNA depletion for improved sensitivity, and real-time diagnostics with portable sequencing platforms, enabling on-site diagnostics in resource-limited or field settings [111]. The synergy between AI and sequencing technologies is transforming microbial surveillance and disease management across human, veterinary, and agricultural domains.

Fig. 4.

Fig. 4

Role of artificial intelligence in advancing culturomics, other omics technologies, and targeted sequencing techniques.

9. Conclusions

The battle against infectious diseases necessitates a multifaceted approach that integrates cutting-edge molecular technologies with a deep understanding of pathogen diversity and diagnostic challenges. High-throughput sequencing, particularly multiplex PCR-based targeted sequencing, has demonstrated its potential to transform pathogen detection by providing rapid, precise, and cost-effective solutions. Its ability to simultaneously enrich and sequence multiple targets makes it an invaluable tool for addressing the complexities of heterogeneous samples and diverse pathogens. Nonetheless, no single technology can serve as a universal solution. For instance, although culturomics offers high accuracy by providing quality isolates, it does not detect unculturable species. Metagenomics can provide a comprehensive survey of a microbial community but is computationally complex and costly. Metatranscriptomics can reveal the active state of microorganisms but is even more demanding in terms of cost and handling. Finally, targeted sequencing, while offering a low-cost, high-sensitivity solution for known targets, lacks the capacity for novel pathogen discovery. Therefore, the judicious selection and integration of these strategies, tailored to specific diagnostic needs and research goals, is of paramount importance.

In direct response to the need for clear, application-specific guidance, we offer the following recommendations. For the detection of known pathogens, a targeted strategy should be used: reference genomes can be sourced from NCBI's RefSeq [112], with primers designed to target specific genes using MultiPrime [98], and the data can be processed using analysis pipelines involving aligners such as Bowtie 2 [113,114] and variant callers such as SAMtools and BCFtools [115]. By contrast, the detection of novel pathogens requires a reference-free metagenomic approach that uses K-mer-based classifiers such as Kraken 2 [116] for rapid identification, assemblers such as MEGAHIT [117] and SPAdes [118] for genome reconstruction, and comprehensive discovery pipelines such as EasyMetagenome [119]. Pathogen-detection strategies should also be tailored to the specific host system. When targeting animal pathogens, it is critical to avoid host cross-reactivity by confirming primer specificity against the host genome (e.g., with Primer-BLAST [120]). For detection of plant pathogens, in which host-DNA contamination and pathogen variability are major challenges, resources such as VEuPathDB [121] are invaluable, and analysis can be streamlined with dedicated pipelines such as PhytoPipe [122] for RNA-seq.

Despite the considerable promise of sequencing-based pathogen detection, several critical challenges must be addressed to ensure its reliable and widespread implementation. A primary challenge is the management and interpretation of bioinformatics data, particularly the risks of false positives and inconsistencies introduced by variable performance across analytical tools. Given the potentially high cost of misdiagnosis, future work must prioritize the development of robust and standardized analytical frameworks that incorporate stringent controls, consensus-based algorithmic approaches, and orthogonal validation to ensure data reliability [[123], [124], [125]]. Furthermore, because NGS uncovers novel microorganisms, establishing clear protocols for risk assessment when moving from in-silico prediction of virulence to functional validation is crucial for global biosafety [126].

Looking forward, the integration of advanced computational tools, particularly AI, holds immense promise. Beyond improvements in analytical speed and scalability, the next frontier is the development of explainable AI (XAI), which offers improved model interpretability by incorporating domain-specific knowledge such as biological networks. Such advances will be crucial for optimizing error-tolerant primer design, improving detection accuracy, and making sense of complex metagenomic data.

When these current limitations are addressed through innovative and collaborative research, sequencing-based technologies can achieve widespread adoption in clinical, veterinary, and agricultural settings. These advances not only promise improved diagnostics but also contribute to global health resilience, sustainable agricultural practices, and robust disease-management frameworks. Continued exploration and refinement of these technologies, with a strong emphasis on data integrity and responsible innovation, are imperative for safeguarding health and biodiversity in an increasingly interconnected world.

CRediT authorship contribution statement

Hao Luo: Writing – original draft, Visualization. Yao Wang: Writing – original draft. Huiyu Hou: Writing – original draft. Junbo Yang: Writing – review & editing, Writing – original draft. Yong-Xin Liu: Writing – review & editing.

Declaration of competing interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Acknowledgements

This study was financially supported by the Natural Science Foundation of China (32470055), and the Agricultural Science and Technology Innovation Program (CAAS-BRC-CB-2025-01).

Contributor Information

Junbo Yang, Email: 1806389316@pku.edu.cn.

Yong-Xin Liu, Email: liuyongxin@caas.cn.

Data availability

Data availability is not applicable to this article, as no new data were created or analyzed in this study.

References

  • 1.Nguyen-Thi T., Pham-Thi-Ngoc L., Nguyen-Ngoc Q., Dang-Xuan S., Lee H.S., Nguyen-Viet H., et al. An assessment of the economic impacts of the 2019 African swine fever outbreaks in Vietnam. Front Vet Sci. 2021;8 [Google Scholar]
  • 2.Chakraborty S., Kumar N., Dhama K., Verma A.K., Tiwari R., Kumar A., et al. Foot-and-mouth disease, an economically important disease of animals. Adv Anim Vet Sci. 2014;2(2S):1–18. [Google Scholar]
  • 3.Jones K.E., Patel N.G., Levy M.A., Storeygard A., Balk D., Gittleman J.L., et al. Global trends in emerging infectious diseases. Nature. 2008;451(7181):990–993. doi: 10.1038/nature06536. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Lamers M.M., Haagmans B.L. SARS-CoV-2 pathogenesis. Nat Rev Microbiol. 2022;20(5):270–284. doi: 10.1038/s41579-022-00713-0. [DOI] [PubMed] [Google Scholar]
  • 5.Mather A.E., Gilmour M.W., Reid S.W.J., French N.P. Foodborne bacterial pathogens: genome-based approaches for enduring and emerging threats in a complex and changing world. Nat Rev Microbiol. 2024;22(9):543–555. doi: 10.1038/s41579-024-01051-z. [DOI] [PubMed] [Google Scholar]
  • 6.Andersen K.G., Rambaut A., Lipkin W.I., Holmes E.C., Garry R.F. The proximal origin of SARS-CoV-2. Nat Med. 2020;26(4):450–452. doi: 10.1038/s41591-020-0820-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Yang H., Rao Z. Structural biology of SARS-CoV-2 and implications for therapeutic development. Nat Rev Microbiol. 2021;19(11):685–700. doi: 10.1038/s41579-021-00630-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Lin X., Jia Y., Heal R., Prokchorchik M., Sindalovskaya M., Olave-Achury A., et al. Solanum americanum genome-assisted discovery of immune receptors that detect potato late blight pathogen effectors. Nat Genet. 2023;55(9):1579–1588. doi: 10.1038/s41588-023-01486-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Torres Ascurra Y.C., Zhang L., Toghani A., Hua C., Rangegowda N.J., Posbeyikian A., et al. Functional diversification of a wild potato immune receptor at its center of origin. Science. 2023;381(6660):891–897. doi: 10.1126/science.adg5261. [DOI] [PubMed] [Google Scholar]
  • 10.Zarco-Tejada P.J., Camino C., Beck P.S.A., Calderon R., Hornero A., Hernández-Clemente R., et al. Previsual symptoms of Xylella fastidiosa infection revealed in spectral plant-trait alterations. Nat Plants. 2018;4(7):432–439. doi: 10.1038/s41477-018-0189-7. [DOI] [PubMed] [Google Scholar]
  • 11.Liu M., Wang F., He B., Hu J., Dai Y., Chen W., et al. Targeting Magnaporthe oryzae effector MoErs1 and host papain-like protease OsRD21 interaction to combat rice blast. Nat Plants. 2024;10(4):618–632. doi: 10.1038/s41477-024-01642-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Cruz-Mireles N., Osés-Ruiz M., Derbyshire P., Jégousse C., Ryder L.S., Bautista M.J.A., et al. The phosphorylation landscape of infection-related development by the rice blast fungus. Cell. 2024;187(10) 2557-73.e18. [Google Scholar]
  • 13.Xu S., Liu Y.-X., Cernava T., Wang H., Zhou Y., Xia T., et al. Fusarium fruiting body microbiome member Pantoea agglomerans inhibits fungal pathogenesis by targeting lipid rafts. Nat Microbiol. 2022;7(6):831–843. doi: 10.1038/s41564-022-01131-x. [DOI] [PubMed] [Google Scholar]
  • 14.Wang Y., Hou H., Luo H., Xun J., Ma C., Yang H., et al. iMeta Conference 2024: building an innovative scientific research ecosystem for microbiome and one health. iMeta. 2024;3(6) [Google Scholar]
  • 15.Rohr J.R., Barrett C.B., Civitello D.J., Craft M.E., Delius B., DeLeo G.A., et al. Emerging human infectious diseases and the links to global food production. Nat Sustain. 2019;2(6):445–456. doi: 10.1038/s41893-019-0293-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Michel A.L., Van Heerden H., Crossley B.M., Al Dahouk S., Prasse D., Rutten V. Pathogen detection and disease diagnosis in wildlife: challenges and opportunities. Rev Sci Tech. 2021;40(1):105–118. doi: 10.20506/rst.40.1.3211. [DOI] [PubMed] [Google Scholar]
  • 17.Khakimov A., Salakhutdinov I., Omolikov A., Utaganov S., editors. IOP conference series: earth and environmental science. IOP Publishing; 2022. Traditional and current-prospective methods of agricultural plant diseases detection: a review. [Google Scholar]
  • 18.Artika I.M., Dewi Y.P., Nainggolan I.M., Siregar J.E., Antonjaya U. Real-time polymerase chain reaction: current techniques, applications, and role in COVID-19 diagnosis. Genes. 2022;13(12):2387. doi: 10.3390/genes13122387. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Rybicka M., Miłosz E., Bielawski K.P. Superiority of MALDI-TOF mass spectrometry over real-time PCR for SARS-CoV-2 RNA detection. Viruses. 2021;13(5):730. doi: 10.3390/v13050730. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Lee J., Song J.-U., Shim S.R. Comparing the diagnostic accuracy of rapid antigen detection tests to real time polymerase chain reaction in the diagnosis of SARS-CoV-2 infection: a systematic review and meta-analysis. J Clin Virol. 2021;144 [Google Scholar]
  • 21.Gao Y., Li D., Liu Y.X. Microbiome research outlook: past, present, and future. Protein Cell. 2023 [Google Scholar]
  • 22.Li S., Tong J., Liu Y., Shen W., Hu P. Targeted next generation sequencing is comparable with metagenomic next generation sequencing in adults with pneumonia for pathogenic microorganism detection. J Infect. 2022;85(5):e127–e129. doi: 10.1016/j.jinf.2022.08.022. [DOI] [PubMed] [Google Scholar]
  • 23.Wang Y., Xu X., Jia S., Qu M., Pei Y., Qiu S., et al. A global atlas and drivers of antimicrobial resistance in Salmonella during 1900-2023. Nat Commun. 2025;16(1):4611. doi: 10.1038/s41467-025-59758-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Lagier J.C., Dubourg G., Million M., Cadoret F., Bilen M., Fenollar F., et al. Culturing the human microbiota and culturomics. Nat Rev Microbiol. 2018;16:540–550. doi: 10.1038/s41579-018-0041-0. [DOI] [PubMed] [Google Scholar]
  • 25.Takeuchi T., Kubota T., Nakanishi Y., Tsugawa H., Suda W., Kwon A.T., et al. Gut microbial carbohydrate metabolism contributes to insulin resistance. Nature. 2023;621(7978):389–395. doi: 10.1038/s41586-023-06466-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Franzosa E.A., McIver L.J., Rahnavard G., Thompson L.R., Schirmer M., Weingart G., et al. Species-level functional profiling of metagenomes and metatranscriptomes. Nat Methods. 2018;15(11):962–968. doi: 10.1038/s41592-018-0176-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Eckburg P.B., Bik E.M., Bernstein C.N., Purdom E., Dethlefsen L., Sargent M., et al. Diversity of the human intestinal microbial flora. Science. 2005;308(5728):1635–1638. doi: 10.1126/science.1110591. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Hugon P., Dufour J.-C., Colson P., Fournier P.-E., Sallah K., Raoult D. A comprehensive repertoire of prokaryotic species identified in human beings. Lancet Infect Dis. 2015;15(10):1211–1219. doi: 10.1016/S1473-3099(15)00293-5. [DOI] [PubMed] [Google Scholar]
  • 29.Hill M.J., Drasar B.S. The normal colonic bacterial flora. Gut. 1975;16(4):318. doi: 10.1136/gut.16.4.318. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Zhang J., Liu Y.X., Guo X., Qin Y., Garrido-Oter R., Schulze-Lefert P., et al. High-throughput cultivation and identification of bacteria from the plant root microbiota. Nat Protoc. 2021;16(2):988–1012. doi: 10.1038/s41596-020-00444-7. [DOI] [PubMed] [Google Scholar]
  • 31.Cassir N., Benamar S., Khalil J.B., Croce O., Saint-Faust M., Jacquot A., et al. Clostridium butyricum strains and dysbiosis linked to necrotizing enterocolitis in preterm neonates. Clin Infect Dis. 2015;61(7):1107–1115. doi: 10.1093/cid/civ468. [DOI] [PubMed] [Google Scholar]
  • 32.Yang H., Wu J., Huang X., Zhou Y., Zhang Y., Liu M., et al. ABO genotype alters the gut microbiota by regulating GalNAc levels in pigs. Nature. 2022;606(7913):358–367. doi: 10.1038/s41586-022-04769-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Wallace R.J., Sasson G., Garnsworthy P.C., Tapio I., Gregson E., Bani P., et al. A heritable subset of the core rumen microbiome dictates dairy cow productivity and emissions. Sci Adv. 2019;5(7) [Google Scholar]
  • 34.Sahu K.P., Patel A., Kumar M., Sheoran N., Mehta S., Reddy B., et al. Integrated metabarcoding and culturomic-based microbiome profiling of rice phyllosphere reveal diverse and functional bacterial communities for blast disease suppression. Front Microbiol. 2021;12 [Google Scholar]
  • 35.Sahu K.P., Kumar A., Sakthivel K., Reddy B., Kumar M., Patel A., et al. Deciphering core phyllomicrobiome assemblage on rice genotypes grown in contrasting agroclimatic zones: implications for phyllomicrobiome engineering against blast disease. Environ microbiome. 2022;17(1):28. doi: 10.1186/s40793-022-00421-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Compant S., Cambon M.C., Vacher C., Mitter B., Samad A., Sessitsch A. The plant endosphere world – bacterial life within plants. Environ Microbiol. 2021;23(4):1812–1829. doi: 10.1111/1462-2920.15240. [DOI] [PubMed] [Google Scholar]
  • 37.Li M., Raza M., Song S., Hou L., Zhang Z.-F., Gao M., et al. Application of culturomics in fungal isolation from mangrove sediments. Microbiome. 2023;11(1):272. doi: 10.1186/s40168-023-01708-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Derrien M., Vaughan E.E., Plugge C.M., de Vos W.M. Akkermansia muciniphila gen. nov., sp. Nov., a human intestinal mucin-degrading bacterium. Int J Syst Evol Microbiol. 2004;54(5):1469–1476. doi: 10.1099/ijs.0.02873-0. [DOI] [PubMed] [Google Scholar]
  • 39.Morotomi M., Nagai F., Watanabe Y. Description of Christensenella minuta gen. nov., sp. Nov., isolated from human faeces, which forms a distinct branch in the order Clostridiales, and proposal of Christensenellaceae fam. nov. Int J Syst Evol Microbiol. 2012;62(1):144–149. doi: 10.1099/ijs.0.026989-0. [DOI] [PubMed] [Google Scholar]
  • 40.Dubourg G., Cornu F., Edouard S., Battaini A., Tsimaratos M., Raoult D. First isolation of <em>Akkermansia muciniphila</em> in a blood-culture sample. Clin Microbiol Infection. 2017;23(9):682–683. [Google Scholar]
  • 41.Lagier J.-C., Dubourg G., Million M., Cadoret F., Bilen M., Fenollar F., et al. Culturing the human microbiota and culturomics. Nat Rev Microbiol. 2018;16(9):540–550. doi: 10.1038/s41579-018-0041-0. [DOI] [PubMed] [Google Scholar]
  • 42.Armetta J., Li S.S., Vaaben T.H., Vazquez-Uribe R., Sommer M.O.A. Metagenome-guided culturomics for the targeted enrichment of gut microbes. Nat Commun. 2025;16(1):663. doi: 10.1038/s41467-024-55668-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Wu-Woods N.J., Barlow J.T., Trigodet F., Shaw D.G., Romano A.E., Jabri B., et al. Microbial-enrichment method enables high-throughput metagenomic characterization from host-rich samples. Nat Methods. 2023;20(11):1672–1682. doi: 10.1038/s41592-023-02025-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Zhou H., Larkin P.M.K., Zhao D., Ma Q., Yao Y., Wu X., et al. Clinical impact of metagenomic next-generation sequencing of bronchoalveolar lavage in the diagnosis and management of pneumonia: a multicenter prospective observational study. J Mol Diagn. 2021;23(10):1259–1268. doi: 10.1016/j.jmoldx.2021.06.007. [DOI] [PubMed] [Google Scholar]
  • 45.Yang A., Chen C., Hu Y., Zheng G., Chen P., Xie Z., et al. Application of metagenomic next-generation sequencing (mNGS) using bronchoalveolar lavage fluid (BALF) in diagnosing pneumonia of children. Microbiol Spectr. 2022;10(5) 22. [Google Scholar]
  • 46.Chen H., Yin Y., Gao H., Guo Y., Dong Z., Wang X., et al. Clinical utility of In-house metagenomic next-generation sequencing for the diagnosis of lower respiratory tract infections and analysis of the host immune response. Clin Infect Dis. 2020;71(Supplement_4):S416–S426. doi: 10.1093/cid/ciaa1516. [DOI] [PubMed] [Google Scholar]
  • 47.Gu W., Miller S., Chiu C.Y. Clinical metagenomic next-generation sequencing for pathogen detection. Annu Rev Pathol. 2019;14(14):319–338. doi: 10.1146/annurev-pathmechdis-012418-012751. 2019. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Baillie V.L., Madhi S.A., Ahyong V., Olwagen C.P. Metagenomic sequencing of post-mortem tissue samples for the identification of pathogens associated with neonatal deaths. Nat Commun. 2023;14(1):5373. doi: 10.1038/s41467-023-40958-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Wang C., You Z., Fu J., Chen S., Bai D., Zhao H., et al. Application of metagenomic next-generation sequencing in the diagnosis of pulmonary invasive fungal disease. Front Cell Infect Microbiol. 2022:12–2022. [Google Scholar]
  • 50.He W.-T., Hou X., Zhao J., Sun J., He H., Si W., et al. Virome characterization of game animals in China reveals a spectrum of emerging pathogens. Cell. 2022;185(7) 1117-29.e8. [Google Scholar]
  • 51.Wang Y., Hu Y., Gao G.F. Combining metagenomics and metatranscriptomics to study human, animal and environmental resistomes. Med Microecol. 2020;3 [Google Scholar]
  • 52.Rong L., Zhang B., Qiu H., Zhang H., Yu J., Yuan Q., et al. Significant generational effects of tetracyclines upon the promoting plasmid-mediated conjugative transfer between typical wastewater bacteria and its mechanisms. Water Res. 2025 [Google Scholar]
  • 53.Wu L., Luo H., Xu J., Yu L., Xiong J., Liu Y., et al. Vital role of CYP450 in the biodegradation of antidiabetic drugs in the aerobic activated sludge system and the mechanisms. J Hazard Mater. 2024;469 [Google Scholar]
  • 54.Carrión V.J., Perez-Jaramillo J., Cordovez V., Tracanna V., de Hollander M., Ruiz-Buck D., et al. Pathogen-induced activation of disease-suppressive functions in the endophytic root microbiome. Science. 2019;366(6465):606–612. doi: 10.1126/science.aaw9285. [DOI] [PubMed] [Google Scholar]
  • 55.Pereira-Marques J., Hout A., Ferreira R.M., Weber M., Pinto-Ribeiro I., van Doorn L.J., et al. Impact of host DNA and sequencing depth on the taxonomic resolution of whole metagenome sequencing for microbiome analysis. Front Microbiol. 2019;10:1277. doi: 10.3389/fmicb.2019.01277. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Gu W., Deng X., Lee M., Sucu Y.D., Arevalo S., Stryke D., et al. Rapid pathogen detection by metagenomic next-generation sequencing of infected body fluids. Nat Med. 2021;27(1):115–124. doi: 10.1038/s41591-020-1105-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Wang Y., Yang J., Hou H., Song L., Cheng X., Liu Y. -X. Advancing Plant Microbiome Research Through Host DNA Depletion Techniques. Plant Biotechnology Journal. 2025 [Google Scholar]
  • 58.Rehm H.L. Disease-targeted sequencing: a cornerstone in the clinic. Nat Rev Genet. 2013;14(4):295–300. doi: 10.1038/nrg3463. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Kumar K.R., Cowley M.J., Davis R.L., editors. Seminars in thrombosis and hemostasis. Thieme Medical Publishers; 2024. Next-generation sequencing and emerging technologies. [Google Scholar]
  • 60.Avivi M.Y., Touitou N., Rohana H., Lerrer B., Shav-Tal Y., Peretz A., et al. Nucleic acid hybridization-based detection of pathogenic RNA using microscale thermophoresis. J Biol Chem. 2024;300(3) [Google Scholar]
  • 61.Zheng Z., Liebers M., Zhelyazkova B., Cao Y., Panditi D., Lynch K.D., et al. Anchored multiplex PCR for targeted next-generation sequencing. Nat Med. 2014;20(12):1479–1484. doi: 10.1038/nm.3729. [DOI] [PubMed] [Google Scholar]
  • 62.Klindworth A., Pruesse E., Schweer T., Peplies J., Quast C., Horn M., et al. Evaluation of general 16S ribosomal RNA gene PCR primers for classical and next-generation sequencing-based diversity studies. Nucleic Acids Res. 2012;41(1):e1–e. doi: 10.1093/nar/gks808. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Viana R., Moyo S., Amoako D.G., Tegally H., Scheepers C., Althaus C.L., et al. Rapid epidemic expansion of the SARS-CoV-2 Omicron variant in southern Africa. Nature. 2022;603(7902):679–686. doi: 10.1038/s41586-022-04411-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64.Baaijens J.A., Zulli A., Ott I.M., Nika I., van der Lugt M.J., Petrone M.E., et al. Lineage abundance estimation for SARS-CoV-2 in wastewater using transcriptome quantification techniques. Genome Biol. 2022;23(1):236. doi: 10.1186/s13059-022-02805-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65.Hoang V.T., Assoumani L., Delerce J., Houhamdi L., Bedotto M., Lagier J.-C., et al. Introduction of the SARS-CoV-2 Beta variant from Comoros into the Marseille geographical area. Trav Med Infect Dis. 2022;46 [Google Scholar]
  • 66.Gandhi S., Klein J., Robertson A.J., Peña-Hernández M.A., Lin M.J., Roychoudhury P., et al. De novo emergence of a remdesivir resistance mutation during treatment of persistent SARS-CoV-2 infection in an immunocompromised patient: a case report. Nat Commun. 2022;13(1):1547. doi: 10.1038/s41467-022-29104-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67.Liu Y.-X., Chen L., Ma T., Li X., Zheng M., Zhou X., et al. EasyAmplicon: an easy-to-use, open-source, reproducible, and community-based pipeline for amplicon data analysis in microbiome research. iMeta. 2023;2(1) [Google Scholar]
  • 68.Yousuf S., Luo H., Zeng M., Chen L., Ma T., Li X., et al. Unveiling microbial communities with EasyAmplicon: a user-centric guide to perform amplicon sequencing data analysis. iMetaOmics. 2024;1(2) [Google Scholar]
  • 69.Luo H, Bai D., Zhu Z., Yousuf S., Yang H., Xun J., et al. EasyAmplicon 2: Expanding PacBio and Nanopore Long Amplicon Sequencing Analysis Pipeline for Microbiome. Advanced Science. 2025 [Google Scholar]
  • 70.Chen L., Zhou J., Zeng T., Miao Y.-F., Mei L., Yao G.-B., et al. Quantifying the sharing of foliar fungal pathogens by the invasive plant Ageratina adenophora and its neighbours. New Phytol. 2020;227(5):1493–1504. doi: 10.1111/nph.16624. [DOI] [PubMed] [Google Scholar]
  • 71.Ping X., Khan R.A.A., Chen S., Jiao Y., Zhuang X., Jiang L., et al. Deciphering the role of rhizosphere microbiota in modulating disease resistance in cabbage varieties. Microbiome. 2024;12(1):160. doi: 10.1186/s40168-024-01883-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 72.Gao M., Xiong C., Gao C., Tsui C.K.M., Wang M.-M., Zhou X., et al. Disease-induced changes in plant microbiome assembly and functional adaptation. Microbiome. 2021;9(1):187. doi: 10.1186/s40168-021-01138-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 73.Cowman S., van Ingen J., Griffith D.E., Loebinger M.R. Non-tuberculous mycobacterial pulmonary disease. Eur Respir J. 2019;54(1) [Google Scholar]
  • 74.van Ingen J. Microbiological diagnosis of nontuberculous mycobacterial pulmonary disease. Clin Chest Med. 2015;36(1):43–54. doi: 10.1016/j.ccm.2014.11.005. [DOI] [PubMed] [Google Scholar]
  • 75.Yin Y., Zhu P., Guo Y., Li Y., Chen H., Liu J., et al. Enhancing lower respiratory tract infection diagnosis: implementation and clinical assessment of multiplex PCR-based and hybrid capture-based targeted next-generation sequencing. EBioMedicine. 2024;107 [Google Scholar]
  • 76.Das J., Ivanov I., Sargent E.H., Kelley S.O. DNA clutch probes for circulating tumor DNA analysis. J Am Chem Soc. 2016;138(34):11009–11016. doi: 10.1021/jacs.6b05679. [DOI] [PubMed] [Google Scholar]
  • 77.Chen W., Xu H., Dai S., Wang J., Yang Z., Jin Y., et al. Detection of low-frequency mutations in clinical samples by increasing mutation abundance via the excision of wild-type sequences. Nat Biomed Eng. 2023;7(12):1602–1613. doi: 10.1038/s41551-023-01072-8. [DOI] [PubMed] [Google Scholar]
  • 78.Ellis P., Moore L., Sanders M.A., Butler T.M., Brunner S.F., Lee-Six H., et al. Reliable detection of somatic mutations in solid tissues by laser-capture microdissection and low-input DNA sequencing. Nat Protoc. 2021;16(2):841–871. doi: 10.1038/s41596-020-00437-6. [DOI] [PubMed] [Google Scholar]
  • 79.Schmitz J.E., Stratton C.W., Persing D.H., Tang Y.-W. Forty years of molecular diagnostics for infectious diseases. J Clin Microbiol. 2022;60(10) 21. [Google Scholar]
  • 80.Guo J., Starr D., Guo H. Classification and review of free PCR primer design software. Bioinformatics. 2020;36(22–23):5263–5268. [Google Scholar]
  • 81.Wingo T.S., Kotlar A., Cutler D.J. MPD: Multiplex primer design for next-generation targeted sequencing. BMC Bioinf. 2017;18(1):14. [Google Scholar]
  • 82.Ye J., Coulouris G., Zaretskaya I., Cutcutache I., Rozen S., Madden T.L. Primer-BLAST: a tool to design target-specific primers for polymerase chain reaction. BMC Bioinf. 2012;13:134. [Google Scholar]
  • 83.Wang K., Li H., Xu Y., Shao Q., Yi J., Wang R., et al. MFEprimer-3.0: quality control for PCR primers. Nucleic Acids Res. 2019;47(W1) W610-w3. [Google Scholar]
  • 84.Hendling M., Pabinger S., Peters K., Wolff N., Conzemius R., Barišic I. Oli2go: an automated multiplex oligonucleotide design tool. Nucleic Acids Res. 2018;46(W1) W252-w6. [Google Scholar]
  • 85.Hugerth L.W., Wefer H.A., Lundin S., Jakobsson H.E., Lindberg M., Rodin S., et al. DegePrime, a program for degenerate primer design for broad-taxonomic-range PCR in microbial ecology studies. Appl Environ Microbiol. 2014;80(16):5116–5123. doi: 10.1128/AEM.01403-14. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 86.Tournayre O., Tian H., Lougheed D.R., Windle M.J.S., Lambert S., Carter J., et al. Enhancing metabarcoding of freshwater biotic communities: a new online tool for primer selection and exploring data from 14 primer pairs. Environ DNA. 2024;6(4) [Google Scholar]
  • 87.McNichol J., Berube P.M., Biller S.J., Fuhrman J.A. Evaluating and improving small subunit rRNA PCR primer coverage for bacteria, archaea, and eukaryotes using metagenomes from global Ocean surveys. mSystems. 2021;6(3) doi: 10.1128/msystems.00565-21. [DOI] [Google Scholar]
  • 88.Chukwuemeka P.O., Umar H.I., Olukunle O.F., Oretade O.M., Olowosoke C.B., Akinsola E.O., et al. In silico design and validation of a highly degenerate primer pair: a systematic approach. J Genet Eng Biotechnol. 2020;18(1):72. doi: 10.1186/s43141-020-00086-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 89.Zhang D.Y., Chen S.X., Yin P. Optimizing the specificity of nucleic acid hybridization. Nat Chem. 2012;4(3):208–214. doi: 10.1038/nchem.1246. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 90.Rejali N.A., Moric E., Wittwer C.T. The effect of single mismatches on primer extension. Clin Chem. 2018;64(5):801–809. doi: 10.1373/clinchem.2017.282285. [DOI] [PubMed] [Google Scholar]
  • 91.Linhart C., Shamir R. The degenerate primer design problem: theory and applications. J Comput Biol. 2005;12(4):431–456. doi: 10.1089/cmb.2005.12.431. [DOI] [PubMed] [Google Scholar]
  • 92.Clarke E.L., Sundararaman S.A., Seifert S.N., Bushman F.D., Hahn B.H., Brisson D. Swga: a primer design toolkit for selective whole genome amplification. Bioinformatics. 2017;33(14):2071–2077. doi: 10.1093/bioinformatics/btx118. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 93.Klindworth A., Pruesse E., Schweer T., Peplies J., Quast C., Horn M., et al. Evaluation of general 16S ribosomal RNA gene PCR primers for classical and next-generation sequencing-based diversity studies. Nucleic Acids Res. 2013;41(1) [Google Scholar]
  • 94.Brownie J., Shawcross S., Theaker J., Whitcombe D., Ferrie R., Newton C., et al. The elimination of primer-dimer accumulation in PCR. Nucleic Acids Res. 1997;25(16):3235–3241. doi: 10.1093/nar/25.16.3235. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 95.Garafutdinov R.R., Galimova A.A., Sakhabutdinova A.R. The influence of quality of primers on the formation of primer dimers in PCR. Nucleosides Nucleotides Nucleic Acids. 2020;39(9):1251–1269. doi: 10.1080/15257770.2020.1803354. [DOI] [PubMed] [Google Scholar]
  • 96.Linhart C., Shamir R. Degenerate primer design: theoretical analysis and the HYDEN program. Methods Mol Biol. 2007;402:221–244. doi: 10.1007/978-1-59745-528-2_11. [DOI] [PubMed] [Google Scholar]
  • 97.Hysom D.A., Naraghi-Arani P., Elsheikh M., Carrillo A.C., Williams P.L., Gardner S.N. Skip the alignment: degenerate, multiplex primer and probe design using K-mer matching instead of alignments. PLoS One. 2012;7(4) [Google Scholar]
  • 98.Xia H., Zhang Z., Luo C., Wei K., Li X., Mu X., et al. MultiPrime: a reliable and efficient tool for targeted next-generation sequencing. iMeta. 2023;2(4) [Google Scholar]
  • 99.Narayanan S., Aakur S.N., Ramamurthy P., Bagavathi A., Ramnath V., Ramachandran A. Scalable pathogen detection from next generation DNA sequencing with deep learning. arXiv preprint arXiv:221200015. 2022 [Google Scholar]
  • 100.Ashdown G.W., Dimon M., Fan M., Sánchez-Román Terán F., Witmer K., Gaboriau D.C.A., et al. A machine learning approach to define antimalarial drug action from heterogeneous cell-based screens. Sci Adv. 2020;6(39) [Google Scholar]
  • 101.Boeck L., Burbaud S., Skwark M., Pearson W.H., Sangen J., Wuest A.W., et al. Mycobacterium abscessus pathogenesis identified by phenogenomic analyses. Nat Microbiol. 2022;7(9):1431–1441. doi: 10.1038/s41564-022-01204-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 102.Wong F., de la Fuente-Nunez C., Collins J.J. Leveraging artificial intelligence in the fight against infectious diseases. Science. 2023;381(6654):164–170. doi: 10.1126/science.adh1114. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 103.Zhou Y., Ren M., Zhang P., Jiang D., Yao X., Luo Y., et al. Application of nanopore sequencing in the detection of foodborne microorganisms. Nanomaterials. 2022;12(9):1534. doi: 10.3390/nano12091534. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 104.Li Y., Lyu H., Yang H., Ju Z., Ma C., Hou H., et al. FoodMicroDB: a microbiome database for composition and time-series research in food. iMetaOmics. 2024;1(2) [Google Scholar]
  • 105.Zhang T., Li H., Jiang M., Hou H., Gao Y., Li Y., et al. Nanopore sequencing: flourishing in its teenage years. J Genet Genom. 2024;51(12):1361–1374. [Google Scholar]
  • 106.Tan P., Chen X., Zhang H., Wei Q., Luo K. Artificial intelligence aids in development of nanomedicines for cancer management. Semin Cancer Biol. 2023;89:61–75. doi: 10.1016/j.semcancer.2023.01.005. [DOI] [PubMed] [Google Scholar]
  • 107.Wang Y., Lyu N., Liu F., Liu W.J., Bi Y., Zhang Z., et al. More diversified antibiotic resistance genes in chickens and workers of the live poultry markets. Environ Int. 2021;153 [Google Scholar]
  • 108.Kini K., Agnimonhan R., Dossa R., Silué D., Koebnik R. A diagnostic multiplex PCR scheme for identification of plant-associated bacteria of the genus Pantoea. bioRxiv. 2018 [Google Scholar]
  • 109.Arinichev I.V., Arinicheva I.V., Volkova G.V., Yakhnik Y.V. AI solutions for digital diagnostics of grain crop diseases (based on the example of Pyrenophora teres in Winter barley) Russ Agric Sci. 2024;50(2):207–211. [Google Scholar]
  • 110.Han D., Yu F., Zhang D., Hu J., Zhang X., Xiang D., et al. Molecular rapid diagnostic testing for bloodstream infections: nanopore targeted sequencing with pathogen-specific primers. J Infect. 2024;88(6) [Google Scholar]
  • 111.Hunjan M. University of Waterloo; 2023. AlignDx: enabling automated, cloud-based workflows for streamlined bioinformatic-focused pathogen surveillance. [Google Scholar]
  • 112.Sayers E.W., Bolton E.E., Brister J.R., Canese K., Chan J., Comeau Donald C., et al. Database resources of the National Center for Biotechnology Information in 2023. Nucleic Acids Res. 2022;51(D1):D29–D38. [Google Scholar]
  • 113.Langmead B., Salzberg S.L. Fast gapped-read alignment with Bowtie 2. Nat Methods. 2012;9(4):357–359. doi: 10.1038/nmeth.1923. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 114.Gao Y., Luo H., Lyu H., Yang H., Yousuf S., Huang S., et al. Benchmarking short-read metagenomics tools for removing host contamination. GigaScience. 2025;14 [Google Scholar]
  • 115.Danecek P., Bonfield J.K., Liddle J., Marshall J., Ohan V., Pollard M.O., et al. Twelve years of SAMtools and BCFtools. GigaScience. 2021;10(2) [Google Scholar]
  • 116.Wood D.E., Lu J., Langmead B. Improved metagenomic analysis with Kraken 2. Genome Biol. 2019;20(1):257. doi: 10.1186/s13059-019-1891-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 117.Li D., Liu C.-M., Luo R., Sadakane K., Lam T.-W. MEGAHIT: an ultra-fast single-node solution for large and complex metagenomics assembly via succinct de Bruijn graph. Bioinformatics. 2015;31(10):1674–1676. doi: 10.1093/bioinformatics/btv033. [DOI] [PubMed] [Google Scholar]
  • 118.Prjibelski A., Antipov D., Meleshko D., Lapidus A., Korobeynikov A. Using SPAdes de novo assembler. Curr Protoc Bioinformatics. 2020;70(1) [Google Scholar]
  • 119.Bai D., Chen T., Xun J., Ma C., Luo H., Yang H., et al. EasyMetagenome: a user-friendly and flexible pipeline for shotgun metagenomic analysis in microbiome research. iMeta. 2025;4(1) [Google Scholar]
  • 120.Ye J., Coulouris G., Zaretskaya I., Cutcutache I., Rozen S., Madden T.L. Primer-BLAST: a tool to design target-specific primers for polymerase chain reaction. BMC Bioinf. 2012;13(1):134. [Google Scholar]
  • 121.Amos B., Aurrecoechea C., Barba M., Barreto A., Basenko Evelina Y., Bażant W., et al. VEuPathDB: the eukaryotic pathogen, vector and host bioinformatics resource center. Nucleic Acids Res. 2021;50(D1):D898–D911. [Google Scholar]
  • 122.Hu X., Hurtado-Gonzales O.P., Adhikari B.N., French-Monar R.D., Malapi M., Foster J.A., et al. PhytoPipe: a phytosanitary pipeline for plant pathogen detection and diagnosis using RNA-seq data. BMC Bioinf. 2023;24(1):470. [Google Scholar]
  • 123.Xuan J., Yu Y., Qing T., Guo L., Shi L. Next-generation sequencing in the clinic: promises and challenges. Cancer Lett. 2013;340(2):284–295. doi: 10.1016/j.canlet.2012.11.025. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 124.Ulahannan D., Kovac M.B., Mulholland P.J., Cazier J.B., Tomlinson I. Technical and implementation issues in using next-generation sequencing of cancers in clinical practice. Br J Cancer. 2013;109(4):827–835. doi: 10.1038/bjc.2013.416. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 125.Forde B.M., O'Toole P.W. Next-generation sequencing technologies and their impact on microbial genomics. Brief Funct Genomics. 2013;12(5):440–453. doi: 10.1093/bfgp/els062. [DOI] [PubMed] [Google Scholar]
  • 126.Boers S.A., Jansen R., Hays J.P. Understanding and overcoming the pitfalls and biases of next-generation sequencing (NGS) methods for use in the routine clinical microbiological diagnostic laboratory. Eur J Clin Microbiol Infect Dis. 2019;38(6):1059–1070. doi: 10.1007/s10096-019-03520-3. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

Data availability is not applicable to this article, as no new data were created or analyzed in this study.


Articles from aBIOTECH are provided here courtesy of Elsevier

RESOURCES