Abstract
Bacterial pathogens are one of the major threats to biosafety and environmental health, and advanced assessment is a prerequisite to combating bacterial pathogens. Currently, 16S rRNA gene sequencing is efficient in the open‐view detection of bacterial pathogens. However, the taxonomic resolution and applicability of this method are limited by the domain‐specific pathogen database, taxonomic profiling method, and sequencing target of 16S variable regions. Here, we present a pipeline of multiple bacterial pathogen detection (MBPD) to identify the animal, plant, and zoonotic pathogens. MBPD is based on a large, curated database of the full‐length 16S genes of 1986 reported bacterial pathogen species covering 72,685 sequences. In silico comparison allowed MBPD to provide the appropriate similarity threshold for both full‐length and variable‐region sequencing platforms, while the subregion of V3−V4 (mean: 88.37%, accuracy rate compared to V1−V9) outperformed other variable regions in pathogen identification compared to full‐length sequencing. Benchmarking on real data sets suggested the superiority of MBPD in a broader range of pathogen detections compared with other methods, including 16SPIP and MIP. Beyond detecting the known causal agent of animal, human, and plant diseases, MBPD is capable of identifying cocontaminating pathogens from biological and environmental samples. Overall, we provide a MBPD pipeline for agricultural, veterinary, medical, and environmental monitoring to achieve One Health.
Keywords: bacterial pathogen detection, 16S rRNA gene sequencing, One Health
Multiple bacterial pathogen detection (MBPD) provides the accurate and comprehensive detection of bacterial pathogens from biological and environmental samples based on 16S rRNA gene sequencing. By constructing a relatively complete database of 72,685 sequences for wide identification across a broad range of bacterial pathogens causing animal, plant, and zoonotic diseases, MBPD can provide the appropriate threshold for common variable regions of 16S rRNA gene in taxonomy alignment based on the recommendation of in silico experiment. MBPD is freely available at https://github.com/LorMeBioAI/MBPDLorMeBioAI/MBPD.

Highlights
Multiple bacterial pathogen detection (MBPD) is a 16S‐based pipeline for detecting animal, plant, and zoonotic bacterial pathogens.
The curated reference database of MBPD contains 72,685 sequences from 1986 known pathogen species.
MBPD provides appropriate thresholds for taxonomic profiling of common 16S variable regions.
MBPD outperforms other methods in detecting both causal and coinfecting pathogens.
INTRODUCTION
Pathogenic infections are a formidable challenge in One Health, which is an approach that recognizes that the health of people is closely connected to the health of animals, plants, and our shared environment [1]. During the past two decades, approximately 15 million global deaths per year have been attributed to infectious diseases [2], and approximately 30% of global food production is lost to plant diseases every year [3]. Today, the serious damage to humans and the global economy caused by COVID‐19, the risk of increased zoonotic host diversity, and the pandemic of plant pathogenic bacteria indicate the importance of whole‐habitat (human‐animal‐plant‐environment) detection of pathogens and surveillance for disease management and ecological health [4–6]. In addition, coinfection is common in both natural and agricultural environments [7]. The main problem is that coinfecting pathogens interact synergistically with each other, with the presence of one enhancing the abundance and/or virulence of the other [8]. However, there is no reliable means of detecting coinfection [9, 10]. Many national/international organizations, such as the National Center for Biotechnology Information (NCBI), World Health Organization, and International Society of Plant Pathology Committee on the Taxonomy of Plant Pathogenic Bacteria (ISPP‐CTPPB), have been gathering information on pathogens, but the information is fragmentary and unable to satisfy the requirement of a holistic view of pathogen infections/contamination in biological and environmental samples for implementation of the One Health approach [11].
Pathogen detection is mainly categorized into culture‐dependent and culture‐independent approaches [12]. The former approaches relying on cultivation and biochemical testing are classic experiments for pathogen detection, which provide critical information, including the ability to satisfy Koch's postulates, drug susceptibility, and biochemical tests, but are highly time‐consuming and limited by the culturability of pathogens [13]. Culture‐independent approaches include PCR‐ and high‐throughput sequencing‐based methods. PCR or probe microarrays reduce time in pathogen identification but are overreliant on the specificity and number of primers and prone to false‐positive rates in pathogen detection [14, 15]. Common high‐throughput sequencing‐based methods involve 16S/ITS/18S/rpoB/cpn60 amplicon sequencing and metagenomic sequencing [16]. Given the reported pathogen sequences, sequencing methods are capable of identifying various pathogens within almost all types of microorganisms in the samples [17]. Compared with culture‐dependent and PCR methods, the main benefit of sequencing methods is being high throughput, but the matching analysis pipelines need to be improved to achieve high sensitivity [18].
16S rRNA gene sequencing is widely used in bacterial pathogen detection in animal, food, water, soil, and plant samples [16, 19–23]. However, the scope and sensitivity of pathogen detection using 16S sequencing are highly dependent on the sequencing methods, hypervariable regions sequenced, pathogen database, and analytic pipelines [9, 18]. Short‐read sequencing is high‐throughput and cost‐effective but has limitations in read length [18]. Long‐read sequencing is able to generate long reads for high‐resolution taxonomy but has a limitation of sequencing error rates of approximately 13% [17]. Additionally, the composition and abundance of pathogens can be affected by the use of different bioinformatics tools [12]. For example, the common approaches to data processing include operational taxonomic unit (OTU) clustering or amplicon sequence variant (ASV) approaches [24]. Compared with the OTU‐based approach, the ASV‐based approach has the advantages of fewer sequencing errors and more accurate taxonomic resolution [25, 26]. Moreover, the outcome of pathogen detection fluctuates depending on the reference database, sequencing region, and similarity threshold [27, 28]. Several pipelines for pathogen identification have been developed based on 16S rRNA amplicon data, such as 16SPIP [29], MIP [30], and Krishna [31], but they only target pathogens in relation to human infection or water quality [29–31]. The widespread distribution and transmission of pathogens among multiple hosts and environments require a tool for systematic and rapid detection of pathogens suitable for all habitats to provide a basis for One Health diagnosis and therapy of multiple pathogens that coexist in humans, animals, and plants [1, 32].
In this study, we developed a bacterial pathogen detection pipeline, multiple bacterial pathogen detection (MBPD), to accurately detect pathogens based on 16S rRNA gene sequencing. To comprehensively identify pathogens in samples and meet the demand for accurate pathogen identification under different variable regions in 16S rRNA sequencing, we constructed a curated full‐length 16S database matching the DADA2 pipeline [25] of the ASV‐based approach for data processing and evaluated the optimal similarity based on in silico experiments. MBPD is able to detect a broad range of animal, plant, and zoonotic pathogens, which is essential for food safety, epidemic prevention, disease diagnosis, and environmental monitoring. MBPD is publicly available at https://github.com/LorMeBioAI/MBPD.
RESULTS
Overview of MBPD
We developed the MBPD approach for holistic pathogen detection based on 16S sequencing. The input of MBPD is 16S rRNA amplicon data without primers and barcodes, and the output is an ASV table with pathogen taxonomy. The pipeline consists of the following steps: (1) building the MBPD database based on the alignment between collected pathogen information and the Silva 16S database; (2) use of the MBPD database in in silico and sequence‐based experiments to explore the optimal similarity and the accuracy of species/strain annotation using the UCLUST algorithm for the different variable regions of 16S; (3) use of the DADA2 tool for the conversion from 16S rRNA amplicon data to ASV sequences; (4) alignment of ASV sequences with the MBPD database using an appropriate similarity threshold based on various sequencing regions, where 90% is recommended for the V4 and V1−V2 regions, and 80% is recommended for other variable regions (Figure 1).
Figure 1.

MBPD workflow for open‐view bacterial pathogen detection based on 16S rRNA‐encoding gene sequencing. Bacterial pathogens causing animal, plant, and zoonotic diseases were first collected and curated from publicly available literature, databases, and web resources to construct the multiple bacterial pathogen database (MBPD) with a total of 72,685 full‐length sequences of 16S. Then, the accuracy and appropriate threshold of each variable region of 16S were evaluated through an in silico experiment. The clean data from 16S sequencing of bio/eco samples were subjected to the DADA2 pipeline using the MBPD database to obtain the amplicon sequence variant (ASV) sequences. The feature table of the ASV sequences was assigned using the UCLUST algorithm with an in silico experiment to optimize the cutoffs of similarity for the different variable regions of 16S. MBPD, multiple bacterial pathogen detection.
A large curated pathogen reference database of 72,685 16S sequences for MBPD
To construct a comprehensive database of bacterial pathogens causing animal, plant, and zoonotic diseases, we collected a total of 1986 pathogen species from public agencies, databases, and papers (Supporting Information: Tables S1 and S2). In general, they mainly belonged to four phyla, namely, Proteobacteria (758), Firmicutes (558), Actinobacteria (388), and Bacteroidetes (123) (Figure 2A). Then, we extracted sequences corresponding to those pathogens at the species or strain level from the Silva database, called the MBPD database, to improve data compatibility and better identify pathogen diversity. Briefly, the MBPD database is primarily composed of Proteobacteria (39,582), Firmicutes (24,284), Actinobacteria (4660), and Bacteroidetes (1449) at the phylum level (Figure 2B) and Bacillus (6945), Escherichia‐Shigella (6187), Salmonella (5251), and Pseudomonas (4360) at the genus level (Figure 2C). In addition, a high number of these pathogens originated from the “zoonotic” group (35,493 out of 72,685, 48.9%) and “animal” group (33,832 out of 72,685, 46.5%) compared to the “plant” group (3360 out of 72,685, 4.6%) (Figure 2B and Supporting Information: Figure S4). Details of the sequence number in each species are presented in Supporting Information: Table S5.
Figure 2.

The curated bacterial pathogen database of MBPD. (A) Phylogenetic tree based on the 16S rRNA gene sequences of 1986 reference pathogens. Colors of branches represent the corresponding phyla, and the outer ring denotes animal, plant, and zoonotic types of pathogens. (B) Basic taxonomic composition of the MBPD database at the phylum level. Only the top 10 phyla are shown in the figure, and the remaining phyla are merged as others. (C) Basic taxonomic composition of the MBPD database at the genus level. Only the top 10 genera are shown in the figure.
In silico experiments under different thresholds for sequencing platforms and amplicon targets
The 16S rRNA gene comprises nine variable regions interspersed throughout the highly conserved 16S sequence (Figure 3A). To provide better detection efficiency for various sequencing platforms and amplicon targets, we conducted an in silico experiment to compare the missing rate, accuracy rate, and runtime under different thresholds (Figure 3, Supporting Information: Figures S1 and S2). We found that subregions differed substantially in the extent to which they could match the PCR primer. The V1−V2, V1−V3, and V6−V9 regions performed worst, with 29.3%−33.2% of in silico amplicons unmatched (Supporting Information: Figure S2A), mainly belonging to three phyla (Proteobacteria, Firmicutes, and Actinobacteria) (Figure 3B and Supporting Information: Figure S1). Meanwhile, certain pathogen genera were probably unstably detected by the sequencing of variable regions with a high missing rate by in silico amplicons (Figure 3B). For example, phytopathogenic Candidatus Phytoplasma spp. is highly missing in the V1−V2, V1−V3, and V6−V9 regions, while human pathogenic Mycoplasma spp. is not well assigned in V3−V5, V4−V5, and V5−V7 (Figure 3B). Next, various sequencing regions showed a distinct discrepancy in the accuracy of pathogen classification (Supporting Information: Figure S2B). The V1−V9 region (full‐length) performed best (analysis of variance [ANOVA]: F 8,1341 = 183.9, p < 0.001, Supporting Information: Figure S2B), and the subregions of V1−V3 and V3−V4 were second to V1−V9 after eliminating the difference in the similarity threshold at the sequence level (ANOVA: F 7,1192 = 174.7, p < 0.001, Figure 3E). Above all, the choice of similarity threshold substantially affected the accuracy of pathogen identification and runtime. In terms of accuracy, most subregions performed well at the 80% similarity threshold, but the V4 and V1−V2 regions performed best at the 90% threshold (Figure 3C). Although the similarity threshold of 80% is recommended for both full‐length and V3−V4 sequencing approaches to obtain the optimal accuracy of bacterial pathogen detection, it took more processing time at the 80% similarity threshold than at other thresholds (Figure 3D).
Figure 3.

In silico comparison of 16S rRNA variable regions. (A) Common sequencing target of variable regions in Illumina and full‐length sequencing. The 16S full‐length database of MBPD was trimmed and generated in silico amplicons for different subregions based on the location of PCR primers commonly used in microbiome studies. Then, 10,000 sequences were extracted and repeated 30 times for pathogen alignment. (B) Missing rate of pathogens across bacterial genera under in silico experiments. Various facets denote the sequencing target of 16S variable regions (V). Colors denote the missing pathogen at the phylum level. C‐Phytoplasma and BCP refer to Candidatus Phytoplasma and Burkholderia‐Caballeronia‐Paraburkholderia, respectively. (C) Accuracy rate of taxonomy assignment for in silico amplicons of the MBPD database with varying similarity thresholds (80%, 90%, 95%, 97%, and 99%) for different variable regions of 16S. (D) Runtime of taxonomy assignment for in silico amplicons with varying similarity thresholds for different variable regions of 16S. (E) Accuracy rate of the subregion in pathogen detection compared to the V1−V9 region. Different lowercase letters indicate significant differences in disease incidence across varieties (HSD post hoc test: p < 0.05). HSD, honest significant difference.
MBPD outperformed other approaches in pathogen detection
We selected 16S data of 50 patient skin samples sourced from Lax et al. [33] to compare MBPD with two published similar pipelines (16SPIP [29] and MIP [30]) using the same data set. For MBPD, we used the similarity threshold of 90% in the process of pathogen alignment based on the results of the in silico experiment (Figure 3C) because they all amplified the V4 region of 16S. In this study, MBPD contained the largest number of identified pathogens among all the pipelines, with the number of detected pathogens reaching 310, and contained most of the other two, especially in MIP (Figure 4A). According to the evaluation of processing time, MBPD is as time‐effective as the MIP pipeline in detecting pathogenic species with the same sample sizes (Figure 4B). Next, we compared the relative abundance of dominant genera (the six most abundant genera) with MIP because 16SPIP can be applied to pathogen identification but not abundance. We identified the dominant genus Staphylococcus, which was also detected in MIP, and detected genera not identified in MIP, such as Corynebacterium and Finegoldia (Supporting Information: Figure S5A).
Figure 4.

Performance of MBPD, 16SPIP, and MIP in bacterial pathogen detection. (A) Venn diagram displaying the shared and specific species of pathogens detected by MBPD, 16SPIP, and MIP. (B) Comparison of runtime among MBPD, MIP, and 16SPIP. Runtime in min. The colors green, red, and brown denote MIP, MBPD, and 16SPIP, respectively.
MBPD accurately identified the dominant and coinfecting pathogens in real samples
We tested MBPD on paired samples between healthy and diseased individuals for human periodontitis, white shrimp disease, and plant bacterial wilt to further evaluate the performance of MBPD in real environmental samples, which are important sectors of One Health (human, animal, and environment). We used the threshold of 90% for pathogen alignment of all samples because they all amplified the V4 region of the 16S rRNA gene. The abundance of the pathogen community and causal agent of diseases was significantly higher in diseased samples than in healthy samples (Student's t‐test, p < 0.001, Figure 5A and Supporting Information: Figure S3A−C), but the evenness of the pathogen community in healthy samples was significantly higher than that in diseased samples (Supporting Information: Figure S3D−F). In addition, MBPD was able to identify other potential coinfecting pathogens (Figure 5B−D). For example, the highly enriched genera in diseased samples of humans and animals (Treponema and Photobacterium; p < 0.001 by Student's t‐test) could be isolated from the same environment as the causal agent of diseases and seriously threaten the host.
Figure 5.

Pathogen detection in the healthy and diseased samples. Differences in the relative abundance of causal agents of diseases in human, animal, and plant rhizosphere samples (A). Relative abundances of other potential genera were enriched in the diseased group between human (B), animal (C), and plant rhizosphere (D) samples. Pairwise Student's t‐test was used for statistical analyses (*p < 0.05; **p < 0.01; ***p < 0.001; ****p < 0.0001).
DISCUSSION
We present MBPD, a method to comprehensively identify bacterial pathogens from bio/eco‐samples using 16S rRNA gene sequencing, focusing on assessing pathogenic contamination in environmental and biological samples. To the best of our knowledge, MBPD represents a One Health‐oriented pathogen detection approach fueled by a large, current reference pathogen database. Key advances underlying our pipeline include (i) a relatively complete database convenient for detection across a broad range of pathogenic bacteria infecting animals and plants and zoonoses; (ii) appropriate thresholds provided for current common 16S amplicon regions for sequence alignment; (iii) high performance in the identification and quantification of pathogens; (iv) accurate identification for both dominant pathogens and their cocontaminating pathogens from biological and environmental samples; and (v) an open‐source tool suitable for constant updating of pathogen databases and processes.
The performance of pathogen identification is affected by the reference database, sequencing region, and similarity threshold [27, 34]. Small databases limit detected pathogen richness [28], and diverse sequencing regions lead to differences in species [27]. Here, we introduced a reference database for pathogen identification based on the Silva database because Silva is comprehensive and updated in a timely manner [35]. However, the optimal similarity threshold under different sequencing regions is still unknown. We conducted an in silico comparison of 16S rRNA gene sequencing under different thresholds [27] and chose UCLUST [36] as the tool for pathogen alignment because UCLUST achieved significantly better performance than all other methods for 16S rRNA gene classifications at the species level [34]. Our assessment showed that 16S full‐length sequencing at the 80% threshold performed best using MBPD, but variable regions will likely reduce the pathogen richness of a microbiome sample. In addition, considering the wider use of the Illumina sequencing platform (lower cost and production of short sequences), we also provided the optimal threshold under various common sequencing regions (V1−V2, V1−V3, V3−V4, V3−V5, V4, V4−V5, V5−V7, and V6−V9) for the best detection results. Next, MBPD enables the detection of cocontaminating pathogens from samples. For example, there are many cross‐infections occurring in a hostile environment [37, 38], but specific pathogens are not clear, or the densities of other pathogens will increase after treatment with the current pathogen. MBPD can also be used for a deeper understanding of pathogens, for example, to elucidate the pathogen diversity in samples and its response to environmental changes.
Although several tools are available to identify pathogens using 16S sequences [29, 30], a benchmark study to assess the accuracy of these pipelines were needed. We downloaded 16S rRNA amplicon‐based sequencing data of 50 patient skin samples from Lax et al. [33] and tested them with these tools. Here, we showed that MBPD might outperform other existing tools when processing the same data: MBPD contains the highest pathogen richness, and the dominant pathogens are consistent with those from other tools, which may result from the more comprehensive database providing a wider detection of pathogens [9]. In addition, we looked at the accuracy of MBPD using 16S rRNA amplicon data of humans, animals, and plants as well as diseased and healthy samples. Each type of sample was adequate for detecting the causal agent of diseases and identifying the potential pathogens.
Although MBPD showed high performance in pathogen detection, there are still limitations and challenges in 16S sequencing diagnosis. First, short‐read sequencing is limited in taxonomical resolution [12, 27, 39, 40]. The performance of the 16S subregion reached a maximum relative accuracy of 90% at the species/strain level compared to 16S full‐length sequencing (Figure 3D). Different strains at the species level or closely related species are occasionally unstably detected using short‐read sequencing [12, 27]. For example, Staphylococcus spp. was abundantly recognized as innocuous S. epidermidis by MIP but identified as pathogenic S. aureus by MBPD (Supporting Information: Figure S5B,C). To date, higher taxonomical resolution in differentiating bacterial strains at the subgenus level has been achieved by long‐read sequencing (PacBio [27] and MinION™ [41]) and protein‐coding taxonomic markers (rpoB [42] and cpn60 [43]), but the target does not perform well in discerning microorganism below the genus level. Second, pathogen identification has primer bias. Studies have indicated that V4 is unstable in identifying enteric pathogens and phytopathogenic Robbsia spp. using OTU clustering [40] or the Greengenes database [39]. The current MBPD shows a decent resolution at the species/strain level and accurately detects these two pathogens using the ASV approach with our curated pathogen database (Figure 4A and Supporting Information: Figure S1). This suggested analytical method and database are critical for taxonomic profiling [25, 28]. However, some pathogen species, such as phytopathogenic Candidatus Phytoplasma spp., are still difficult to detect (missing rate >75%, Supporting Information: Figure S1) due to the limited strain resolution of 16S for identification [40]. Thus, the identification of Phytoplasmas spp. may rely on some protein‐coding taxonomic markers (rpoB [42] and cpn60 [43]) or PCR‐based sequencing approaches applied to multiple genes other than 16S [12]. Finally, sequencing approaches lack direct evidence of lesions. Although the composition and risk of pathogen contamination have been assessed by amplicon and metagenome sequencing approaches [9, 18], the ground truth of causal pathogenic agents, lesions, and disease types require further experimental validations and antilytic method advances. For example, some pipelines, such as MIP, were developed to understand the relationship among pathogens, disease types, and human lesions [30, 44].
CONCLUSION
We developed a 16S‐based pipeline, MBPD, for the holistic detection of animal, plant, and zoonotic pathogens. MBPD contains the most comprehensive reference database for pathogen identification and quantification with feasible updating. This pipeline outperforms the existing tools for detecting bacterial pathogens and provides the optimal similarity threshold for pathogen alignment from varying biome samples. We anticipate a broad application of MBPD for assessment and diagnosis in the clinic, agriculture, fisheries, and veterinary medicine in the current One Health era.
METHODS
MBPD database construction
We first collected information on animal, plant, and zoonotic pathogens from various studies and databases, such as 16SPIP [29], FAPROTAX [45], and Bull et al. [46], [47], [48] (for details, see Supporting Information: Table S1). Based on the species level, a pathogenic reference database containing 72,685 sequences was obtained by alignment with the Silva 138.1 SSU Ref NR 99 bacterial database, which was developed with a 99% identity to remove redundant sequences [35], and a label on pathogenic types was added to the basic taxonomy of the Silva database (Supporting Information: Table S4, type). For unclassified pathogens, we further searched for relatively accurate taxonomic information in the NCBI Taxonomy Database (https://www.ncbi.nlm.nih.gov/taxonomy). However, considering that the species and strain levels in the Silva database are not well distinguished [49], we added a label for the corrected species level (Supporting Information: Table S4, species_manual). To construct and explore the phylogenetic relationships of pathogens, the sequences of all species (1986) were aligned using MAFFT [50], and a de novo phylogenetic tree was constructed using FastTree2 based on the maximum‐likelihood method [51]. The phylogenetic tree was further visualized using the iTOL web tool (https://itol.embl.de).
In silico comparison of 16S rRNA gene sequencing under different thresholds
The in silico analysis was carried out separately using the MBPD database. In silico amplicons demarcating different subregions of the 16S gene were generated by trimming regions defined by established primer sets (Supporting Information: Table S2) using the USEARCH search_pcr2 command [36]. Sequences were discarded if one or more variable regions could not be matched to a primer pair [27]. To determine the taxonomic resolution afforded by different variable regions and thresholds, 10,000 sequences were randomly selected using SeqKit [52] from each in silico amplicon, and classification was repeated 30 times against the MBPD database using the UCLUST algorithm [36, 53] with a variety of similarity thresholds (80%, 90%, 95%, 97%, and 99%). The accuracy rate, calculated as Accuracy rate = 100 × (accuracyfull‐length − accuracysubregions)/accuracyfull‐length, was employed to explore the best performance of various subregions in pathogen identification. The missing rate, calculated as Missing rate = 100 × (total numberphyla − miss numberphyla)/total numberphyla, was used to reveal the proportion of sequences for each variable region that could not be matched in the in silico experiment.
Data processing
Sequences with primers and barcodes removed were processed using the DADA2 pipeline [25], which is designed to obtain sequences with a single‐nucleotide difference, known as ASVs. Trimming and filtering were performed on paired reads with a maximum of one expected error per read (maxEE = 1). After merging paired reads and chimera filtering, the phylogenetic affiliation of each 16S rRNA gene sequence was analyzed by the UCLUST algorithm [36, 53] against the MBPD database using the appropriate threshold based on the results of in silico experiments.
Amplicon sequencing data for testing the performance of MBPD
To compare the performance of MBPD with that of 16SPIP and MIP, we randomly picked 50 patient skin samples from projects [33] targeting the V4 region of 16S for taxonomic profiling, which had been previously tested on the MIP pipeline [30]. In addition, three published V4 regions of 16S data from human, animal, and plant samples, comprising paired healthy versus diseased samples, were downloaded from the NCBI according to accession number: periodontitis (pathogen: Porphyromonas gingivalis) [20], white shrimp disease (Vibrio parahaemolyticus) [19], and bacterial wilt (Ralstonia solanacearum) [54]. The raw sequences were first processed using fastp [55] to truncate low‐quality reads, potential primers, and barcodes. Then, paired‐end sequences were merged using the QIIME join_paired_ends.py command [53]. Finally, cleaned fastq files were processed using the MBPD pipeline.
Statistical analysis
Student's t‐tests (two‐sided) were used to test for statistical significance between pairs of samples, where p values below 0.05 were considered statistically significant. ANOVA and Tukey's honest significant difference tests was used to determine the statistical significance of multiple comparisons. All statistical analyses were carried out using the R 4.1.2 program (www.r-project.org).
AUTHOR CONTRIBUTIONS
Conceptualization: Xinrun Yang, Gaofei Jiang, Zhong Wei. Resources: Gaofei Jiang, Zhong Wei. Methodology: Xinrun Yang, Gaofei Jiang, Zhong Wei. Data curation: Xinrun Yang, Gaofei Jiang, Yuling Zhang, Ningqi Wang. Formal analysis: Xinrun Yang, Gaofei Jiang, Yaozhong Zhang, Ningqi Wang, Yuling Zhang. Funding acquisition: Gaofei Jiang, Zhong Wei, Xiaofang Wang, Fang‐Jie Zhao, Yangchun Xu, Qirong Shen. Investigation: Xinrun Yang, Gaofei Jiang, Yaozhong Zhang. Project administration: Xinrun Yang, Gaofei Jiang, Zhong Wei, Xiaofang Wang, Fang‐Jie Zhao, Yangchun Xu, Qirong Shen. Supervision: Xinrun Yang, Gaofei Jiang, Zhong Wei, Yaozhong Zhang. Software: Xinrun Yang, Gaofei Jiang, Yaozhong Zhang, Zhong Wei, Yangchun Xu. Visualization: Xinrun Yang, Gaofei Jiang, Yaozhong Zhang. Writing—original draft: Xinrun Yang. Writing—review & editing: Gaofei Jiang, Yaozhong Zhang, Zhong Wei, Fang‐Jie Zhao.
CONFLICT OF INTEREST
The authors declare no conflict of interest.
Supporting information
Supporting information.
Supporting information.
ACKNOWLEDGMENTS
We thank Chen Liu and Ye Tao for their suggestions. This research was financially supported by the National Natural Science Foundation of China (42090062, 42090064, 31972504, and 42007038), the National Key Research and Development Program of China (2022YFC3501501), and the Fundamental Research Funds for the Central Universities (XUEKEN2022025‐XUEKEN2022026), and China National Tobacco Corporation (110202101047, LS‐07), Jiangxi Branch of China National Tobacco Corporation (2021.01.010).
Yang, Xinrun , Jiang Gaofei, Zhang Yaozhong, Wang Ningqi, Zhang Yuling, Wang Xiaofang, Zhao Fang‐Jie, Xu Yangchun, Shen Qirong, and Wei Zhong. 2023. “MBPD: A Multiple Bacterial Pathogen Detection Pipeline for One Health practices.” iMeta 2, e82. 10.1002/imt2.82
Xinrun Yang and Gaofei Jiang contributed equally to this work.
Contributor Information
Gaofei Jiang, Email: gjiang@njau.edu.cn.
Zhong Wei, Email: weizhong@njau.edu.cn.
DATA AVAILABILITY STATEMENT
MBPD is publicly available on GitHub (https://github.com/LorMeBioAI/MBPD). Supplementary materials (figures, tables, scripts, graphical abstract, slides, videos, Chinese translated version, and updated materials) may be found in the online DOI or iMeta Science http://www.imeta.science/.
REFERENCES
- 1. Ghai, Ria R. , Wallace Ryan M., Kile James C., Shoemaker Trevor R., Vieira Antonio R., Negron Maria E., and Shadomy Sean V., et al. 2022. “A Generalizable One Health Framework for the Control of Zoonotic Diseases.” Scientific Reports 12: 8588. 10.1038/s41598-022-12619-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2. Fauci, Anthony S. , and Morens David M.. 2012. “The Perpetual Challenge of Infectious Diseases.” New England Journal of Medicine 366: 454–61. 10.1056/NEJMra1108296 [DOI] [PubMed] [Google Scholar]
- 3. Savary, Serge , Willocquet Laetitia, Pethybridge Sarah Jane, Esker Paul, McRoberts Neil, and Nelson Andy. 2019. “The Global Burden of Pathogens and Pests on Major Food Crops.” Nature Ecology & Evolution 3: 430–39. 10.1038/s41559-018-0793-y [DOI] [PubMed] [Google Scholar]
- 4. Gruetzmacher, Kim , Karesh William B., Amuasi John H., Arshad Adnan, Farlow Andrew, Gabrysch Sabine, Jetzkowitz Jens, et al. 2021. “The Berlin Principles on One Health—Bridging Global Health and Conservation.” Science of The Total Environment 764: 142919. 10.1016/j.scitotenv.2020.142919 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5. Gibb, Rory , Redding David W., Chin Kai Qing, Donnelly Christl A., Blackburn Tim M., Newbold Tim, and Jones Kate E.. 2020. “Zoonotic Host Diversity Increases in Human‐Dominated Ecosystems.” Nature 584: 398–402. 10.1038/s41586-020-2562-8 [DOI] [PubMed] [Google Scholar]
- 6. Delgado‐Baquerizo, Manuel , Guerra Carlos A., Cano‐Díaz Concha, Egidi Eleonora, Wang Jun‐Tao, Eisenhauer Nico, Singh Brajesh K., and Maestre Fernando T.. 2020. “The Proportion of Soil‐Borne Pathogens Increases with Warming at the Global Scale.” Nature Climate Change 10: 550–54. 10.1038/s41558-020-0759-3 [DOI] [Google Scholar]
- 7. Tollenaere, Charlotte , Susi Hanna, and Laine Anna‐Liisa. 2016. “Evolutionary and Epidemiological Implications of Multiple Infection in Plants.” Trends in Plant Science 21: 80–90. 10.1016/j.tplants.2015.10.014 [DOI] [PubMed] [Google Scholar]
- 8. Griffiths, Emily C. , Pedersen Amy B., Fenton Andy, and Petchey Owen L.. 2011. “The Nature and Consequences of Coinfection in Humans.” Journal of Infection 63: 200–206. 10.1016/j.jinf.2011.06.005 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9. Han, Dongsheng , Li Ziyang, Li Rui, Tan Ping, Zhang Rui, and Li Jinming. 2019. “mNGS in Clinical Microbiology Laboratories: On the Road to Maturity.” Critical Reviews in Microbiology 45: 668–85. 10.1080/1040841X.2019.1681933 [DOI] [PubMed] [Google Scholar]
- 10. Sundin, George W. , and Wang Nian. 2018. “Antibiotic Resistance in Plant‐Pathogenic Bacteria.” Annual Review of Phytopathology 56: 161–80. 10.1146/annurev-phyto-080417-045946 [DOI] [PubMed] [Google Scholar]
- 11. Timme, Ruth E. , Wolfgang William J., Balkey Maria, Venkata Sai Laxmi Gubbala, Randolph Robyn, Allard Marc, and Strain Errol. 2020. “Optimizing Open Data to Support One Health: Best Practices to Ensure Interoperability of Genomic Data from Bacterial Pathogens.” One Health Outlook 2: 20. 10.1186/s42522-020-00026-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12. Muhamad Rizal, Nurnabila Syafiqah , Neoh Hui‐min, Ramli Ramliza, Periyasamy Petrick Ramesh A/L K, Hanafiah Alfizah, Samat Muttaqillah Najihan Abdul, Tan Toh Leong, et al. 2020. “Advantages and Limitations of 16S rRNA Next‐Generation Sequencing for Pathogen Identification in the Diagnostic Microbiology Laboratory: Perspectives from a Middle‐Income Country.” Diagnostics 10: 816. 10.3390/diagnostics10100816 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13. Lagier, Jean‐Christophe , Edouard Sophie, Pagnier Isabelle, Mediannikov Oleg, Drancourt Michel, and Raoult Didier. 2015. “Current and Past Strategies for Bacterial Culture in Clinical Microbiology.” Clinical Microbiology Reviews 28: 208–36. 10.1128/CMR.00110-14 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14. Palacio‐Bielsa, A. , Cambra M. A., and López M. M.. 2009. “PCR Detection and Indentification of Plant‐Pathogenic Bacteria: Updated Review and Identification of Plant‐Pathogenic Bacteria: Updated Review of Protocols (1989−2007).” Journal of Plant Pathology 91: 249–97. 10.4454/jpp.v91i2.957 [DOI] [Google Scholar]
- 15. Allicock, Orchid M. , Guo Cheng, Uhlemann Anne‐Catrin, Whittier Susan, Chauhan Lokendra V., Garcia Joel, Price Adam, et al. 2018. “BacCapSeq: A Platform for Diagnosis and Characterization of Bacterial Infections.” mBio 9: e02007–02018. 10.1128/mBio.02007-18 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16. Garner, Emily , Davis Benjamin C., Milligan Erin, Blair Matthew Forrest, Keenum Ishi, Maile‐Moskowitz Ayella, Pan Jin, et al. 2021. “Next Generation Sequencing Approaches to Evaluate Water and Wastewater Quality.” Water Research 194: 116907. 10.1016/j.watres.2021.116907 [DOI] [PubMed] [Google Scholar]
- 17. He, Xuejun , Zhang Ningzhi, Cao Wenye, Xing Yiqiao, and Yang Ning. 2022. “Application Progress of High‐Throughput Sequencing in Ocular Diseases.” Journal of Clinical Medicine 11: 3485. 10.3390/jcm11123485 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18. Chen, Peixin , Sun Wenwen, and He Yayi. 2020. “Comparison of the Next‐Generation Sequencing (NGS) Technology with Culture Methods in the Diagnosis of Bacterial and Fungal Infections.” Journal of Thoracic Disease 12: 4924–29. 10.21037/jtd-20-930 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19. Huang, Zhijian , Zeng Shenzheng, Xiong Jinbo, Hou Dongwei, Zhou Renjun, Xing Chengguang, Wei Dongdong, et al. 2020. “Microecological Koch's Postulates Reveal that Intestinal Microbiota Dysbiosis Contributes to Shrimp White Feces Syndrome.” Microbiome 8: 32. 10.1186/s40168-020-00802-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20. Chen, Casey , Hemme Chris, Beleno Joan, Shi Zhou Jason, Ning Daliang, Qin Yujia, and Tu Qichao, et al. 2018. “Oral Microbiota of Periodontal Health and Disease and Their Changes after Nonsurgical Periodontal Therapy.” The ISME Journal 12: 1210–24. 10.1038/s41396-017-0037-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21. Lewis, Elizabeth , Hudson John A., Cook Nigel, Barnes Jeremy D., and Haynes Edward. 2020. “Next‐Generation Sequencing as a Screening Tool for Foodborne Pathogens in Fresh Produce.” Journal of Microbiological Methods 171: 105840. 10.1016/j.mimet.2020.105840 [DOI] [PubMed] [Google Scholar]
- 22. Wei, Zhong , Hu Jie, Gu Yi'an, Yin Shixue, Xu Yangchun, Jousset Alexandre, Shen Qirong, and Friman Ville‐Petri. 2018. “ Ralstonia solanacearum Pathogen Disrupts Bacterial Rhizosphere Microbiome during an Invasion.” Soil Biology & Biochemistry 118: 8–17. 10.1016/j.soilbio.2017.11.012 [DOI] [Google Scholar]
- 23. Chen, Tao , Nomura Kinya, Wang Xiaolin, Sohrabi Reza, Xu Jin, Yao Lingya, Paasch Bradley C., et al. 2020. “A Plant Genetic Network for Preventing Dysbiosis in the Phyllosphere.” Nature 580: 653–57. 10.1038/s41586-020-2185-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24. Edgar, Robert C. 2013. “UPARSE: Highly Accurate OTU Sequences from Microbial Amplicon Reads.” Nature Methods 10: 996–98. 10.1038/nmeth.2604 [DOI] [PubMed] [Google Scholar]
- 25. Callahan, Benjamin J. , McMurdie Paul J., Rosen Michael J., Han Andrew W., Johnson Amy Jo A., and Holmes Susan P.. 2016. “DADA2: High‐Resolution Sample Inference from Illumina Amplicon Data.” Nature Methods 13: 581–83. 10.1038/nmeth.3869 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26. Prodan, Andrei , Tremaroli Valentina, Brolin Harald, Zwinderman Aeilko H., Nieuwdorp Max, and Levin Evgeni. 2020. “Comparing Bioinformatic Pipelines for Microbial 16S rRNA Amplicon Sequencing.” Plos One 15: e0227434. 10.1371/journal.pone.0227434 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27. Johnson, Jethro S. , Spakowicz Daniel J., Hong Bo‐Young, Petersen Lauren M., Demkowicz Patrick, Chen Lei, Leopold Shana R., et al. 2019. “Evaluation of 16S rRNA Gene Sequencing for Species and Strain‐Level Microbiome Analysis.” Nature Communications 10: 5029. 10.1038/s41467-019-13036-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28. Liao, Chun‐Chieh , Fu Po‐Ying, Huang Chih‐Wei, Chuang Chia‐Hsien, Yen Yun, Lin Chung‐Yen, and Chen Shu‐Hwa. 2022. “MetaSquare: An Integrated Metadatabase of 16S rRNA Gene Amplicon for Microbiome Taxonomic Classification.” Bioinformatics 38: 2930–31. 10.1093/bioinformatics/btac184 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29. Miao, Jiaojiao , Han Na, Qiang Yujun, Zhang Tingting, Li Xiuwen, and Zhang Wen. 2017. “16SPIP: A Comprehensive Analysis Pipeline for Rapid Pathogen Detection in Clinical Samples Based on 16S Metagenomic Sequencing.” BMC Bioinformatics 18: 568. 10.1186/s12859-017-1975-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30. Sun, Zheng , Liu Xudong, Jing Gongchao, Chen Yuzhu, Jiang Shuaiming, Zhang Meng, Liu Jiquan, Xu Jian, and Su Xiaoquan. 2022. “Comprehensive Understanding to the Public Health Risk of Environmental Microbes via a Microbiome‐Based Index.” Journal of Genetics and Genomics 49: 685–88. 10.1016/j.jgg.2021.12.011 [DOI] [PubMed] [Google Scholar]
- 31. Bal Krishna, K. C. , Sathasivan Arumugam, and Ginige Maneesha P.. 2021. “An Assessment of the Persistence of Putative Pathogenic Bacteria in Chloraminated Water Distribution Systems.” Water Research 190: 116677. 10.1016/j.watres.2020.116677 [DOI] [PubMed] [Google Scholar]
- 32. Bird, Brian H. , and Mazet Jonna A. K.. 2018. “Detection of Emerging Zoonotic Pathogens: An Integrated One Health Approach.” Annual Review of Animal Biosciences 6: 121–39. 10.1146/annurev-animal-030117-014628 [DOI] [PubMed] [Google Scholar]
- 33. Lax, Simon , Sangwan Naseer, Smith Daniel, Larsen Peter, Handley Kim M., Richardson Miles, Guyton Kristina, et al. 2017. “Bacterial Colonization and Succession in a Newly Opened Hospital.” Science Translational Medicine 9: eaah6500. 10.1126/scitranslmed.aah6500 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34. Bokulich, Nicholas A. , Kaehler Benjamin D., Rideout Jai Ram, Dillon Matthew, Bolyen Evan, Knight Rob, Huttley Gavin A., and Gregory Caporaso J.. 2018. “Optimizing Taxonomic Classification of Marker‐Gene Amplicon Sequences with QIIME 2's q2‐Feature‐Classifier Plugin.” Microbiome 6: 90. 10.1186/s40168-018-0470-z [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35. Quast, Christian , Pruesse Elmar, Yilmaz Pelin, Gerken Jan, Schweer Timmy, Yarza Pablo, Peplies Jörg, and Glöckner Frank Oliver. 2012. “The SILVA Ribosomal RNA Gene Database Project: Improved Data Processing and Web‐Based Tools.” Nucleic Acids Research 41: D590–596. 10.1093/nar/gks1219 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36. Edgar, Robert C . 2010. “Search and Clustering Orders of Magnitude Faster than BLAST.” Bioinformatics (Oxford, England) 26: 2460–61. 10.1093/bioinformatics/btq461 [DOI] [PubMed] [Google Scholar]
- 37. Wang, Ji‐Xiang , Cao Xiang, and Chen Yong‐Ping. 2021. “An Air Distribution Optimization of Hospital Wards for Minimizing Cross‐Infection.” Journal of Cleaner Production 279: 123431. 10.1016/j.jclepro.2020.123431 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38. Son, Sungwan , and Jang Choon‐Man. 2022. “Effects of Internal Airflow on IAQ and Cross‐Infection of Infectious Diseases between Students in Classrooms.” Atmospheric Environment 279: 119112. 10.1016/j.atmosenv.2022.119112 [DOI] [Google Scholar]
- 39. Greay, Telleasha L. , Gofton Alexander W., Zahedi Alireza, Paparini Andrea, Linge Kathryn L., Joll Cynthia A., and Ryan Una M.. 2019. “Evaluation of 16S Next‐Generation Sequencing of Hypervariable Region 4 in Wastewater Samples: An Unsuitable Approach for Bacterial Enteric Pathogen Identification.” Science of The Total Environment 670: 1111–24. 10.1016/j.scitotenv.2019.03.278 [DOI] [PubMed] [Google Scholar]
- 40. Mechan‐Llontop, Marco Enrique , Tian Long, Bernal‐Galeano Vivian, Reeves Ella, Hansen Mary Ann, Bush Elizabeth Ann, and Vinatzer Boris Alexander. 2019. “Assessing the Potential of Culture‐Independent 16S rRNA Microbiome Analysis in Disease Diagnostics: The Example of Dianthus gratianopolitanus and Robbsia andropogonis .” European Journal of Plant Pathology 155: 1211–23. 10.1007/s10658-019-01850-8 [DOI] [Google Scholar]
- 41. Matsuo, Yoshiyuki , Komiya Shinnosuke, Yasumizu Yoshiaki, Yasuoka Yuki, Mizushima Katsura, Takagi Tomohisa, Kryukov Kirill, et al. 2021. “Full‐Length 16S rRNA Gene Amplicon Analysis of Human Gut Microbiota Using MinION™ Nanopore Sequencing Confers Species‐Level Resolution.” BMC Microbiology 21: 35. 10.1186/s12866-021-02094-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42. Ogier, Jean‐Claude , Pagès Sylvie, Galan Maxime, Barret Matthieu, and Gaudriault Sophie. 2019. “ rpoB, A Promising Marker for Analyzing the Diversity of Bacterial Communities by Amplicon Sequencing.” BMC Microbiology 19: 171. 10.1186/s12866-019-1546-z [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43. Vancuren, Sarah J. , and Hill Janet E.. 2019. “Update on cpnDB: A Reference Database of Chaperonin Sequences.” Database 2019: baz033. 10.1093/database/baz033 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44. Chen, Yuzhu , Li Jian, Zhang Yufeng, Zhang Mingqian, Sun Zheng, Jing Gongchao, Huang Shi, and Su Xiaoquan. 2022. “Parallel‐Meta Suite: Interactive and Rapid Microbiome Data Analysis on Multiple Platforms.” iMeta 1: e1. 10.1002/imt2.1 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45. Louca, S. , Parfrey L. W., and Doebeli M.. 2016. “Decoupling Function and Taxonomy in the Global Ocean Microbiome.” Science 353: 1272–77. 10.1126/science.aaf4507 [DOI] [PubMed] [Google Scholar]
- 46. Bull, Carolee T. , De Boer Solke H., Denny Timothy P., Firrao Giuseppe, Saux Marion Fischer‐Le, Saddler Gerry S., Scortichini Marco, Stead David E., and Takikawa Yuichi. 2010. “Comprehensive List of Names of Plant Pathogenic Bacteria, 1980−2007.” Journal of Plant Pathology 92: 551–92. 10.4454/jpp.v92i3.302 [DOI] [Google Scholar]
- 47. Bull, Carolee T. , De Boer Solke H., Denny Timothy P., Firrao Giuseppe, Fischer‐Le Saux Marion, Saddler Gerry S., yScortichini Marco, Stead David E., and Takikawa Yuichi. 2012. “List of New Names of Plant Pathogenic Bacteria (2008−2010).” Journal of Plant Pathology 94: 21–27. 10.4454/jpp.fa.2011.003 [DOI] [Google Scholar]
- 48. Bull, Carolee T. , Coutinho Teresa A., Denny Timothy P., Firrao Giuseppe., Fischer‐Le Saux Marco, Li Xuguang, Saddler Gerry S., et al. 2014. “List of New Names of Plant Pathogenic Bacteria (2011−2012).” Journal of Plant Pathology 96: 223–26. 10.4454/jpp.v96i2.026 [DOI] [Google Scholar]
- 49. Yilmaz, Pelin , Parfrey Laura Wegener, Yarza Pablo, Gerken Jan, Pruesse Elmar, Quast Christian, Schweer Timmy, et al. 2014. “The SILVA and “All‐Species Living Tree Project (LTP)” Taxonomic Frameworks.” Nucleic Acids Research 42: D643–648. 10.1093/nar/gkt1209 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50. Katoh, Kazutaka . 2002. “MAFFT: A Novel Method for Rapid Multiple Sequence Alignment Based on Fast Fourier Transform.” Nucleic Acids Research 30: 3059–66. 10.1093/nar/gkf436 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51. Price, Morgan N. , Dehal Paramvir S., and Arkin Adam P.. 2010. “FastTree 2–Approximately Maximum‐Likelihood Trees for Large Alignments.” Plos One 5: e9490. 10.1371/journal.pone.0009490 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52. Shen, Wei , Le Shuai, Li Yan, and Hu Fuquan. 2016. “SeqKit: A Cross‐Platform and Ultrafast Toolkit for FASTA/Q File Manipulation.” Plos One 11: e0163962. 10.1371/journal.pone.0163962 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53. Caporaso, J. Gregory , Kuczynski Justin, Stombaugh Jesse, Bittinger Kyle, Bushman Frederic D., Costello Elizabeth K., Fierer Noah, et al. 2010. “QIIME Allows Analysis of High‐Throughput Community Sequencing Data.” Nature Methods 7: 335–36. 10.1038/nmeth.f.303 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54. Jiang, Gaofei , Wang Ningqi, Zhang Yaoyu, Wang Zhen, Zhang Yuling, Yu Jiabao, Zhang Yong, et al. 2021. “The Relative Importance of Soil Moisture in Predicting Bacterial Wilt Disease Occurrence.” Soil Ecology Letters 3: 356–66. 10.1007/s42832-021-0086-2 [DOI] [Google Scholar]
- 55. Chen, Shifu , Zhou Yanqing, Chen Yaru, and Gu Jia. 2018. “fastp: An Ultra‐Fast All‐in‐One FASTQ Preprocessor.” Bioinformatics 34: i884–890. 10.1093/bioinformatics/bty560 [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Supporting information.
Supporting information.
Data Availability Statement
MBPD is publicly available on GitHub (https://github.com/LorMeBioAI/MBPD). Supplementary materials (figures, tables, scripts, graphical abstract, slides, videos, Chinese translated version, and updated materials) may be found in the online DOI or iMeta Science http://www.imeta.science/.
