Skip to main content

This is a preprint.

It has not yet been peer reviewed by a journal.

The National Library of Medicine is running a pilot to include preprints that result from research funded by NIH in PMC and PubMed.

bioRxiv logoLink to bioRxiv
[Preprint]. 2026 Mar 14:2026.03.11.710980. [Version 1] doi: 10.64898/2026.03.11.710980

A Multi-Omics Processing Pipeline (MOPP) for Extracting Taxonomic and Functional Insights from Metaribosome Profiling (metaRibo-Seq) data

Yuhan Weng, Oriane Moyne, Corinn Walker, Eli Haddad, Chloe Lieng, Loryn Chin, Gibraan Rahman, Daniel McDonald, Rob Knight, Karsten Zengler
PMCID: PMC13061286  PMID: 41959051

ABSTRACT

Metaribosome profiling (metaRibo-Seq) enables genome-wide measurement of translation across complex microbial communities by sequencing ribosome-protected mRNA fragments, but the short length of these footprints creates substantial nonspecific mapping against large reference genome collections, leading to spurious taxonomic and functional assignments. Here we present MOPP (Multi-Omics Processing Pipeline), a modular reference-based workflow that denoises metaRibo-Seq data by leveraging matched metagenomic coverage breadth to identify genomes likely to be truly present in a sample before aligning metatranslatomic and optional metatranscriptomic reads. MOPP generates taxon-by-gene count tables across genomic, transcriptional and translational layers, enabling integrated downstream analyses of microbial function. We evaluated MOPP using a defined 79-member synthetic human gut community profiled by metagenomics and metaRibo-Seq. Coverage breadth filtering markedly improved detection accuracy relative to a standard baseline workflow, with performance remaining robust across a broad intermediate threshold range and peaking at 92-95% coverage breadth. At a 92% threshold, MOPP reduced the number of distinct detected operational genomic units by 99.4% while retaining 87.8% of aligned metaRibo-Seq reads on average, and increased the F1 score from 0.02 to 0.61. Residual false positives were predominantly attributable to genomes with extremely high nucleotide similarity to true community members, whereas false negatives were enriched among low-abundance taxa, indicating that remaining errors are driven primarily by biological similarity and detection limits rather than widespread nonspecific mapping. Together, these results establish MOPP as a high-throughput workflow for robust processing of metaRibo-Seq in the context of matched metagenomics and position it as a scalable framework for integrated taxonomic and functional analysis of microbial communities across genomic, transcriptional and translational layers.

Full Text

The Full Text of this preprint is available as a PDF (605.9 KB). The Web version will be available soon.


Articles from bioRxiv are provided here courtesy of Cold Spring Harbor Laboratory Preprints

RESOURCES