Skip to main content

This is a preprint.

It has not yet been peer reviewed by a journal.

The National Library of Medicine is running a pilot to include preprints that result from research funded by NIH in PMC and PubMed.

bioRxiv logoLink to bioRxiv
[Preprint]. 2025 Nov 11:2025.04.30.651563. [Version 4] doi: 10.1101/2025.04.30.651563

An AI-Guided Framework Reveals Conserved Features Governing microRNA Strand Selection

Dalton Meadows, Hailee Hargis, Amanda Ellis, Heewook Lee, Marco Mangone
PMCID: PMC12132293  PMID: 40463156

Abstract

MicroRNAs (miRNAs) are central regulators of gene expression, yet how cells choose between the two strands (5p or 3p) of a miRNA duplex during biogenesis remains unresolved. Here, we present a comprehensive, experimentally grounded framework that decodes the logic of miRNA strand selection. Using Caenorhabditis elegans as a model system, we developed a high-throughput qPCR platform enabling precise quantification of strand usage across developmental stages and in specific somatic tissues. To uncover the molecular grammar guiding this process, we built a predictive machine learning model trained on experimentally validated strand usage data. This AI-driven model, integrating 77 biologically informed features, accurately predicts strand preference not only in nematodes but also across vertebrates, including humans, revealing compositional and structural biases that are conserved yet functionally repurposed. Our analysis shows that strand selection is not stochastic but follows conserved, context-dependent rules shaped by cellular and developmental cues. To support the research community, we provide open-access resources: a database of strand usage profiles, predictive scores across species, and code and protocols via GitHub. This work offers the first unified, generalizable model for miRNA strand selection, establishing a paradigm that combines large-scale experimentation with AI to reveal a conserved, programmable layer of gene regulation.

Full Text Availability

The license terms selected by the author(s) for this preprint version do not permit archiving in PMC. The full text is available from the preprint server.


Articles from bioRxiv are provided here courtesy of Cold Spring Harbor Laboratory Preprints

RESOURCES