Skip to main content

This is a preprint.

It has not yet been peer reviewed by a journal.

The National Library of Medicine is running a pilot to include preprints that result from research funded by NIH in PMC and PubMed.

bioRxiv logoLink to bioRxiv
[Preprint]. 2024 May 29:2024.05.24.595722. [Version 1] doi: 10.1101/2024.05.24.595722

micronuclAI: Automated quantification of micronuclei for assessment of chromosomal instability

Miguel A Ibarra-Arellano, Lindsay A Caprio, Aroj Hada, Niklas Stotzem, Luke Cai, Shivem Shah, Johannes C Melms, Florian Wünneman, Benjamin Izar, Denis Schapiro
PMCID: PMC11160592  PMID: 38854106

Abstract

Chromosomal instability (CIN) is a hallmark of cancer that drives metastasis, immune evasion and treatment resistance. CIN results from chromosome mis-segregation events during anaphase, as excessive chromatin is packaged in micronuclei (MN), that can be enumerated to quantify CIN. Despite recent advancements in automation through computer vision and machine learning, the assessment of CIN remains a predominantly manual and time-consuming task, thus hampering important work in the field. Here, we present micronuclAI , a novel pipeline for automated and reliable quantification of MN of varying size, morphology and location from DNA-only stained images. In micronucleAI , single-cell crops are extracted from high-resolution microscopy images with the help of segmentation masks, which are then used to train a convolutional neural network (CNN) to output the number of MN associated with each cell. The pipeline was evaluated against manual single-cell level counts by experts and against routinely used MN ratio within the complete image. The classifier was able to achieve a weighted F1 score of 0.937 on the test dataset and the complete pipeline can achieve close to human-level performance on various datasets derived from multiple human and murine cancer cell lines. The pipeline achieved a root-mean-square deviation (RMSE) value of 0.0041, an R 2 of 0.87 and a Pearson’s correlation of 0.938 on images obtained at 10X magnification. We tested the approach in otherwise isogenic cell lines in which we genetically dialed up or down CIN rates, and also on a publicly available image data set (obtained at 100X) and achieved an RMSE value of 0.0159, an R 2 of 0.90, and a Pearson’s correlation of 0.951. Given the increasing interest in developing therapies for CIN-driven cancers, this method provides an important, scalable, and rapid approach to quantifying CIN on routinely obtained images. We release a GUI-implementation for easy access and utilization of the pipeline.

Full Text Availability

The license terms selected by the author(s) for this preprint version do not permit archiving in PMC. The full text is available from the preprint server.


Articles from bioRxiv are provided here courtesy of Cold Spring Harbor Laboratory Preprints

RESOURCES