Skip to main content
eLife logoLink to eLife
. 2025 Jun 2;13:RP101579. doi: 10.7554/eLife.101579

Image-based identification and isolation of micronucleated cells to dissect cellular consequences

Lucian DiPeso 1,2, Sriram Pendyala 3, Heather Z Huang 1, Douglas M Fowler 3, Emily M Hatch 1,4,
Editors: Silke Hauf5, Adèle L Marston6
PMCID: PMC12129451  PMID: 40455079

Abstract

Recent advances in isolating cells based on visual phenotypes have transformed our ability to identify the mechanisms and consequences of complex traits. Micronucleus (MN) formation is a frequent outcome of genome instability, triggers extensive changes in genome structure and signaling coincident with MN rupture, and is almost exclusively defined by visual analysis. Automated MN detection in microscopy images has proved challenging, limiting discovery of the mechanisms and consequences of MN. In this study we describe two new MN segmentation modules: a rapid model for classifying micronucleated cells and their rupture status (VCS MN), and a robust model for accurate MN segmentation (MNFinder) from a broad range of cell lines. As proof-of-concept, we define the transcriptome of non-transformed human cells with intact or ruptured MN after chromosome missegregation by combining VCS MN with photoactivation-based cell isolation and RNASeq. Surprisingly, we find that neither MN formation nor rupture triggers a strong unique transcriptional response. Instead, transcriptional changes appear correlated with small increases in aneuploidy in these cell classes. Our MN segmentation modules overcome a significant challenge with reproducible MN quantification, and, joined with visual cell sorting, enable the application of powerful functional genomics assays to a wide-range of questions in MN biology.

Research organism: Human

eLife digest

Healthy cells house most of their genetic information, such as their chromosomes, within a dedicated compartment known as the nucleus. Micronuclei, on the other hand, are small cellular compartments which contain pieces of DNA left behind after improper cell divisions or other abnormal events. Detectable under the microscope, their presence is usually interpreted as a sign of aging, disease or negative perturbations such as exposure to toxic chemicals.

However, new research suggests that micronuclei could also be directly harmful. When they rupture – which they almost always do – the sudden presence of unprotected genetic information in the ‘incorrect’ part of the cell could trigger dangerous cascades of events. To better understand these processes, researchers need to be able to quickly identify and isolate micronuclei-carrying cells within a large population, but such techniques are currently lacking.

DiPeso et al. aimed to address this gap by developing an AI tool, MNFinder, which can analyse images of live cells and automatically identify micronuclei. This allowed the team to mark and isolate cells in which these compartments were either absent, present, or had ruptured. Further experiments showed that, unexpectedly, patterns of gene expression did not differ between these different groups of cells. This suggests that the cell cannot ‘see’ that it has too many nuclei, or sense when its own DNA is in the wrong place. The disease-associated changes caused by the rupturing of micronuclei may therefore only emerge later, when the cell next divides. At this point, the DNA is heavily damaged and can activate immune checkpoint pathways, which can generate cancer-driving mutations and cellular self-destruct signals.

By developing and making MNFinder freely available, DiPeso et al. hope that micronuclei will be easier to study for scientists in a wide range of fields. Interestingly, micronuclei occur frequently during early human embryo development and may affect fertility, highlighting another research area that could benefit from these tools. As with all image analysis programs, however, increased usage, data, and model training will expand the use and ease of applying these tools to critical questions in human health.

Introduction

Recent advances in automated image analysis have led to the development of high-throughput platforms to isolate specific cell classes and match visual phenotypes to specific genetic and expression profiles. These platforms bring the power of population-based analyses and pooled genetic screening to a large range of phenotypes that are defined solely by visual changes in subcellular features. One such feature are micronuclei (MN), nuclear compartments containing a few chromosomes or chromatin fragments that result from mitotic segregation errors or DNA damage (Bona and Bakhoum, 2024; Guo et al., 2019). Increased MN frequency is a hallmark of carcinogen exposure, cancer development, and aging, and MN are potent drivers of massive genome structural changes, pro-inflammatory and metastasis signaling, and senescence (Bakhoum et al., 2018; Dou et al., 2017; Harding et al., 2017; He et al., 2019; Mackenzie et al., 2017; Mohr et al., 2021; Soto et al., 2018; Zhang et al., 2015). These processes are linked to MN rupture, which exposes the chromatin to the cytosol for the duration of interphase (Hatch et al., 2013), and may contribute to tumorigenesis, metastasis, aging, and immune disorders (Bona and Bakhoum, 2024; Guo et al., 2019).

Most studies on the biology and consequences of MN formation and rupture take advantage of the fact that MN can be induced with high frequency in cultured cells, for instance by inhibiting the spindle assembly checkpoint kinase Mps1 (Krupina et al., 2021). However, these interventions cause diverse additional nuclear and cellular changes, including nuclear atypia, chromatin bridges, aneuploidy, and DNA damage (Flynn et al., 2021; Santaguida et al., 2017). Sophisticated techniques have been developed to overcome this challenge by identifying micronucleated cells in single cell arrays, inducing missegregation of a single chromosome or chromosome arm without global ploidy changes by CRISPR, and purifying MN from lysed cells (Agustinus et al., 2023; Ly et al., 2017; Mohr et al., 2021; Papathanasiou et al., 2023; Zhang et al., 2015). These techniques led to new insights into MN functional defects and their mechanisms but have a limited ability to define the cellular consequences of MN formation and rupture. To understand these consequences across biologically relevant contexts, what is needed is a method that can identify and isolate all micronucleated cells from a mixed population in a broad range of cell types and conditions.

Automated detection of MN using conventional image analysis methods is challenging due to the diversity of MN sizes, their morphological similarity to frequently co-occurring nuclear features, including nuclear blebs and chromatin bridges, and their proximity to nuclei. Machine learning methods, specifically deep neural nets, are a powerful approach to discriminate cellular organelles and were used to develop programs to identify nuclei, mitochondria, and spindle poles with high fidelity (Caicedo et al., 2019b; Dang et al., 2023; Fischer et al., 2020). In this study, we apply neural network-based pixel classification to rapidly identify micronucleated cells (VCS MN) or pixel and instance classification to segment MN with high sensitivity across multiple cell types and imaging conditions (MNFinder). We demonstrate the utility of these analyses for understanding the consequences of micronucleation by combining VCS MN with a phenotype-based cell isolation method, called Visual Cell Sorting (VCS), to define the transcriptomic profile by RNAseq of hTERT-RPE1 cells that have none, intact, or ruptured MN after induced chromosome instability. During visual cell sorting, live single cells expressing nuclear-localized Dendra2 are photoconverted on-demand based on the results of the VCS MN classifier. Specific cell classes are then isolated by FACS (Hasle et al., 2020). We show that we can recapitulate an established aneuploidy signature using this method and find that neither micronucleation nor rupture is sufficient to induce significant transcriptional changes. We anticipate that visual cell sorting will be critical to address fundamental questions about how MN contribute to cancer development and enable new high-throughput screens for MN stability and survivability regulators. In addition, we expect MNFinder to have a broad utility for experimenters requiring accurate quantification of MN frequency and characteristics in cultured cells, including those in cancer research, nucleus biology, and virology.

Results

Development and training of VCS MN to rapidly classify micronucleated cells by fluorescence imaging

To isolate live micronucleated cells from a mixed population by visual cell sorting, we developed an automated pipeline to rapidly identify nuclei associated with one or more MN. For these studies, we focused on hTERT RPE-1 (RPE1) cells, a near-diploid, non-transformed human cell line that has been extensively used to study chromosome missegregation, aneuploidy, and micronucleation (He et al., 2019; Kneissig et al., 2019; Mammel et al., 2021; Santaguida et al., 2017; Zhang et al., 2015). We co-expressed a fluorescent chromatin marker (H2B-emiRFP703) to identify nuclei and MN with 3xDendra2-NLS (nuclear localization signal) to identify ruptured MN and photoactivate selected cells (Hasle et al., 2020; Hatch et al., 2013; Matlashov et al., 2020; Figure 1a). We refer to this cell line as RFP703/Dendra. In the absence of mitotic perturbations, RPE1 cells have a very low frequency of MN and nuclear atypia, including blebs and chromatin bridges (Figure 1—figure supplement 1a–b). Addition of an Mps1 kinase inhibitor (Mps1i), which blocks the spindle checkpoint and accelerates the metaphase-anaphase transition, induces a high level of whole chromosome missegregation and micronucleation (Garribba et al., 2023). In RFP703/Dendra cells, Mps1i incubation for 24 hr increased MN frequency from 3% to ~50% of cells (Figure 1—figure supplement 1b). Mps1i also increased the frequency of misshapen nuclei and chromatin bridges, but to substantially lower levels than MN (Figure 1—figure supplement 1b), and only misshapen nuclei were slightly enriched in the MN +population (Figure 1—figure supplement 1c). Most MN +nuclei had only one associated MN and these MN were most likely to contain a single chromosome, consistent with previous analyses (Figure 1—figure supplement 1d–e; Garribba et al., 2023). Thus, the major effect of short-term Mps1 inhibition on nuclear morphology in RPE1 cells was increased micronucleation, specifically micronucleation of single chromosomes.

Figure 1. VCS MN neural net module identifies micronucleated cells.

(a) Diagrams of the two constructs transduced into hTERT-RPE1 cells to make the RFP703/Dendra cell line and how the proteins localize in micronucleated (MN+) cells before and after MN rupture. (b) Diagram of VCS MN image analysis pipeline on RFP703/Dendra cells incubated in Mps1i. Raw image is of H2B-emiRFP703 taken on a 20 x widefield objective. Nuclei are first segmented using the Deep Retina segmenter and these masks are used to generate image crops centered on each nucleus. Image crops are then resized to fit the ResNet18 UNet model (see Figure 1—figure supplement 1) and analyzed by VCS MN to generate MN label predictions, which are mapped back to the raw image. After image reconstruction, MN are assigned to the nearest nucleus based on proximity. Nucleus masks from the Deep Retina segmenter are then classified as MN +or MN- to generate the nuclei labels. Scale bar = 40 µm. (c) Recall and positive predictive value (PPV) of MN identification within the image crops. MN were manually labeled as intact or ruptured using an image of NLS-3xDendra2 acquired at the same time. N=5, n=264, 158, 365, 283, 249. (d) Recall and PPV of MN- and MN +labeled nuclei. N=2, n=328, 186. (e) Same as for (d) except images were of U2OS cells expressing H2B-emiRFP703 and 3xDendra2-NLS. N=1, n=85, 95. (f) RFP703/Dendra cells with ruptured (arrows) and intact (arrowhead) MN showing a loss of NLS-3xDendra2 signal in ruptured MN. Scale bar = 10 µm. (g) Recall and PPV for MN rupture-based nucleus classification. N=3, n=120, 91, 82.

Figure 1.

Figure 1—figure supplement 1. Analysis of nuclear and MN characteristics after Mps1i addition to RPE1 RFP703/Dendra cells.

Figure 1—figure supplement 1.

(a) Representative images of nucleus morphology classes in RFP703/Dendra cells after 24 hr Mps1i treatment. The NLS-3xDendra2 channel (NLS-Dend.) is blown out to highlight the cytoplasmic signal. H2B=H2B-emiRFP703 channel. Arrows indicate chromatin bridges. Scale bar=10 µm. (b) Nuclei were manually scored for indicated phenotypes. Rare multinucleated cells were included in the misshapen nuclei class. Nuclei clearly in late telophase based on the cup shape of the chromatin were scored as normal, not misshapen. Msp1i- (DMSO): N=2, n=531, 833. Mps1i +: N=3, n=647, 597, 627 p: **≤0.01, ***≤0.001, ****≤0.0001 by Barnard’s test. (c) Quantification of misshapen nuclei in cells manually labeled as MN +and MN- from (b). N=3, n=839, 1032. p: **≤0.01, by Barnard’s test. (d) Histogram of number of MN per cell in Mps1i treated RFP703/Dendra cells. N=5, n=1323. (e) Quantification of MN chromosome content in Mps1i treated RFP703/Dendra cells. Chromosomes enumerated using a CENPB PNA probe. N=3, n=86, 91, 82. Single chromosome MN are highly enriched in this population and MN containing chromatin fragments (0 foci) are rare.
Figure 1—figure supplement 2. Description of VCS MN classifier training and assessment of MN assignment accuracy.

Figure 1—figure supplement 2.

(a) Graphic depiction of pipeline for generating MN ground truth labels for VCS MN training and testing datasets. Two-channel images of Mps1i treated RPE1 RFP703/Dendra cells were acquired using 20 x widefield epifluorescence. Nuclei were segmented on the H2B-emiRFP703 channel using the Deep Retina segmenter. These labels were used to generate 48x48 pixel crops centered on each nucleus. Crops underwent histogram normalization and filtering out of nuclei touching the image border. MN pixels were then manually annotated on each crop by an expert user. 2341 crops from 100 images (90 treated with Mps1i, 10 untreated) were annotated. Scale bar = 40 µm. (b) Quantification of nucleus solidity in all nuclei versus nuclei labeled by the Deep Retina segmenter in RFP703/Dendra cells after Mps1i treatment. Highly lobulated nuclei (low solidity) were routinely missed by the Deep Retina segmenter. N=1, n=1357, 1492. p: ****≤0.0001 by KS test. (c) Graphic depiction of pipeline for training the VCS MN classifier. VCS MN was trained on the image crops and MN ground truth labels generated in (a). To accommodate Torchvision’s ResNet18’s expectation of a 3-channel 96x96 px image, each crop was resized and then expanded as follows: the first 2 channels are duplicates of the H2B-emiRFP703 image and the final channel is the result of Sobel edge detection. This image set was split into three parts for training, validation, and testing, as indicated, and fed into the VCS MN classifier. (d) Representative images of MN assignment pipeline results. Automatic assignment depicted in top row (arrow) and manual assignment based on the NLS-3xDendra signal (NLS-Dend.) (arrow) in bottom row. Examples of correct and incorrect proximity-based assignment are shown. Scale bar=10 µm. (e) Quantification of proportion correctly assigned MN by proximity. Pooled proportion=97%. N=5 experiments, n=1319 MN. (f) Quantification of cell confluency in training images. Proportion of field covered by cells was defined by thresholding on the total NLS-3xDendra2 signal. Median=33.2%. n=30 images from three experiments. (g) Distance between MN, nearest nucleus, and second nearest nucleus, centroid to centroid, in pixels. Median distances are 17.1 px (nearest) and 57.1 px (second nearest). n=981 MN from images from five experiments. (h) Distance between nuclei, border to border, in pixels in training images. Median = 30 px. N=11 images, n=507 nuclei.
Figure 1—figure supplement 3. Analysis of VCS MN classification accuracy.

Figure 1—figure supplement 3.

(a) Representative images of VCS MN analysis of RFP703/Dendra cells after Mps1i addition. Arrows on H2B-emiRFP703 channel indicate nuclear feature on left. Ground truth masks were annotated manually for nuclei (N) and micronuclei (MN). VCS masks and labels were generated automatically. In the VCS masks column, true positive MN are white, missed MN are green, and false positive MN are magenta. Nucleus blebs and chromatin bridges were rarely identified as MN by VCS MN. Instead, false positives were frequently associated with dim H2B signal (bottom). Examples of false negative MN show enrichment for large MN, MN overlapping the nucleus, and MN with dim H2B signal compared to nearby nuclei. Scale bars = 10 µm. (b) Quantification of H2B-emiRFP703 intensity, MN area, and distance to nucleus for MN that were properly classified by VCS MN (true positives, TP), missed (false negatives, FN), or misclassified (false positives, FP). N=5, 5, 1. n=654, 327, 80. p: ****≤0.0001 by KS test with Bonferroni multiple test correction. (c) MN:nucleus NLS-3xDendra2 intensity ratios for manually classified intact and ruptured MN. Solid gray line=set VCS MN threshold. N=3, n=179, 113. (d, e) Recall and rupture frequency for MN +nuclei by # MN. MN + cells were manually classified. N=2, n values on graph. Data show that nuclei associated with more than 1 MN are more likely to be correctly identified as MN +by VCS MN (d) and that the presence of multiple MN increases the likelihood that at least 1 MN will be ruptured (e).

The major challenges for classical segmentation of MN using DNA markers alone are the (1) similarity in shape and size of MN to nuclear blebs and chromatin bridges, (2) highly variable MN sizes (1 µm diameter to 10 µm diameter), (3) proximity to the nucleus, and (4) similar chromatin labeling intensities and patterns between MN and nuclei. Therefore, we decided to apply machine learning to overcome current challenges in identifying MN from single-channel fluorescence images. For visual cell sorting, we developed a classifier, VCS MN, based on the widely used U-Net architecture (Ronneberger et al., 2015) using Torchvision’s pretrained ResNet18 model as its encoder (Figure 1b). The goal for its design was to rapidly classify MN-associated nuclei in low resolution images of H2B-emiRFP703 fluorescence for subsequent photoconversion and sorting. A second goal was to capture a large proportion of the cells with small MN close to the nucleus, which represent a highly biologically relevant class of MN that is particularly challenging to segment using classical methods (Mammel et al., 2021; Joo et al., 2023). To generate training data, one hundred images of RFP703/Dendra cells either untreated or incubated for 24 hr in Mps1i were acquired on a widefield microscope at ×20. To facilitate ground truth annotation and match the 96x96 pixel input required for ResNet18, we used the nucleus segmenter Deep Retina (Caicedo et al., 2019a) to generate nucleus masks, which were then used to crop each field into smaller images centered around each identified nucleus. MN pixels were then manually annotated on each image crop and labeled as either intact (3xDendra-NLS+) or ruptured (3xDendra-NLS-; Figure 1—figure supplement 2a). Analysis of the Deep Retina results found that it frequently missed highly lobulated MN, resulting in an image set with a higher mean nuclear solidity than observed for bulk Mps1i treated cells (Figure 1—figure supplement 2b). In total, approximately 2000 single-nucleus crops were used as training data, with 164 crops held back for validation and a further 177 for testing. Because the ResNet18 model takes a three-channel image as input, nuclear crops were expanded to three-channels by duplicating the H2B-emiRFP703 channel and adding a third channel containing the results of Sobel edge detection (Figure 1—figure supplement 2c).

The initial output of VCS MN are MN label predictions for each image crop (Figure 1b). To identify nuclei associated with at least one MN, these predictions are converted to masks that are mapped back onto the whole field image. MN objects are then assigned to ‘parent’ nuclear masks by proximity. Nucleus masks associated with at least one MN are labeled as MN +and those associated with no MN are labeled as MN-. MN over 40 pixels (26 µm) away from any nuclear mask (~10% of objects) or within a nucleus mask are discarded as likely false positives signals from cellular debris or inhomogeneity in H2B-emiRFP703 fluorescence.

The accuracy of assigning all MN to the closest nucleus was quantified using the dim cytoplasmic 3xDendra-NLS signal to define the cell boundaries and ground truth MN-nucleus associations (Figure 1—figure supplement 1a). Quantification of correct and incorrect nuclei assignments found that the nearest nucleus mask to the MN was in the same cell 97% of the time (Figure 1—figure supplement 2d–e). Cells in the training and testing datasets were imaged at subconfluent densities varying between 10% and 50% confluency (Figure 1—figure supplement 2f), which is optimal for single cell photoconversion (Hasle et al., 2020). In these conditions, MN were on average 3.3 times closer to the nearest nucleus compared to the next nearest one (Figure 1—figure supplement 2g). However, 10% of nuclei had borders less than 6 µm apart (Figure 1—figure supplement 2h), suggesting that proximity could robustly MN-nucleus pairs even at high cell densities.

VCS MN classifier performance

VCS MN model performance was determined by analyzing the pixel overlap between segmented MN and the ground truth annotations on the test set. A single pixel overlap between the predicted and actual MN masks was sufficient to call an MN as a true positive since any MN signal classified the adjacent nucleus as MN+. We quantified the positive predictive value (PPV, the proportion of true positives, i.e. specificity) and recall (the proportion of MN found by the classifier, i.e. sensitivity) of VCS MN. MN were identified with a recall of 70% and a PPV of 89% (Figure 1c). This translated to a recall of 86% and 65%, respectively, for MN- and MN +nuclei, and a PPV 73% and 93% (Figure 1d). Together these indicate that the MN +nuclei class is highly specific for micronucleated cells and suggest that some MN +nuclei are misclassified as MN-, likely due to VCS MN missing the MN in the initial pixel classification.

We next analyzed the source of MN identification errors by VCS MN. Chromatin bridges and nuclear blebs were rarely misidentified as MN (Figure 1—figure supplement 3a). Instead, false positives were most often dim and likely out-of-focus objects (Figure 1—figure supplement 3a–b) that were misidentified as MN. False negatives were the much larger error class and several features were enriched in missed MN, including low H2B-emiRFP703 fluorescence intensity, very small or very large size, and being either very close or very far away from the nucleus (Figure 1—figure supplement 3a–b). Consistent with smaller MN being missed, we also observed a small, but statistically significant, reduction in recall for ruptured MN (Figure 1c), which compact after rupture (Hatch et al., 2013; Vietri et al., 2020).

To determine whether VCS MN could achieve similar specificity and sensitivity in another cell line, we performed additional training on images of micronucleated cells in other cell types, including U2OS, HeLa, and human fetal fibroblasts (HFF). For testing, U2OS cells expressing H2B-emiRFP703 and 3xDendra2-NLS were treated with Mps1i to induce MN and imaged and annotated as for the RPE1 RFP703/Dendra cells. Analysis of this dataset yielded recall values of 82% and 83% for MN- and MN + cells, respectively, and PPVs of 81% and 83%. MN were identified with a specificity of 88% (Figure 1e). In summary, VCS MN can automatically identify micronucleated and non-micronucleated cells with high specificity in low-resolution images from multiple cell lines containing a mix of contaminating objects.

MN rupture classification

To identify nuclei with intact or ruptured MN, we added a step to the VCS MN module to determine whether any MN assigned to a nucleus were ruptured. Ruptured MN were identified based on 3xDendra2-NLS intensity, which is present in intact MN and very dim in ruptured MN (Figure 1a and f). The pipeline measures the maximum Dendra2 green fluorescence intensity in the nucleus and associated MN and MN with a signal less than 0.16 of the nucleus are classified as ‘ruptured’. This threshold correctly classified approximately 90% of MN (Figure 1—figure supplement 3c). When appended to the VCS MN classifier, this analysis correctly identified 60% of rupture- nuclei (nuclei associated with only intact MN) and 70% of rupture +nuclei (nuclei with at least 1 ruptured MN) with a PPV near 75% in both cases (Figure 1g). The lower sensitivity for rupture MN- nuclei is likely because nuclei associated with multiple MN are both more likely to be scored correctly as MN +and have at least 1 ruptured MN (Figure 1—figure supplement 3d–e).

Development of MNFinder for fixed cell MN segmentation

To determine whether the VCS MN classifier could be broadly used to quantify MN frequency, fluorescence, and morphological characterisitics across different imaging modalities and chromatin labels, we analyzed its performance on RPE1 cells treated with Mps1i, fixed and labeled with DAPI, and imaged at ×40 on a confocal microscope. Although nuclear shape and confluency were similar to that in the training set, VCS MN performed substantially worse on the ×40 images (compare to Figure 1d) and MN masks failed to cover most of the MN area (Figure 2a). Thus, we decided to develop a new classifier for broad MN quantification in fixed cells, called MNFinder.

Figure 2. MNFinder robustly segments MN across cell types and imaging conditions.

(a) Representative image showing undersegmentation and low recall by the VCS MN neural net on DAPI-labeled RPE1 cells imaged at ×40 magnification after Mps1i incubation. Images were scaled down prior to VCS MN analysis to match ×20 pixel size. Mean intersection-over-union (mIoU) and recall quantified from N=2, n=33, 31 images. Scale bar = 10 µm. (b) Overview of MNFinder module for classifying and segmenting MN and nuclei. Raw images of chromatin are first tiled by a sliding window and then processed by two groups of neural nets: one classifies pixels as either nuclei or MN (Nuc/MN) and one is an instance classifier for cells, with cells being defined as the smallest object that encloses a nucleus and its associated MN. Nuc/MN classifier results are post-processed to reassign MN that were misclassified as small nuclei and to expand MN masks. The cell instance classifier outputs gradient maps, which are used to define cells through watershed-based post-processing (see Figure 2—figure supplement 1). Nuc/MN results are integrated with cell results to produce final labels of individual MN, nuclei, and their associations. Image crops are reassembled onto the final image by linear blending. Scale bar = 40 µm. (c–d) Example images and MN pixel predictions using MNFinder on multiple cell types (RPE1 H2B-emiRFP703, U2OS, U2OS H2B-emiRFP703, HeLa H2B-GFP, and HFF), chromatin labels (DAPI, H2B-FP), and magnifications (×20, ×40). Scale bar = 10 µm. MN recall, PPV, and mIOU were quantified and averaged per image for each condition. Dotted line=performance of the VCS neural net on RPE1 H2B-emiRFP703 ×20 images. MNFinder performance is similar across conditions except images of H2B-GFP in fixed HeLa cells. N=1. n (on graph)=cells.

Figure 2.

Figure 2—figure supplement 1. Details of UNet architectures and object processing by the MNFinder module.

Figure 2—figure supplement 1.

(a) The nucleus and MN (Nuc/MN) pixel classifier takes as input a cropped single channel image of chromatin and feeds it into two parallel, attention-gated UNets, one of which also has multiscale downsamplers (yellow). The nucleus weights from the basic UNet (top) are retained and MN weights from both UNets are fed to a third UNet for ensembling to produce the final MN predictions. (b) Diagram of the triple decoder cell instance classifier. Two of the decoders have a UNet3+-like architecture with multiple skip connections and deep supervision during training. Feature depths are kept constant and most concatenation/max-pooling operations are replaced with addition to reduce training time. One decoder generates distance maps of a concave hull containing each nucleus and any associated MN (a ‘cell’) and the other generates a proximity map of each cell’s distance to all others. A third decoder uses a standard UNet with attention gates (cyan) to classify foreground pixels (nuclei and MN) and is used as input into every level of the distance- and proximity-map decoders via an integration block (magenta). (c) Results from the nucleus/MN pixel classifier and cell instance classifier are further processed to improve accuracy. To limit misclassification of large MN as small nuclei, nuclei under a user-set area threshold are reclassified as MN. To limit MN undersegmentation, MN pixel groups are expanded by transforming each object to its convex hull. Scale bar = 5 µm. (d) Distance and proximity maps from the cell instance classifier are combined to generate seeds for watershed segmentation. To correct for oversegmentation, only labels with boundaries that intersect a skeletonized version of the proximity map or background pixels are retained. Scale bar = 10 µm.
Figure 2—figure supplement 2. Schematic of training data generation and use in MNFinder classifiers.

Figure 2—figure supplement 2.

(a) Graphic depiction of pipeline to generate training data for the MNFinder MN and nucleus pixel classifier. A collection of single channel fluorescence images from several cell lines, chromatin labels, and microscope settings were manually annotated for nuclei and MN. Training, validating, and testing data were generated from these images by first normalizing image dimensions so that nuclei were approximately 30 px across their shortest diameter, generating random 128 by 128 px crops from all images, augmenting the data with a series of transformations, and then adding a second channel from a Sobel filter. This yielded a dataset of ~8800 images, which was split for training, validation, and testing as indicated. (b) Graphic depiction of pipeline to generate training data for MNFinder cell instance classifier, where cell is a defined as a nucleus and its associated MN. The same starting image set was used as in (a). Ground truth associated nuclei and MN groups were manually labeled using the NLS-3xDendra2 cytoplasm signal. Grouped objects were then transformed into cells by drawing the convex hulls. These hulls were then transformed into distance and proximity maps using Euclidian distance formulas. These maps were combined with a mask of the foreground pixels to generate a three-channel image. These images were randomly cropped and transformed to generate a training dataset of ~8800 images. This dataset was then split for training, validating, and testing of the cell instance classifier as indicated. Scale bars = 40 µm.
Figure 2—figure supplement 3. Analysis of MNFinder errors.

Figure 2—figure supplement 3.

(a) Representative images of MNFinder analysis from human fetal fibroblasts (DAPI, top), RPE1 RFP703/Dendra (H2B-emiRFP703, second) or U2OS RFP703/Dendra (H2B-emiRFP703, bottom two) cells. Arrows on the chromatin images point to the nuclear feature listed at left. Ground truth annotations for nuclei (N) and micronuclei (MN) were generated manually. Nucleus and MN masks were automatically defined using MNFinder. In the mask channel, true positive MN are white, missed MN are green, and false positive MN are blue. Neither nuclear blebs nor chromatin bridges were frequently miscategorized as MN. False negatives were enriched in small MN, and false positives were frequently nuclear lobes. Scale bars=10 µm. (b) Quantification of false negative (FN) MN area. MN missed by MNFinder were enriched in small MN across the testing dataset. TP=true positives, N=12 images, n=329, 60 MN. p: *≤0.05 by KS test. (c) Quantification of nucleus solidity in nuclei associated with false positive (FP) MN. False positives were frequently associated with highly lobulated nuclei with low solidity. N=12, n=329, 21. p: *≤0.05 by KS test.

MNFinder takes a single or multi-channel image of a chromatin-labeled sample at a resolution between 1.5 and 2.8 pixels per µm as input and uses two sets of neural nets to segment nuclei and MN and associate them into ‘cells’. MNFinder first crops the image into overlapping 128x128 pixel sections. These are analyzed by (1) an ensemble of three attention-gated U-Nets (Nuc/MN pixel classifier; Oktay et al., 2018; Ronneberger et al., 2015) and by (2) a multi-pathway UNet3+-based ‘cell’ instance classifier (Huang et al., 2020) based on the architecture described in Mahbod et al., 2022. The output masks and labels from both U-Nets then undergo additional filtering and morphological transformations before they are combined into a final multi-channel mask of the nuclei, MN and their associations. A default data output lists individual MN and nuclei objects, thire relationships, and several object features. Additional outputs include multi-channel fluorescence intensity and a wide range nuclear and MN morphometry available via the python scikit tool.

Nucleus and MN pixel segmentation is organized as follows: crops are converted into 2-channel images by adding the result of Sobel filtering to the raw image. These images are then fed into two U-Net inputs, one incorporating multiscaling downsample blocks (Su et al., 2021) in the encoder pathway and one without. The MN predictions from both U-Nets are then combined and fed into a third U-Net classifier to produce a consensus output (Figure 2—figure supplement 1a). Each of the three neural nets were trained separately and the ensembling neural net was trained with fixed weights set for each input neural net (Figure 2b, top). We chose this architecture because incorporating multiscale downsample blocks identified some MN otherwise missed, but produced an overall reduction in performance. Using an ensembling neural net allowed us to retain these MN while maintaining the high peformance observed with a single U-Net (Table 1). To adjust for the imbalanced nature of our data, where MN make up only a small proportion of any given image, we incorporated attention gates in the upsampling blocks and used the Focal Loss function to direct training to the complex parts of the image (Lin et al., 2018).

Table 1. Comparison of neural net model metrics for MN segmentation across cell lines.

Model PPV (%) Recall (%) mIoU, per object (%)
Cell type RPE-1 U2OS HeLa HFF U2OS (Broad) RPE-1 U2OS HeLa HFF U2OS (Broad) RPE-1 U2OS HeLa HFF U2OS (Broad)
UNet 89.3 (±4.5) 88.2 N/A N/A 44.8 50.8 (±6) 52 N/A N/A 59.1 48.1 (±1.7) 40.8 N/A N/A 33.7
Attention 100.0 (±0) 92.6 (±6.5) 63.9 (±12.7) 90.8 (±7.3) 88.9 (±19.2) 85.8 (±7.3) 82.8 (±2.5) 85.6 (±17.1) 92.7 (±5.4) 76.4 (±10.5) 78.7 (±6.5) 75.0 (±13.6) 69.0 (±13.7) 74.4 (±13.3) 75.5 (±2.8)
MSAttention 94.8 (±7.3) 92.2 (±6.8) 68.8 (±27.2) 87.6 (±6.6) 87.9 (±21.0) 84.4 (±0.7) 83.2 (±2.75) 74.4 (±13.5) 92.3 (±6.4) 80.6 (±12.0) 75.2 (±0.4) 73.6 (±9.2) 74.9 (±8.4) 71.3 (±13.4) 75.6 (±4.0)
Ensemble 100.0 (±0) 93.8 (±5.4) 75.0 (±25.0) 89.9 (±8.74) 88.9 (±19.2) 82.6 (±7.4) 82.8 (±2.5) 85.6 (±17.1) 92.7 (±5.4) 76.4 (±10.5) 79.3 (±6.0) 74.2 (±12.8) 68.8 (±13.1) 72.4 (±13.4) 74.8 (±2.3)

The ‘cell’ instance classifier is organized as follows: crops are fed into a U-Net with two U-Net3 +decoder (Huang et al., 2020) pathways organized in parallel: one that predicts distance from the cell center, where a ‘cell’ is defined as the smallest convex shape encompassing a nucleus and its associated MN, and one that predicts proximity to neighboring nuclei. A third decoder pathway predicts foreground pixels and is used as input for the first two decoder pathways, an architecture that we found substantially improved training speed and model performance (Figure 2b, bottom). To reduce neural net complexity, concatenation operations were replaced with addition (Lu et al., 2022) and the foreground/ background decoder pathway foregoes the additional skip connections found in U-Net3 +designs (Figure 2—figure supplement 1b).

The outputs from the nucleus/MN and cell classifiers are further processed to improve segmentation and nucleus/MN association accuracy. Masks from the nucleus/MN segmenter are passed to a filter that reclassifies nuclei with an area below 250 px as MN and then expands MN mask boundaries to their calculated convex hull to improve pixel capture (Figure 2—figure supplement 1c). ‘Cell’ masks are derived from the cell instance classifier by first summing the distance and proximity predictions and using the output as seeds for watershed segmentation. To correct for oversegmentation, boundary pixels in these objects are then filtered to retain those that touch either a background pixel or a pixel in the skeletonized proximity map (Figure 2—figure supplement 1d). After post-processing, the results of the nucleus/MN pixel classifier and the ‘cell’ instance classifier are integrated to produce a final set of labels and masks and a data file (Figure 2b).

To train, validate, and test MNFinder, we generated a dataset of 46 images of multiple cell lines, including RPE1, U2OS, HeLa, and human fetal fibroblasts (HFF), after incubation in Msp1i, labeled with dyes or a fluorophore-tagged histone, and imaged at multiple magnifications on multiple microscopes as either a single section or a maximum intensity projection (Table 2), after incubation in Mps1i for 24 hr, imaged on multiple microscopes at multiple resolutions. Nuclei and MN pixels were manually annotated on each image to generate the ground truth dataset. To train the nucleus/MN pixel classifier, thirty images with their annotations were cropped, augmented by translation, rescaling, rotation, and flipping, and then converted to 2-channel images to generate a final set of 8800 images. Training was performed on 65% of the image collection, with 7% held back for validation, and the final 28% used for testing (Figure 2—figure supplement 2a).

Table 2. List of image sets used for training, validating, and testing MNFinder.

Cell_type DNA_Label Integrity_label Max_Int_Proj Objective mag. Microscope Initial image resolution (px/um) Scaled_image_resolution Number_images Training Validation Test
HeLa H2B-GFP None Y 40 Confocal_LSM 5.6 2.8 1 1
HeLa DAPI None Y 40 Confocal_LSM 5.6 2.8 6 3 1 2
HeLa DAPI None N 20 Confocal_LSM 2.8 2.8 4 4
hTERT-HFF DAPI H3K27-acetyl Y 40 Confocal_Spinning_Disk 3.9 1.95 4 4
hTERT-RPE1 DAPI 2xRFP-NLS Y 40 Confocal_LSM 5.6 2.8 2 2
hTERT-RPE1 H2B-emiRFP703 NLS-3xDendra2 N 20 Widefield 1.55 1.55 16 14 1 1
MCC13 DAPI None N 20 Confocal_LSM 2.8 2.8 3 3
U2OS DAPI H3K27-acetyl Y 20 Confocal_LSM 2.8 2.8 1 1
U2OS DAPI H3K27-acetyl Y 40 Confocal_LSM 5.6 2.8 1 1
U2OS H2B-emiRFP703 NLS-3xDendra2 N 20 Widefield 1.55 1.55 8 6 1 1
Total: 46 30 3 13

To train the ‘cell’ instance classifier, annotations of the foreground pixels, distance map, and proximity maps of the full field image were generated on the same starting images by drawing the smallest convex shape that enclosed a nucleus and any associated MN, based on cytoplasmic 3xDendra2-NLS signal or another cell marker. These convex hulls were then converted to distance and proximity maps by scaling the pixel intensity based on distance from the hull’s center and distance to adjacent hulls, respectively. These annotated images were then cropped, augmented, and split into training, validation, and test data sets as for the pixel classifier (Figure 2—figure supplement 2b).

MNFinder model performance was determined using the test dataset and publicly available images of unperturbed U2OS cells from Broad Bioimage dataset BBBC039v1. We calculated that the unperturbed U2OS cells had an MN frequency of ~8%, compared to the frequency of ~30–70% in the rest of our dataset (Table 3). MNFinder showed significant improvement in recall over VCS MN for all image types with an additional improvement in PPV for 3/5 conditions (Figure 2c–d). Importantly, recall and PPV were largely insensitive to image resolution, DNA label, and cell type. HeLa H2B-GFP ×40 images were an outlier in terms of performance for unclear reasons, but also had the lowest MN frequeny. We calculated the per object mIoU to determine the quality of the segmentation and found that most MN were accurately segmented with mIoU values between 74% and 79% despite the median MN area being only 34 pixels after resizing (Figure 2d, Table 1).

Table 3. Quantification of MN frequency (propotion of cells with at least 1 MN) in images used to evaluate MNFinder.

RPE-1 U2OS HeLa HFF U2OS (Broad)
0.706 0.623 0.286 0.5 0.078

Analysis of MNFinder model errors found that chromatin bridges and nucleus blebs were rarely misclassified as MN (Figure 2—figure supplement 3a). Instead, highly lobulated nuclei were the main source of false positive errors (Figure 2—figure supplement 3a, c). Although small MN were enriched in the false negative error class (Figure 2—figure supplement 3a–b), over 75% of MN under 5 µm2 were properly identified. These metrics indicate that MNFinder provides accurate and robust MN segmentation across multiple cell lines and image acquisition settings. MNFinder identifies MN with similar sensitivity and substantially improved specificity compared to existing MN enumeration programs (Table 4; Ibarra-Arellano et al., 2024; Pons and Mauvezin, 2024) and is the only one to report a high mIoU, which is necessary for quantifying MN characteristics. This module is available as a Python package, mnfinder, via pip and on the Hatch Lab GitHub repository, along with the training, testing, and validation images and ground truth labels.

Table 4. Comparison of MNFinder to existing programs that quantify MN frequency in adherent cell images.

Study Cell lines tested Multiple imaging conditions assessed? PPV Recall Only MN identified? mIoU
Pons and Mauvezin, 2024; QATS U2OS and HeLa cells No 99% 60% Yes ND
Ibarra-Arellano et al., 2024; micronuclAI Multiple human and mouse cell lines Yes 93.20% 98.70% No, nuclear buds classified as MN ND
This study; MNFinder Multiple human cell lines Yes 75.0–100% 83.2–92.7% Yes 74.4–79.3%

Visual cell sorting with VCS MN can isolate live micronucleated and non-micronucleated cells

To test whether visual cell sorting could be used to understand how micronucleation contributes to aneuploid cell phenotypes, we first asked whether using visual cell sorting with the VCS MN classifier could yield cell populations enriched in either micronucleated or non-micronucleated cells. Visual cell sorting takes advantage of recent advances in digital micromirror technology coupled with expression of a convertible fluorophore to label multiple populations of cells with single cell accuracy. By illuminating different classes of cells with different lengths of UV (405 nm) pulses, it can generate up to four distinct proportions of converted Dendra2, which can be sorted by FACS (Hasle et al., 2020; Figure 3a). To optimize labeling and sorting conditions, we mixed RPE1 cells labeled with CellTrace far-red at a 1:1 ratio with unlabeled cells and targeted the two populations with different UV pulses based on thresholding of the far-red fluorescence intensity (Figure 3—figure supplement 1a). We found that activating cells with either an 800ms or a 200ms UV pulse allowed us to define two populations with high precision by FACs and with minimal activation of adjacent cells of the wrong class (Figure 3—figure supplement 1b). Reanalysis of sorted populations demonstrated that our protocol yielded a 95% sorting efficiency (Figure 3—figure supplement 1c).

Figure 3. Visual cell sorting can isolate RPE1 RFP703/Dendra micronucleated cells.

(a) Overview of visual cell sorting protocol. Cells are plated in multi-well plates, cellular phenotypes are quantified on-demand during imaging by VCS MN, and labeled nuclei (e.g. MN +and MN-) are photoconverted for either 200 or 800ms, yielding two different ratios of red:green fluorescence (red and yellow nuclei). These differences are quantified by FACs and gated on the red:green ratios for cell sorting. Graphic created with BioRender.com. (b) Histogram of nuclear red:green ratio quantification measured on repeated imaging of the same fields 0, 4, and 8 hr after photoconversion. Representative images are pseudo-colored by log10 Dendra red:green ratio (below). N=1, n=82, 353, 285, 313. Scale bar = 40 µm. (c) Design of MN cell isolation validation experiment. VCS MN classifier PPV calculated on images acquired during activation and frequency of MN- or MN + cells manually quantified in cells plated and fixed post-FACs sorting. Pre-FACS: N=2, n=251, 263. post-FACS N=1, n=338, 353.

Figure 3.

Figure 3—figure supplement 1. Controls for VCS MN isolation experiments.

Figure 3—figure supplement 1.

(a) Outline of RFP703/Dendra visual cell sorting validation experiment using CellTrace labeling as the activation trigger. Cells were incubated with CellTrace far-red and mixed with unlabeled cells at a 1:1 ratio. Nuclei were labeled based on CellTrace fluorescence intensity and converted with either an 800ms (CellTrace+) or 200ms (CellTrace-) UV pulse. The well was only partially converted prior to FACs analysis and sorting. Representative image of the mixed population prior to photoconversion is shown. Scale bar=10 µm. (b) FACS plot of Dendra2 red:green ratio versus CellTrace fluorescence. Colored bars represent gates. As expected, CellTrace + cells generally have a high red:green ratio, CellTrace- cells a middle red:green ratio, and unanalyzed cells from both classes a low red:green ratio. Percentages on graph are the percentage of CellTrace- and CellTrace + cells in their expected gate, i.e. the population purity. (c) Histogram of CellTrace fluorescence in cells sorted on the Dendra2 red:green ratio in (b) after re-analysis by FACs. Values show a slight drop in cell purity for both populations after sorting. (d) Data from Figure 3b replotted with NLS-3xDendra2 red and green fluorescence intensity values plotted separately. Two peaks of red fluorescence intensity are visible immediately after photoconversion (0 hr) and maintained over time. The increased population of nuclei at 4 and 8 hr with only background levels of red fluorescence (Dendra2:Red <101.5) likely represents unconverted nuclei moving into the imaging frame. The downward shift in red fluorescence intensity over time likely represents turnover of converted red fluorophores within the cells. N=1, n=82, 353, 285, 313. p: ****≤0.0001 by KS test. (e) Predicted classifier PPV (population purity) for low MN frequency U2OS cells (U2OS Broad). We observe a lower, but still substantial, enrichment of micronucleated cells in the MN +population compared to a highly micronucleated population (e.g. Figure 3c). N=1, n=17 cells.

We next tested whether Dendra2 red:green ratios were stable for the duration of a cell isolation experiment, based on the estimated time required for imaging, analyzing, and converting multiple six-well plate wells. RPE1 RFP703/Dendra cells were randomly converted with either a 200ms or 800ms UV pulse and nuclear green and red fluorescence intensities quantified at the same positions 0, 4, and 8 hr after conversion. Analysis of the red:green ratios found that, although the intensity of the red fluorescence decreased over time (Figure 3—figure supplement 1d), the two activated populations and a small unactivated one (arising from unconverted cells moving into the field of view) remained distinct through the 8 hr time point (Figure 3b). These data demonstrate that the change in Dendra red:green ratio after photoconversion persists long enough for accurate cell isolation at the end of a multi-well experiment.

We next asked whether VCS MN classification plus visual cell sorting could accurately activate and isolate MN +and MN- cells. RPE1 RFP703/Dendra cells were incubated with Mps1i for 24 hr prior to imaging to generate MN and Cdk1i was added 1 hr prior to imaging to prevent mitosis, which dilutes the Dendra2(red) signal and frequently leads to MN + cells becoming MN- daughter cells (Hatch et al., 2013; Zhang et al., 2015). Nuclei were classified by VCS MN as either MN +or MN-, converted with 200ms and 800ms pulses, respectively, and isolated by FACS. Sorted cells were replated in medium containing Cdk1i, fixed, and MN frequency quantified manually. Comparison of classifier PPV for MN +and MN- cells during activation and MN +and MN- cell frequency after FACs found a strong enrichment for the correct cell type in each group, with the increased purity of MN +classified cells being retained during sorting (Figure 3c). An increase in MN- cells in the MN +group after sorting is likely due to limitations in sorting efficiency (Figure 3—figure supplement 1c) and cells overcoming the Cdk1i block during sorting, plating, and entering mitosis and either dying or producing MN- daughter cells.

To determine how MN frequency affects our ability to accurately isolate MN + cells, we estimated sorted population purity of untreated U2OS cells, which we quantified as having 7.8% MN + cells (Table 3). We combined our previously obtained PPV and recall values with a predicted 95% sorting accuracy. The decreased frequency of MN +nuclei meant that the proportion of false positives appeared larger, which decreased purity of the MN +population compared to Msp1i treated RPE1 cells (Figure 3—figure supplement 1e). However, the MN +population had a nearly 7-fold enrichment of MN + cells compared to the starting population, similar to the enrichment of tumor cells in patient biopsies (Wu et al., 2021) that have been the basis of cancer driver identification by transcriptomic and genetic analysis. Thus, our data strongly suggest that visual cell sorting can be combined with the VCS MN classifier to generate cell populations enriched for MN- and MN + cells from a variety of conditions and MN frequencies that are highly suitable for multiple downstream applications, including genetic screening, bulk RNA and proteomic analyses, and single cell sequencing.

Validation of visual cell sorting for transcriptome analysis of aneuploid cells

To validate visual cell sorting for aneuploidy-associated transcriptome analysis, we used RNASeq to define gene expression changes in RPE1 cells after Mps1i incubation and optical isolation. Cells were incubated in either DMSO or Mps1i for 24 hr and each population was randomly activated with short and long UV pulses using nuclear masks generated by the Deep Retina segmenter (Figure 4a). Conversion of 1500–2000 fields at ×20 magnification allowed us to isolate 13 k cells after FACS for each photoconverted population in each condition. Isolated populations were then processed for RNASeq. Principal component analysis (PCA) of the results revealed that, as expected, results clustered first by treatment group (Figure 4b). Analysis of the DMSO samples found minimal differences in gene expression associated with UV pulse duration (Figure 4—figure supplement 1a–b), consistent with previous results (Hasle et al., 2020). Therefore, data from cells activated at both pulse lengths were pooled in subsequent analyses.

Figure 4. Visual cell sorting pipeline identifies Mps1i transcriptional response.

(a) Outline of experiment. Nuclei identified by the Deep Retina segmenter were randomly activated for 800 or 200ms. (b) PCA plot showing clustering of Mps1i-treated and DMSO-treated cells by treatment (major) and by replicate (minor). Each experimental replicate represents two technical replicates. (c) MA plot of genes identified by RNASeq. Differentially expressed genes (FDR adjusted p-value <0.05) are in green. Gray lines represent 1.5-fold-change in expression. (d) Heatmap of Hallmark pathway enrichment between visual cell sorting data and data from Santaguida et al., 2017 and He et al., 2019 analyses of RPE1 cells after chromosome missegregation. Hallmark pathways (bottom) were clustered based on manually annotated categories (top).

Figure 4.

Figure 4—figure supplement 1. Differential UV pulses do not induce substantial transcriptional changes.

Figure 4—figure supplement 1.

(a) PCA plot of cells treated with DMSO and exposed to 800ms or 200ms UV. (b) MA plot of the data in (a). Only six differentially expressed genes were identified in cells exposed to 800ms vs 200ms UV pulses and only three were downregulated over 1.5-fold: DDX39B, FASN, RGPD6.

Analysis of gene expression differences identified 2200 differentially expressed genes (DEGs) in Mps1i versus DMSO treated cells, 63 of which had absolute changes >1.5-fold (Figure 4c, Figure 4—figure supplement 1b-c). We used gene set enrichment analysis (GSEA) to compare these genes to those previously identified in RPE1 cells after induction of chromosome missegregation (Figure 1—figure supplement 1d-e; He et al., 2018; Santaguida et al., 2017). This analysis found substantial overlap between the Hallmark pathways changing in Mps1i cells isolated by visual cell sorting and in previous studies (Figure 4—figure supplement 1f-h), including increased expression of inflammation, EMT, and p53 associated genes (Figure 4d). Additional DEGs observed in visual cell sorting samples fell into similar function categories and potentially resulted from increased sequencing depth and increased sensitivity from the exclusion of highly lobulated nuclei from our analysis by the Deep Retina segmenter. These data confirm that visual cell sorting can accurately and sensitively identify biologically relevant transcriptional changes in aneuploid RPE1 cells.

MN rupture induces few transcriptional changes

MN formation frequently coincides with increased aneuploidy, as both a cause and a consequence. Therefore, we asked whether the presence or rupture of an MN contributed to the genome-wide aneuploidy transcriptional response (Andrade et al., 2023; He et al., 2018; Santaguida et al., 2017). Although it has been suggested that MN rupture induces rapid upregulation of inflammatory genes via activation of cGAS-STING signaling, initial and subsequent studies have demonstrated that this response does not occur during the first cell cycle and also can occur in RPE1 cells that lack cGAS via a RIG-I dependent mechanism (Chen et al., 2020; Flynn et al., 2021; Harding et al., 2017; Mackenzie et al., 2017; Mohr et al., 2021; Sato and Hayashi, 2024). Thus, we do not expect the lack of cGAS expression in RPE1 cells to alter MN transcriptional responses.

We first analyzed gene expression changes in MN +versus MN- RPE1 cells after Mps1i treatment (Figure 5a). Quantification of nuclear atypia in the masked nuclei found no difference in the proportion of MN +nuclei with lobulated nuclei, chromatin bridges, or rounded up nuclei compared to MN- ones (Figure 5—figure supplement 1a), likely due to exclusion of most of these nuclei by the Deep Retina segmenter (Figure 1—figure supplement 1b). RNASeq results clustered largely by replicate rather than by the presence of an MN (Figure 5b). Consistent with a lack of a strong transcriptional effect, DEG analysis identified few statistically significant changes in gene expression (Figure 5c, Figure 5—figure supplement 1i). The only two genes that were upregulated more than 1.5-fold, EGR1 and TNFAIP3, were also strongly upregulated in the Mps1i bulk analysis (Figure 5—figure supplement 1cj). Together, these results suggest that micronucleation does not drive substantial changes in the transcriptional program of aneuploid cells.

Figure 5. Micronucleation and rupture result in few transcriptional changes.

(a) Outline of experiment for MN +and MN- cell isolation from Mps1i treated RFP703/Dendra cells. (b) PCA plot showing clustering of MN + and MN- cells by replicate (major) and condition (minor). (c) MA plot of genes identified in MN +/-RNASeq. Of the identified DEGs, only two have fold-changes larger than 1.5. Both, TNFAIP3 and EGR1, are also significantly upregulated in Mps1i-treated cells. (d) Outline of experiment for MN rupture + and rupture- cell isolation. (e) PCA plot showing clustering of intact MN and ruptured MN cells by condition and replicate. (f) MA plot of genes identified in MN rupture +/-RNASeq. The three DEGs with a fold-change higher than 1.5 and unique to this dataset are labeled on the plot. (g) Heatmap of Hallmark pathway enrichment in datasets of DMSO vs Mps1i, Mps1i-treated cells with and without MN, and synchronized, Mps1i-treated, MN + cells with and without MN rupture. All DEGs, defined as an FDR less than 0.05, are included. Pathways are grouped based on manual annotation (right). 6/43 pathways are unique to the MN +/- profile compared to Mps1i+/-, and 2/43 pathways are unique to MN rupture +/-compared to Mps1i+/-.

Figure 5.

Figure 5—figure supplement 1. Analysis of visual cell sorting MN images.

Figure 5—figure supplement 1.

(a) Manual analysis of nuclei features in cells labeled as MN + or MN- by VCS MN in RNASeq experiments. Categories defined as in Figure 1—figure supplement 1. N=3, n=875 MN +nuclei and 1375 MN- nuclei. p: ns >0.05 by Barnard’s test. (b) Change in MN rupture frequency over time in asynchronous and synchronized cells treated with Cdk1i. Other=mitotic, MN-, or Dendra2- cells. N=1, n=~200 cells per time point. (c) Change in MN rupture frequency between the start and end of a visual cell sorting experiment (4 hr) and predicted change in classifier PPV due to ongoing rupture of intact MN based on values in (b).

We next asked whether MN rupture affected transcription by using visual cell sorting to isolate MN + cells with all intact MN (rupture- nuclei) and MN + cells with at least 1 ruptured MN (rupture +nuclei). To minimize the number of MN rupturing during the experiment, and thereby decreasing the purity of the intact MN population, we synchronized the cells prior to Mps1i treatment and converted them in the following G1 phase (Figure 5—figure supplement 1b). Analysis of MN rupture by live-cell imaging found this led to only about 10% of intact MN cells becoming ruptured MN cells over 4 hr, the time to photo-convert one well of a six-well plate. We estimated this would reduce the intact MN population PPV from 70.6% to 61.9% (Figure 5—figure supplement 1c).

Analysis of gene expression changes between intact and ruptured MN + cells by RNAseq showed that samples clustered first by condition (Figure 5e) and identified a large number of DEGs, 14 of which had absolute changes greater than 1.5-fold (Figure 5f, Figure 5—figure supplement 1k). Of these 14, 3 were not previously identified in the Mps1i+analysis (Figure 5—figure supplement 1l) and GSEA analysis confirmed that almost all of the pathways altered in either MN +and MN rupture + cells overlapped with those identified in the bulk Mps1i population (Figure 5g, Figure 5—figure supplement 1m-n). These data suggest that MN rupture has minimal unique effects on transcription in the first interphase after whole chromosome missegregation.

MN formation and rupture do not contribute to the aneuploidy transcription signature

Due to the large overlap between DEGs in MN rupture cells compared to the bulk aneuploid population, we tested the hypothesis that MN rupture is the cause of these transcriptional changes. If this is the case, then we expect to observe the changes in gene expression to get stronger as cells with ruptured MN are more enriched in the population. We used k-means clustering to visualize changes in DEG expression between Mps1i treated cells, MN + cells, and MN rupture cells. To minimize noise from potential batch effects in the MN +dataset, we included individual replicates from that and the MN rupture experiment in our analysis. This analysis identified one gene cluster that showed potentially elevated expression in MN rupture cells, which included all the most highly upregulated genes after MN rupture (Figure 6a, Figure 5—figure supplement 1o).

Figure 6. Micronucleation and rupture do not significantly alter the aneuploidy transcription response.

(a) Heatmap of Mps1i+DEGs (cutoff: absolute fold-change ≥1.5) compared to MN +and rupture +replicates. Euclidean distances calculated for features and samples and clustered by complete-linkage. Genes lacking values for one more experiment types were excluded. Line=gene cluster upregulated in MN rupture + cells. (b) Representative images and quantification of ATF3 and EGR1 labeling in RPE1 2xRFP-NLS cells after Mps1i incubation. Arrows = ruptured MN, arrowheads=intact MN. Scale bar=20 µm. (c) Quantification of normalized ATF3 and EGR1 mean nuclear intensity in manually classified cells. N=2 (graph colors), n=on graph, p: ns >0.05, *≤0.05, ***≤0.001 by GEE. (d) Representative images of DNA FISH for chromosomes 1, 11, and 18 and H3K27Ac identification of intact MN. Arrows=ruptured MN, arrowheads=intact MN. (e) Quantification of aneuploidy frequency (foci do not equal 2) per chromosome. Cells manually classified as MN- or MN+, and MN rupture- or rupture+. MN: Chr 1: N=2, n=429, 158; Chr 11: N=3, n=406, 313, 160; Chr 18: N=3, n=425, 202, 230. Rupture: Chr 1: N=2, n=187, 74; Chr 11: N=3, n=190, 108, 71; Chr 18: N=3, n=186, 102, 101.

Figure 6.

Figure 6—figure supplement 1. Controls related to Figure 6.

Figure 6—figure supplement 1.

(a) Representative images and quantification of ATF3 nuclear mean fluorescence intensity in cells treated with DMSO or doxorubicin (Doxo.). N=2 (colors on graph), n=on graph. Scale bar = 20 µm. (b) Representative images and quantification of EGR1 nuclear mean fluorescence intensity in cells treated with DMSO or hEGF. N=2 (colors on graph), n=on graph. For a-b: p: ***≤0.001, by GEE. Scale bar=20 µm. (c) Same analysis as Figure 6b, but with chromosomes in ruptured MN excluded from the foci count. Similar levels of aneuploidy were observed between groups as in Figure 6b.

We next attempted to validate increased expression of two of these genes, EGR1 and ATF3, after MN rupture by immunofluorescence. As a positive control, increased protein levels were observed after addition of hEGF or doxorubicin, a DNA damaging agent, to RPE1 cells (Figure 6—figure supplement 1a–b). Both EGR1 and ATF3 showed increased expression after Mps1i treatment, however, no increase in EGR1 and only a small shift in ATF3 was observed in cells with ruptured MN (Figure 6b–c). Based on these results, we conclude that MN rupture is strongly unlikely to be a significant driver of transcriptional changes in aneuploid cells.

Cells with ruptured MN have higher levels of aneuploidy

One explanation for our observation that some genes increase in expression strength in cells with ruptured MN is that this population is enriched in aneuploid cells. We therefore analyzed aneuploidy levels in Mps1i-treated RPE1 cells by quantifying the number of chromosomes 1, 11, and 18 by DNA-FISH. Cells were manually classified as MN- or MN +by DAPI labeling, and MN classified as ruptured based on loss of H3K27Ac labeling (Mammel et al., 2021; Mohr et al., 2021; Figure 6d). Aneuploidy frequency varied between chromosomes but was consistently higher for MN +compared to MN- cells and higher for 2/3 chromosomes for ruptured compared to intact MN + cells (Figure 6e). Similar results were obtained when transcription loss due to MN rupture was considered (functional aneuploidy; Figure 6—figure supplement 1c). Overall, our results strongly suggest that micronucleated cells likely have slightly higher levels of aneuploidy and that MN formation and rupture do not cause specific transcriptional changes in the first cell cycle.

Discussion

In this study, we present two machine-learning based modules to identify MN and micronucleated cells from single channel fluorescence images of chromatin or DNA and combine one with visual cell sorting to profile the transcriptional responses to MN formation and rupture. We demonstrate that our MN segmentation pipeline MNFinder can robustly classify and segment MN across multiple adherent cell types and fluorescent imaging conditions. Further, we demonstrate that a separate MN cell classifier, VCS MN, rapidly and robustly identifies micronucleated cells from low resolution images and can be combined with single-cell photoconversion to accurately isolate live cells with none, intact, or ruptured MN from a mixed population. Using this platform, we find that, unexpectedly, neither micronucleation nor rupture triggers gene expression changes beyond those associated with a slight increase in aneuploidy. Overall, our study brings a powerful high-throughput optical isolation strategy to MN biology that enables new types of investigations and a new module for quantifying MN characteristics in adherent cells that will allow naïve users to rapidly and robustly quantify these compartments.

Visual cell sorting isolation of micronucleated cells has several advantages over current methods to identify the mechanisms and consequences of MN formation and rupture. First, it can be used on any adherent cell line. This overcomes challenges involved with using lamin B2 overexpression to inhibit MN rupture, which is limited to specific cell lines and MN types (Hatch et al., 2013; Mammel et al., 2021; Xia et al., 2019) and is complicated by additional changes in mitosis and gene expression (Agustinus et al., 2023; Han et al., 2020; Kuga et al., 2014; Pujadas Liwag et al., 2024). In addition, visual cell sorting overcomes cell line and MN content restrictions present in systems that induce missegregation of single chromosomes or chromosome arms by enabling analysis of all missegregation events in any genetic background (Lin et al., 2023; Ly et al., 2019; Ly et al., 2017; Shoshani et al., 2021; Trivedi et al., 2023). Unlike live single-cell assays, it is highly scalable and eliminates selection pressures and restrictions added by clonal expansion (Mohr et al., 2021; Papathanasiou et al., 2023; Zhang et al., 2015). Importantly, VCS MN isolation captures whole live cells, overcoming limitations associated with MN purification (Agustinus et al., 2023; Klaasen et al., 2022; Mohr et al., 2021; Papathanasiou et al., 2023; Tang et al., 2022), and permits time-resolved analyses of cellular changes and MN chromatin. Visual cell sorting also has an advantage over other optical isolation methods, including image-enabled cell sorting (Schraivogel et al., 2022), as it can be adapted to any wide-field microscope by adding a digital micromirror to existing equipment, and can be performed on attached cells, which is critical for accurate MN identification (Li et al., 2015).

Visual cell sorting of MN and our MN segmentation modules do have limitations. Due to Dendra2 signal decay and ongoing MN rupture, only about 200,000 live cells can be analyzed and targeted per visual cell sorting experiment with VCS MN. For optical pooled screening or analysis of rare cells, this limits the number of genes or depth of analysis that can be achieved. Cell fixation would overcome this issue and efforts to improve cell sorting and sample extraction after fixation are ongoing (Kanfer et al., 2021; Yan et al., 2021). Visual cell sorting also requires the introduction of at least one photoconvertible or activatable protein to mark the cells and a second fluorescent protein to identify ruptured MN. This limits the channels available for additional phenotype identification. However, recent advances in cell structure prediction (Johnson et al., 2023) may vastly expand the phenotypic information available from limited cell labels. VCS MN and MNFinder precision vary across cell types and widely divergent nuclear morphologies from the training set could impair performance. Additional training of the neural net should improve this metric, as has been the case with other cellular structures (Caicedo et al., 2019b; von Chamier et al., 2021), but different algorithm architectures will likely be required to identify MN in signal-rich environments like organoids or tissue samples. In these cases, correct association of MN to their parent nuclei may also require using a cytoplasm or cell surface label to define ground truth MN-nucleus associations and/or developing a new module to perform this task.

Comparison of our RNASeq analysis of aneuploid RPE1 cells with previously published datasets demonstrates that visual cell sorting is highly suitable for bulk genomic analyses. We observed a large overlap between upregulated pathways in our study, previous analyses of Mps1i-treated RPE1 cells, and aneuploidy more broadly, including inflammation, endothelial-to-mesenchymal transition, and p53 (He et al., 2019; Santaguida et al., 2017). We did observe a globally lower change in gene expression levels compared to previous studies. Although the values are within the variation anticipated for different preparation and sequencing protocols, it could also reflect the exclusion of cells with highly lobulated nuclei from our analysis and suggests that nuclear atypia may be a significant contributor to aneuploidy signaling changes. Additional visual cell sorting experiments using nuclear solidity to define cell populations could be used to accurately quantify this contribution.

Our analysis of micronucleated cells and cells with ruptured MN found only a handful of genes that were uniquely upregulated after MN rupture and we were unable to validate a contribution of either condition to protein changes in the cell. One potential interpretation of these results is that MN rupture or micronucleation does cause transcriptional changes, but that they are masked by signals from co-occurring events. Although we cannot rule this out, we favor the alternative interpretation that MN formation and rupture do not lead to biologically relevant transcriptional changes for several reasons. The first is that, of potential co-occurring nuclear phenotypes we quantified, only nuclear lobulation was frequently observed in Mps1i treated cells and was only slightly enriched in micronucleated cells (Figure 1—figure supplement 1b–c). Furthermore, this frequency was significantly reduced and MN + cell enrichment was lost when nuclei were classified using VCS MN (Figure 1—figure supplement 2b, Figure 2—figure supplement 2a). Thus, nuclear atypia, chromatin bridges, and mitotic DNA damage are unlikely masking other transcriptional signals in our RNAseq results. Second, we were unable to validate protein-level increases of two increasingly upregulated genes in cells with ruptured MN by immunofluorescence, suggesting that any missed DEGs are unlikely to be biologically relevant. Third, we were able to identify small changes in aneuploidy gene upregulation in both MN +and ruptured MN cells, strongly suggesting that our analysis was sensitive to small differences in transcript abundance. In addition, we observed an increase in aneuploidy levels in cells with MN and ruptured MN that correlated with the increase in aneuploidy gene expression. Taken together, these data strongly suggest that the most likely explanation is that MN, intact or ruptured, do not induce a meaningful transcriptional signal in the first cell cycle.

A potential limitation of our current data is that the cell line used, RPE1, has reduced cGAS expression. MN rupture has been proposed to activate cGAS/STING signaling (Bakhoum et al., 2018; Harding et al., 2017; Mackenzie et al., 2017) and the strong localization of cGAS to ruptured MN led to a widespread model where this activation occurred in the first cell cycle. However, the original studies did not observe upregulation of interferon-stimulated genes until several days after MN rupture, suggesting that cGAS recruitment to MN does not activate it, and now multiple lines of evidence strongly suggest cGAS bound to ruptured MN is incapable of activating STING (Flynn et al., 2021; Harding et al., 2017; Mohr et al., 2021; Sato and Hayashi, 2024). Therefore, we expect that VCS analysis of MN-associated gene expression in a highly active cGAS/STING cell line will yield the same results as what we observed in RPE1 cells.

Visual cell sorting is a highly flexible platform that substantially expands our ability address fundamental questions in MN biology. Visual cell sorting MN isolation can be used for optical pooled screening, an unbiased method that would be ideal to identify mechanisms of MN rupture, and, in combination with dCas9-based chromosome labeling (Chen et al., 2013; Maass et al., 2018; Tanenbaum et al., 2014), to identify mechanisms that enrich specific chromosomes in MN and drive cancer-specific aneuploidies (Ben-David and Amon, 2020). Techniques to improve retrieval and sorting of formaldehyde-fixed cells would significantly facilitate these analyses by enabling collection of a larger number of cells per experiment and allowing use of slower but more accurate MN segmentation modules, such as MNFinder, to label cell populations.

A current strength of visual cell sorting with VCS MN is the ability to recover a population of live cells with either none, intact, or ruptured MN. This allows, for the first time, large-scale analyses of post-mitotic genetic and functional changes caused by micronucleation. For instance, aneuploid cells with and without MN could be analyzed for acquisition of disease-associated behaviors, including proliferation and migration, and used in in vivo tumorigenesis and metastasis assays to directly assess their contribution to cancer development. In addition, a similar method developed in the Korbel lab shows the power of this technique used to quantify the frequency and extent of genetic changes after chromosome missegregation (Cosenza et al., 2024). The ability of visual cell sorting to use nucleus masks generated by any image analysis program means that the user can employ any open-source MN segmentation module to optimize for higher purity for these experiments, or a broader recall to identify rare populations in single-cell experiments. In summary, automated MN segmentation and visual cell sorting are poised to provide critical insights into a wide range of questions about how MN form, rupture, and contribute to disease pathologies.

Materials and methods

Key resources table.

Reagent type (species) or resource Designation Source or reference Identifiers Additional information
Cell line (Homo sapiens) HEK293T (transformed, kidney) ATCC CRL-3216, RRID:CVCL_0063
Cell line (Homo sapiens) U2OS (transformed, osteosarcoma) ATCC HTB-96. RRID:CVCL_0042
Cell line (Homo sapiens) HeLa (transformed, cervical carcinoma) ATCC CRM-CCL-2, RRID:CVCL_0030
Cell line (Homo sapiens) hTERT RPE-1; RPE1 (immortalized, epithelial) ATCC CRL-4000, RRID:CVCL_4388
Cell line (Homo sapiens) hTERT HFF; HFF (immortalized, fibroblast) PMID:9817205 Cell line maintained in D. Galloway lab.
Cell line (Homo sapiens) MCC13 (transformed, Merkel cell carcinoma) ECACC 10092302, RRID:CVCL_2583 Cell line maintained in P. Nghiem lab
Cell line (Homo sapiens) HeLa H2B-GFP Millipore SCC117, RRID:CVCL_ZM02 Cell line maintained in D. Avgousti lab
Cell line (Homo sapiens) hTERT RPE-1 NLS-3xDendra2, H2B-emiRFP703; RFP703/Dendra This study RPE1 cells used for VCS MN experiments
Cell line (Homo sapiens) U2OS NLS-3xDendra2, H3B-emiRFP703 This study U2OS cells used for MNFinder training
Cell line (Homo sapiens) hTERT RPE-1 NLS-3xDendra2-P2A-H2B-miRFP703 This study RPE1 cells used for VCS MN training
Antibody Anti-gH2AX phospho (Ser139) (mouse monoclonal) Biolegend Cat# 613401, RRID:AB_315794 IF (1:500)
Antibody Anti-ATF3 (rabbit monoclonal) CST Cat# 18665, RRID:AB_2827506 IF (1:400)
Antibody Anti-EGR1 (rabbit monoclonal) CST Cat# 4154, RRID:AB_2097035 IF (1:1600)
Antibody Anti-H3K27Ac (rabbit polyclonal) Abcam Cat# ab4729, RRID:AB_2118291 IF (1:500)
Antibody AF647 goat-anti-mouse (polyclonal, secondary) Life Technologies Cat# A21236 IF (1:1000)
Antibody AF488 goat anti-rabbit (polyclonal, secondary) Life Technologies Cat# A11034 IF (1:2000)
Recombinant DNA reagent pH2B-emiRP703 Addgene Cat# 136567, RRID:Addgene_136567 Verkhusha lab
Recombinant DNA reagent pLVX-EF1a-NLS-3xDendra2-blast (plasmid) This study Lentiviral plasmid based on pLVX-puro backbone (Clontech) to express NLS-3xDendra2
Recombinant DNA reagent pLVX-EF1a-H2B-emiRFP703-neo (plasmid) This study Lentiviral plasmid based on pLVX-puro backbone (Clontech) to express H2B-emiRFP703
Recombinant DNA reagent pLenti-CMV-NLS-Dendra2x3-P2A-H2B-miRFP PMCID:PMC7273721 Lentiviral plasmid to co-express H2B-miRFP703 and NLS-Dendra2
Commercial assay or kit RNAqueous micro Thermo Fisher AM1931
Commercial assay or kit SMART-Seq v4 ultra-low input RNA Takara 634894
Commercial assay or kit Nextera XT DNA library preparation Illumina FC-131–1024
Chemical compound, drug DAPI Life Technologies D1306 (1 µg/mL)
Chemical compound, drug Vectashield Vector Labs H-1000
Chemical compound, drug RO-3306; Cdk1i Sigma-Aldrich SML0569 (10 µM)
Chemical compound, drug BAY1217389; Mps1i Fisher Scientific 501872752 Msp1 inhibitor (100 nM)
Chemical compound, drug PD-0332991; Cdk4/6i Sigma-Aldrich PZ0199 Cdk4/6 inhibitor (1 µM)
Chemical compound, drug doxorubicin hydrochloride Fisher Scientific BP25165 (2 µg/mL)
Chemical compound, drug hEGF Peprotech AF-100–15- (5 ng/mL)
Chemical compound, drug Phenol-red free DMEM/F12 GIBCO 21041025 Used during VCS imaging
Chemical compound, drug CellTrace Far Red Thermo Fisher C34572 Cell label dye used in VCS validation
Software, algorithm Metamorph (v7.10.1.161) Molecular Devices RRID:SCR_002368
Software, algorithm Leica Application Suite X Leica RRID:SCR_013673
Software, algorithm VCS MN This study Available at https://github.com/hatch-lab/fast-mn
Software, algorithm MNFinder This study Available at https://github.com/hatch-lab/mnfinder
Other Chromosome 1 XCE – orange (Homo sapiens) MetaSystems D-0801–050-OR DNA-FISH probe
Other Chromosome 11 XCE – orange (Homo sapiens) MetaSystems D-0811–050-OR DNA-FISH probe
Other Chromosome 18 XCE – orange (Homo sapiens) MetaSystems D-0818–050-OR DNA-FISH probe
Other PNA CENPB-Cy5 PNA Bio F3005 DNA-FISH probe
Other Leica DMi8 with Adaptive Focus Leica Microscope for VCS
Other Mosaic 3 Digital Micromirror Device Andor Microscope component for VCS
Other Leica DMi8 laser scanning confocal microscope Leica Confocal microscope
Other CSU spinning disk unit Yokagawa Confocal microscope component
other FACS Aria II BD Biosciences RRID:SCR_018934 Cell sorter

Plasmid construction

pLVX-EF1a-NLS-3xDendra2-blast was created by PCR of NLS-3xDendra2 from pLenti-CMV-NLS-Dendra2x3-P2A-H2B-miRFP (Hasle et al., 2020) using primer sequences 5'-caagtttgtacaaaaaagttggcaccATGG-3' and 5'-TTAGGAAAAATTCGTTGCGCCGCTCCC-3', followed by ligation into pLVX-EF1a-blast. pLVX-EF1a-H2B-emiRFP703-neo was created by PCR of H2B-emiRFP703 from pH2B-emiRP703 (a gift from Vladislav Verkhusha, AddGene #136567) with primers 5'-ATGCCAGAGCCAGCGAAG-3' and 5’-TTAGCTCTCAAGCGCGGTGATC-3', followed by ligation into pLVX-EF1a-neo. pLVX-EF1a was derived from pLVX-puro (Clontech) by replacing the CMV promoter with EF1a and replacing the selection marker with blast or neo.

Cell culture and construction of cell lines

hTERT RPE-1 cells were cultured in DMEM/F12 (Gibco) supplemented with 10% FBS (Sigma), 1% penicillin-streptomycin (Sigma), and 0.01 mg/mL hygromycin B (Sigma) at 5% CO2 and 37 °C. U2OS, hTERT-human fetal fibroblasts (HFF), and HeLa cells were cultured in DMEM (Gibco) supplemented with 10% FBS and 1% pen/strep at 10% CO2 at 37 °C. RPE1 and U2OS cells were validated by STR sequencing and all cell lines were routinely tested for mycoplasma negativity. For ATF3 validation, cells were incubated in 2 µg/mL doxorubicin hydrochloride (Fisher Sci) for 1 hr prior to fixation. For EGR1 validation, cells were incubated in 5 ng/mL hEGF (Peprotech) for 1 hr prior to fixation. For MN induction, cells were incubated 100 nM BAY1217389 (Msp1i, Fisher Sci) for the indicated times. Where noted, Cdk1i (RO-3306, Sigma) was added to 10 µM.

hTERT RPE-1 NLS-3xDendra2/H2B-emiRFP703 and U2OS NLS-3xDendra2/H2B-emiRFP703 cell lines were produced through serial transduction of lentiviruses. Lentivirus was produced in HEK293T cells using standard protocols and filtered medium was added with polybrene (Sigma, #H9268) for transduction. Cells were selected with 10 µg/mL blasticidin (Invivogen) and 500 µg/mL active G418 (Gibco) and FACS sorted on an Aria II sorter (BD Biosciences) for the top 20% brightest double positive cells. hTERT RPE-1 NLS-3xDendra2-P2A-H2B-miRFP703 cells were created through viral transduction and FACs sorting for the brightest double positive population. MCC13 cells were a gift from Dr. Paul Nghiem (University of Washington). HeLa H2B-GFP cells were a gift from Dr. Daphne Avgousti (Fred Hutchinson Cancer Center) and were originally acquired from Millipore (SCC117). hTERT-HFF cells were a gift from Dr. Denise Galloway (Fred Hutchinson Cancer Center) (Kiyono et al., 1998).

Microscopy

Visual cell sorting experiments were performed on a Leica DMi8 widefield fluorescence microscope with Adaptive Focus outfitted with an i8 incubation chamber (Leica) with temperature (PeCon: TempController 2000–1) and gas control (Oko) and a Mosaic 3 Digital Micromirror (Andor). Images were acquired with a 20x0.8 NA apochromatic objective (Leica) using an iXon Ultra 888 EMCCD camera (Andor) and MetaMorph v7.1.0.1.161 (Molecular Devices). Camera binning was set to 2 and single section images were acquired. For quality control, the first and last 5 positions were reimaged at the end of each well for in all channels and images of an unactivated well were acquired for comparison.

Fixed cell training, validation, and testing images for VCS MN and MNFinder were acquired with a Leica DMi8 laser scanning confocal microscope using the Leica Application Suite (LAS X) software and a 40 x/1.3 NA Oil APO CS objective (Leica) or 20 x/0.7 NA PLAN APO objective, or a Leica DMi8 microscope outfitted with a Yokogawa CSU spinning disk unit, Andor Borealis illumination, ASI automated Stage with Piezo Z, with an environmental chamber and Automatic Focus using a 40 x/1.3 NA Oil PLAN APO objective. Images on the spinning disk microscope were captured using an iXon Ultra 888 EMCCD camera and MetaMorph software (v7.10.4).

Micronucleus segmentation and cell classifiers

VCS MN: The neural net was created using the FastAI 1.0 library in Python, a UNet with Torchvision’s ResNet18 pre-trained model as its base architecture (Ronneberger et al., 2015). Training for MN recognition was performed using ~2000 images of individual cells as training data, a further 164 for validation, and 177 for testing. Training images were of RPE-1 NLS-3xDendra2-P2A-H2B-miRFP703 cells after incubation in 0.5 µM reversine (an Mps1 inhibitor, EMD Millipore) or DMSO for 24 hr and taken with a ×20 widefield objective on the visual cell sorting microscope. Nuclei were segmented on H2B channel images using the Deep Retina segmenter (Caicedo et al., 2019a) and 48x48 px image crops were generated centered on each nucleus. For training, MN pixels in cropped images were manually annotated by an MN expert. MN associated with chromatin bridges were not annotated to ensure that labeled MN were discrete nuclear compartments.

The VCS MN classifier takes as input a two-channel ×20 image. It applies the Deep Retina segmenter to the H2B channel to segment nuclei, discards any nuclei touching the edge of the image, and generates a 48x48 px crops centered on each nucleus. Each crop is processed with Sobel edge detection and linearly enlarged to 96x96 px. To accommodate the ResNet18 3-channel architecture, each crop is expanded to the H2B channel, a duplicate of the H2B channel, and the results of Sobel edge detection. Identified MN are mapped back to the full image and assigned to the closest segmented nucleus. MN more than 40 px away from a nucleus are discarded.

Once MN are assigned to nuclei, the classifier calculates the maximum Dendra2 MN/nucleus intensity ratio for each MN. MN with a ratio below 0.16 are classified as ruptured. This threshold was identified using the JRip classifier in Weka 3.8.6 to define the optimal threshold to separate manually annotated intact and ruptured MN (Cohen, 1995; Witten et al., 2017). Nucleus masks are classified as MN +or MN- based on the presence or absence of at least one associated MN. MN +nuclei are then further classified into those associated with only intact MN (rupture-) or those with at least one ruptured MN (rupture+).

For analysis of MN recall and PPV, MN ground truth labels were generated using PixelStudio 4.5 on an iPad (Apple) for annotation. Recall was calculated as the proportion of all MN that overlapped with a predicted segment. Positive predictive value was calculated as the proportion of all predicted segments that overlapped with a MN. Mean Intersection over Union (mIoU) was calculated per object by quantifying the overlap between groups of true positive pixels and their respective ground truths.

Prior to analysis of U2OS cells, the VCS MN segmentation module was retrained on a collection of images of RPE1, U2OS, HFF, and HeLa cells that had been incubated in 100 nM BAY1217389 or 0.5 µM reversine for 24 hr to increase MN frequency. Live images of RPE1 and U2OS NLS-3xDendra2/H2B-emiRFP703 cells were acquired on the VCS microscope at ×20. Images of fixed cells were taken on either the LSM or spinning disk confocal microscopes at ×40 after fixation in 4% paraformaldehyde (Electron Microscopy Sciences, #15710) for 5 min at room temperature. Cells were labeled with DAPI where indicated. ~2300 crops of U2OS NLS-3xDendra2/H2B-emiRFP703 cells taken on the VCS microscope at ×20 were used for training with another 233 held back for validation and 910 for testing. Three images of Hoechst labeled U2OS cells taken at ×20 on a widefield microscope at 16-bit depth were downloaded from the Broad Bioimage Benchmark Collection (BBBC039v1, Bray et al., 2016; Caicedo et al., 2019b; Ljosa et al., 2012) and linearly scaled up by 0.5 for testing. For all images in the training, validation, and testing datasets, Deep Retina segmentation was bypassed by manually annotating the nuclei and generating image crops with a custom python program. These crops and masks were then fed into VCS MN to obtain MN masks and nucleus/MN assignments.

MNFinder: The MNFinder neural nets were created using TensorFlow 2.0 without transfer learning. Training was performed on 128x128 px crops generated from widefield images of RFP703/Dendra RPE-1 and U2OS cells of H2B-RFP703 and confocal images of DAPI-labeled MCC13 and HeLa cells after Mps1i incubation (Table 3). Nuclei and MN pixels were annotated by hand by an MN expert.

For nucleus/MN segmentation, predictions are taken from two UNet-based neural nets, with MN predictions fed into a third ensembling UNet. All UNets are identical, save for the incorporation of multiscale downsampling into one of the input UNets. For “cell” segmentation (instance segmentation) a UNet architecture incorporating 3 decoder pathways is used to predict distance maps, proximity maps, and foreground pixels. The distance and proximity map decoders incorporate features from a UNet3 +design: specifically, additional skip connections from multiple layers of the encoder and decoder pathways and deep supervision during training (Huang et al., 2020). Training data were generated from annotated nuclei and MN images by generating a concave hull grouping a nucleus and associated MN using the cdBoundary package in Python (Duckham et al., 2008). This hull was transformed into a distance map by calculating the Euclidean distance transform (EDT) with each pixel value encoding the shortest distance between that pixel and the background. Proximity maps were generated by setting all pixels as foreground pixels except for those belonging to other hulls and applying an EDT, masked by the cell’s boundaries, and raising this to the 4th power to sharpen edges. Both maps are scaled from 0 to 1 for each cell.

MNFinder requires images with an image resolution of 1.55–2.8 px/µm, or nuclei around 30 px in diameter. Images taken at other resolutions need to be linearly scaled to within this range prior to analysis. Both single channel chromatin and multi-channel images can be used as input. MNFinder processing proceeds by first cropping input images using a 128x128 px sliding window, advancing the window by 96 px horizontally and vertically to oversample the image. Crops are then expanded into two-channel images, with the second channel the result of Sobel edge detection. These images are processed by the neural nets, post-processed as described, and reassembled by linear blending into a complete field. Recall and positive predictive values were calculated using the same metrics as for VCS classifier validation.

MNFinder was validated on images of RPE1, U2OS, HFF, and HeLa cells described in Table 3. PPV and recall for MN segmentation were calculated for individual input UNets and the ensemble UNet (Table 1).

Outline of visual cell sorting cell isolation experiments

Cells for visual cell sorting were plated onto six-well glass-bottom, black-walled plates (CellVis) at a density of 50,000–225,000 cells per well 1–2 days before activation. An extra unactivated well and a well of non-fluorescent RPE1 cells were plated as controls. One day before imaging, 100 nM Mps1i was added to the medium. One hour prior to imaging, cells were washed 1 x in PBS and medium changed to phenol red free (GIBCO) containing 10 µM RO-3306 (Sigma). The plate was transferred to the microscope, the plate center and micromirror device were aligned, and the appropriate journals (see Hasle et al., 2020) were initiated for visual cell sorting. Imaging conditions were optimized for each experiment. Images were acquired using MetaMorph and analyzed on a dedicated linked computer. 1-bit masks of MN +nuclei and MN- nuclei were transmitted back to MetaMorph, which directed UV pulses at the segmented nuclei. Activation occurred using either a 200ms or 800ms pulse of the 405 nm laser. Classifier predictions were compared to the first 3 and last 3 images from each VCS experiment, each manually annotated prior to downstream analysis, including RNA extraction.

Activated and unactivated cells were trypsinized (0.25% Trypsin-EDTA, GIBCO), suspended in 2% FBS in PBS, and sorted using a FACS Aria II (BD Biosciences). Compensation for PE-blue excitation of unconverted Dendra2 was performed on the unactivated cells. Dendra2 activation-based sorting gates were defined on single cells positive for both Dendra2 and emiRFP703 using the PE-Blue-A/FITC-A ratio, which was found to be more distinct than the PE-YG/FITC ratios. Cells were sorted into 2% FBS then pelleted and either flash frozen on dry ice or replated onto poly-L-lysine coated coverslips.

CellTrace

Activation and sorting accuracy were analyzed for RPE1 RFP703/Dendra2 cells by incubating cells in CellTrace far-red (Thermo Fisher) for 10 min at 37 °C, trypsinizing and pelleting cells, mixing 1:1 with unlabeled cells, and plating. A classifier segmented nuclei with the Deep Retina segmenter on Dendra (green) fluorescence and measured the mean far-red intensity in the nucleus (Hasle et al., 2020). Threshold intensity for activation was experimentally determined. Cells were sorted by FACs for Dendra2 ratio and CellTrace intensity using compensation to eliminate emiRFP703 spectral overlap and then reanalyzed on the same machine.

Mps1i+/-isolation

Cells incubated in Msp1i or DMSO were imaged and activated using a random classifier. 1-bit masks of nuclei generated using the Deep Retina segmenter were randomly assigned to receive 800ms or 200ms pulses. At least 13 k cells were collected per sorting bin and samples were pelleted, flash frozen, and stored at –80 °C.

Micronucleus+/- isolation

Two wells were imaged sequentially per experiment with the activation time for MN+ and MN- nuclei reversed between wells.

Rupture +/-isolation

Cells were plated 2 days before imaging in medium containing 1 µM Cdk4/6i (PD-0332991, Sigma). Twenty-four hours later, cells were rinsed 3 x with PBS and the medium replaced with 100 nM BAY1217389. Only one well was imaged per experiment with rupture- cells receiving 800ms and rupture + cells receiving 200ms pulses.

RNA isolation and sequencing

We extracted RNA from frozen cell pellets using the RNAqueous micro kit (Thermo Fisher), according to the manufacturer’s protocol. Residual DNA was removed by DNase I treatment and RNA was further purified by glycogen precipitation (RNA-grade glycogen; Thermo Fisher) and resuspension in ultra-pure H2O heated to 65 °C. RNA quality and concentration was checked by the Genomics Core at the Fred Hutchinson Cancer Center with an Agilent 4200 Tapestation HighSense RNA assay and only samples with RIN scores above 8 and 28 S/18 S values above 2 were further processed. cDNA synthesis and library preparations were performed by the Genomics Core using the SMARTv4 for ultra-low RNA input and Nextera XT kits (Takara). Sequencing was also performed on an Illumina NextSeq 2000 sequencing system with paired-end, 50 bp reads.

RNAseq and gene-set enrichment analysis

We quantified transcripts with Salmon to map reads against the UCSC hg38 assembly at http://refgenomes.databio.org (digest: 2230c535660fb4774114bfa966a62f823fdb6d21acf138d4), using bootstrapped abundance estimates and corrections for GC bias (Patro et al., 2017). For comparisons with data from He, et al. and Santaguida, et al., the original FASTA files deposited at the Sequence Read Archive were downloaded with NCBI’s SRA Toolkit and quantified with Salmon (He et al., 2019; Santaguida et al., 2017). No GC-bias correction was applied as only single-end reads were available.

Transcript abundances were processed to find differentially expressed genes (DEGs) with the R package DESeq2 version 3.16 in R 4.2.1, RStudio 2022.07.2 build 576, and Sublime Text build 4143. Files were imported into DESeq2 with the R package tximeta (Love et al., 2020; Love et al., 2014), estimated transcript counts were summarized to gene-level, and low-abundance genes were filtered by keeping only those genes with estimated counts ≥700 in at least two samples. DEGs were identified using a likelihood ratio test comparing the full model with one with the condition of interest dropped and an FDR of 0.05. Log-fold changes were corrected using empirical Bayes adaptive shrinkage (Stephens, 2017). Operations were performed before pseudogenes were filtered from dataset.

GSEA was performed using the R package fgsea version 1.25.1, comparing log-fold changes of all DEGs against the full Homo sapiens Hallmark Gene Sets version 2022.1, part of the MSigDB resource (UC San Diego, Broad Institute) (Crameri, 2018; Greene et al., 2017; Korotkevich et al., 2021; Liberzon et al., 2015; Subramanian et al., 2005).

Live-cell imaging for MN rupture frequency analysis

RPE1 NLS-3xDendra2/H2B-emiRFP703 cells were plated 2 days before imaging and treated for 24 hr with either 1 µM Cdk4/6i or DMSO. One day before imaging, cells were rinsed and incubated in 100 nM BAY1217389. Nineteen hours later, the media was exchanged for Cdk1i medium, five positions were imaged in each well and rupture- cells were activated. These positions and the surrounding area were imaged every hour for 11 hr and the status of photoconverted cells manually recorded.

Immunofluorescence (IF)

Cells plated on poly-L-lysine coated coverslips or glass bottomed plates were fixed for IF in 4% paraformaldehyde for 5 min unless otherwise indicated. Cells were permeabilized for 30 min at RT in PBSBT (1xPBS (GIBCO), 3% BSA, 0.4% Triton X-100, 0.02% sodium azide (all Sigma)), followed by incubation in primary antibodies diluted in PBSBT for 30 min, secondary antibodies diluted in same for 30 min, and 5 min in 1 µg/mL DAPI (Invitrogen). Coverslips were mounted in VectaShield (VectorLabs) and sealed with nail polish before imaging. Primary antibodies used were: mouse-a-γH2AX-phospho Ser139 (1:500; BioLegend, 613401), rabbit-a-ATF3 (1:400; Cell Signaling Technology, 18665), rabbit-a-EGR1 (1:1600; Cell Signaling Technology, 4154), and rabbit α H3K27Ac (1:500; Abcam, ab4729). Secondary antibodies used were: AF647 goat-a-mouse (1:1000; Life Technologies, A21236) and AF488 goat-a-rabbit (1:2000; Life Technologies, A11034).

DNA FISH

RPE1 cells plated onto poly-L lysine coverslips were fixed in –20 °C 100% methanol for 10 min, rehydrated for 10 min in 1xPBS and processed for IF. Cells were then refixed in 4% PFA for 5 min at RT then incubated in 2xSSC (Sigma) for 2x5 min RT. Cells were permeabilized in 0.2 M HCl (Sigma), 0.7% TritonX-100 in H2O for 15 min at RT, washed in 2xSSC, and incubated for 1 hr at RT in 50% formamide (Millipore). Cells were rewashed in 2xSSC, inverted onto chr 1, 11, or 18 XCE probes (MetaSystems), and the coverslips sealed with rubber cement. Probes were hybridized at 74 °C for 3 min and then incubated for 4 hr (chr 18) or overnight (chr.s 1 and 11) at 37 °C. After hybridization, coverslips were washed in 0.4xSSC at 74 °C for 5 min, then 2xSCC 0.1% Tween20 (Fisher) for 2x5 min at RT. DNA was labeled by incubation in 1 µg/mL DAPI for 5 min at RT, and coverslips mounted in VectaShield. Images were acquired as 0.45 µm step z-stacks through the cell on the confocal LSM with a ×40 objective. Cells that had more or less than two FISH foci were classified as aneuploid for that chromosome.

For MN chromosome enumeration, Mps1i treated RPE1 cells were labeled with a PNA CENPB-Cy5 probe (PNA Bio). The same DNA FISH protocol was followed until formamide denaturation. At that point cells were incubated in 50% formamide in 2 x SSC for 30 min at 85 °C then rinsed three times in ice cold 2xSSC. PNA probes were diluted to 50 μM in 85 °C hybridization buffer (60% formamide +20 mM Tris, pH 7.4+0.1 μg/mL salmon sperm DNA (Trevigen)) and coverslips were simultaneously washed at 85 °C in 2xSSC. Coverslips were then incubated in 10 μL of the PNA probe for 10 min at 85 °C and then 2 hr at RT. Coverslips were then washed twice with 2 x SSC +0.1% Tween-20 at 55 °C for 10 min and once with 2 x SSC +0.1% Tween-20 at RT. before incubation in DAPI and mounting in Prolong Gold (Life Technologies).

Image analysis

Mps1i nuclear morphology

Analysis was performed on images plus ground truth annotations and Deep Retina masks acquired as part of the DMSO versus Mps1i visual cell sorting RNAseq experiment. Solidity was quantified on ground truth and Deep Retina nuclear masks and chromatin bridges, misshapen nuclei, and dead or mitotic nuclei were manually scored. Nuclear atypia was defined as a cell with multiple large nuclei (rare) or a nuclear shape that multiple significant indentations. Nuclei that curved around a single focus were scored as normal.

Cell confluency

A subset of images of RFP703/Dendra cells from the MN +/-RNASeq experiments were thresholded on the 3xDendra-NLS signal at a value that included the cytoplasmic signal and the proportion of thresholded pixels was calculated per image using ImageJ. Distances between nuclei borders were calculated using ground truth masks of the same images using tools packaged in the 3D ImageJ Suite (Ollion et al., 2013).

Dendra2 ratio stability

Nuclei were segmented on images taken at the start and end of an Mps1i+/-VCS experiment by thresholding on the GFP channel, measuring the mean intensity of GFP and RFP, and calculating the RFP:GFP ratio per nucleus for each image group.

MN +/-sorting accuracy

Cells replated and fixed after sorting were imaged on the LSM confocal at 40 x with 0.45 µm z-stacks through the cell. Image names were randomized prior to quantification of MN + cells.

ATF3 and EGR1 intensity

Images were acquired as 0.45 µm step z-stacks through the cell on the confocal LSM with a ×40 objective. Images were corrected for illumination inhomogeneity by dividing by a dark image and background subtracted using a 60 px radius rolling-ball in FIJI (Schindelin et al., 2012) (v2.9.0). Single in focus sections of each nucleus was selected and nuclei masks generated by thresholding on RFP-NLS. Mean intensity of ATF3 or EGR1 were calculated for each nucleus and normalized for each replicate by scaling to the median value for the DMSO control. Statistics were calculated on the raw values.

Statistical analyses

Shorthand p-values are as follows:

  • ns: p-value ≥ 0.05

  • *: p-value <0.05

  • **: p-value <0.01

  • ***: p-value <0.001

  • ****: p-value <0.0001

Generalized estimating equations (GEE) were used to determine statistical differences for nominal data with multiple variables using binomial distributions and a logit link function (Halekoh et al., 2006). For Figure 1c, data were assessed using the formula: (# recalled, # missed)~MN status where MN status is whether ruptured or intact. For Figure 6c, we also used a binomial distribution and a logit link function. For Figure 5c, Figure 6—figure supplement 1c, we used the formula: (# aneuploid, # normal)~Status × Chr where Status is whether the cell was MN +/-or Rupture +/-and Chr is chromosome identity. p-values for each individual property were calculated using the drop1 function in R. In Figure 6c, Figure 6—figure supplement 1a-b, we used a gamma distribution and the formula mean intensity ~Population. Statistical significance for differences between single nominal variables in other figures were by Barnard’s exact test. KS test=Kolmogorov-Smirnov test to compare distributions of non-Gaussian data.

The predicted change to classifier PPV in Figure 5—figure supplement 1c was determined by reducing the true positive rate in the rupture- population by the difference in mean rupture frequencies between the beginning and end of the experiments and increasing the true positive rate in the rupture +population by the same.

Acknowledgements

This work was supported by a National Institutes of Health grant (R35GM124766, awarded to EM Hatch), a National Human Genome Research Institute grant (RM1HG010461, awarded to DM Fowler), a training grant (T32CA009657, awarded to L DiPeso), the Rita Allen Foundation Scholars program (awarded to EM Hatch), and the Fred Hutchinson Cancer Center Bioinformatics and Genomics cores (funded by NIH grant P30CA015704).

Funding Statement

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Contributor Information

Emily M Hatch, Email: ehatch@fredhutch.org.

Silke Hauf, Virginia Tech, United States.

Adèle L Marston, University of Edinburgh, United Kingdom.

Funding Information

This paper was supported by the following grants:

  • National Institute of General Medical Sciences R35GM124766 to Lucian DiPeso, Heather Z Huang, Emily M Hatch.

  • National Human Genome Research Institute RM1HG010461 to Sriram Pendyala, Douglas M Fowler.

  • National Cancer Institute T32CA009657 to Lucian DiPeso.

  • Rita Allen Foundation Scholars Program to Emily M Hatch.

  • National Cancer Institute P30CA015704 to Lucian DiPeso, Heather Z Huang, Emily M Hatch.

Additional information

Competing interests

No competing interests declared.

Author contributions

Conceptualization, Software, Formal analysis, Validation, Investigation, Visualization, Methodology, Writing - original draft.

Software, Formal analysis, Methodology.

Formal analysis, Methodology.

Conceptualization, Supervision, Funding acquisition, Project administration.

Conceptualization, Supervision, Funding acquisition, Project administration, Writing – review and editing.

Additional files

Supplementary file 1. Additional tables containing transcriptomics data.

(a) DMSO 800 vs 200 DEG. List of differentially expressed genes (FDR ≤0.05) in DMSO treated RPE1 cells isolated after an 800ms or 200ms UV pulse. For all DEG analyses: padj = false-discovery rate adjusted p-values. (b) Table 2: Msp1i DEG. List of differentially expressed genes (FDR ≤0.05) in Mps1i versus DMSO RPE1 cells. (c) Msp1i DEG log2FC. List of differentially expressed genes in Mps1i versus DMSO RPE1 cells filtered for log2 fold change above 0.58 or below –0.58. (d) He et al DEG. List of differentially expressed genes (FDR ≤0.05) in RPE1 cells treated with nocodazole for 8 h versus control, initially reported in He et al., 2018. Overlap with Mps1i DEG list noted in last column. (e) Santaguida et al DEG. List of differentially expressed genes (FDR ≤0.05) in RPE1 cells that were treated with the Mps1i molecule reversine for 12 h, released, and ceased to divide, compared to control cells. Results initially reported in Santaguida et al., 2017. Overlap with Mps1i DEG list noted in last column. (f) Hallmark Mps1i. List of MSigDB Hallmark gene sets that are significantly enriched (FDR ≤0.05) in Mps1i treated RPE1 cells. For all Hallmark lists, NES=normalized enrichment score, padj = false discovery rate adjusted p-values, size = number of genes in gene set, leading_edge = , sig_genes_in_geneset=the gene names of set genes that were significantly upregulated. (g) Hallmark He et al. List of MSigDB Hallmark gene sets that are significantly enriched in nocodazole treated RPE1 cells from He et al., 2018. (h) Hallmark Santaguida et al. List of MSigDB Hallmark gene sets that are significantly enriched in reversine treated RPE1 cells from Santaguida et al., 2017. (i) MN +DEG. List of differentially expressed genes (FDR ≤0.05) in Mps1i treated RPE1 cells with MN versus without MN. (j) MN +DEG log2FC. List of differentially expressed genes in Mps1i treated RPE1 cells with MN versus without MN filtered for log2 fold change above 0.58 or below –0.58. Overlap with Mps1i DEG list noted in last column. (k) Ruptured +DEG. List of differentially expressed genes (FDR ≤0.05) in Mps1i treated micronucleated RPE1 cells with only intact MN versus at least 1 ruptured MN. (l) Ruptured DEG +log2 FC. List of differentially expressed genes in Mps1i treated micronucleated RPE1 cells with only intact MN versus at least 1 ruptured MN MN filtered for absolute log2 fold change ≥0.58. Overlap with Mps1i DEG list noted in last column. (m) Hallmark MN+. List of MSigDB Hallmark gene sets that are significantly enriched in Mps1i treated RPE1 cells with MN. (n) Hallmark ruptured+. List of MSigDB Hallmark gene sets that are significantly enriched in Mps1i treated RPE1 cells with MN and at least 1 ruptured MN versus no ruptured MN. (o) Log2FC per replicate. List of log2 fold change values for Mps1i DEGs with absolute log2FC ≥0.58 broken out by MN +and rupture +replicate. Genes with all NA values for either all MN +or all rupture +replicates were excluded from analysis. enriched_in_rupture+_cluster=genes present in cluster enriched in increased expression over Mps1i DEGs.

elife-101579-supp1.xlsx (426.9KB, xlsx)
MDAR checklist

Data availability

All cell lines and plasmids made for this study are available upon request. Both the MNFinder and VCS MN modules and supporting code are available on the Hatch Lab GitHub repository, along with all training, testing, and validation images and their ground truth labels (https://github.com/hatch-lab/mnfinder and https://github.com/hatch-lab/fast-mn). Additional validation images and data are available on BioImage Archive. Raw and processed RNA-seq datasets were deposited in the National Center for Biotechnology Information (NCBI) Gene Expression Omnibus (GEO), accession number GSE291811.

The following datasets were generated:

DiPeso L, Pendyala S, Huang HZ, Fowler DM, Hatch EM. 2025. Image-based identification and isolation of micronucleated cells to dissect cellular consequences. NCBI Gene Expression Omnibus. GSE291811

Hatch E. 2025. Image-based identification and isolation of micronucleated cells to dissect cellular consequences. BioImage Archive. S-BIAD1990

The following previously published datasets were used:

Santaguida S, Richardson A, Iyer D, M'Saad O, Zasadil L, Knouse K, Wong Y, Rhind N, Desai A, Amon A. 2017. Aneuploidy triggers an immune response. NCBI Gene Expression Omnibus. GSE83647

He Q, Au B, Kulkarni M, Shen Y, KahJ L, Maimaiti J, Chong HC, Lim EH, Rancati G, Sinha I, Fu Z, Wang X, Crasta KC, ChengKit W, MoniqueNH L, JohnE C. 2019. Genome-wide transcriptional response to random aneuploidy in human cells. NCBI Gene Expression Omnibus. GSE109519

References

  1. Agustinus AS, Al-Rawi D, Dameracharla B, Raviram R, Jones BSCL, Stransky S, Scipioni L, Luebeck J, Di Bona M, Norkunaite D, Myers RM, Duran M, Choi S, Weigelt B, Yomtoubian S, McPherson A, Toufektchan E, Keuper K, Mischel PS, Mittal V, Shah SP, Maciejowski J, Storchova Z, Gratton E, Ly P, Landau D, Bakhoum MF, Koche RP, Sidoli S, Bafna V, David Y, Bakhoum SF. Epigenetic dysregulation from chromosomal transit in micronuclei. Nature. 2023;619:176–183. doi: 10.1038/s41586-023-06084-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Andrade JR, Gallagher AD, Maharaj J, McClelland SE. Disentangling the roles of aneuploidy, chromosomal instability and tumour heterogeneity in developing resistance to cancer therapies. Chromosome Research. 2023;31:28. doi: 10.1007/s10577-023-09737-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Bakhoum SF, Ngo B, Laughney AM, Cavallo JA, Murphy CJ, Ly P, Shah P, Sriram RK, Watkins TBK, Taunk NK, Duran M, Pauli C, Shaw C, Chadalavada K, Rajasekhar VK, Genovese G, Venkatesan S, Birkbak NJ, McGranahan N, Lundquist M, LaPlant Q, Healey JH, Elemento O, Chung CH, Lee NY, Imielenski M, Nanjangud G, Pe’er D, Cleveland DW, Powell SN, Lammerding J, Swanton C, Cantley LC. Chromosomal instability drives metastasis through a cytosolic DNA response. Nature. 2018;553:467–472. doi: 10.1038/nature25432. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Ben-David U, Amon A. Context is everything: aneuploidy in cancer. Nature Reviews Genetics. 2020;21:44–62. doi: 10.1038/s41576-019-0171-x. [DOI] [PubMed] [Google Scholar]
  5. Bona MD, Bakhoum SF. Micronuclei and cancer. Cancer Discovery. 2024;14:214–226. doi: 10.1158/2159-8290.cd-23-1073. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Bray MA, Singh S, Han H, Davis CT, Borgeson B, Hartland C, Kost-Alimova M, Gustafsdottir SM, Gibson CC, Carpenter AE. Cell Painting, a high-content image-based assay for morphological profiling using multiplexed fluorescent dyes. Nature Protocols. 2016;11:1757–1774. doi: 10.1038/nprot.2016.105. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Caicedo JC, Goodman A, Karhohs KW, Cimini BA, Ackerman J, Haghighi M, Heng C, Becker T, Doan M, McQuin C, Rohban M, Singh S, Carpenter AE. Nucleus segmentation across imaging experiments: the 2018 Data Science Bowl. Nature Methods. 2019a;16:1247–1253. doi: 10.1038/s41592-019-0612-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Caicedo JC, Roth J, Goodman A, Becker T, Karhohs KW, Broisin M, Molnar C, McQuin C, Singh S, Theis FJ, Carpenter AE. Evaluation of deep learning strategies for nucleus segmentation in fluorescence images. Cytometry. Part A. 2019b;95:952–965. doi: 10.1002/cyto.a.23863. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Chen B, Gilbert LA, Cimini BA, Schnitzbauer J, Zhang W, Li GW, Park J, Blackburn EH, Weissman JS, Qi LS, Huang B. Dynamic imaging of genomic loci in living human cells by an optimized CRISPR/Cas system. Cell. 2013;155:1479–1491. doi: 10.1016/j.cell.2013.12.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Chen J, Harding SM, Natesan R, Tian L, Benci JL, Li W, Minn AJ, Asangani IA, Greenberg RA. Cell cycle checkpoints cooperate to suppress dna- and rna-associated molecular pattern recognition and anti-tumor immune responses. Cell Reports. 2020;32:108080. doi: 10.1016/j.celrep.2020.108080. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Cohen WW. In: Machine Learning Proceedings 1995. Prieditis A, Russell S, editors. Morgan Kaufmann; 1995. Fast effective rule induction in; pp. 115–123. [DOI] [Google Scholar]
  12. Cosenza MR, Gaiatto A, Erarslan Uysal B, Andrades Delgado Á, Sautter NL, Jendrusch MA, Zumalave Duro S, Rausch T, Halavatyi A, Geissen E-M, Hasenfeld P, Cortes-Ciriano I, Kulozik A, Pepperkok R, Korbel JO. Origins of de Novo Chromosome Rearrangements Unveiled by Coupled Imaging and Genomics. bioRxiv. 2024 doi: 10.1101/2024.08.15.607890. [DOI]
  13. Crameri F. Geodynamic diagnostics, scientific visualisation and StagLab 3.0. Geoscientific Model Development. 2018;11:2541–2562. doi: 10.5194/gmd-11-2541-2018. [DOI] [Google Scholar]
  14. Dang D, Efstathiou C, Sun D, Yue H, Sastry NR, Draviam VM. Deep learning techniques and mathematical modeling allow 3D analysis of mitotic spindle dynamics. The Journal of Cell Biology. 2023;222:e202111094. doi: 10.1083/jcb.202111094. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Dou Z, Ghosh K, Vizioli MG, Zhu J, Sen P, Wangensteen KJ, Simithy J, Lan Y, Lin Y, Zhou Z, Capell BC, Xu C, Xu M, Kieckhaefer JE, Jiang T, Shoshkes-Carmel M, Tanim K, Barber GN, Seykora JT, Millar SE, Kaestner KH, Garcia BA, Adams PD, Berger SL. Cytoplasmic chromatin triggers inflammation in senescence and cancer. Nature. 2017;550:402–406. doi: 10.1038/nature24050. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Duckham M, Kulik L, Worboys M, Galton A. Efficient generation of simple polygons for characterizing the shape of a set of points in the plane. Pattern Recognition. 2008;41:3224–3236. doi: 10.1016/j.patcog.2008.03.023. [DOI] [Google Scholar]
  17. Fischer CA, Besora-Casals L, Rolland SG, Haeussler S, Singh K, Duchen M, Conradt B, Marr C. MitoSegNet: easy-to-use deep learning segmentation for analyzing mitochondrial morphology. iScience. 2020;23:101601. doi: 10.1016/j.isci.2020.101601. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Flynn PJ, Koch PD, Mitchison TJ. Chromatin bridges, not micronuclei, activate cGAS after drug-induced mitotic errors in human cells. PNAS. 2021;118:e2103585118. doi: 10.1073/pnas.2103585118. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Garribba L, De Feudis G, Martis V, Galli M, Dumont M, Eliezer Y, Wardenaar R, Ippolito MR, Iyer DR, Tijhuis AE, Spierings DCJ, Schubert M, Taglietti S, Soriani C, Gemble S, Basto R, Rhind N, Foijer F, Ben-David U, Fachinetti D, Doksani Y, Santaguida S. Short-term molecular consequences of chromosome mis-segregation for genome stability. Nature Communications. 2023;14:1353. doi: 10.1038/s41467-023-37095-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Greene D, Richardson S, Turro E. ontologyX: a suite of R packages for working with ontological data. Bioinformatics. 2017;33:1104–1106. doi: 10.1093/bioinformatics/btw763. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Guo X, Ni J, Liang Z, Xue J, Fenech MF, Wang X. The molecular origins and pathophysiological consequences of micronuclei: New insights into an age-old problem. Mutation Research. 2019;779:1–35. doi: 10.1016/j.mrrev.2018.11.001. [DOI] [PubMed] [Google Scholar]
  22. Halekoh U, Højsgaard S, Yan J. The R package geepack for generalized estimating equations. Journal of Statistical Software. 2006;15:i02. doi: 10.18637/jss.v015.i02. [DOI] [Google Scholar]
  23. Han L, Choudhury S, Mich-Basso JD, Ammanamanchi N, Ganapathy B, Suresh S, Khaladkar M, Singh J, Maehr R, Zuppo DA, Kim J, Eberwine JH, Wyman SK, Wu YL, Kühn B. Lamin B2 levels regulate polyploidization of cardiomyocyte nuclei and myocardial regeneration. Developmental Cell. 2020;53:42–59. doi: 10.1016/j.devcel.2020.01.030. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Harding SM, Benci JL, Irianto J, Discher DE, Minn AJ, Greenberg RA. Mitotic progression following DNA damage enables pattern recognition within micronuclei. Nature. 2017;548:466–470. doi: 10.1038/nature23470. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Hasle N, Cooke A, Srivatsan S, Huang H, Stephany JJ, Krieger Z, Jackson D, Tang W, Pendyala S, Monnat RJ, Trapnell C, Hatch EM, Fowler DM. High-throughput, microscope-based sorting to dissect cellular heterogeneity. Molecular Systems Biology. 2020;16:e9442. doi: 10.15252/msb.20209442. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Hatch EM, Fischer AH, Deerinck TJ, Hetzer MW. Catastrophic nuclear envelope collapse in cancer cell micronuclei. Cell. 2013;154:47–60. doi: 10.1016/j.cell.2013.06.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. He Q, Au B, Kulkarni M, Shen Y, KahJ L, Maimaiti J, ChengKit W, MoniqueNH L, Chong HC, Lim EH, Rancati G, Sinha I, Fu Z, Wang X, JohnE C, Crasta KC. Chromosomal instability-induced senescence potentiates cell non-autonomous tumourigenic effects. Oncogenesis. 2018;7:62. doi: 10.1038/s41389-018-0072-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. He B, Gnawali N, Hinman AW, Mattingly AJ, Osimani A, Cimini D. Chromosomes missegregated into micronuclei contribute to chromosomal instability by missegregating at the next division. Oncotarget. 2019;10:2660–2674. doi: 10.18632/oncotarget.26853. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Huang H, Lin L, Tong R, Hu H, Zhang Q, Iwamoto Y, Han X, Chen YW, Wu J. UNet 3+: A Full-Scale Connected UNet for Medical Image Segmentation. ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP); Barcelona, Spain. 2020. pp. 1055–1059. [DOI] [Google Scholar]
  30. Ibarra-Arellano MA, Caprio LA, Hada A, Stotzem N, Cai L, Shah S, Melms JC, Wünneman F, Izar B, Schapiro D. micronuclAI: Automated Quantification of Micronuclei for Assessment of Chromosomal Instability. bioRxiv. 2024 doi: 10.1101/2024.05.24.595722. [DOI] [PMC free article] [PubMed]
  31. Johnson GT, Agmon E, Akamatsu M, Lundberg E, Lyons B, Ouyang W, Quintero-Carmona OA, Riel-Mehan M, Rafelski S, Horwitz R. Building the next generation of virtual cells to understand cellular biology. Biophysical Journal. 2023;122:3560–3569. doi: 10.1016/j.bpj.2023.04.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Joo YK, Black EM, Trier I, Haakma W, Zou L, Kabeche L. ATR promotes clearance of damaged DNA and damaged cells by rupturing micronuclei. Mol Cell. 2023;83:3642–3658. doi: 10.1016/j.molcel.2023.09.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Kanfer G, Sarraf SA, Maman Y, Baldwin H, Dominguez-Martin E, Johnson KR, Ward ME, Kampmann M, Lippincott-Schwartz J, Youle RJ. Image-based pooled whole-genome CRISPRi screening for subcellular phenotypes. Journal of Cell Biology. 2021;220:e202006180. doi: 10.1083/jcb.202006180. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Kiyono T, Foster SA, Koop JI, McDougall JK, Galloway DA, Klingelhutz AJ. Both Rb/p16INK4a inactivation and telomerase activity are required to immortalize human epithelial cells. Nature. 1998;396:84–88. doi: 10.1038/23962. [DOI] [PubMed] [Google Scholar]
  35. Klaasen SJ, Truong MA, van Jaarsveld RH, Koprivec I, Štimac V, de Vries SG, Risteski P, Kodba S, Vukušić K, de Luca KL, Marques JF, Gerrits EM, Bakker B, Foijer F, Kind J, Tolić IM, Lens SMA, Kops GJPL. Nuclear chromosome locations dictate segregation error frequencies. Nature. 2022;607:604–609. doi: 10.1038/s41586-022-04938-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Kneissig M, Keuper K, de Pagter MS, van Roosmalen MJ, Martin J, Otto H, Passerini V, Campos Sparr A, Renkens I, Kropveld F, Vasudevan A, Sheltzer JM, Kloosterman WP, Storchova Z. Micronuclei-based model system reveals functional consequences of chromothripsis in human cells. eLife. 2019;8:e50292. doi: 10.7554/eLife.50292. [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Korotkevich G, Sukhov V, Budin N, Shpak B, Artyomov MN, Sergushichev A. Fast Gene Set Enrichment Analysis. bioRxiv. 2021 doi: 10.1101/060012. [DOI]
  38. Krupina K, Goginashvili A, Cleveland DW. Causes and consequences of micronuclei. Current Opinion in Cell Biology. 2021;70:91–99. doi: 10.1016/j.ceb.2021.01.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Kuga T, Nie H, Kazami T, Satoh M, Matsushita K, Nomura F, Maeshima K, Nakayama Y, Tomonaga T. Lamin B2 prevents chromosome instability by ensuring proper mitotic chromosome segregation. Oncogenesis. 2014;3:e94. doi: 10.1038/oncsis.2014.6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Li Y, Lovett D, Zhang Q, Neelam S, Kuchibhotla RA, Zhu R, Gundersen GG, Lele TP, Dickinson RB. Moving cell boundaries drive nuclear shaping during cell spreading. Biophysical Journal. 2015;109:670–686. doi: 10.1016/j.bpj.2015.07.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Liberzon A, Birger C, Thorvaldsdóttir H, Ghandi M, Mesirov JP, Tamayo P. The molecular signatures database (MSigDB) hallmark gene set collection. Cell Systems. 2015;1:417–425. doi: 10.1016/j.cels.2015.12.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Lin T-Y, Goyal P, Girshick R, He K, Dollar P. Focal loss for dense object detection. IEEE Transactions on Pattern Analysis and Machine Intelligence. 2018;42:318–327. doi: 10.1109/TPAMI.2018.2858826. [DOI] [PubMed] [Google Scholar]
  43. Lin YF, Hu Q, Mazzagatti A, Valle-Inclán JE, Maurais EG, Dahiya R, Guyer A, Sanders JT, Engel JL, Nguyen G, Bronder D, Bakhoum SF, Cortés-Ciriano I, Ly P. Mitotic clustering of pulverized chromosomes from micronuclei. Nature. 2023;618:1041–1048. doi: 10.1038/s41586-023-05974-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Ljosa V, Sokolnicki KL, Carpenter AE. Annotated high-throughput microscopy image sets for validation. Nature Methods. 2012;9:637. doi: 10.1038/nmeth.2083. [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Love MI, Huber W, Anders S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biology. 2014;15:550. doi: 10.1186/s13059-014-0550-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Love MI, Soneson C, Hickey PF, Johnson LK, Pierce NT, Shepherd L, Morgan M, Patro R. Tximeta: Reference sequence checksums for provenance identification in RNA-seq. PLOS Computational Biology. 2020;16:e1007664. doi: 10.1371/journal.pcbi.1007664. [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Lu H, She Y, Tie J, Xu S. Half-UNet: A simplified u-net architecture for medical image segmentation. Frontiers in Neuroinformatics. 2022;16:911679. doi: 10.3389/fninf.2022.911679. [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Ly P, Teitz LS, Kim DH, Shoshani O, Skaletsky H, Fachinetti D, Page DC, Cleveland DW. Selective Y centromere inactivation triggers chromosome shattering in micronuclei and repair by non-homologous end joining. Nature Cell Biology. 2017;19:68–75. doi: 10.1038/ncb3450. [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Ly P, Brunner SF, Shoshani O, Kim DH, Lan W, Pyntikova T, Flanagan AM, Behjati S, Page DC, Campbell PJ, Cleveland DW. Chromosome segregation errors generate a diverse spectrum of simple and complex genomic rearrangements. Nature Genetics. 2019;51:705–715. doi: 10.1038/s41588-019-0360-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Maass PG, Barutcu AR, Shechner DM, Weiner CL, Melé M, Rinn JL. Spatiotemporal allele organization by allele-specific CRISPR live-cell imaging (SNP-CLING) Nature Structural & Molecular Biology. 2018;25:176–184. doi: 10.1038/s41594-017-0015-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Mackenzie KJ, Carroll P, Martin CA, Murina O, Fluteau A, Simpson DJ, Olova N, Sutcliffe H, Rainger JK, Leitch A, Osborn RT, Wheeler AP, Nowotny M, Gilbert N, Chandra T, Reijns MAM, Jackson AP. cGAS surveillance of micronuclei links genome instability to innate immunity. Nature. 2017;548:461–465. doi: 10.1038/nature23449. [DOI] [PMC free article] [PubMed] [Google Scholar]
  52. Mahbod A, Schaefer G, Dorffner G, Hatamikia S, Ecker R, Ellinger I. A dual decoder U-Net-based model for nuclei instance segmentation in hematoxylin and eosin-stained histological images. Frontiers in Medicine. 2022;9:978146. doi: 10.3389/fmed.2022.978146. [DOI] [PMC free article] [PubMed] [Google Scholar]
  53. Mammel AE, Huang HZ, Gunn AL, Choo E, Hatch EM. Chromosome length and gene density contribute to micronuclear membrane stability. Life Sci Alliance. 2021;5:e202101210. doi: 10.26508/lsa.202101210. [DOI] [PMC free article] [PubMed] [Google Scholar]
  54. Matlashov ME, Shcherbakova DM, Alvelid J, Baloban M, Pennacchietti F, Shemetov AA, Testa I, Verkhusha VV. A set of monomeric near-infrared fluorescent proteins for multicolor imaging across scales. Nature Communications. 2020;11:239. doi: 10.1038/s41467-019-13897-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. Mohr L, Toufektchan E, von Morgen P, Chu K, Kapoor A, Maciejowski J. ER-directed TREX1 limits cGAS activation at micronuclei. Molecular Cell. 2021;81:724–738. doi: 10.1016/j.molcel.2020.12.037. [DOI] [PMC free article] [PubMed] [Google Scholar]
  56. Oktay O, Schlemper J, Folgoc LL, Lee M, Heinrich M, Misawa K, Mori K, McDonagh S, Hammerla NY, Kainz B, Glocker B, Rueckert D. Attention u-net: learning where to look for the pancreas. arXiv. 2018 https://arxiv.org/abs/1804.03999
  57. Ollion J, Cochennec J, Loll F, Escudé C, Boudier T. TANGO: a generic tool for high-throughput 3D image analysis for studying nuclear organization. Bioinformatics. 2013;29:1840–1841. doi: 10.1093/bioinformatics/btt276. [DOI] [PMC free article] [PubMed] [Google Scholar]
  58. Papathanasiou S, Mynhier NA, Liu S, Brunette G, Stokasimov E, Jacob E, Li L, Comenho C, van Steensel B, Buenrostro JD, Zhang C-Z, Pellman D. Heritable transcriptional defects from aberrations of nuclear architecture. Nature. 2023;619:184–192. doi: 10.1038/s41586-023-06157-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  59. Patro R, Duggal G, Love MI, Irizarry RA, Kingsford C. Salmon provides fast and bias-aware quantification of transcript expression. Nature Methods. 2017;14:417–419. doi: 10.1038/nmeth.4197. [DOI] [PMC free article] [PubMed] [Google Scholar]
  60. Pons C, Mauvezin C. QATS: an ImageJ plugin for the quantification of toroidal nuclei in biological images. Bioinformatics. 2024;40:btae026. doi: 10.1093/bioinformatics/btae026. [DOI] [PMC free article] [PubMed] [Google Scholar]
  61. Pujadas Liwag EM, Wei X, Acosta N, Carter LM, Yang J, Almassalha LM, Jain S, Daneshkhah A, Rao SSP, Seker-Polat F, MacQuarrie KL, Ibarra J, Agrawal V, Aiden EL, Kanemaki MT, Backman V, Adli M. Depletion of lamins B1 and B2 promotes chromatin mobility and induces differential gene expression by a mesoscale-motion-dependent mechanism. Genome Biology. 2024;25:77. doi: 10.1186/s13059-024-03212-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  62. Ronneberger O, Fischer P, Brox T. Medical Image Computing and Computer-Assisted Intervention – MICCAI 2015. MICCAI 2015, 18th International Conference; 2015. [DOI] [Google Scholar]
  63. Santaguida S, Richardson A, Iyer DR, M’Saad O, Zasadil L, Knouse KA, Wong YL, Rhind N, Desai A, Amon A. Chromosome mis-segregation generates cell-cycle-arrested cells with complex karyotypes that are eliminated by the immune system. Developmental Cell. 2017;41:638–651. doi: 10.1016/j.devcel.2017.05.022. [DOI] [PMC free article] [PubMed] [Google Scholar]
  64. Sato Y, Hayashi MT. Micronucleus is not a potent inducer of the cGAS/STING pathway. Life Science Alliance. 2024;7:e202302424. doi: 10.26508/lsa.202302424. [DOI] [PMC free article] [PubMed] [Google Scholar]
  65. Schindelin J, Arganda-Carreras I, Frise E, Kaynig V, Longair M, Pietzsch T, Preibisch S, Rueden C, Saalfeld S, Schmid B, Tinevez JY, White DJ, Hartenstein V, Eliceiri K, Tomancak P, Cardona A. Fiji: an open-source platform for biological-image analysis. Nature Methods. 2012;9:676–682. doi: 10.1038/nmeth.2019. [DOI] [PMC free article] [PubMed] [Google Scholar]
  66. Schraivogel D, Kuhn TM, Rauscher B, Rodríguez-Martínez M, Paulsen M, Owsley K, Middlebrook A, Tischer C, Ramasz B, Ordoñez-Rueda D, Dees M, Cuylen-Haering S, Diebold E, Steinmetz LM. High-speed fluorescence image-enabled cell sorting. Science. 2022;375:315–320. doi: 10.1126/science.abj3013. [DOI] [PMC free article] [PubMed] [Google Scholar]
  67. Shoshani O, Brunner SF, Yaeger R, Ly P, Nechemia-Arbely Y, Kim DH, Fang R, Castillon GA, Yu M, Li JSZ, Sun Y, Ellisman MH, Ren B, Campbell PJ, Cleveland DW. Chromothripsis drives the evolution of gene amplification in cancer. Nature. 2021;591:137–141. doi: 10.1038/s41586-020-03064-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  68. Soto M, García-Santisteban I, Krenning L, Medema RH, Raaijmakers JA. Chromosomes trapped in micronuclei are liable to segregation errors. Journal of Cell Science. 2018;131:jcs214742. doi: 10.1242/jcs.214742. [DOI] [PMC free article] [PubMed] [Google Scholar]
  69. Stephens M. False discovery rates: a new deal. Biostatistics. 2017;18:275–294. doi: 10.1093/biostatistics/kxw041. [DOI] [PMC free article] [PubMed] [Google Scholar]
  70. Su R, Zhang D, Liu J, Cheng C. MSU-Net: multi-scale u-net for 2d medical image segmentation. Frontiers in Genetics. 2021;12:639930. doi: 10.3389/fgene.2021.639930. [DOI] [PMC free article] [PubMed] [Google Scholar]
  71. Subramanian A, Tamayo P, Mootha VK, Mukherjee S, Ebert BL, Gillette MA, Paulovich A, Pomeroy SL, Golub TR, Lander ES, Mesirov JP. Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles. PNAS. 2005;102:15545–15550. doi: 10.1073/pnas.0506580102. [DOI] [PMC free article] [PubMed] [Google Scholar]
  72. Tanenbaum ME, Gilbert LA, Qi LS, Weissman JS, Vale RD. A protein-tagging system for signal amplification in gene expression and fluorescence imaging. Cell. 2014;159:635–646. doi: 10.1016/j.cell.2014.09.039. [DOI] [PMC free article] [PubMed] [Google Scholar]
  73. Tang S, Stokasimov E, Cui Y, Pellman D. Breakage of cytoplasmic chromosomes by pathological DNA base excision repair. Nature. 2022;606:930–936. doi: 10.1038/s41586-022-04767-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  74. Trivedi P, Steele CD, Au FKC, Alexandrov LB, Cleveland DW. Mitotic tethering enables inheritance of shattered micronuclear chromosomes. Nature. 2023;618:1049–1056. doi: 10.1038/s41586-023-06216-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  75. Vietri M, Schultz SW, Bellanger A, Jones CM, Petersen LI, Raiborg C, Skarpen E, Pedurupillay CRJ, Kjos I, Kip E, Timmer R, Jain A, Collas P, Knorr RL, Grellscheid SN, Kusumaatmaja H, Brech A, Micci F, Stenmark H, Campsteijn C. Unrestrained ESCRT-III drives micronuclear catastrophe and chromosome fragmentation. Nature Cell Biology. 2020;22:856–867. doi: 10.1038/s41556-020-0537-5. [DOI] [PubMed] [Google Scholar]
  76. von Chamier L, Laine RF, Jukkala J, Spahn C, Krentzel D, Nehme E, Lerche M, Hernández-Pérez S, Mattila PK, Karinou E, Holden S, Solak AC, Krull A, Buchholz T-O, Jones ML, Royer LA, Leterrier C, Shechtman Y, Jug F, Heilemann M, Jacquemet G, Henriques R. Democratising deep learning for microscopy with ZeroCostDL4Mic. Nature Communications. 2021;12:2276. doi: 10.1038/s41467-021-22518-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  77. Witten IH, Frank E, Hall MA. In: Data Mining. Witten IH, Frank E, Hall MA, editors. Morgan Kaufmann; 2017. He WEKA workbench; pp. 553–571. [DOI] [Google Scholar]
  78. Wu F, Fan J, He Y, Xiong A, Yu J, Li Y, Zhang Y, Zhao W, Zhou F, Li W, Zhang J, Zhang X, Qiao M, Gao G, Chen S, Chen X, Li X, Hou L, Wu C, Su C, Ren S, Odenthal M, Buettner R, Fang N, Zhou C. Single-cell profiling of tumor heterogeneity and the microenvironment in advanced non-small cell lung cancer. Nature Communications. 2021;12:2540. doi: 10.1038/s41467-021-22801-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  79. Xia Y, Pfeifer CR, Zhu K, Irianto J, Liu D, Pannell K, Chen EJ, Dooling LJ, Tobin MP, Wang M, Ivanovska IL, Smith LR, Greenberg RA, Discher DE. Rescue of DNA damage after constricted migration reveals a mechano-regulated threshold for cell cycle. The Journal of Cell Biology. 2019;218:2545–2563. doi: 10.1083/jcb.201811100. [DOI] [PMC free article] [PubMed] [Google Scholar]
  80. Yan X, Stuurman N, Ribeiro SA, Tanenbaum ME, Horlbeck MA, Liem CR, Jost M, Weissman JS, Vale RD. High-content imaging-based pooled CRISPR screens in mammalian cells. The Journal of Cell Biology. 2021;220:e202008158. doi: 10.1083/jcb.202008158. [DOI] [PMC free article] [PubMed] [Google Scholar]
  81. Zhang CZ, Spektor A, Cornils H, Francis JM, Jackson EK, Liu S, Meyerson M, Pellman D. Chromothripsis from DNA damage in micronuclei. Nature. 2015;522:179–184. doi: 10.1038/nature14493. [DOI] [PMC free article] [PubMed] [Google Scholar]

eLife Assessment

Silke Hauf 1

This valuable paper reports machine learning-based image analysis pipelines for the automated segmentation of micronuclei and the detection and sorting of micronuclei-containing cells. These are powerful new tools for researchers who study micronuclei and their physiologic consequences. The analysis of the new tools and their benchmarking is rigorous and convincing; applications and remaining limitations are well explained in the paper.

Reviewer #1 (Public review):

Anonymous

DiPeso et al. develop two tools to (i) classify micronucleated (MN) cells, which they call VCS MN, and (ii) segment micronuclei and nuclei with MNFinder. They then use these tools to identify transcriptional changes in MN cells.

The strengths of this study are:

- Developing highly specialized tools to speed up the analysis of specific cellular phenomena such as MN formation and rupture is likely valuable to the community and neglected by developers of more generalist methods.

- A lot of work and ideas have gone into this manuscript. It is clearly a valuable contribution.

- Combining automated analysis, single-cell labeling, and cell sorting is an exciting approach to enrich for phenotypes of interest, which the authors demonstrate here.

The authors addressed my original concerns related to the first version of this manuscript.

Reviewer #2 (Public review):

Anonymous

Summary:

Micronuclei are aberrant nuclear structures frequently seen following the missegregation of chromosomes. The authors present two image analysis methods, one robust and another rapid, to identify micronuclei (MN) bearing cells. To analyse their software efficacy, the authors study images of cells treated with MPS1 inhibitor to induce chromosome missegregation. Next, the authors use RNA-seq to assess the outcomes of their MN-identifying methods: they do not observe a transcriptomic signature specific to MN but find changes that correlate with aneuploidy status. Overall, this work offers new tools to identify MN-presenting cells, and it sets the stage with clear benchmarks for further software development.

Strengths:

Currently, there are no robust MN classifiers with a clear quantification of their efficiency across cell lines (mIoU score). The software presented here tries to address this gap. GitHub material (images, ground truth labels, tools, protocols, etc.) provided is a great asset to computational biologists. The method has been tested in more than one cell line. This method can help integrate cell biology and 'omics' data, making it suitable for multimodal studies.

Weaknesses:

Although the classifier outperforms available tools for MN segmentation by providing mIoU, it's not yet at a point where it can be reliably applied to functional genomics assays where we expect a range of phenotypic penetrance in most cell lines (e.g., misshapen, multinucleated, and lagging DNA in addition to micronucleated cells). The discussion considers the nature and proportion of MN in RPE1 cells, and how the classifier is well-suited for RPE1 that predominantly display MN structures. Whether the classifier can rigorously assign MN-presenting cells amidst drastic nuclear aberrancies following a spindle checkpoint loss needs to be tested in the future.

Reviewer #3 (Public review):

Anonymous

Summary:

The authors develop automated methods to visually identify micronuclei (MN) and MN-containing cells. The authors then use these methods to isolate MN-containing RPE-1 cells post-photoactivation and analyze transcriptional changes in cells with and without micronuclei. The authors find that RPE-1 cells with MN have similar transcriptomic changes as aneuploid cells and that MN rupture does not lead to vast changes in the transcriptome.

Strengths:

The authors develop a method that allows for automating measurements and analysis of micronuclei. This has been something that the field has been missing for a long time. Using such a method has the potential to greatly enhance the field's ability to analyze micronuclei and understand the downstream consequences. The authors also develop a method to identify cells with micronuclei in real-time, mark them using photoconversion, and then isolate them via cell sorting, which could change the way we isolate and study MN-containing cells, and the scale at which we do it. The authors use this method to look at the transcriptome. This method is very powerful as it can allow for the separation of a heterogenous population and subsequent analysis with a much higher sample number than previously possible.

Weaknesses:

The major weakness of this paper is the transcriptomic analysis of MN. There is in general large variance between replicates in experiments looking at cells with ruptured versus intact micronuclei. This limits our ability to assess if lack of changes are due to truly not having changes between these populations or experimental limitations. More transcriptomic analysis will be necessary to fully understand the downstream consequences of MN rupture.

eLife. 2025 Jun 2;13:RP101579. doi: 10.7554/eLife.101579.3.sa4

Author response

Lucian DiPeso 1, Sriram Pendyala 2, Heather Z Huang 3, Douglas M Fowler 4, Emily M Hatch 5

The following is the authors’ response to the original reviews.

Public Reviews:

Reviewer #1 (Public review):

DiPeso et al. develop two tools to (i) classify micronucleated (MN) cells, which they call VCS MN, and (ii) segment micronuclei and nuclei with MMFinder. They then use these tools to identify transcriptional changes in MN cells.

The strengths of this study are:

(1) Developing highly specialized tools to speed up the analysis of specific cellular phenomena such as MN formation and rupture is likely valuable to the community and neglected by developers of more generalist methods.

(2) A lot of work and ideas have gone into this manuscript. It is clearly a valuable contribution.

(3) Combining automated analysis, single-cell labeling, and cell sorting is an exciting approach to enrich phenotypes of interest, which the authors demonstrate here.

Weaknesses:

(1) Images and ground truth labels are not shared for others to develop potentially better analysis methods.

We regret this omission and thank the reviewer for pointing it out. Both the images and ground truth labels for VCS MN and MNFinder are now available on the lab’s github page and described in the README.txt files. VCS MN: https://github.com/hatch-lab/fast-mn. MNFinder: https://github.com/hatch-lab/mnfinder.

(2) Evaluations of the methods are often not fully explained in the text.

The text has been extensively updated to include a full description of the methods and choices made to develop the VCS MN and MNFinder image segmentation modules.

(3) To my mind, the various metrics used to evaluate VCS MN reveal it not to be terribly reliable. Recall and PPV hover in the 70-80% range except for the PPV for MN+. It is what it is - but do the authors think one has to spend time manually correcting the output or do they suggest one uses it as is?

VCS MN attempts to balance precision and recall with speed to reduce the fraction of MN changing state from intact to ruptured during a single cell cycle during a live-cell isolation experiment. In addition, we chose to prioritize inclusion of small MN adjacent to the nucleus in our positive calls. This meant that there were more false positives (lower PPV) than obtained by other methods but allowed us to include this highly biologically relevant class of MN in our MN+ population. Thus, for a comprehensive understanding of the consequences of MN formation and rupture, we recommend using the finder as is. However, for other visual cell sorting applications where a small number of highly pure MN positive and negative cells is preferred, such as clonal outgrowth or metastasis assays, we would recommend using the slower, but more precise, MNFinder to get a higher precision at a cost of temporal resolution. In addition, MNFinder, with its higher flexibility and object coverage, is recommended for all fixed cell analyses.

Reviewer #2 (Public review):

Summary:

Micronuclei are aberrant nuclear structures frequently seen following the missegregation of chromosomes. The authors present two image analysis methods, one robust and another rapid, to identify micronuclei (MN) bearing cells. The authors induce chromosome missegregation using an MPS1 inhibitor to check their software outcomes. In missegregation-induced cells, the authors do not distinguish cells that have MN from those that have MN with additional segregation defects. The authors use RNAseq to assess the outcomes of their MN-identifying methods: they do not observe a transcriptomic signature specific to MN but find changes that correlate with aneuploidy status. Overall, this work offers new tools to identify MN-presenting cells, and it sets the stage with clear benchmarks for further software development.

Strengths:

Currently, there are no robust MN classifiers with a clear quantification of their efficiency across cell lines (mIoU score). The software presented here tries to address this gap. GitHub material (tools, protocols, etc) provided is a great asset to naive and experienced computational biologists. The method has been tested in more than one cell line. This method can help integrate cell biology and 'omics' studies.

Weaknesses:

Although the classifier outperforms available tools for MN segmentation by providing mIOU, it's not yet at a point where it can be reliably applied to functional genomics assays where we expect a range of phenotypic penetrance.

We agree that the MNFinder module has limitations with regards to the degree of nuclear atypia and cell density that can be tolerated. Based on the recall and PPV values and their consistency across the majority conditions analyzed, we believe that MNFinder can provide reliable results for MN frequency, integrity, shape, and label characteristics in a functional genomics assay in many commonly used adherent cell lines. We also added a discussion of caveats for these analyses, including the facts that highly lobulated nuclei will have higher false positive rates and that high cell confluency may require additional markers to ensure highly accurate assignment of MN to nuclei.

Spindle checkpoint loss (e.g., MPS1 inhibition) is expected to cause a variety of nuclear atypia: misshapen, multinucleated, and micronucleated cells. It may be difficult to obtain a pure MN population following MPS1 inhibitor treatment, as many cells are likely to present MN among multinucleated or misshapen nuclear compartments. Given this situation, the transcriptomic impact of MN is unlikely to be retrieved using this experimental design, but this does not negate the significance of the work. The discussion will have to consider the nature, origin, and proportion of MN/rupture-only states - for example, lagging chromatids and unaligned chromosomes can result in different states of micronuclei and also distinct cell fates.

We appreciate the reviewer’s comments and now quantify the frequency of other nuclear atypias and MN chromosome content in RPE1 cells after 24 h Mps1 inhibition (Fig. S1). In summary, we find only small increases in nuclear atypia, including multinucleate cells, misshapen nuclei, and chromatin bridges, compared to the large increase in MN formation. This contrasts with what is observed when mitosis is delayed using nocodazole or CENPE inhibitors where nuclear atypia is much more frequent. Importantly, after Mps1 inhibition, RPE1 cells with MN were only slightly more likely to have a misshapen nucleus compared to cells without MN (Fig. S1C).

Interestingly, this analysis showed that the VCS MN pipeline, which uses the Deep Retina segmenter to identify nuclei, has a strong bias against lobulated nuclei and frequently fails to find them (Fig. S2B). Therefore, the cell populations analyzed by RNAseq were largely depleted of highly misshapen nuclei and differences in nuclear atypia frequency between MN+ and MN- cells in the starting population were lost (Fig. S9A, compare to Fig. S1C). This strongly suggests that the transcript changes we observed reflect differences in MN frequency and aneuploidy rather than differences in nuclei morphology.

We agree with the reviewer that MN rupture frequency and formation, and downstream effects on cell proliferation and DNA damage, are sensitive to the source of the missegregated chromatin. In the revised manuscript we make clear that we chose Mps1 inhibition because it is strongly biased towards whole chromosome MN (Fig. S1E), limiting signal from DNA damage products, including chromosome fragments and chromatin bridges. This provides a base line to disambiguate the consequences of micronucleation and DNA damage in more complex chromosome missegregation processes, such as DNA replication disruption and irradiation.

Reviewer #3 (Public review):

Summary:

The authors develop a method to visually analyze micronuclei using automated methods. The authors then use these methods to isolate MN post-photoactivation and analyze transcriptional changes in cells with and without micronuclei of RPE-1 cells. The authors observe in RPE-1 cells that MN-containing cells show similar transcriptomic changes as aneuploidy, and that MN rupture does not lead to vast changes in the transcriptome.

Strengths:

The authors develop a method that allows for automating measurements and analysis of micronuclei. This has been something that the field has been missing for a long time. Using such a method has the potential to advance micronuclei biology. The authors also develop a method to identify cells with micronuclei in real time and mark them using photoconversion and then isolate them via FACS. The authors use this method to study the transcriptome. This method is very powerful as it allows for the sorting of a heterogenous population and subsequent analysis with a much higher sample number than could be previously done.

Weaknesses:

The major weakness of this paper is that the results from the RNA-seq analysis are difficult to interpret as very few changes are found to begin with between cells with MN and cells without. The authors have to use a 1.5-fold cut-off to detect any changes in general. This is most likely due to the sequencing read depth used by the authors. Moreover, there are large variances between replicates in experiments looking at cells with ruptured versus intact micronuclei. This limits our ability to assess if the lack of changes is due to truly not having changes between these populations or experimental limitations. Moreover, the authors use RPE-1 cells which lack cGAS, which may contribute to the lack of changes observed. Thus, it is possible that these results are not consistent with what would occur in primary tissues or just in general in cells with a proficient cGAS/STING pathway.

We agree with the reviewer’s assessment of the limitations of our RNA-Seq analysis. After additional analysis, we propose an alternative explanation for the lower expression changes we observe in the MN+ and Mps1 inhibitor RNA-Seq experiments. In summary, we find that VCS MN has a strong bias against highly lobulated nuclei that depletes this class of cells from both the bulk analysis and the micronucleated cell populations (Fig. S9A). Based on this result, we propose that our analysis reduces the contribution of nuclear atypia to these transcriptional changes and that nuclear morphology changes are likely a signaling trigger associated with aneuploidy.

We believe that this finding strengthens our overall conclusion that MN formation and rupture do not cause transcriptional changes, as suppressing the signaling associated with nuclei atypia should increase sensitivity to changes from the MN. However, we cannot completely rule out that MN formation or rupture cause a broad low-level change in transcription that is obscured by other signals in the dataset.

As to cGAS signaling, several follow up papers and even the initial studies from the Greenburg lab show that MN rupture does not activate cGAS and does not cause cGAS/STING-dependent signaling in the first cell cycle (see citations and discussion in text). Therefore, we expect the absence of cGAS in RPE1 cells will have no effect in the first cell cycle, but could alter the transcriptional profile after mitosis. Although analysis of RPE1 cGAS+ cells or primary cells in these experiments will be required to definitively address this point, we believe that our interpretation of our RNAseq results is sufficiently backed up by the literature to warrant our conclusion that MN formation and rupture do not induce a transcriptional response in the first cell cycle.

Reviewer #1 (Recommendations for the authors):

I do not recommend additional experimental or computational work. Instead, I just recommend adapting the claims of the manuscript to what has been done. I am just asking for further clarification and minor rewriting.

(1) The manuscript is written like a molecular biology paper with sparse explanations of the authors' reasoning, especially in the development of their algorithms. I was often lost as to why they did things in one way or another.

The revised manuscript has thorough explanations and additional data and graphics defining how and why the VCS MN and MNFinder modules were developed. We hope that this clears up many of the questions the reviewer had and appreciate their guidance on making it more readable for scientists from different backgrounds.

(2) Evaluations of their method are often not fully explained, for example:

"On average, 75% of nuclei per field were correctly segmented and cropped."

"MN segments were then assigned to 'parent' nuclei by proximity, which correctly associated 97% of MN."

Were there ground truth images and labels created? How many? For example, I don't know how the authors could even establish a ground-truth for associating MNs to nuclei if MNs happened to be almost equidistant between two nuclei in their images.

I suggest a separate subsection early in the Results section where the underlying imaging data + labels are presented.

We added new sections to the text and figures at the beginning of the VCS MN and MNFinder subsections (Fig. S2 and Fig. S5) with specific information about how ground truth images and labels were generated for both modules and how these were broken up for training, validation, and testing.

We also added information and images to explain how ground truth MN/nucleus associations were derived. In summary, we took advantage of the fact that 2xDendra-NLS is present at low levels in the cytoplasm to identify cell boundaries. This combined with a subconfluent cell population allowed us to unambiguously group MN and nuclei for 98% of MN, we estimate. These identifications were used to generate ground truth labels and analyze how well proximity defines MN/nuclei groups (Fig.s S1 and S2).

(3) Overall, I find the sections long and more subtitles would help me better navigate the manuscript.

Where possible, we have added subtitles.

(4) Everything following "To train the model, H2B channel images were passed to a Deep Retina neural net ..." is fully automated, it seems to me. Thus, there seems to be no human intervention to correct the output before it is used to train the neural network. Therefore, I do not understand why a neural network was trained at all if the pipeline for creating ground truth labels worked fully automatically. At least, the explanations are insufficient.

We apologize for the initial lack of clarity in the text and included additional details in the revision. We used the Deep Retina segmenter to crop the raw images to areas around individual nuclei to accelerate ground truth labeling of MN. A trained user went through each nucleus crop and manually labeled pixels belonging to MN to generate the ground truth dataset for training, validation, and imaging in VCS MN (Fig. S2A).

(5) To my mind, the various metrics used to evaluate VCS MN reveal it not to be terribly reliable. Recall and PPV hover in the 70-80% range except for the PPV for MN+. It is what it is - but do the authors think one has to spend time manually correcting the output or do they suggest one uses it as is? I understand that for bulk transcriptomics, enrichment may be sufficient but for many other questions, where the wrong cell type could contaminate the population, it is not.

Remarks in the Results section on what the various accuracies mean for different applications would be good (so one does not need to wait for the Discussion section).

One of the strengths of the visual cell sorting system is that any image analysis pipeline can be used with it. We used VCS MN for the transcriptomics experiment, but for other applications a user could run visual cell sorting in conjunction with MNFinder for increased purity while maintaining a reasonable recall or use a pre-existing MN segmentation program that gives 100% purity but captures only a specific subgroup of micronucleated cells (e.g. PIQUE).

To maintain readability, especially with the expansion of the results sections, we kept the discussion of how we envision using visual cell sorting for other MN-based applications in the discussion section.

(6) I am confused about what "cell" is referring to in much of the manuscript. Is it the nucleus + MNs only? Is it the whole cell, which one would ordinarily think it is? If so, are there additional widefield images, where one can discern cell boundaries? I found the section "MNFinder accurately ..." very hard to read and digest for this reason and other ambiguous wording. I suggest the authors take a fresh look at their manuscript and see whether the text can be improved for clarity. I did not find it an easy read overall, especially the computational part.

After re-examining how “cell” was used, we updated the text to limit its use to the MNFinder arm tasked with identifying MN-nucleus associations where the convex hull defined by these objects is used to determine the “cell” boundary. In all other cases we have replaced cell with “nucleus” because, as the reviewer points out, that is what is being analyzed and converted. We hope this is clearer.

(7) Post-FACS PPVs are not that great (Figure 3c). It depends on the question one wants to answer whether ~70% PPV is good enough. Again, would be good to comment on.

We added discussion of this result to the revision. In summary, a likely reason for the reduced PPV is that, although we maintain the cells in buffer with a Cdk1 inhibitor, we know that some proportion of the cells go through mitosis post-sorting. Since MN are frequently reincorporated into the nucleus after mitosis (Hatch et al, 2013; Zhang et al., 2015), we expect this to reduce the MN+ population. Thus, we expect that the PPV in the RNAseq population is higher than what we can measure by analyzing post-sorted cells that have been plated and analyzed later.

(8) I am thoroughly confused as to why the authors claim that their system works in the "absence of genetic perturbations" and why they emphasize the fact that their cells are non-transformed: They still needed a fluorescent label and they induce MNs with a chemical Mps1 inhibitor. (The latter is not a genetic manipulation, of course, but they still need to enrich MNs somehow. That is, their method has not been tested on a cell population in which MNs occur naturally, presumably at a very low rate, unless I missed something.) A more careful description of the benefits of their method would be good.

We apologize for the confusion on these points and hope this is clarified in the revision. We were comparing our system, which can be made using transient transfection, if desired, to current tools that disambiguate aneuploidy and MN formation by deleting parts of chromosomes or engineering double strand breaks with CRISPR to generate single chromosome-specific missegregation events. Most of these systems require transformed cancer cells to obtain high levels of recombination. In contrast, visual cell sorting can isolate micronucleated cells from any cell line that can exogenously express a protein, including primary cells and non-transformed cells like RPE1s.

Other minor points:

(1) The authors should not refer to "H2B channels" but to "H2B-emiRFP703 channels". It may seem obvious to the authors but for someone reading the manuscript for the very first time, it was not. I was not sure whether there were additional imaging modalities used for H2B/nucleus/chromatin detection before I went back and read that only fluorescence images of H2B-emiRFP703 were used. To put it another way, the authors are detecting fluorescence, not histones -- unless I misunderstood something.

To address this point, we altered the text to read “H2B-emiRFP703” when discussing images of this construct. For MNFinder some images were of cells expressing H2B-GFP, which has also been clarified.

(2) If the level of zoom on my screen is such that I can comfortably read the text, I cannot see much in the figure panels. The features that I should be able to see are the size of a title. The image panels should be magnified.

In the revision, the images are appended to the end at full resolution to overcome this difficulty. Thank you for your forbearance.

Reviewer #2 (Recommendations for the authors):

The methods are adequately explained. The Results text narrating experiments and data analysis is clear. Interpretation of a few results could be clarified and strengthened as explained below.

(1) RNAseq experiments are a good proof of principle. To strengthen their interpretation in Figures 4 and 6, I would recommend the authors cite published work on checkpoint/MPS1 loss-induced chromosome missegregation (PMID: 18545697, PMID: 33837239, PMC9559752) and consider in their discussion the 'origin' and 'proportion' of micronucleated cells and irregularly shaped nuclei expected in RPE1 lines. This will help interpret Figure 6 findings on aneuploidy signature accurately. Not being able to see an MN-specific signature could be due to the way the biological specimen is presented with a mixture of cells with 'MN only' or 'rupture' or 'MN along with misshapen nuclei'. These features may all link to aneuploidy rather than 'MN' specifically.

We appreciate the reviewer’s suggestion and added a new analysis of nuclear atypia after Mps1 inhibition in RPE1 cells to Fig. S1. Overall, we found that Mps1 inhibition significantly, but modestly, increased the proportion of misshapen nuclei and chromatin bridges. Multinucleate cells were so rare that instead of giving them their own category we included them in “misshapen nuclei.” These results are consistent with images of Msp1i treated RPE1 cells from He et al. 2019 and Santaguida et al. 2017 and distinct from the stronger changes in nuclear morphology observed after delaying mitosis by nocodazole or CENPE inhibition.

We also found that the Deep Retina segmenter used to identify nuclei in VCS MN had a significant bias against highly lobulated nuclei (Fig. S2B) that led to misshapen nuclei being largely excluded from the RNAseq analyses. As a result we found no enrichment of misshapen nuclei, chromatin bridges, or dead/mitotic nuclear morphologies in MN+ compared to MN- nuclei in our RNASeq experiments (Fig. S9A).

(2) As the authors clarify in the response letter, one round of ML is unlikely to result in fully robust software; additional rounds of ML with other markers will make the work robust. It will be useful to indicate other ML image analysis tools that have improved through such reiterations. They could use reviews on challenges and opportunities using ML approaches to support their statement. Also in the introduction, I would recommend labelling as 'rapid' instead of 'rapid and precise' method.

We updated the text to reference review articles that discuss the benefit of additional training for increasing ML accuracy and changed the text to “rapid.”

(3) The lack of live-cell studies does not allow the authors to distinguish the origin of MN (lagging chromatids or unaligned chromosomes). As explained in 1, considering these aspects in discussion would strengthen their interpretation. Live-cell studies can help reduce the dependencies on proximity maps (Figure S2).

The revised text includes new references and data (Fig. S1E) demonstrating that Mps1 inhibition strongly biases towards whole chromosome missegregation and that MN are most likely to contain a single centromere positive chromosome rather than chromatin fragments or multiple chromosomes.

(4) Mean Intersection over Union (mIOU) is a good measure to compare outcomes against ground truth. However, the mIOU is relatively low (Figure 2D) for HeLa-based functional genomics applications. It will help to discuss mIOU for other classifiers (non-MN classifiers) so that they can be used as a benchmark (this is important since the authors state in their response that they are the first to benchmark an MN classifier). There are publications for mitochondria, cell cortex, spindle, nuclei, etc. where IOU has been discussed.

We added references to classifiers for other small cellular structures. We also evaluated major sources of error in MNFinder found that false negatives are enriched in very small MN (3 to 9 pixels, or about 0.4 µm2 – 3 µm2, Fig. S6B). A similar result was obtained for VCS MN (Fig. S3B). Because small changes in the number of pixels identified in small objects can have outsized effects on mIoU scores, we suspect that this is exerting downward pressure on the mIoU value. Based on the PPV and recall values we identified, we believe that MNFinder is robust enough to use for functional genomics and screening applications with reasonable sample sizes.

(5) Figure 5 figure legend title is an overinterpretation. MN and rupture-initiated transcriptional changes could not be isolated with this technique where several other missegregation phenotypes are buried (see point 1 above).

We decided to keep the figure title legend based on our analysis of known missegregation phenotypes in Fig. S1 and S9 showing that there is no difference in major classes of nuclear atypia between MN+ and MN- populations in this analysis. Although we cannot rule out that other correlated changes exist, we believe that the title represents the most parsimonious interpretation.

Minor comments

(1) The sentence in the introduction needs clarification and reference. "However, these interventions cause diverse "off-target" nuclear and cellular changes, including chromatin bridges, aneuploidy, and DNA damage." Off-target may not be the correct description since inhibiting MPS1 is expected to cause a variety of problems based on its role as a master kinase in multiple steps of the chromosome segregation process. Consider one of the references in point 1 for a detailed live-cell view of MPS1 inhibitor outcomes.

We have changed “off-target” to “additional” for clarity.

(2) In Figure 3 or S3, did the authors notice any association between the cell cycle phase and MN or rupture presence? Is this possible to consider based on FACS outcomes or nuclear shapes?

Previous work by our lab and others have shown that MN rupture frequency increases during the cell cycle (Hatch et al., 2013; Joo et al., 2023). Whether this is stochastic or regulated by the cell cycle may depend on what chromosome is in the MN (Mammel et al., 2021) and likely the cell line. Unfortunately, the H2B-emiRFP703 fluorescence in our population is too variable to identify cell cycle stage from FACS or nuclear fluorescence analysis.

(3) Figure 5 - Please explain "MA plot".

An MA plot, or log fold-change (M) versus average (A) gene expression, is a way to visualize differently expressed genes between two conditions in an RNASeq experiment and is used as an alternative to volcano plots. We chose them for our paper because most of the expression changes we observed were small and of similar significance and the MA plot spreads out the data compared to a volcano plot and allowed a better visualization of trends across the population.

(4) Page 7: "our results strongly suggest that protein expression changes in MN+ and rupture+ cells are driven mainly by increased aneuploidy rather than cellular sensing of MN formation and rupture.". This is an overstatement considering the mIOU limits of the software tool and the non-exclusive nature of MN in their samples.

We agree that we cannot rule out that an unknown masking effect is inhibiting our ability to observe small broad changes in transcription after MN formation or rupture. However, we believe we have minimized the most likely sources of masking effects, including nuclear atypia and large scale aneuploidy differences, and thus our interpretation is the most likely one.

Reviewer #3 (Recommendations for the authors):

Overall, the authors need to explain their methods better, define some technical terms used, and more thoroughly explain the parameters and rationale used when implementing these two protocols for identifying micronuclei; primarily as this is geared toward a more general audience that does not necessarily work with machine learning algorithms.

(1) A clearer description in the methods as to how accuracy was calculated. Were micronuclei counted by hand or another method to assess accuracy?

We significantly expanded the section on how the machine learning models were trained and tested, including how sensitivity and specificity metrics were calculated, in both the results and the methods sections. The code used to compare ground truth labels to computed masks is also now included in the MNFinder module available on the lab github page.

(2) Define positive predictive value.

The text now says “the positive predictive value (PPV, the proportion of true positives, i.e. specificity) and recall (the proportion of MN found by the classifier, i.e. sensitivity)…”.

(3) Why is it a problem to use the VCS MN at higher magnifications where undersegmentation occurs? What do the authors mean by diminished performance (what metrics are they using for this?).

We have included a representative image and calculated mIoU and recall for 40x magnification images analyzed by MNFinder after rescaling in Fig. 2A. In summary, VCS MN only correctly labeled a few pixels in the MN, which was sufficient to call the adjacent nucleus “MN+” but not sufficient for other applications, such as quantifying MN area. In addition, VCS MN did much worse at identifying all the MN in 40x images with a recall, or sensitivity, metric of 0.36. We are not sure why. Developing MNFinder provided a module that was well suited to quantify MN characteristics in fixed cell images, an important use case in MN biology.

(4) The authors should compare MN that are analyzed and not analyzed using these methods and define parameters. Is there a size limitation? Closeness to the main nucleus?

We added two new figures defining what contributes to module error for both VCS MN (Fig. S3) and MNFinder (Fig. S6). For VCS MN, false negatives are enriched in very large or very small MN and tend to be dimmer and farther from the nucleus than true positives. False positives are largely misclassification of small dim objects in the image as MN. For MNFinder, the most missed class of MN are very small ones (3-9 px in area) and the majority of false positives are misclassifications of elongated nuclear blebs as MN.

(5) Are there parameters in how confluent an image must be to correctly define that the micronucleus belongs to the correct cell? The authors discussed that this was calculated based on predicted distance. However, many factors might affect proper calling on MN. And the authors should test this by staining for a cytosolic marker and calculating accuracy.

We updated the text with more information about how the cytoplasm was defined using leaky 2x-Dendra2-NLS signal to analyze the accuracy of MN/nucleus associations (Fig. S2G-H). In addition, we quantified cell confluency and distance to the first and second nearest neighbor for each MN in our training and testing image datasets. We found that, as anticipated, cells were imaged at subconfluent concentrations with most fields having a confluency around 30% cell coverage (Fig. S2E) and that the average difference in distance between the closest nucleus to an MN and the next closest nucleus was 3.3 fold (Fig. S2F). We edited the discussion section to state that the ability of MN/nuclear proximity to predict associations at high cell confluencies would have to be experimentally validated.

(6) The authors measure the ratio of Dendra2(Red) v. Dendra2 (Green) in Figure 3B to demonstrate that photoconversion is stable. This measurement, to me, is confusing, as in the end, the authors need to show that they have a robust conversion signal and are able to isolate these data. The authors should directly demonstrate that the Red signal remains by analyzing the percent of the Red signal compared to time point 0 for individual cells.

We found a bulk analysis to be more powerful than trying to reidentify individual cells due to how much RPE1 cells move during the 4 and 8 hours between image acquisitions. In addition, we sort on the ratio between red and green fluorescence per cell, rather than the absolute fluorescence, to compensate for variation in 2xDendra-NLS protein expression between cells. Therefore, demonstrating that distinct ratios remained present throughout the time course is the most relevant to the downstream analysis.

To address the reviewer’s concern, we replotted the data in Fig. 3B to highlight changes over time in the raw levels of red and green Dendra fluorescence (Fig. S7D). As expected, we see an overall decrease in red fluorescence intensity, and complementary increase in green fluorescence intensity, over 8 hours, likely due to protein turnover. We also observe an increase in the number of nuclei lacking red fluorescence. This is expected since the well was only partially converted and we expect significant numbers of unconverted cells to move into the field between the first image and the 8 hour image.

(7) The authors isolate and subsequently use RNA-sequencing to identify changes between Mps1i and DMSO-treated cells. One concern is that even with the less stringent cut-off of 1.5 fold there is a very small change between DMSO and MPS1i treated cells, with only 63 genes changing, none of which were affected above a 2-fold change. The authors should carefully address this, including why their dataset sees changes in many more pathways than in the He et al. and Santaguida et al. studies. Is this due to just having a decreased cut-off?

The reviewer correctly points out that we observed an overall reduction in the strength of gene expression changes between our dataset of DMSO versus Mps1i treated RPE1 cells compared to similar studies. We suggest a couple reasons for this. One is that the log2 fold changes observed in the other studies are not huge and vary between 2.5 and -3.8 for He et al., 3.3 and -2.3 for Santaguida et al., and -0.8 and 1.6 for our study. This variability is within a reasonable range for different experimental conditions and library prep protocols. A second is that our protocol minimizes a potential source of transcriptional change – nuclear lobulation – that is present in the other datasets.

For the pathway analysis we did not use a fold-change cut-off for any data set, instead opting to include all the genes found to be significantly different between control and Mps1i treated cells for all three studies. Our read-depth was higher than that of the two published experiments, which could contribute to an increased DEG number. However, we hypothesize that our identification of a broader number of altered pathways most likely arises from increased sensitivity due to the loss of covering signal from transcriptional changes associated with increased nuclear atypia. Additional visual cell sorting experiments sorting on misshapen nuclei instead of MN would allow us to determine the accuracy of this hypothesis.

(8) Moreover, clustering (in Figure 5E) of the replicates is a bit worrisome as the variances are large and therefore it is unclear if, with such large variance and low screening depth, one can really make such a strong conclusion that there are no changes. The authors should prove that their conclusion that rupture does not lead to large transcriptional changes, is not due to the limitations of their experimental design.

We agree with the reviewers that additional rounds of RNAseq would improve the accuracy of our transcriptomic analysis and could uncover additional DEGs. However, we believe the overall conclusion to be correct based on the results of our attempt to validate changes in gene expression by immunofluorescence. We analyzed two of the most highly upregulated genes in the ruptured MN dataset, ATF3 and EGR1. Although we saw a statistically significant increase in ATF3 intensity between cells without MN and those with ruptured MN, the fold change was so small compared to our positive control (100x less) that we believe it is it is more consistent with a small increase in the probability of aneuploidy rather than a specific signature of MN rupture.

(9) The authors also need to address the fact that they are using RPE-1 cells more clearly and that the lack of effect in transcriptional changes may be simply due to the loss of cGAS-STING pathway (Mackenzie et al., 2017; Harding et al., 2017; etc.).

As we discuss above in the public comments section, the literature is clear that MN do not activate cGAS in the first cell cycle after their formation, even upon rupture. Therefore, we do not expect any changes in our results when applied to cGAS-competent cells. However, this expectation needs to be experimentally validated, which we plan to address in upcoming work.

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Data Citations

    1. DiPeso L, Pendyala S, Huang HZ, Fowler DM, Hatch EM. 2025. Image-based identification and isolation of micronucleated cells to dissect cellular consequences. NCBI Gene Expression Omnibus. GSE291811 [DOI] [PMC free article] [PubMed]
    2. Hatch E. 2025. Image-based identification and isolation of micronucleated cells to dissect cellular consequences. BioImage Archive. S-BIAD1990 [DOI] [PMC free article] [PubMed]
    3. Santaguida S, Richardson A, Iyer D, M'Saad O, Zasadil L, Knouse K, Wong Y, Rhind N, Desai A, Amon A. 2017. Aneuploidy triggers an immune response. NCBI Gene Expression Omnibus. GSE83647
    4. He Q, Au B, Kulkarni M, Shen Y, KahJ L, Maimaiti J, Chong HC, Lim EH, Rancati G, Sinha I, Fu Z, Wang X, Crasta KC, ChengKit W, MoniqueNH L, JohnE C. 2019. Genome-wide transcriptional response to random aneuploidy in human cells. NCBI Gene Expression Omnibus. GSE109519

    Supplementary Materials

    Supplementary file 1. Additional tables containing transcriptomics data.

    (a) DMSO 800 vs 200 DEG. List of differentially expressed genes (FDR ≤0.05) in DMSO treated RPE1 cells isolated after an 800ms or 200ms UV pulse. For all DEG analyses: padj = false-discovery rate adjusted p-values. (b) Table 2: Msp1i DEG. List of differentially expressed genes (FDR ≤0.05) in Mps1i versus DMSO RPE1 cells. (c) Msp1i DEG log2FC. List of differentially expressed genes in Mps1i versus DMSO RPE1 cells filtered for log2 fold change above 0.58 or below –0.58. (d) He et al DEG. List of differentially expressed genes (FDR ≤0.05) in RPE1 cells treated with nocodazole for 8 h versus control, initially reported in He et al., 2018. Overlap with Mps1i DEG list noted in last column. (e) Santaguida et al DEG. List of differentially expressed genes (FDR ≤0.05) in RPE1 cells that were treated with the Mps1i molecule reversine for 12 h, released, and ceased to divide, compared to control cells. Results initially reported in Santaguida et al., 2017. Overlap with Mps1i DEG list noted in last column. (f) Hallmark Mps1i. List of MSigDB Hallmark gene sets that are significantly enriched (FDR ≤0.05) in Mps1i treated RPE1 cells. For all Hallmark lists, NES=normalized enrichment score, padj = false discovery rate adjusted p-values, size = number of genes in gene set, leading_edge = , sig_genes_in_geneset=the gene names of set genes that were significantly upregulated. (g) Hallmark He et al. List of MSigDB Hallmark gene sets that are significantly enriched in nocodazole treated RPE1 cells from He et al., 2018. (h) Hallmark Santaguida et al. List of MSigDB Hallmark gene sets that are significantly enriched in reversine treated RPE1 cells from Santaguida et al., 2017. (i) MN +DEG. List of differentially expressed genes (FDR ≤0.05) in Mps1i treated RPE1 cells with MN versus without MN. (j) MN +DEG log2FC. List of differentially expressed genes in Mps1i treated RPE1 cells with MN versus without MN filtered for log2 fold change above 0.58 or below –0.58. Overlap with Mps1i DEG list noted in last column. (k) Ruptured +DEG. List of differentially expressed genes (FDR ≤0.05) in Mps1i treated micronucleated RPE1 cells with only intact MN versus at least 1 ruptured MN. (l) Ruptured DEG +log2 FC. List of differentially expressed genes in Mps1i treated micronucleated RPE1 cells with only intact MN versus at least 1 ruptured MN MN filtered for absolute log2 fold change ≥0.58. Overlap with Mps1i DEG list noted in last column. (m) Hallmark MN+. List of MSigDB Hallmark gene sets that are significantly enriched in Mps1i treated RPE1 cells with MN. (n) Hallmark ruptured+. List of MSigDB Hallmark gene sets that are significantly enriched in Mps1i treated RPE1 cells with MN and at least 1 ruptured MN versus no ruptured MN. (o) Log2FC per replicate. List of log2 fold change values for Mps1i DEGs with absolute log2FC ≥0.58 broken out by MN +and rupture +replicate. Genes with all NA values for either all MN +or all rupture +replicates were excluded from analysis. enriched_in_rupture+_cluster=genes present in cluster enriched in increased expression over Mps1i DEGs.

    elife-101579-supp1.xlsx (426.9KB, xlsx)
    MDAR checklist

    Data Availability Statement

    All cell lines and plasmids made for this study are available upon request. Both the MNFinder and VCS MN modules and supporting code are available on the Hatch Lab GitHub repository, along with all training, testing, and validation images and their ground truth labels (https://github.com/hatch-lab/mnfinder and https://github.com/hatch-lab/fast-mn). Additional validation images and data are available on BioImage Archive. Raw and processed RNA-seq datasets were deposited in the National Center for Biotechnology Information (NCBI) Gene Expression Omnibus (GEO), accession number GSE291811.

    The following datasets were generated:

    DiPeso L, Pendyala S, Huang HZ, Fowler DM, Hatch EM. 2025. Image-based identification and isolation of micronucleated cells to dissect cellular consequences. NCBI Gene Expression Omnibus. GSE291811

    Hatch E. 2025. Image-based identification and isolation of micronucleated cells to dissect cellular consequences. BioImage Archive. S-BIAD1990

    The following previously published datasets were used:

    Santaguida S, Richardson A, Iyer D, M'Saad O, Zasadil L, Knouse K, Wong Y, Rhind N, Desai A, Amon A. 2017. Aneuploidy triggers an immune response. NCBI Gene Expression Omnibus. GSE83647

    He Q, Au B, Kulkarni M, Shen Y, KahJ L, Maimaiti J, Chong HC, Lim EH, Rancati G, Sinha I, Fu Z, Wang X, Crasta KC, ChengKit W, MoniqueNH L, JohnE C. 2019. Genome-wide transcriptional response to random aneuploidy in human cells. NCBI Gene Expression Omnibus. GSE109519


    Articles from eLife are provided here courtesy of eLife Sciences Publications, Ltd

    RESOURCES