Abstract
The use of digital image analysis and count regression models contributes to the reproducibility and rigor of histological studies in cardiovascular research. The use of formalized computer-based quantification strategies of histological images essentially removes potential researcher bias, allows for higher analysis throughput, and enables easy sharing of formalized quantification tools, contributing to research transparency, and data transferability. Moreover, the use of count regression models rather than ratios in statistical analysis of cell population data incorporates the extent of sampling into analysis and acknowledges the non-Gaussian nature of count distributions. Using quantification of proliferating cardiomyocytes in embryonic murine hearts as an example, we describe how these improvements can be implemented using open-source artificial intelligence-based image analysis tools and novel count regression models to efficiently analyze real-life data.
Keywords: artificial intelligence, count regression, image analysis, quantitative microscopy, statistical analysis
INTRODUCTION
Retrieval and analysis of quantitative readouts from histological images have become an increasingly important component of cardiovascular research. Quantitative histology heavily relies on computer-aided image analysis, resulting in increased transparency and reproducibility of generated data due to machine-based formalized quantification and increased throughput (1–3). Identification and quantification of specific cardiac cell populations according to their developmental origins (4), activated signaling pathways (5), or cellular processes (6) is achieved using quantitative analysis of histological images. In most cases, the scientific readout is not absolute cell counts but the relative frequency of studied cells, obtained by dividing the number of cells of interest by the total cell count. For example, the rate of 5-ethynyl-2′-deoxyuridine (EdU) positive cardiomyocytes/total number of cardiomyocytes is used to assess cardiomyocyte proliferation in embryonic hearts (6). The statistical analysis of those data is directed to determine how experimental factor(s) and a combination thereof affect these processes. The rigor of such analyses depends on the magnitude of the denominator, in other words, how many cells are screened in total. Consider 20% EdU+ cardiomyocytes rates obtained from three measurements: 2/10, 20/100, and 200/1,000. Although the rate is the same in all three measurements, one would consider 200/1,000 as the most robust estimate. However, the magnitude of denominator is often left for a scientist to choose arbitrarily and is not included in the statistical analysis framework, as data are transformed into ratio variables by calculating the rate; obtained rates are then analyzed by t test-based statistical methods or their nonparametric counterparts. We propose to improve quality of such analyses by quantifying the whole population of identifiable cells per section/biological replicate using formalized machine learning-based image analysis algorithms, and analyzing nontransformed data using statistical methods tailored explicitly for count and rate data. The ultimate goal of obtaining rates of proliferating cardiomyocytes can be divided into two primary tasks: 1) identifying all cells and 2) classifying them into proliferating EdU+ cardiomyocytes, EdU− cardiomyocytes, and the rest of the cardiac cells. Below we provide a concise description of such improved pipeline, using quantification of EdU+ positive cardiomyocytes in embryonic murine hearts as an example.
DATASET DESCRIPTION
Previously analyzed and published data set (7) consists of control and Prdm16 cardiomyocyte-specific knockout animal (cKO) heart sections collected at embryonic days E13.5 and E15.5. Edu was administered to pregnant females 2 h before embryo harvesting, and incorporation was detected via Alexa Fluor 647 azide coupling using commercially available click chemistry-based assay, followed by NKX2-5 and DNA staining. Prepared slides were imaged using an Olympus FluoView FV1000 confocal microscope.
IMAGE QUANTIFICATION
Image Segmentation
Nuclear EdU signal is considered a marker of proliferation in embryonic cardiomyocytes, and the positive NKX2-5 signal is used to identify cardiomyocytes. As those markers are detected in nucleus, we choose nuclei as objects of interest, for instance, segmentation, which in embryonic hearts corresponds to cardiomyocyte number because of developmental mononuclearity. Technically, identification of nuclei involves instance segmentation, a process to delineate distinct objects of interest in an image. In our work, we have achieved the best results in segmenting densely packed DAPI-stained nuclei in embryonic heart sections using the deep learning-based Stardist algorithm (8) on ZeroCostDL4Mic, a freely available platform for image analysis based on Google Drive and Google Colab (9, 10). DAPI channel images and pretrained Stardist model (11) were uploaded via Google Drive and processed with Stardist (2-D) yupyter notebook maintained at ZeroCostDL4Mic. However, segmentation using pretrained model was not satisfactory, with a significant portion of nuclei missing an assigned mask. Therefore, misclassified regions and corresponding masks were cropped in Fiji (12) and manually adjusted using LabKit (13). Manually curated image-mask pairs were used as training material to finetune the initial classification model (Fig. 1A). Once a reasonable level of nuclei segmentation was achieved, the DAPI images of the whole data set were automatically segmented.
Feature Extraction
The classification task can be subdivided into two subtasks: feature extraction and classification itself. Nuclei borders identified with Stardist were used as frames to measure different parameters of DAPI, EdU, and NKX2-5 signals. This subtask is achieved using CellProfiler, a free, open-source software for measuring and analyzing cell images (14). CellProfiler converts visual information from the images of single nuclei into a row of different extracted features for each identified nucleus (Fig. 1B). Set of object intensity, intensity distribution, and texture parameters was measured for Edu, DAPI, and NKX2.5 channel, as well as shape and size parameters of the mask. Although simple classification tasks can be performed in CellProfiler, we found that more complex, multifeature, machine learning-based classification models are required to achieve satisfactory classification accuracy. Therefore, the classification of nuclei into those from EdU+ cardiomyocytes, EdU− cardiomyocytes, and noncardiomyocytes was performed in CellProfiler-Analyst (15) using extracted feature database generated by CellProfiler. CellProfiler-Analyst provides object tiles, and snapshots of single nuclei from their original images for the researcher to manually sort into classification bins to form an annotated set. Based on these provided examples, the software builds a classifier using CellProfiler extracted features and one of the available machine learning algorithms. The researcher can provide more object examples and vary the machine learning model and the number of features used for classification to achieve optimal classification accuracy, as assessed by evaluating the obtained confusion matrix (Fig. 1B). To classify nuclei from our studied data set, we trained a random forest classifier by manually assigning more than 70 nuclei to each class. Once classifier achieved >95% recall and precision for each class the whole data set was scored. Noteworthy, Edu and NKX2-5 signal texture and not the intensity parameters were the key features in classifier distinguishing between EdU+ cardiomyocytes, EdU− cardiomyocytes, and noncardiomyocytes. The training data set and the classifier were saved and exported for data reproducibility and transparency purposes.
STATISTICAL ANALYSIS
The numerical outcome of such analysis is the count data: numbers of EdU+ cardiomyocytes, EdU− cardiomyocytes, and noncardiomyocytes. The statistical analysis of such data aims to evaluate how and if experimental factors and their combinations affect these counts. This can be achieved by performing regression analysis for count data. Poisson regression serves as a classic tool to model and assess the association between the count response variable and explanatory variables. In Poisson regression, an occurrence count recorded for a particular measurement window is modeled. If counts are recorded at different measurement windows, an adjustment called an offset must be introduced in the regression equation, and thus the rate or the count per measurement unit is modeled. In the presented example, number of proliferating cardiomyocytes is offset by a total number of all cardiomyocytes counted. However, applicability of Poisson regression is limited because of the assumption of data equidispersion, rooted in the nature of the Poisson distribution, which is often violated in real count data that express variance higher (overdispersion) or lower (underdispersion) than a mean. Mean-parametrized Conway-Maxwell Poisson (COM-Poisson) regression does not require the equidispersion assumption and can be used to efficiently model data with under- or overdispersion (16, 17), whereas negative binomial regression can be used for overdispersed data (18). The studied data set consists of nuclei numbers quantified from 53 slides, prepared from the heart sections of 25 embryos (EmbryoID), harvested at two embryonic stages (E13.5 and E15.5), and belonging to control or knockout groups (Fig. 2A). In this analysis, we did not sum nuclei counts from different microscopic slides from a single sectioned heart, therefore, to avoid pseudoreplication (19, 20) and honor the hierarchical nature of this study design, we treated embryo as a random factor (EmbryoID), whereas embryonic stage and genotype were analyzed as fixed experimental factors. This statistical analysis aims to assess whether genotype, embryonic stage, and their combination affect counts of proliferating CMs. To answer this question, we modeled rates of EdU+ cardiomyocytes (positive) as a linear combination of experimental (stage, group) and random (1 | EmbryoID) factors and the offset {offset[log(offset)]} parameter (Fig. 2B). Offset variable is total number of counted cardiomyocytes. We used glmmTMB package in R to perform COM-Poisson regression (21). Calling function summary on Regression object outputs table with information on how experimental factors affect EdU+ cardiomyocyte counts. When compared with reference genotype category control, cKO animals have 23% [exp(−0.25713) = 0.77 times, or 23%] statistically significantly (P value = 1.32e-06) lower EdU+ cardiomyocyte counts. Analogically, compared with E13.5, proliferation rates increased by 15% (exp(0.13947)) at E15.5 and the interaction of both factors was significant (P value = 0.00557). Although we obtained statistically significant effect of experimental factors on rates of proliferating CMs, it is necessary to assess that observed differences are coming from the model that fits well-studied data set. The quality of regression model was studied using DHAMRa package in R (22). DHARMa simulates new values from the fitted model for each observation and creates readily interpretable standardized residuals, that can be intuitively interpreted as residuals from linear regression, which allow us to evaluate how well-generated model fits the studied data set (Fig. 2C). Residuals are simulated by calling function to simulate residuals for generated regression model Regression and statistically assessed by plotting the output of this function. The most common issue with the count regression models is over/underdispersion, which means that residual variance is larger or smaller than expected under used model. Overdispersion causes inflation of type I error, whereas underdispersion leads to loss of test power. The unaccounted over/underdispersion would cause deviation of DHARMa residuals. More formally over/underdispersion could be checked in DHARMa package by calling test dispersion function on simulated residuals, which would statistically compare the dispersion of simulated residuals to the observed residuals. Quantile-quantile plot and corresponding statistical tests performed on generated COM-Poisson regression model indicate that residuals are normally distributed, and the residuals calculated for individual genotypes are normally distributed and have comparable variance. Taken together, model quality assessment indicates that obtained regression model accurately fits studied data set and provides robust statistical estimates. Alternatively to DHARMa, model diagnostics could be performed using performance or easystats collection of packages (23, 24).
Post hoc comparisons for data analyzed using count regression can be performed by assessing estimated marginal means and calculating contrasts. Several packages, including marginaleffects and modelbased from easystats (24) bundle or emmeans (25) can be used to perform post hoc analysis on count regression models. We have chosen emmeans, as it provides a more straightforward and intuitive application programming interface than easystats, especially for specifying contrasts. Post hoc analysis wastargeted toward comparing the differences between genotypes (Group) within individual embryonic stages (Stage), by calling function emmeans, for regression object regression, and specifying the contrast of interest (poly ∼ group | stage). The function output provided estimated marginal means and confidence intervals for each experimental group as well as contrasts. Obtained results indicate that at both studied developmental stages knockout mice have lower CM proliferation rates, by 23% [1 − exp(−0.257)] at E13.5 and by 40% at E15.5 [1 − exp(−0.508)] when compared with control. Finally, obtained results were plotted using ggplot2 (26) in R (Fig. 2E). Obtained results are in excellent agreement with published manual analysis of studied data set (7).
DISCUSSION
Here we present a pipeline for automated quantification and statistical assessment of differences in rates of proliferating cardiomyocytes in embryonic murine hearts. To improve reproducibility and rigor of such studies, we propose two interconnected improvements. First, we use computer-aided quantification of proliferating cardiomyocytes. Computer rather than human-lead quantification of proliferating cardiomyocytes offers several advantages. Use of formalized methods allows researchers to export, save, and share algorithms and parameters used in every step of the pipeline, thus making analysis process transparent, transferable, and reproducible. Moreover, once parameters and algorithms are optimized, complete images/sections can be quantified in an unbiased way, rather than performing manual quantification in arbitrarily selected area or number of nuclei. Second, we use statistical methods tailored for the count and rate data. It has been reported that count and rate data should not be analyzed after transformation, but instead models based on Poisson distribution should be used (27–30). This approach not only acknowledges non-Gaussian distribution of count data but as well takes into consideration of denominator in rate data. However, often distribution of real-life data does not meet core assumption of equidispersion required for Poisson regression. To overcome this limitation, we use a COM-Poisson regression that can model data with both under- and over-dispersion (16). Alternatively, negative binomial regression can be used to model overdispersed data (18). Researchers can perform count regressions using alternative model structures and distributions implemented in glmmTMB and choose the best model for studied data set based on outcome of aforementioned diagnostic tests and complimentary model comparisons implemented in performance package from easystats bundle. Not only proliferation but also other various cellular processes or signaling pathways can be visualized by nuclear fluorescence. Moreover, nuclear signal can facilitate segmentation of cells (31). Therefore, our pipeline can be easily customized and adapted for quantifying other various experimental readouts.
GRANTS
J. Chen is funded by National Heart, Lung, and Blood Institute Grants HL155826, HL157119, and HL153032. Microscopy work was performed at the University of California–San Diego Neuroscience Microscopy Shared Facility and was supported by the National Institute of Neurological Disorders and Stroke Grant P30NS047101.
DISCLOSURES
J.C. holds an American Heart Association Endowed Chair in Cardiovascular Research. J.B. is a consultant for Rocket Pharmaceuticals on image analysis. None of the other authors has any conflicts of interest, financial or otherwise, to disclose.
AUTHOR CONTRIBUTIONS
J.B., Z.Z., and J.C. conceived and designed research; J.B., Z.Z., and T.W. performed experiments; J.B., T.W., and J.C. analyzed data; J.B., T.W., and J.C. interpreted results of experiments; J.B. prepared figures; J.B. drafted manuscript; J.B., Z.Z., T.W., and J.C. edited and revised manuscript; J.B., Z.Z., T.W., and J.C. approved final version of manuscript.
ACKNOWLEDGMENTS
We thank Li Wang for critical reading of this manuscript.
REFERENCES
- 1. Büllow RD, Marsh JN, Swamidass SJ, Gaut JP, Boor P. The potential of artificial intelligence-based applications in kidney pathology. Curr Opin Nephrol Hypertens 31: 251–257, 2022. doi: 10.1097/MNH.0000000000000784. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2. Daunoravicius D, Besusparis J, Zurauskas E, Laurinaviciene A, Bironaite D, Pankuweit S, Plancoulaine B, Herlin P, Bogomolovas J, Grabauskiene V, Laurinavicius A. Quantification of myocardial fibrosis by digital image analysis and interactive stereology. Diagn Pathol 9: 114, 2014. doi: 10.1186/1746-1596-9-114. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3. Niazi MKK, Parwani AV, Gurcan MN. Digital pathology and artificial intelligence. Lancet Oncol 20: e253–e261, 2019. doi: 10.1016/S1470-2045(19)30154-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4. Moore-Morris T, Cattaneo P, Guimarães-Camboa N, Bogomolovas J, Cedenilla M, Banerjee I, Ricote M, Kisseleva T, Zhang L, Gu Y, Dalton ND, Peterson KL, Chen J, Pucéat M, Evans SM. Infarct fibroblasts do not derive from bone marrow lineages. Circ Res 122: 583–590, 2018. doi: 10.1161/CIRCRESAHA.117.311490. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5. Bogomolovas J, Feng W, Yu MD, Huang S, Zhang L, Trexler C, Gu Y, Spinozzi S, Chen J. Atypical ALPK2 kinase is not essential for cardiac development and function. Am J Physiol Heart Circ Physiol 318: H1509–H1515, 2020. doi: 10.1152/ajpheart.00249.2020. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6. Boogerd CJ, Zhu X, Aneas I, Sakabe N, Zhang L, Sobreira DR, Montefiori L, Bogomolovas J, Joslin AC, Zhou B, Chen J, Nobrega MA, Evans SM. Tbx20 is required in mid-gestation cardiomyocytes and plays a central role in atrial development. Circ Res 123: 428–442, 2018. doi: 10.1161/CIRCRESAHA.118.311339. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7. Wu T, Liang Z, Zhang Z, Liu C, Zhang L, Gu Y, Peterson KL, Evans SM, Fu XD, Chen J. PRDM16 is a compact myocardium-enriched transcription factor required to maintain compact myocardial cardiomyocyte identity in left ventricle. Circulation 145: 586–602, 2022. doi: 10.1161/CIRCULATIONAHA.121.056666. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8. Weigert M, Schmidt U, Haase R, Sugawara K, Myers EW. Star-convex polyhedra for 3D object detection and segmentation in microscopy. In: Proceedings of the 2020 IEEE Winter Conference on Applications of Computer Vision (WACV), Snowmass, CO, 20, p. 3655–3662. [Google Scholar]
- 9. von Chamier L, Laine RF, Jukkala J, Spahn C, Krentzel D, Nehme E, Lerche M, Hernández-Pérez S, Mattila PK, Karinou E, Holden S, Solak AC, Krull A, Buchholz TO, Jones ML, Royer LA, Leterrier C, Shechtman Y, Jug F, Heilemann M, Jacquemet G, Henriques R. Democratising deep learning for microscopy with ZeroCostDL4Mic. Nat Commun 12: 2276, 2021. doi: 10.1038/s41467-021-22518-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10. Gómez-de-Mariscal E, García-López-de-Haro C, Ouyang W, Donati L, Lundberg E, Unser M, Muñoz-Barrutia A, Sage D. DeepImageJ: a user-friendly environment to run deep learning models in ImageJ. Nat Methods 18: 1192–1195, 2021. doi: 10.1038/s41592-021-01262-9. [DOI] [PubMed] [Google Scholar]
- 11. Schmidt U, Weigert M. StarDist Fluorescence Nuclei Segmentation (Online). Zenodo, 2022. doi: 10.5281/zenodo.6348085. [last accessed, 11 March 2022]. [DOI] [Google Scholar]
- 12. Schindelin J, Arganda-Carreras I, Frise E, Kaynig V, Longair M, Pietzsch T, Preibisch S, Rueden C, Saalfeld S, Schmid B, Tinevez JY, White DJ, Hartenstein V, Eliceiri K, Tomancak P, Cardona A. Fiji: an open-source platform for biological-image analysis. Nat Methods 9: 676–682, 2012. doi: 10.1038/nmeth.2019. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13. Arzt M, Deschamps J, Schmied C, Pietzsch T, Schmidt D, Tomancak P, Haase R, Jug F. LABKIT: labeling and segmentation toolkit for big image data. Front Comput Sci 4: 777728, 2022. doi: 10.3389/fcomp.2022.777728. [DOI] [Google Scholar]
- 14. Stirling DR, Swain-Bowden MJ, Lucas AM, Carpenter AE, Cimini BA, Goodman A. CellProfiler 4: improvements in speed, utility and usability. BMC Bioinformatics 22: 433, 2021. doi: 10.1186/s12859-021-04344-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15. Dao D, Fraser AN, Hung J, Ljosa V, Singh S, Carpenter AE. CellProfiler Analyst: interactive data exploration, analysis and classification of large biological image sets. Bioinformatics 32: 3210–3212, 2016. doi: 10.1093/bioinformatics/btw390. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16. Huang A. Mean-parametrized Conway–Maxwell–Poisson regression models for dispersed counts. Stat Modelling 17: 359–380, 2017. doi: 10.1177/1471082X17697749. [DOI] [Google Scholar]
- 17. Shmueli G, Minka TP, Kadane JB, Borle S, Boatwright P. A useful distribution for fitting discrete data: revival of the Conway–Maxwell–Poisson distribution. J R Stat Soc Ser C Appl Stat 54: 127–142, 2005. doi: 10.1111/j.1467-9876.2005.00474.x. [DOI] [Google Scholar]
- 18. Johnson TR, Hilbe JM. Negative binomial regression. Psychometrika 77: 611–612, 2012. doi: 10.1007/s11336-012-9263-7. [DOI] [Google Scholar]
- 19. Eisner DA. Pseudoreplication in physiology: more means less. J Gen Physiol 153: e202012826, 2021. doi: 10.1085/jgp.202012826. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20. Krzywinski M, Altman N, Blainey P. Points of significance: nested designs. For studies with hierarchical noise sources, use a nested analysis of variance approach. Nat Methods 11: 977–978, 2014. [Erratum in Nat Methods 17: 1060, 2020]. doi: 10.1038/nmeth.3137. [DOI] [PubMed] [Google Scholar]
- 21. Brooks ME, Kristensen K, van Benthem KJ, Magnusson A, Berg CW, Nielsen A, Skaug HJ, Mächler M, Bolker BM. glmmTMB balances speed and flexibility among packages for zero-inflated generalized linear mixed modeling. The R Journal 9: 378–400, 2017. doi: 10.32614/RJ-2017-066. [DOI] [Google Scholar]
- 22. Hartig F. DHARMa : Residual Diagnostics for Hierarchical (Multi-Level/Mixed) Regression Models R package version 0.4.6., University of Regensburg, 2022. http://florianhartig.github.io/DHARMa/ [last accessed, 31 August 2022]. [Google Scholar]
- 23. Daniel L, Mattan SB-S, Indrajeet P, Philip W, Dominique M. performance: An R Package for Assessment, Comparison and Testing of Statistical Models. J Open Source Softw 6: 3139, 2021. doi: 10.21105/joss.03139. [DOI] [Google Scholar]
- 24. Daniel L, Mattan SB-S, Indrajeet P, Brenton MW, Dominique M. easystats: Framework for Easy Statistical Modeling, Visualization, and Reporting (Online). CRAN, 2022. https://easystats.github.io/easystats/ [last accessed, 31 August 2022]. [Google Scholar]
- 25. Russell VL. emmeans: Estimated Marginal Means, aka Least-Squares Means (Online). https://github.com/rvlenth/emmeans.
- 26. Hadley W. ggplot2: Elegant Graphics for Data Analysis. New York: Springer-Verlag, 2016. [Google Scholar]
- 27. O'Hara R, Kotze J. Do not log-transform count data. Methods Ecol Evol 1: 118–122, 2010. doi: 10.1111/j.2041-210X.2010.00021.x. [DOI] [Google Scholar]
- 28. St-Pierre AP, Shikon V, Schneider DC. Count data in biology-Data transformation or model reformation? Ecol Evol 8: 3077–3085, 2018. doi: 10.1002/ece3.3807. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29. Gardner W, Mulvey EP, Shaw EC. Regression analyses of counts and rates: Poisson, overdispersed Poisson, and negative binomial models. Psychol Bull 118: 392–404, 1995. doi: 10.1037/0033-2909.118.3.392. [DOI] [PubMed] [Google Scholar]
- 30. Frome EL. The analysis of rates using Poisson regression models. Biometrics 39: 665–674, 1983. doi: 10.2307/2531094. [DOI] [PubMed] [Google Scholar]
- 31. Stringer C, Wang T, Michaelos M, Pachitariu M. Cellpose: a generalist algorithm for cellular segmentation. Nat Methods 18: 100–106, 2021. doi: 10.1038/s41592-020-01018-x. [DOI] [PubMed] [Google Scholar]