Skip to main content
Briefings in Bioinformatics logoLink to Briefings in Bioinformatics
. 2026 Apr 25;27(2):bbag197. doi: 10.1093/bib/bbag197

ZeCardioAI: combining zebrafish, AI, and xAI for an in-depth cardiac phenotyping platform

Ferran Arqué 1,2,3,#, Carole Jung 4,#, Beatriz Muriel-Moreno 5, Laura López-Blanch 6, Eduardo Saman 7, Davide D’Amico 8, Javier Terriente 9,, Paula Petrone 10,11,, Sylvia Dyballa 12,
PMCID: PMC13112437  PMID: 42041226

Abstract

Artificial intelligence (AI) integrated with high-throughput assays offers a powerful route to accelerate discovery in relevant biological models. Functional cardiac imaging is a prime application, where deep learning (DL) and explainable AI (xAI) can overcome limitations of traditional phenotyping methods, such as manual analysis, subjective interpretation, and low scalability. In cardiovascular research, the zebrafish model is highly valuable due to its translational relevance and accessibility for high-throughput applications. Here, we present ZeCardioAI, a computational platform combining zebrafish experimental advantages with DL and xAI methodologies. The platform automatically extracts comprehensive cardiac phenotypes from live imaging, achieving high precision while maintaining interpretability, critical for mechanistic insight and translational validation. ZeCardioAI, when applied to zebrafish models of dilated and hypertrophic cardiomyopathy (CM), detected subtle yet clinically relevant phenotypic differences. Machine learning classifiers achieved robust separation of disease from healthy phenotypes, and xAI revealed discriminative features aligning with established clinical markers. Our developments should prove valuable in addressing the unmet medical need in CMs to find new, specific treatments. The platform’s modular architecture supports future adaptation to diverse disease contexts beyond CMs, enabling large-scale, fully automated phenotyping at a throughput unattainable by manual approaches. ZeCardioAI establishes a new standard for AI-powered biological research, offering transformative potential for accelerating drug discovery, advancing precision medicine approaches, and deepening fundamental understanding of complex biological systems across multiple therapeutic areas.

Keywords: zebrafish, artificial intelligence, explainable AI, machine learning, cardiomyopathy, phenotyping

Graphical Abstract

Graphical Abstract.

Graphical Abstract

Introduction

Cardiomyopathies (CMs) are diseases of the heart muscle causing structural and functional abnormalities and are a leading global cause of heart failure. Affecting 3.73 million people worldwide, their prevalence rose by 41.5% between 1990 and 2019 [1], imposing a major healthcare and socioeconomic burden. The two most common forms are dilated cardiomyopathy (DCM) and hypertrophic cardiomyopathy (HCM).

DCM is characterized by left ventricle dilation with wall stretching and thinning, resulting in systolic dysfunction due to weakened cardiac contraction [2]. HCM presents excessive left ventricular wall thickening, decreasing chamber volume and impairing diastolic function [3].

CM disease biology is complex due to (i) the complex genetic architecture; over 20 sarcomere and sarcomere-associated genes are known to cause HCM, while DCM involves >40 genes affecting multiple cellular structures, including the sarcomere, cytoskeleton, and nuclear envelope. Even within the same gene, different mutations can lead to distinct disease patterns, and the same mutation can manifest differently among individuals and families. These effects are further shaped by environmental modifiers, making genotype–phenotype relationships difficult to predict [4–7]. (ii) the impact of disease progression; How CMs develop over time can be highly variable, influenced by additional genetic background, environmental modifiers, and other yet-unknown contributors [8]. Suitable biological models and analytical frameworks to enable precision medicine approaches for CM patients are currently missing.

Current therapeutic solutions for CMs are often limited to surgical interventions and palliative heart failure management rather than addressing specific underlying disease mechanisms [9, 10]. Two emerging targeted therapies show promise: Mavacamten, recently approved for HCM treatment [11], and Levosimendan, currently in clinical trials for DCM [12]. However, their full effects and mechanisms are still being elucidated. This therapeutic gap highlights an urgent need for more effective, mechanism-based treatments for CMs.

Zebrafish has emerged as a powerful translational animal model for cardiovascular research and high-throughput drug discovery. Its small size and optical transparency enable sophisticated live microscopy, while high progeny rates support extensive screening [13]. Furthermore, zebrafish share Inline graphic82% genetic homology with humans for orthologous disease-related genes, ensuring biological translatability and clinical relevance [14]. The genome is readily editable via CRISPR/ Cas9 for precise modeling of genetic diseases, and fluorescent transgenic lines specifically label cardiovascular components for detailed in vivo analysis [15]. In vivo imaging of the zebrafish heart has proven particularly useful to address off-target cardiotoxic effects of drugs [16].

Traditional zebrafish cardiac analysis relies on fractional shortening, a 1D proxy for chamber volume variation derived from kymographs along manually drawn axes [16]. While valuable, its subjectivity and limited dimensional scope cannot report subtle functional abnormalities. Artificial intelligence (AI)-assisted image processing offers a means to overcome these limitations, providing objective and scalable quantification of cardiac dynamics.

As such, AI has become a valuable tool in clinical imaging, enabling automated volumetric analyses that extract detailed cardiac metrics from large datasets while reducing experimenter bias [17–19]. However, limited research applies these methodologies to zebrafish, and critical parameters like myocardial strain, an early detector of DCM and HCM in humans [20], remain unmeasured in zebrafish larvae.

AI approaches are valuable in cardiac phenotyping, and to ensure transparency and validate model decisions from a biological perspective, explainable AI (xAI) has become essential [21]. xAI methods like SHAP (SHapley Additive exPlanations) quantify individual feature contributions to predictions, enabling a clearer understanding of model decisions and identifying consistent biological patterns and clinical biomarkers [22].

Despite technological advances, identifying precise CM therapies remains challenging due to limited understanding of disease progression. Current diagnostic approaches (echocardiography, MRI), while providing valuable clinical metrics, may not sufficiently detect early CM stages [23]. This shortcoming, coupled with the complex genetic architecture and variable clinical presentations, necessitates innovative approaches for drug discovery and target identification.

To address these challenges, we developed ZeCardioAI, a fully automated framework for CM phenotyping that supports accurate model characterization, high-throughput drug screening, and target discovery. ZeCardioAI builds on the ZeCardio Screening Platform (ZeCardio1.0) [16], a semi-automated tool for cardiotoxicity prediction from in vivo imaging. While ZeCardio1.0 relied on manual annotation and was limited in scope, ZeCardioAI now enables the computation of key physiological metrics such as AV delay, contraction and relaxation times, and myocardial strain, in addition to standard cardiac parameters. It operates without user interaction and is suitable for both local and cloud-based execution. In addition, it provides automated organization, indexing, and large-scale analysis of experimental datasets, which allows to obtain high-quality datasets for model training.

The aim of this work was to develop ZeCardioAI, an in-depth cardiac phenotyping platform based on zebrafish CM models. The platform integrates fluorescence microscopy for real-time cardiac imaging with deep learning(DL)-based chamber segmentation and border tracking, enabling computation of both conventional and advanced functional parameters, including strain. These quantitative metrics were used to train machine learning (ML) classifiers that distinguish dilated and hypertrophic CM models from wild-type (WT) larvae, while xAI identified the features most relevant to classification. By combining automated imaging, advanced cardiac quantification, ML, and interpretability, ZeCardioAI provides a systematic framework for in vivo CM characterization and high-throughput drug screening, advancing precision medicine in CMs and offering translational potential for other disease areas.

Materials and methods

Zebrafish husbandry and mutant generation

Zebrafish were maintained under standard conditions at 28.5Inline graphic1Inline graphicC with a 14 h light/10 h dark cycle. Embryos were obtained by natural mating and maintained in E3 medium at 28Inline graphicC until 120 hours past fertilization (hpf). Mutant (MUT) lines for nexn (DCM) and mybpc3 (HCM) were generated in the background Tg[myl7:GFP] [24] using CRISPR/Cas9 with two sgRNAs per gene. Stable lines were established by outcrossing and validated via PCR and gel electrophoresis. See Supplementary Materials.

Zebrafish larvae preparation and imaging

At 120 hpf, larvae were treated with PTU and immobilized in tricaine for imaging. High-throughput cardiac imaging was performed using the ZeCardio Screening platform previously described [16], combining a VAST BioImager (Union Biometrica) [25] and Leica DM6B microscope. Larvae were oriented at 144Inline graphic (0Inline graphic being the dorsal position and rotation always towards the left body side), and time-lapse videos of beating hearts were recorded at 100 fps for 20 s using 10Inline graphic magnification. See Supplementary Materials.

Video quality control and preprocessing

Experiment quality control involved checking metadata for completeness and integrity and then subjecting the videos to a quality check using an ML binary classifier. The quality check was performed on a single frame sampled near the end of each video to enable rapid assessment and capture potential late-acquisition artifacts (e.g. a larva drifting out of frame). This classifier was trained via five-fold cross-validation on a balanced dataset of manually selected suitable and unsuitable images (namely out-of-focus or overexposed, cropped, not detected, badly oriented, deformed, corrupted, or pigmented), also with balanced representation across defect types. To ensure generalization, a 10% random, balanced validation set was held out, maintaining a balanced selection of the classes (including subclasses within the defective class), and selected from different batches generated in experiments performed on different days. Evaluated models included RegNetY_400MF, ResNet18, SqueezeNet 1.1, xResNet18, and XResNet34 [26–28].

Videos that passed the quality control were cropped to Inline graphic pixels. A binary heart mask was generated by the Sobel filter [29]. The mask was used to compute the centroid of the heart, which was then served as the cropping reference point. Heart orientation was standardized (anterior to the right and dorsal to the top) by image flipping. Videos were downscaled to 8 bits and saved as uncompressed.avi files. See Table S1 in Supplementary Materials for the number of experiments and videos that passed the quality control.

Annotation and feature extraction for cardiac function analysis

The following metrics were extracted from zebrafish heart videos: beats per minute (BPM), ejection fraction, arrhythmias, contraction and relaxation times, atrioventricular (AV) delay (time between atrial and ventricular beats), chamber diameters and volumes, and ventricle strain (the deformation of the cardiac tissue during contraction and relaxation) and strain rate. Ventricle and atrium semantic segmentation was performed, saving the mask coordinates at each frame. From the signal of the mask areas over time, the BPM was computed via its discrete Fourier transform. The AV delay was computed as the cross-correlation between the atrium and ventricle signals. Chamber diameters were derived from mask diameters, and volumes approximated using the biplane method of disks [30]. End-diastolic and end-systolic points, identified by signal peaks/valleys, enabled extraction of all cardiac metrics except strain.

For strain analysis, since the data consist of 2D fluorescence microscopy projections rather than Doppler or speckle-tracking echocardiography commonly used for strain analysis [30], two new types of strain were defined: boundary segmental strain, and boundary radial strain. The ventricle border was divided into four 5-marker point regions, with markers automatically tracked. Boundary segmental strain was calculated as the Lagrangian strain [30] (lengthening/shortening relative to maximum length) of each 5-point segment. That is, for each segment, its Lagrangian strain is Inline graphic, where Inline graphic is the segment’s length at time Inline graphic and Inline graphic is the reference maximum segment length (end-diastole). For the boundary radial strain, we first defined a fixed center point C in the ventricle. For each point P in one of the aforementioned 5-point segments, we calculate the Lagrangian strain of the segment P-C. The boundary radial strain for that segment is the average of these individual strain values.

Further details and custom Python code to extract the clinical metrics from the zebrafish heart videos is provided in the Supplementary Materials.

Cardiac chamber segmentation

The DeepLabV3 convolutional neural network was employed for automatic segmentation of the zebrafish heart chambers [31]. A training dataset was built of manually annotated key frames extracted from videos capturing different cardiac cycle phases. Three trained scientists independently annotated the dataset, split into three subsets, and a consensus protocol developed with cardiac biology experts was followed to ensure consistency.

To render the segmentation model invariant to rotation, translation, and deformation, data augmentation was performed including random translations in the range of Inline graphic pixels along x–y axes, random rotations within Inline graphic degrees, and random elastic deformations [32]. Transfer learning was applied by initializing DeepLabV3 with weights pretrained on a subset of the COCO train2017 dataset [33], specifically on the 20 categories included in the Pascal VOC dataset [34].

The model was trained for nine epochs with an Adam optimizer, learning rate of Inline graphic, mean squared error loss, and a batch size of four. In total,20% of the data was used for validation, ensuring that all frames from the same video were assigned to the same split to prevent data leakage. Performance was evaluated using Dice similarity coefficient, Hausdorff distance, and balanced average Hausdorff distance [35], with Euclidean distance as the ground distance for Hausdorff metrics.

Border local marker tracking

To compute heart strain, 20 equidistant ventricle border markers (a density selected to balance spatial resolution with annotation efficiency) were automatically tracked across video frames using DeepLabCut (DLC), a DL-based pose estimation software [36] (version 2.3.4). Annotation was performed on videos by two trained scientists by selecting representative frames capturing the full cardiac cycle.

A ResNet-50 model was used as backbone network [26, 37], using default DeepLabCut parameters and training for 100 000 iterations. Performance was evaluated by comparing DLC labels to manual annotations (average pixel distance).

Feature selection and machine learning classification

Univariate analysis and feature selection

Data preprocessing was done in the following order to ensure biological validity and cross-experiment comparability: automated removal of physiologically implausible samples (e.g. negative ejection fraction), normalization of contraction/relaxation times and AV delay by heart rate, batch-wise normalization of all features relative to WT controls to reduce inter-experiment variability, and final z-score standardization, with scaler parameters saved for consistent application to future data.

Feature distributions and independent t-tests identified inconsistencies and non-significant variables (Inline graphic-value >.05). However, subsequent analyses were performed both with and without non-significant variables to assess whether they contributed predictive signal. From feature pairs with a Pearson correlation coefficient > 0.8, one feature was removed.

Machine learning classification

For each gene, ML algorithms (logistic regression, random forest, support vector machine, multi-layer perceptron, XGBoost, AdaBoost, and Gaussian Naive Bayes) were evaluated to classify WT or MUT data derived from the imaging (a list of variables with mean values is provided in Table 1).

Table 1.

Baseline characteristics of heart metrics in dilated and hypertrophic models (nexn and mybpc3), reporting mean Inline graphic standard deviation and independent samples t-test P-values for mutant versus wild-type comparisons, with only global strain metrics shown (except top SHAP strain variables) and bold values indicating features used in each classifier.

Heart metric nexn Inline graphic (Inline graphic) nexn − / − (Inline graphic) P-value mybpc3 Inline graphic (Inline graphic) mybpc3 − / − (Inline graphic) P-value
Atrial beats per minute (1/min) Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic
Ventricular beats per minute (1/min) Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic
Ventricular mean ejection fraction (%) Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic
Ventricular mean contraction time (s) Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic
Ventricular mean relaxation time (s) Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic
Atrial mean contraction time (s) Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic
Atrial mean relaxation time (s) Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic
AV delay (s) Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic
Atrial mean mask area at end-systole (Inline graphicm) Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic
Ventricular mean mask area at end-systole (Inline graphicm) Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic
Atrial mean mask area at end-diastole (Inline graphicm) Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic
Ventricular mean mask area at end-diastole (Inline graphicm) Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic
Atrium maximum diameter (Inline graphicm) Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic
Ventricle maximum diameter (Inline graphicm) Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic
Atrium minimum diameter (Inline graphicm) Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic
Ventricle minimum diameter (Inline graphicm) Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic
Global mean boundary segmental strain (%) Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic
Global peak boundary segmental strain (%) Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic
Global mean segmental strain rate (%/s) Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic
Global peak boundary segmental strain rate (%/s) Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic
Left-right wall segmental strain delay (s) Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic
Global mean boundary radial strain (%) Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic
Global peak boundary radial strain (%) Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic
Global mean boundary radial strain rate (%/s) Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic
Global peak boundary radial strain rate (%/s) Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic
Left-right wall radial strain delay (s) Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic
Ventricular peak left boundary segmental strain (%) Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic
Ventricular mean bottom-left boundary segmental strain (%) Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic
Ventricular mean left boundary radial strain (%) Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic
Ventricular peak left boundary radial strain rate (%/s) Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic
Ventricular mean bottom-left boundary radial strain (%) Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic
Ventricular mean right boundary radial strain (%) Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic
Ventricular peak right boundary radial strain rate (%/s) Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic

A nested -fold cross-validation, maintaining class distribution, was used for model selection, with inner folds for hyperparameter tuning and outer for evaluation. Performance was assessed via precision, recall, F1 score, and area under the receiver operating characteristic curve (AUROC), averaged across folds.

Explainable artificial intelligence and phenotypic clustering

To ensure interpretability and biological plausibility of the classifiers, we used SHAP, an xAI technique used to quantify feature contributions [22]. The computed SHAP values were visualized using beeswarm plots, highlighting the most influential features for classification. The top five SHAP-ranked features were further used as input for Uniform Manifold Approximation and Projection (UMAP) [38] to visualize clustering in a 2D representation (parameters: 50 neighbors, 0 minimum distance, and Euclidean distance). Abnormal data points (outliers in UMAP, defined as points lying in regions of overlap on the genotype-specific clusters) were manually selected for study. They were compared with baseline ranges using the top 10 SHAP features, and verified with data ranges and ML prediction scores/probabilities. Outlier analysis enabled biological interpretation of genotype–phenotype mismatches, technical artifacts, or natural variability, reinforcing the robustness and interpretability of the classification framework.

Institutional review board statement

The animal study was approved by Generalitat de Catalunya “Direcció General de Polítiques Ambientals i Medi Natural” under Reial Decret 53/2013, d’1 de febrer, referential procedure number 11925, under the title “Evaluación de la función cardíaca en pez cebra.” The study was conducted in accordance with the local legislation and institutional requirements.

Results

The goal of this research was to develop an AI-based phenotyping platform which would enable a deep characterization of zebrafish cardiac disease models, allow the validation of new therapeutic targets, and the assessment of the efficacy of new potential drugs, using HCM and DCM as a case study for platform development and proof of concept. For reference, Table S5 in Supplementary Materials provides an overview of the experimental setup and analysis pipeline of ZeCardioAI, alongside a comparison with recent zebrafish cardiac analysis frameworks.

Nexilin1 and Myosin binding protein C3 zebrafish knock-out models facilitate the study of cardiomyopathies in vivo

To model DCM and HCM, zebrafish transgenic lines carrying nexn and mybpc3 mutations were used, respectively. These lines were selected from a broader set of generated lines based on (i) the strength of their cardiac phenotypes, (ii) their viability and (iii) their suitability for high-throughput analysis (data not shown). NEXILIN encodes an actin binding protein and component of the Z-disk, essential for calcium handling and Z-disk and sarcomere stabilization, thereby ensuring optimal muscle contraction. Dysfunctions in Nexn have recently been strongly linked to DCM [39, 40]. MYBPC3 encodes cardiac myosin-binding protein C, a key sarcomeric regulator, and its mutations, especially truncating variants, are a major cause of HCM by disrupting contractile function and promoting myocardial thickening, thereby impairing cardiac relaxation [41, 42].

Homozygous MUT fish were created and grown to adulthood for nexn and mybpc3. Those MUT fish (and their WT cousins) were incrossed to produce the respective MUT and WT progeny, containing either both +/+ or no − / − functional allele for respective gene. The use of − / − parents ensured a complete loss of function. All zebrafish models were created in the transgenic background of Tg[myl7:GFP] [24], which expresses GFP in cardiomyocytes (Fig. 1A) allowing for in vivo live imaging to assess heart morphology and function. Neither reduced viability nor macroscopic morphological defects were observed in homozygous mutants of nexn and mybpc3 (Fig. 1B). Using the previously developed ZeCardio HTS platform [16], we observed expected alterations, namely increased ventricular size at end systole for nexn, decreased ventricular size at end diastole for mybpc3, and reduced cardiac output in both models. To enable deeper phenotypic characterization, we developed ZeCardioAI as a successor platform, extending the analytical capabilities of the original pipeline.

Figure 1.

A multi-panel figure showing zebrafish larvae. Panel A features an image of a larva with a zoomed-in, green fluorescent inset labeling the atrium and ventricle chambers. Panels B and C show comparative side-view images of zebrafish larvae from the nexn and mybpc3 genetic lines across wild-type and mutant genotypes. All images include directional axes and a 500 µm scale bar

Hypertrophic and dilated CM models and datasets. (A) GFP expression in the zebrafish heart at 5 dpf. The orientation for optimal view of the heart is quasi lateral, at 144Inline graphic (0Inline graphic = dorsal (D)). Note how both of the heart chambers lie approximately in the same imaging plane. (B) Lateral view of dilated (nexn) and (C) Hypertrophic (mybpc3) CM zebrafish knockout models (− / −) and their controls (+/+) at 5dpf in lateral (L) view imaged with the VAST onboard camera. dpf: days post-fertilization, scale bar 500µm.

ZeCardioAI platform: an innovative artificial intelligence framework for in vivo cardiac phenotyping and disease classification

We developed ZeCardioAI, a DL-powered analysis platform to achieve detailed assessment of heart metrics.

Hearts of zebrafish larvae were imaged and the datasets were subjected to ZeCardioAI platform’s first component addressing data quality assessment. A ResNet18 classifier was selected as the best performing model to discard poorly imaged hearts (Fig. S1 in Supplementary Materials). It was trained on 800 manually selected high- and low-quality images (400 each, from which 56 corresponded to out-of-focus, 60 to overexposed, 50 to cropped, 50 to not detected, 50 to badly oriented, 41 to deformed, 50 to corrupted, and 43 to pigmented), and achieved an accuracy of 0.98 Inline graphic 0.02 (validation accuracy: 0.99). This ensured using high-quality datasets for subsequent model training. Tables S2 and S1 in Supplementary Materials show performance metrics for all models tested, and the total amount of zebrafish imaged and how many passed the initial quality check.

Cardiac chamber segmentation allows for refined chronotropic assessment

To enable unbiased and detailed characterization of myocardial contraction dynamics, we developed a DeepLabV3-based segmentation model for automatic tracking of full cardiac chamber contours (Fig. 2B). The model was trained on 494 manually annotated images from 107 videos, ensuring generalization across diverse cardiac phenotypes (see Fig. S2 in Supplementary Materials).

Figure 2.

A multi-panel figure displaying ZeCardioAI’s cardiac analysis in zebrafish. Panel A shows a frame of the zebrafish heart and panel B shows the heart chamber segmentation, while panels D to F illustrate strain marker tracking on the ventricle. Panel C provides a comparative analysis of chamber mask area over time between wild-type and mutant, showing periodic heart contractions. Panel G presents line graphs for radial strain percentages across four heart segments (left, bottom-left, bottom-right, and right), highlighting differences in contractile performance between the nexn genotypes

Cardiac chamber segmentation model and ventricle border marker tracking model. (A) Zebrafish heart at 120 hpf, imaged via brightfield fluorescence microscopy. (B) atrium (top chamber) and ventricle (bottom chamber) outlined as marked in the annotation tool. Note how the ventricle outline does not encompass the bulbus arteriosus (arrowhead). (C) Semantic segmentation of a nexnInline graphic and nexnInline graphic larva heart. A still from a movie shows the heart and segmented chambers and the area change over time of each chamber mask (green, atrium; blue, ventricle). Note the comparable beat rate but reduced area changes in both chambers in the nexnInline graphic larva. (D) 20 equidistant points along the ventricle border were annotated excluding the bulbus arteriosus. (E) Boundary segmental strain was computed for four segments (each made up of five equidistant dots). For each segment, the boundary segmental strain was measured by computing their Lagrangian strain. (F) Boundary radial strain was computed for the same four segments as the segmental strain by calculating the Lagrangian strain for each point to the center point of the chamber (marked as a dot). The individual strains were then averaged to denote the boundary radial strain of each segment. (G) Line plot of the mean boundary radial strain over time (three heartbeats) in representative nexnInline graphic and nexnInline graphic larvae. Note that the boundary radial strain of different segments is different and is reduced about half in the nexnInline graphic larva.

DeepLabV3 was chosen for its ability to capture multiscale contextual information while maintaining high-resolution feature maps, improving boundary precision, a key aspect to obtain smooth transitions across frames. To improve generalization and reduce the need for large amounts of data and long training times, we applied transfer learning and data augmentation. In particular, elastic deformations proved effective in capturing the diversity of cardiac morphologies in zebrafish larvae.

The model achieved strong segmentation accuracy for both chambers, with test set Dice scores of 0.98 Inline graphic 0.003. Hausdorff metrics showed a maximum deviation of 3.5 Inline graphic 0.9 pixels, suggesting only minor localized errors. The balanced average Hausdorff distance further confirmed strong boundary alignment with low variability (0.028 Inline graphic 0.007). Overall, the model produced reliable and smooth segmentations frame-to-frame (Fig. 2C, data for mybpc3 not shown). From chamber masks and their area changes over time, several metrics were extracted: heart rate, chamber diameter and approximate volume at end-systole and end-diastole, AV delay, and percentage of arrhythmic beats and parameters such as ejection fraction and contraction and relaxation times could be derived (Table 1).

For reference, automated chamber segmentation required Inline graphic2 min per heart (2000 frames), whereas manual segmentation of only end-systolic and end-diastolic frames (300 frames) typically required 2.5 h per heart. Measuring was done using NVIDIA Tesla T4 GPU acceleration (CPU-only execution is Inline graphic10Inline graphic slower).

Ventricle border marker tracking allows for assessment of heart strain parameters

To measure heart strain, we used DeepLabCut (DLC) to track 20 equidistant markers along the ventricle border across frames (Fig. 2D). The network, trained on 10 videos with 15 annotated frames each, achieved an accuracy of 1.44 pixels (mean distance between DLC and manual labels).

Two types of strain were defined for this case study: boundary segmental strain and boundary radial strain. Both, nexnInline graphic compared with nexnInline graphic and mybpc3Inline graphic compared with mybpc3Inline graphic showed a reduction in strain parameters (Fig. 2G, data for mybpc3 not shown), consistent with findings in human patients [43, 44] and underlining the value of capturing local contractility impairments in biological models for DCM and HCM.

Automated ventricular border marker tracking required Inline graphic11 s per heart video, encompassing 75 cardiac cycles (with NVIDIA Tesla T4 GPU acceleration, while 10Inline graphic slower on CPU), compared with 8 min for manual tracking of one single cardiac cycle.

Univariate analysis confirms known human phenotypes in zebrafish cardiomyopathy models

To enable classification, we generated two balanced datasets: DCM-nexn (427 nexnInline graphic, 427 nexnInline graphic) and HCM-mybpc3 (186 mybpc3Inline graphic, 186 mybpc3Inline graphic). Data preprocessing involved removing outliers, with thresholds for heart rate (<50 or >350 BPM) and absolute strain (>40%), filtering physiologically implausible values.

For the DCM-nexn dataset, after feature filtering, the dataset retained 15 features, encompassing ejection fraction, contraction times, AV delay, ventricle size, and strain. These variables reflected hallmark DCM phenotypes, including enlarged ventricles, reduced systolic function, and impaired myocardial strain (Fig. 3A and B).

Figure 3.

A multi-panel figure comparing cardiac metrics between wild-type and mutant zebrafish across two genetic models: nexn and mybpc3. Panels A and C display box plots for “Ventricle mask area,” “Ventricle mean ejection fraction,” and “Ventricle peak radial strain,” with statistical significance indicated by asterisks. Panels B and D feature polar plots (bullseye plots) mapping the “wild-type versus mutant mean boundary radial strain” across a 360 degree ventricular circumference, using a color-coded heatmap scale to represent absolute strain values

Univariate analyses and strain profiles for DCM and HCM models. (A) Boxplots of the normalized features: minimum ventricle mask area, ventricular mean ejection fraction (EjF), and ventricular peak radial strain for nexnInline graphic and nexnInline graphic. (B) Polar plot of the mean radial boundary strain for the 20 tracked markers along the ventricle border for nexnInline graphic and nexnInline graphic. (C) Boxplots of the normalized features: maximum ventricle mask area, ventricular mean ejection fraction, and ventricular peak radial strain for mybpc3Inline graphic and mybpc3Inline graphic. (D) Polar plot of the mean radial boundary strain for the 20 tracked markers along the ventricle border for mybpc3Inline graphic and mybpc3Inline graphic. Statistical significance (independent samples t-test): ****Inline graphic-value<.0001. WT: wild-type; MUT: mutant.

For the HCM-mybpc3 dataset, the same preprocessing and filtering pipeline resulted in 23 retained features, including ejection fraction, ventricle relaxation time, AV delay, chamber sizes at end-systole and end-diastole, and strain. Retained features captured known HCM traits such as reduced diastolic ventricular size, lower ejection fraction, and diminished strain, consistent with patient phenotypes (Fig. 3C and D).

Summary statistics for both datasets are presented in Table 1.

Cardiomyopathy classification models separate conditions and explainable artificial intelligence identifies key phenotypic features

We trained and evaluated classifiers to distinguish nexnInline graphic (WT) from nexnInline graphic (DCM) larvae and mybpc3Inline graphic (WT) from mybpc3Inline graphic (HCM) larvae. Below we present a breakdown of model performance, feature contributions, and data visualization techniques used to further explore the classification results.

For DCM-nexn classification, XGBoost was selected as the winning model due to performance (Supplementary Material, Table S3) and interpretability advantage. XGBoost achieved the highest average accuracy (0.7727 Inline graphic 0.0342) and F1-score (0.7620 Inline graphic 0.0349), along with strong and balanced precision and recall, and high mean AUROC across the five splits (0.8404 Inline graphic 0.0394). Train, test, and validation ROC curves are provided in the Supplementary Materials (Fig. S3). XGBoost showed robust performance in distinguishing nexnInline graphic from nexnInline graphic zebrafish, with 85% accuracy for nexnInline graphic and 77% for nexnInline graphic on the testing dataset, and mirroring the results on the validation set, indicating no overfitting. Additionally, its tree-based nature allows for more efficient SHAP analysis [45], an xAI technique to interpret the model’s decisions.

SHAP analysis identified ventricular mask area at end-systole, radial strain at right and bottom-left boundaries, AV delay, and average atrial contraction time as top predictors (Fig. 4B), consistent with DCM phenotypes, validating the animal model and the XGBoost classifier. Consistently, retraining the classifier using only the top 10 SHAP-ranked features yielded performance metrics comparable to those obtained with the full feature set (Supplementary Material, Table S3). UMAP embeddings of those features (Fig. 4A) showed good WT/MUT clustering with only small overlap, which could indicate some phenotypic heterogeneity. Outlier analysis revealed genotype–phenotype mismatches and borderline cases near the decision boundary. Comparison with XGBoost prediction scores confirmed this ambiguity, as borderline cases showed scores near 0.5.

Figure 4.

A multi-panel figure displaying machine learning explainability results for nexn and mybpc3 zebrafish models. Panels A and C show UMAP scatter plots with marginal density plots, visualizing the clustering and separation between wild-type and mutant populations. Panels B and D provide SHAP summary plots, ranking cardiac features such as “Ventricular mask area,” “atrioventricular delay,” and “Ventricular ejection fraction” by their impact on model output. The SHAP plots use a color gradient to represent feature values from low to high

Explainability of the DCM-nexn and HCM-mybpc3 WT versus MUT classifiers. (A) and (C) UMAP embeddings in 2D of the five most influential features identified by SHAP for nexn (Top, XGBoost classifier) and mybpc3 (Bottom, SVM classifier). The density plot of the UMAP points visualizes the distribution, highlighting the densest clusters and illustrating the spread of variability within each class. Density marginal distributions are shown on the corresponding axes. (B) and (D) SHAP beeswarm plots illustrating model decision-making for the 10 most influential features. Positive SHAP values indicate a stronger contribution toward the MUT classification, whereas negative values favor the WT class.

For HCM-mybpc3 classification, a Support Vector Machine (SVM) with radial basis function kernel was chosen as the best model (Supplementary Material, Table S4), with the highest average accuracy (0.773 Inline graphic 0.071), precision (0.794 Inline graphic 0.068), and mean AUROC across the five splits (0.854 Inline graphic 0.044), along with competitive recall (0.732 Inline graphic 0.099). Train, test, and validation ROC curves are provided in the Supplementary Materials (Fig. S4). SVM achieved 91% accuracy for mybpc3Inline graphic and 86% for mybpc3Inline graphic on the testing dataset, with similar validation results, indicating no overfitting or bias. SHAP analysis (Fig. 4D) revealed key features: AV delay, ventricular and atrial mask areas at end-diastole and end-systole, and ventricular ejection fraction, strain, and strain rate, all aligning with HCM phenotypes in patients and reinforcing the biological relevance. Similarly, retraining the classifier using only the top 10 SHAP-ranked features resulted in performance comparable with that obtained with the full feature set (Supplementary Material, Table S4). UMAP embeddings showed clear clusters with limited overlap (Fig. 4C). Outliers again corresponded to phenotype–genotype mismatches or inherent biological variability, supported by SVM decision function scores, with borderline cases showing values near zero. Notably, including non-significant variables did not affect the classification outcome and these did not rank among the top features in the SHAP analysis.

Overall, both classifiers effectively distinguished WT from MUT larvae using biologically relevant cardiac features, supporting their potential for assessing therapeutic effects in zebrafish CM models.

Discussion

We developed ZeCardioAI, a platform for high-throughput imaging and automated phenotypic analysis of in vivo cardiac function in zebrafish larval hearts. ZeCardioAI allows for characterization of zebrafish disease models, representing an important step towards precision medicine and safety understanding, since not only specific genetic mutations can be modeled but also the effect of environmental, dietary modifiers, and potential cardio-toxic substances can be assessed. The platform provides a means to gain mechanistic insight into CM progression and establishes the foundation for future assessment of therapeutic and repurposing candidates. ZeCardioAI extracts clinically relevant cardiac parameters through DL-based chamber segmentation and ventricle border tracking, enabling automatic calculation of inotropic and chronotropic metrics, and ventricular strain. The developed ML classifiers (XGBoost for DCM, SVM for HCM) effectively distinguished WT from MUT larvae and SHAP analysis identified key features driving classification which align with known characteristics in human patients.

In disease biology research and drug discovery, animal model translatability to human is critical. Rather than preselecting features based on expected human phenotypes, we quantified a broad range of cardiac metrics to allow the ML algorithms to objectively identify which parameters are most discriminative for each specific genetic perturbation, acknowledging that while key disease markers are conserved, their relative contribution may differ between zebrafish and humans. We hypothesized that not all extracted features would be affected by the mutations, thus, the feature importance analysis served to detect biologically relevant phenotypes. The DCM-nexn model showed changes in ventricular size, ejection fraction, and strain that parallel clinical findings of systolic dysfunction and ventricular remodeling in DCM patients [46], including those with nexn mutations [47]. Notably, strain is emerging as an early diagnostic tool in patients with preserved ejection fraction but familial predisposition [20, 48] and, in our case, strain metrics were highly relevant, revealing subtle changes. AV delay, crucial for synchronized AV activation and optimal ventricular filling, was also among the top features in the DCM-nexn model identified by SHAP. Prolongation can cause suboptimal filling and diastolic mitral regurgitation and, while not a primary hallmark, is frequently observed in DCM patients due to chamber dilation disrupting conduction pathways, leading to bundle branch block, and higher heart failure risk [49]. Prolonged contraction time was also relevant, consistent with other DCM cellular models [50], and mice [51]. To further support the SHAP analysis, comparable classification performance was retained when retraining using only the top 10 SHAP-ranked features, indicating that SHAP successfully identified a set of biologically relevant parameters sufficient to characterize the nexn CM model.

The HCM-mybpc3 model recapitulated key aspects of human HCM, including altered conductivity, chamber dimensions, and contractility [52]. AV delay, while not a primary symptom, was the most influential feature, potentially due to myocardial disarray and fibrosis disrupting conduction [53]. Following the same approach as with the DCM-nexn model, comparable classification performance was observed when the model was retrained using only the top-10 SHAP-ranked features, further reinforcing the validity SHAP analysis. Prolonged relaxation time, though not among the top-10 features, was significantly increased, consistent with what is observed in HCM patients [54].

Overall, strain and chronotropic features such as AV delay and contraction/relaxation times emerge as sensitive, potentially early indicators. While abnormal strain has been shown to predict potential adverse outcomes [55], further research on AV delay as an early CM marker is needed.

The ability to detect clinically meaningful phenotypes underscores the value of zebrafish in cardiovascular research. Despite intrinsic differences, such as a single-chambered circulatory loop, absence of pulmonary circulation, and the capacity for heart regeneration, the zebrafish is well established for heart failure studies [56]. Its optical transparency, conserved signaling pathways, ease of genetic manipulation to mimic human mutations, similar heart rate, morphology, and function make it a powerful model for cardiac disease research [57].

However, cardiac imaging pipelines remain limited by imaging quality, fish line, and developmental stage. Moreover, to our knowledge, strain analysis has not been reported in larvae and has only recently been explored in adult fish [58, 59]. Our research expands on previous studies by leveraging zebrafish HTS with AI and xAI to improve phenotyping and model interpretability. While recent works also capitalize on the HTS potential of zebrafish [60, 61], ventricular geometry is simplified, and only few pipelines employ DL-based segmentation of both chambers, limiting their sensitivity to complex morphologies [17–19]. Although a direct quantitative comparison is not technically feasible due to substantial differences in input data, our review comparing recent similar works highlights that ZeCardioAI addresses a specific gap: the combination of high-throughput screening with advanced cardiac phenotyping, including strain analysis. Moreover, as emphasized across these studies, manual processing would be unfeasible at this scale. This further stresses the need for automation in cardiac analysis pipelines.

ZeCardioAI addresses these limitations with chamber-specific segmentation, regional strain analysis, and interpretable classification in a scalable, generalizable, and automated pipeline. Its throughput and interpretability make ZeCardioAI well suited for early-stage in vivo drug discovery, where sensitive phenotyping is essential for screening therapeutic candidates. We also note that this study focused on single-gene mutants, nexn and mybpc3, to isolate mutation-specific effects. While human DCM and HCM are genetically complex [4, 62], zebrafish enable future expansion to double mutants or inducible knockouts. The findings for the nexn and mybpc3 models, spanning morphology, strain, and conductivity abnormalities, align and contribute to the existing knowledge of CM pathophysiology, reinforcing the translational relevance of zebrafish models in advancing CV research.

In clinical settings, cardiac imaging often relies on 3D acquisition and established speckle-tracking techniques for strain estimation [20, 30]. In contrast, imaging in this study was limited to 2D, requiring proxy metrics in place of clinical standards. Furthermore, while speckle-tracking is applicable to adult zebrafish using high-frequency transducers to achieve the necessary axial and lateral resolution, the much smaller size of zebrafish larvae approaches the technical resolution limits of current ultrasound technology [63]. While we acknowledge that 2D imaging simplifies the heart’s 3D complexity and may overlook features along the depth axis, this approach represents a necessary trade-off to enable high-throughput screening. Implementing 3D or multi-angle acquisition would allow for even more detailed phenotyping but it would prolong imaging time, typically entail a loss of temporal resolution, and increase data processing load, rendering the platform unsuitable for large-scale cardiac screening. Accordingly, our bespoke boundary segmental strain serves as a 2D proxy for clinically established longitudinal strain, measuring myocardial wall shortening, while boundary radial strain approximates wall thickening and thinning typically assessed in parasternal short-axis echocardiographic views. Although these metrics lack full volumetric context, they successfully capture the contractility defects characteristic of CM, as reflected by our ability to accurately classify disease phenotypes and their high ranking in the explainability analysis. Still, 3D or multi-angle 2D imaging could be employed for lower-throughput additional validation. Moreover, inconsistent fluorescence patterns across the cardiac cycle precluded reliable speckle tracking. To address this, we implemented DL-based marker tracking to follow anatomical landmarks, enabling accurate strain calculation. The DLC network was trained on representative frames from 10 videos (totaling 150 annotated frames). This relatively small dataset size aligns with established DLC benchmarks, which show that transfer learning enables human-level accuracy with as few as 100–200 training frames [36]. Furthermore, unlike freely moving animal behaviors, our videos from zebrafish hearts present a constrained tracking challenge characterized by periodic, smooth motion and high-contrast fluorescent boundaries against a uniform background. These standardized imaging conditions significantly reduce the diversity of visual features the model must learn, allowing for robust generalization even with a small training set. Still, while DLC reports validation error on held-out annotated frames, frame-level validation is limited when the training data are small and temporally correlated. In practice, by qualitative visual inspection, we observed consistent tracking across independent datasets, validating the model’s ability to generalize to data unseen during training.

One key module of ZeCardioAI focused on data quality checks and standardization to obtain well-curated, high-quality datasets. We implemented a DL-based quality control model because manual video assessment is a major bottleneck in high-throughput screening, being time-consuming, repetitive, and prone to operator bias, thereby undermining end-to-end automation. Simpler automated metrics (e.g. intensity thresholds or blur measures) proved insufficient, as they failed to capture complex morphological artifacts such as bad orientation or cardiac deformation. Therefore, a CNN-based approach was chosen to robustly identify diverse quality failures. Nevertheless, some variability is unavoidable. For instance, WT and MUT samples showed partial overlap in UMAP projections. Some of the outliers could be traced to segmentation errors, while others reflected genotype-positive/phenotype-negative or genotype-negative/phenotype-positive cases, consistent with clinical CM observations [64, 65]. Additional sources of overlap likely arose from natural variation and dimensionality reduction constraints.

This variability is not necessarily detrimental. In fact, training on heterogeneous data may still support highly predictive models. In our study, XGBoost achieved strong performance in the DCM-nexn dataset, whereas its lower accuracy in the HCM-mybpc3 model may stem from limited data, as the algorithm typically requires larger datasets to generalize effectively. Expanding the mybpc3 dataset and refining curation will therefore be key to improving performance, particularly for downstream drug screening applications. Moreover, integration with transcriptomic datasets could further strengthen this framework by linking phenotypes to disease-driving pathways and prioritizing druggable targets, advancing mechanistic understanding [66]. Future efforts should also validate candidate compounds and gene targets through pharmacological treatments and CRISPR/Cas9-generated zebrafish mutants, while expanding datasets across additional genetic backgrounds.

Conclusion

ZeCardioAI’s combination of automated cardiac quantification and interpretable classification makes it well suited for HTS and drug discovery. Its modular design also supports adaptation to cardio-toxicity assays and broader cardiovascular research, positioning ZeCardioAI as a scalable tool for zebrafish-based translational studies. We anticipate that the integration of zebrafish and AI will be a transformative approach in the discovery of new therapies for complex diseases, such as CMs.

Finally, in alignment with the principles of open science and to ensure reproducibility, the complete ZeCardioAI framework is publicly available as a modular software suitable for local or cloud-based deployment (see Data availability section).

Key Points.

  • Zebrafish genetic models for dilated and hypertrophic cardiomyopathy (CM) recapitulate clinical phenotypes.

  • Fully automated, high-throughput fluorescent in-vivo imaging of zebrafish hearts allows for generation of large-scale and high-quality datasets for artificial intelligence applications.

  • Specific deep learning models can successfully extract a plethora of metrics aiding the deep phenotypic description of CM disease models.

  • Classifiers trained on heart metrics are able to distinguish healthy from diseased hearts with high fidelity, and explainable artificial intelligence enables the understanding of the parameter prioritization in the classification decision.

  • The analytical capabilities provided in the ZeCardioAI platform can enable drug discovery in CMs and can be easily scaled up or expanded to other cardiovascular indications.

Supplementary Material

bbag197_Supplemental_Files

Acknowledgements

The authors thank Christian Cortés for assistance with code implementation, data review, and manuscript revision, and Irina Suárez for support with experiments and data acquisition. The authors would also like to thank the anonymous reviewers for their valuable suggestions.

Contributor Information

Ferran Arqué, Early Drug Discovery Department, ZeCardio Therapeutics, Carrer de Laureà Miró, 408, 410, 08980 Barcelona, Spain; Technology and Innovation Department, ZeClinics SL, Carrer de Laureà Miró, 408, 410, 08980 Barcelona, Spain; Biomedical Data Science Group, Institut de Salut Global de Barcelona (ISGlobal), Carrer del Doctor Aiguader, 88, 08003 Barcelona, Spain.

Carole Jung, Technology and Innovation Department, ZeClinics SL, Carrer de Laureà Miró, 408, 410, 08980 Barcelona, Spain.

Beatriz Muriel-Moreno, Technology and Innovation Department, ZeClinics SL, Carrer de Laureà Miró, 408, 410, 08980 Barcelona, Spain.

Laura López-Blanch, Technology and Innovation Department, ZeClinics SL, Carrer de Laureà Miró, 408, 410, 08980 Barcelona, Spain.

Eduardo Saman, Technology and Innovation Department, ZeClinics SL, Carrer de Laureà Miró, 408, 410, 08980 Barcelona, Spain.

Davide D’Amico, Technology and Innovation Department, ZeClinics SL, Carrer de Laureà Miró, 408, 410, 08980 Barcelona, Spain.

Javier Terriente, Technology and Innovation Department, ZeClinics SL, Carrer de Laureà Miró, 408, 410, 08980 Barcelona, Spain.

Paula Petrone, Biomedical Data Science Group, Institut de Salut Global de Barcelona (ISGlobal), Carrer del Doctor Aiguader, 88, 08003 Barcelona, Spain; Life Sciences - Digital Health Unit, Barcelona Supercomputing Center (BSC-CNS), Plaça d’Eusebi Güell, 1-3, 08034 Barcelona, Spain.

Sylvia Dyballa, Technology and Innovation Department, ZeClinics SL, Carrer de Laureà Miró, 408, 410, 08980 Barcelona, Spain.

Author contributions

Ferran Arqué (Conceptualization, Methodology, Software, Validation, Writing, Review), Carole Jung (Conceptualization, Validation, Project Administration, Writing, Review), Beatriz Muriel-Moreno (Investigation, Validation), Laura López-Blanch (Investigation, Validation), Eduardo Saman (Methodology, Software, Validation), Davide DAmico (Funding Acquisition), Javier Terriente (Conceptualization, Funding Acquisition, Writing, Review), Paula Petrone (Conceptualization, Supervision, Writing, Review), and Sylvia Dyballa (Conceptualization, Methodology, Software, Validation, Project Administration, Writing, Review)

Conflicts of interest

F.A., C.J., B.M.-M., L.L.-B., E.S., D.D., J.T., and S.D. are affiliated with ZeCardio Therapeutics SL, a pharmaceutical company using zebrafish in cardiovascular disease modeling and drug discovery. F.A., J.T., and S.D. are affiliated with ZeClinics SL, a Contract Research Organization (CRO) using the zebrafish model for disease modeling, target validation, and drug screening.

Funding

This work was supported by the NextGenEU programme “Artificial Intelligence and Other Digital Technologies to Drive Their Integration into Value Chains” (NextGenEU-C005/21-ED 2021/C005/00149851); and the Ministerio de Ciencia e Innovación, Agencia Estatal de Investigación, “Ayudas para Doctorados Industriales” (DIN 2022-012817 to F.A.).

Data availability

The data underlying this article will be shared on reasonable request to the corresponding author. The cardio-deeplab and cardio-deeplabcut Python packages are publicly available, along with the supporting code and classes used to run them and extract cardiac metrics from zebrafish heart videos. A Jupyter notebook demonstrating the extraction of cardiac metrics from an example video is provided for illustrative purposes. All resources are accessible at https://doi.org/10.5281/zenodo.17457010.

References

  • 1. Cheng  S, Han  Y, Jiang  L  et al. National, regional, and global cardiomyopathy burden from 1990 to 2019. Front Cardiovasc Med  2022; 9:1042448. 10.3389/fcvm.2022.1042448 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2. Heymans  S, Lakdawala  NK, Tschöpe  C  et al. Dilated cardiomyopathy: causes, mechanisms, and current and future treatment approaches. Lancet (London, England)  2023; 402:998–1011. 10.1016/S0140-6736(23)01241-2 [DOI] [PubMed] [Google Scholar]
  • 3. Argirò A, Parikh V, Jurcut R  et al. Hypertrophic cardiomyopathy. Nat Rev Dis Primers 2025;11:58. 10.1038/s41572-025-00643-0 [DOI] [PubMed] [Google Scholar]
  • 4. Lopes  LR, Ho  CY, Elliott  PM. Genetics of hypertrophic cardiomyopathy: established and emerging implications for clinical practice. Eur Heart J  2024; 45:2727–34. 10.1093/eurheartj/ehae421 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5. Pérez-Sánchez  I, Romero-Puche  AJ, Sáez  EG-M  et al. Factors influencing the phenotypic expression of hypertrophic cardiomyopathy in genetic carriers. Revista Española de Cardiología (English Edition)  2018; 71:146–54. 10.1016/j.rec.2017.06.002 [DOI] [PubMed] [Google Scholar]
  • 6. Kim  MJ, Cha  S, Baek  JS  et al. Genetic heterogeneity of cardiomyopathy and its correlation with patient care. BMC Med Genomics  2023; 16:270. 10.1186/s12920-023-01639-z [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7. Jordan  E, Peterson  L, Ai  T  et al. Evidence-based assessment of genes in dilated cardiomyopathy. Circulation  2021; 144:7–19. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8. Arbelo  E, Protonotarios  A, Gimeno  JR  et al. 2023 ESC guidelines for the management of cardiomyopathies: developed by the task force on the management of cardiomyopathies of the European Society of Cardiology (ESC). Eur Heart J  2023; 44:3503–626. 10.1093/eurheartj/ehad194 [DOI] [PubMed] [Google Scholar]
  • 9. Sorella  A, Galanti  K, Iezzi  L  et al. Diagnosis and management of dilated cardiomyopathy: a systematic review of clinical practice guidelines and recommendations. Eur Heart J  2025; 11:206–22. 10.1093/ehjqcco/qcae109 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10. Packard  E, deFeria  A, Peshin  S  et al. Contemporary therapies and future directions in the management of hypertrophic cardiomyopathy. Cardiol Ther  2022; 11:491–507. 10.1007/s40119-022-00283-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11. Chase Cole  J, Benvie  SF, DeLosSantos  M. Mavacamten: a novel agent for hypertrophic cardiomyopathy. Clin Ther  2024; 46:368–73. 10.1016/j.clinthera.2024.02.007 [DOI] [PubMed] [Google Scholar]
  • 12. Beitzke  D, Gremmel  F, Senn  D  et al. Effects of Levosimendan on cardiac function, size and strain in heart failure patients. Int J Cardiovasc Imaging  2021; 37:1063–71. 10.1007/s10554-020-02077-z [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13. Elizabeth Patton  E, Zon  LI, Langenau  DM. Zebrafish disease models in drug discovery: from preclinical modelling to clinical trials. Nat Rev Drug Discov  2021; 20:611–28. 10.1038/s41573-021-00210-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14. Rubbini  D, Cornet  C, Terriente  J  et al. CRISPR meets zebrafish: accelerating the discovery of new therapeutic targets. SLAS Discovery  2020; 25:552–67. 10.1177/2472555220926920 [DOI] [PubMed] [Google Scholar]
  • 15. Pott  A, Rottbauer  W, Just  S. Streamlining drug discovery assays for cardiovascular disease using zebrafish. Expert Opin Drug Discovery  2020; 15:27–37. 10.1080/17460441.2020.1671351 [DOI] [PubMed] [Google Scholar]
  • 16. Dyballa  S, Miñana  R, Rubio-Brotons  M  et al. Comparison of zebrafish larvae and hiPSC cardiomyocytes for predicting drug-induced cardiotoxicity in humans. Toxicol Sci  2019; 171:283–95. 10.1093/toxsci/kfz165 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17. Akerberg  AA, Burns  CE, Geoffrey Burns  C  et al. Deep learning enables automated volumetric assessments of cardiac function in zebrafish. Dis Model Mech  2019; 12:dmm040188. 10.1242/dmm.040188 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18. Naderi  AM, Haisong  B, Jingcheng  S  et al. Deep learning-based framework for cardiac function assessment in embryonic zebrafish from heart beating videos. Comput Biol Med  2021; 135:104565. 10.1016/j.compbiomed.2021.104565 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19. Zhang  B, Pas  KE, Ijaseun  T  et al. Automatic segmentation and cardiac mechanics analysis of evolving zebrafish using deep learning. Front Cardiovasc Med  2021; 8:675291. 10.3389/fcvm.2021.675291 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20. Smiseth  OA, Rider  O, Cvijic  M  et al. Myocardial strain imaging: theory, current practice, and the future. JACC Cardiovasc Imaging  2025; 18:340–81. 10.1016/j.jcmg.2024.07.011 [DOI] [PubMed] [Google Scholar]
  • 21. Salih  A, Galazzo  IB, Gkontra  P  et al. Explainable artificial intelligence and cardiac imaging: toward more interpretable models. Circ Cardiovasc Imaging  2023; 16:e014519. 10.1161/CIRCIMAGING.122.014519 [DOI] [PubMed] [Google Scholar]
  • 22. Lundberg  SM, Lee  S-I. A unified approach to interpreting model predictions. In: Guyon I et al. (eds.) Proceedings of the 31st International Conference on neural information processing systems, pp. 4768–77. Red Hook, NY, USA: Curran Associates Inc, 2017. [Google Scholar]
  • 23. Ottaviani  A, Mansour  D, Molinari  LV  et al. Revisiting diagnosis and treatment of hypertrophic cardiomyopathy: current practice and novel perspectives. J Clin Med  2023; 12:5710. 10.3390/jcm12175710 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24. Huang  C-J, Chi-Tang  T, Hsiao  C-D  et al. Germ-line transmission of a myocardium-specific GFP transgene reveals critical regulatory elements in the cardiac myosin light chain 2 promoter of zebrafish. Dev Dyn  2003; 228:30–40. 10.1002/dvdy.10356 [DOI] [PubMed] [Google Scholar]
  • 25. Pardo-Martin  C, Chang  T-Y, Koo  BK  et al. High-throughput in vivo vertebrate screening. Nat Methods  2010; 7:634–6. 10.1038/nmeth.1481 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26. He  K, Zhang  X, Ren  S  et al. Deep residual learning for image recognition. In: 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 770–8. Las Vegas, NV, USA: IEEE, 2016.
  • 27. Iandola  FN, Han  S, Moskewicz  MW  et al. SqueezeNet: AlexNet-Level Accuracy with 50x Fewer Parameters and <0.5MB Model Size. arXiv preprint arXiv:1602.07360. 2016.
  • 28. Radosavovic  I, Kosaraju  RP, Girshick  R  et al. Designing network design spaces. In: 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 10425–33. Seattle, WA, USA: IEEE, 2020. [Google Scholar]
  • 29. Kanopoulos  N, Vasanthavada  N, Baker  RL. Design of an image edge detection filter using the Sobel operator. IEEE J Solid-State Circuits  1988; 23:358–67. 10.1109/4.996 [DOI] [Google Scholar]
  • 30. Lang  RM, Badano  LP, Mor-Avi  V  et al. Recommendations for cardiac chamber quantification by echocardiography in adults: an update from the American Society of Echocardiography and the European Association of Cardiovascular Imaging. Eur Heart J Cardiovasc Imaging  2015; 16:233–71. 10.1093/ehjci/jev014 [DOI] [PubMed] [Google Scholar]
  • 31. Chen  L-C, Papandreou  G, Schroff  F  et al. Rethinking Atrous Convolution for Semantic Image Segmentation. arXiv preprint arXiv:1706.05587. 2017. [Google Scholar]
  • 32. Ronneberger  O, Fischer  P, Brox  T. U-net: convolutional networks for biomedical image segmentation. In: Navab  N, Hornegger  J, Wells  WM, Frangi  AF (eds.), Medical Image Computing and Computer-Assisted Intervention – MICCAI 2015, Cham: Springer International Publishing, 2015, 234–41. 10.1007/978-3-319-24574-4_28. [DOI] [Google Scholar]
  • 33. Lin  T-Y, Maire  M, Belongie  S  et al. Microsoft COCO: common objects in context. In: Fleet  D, Pajdla  T, Schiele  B, Tuytelaars  T (eds.), Computer Vision – ECCV 2014, Cham: Springer International Publishing, 2014, 740–55. 10.1007/978-3-319-10602-1_48. [DOI] [Google Scholar]
  • 34. Everingham  M, Van Gool  L, Williams  CKI  et al. The Pascal visual object classes (VOC) challenge. Int J Comput Vis  2010; 88:303–38. 10.1007/s11263-009-0275-4 [DOI] [Google Scholar]
  • 35. Aydin  OU, Taha  AA, Hilbert  A  et al. On the usage of average Hausdorff distance for segmentation performance assessment: hidden error when used for ranking. Eur Radiol Exp  2021; 5:4. 10.1186/s41747-020-00200-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36. Mathis  A, Mamidanna  P, Cury  KM  et al. DeepLabCut: markerless pose estimation of user-defined body parts with deep learning. Nat Neurosci  2018; 21:1281–9. 10.1038/s41593-018-0209-y [DOI] [PubMed] [Google Scholar]
  • 37. Insafutdinov  E, Pishchulin  L, Andres  B  et al. DeeperCut: a deeper, stronger, and faster multi-person pose estimation model. In: Leibe  B, Matas  J, Sebe  N, Welling  M (eds.), Computer Vision – ECCV 2016, Cham: Springer International Publishing, 2016, 34–50. 10.1007/978-3-319-46466-4_3. [DOI] [Google Scholar]
  • 38. McInnes  L, Healy  J, Saul  N  et al. UMAP: uniform manifold approximation and projection. J open source softw  2018; 3:861. 10.21105/joss.00861 [DOI] [Google Scholar]
  • 39. Noureddine  M, Gehmlich  K. Structural and signaling proteins in the Z-disk and their role in cardiomyopathies. Front Physiol  2023; 14:1143858. 10.3389/fphys.2023.1143858 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40. Hofeichner  J, Gahr  BM, Huber  M  et al. CRISPR/Cas9-mediated nexilin deficiency interferes with cardiac contractile function in zebrafish in vivo. Sci Rep  2023; 13:22679. 10.1038/s41598-023-50065-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41. Toepfer  CN, Wakimoto  H, Garfinkel  AC  et al. Hypertrophic cardiomyopathy mutations in MYBPC3 dysregulate myosin. Sci Transl Med  2019; 11:eaat1199. 10.1126/scitranslmed.aat1199 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42. Muddasir Ashraf  M, Jan  F, Jahangir  A  et al. Genotype-phenotype correlations in hypertrophic cardiomyopathy: insights from an HCM Center of Excellence. Curr Probl Cardiol  2025; 50:102996. 10.1016/j.cpcardiol.2025.102996 [DOI] [PubMed] [Google Scholar]
  • 43. Yaohan  Y, Sisi  Y, Tang  X  et al. Evaluation of left ventricular strain in patients with dilated cardiomyopathy. J Int Med Res  2017; 45:2092–100. 10.1177/0300060517712164 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44. Qiao  J, Zhao  P, Jianyao  L  et al. Diastolic dysfunction assessed by cardiac magnetic resonance imaging tissue tracking on normal-thickness wall segments in hypertrophic cardiomyopathy. BMC Med Imaging  2023; 23:7. 10.1186/s12880-022-00955-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45. Lundberg  SM, Erion  G, Chen  H  et al. From local explanations to global understanding with explainable AI for trees. Nat Mach Intell  2020; 2:56–67. 10.1038/s42256-019-0138-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46. Reichart  D, Magnussen  C, Zeller  T  et al. Dilated cardiomyopathy: from epidemiologic to genetic phenotypes: a translational review of current literature. J Intern Med  2019; 286:362–72. 10.1111/joim.12944 [DOI] [PubMed] [Google Scholar]
  • 47. Hassel  D, Dahme  T, Erdmann  J  et al. Nexilin mutations destabilize cardiac Z-disks and lead to dilated cardiomyopathy. Nat Med  2009; 15:1281–8. 10.1038/nm.2037 [DOI] [PubMed] [Google Scholar]
  • 48. Chevalier  C, Kremer  K, Cavus  E  et al. CMR feature tracking in patients with dilated cardiomyopathy: patterns of myocardial strain and focal fibrosis. Open Heart  2022; 9:e002013. 10.1136/openhrt-2022-002013 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49. Mages  C, Gampp  H, Syren  P  et al. Electrical ventricular remodeling in dilated cardiomyopathy. Cells  2021; 10:2767. 10.3390/cells10102767 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50. Dai  Y, Amenov  A, Ignatyeva  N  et al. Troponin destabilization impairs sarcomere-cytoskeleton interactions in iPSC-derived cardiomyocytes from dilated cardiomyopathy patients. Sci Rep  2020; 10:209. 10.1038/s41598-019-56597-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51. Ramratnam  M, Salama  G, Sharma  RK  et al. Gene-targeted mice with the human troponin T R141W mutation develop dilated cardiomyopathy with calcium desensitization. PLoS One  2016; 11:e0167681. 10.1371/journal.pone.0167681 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52. Wijnker  PJM, Sequeira  V, Kuster  DWD  et al. Hypertrophic cardiomyopathy: a vicious cycle triggered by sarcomere mutations and secondary disease hits. Antioxid Redox Signal  2019; 31:318–58. 10.1089/ars.2017.7236 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53. Ashkir  Z, Samat  AHA, Ariga  R  et al. Myocardial disarray and fibrosis across hypertrophic cardiomyopathy stages associate with ECG markers of arrhythmic risk. Eur Heart J Cardiovasc Imaging  2025; 26:218–28. 10.1093/ehjci/jeae260 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54. Alizadehasl  A, Sadeghpour  A, Akiash  N. Isovolumic relaxation time as an indicator of diastolic dysfunction in hypertrophic cardiomyopathy. Iran Heart J  2018; 19:44–9. [Google Scholar]
  • 55. Chadalavada  S, Fung  K, Rauseo  E  et al. Myocardial strain measured by cardiac magnetic resonance predicts cardiovascular morbidity and death. J Am Coll Cardiol  2024; 84:648–59. 10.1016/j.jacc.2024.05.050 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56. González-Rosa  JM. Zebrafish models of cardiac disease: from fortuitous mutants to precision medicine. Circ Res  2022; 130:1803–26. 10.1161/CIRCRESAHA.122.320396 [DOI] [PubMed] [Google Scholar]
  • 57. Giardoglou  P, Beis  D. On zebrafish disease models and matters of the heart. Biomedicines  2019; 7:15. 10.3390/biomedicines7010015 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58. Huang  H, Machikhin  A, Chen  D-Q  et al. High-resolution tissue doppler and strain imaging for adult zebrafish myocardial tissue through ultrafast high-frequency ultrasound vector doppler estimation. IEEE Trans Ultrason Ferroelectr Freq Control  2024; 71:1006–18. 10.1109/TUFFC.2024.3414856 [DOI] [PubMed] [Google Scholar]
  • 59. Weeks  O, Gao  X, Basu  S  et al. Embryonic alcohol exposure in zebrafish predisposes adults to cardiomyopathy and diastolic dysfunction. Cardiovasc Res  2024; 120:1607–21. 10.1093/cvr/cvae139 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60. Duan  J, Lin  G, Jiao  K  et al. Fully automated in vivo screening system for multi-organ imaging and pharmaceutical evaluation. Microsyst Nanoeng  2025; 11:22. 10.1038/s41378-024-00852-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61. Suryanto  ME, Saputra  F, Kurnia  KA  et al. Using DeepLabCut as a real-time and Markerless tool for cardiac physiology assessment in zebrafish. Biology  2022; 11:1243. 10.3390/biology11081243 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62. Gigli  M, Stolfo  D, Merlo  M  et al. Pathophysiology of dilated cardiomyopathy: from mechanisms to precision medicine. Nat Rev Cardiol  2025; 22:183–98. 10.1038/s41569-024-01074-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63. Hein  SJ, Lehmann  LH, Kossack  M  et al. Advanced echocardiography in adult zebrafish reveals delayed recovery of heart function after myocardial cryoinjury. PLoS One  2015; 10:e0122665. 10.1371/journal.pone.0122665 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64. Paldino  A, Rossi  M, Dal Ferro  M  et al. Sport and exercise in genotype positive (+) phenotype negative (−) individuals: current dilemmas and future perspectives. Eur J Prev Cardiol  2023; 30:871–83. 10.1093/eurjpc/zwad079 [DOI] [PubMed] [Google Scholar]
  • 65. Schoonvelde  SAC, Nollet  EE, Zwetsloot  P-P  et al. Genotype-negative hypertrophic cardiomyopathy: exploring the role of cardiovascular risk factors in disease expression. Int J Cardiol  2025; 437:133444. 10.1016/j.ijcard.2025.133444 [DOI] [PubMed] [Google Scholar]
  • 66. Minhas  R, Loeffler-Wirth  H, Siddiqui  YH  et al. Transcriptome profile of the sinoatrial ring reveals conserved and novel genetic programs of the zebrafish pacemaker. BMC Genomics  2021; 22:715. 10.1186/s12864-021-08016-z [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

bbag197_Supplemental_Files

Data Availability Statement

The data underlying this article will be shared on reasonable request to the corresponding author. The cardio-deeplab and cardio-deeplabcut Python packages are publicly available, along with the supporting code and classes used to run them and extract cardiac metrics from zebrafish heart videos. A Jupyter notebook demonstrating the extraction of cardiac metrics from an example video is provided for illustrative purposes. All resources are accessible at https://doi.org/10.5281/zenodo.17457010.


Articles from Briefings in Bioinformatics are provided here courtesy of Oxford University Press

RESOURCES