Skip to main content
NPJ Digital Medicine logoLink to NPJ Digital Medicine
. 2026 Feb 19;9:263. doi: 10.1038/s41746-026-02442-7

Robust and interpretable unit level causal inference in neural networks for pediatric myopia

Zihui Jin 1,#, Mengtian Kang 2,3,4,#, Wuyan Zhao 1, Wenjin Gui 1, He Li 5, Yongfang Tu 5, Yongjun Huo 5, Canqing Yu 6,7, Weihua Song 8,9,, Ningli Wang 2,3,4,, Xu Yang 1,, Shi-Ming Li 2,3,4,
PMCID: PMC13032011  PMID: 41714699

Abstract

Understanding causal mechanisms in deep learning is essential for clinical adoption, where interpretability and reliability are critical. Most existing AI systems act as black boxes, limiting transparency in medicine. We propose a causal inference framework integrated into neural networks to assess the influence of individual features on predictions. Using a prospective pediatric ophthalmology cohort of over 3000 children with longitudinal follow-up, our method estimates direct and indirect causal effects through intervention. Applied to myopia progression in children, our approach not only achieved good performance but also identified clinically plausible causal pathways. Refutation experiments with multiple falsification strategies confirm the robustness and reliability of causal effects. The approach is model-agnostic and suitable for digital health interventions requiring explainability. By incorporating unit-level causal reasoning into deep learning, this work advances transparent and reliable AI systems aligned with the goals of precision medicine and equitable healthcare.

graphic file with name 41746_2026_2442_Figa_HTML.jpg

Subject terms: Computational biology and bioinformatics, Diseases, Health care, Mathematics and computing, Medical research

Introduction

Myopia has become the most prevalent refractive disorder and a leading cause of vision impairment globally1,2. In East Asia, the adolescent myopia rate has reached alarming levels of 80–90%, with up to 20% developing high myopia—associated with sight-threatening complications, such as retinal detachment and macular degeneration35. The increasing incidence in younger populations calls for a deeper understanding of the multifactorial etiology of myopia68.

Traditional statistical methods, such as linear and logistic regression, have been widely used to explore myopia-related risk factors9,10. However, these models primarily capture correlations and often fail to represent the complex, nonlinear interactions inherent in biological systems1113. Furthermore, few studies incorporate causal frameworks, limiting mechanistic insight and weakening the translational value for designing targeted interventions14.

With the rise of digital medicine, artificial neural networks (ANNs) offer an opportunity to leverage large-scale, real-world health data for predictive modeling1518. Nevertheless, deep learning models are frequently criticized for their “black-box" nature, making it difficult to explain predictions or guide clinical action1923. To address this limitation, post-hoc interpretability techniques, such as SHapley Additive exPlanations (SHAP), have been widely adopted to attribute model outputs to input features24,25. While such methods improve transparency, they primarily provide correlation-based explanations and do not fully capture the underlying causal mechanisms. Interpretability and reliability are particularly essential in medical settings, where clinical decisions must be transparent, justifiable, and safe.

To bridge this gap, there is growing interest in combining causal inference with neural architectures, enabling not only accurate predictions but also insight into the underlying mechanisms2629. Such approaches are especially valuable for digital interventions, where understanding the impact pathways of behavioral, physiological, or environmental factors can inform scalable and personalized healthcare strategies.

In this study, we propose a causal modeling framework that integrates intervention-based causal attribution within deep neural networks. Using a prospective pediatric ophthalmology cohort tracking over 3000 children longitudinally, our framework disentangles both direct and indirect causal effects of input features on myopia progression, enhancing model transparency and clinical interpretability.

To achieve this, we categorize input neurons into three functional types—Isolated, Pure, and Confounded Units—based on their positions in the learned causal structure. We design targeted attribution strategies for each, including a domain-adaptive meta-learning approach for estimating causal effects under confounding bias. In addition to robust predictive performance, our model enables the reconstruction of interpretable causal pathways within the network, offering a mechanistic view of how input signals propagate through the architecture.

Importantly, our approach is model-agnostic and can be generalized to other clinical contexts that demand trustworthy, explainable AI. By integrating deep learning, causal reasoning, and real-world validation, our work contributes to the development of reliable digital interventions that meet the interpretive and ethical standards of modern healthcare.

Results

Neurons’ causal structure reveals functional categories of input neurons

The causal explanation framework proposed in this study is model-agnostic, requiring that the prediction model be trained prior to causal analysis. We first constructed a binary classification model for myopia prediction using a feedforward neural network (as shown in Fig. 1b) trained on a pediatric ophthalmic cohort dataset collected via annual surveys in City Anyang, Henan Province, China, from 2011 to 2017 (see the section “Dataset”). As illustrated in Fig. 1a, the model input consists of 16 variables spanning behavioral, physiological, dietary, environmental, and genetic domains (Table 1). The result of this model for myopia classification is summarized in Table 2, the model achieved an accuracy of 0.933, indicating strong discriminative ability between the two classes. This result confirms that the binary myopia classification outcomes provide a robust foundation for subsequent causal attribution analyses, ensuring that the interpretability of the model is built upon clinically reliable predictions.

Fig. 1. The workflow of causal structure discovery and functional categorization of input neurons.

Fig. 1

a Data preprocessing. Sixteen input variables were selected across five domains: behavioral, physiological, environmental, dietary, and hereditary. b Architecture of the neural network (NN). c Flow of removing edges via causal discovery algorithm. d Four rules for determining edge directions. e The directed causal graph of key behavioral and genetic factors related to myopia, involving 10 of the 16 input variables, connected by a total of 15 edges. f Three categorized units of the corresponding input-layer neurons in NN.

Table 1.

Variables of the pediatric myopia dataset from the Anyang Childhood Eye Study

Variable Variable meaning Data type Data range Units
CR Cycloplegic refraction Continuous −5.4 to 8.7 diopter (D)
AL Axial length Continuous 20–34 mm
NW Total nearwork load Continuous 0–102 hour (h)
DTO Distance-viewing time outdoors Continuous 0–57 hour (h)
NAR Near accommodative ability of right eye Continuous −5.1 to 2.2 diopter (D)
DAR Distance accommodative ability of right eye Continuous −6.6 to 4.4 diopter (D)
HEIGHT Height Continuous 97–143 cm
PULSE Pulse rate Continuous 52–140 Bpm
GENDER Gender Binary 0.1 0 represents a girl, 1 represents a boy
CB Carbonated beverage consumption frequency Discrete 1–5 times per week
EGG Egg consumption frequency Discrete 1–4 times per week
REDM Red meat consumption frequency Discrete 1–5 times per week
WHIM White meat consumption frequency Discrete 1–5 times per week
PWG Number of parents wearing glasses Discrete 0–2 person
K1 Corneal keratometry in both eyes Continuous 38–71 mm
K2 Corneal keratometry in both eyes Continuous 38–183 mm
Myopia Myopia or not Binary 0.1 0 represents non-myopia, 1 represents myopia

Table 2.

Performance metrics of the four-layer feedforward neural network and traditional machine learning models

Model Accuracy Sensitivity Specificity F1 score AUC
4-layer FNN 0.935 ± 0.011 0.939 ± 0.010 0.927 ± 0.019 0.947 ± 0.009 0.976 ± 0.007
SVM 0.863 ± 0.008 0.867 ± 0.013 0.854 ± 0.016 0.888 ± 0.007 0.921 ± 0.014
Logistic regression 0.935 ± 0.016 0.941 ± 0.009 0.926 ± 0.030 0.948 ± 0.013 0.977 ± 0.008
Naive Bayes 0.845 ± 0.018 0.783 ± 0.027 0.949 ± 0.011 0.864 ± 0.018 0.948 ± 0.010
K-Nearest neighbors 0.922 ± 0.012 0.936 ± 0.015 0.899 ± 0.011 0.938 ± 0.010 0.978 ± 0.008
Random forest 0.936 ± 0.014 0.940 ± 0.007 0.928 ± 0.027 0.948 ± 0.011 0.977 ± 0.009

To uncover the underlying causal relationships among input features, we applied a constraint-based causal discovery algorithm combining the PC algorithm30 with Degenerate Gaussian Scoring and dimensionality reduction via PCA. PCA was used solely as a preprocessing step to mitigate collinearity and improve numerical stability in conditional independence testing, while ensuring that the final causal graph is defined exclusively over the original 16 clinical and physiological variables (as shown in Fig. 1c, d). The final causal graph was constructed using the TETRAD software suite31. As shown in Fig. 1e, the resulting directed acyclic graph (DAG) reveals 15 causal edges connecting 10 of the 16 variables. Notably, six variables—parental myopia (PWG), gender (GENDER), near and distance accommodative ability (NAR, DAR), and corneal curvature (K1, K2)—were found to be direct causes of both axial length (AL) and cycloplegic refraction (CR). Height and red meat consumption (REDM) showed direct effects on AL only, while AL was identified as a direct cause of CR.

Based on Fig. 1f, we categorized the corresponding input-layer neurons in the trained neural network into three structurally distinct types: Isolated Units, Pure Units, and Confounded Units. This categorization is not merely descriptive; rather, it enables a principled decomposition of causal contributions within the neural network. By aligning unit types with distinct structural roles in the causal graph, we introduce a unified framework to estimate causal effects under different confounding conditions, thus bridging structural causal inference and neural attribution in a theoretically grounded manner.

Isolated Units (Fig. 1f, left one) are input neurons that do not have any causal relation with other input features. Their contributions to the prediction model are assumed to be independent. Six variables fall under this category: CB, EGG, NW, PULSE, WHIM, and DTO.

Pure Units (Fig. 1f, middle one) are input neurons that influence the output only through downstream mediators. For example, variable A causally influences variable B, which in turn affects the prediction output. This corresponds to a chain-like causal path. Variables such as PWG, GENDER, NAR, DAR, K1, K2, HEIGHT, and REDM are identified as Pure Units based on their positions in the causal graph.

Confounded Units (Fig. 1f, right one) are input neurons whose effect on the output is entangled with that of another causally related variable, forming a v-structure or a backdoor path. In our study, AL and CR are identified as Confounded Units, both influenced by multiple upstream variables and interlinked causally.

This categorization is central to the subsequent causal attribution analysis. By structurally disentangling how input neurons contribute to predictions, we establish a scalable and interpretable framework for assessing causal roles within neural networks in medical applications.

Causal attribution experiments on isolated units reveal consistency and deviation from prior knowledge

We performed causal attribution experiments on the six Isolated Units identified in the causal graph to assess the individual causal effect of each input-layer neuron on the output predictions of the trained neural network. In this context, attribution refers specifically to estimating the average treatment effect (ATE) of a single input neuron on each output class in the binary classification task (myopic vs. non-myopic).

Each Isolated Unit was treated as an intervention variable and directly input into a causal attribution algorithm to quantify its effect. Since the model produced a binary prediction for myopia status (Yes/No), we conducted separate interventional analyses on both output classes.

Pulse rate (PULSE) has been previously reported to be negatively associated with myopia risk3235, possibly due to increased ocular blood flow influencing eye growth. As shown in Fig. 2a, increasing the intervention value of PULSE leads to a rising trend in the ’No’ (non-myopic) output neuron’s ATE and a decreasing trend in the ’Yes’ (myopic) output neuron’s ATE. This suggests a negative causal relationship between PULSE and myopia, in agreement with prior clinical findings and indicating that the model has accurately captured this relationship.

Fig. 2. Causal attribution results of Isolated Units.

Fig. 2

a Intervention value of PULSE. b Intervention value of CB. c Intervention value of EGG. The difference between the ATE value and its average value is presented as the green line, which shows the increasing/decreasing trend of the variable for myopic. 'Yes' means myopic and 'No' means non-myopic. The vertical dotted line indicates that the ATE at this location is 0.

In contrast, carbonated beverage consumption frequency (CB) and egg consumption frequency (EGG) have been reported to be positively associated with myopia3639. However, as shown in Fig. 2b, c, the model exhibits an inverted trend: CB shows a positive effect on the ’No’ neuron and a negative effect on the ’Yes’ neuron, while EGG shows the opposite pattern. These observations indicate that the model’s learned causal behavior for these features is inconsistent with established domain knowledge. A likely explanation is the uneven distribution of CB and EGG consumption across subgroups in our dataset, which may have influenced the model’s learned patterns.

In summary, among the six Isolated Units, the model correctly captures causal trends for PULSE and DTO, but misrepresents those for NW, CB, and EGG. The effect of WHIM appears negligible and is excluded from further interpretation4045. Detailed attribution results of variables other than CB, EGG, and PULSE are shown in Supplementary material Figs. S1S3.

Causal attribution on pure units confirms model reliability across mediated paths

For Pure Units, causal attribution can be conducted without adjusting for additional covariates, as their causal effects are either direct or fully mediated by identified intermediate variables. The attribution process involves identifying all causal paths—both direct and indirect—between the input neuron and the output neuron. Assuming linear causal relationships among the variables in the dataset, we performed causal inference using linear regression applied to the structure encoded in the causal graph. The resulting causal effect graph is shown in Fig. 3a, b, where solid lines represent positive causal effects and dashed lines indicate negative effects.

Fig. 3. Causal attribution results of Pure Units.

Fig. 3

a Result of positive causal effects. b Result of negative causal effects. Solid lines represent positive causal effects and dashed lines indicate negative effects. Numbers on each line represent the causal effect value of the corresponding causal links. c–e Intervention value of HEIGHT, GENDER, and REDM. The difference between the ATE and its average value is presented as the blue line, showing the increasing/decreasing trend of variable for myopic. 'YES' means myopic and 'NO' means non-myopic. The vertical dotted line indicates that the ATE at this location is 0. f Refutation results of the 15 causal effects.

Two representative examples are shown in Fig. 3c, d, where we analyze the effect of height (HEIGHT) and gender (GENDER). As the intervention value of HEIGHT increases, the response of the ’No’ (non-myopic) output neuron shows a decreasing trend, while the ’Yes’ (myopic) output neuron shows an increasing trend. This trend is consistent with previous research findings that people with greater height are more likely to suffer from myopia4652. It should be emphasized, however, that the causal pathway identified by our framework is HEIGHT → AL → CR, rather than a direct HEIGHT → Myopia link. This indicates that height acts indirectly through ocular growth and refractive status, and should not be interpreted as a direct or modifiable risk factor for myopia. The trend after the intervention on gender (GENDER) is the same as previous findings, that is, among adolescents, the incidence of myopia in females is higher than that in males5360. However, such associations may vary with individual-level heterogeneity.

Among the eight Pure Units, the model correctly reflects the causal trends for NAR, DAR, K1, K2, HEIGHT, GENDER, and PWG6189. Notably, while axial length (AL) is widely recognized as the primary anatomical determinant of myopia and a direct driver of cycloplegic refraction (CR), several clinical studies have reported that accommodative function—including near and distance accommodative ability (NAR and DAR)—may also contribute to AL elongation and myopia progression9093. Therefore, the NAR/DAR → AL path revealed by our framework should be interpreted as a data-driven attribution pattern that aligns with certain clinical observations, while requiring further validation. Detailed attribution results of variables other than HEIGHT and GENDER are shown in Supplementary material Figs. S4S7. The effect of REDM remains inconclusive in the existing literature and is therefore not interpreted in terms of fit correctness. However, our method revealed a weak but directional causal path from REDM to AL (as shown in Fig. 3b S5S9), followed by a stronger negative causal influence from AL to CR (as shown in Fig. 3b S9S10). Although the direct effect of REDM on AL is marginal, as shown in Fig. 3e, the existence of this indirect pathway suggests a potential mediating mechanism by which dietary patterns might influence refractive development. This finding, uncovered through our causal attribution framework, suggests a hypothesis-generating signal that warrants further investigation in nutritional ophthalmology and longitudinal dietary studies.

Meta-learning-based attribution for confounded units demonstrates stable causal estimation

To robustly assess causal effects under confounding, especially in cases where input neurons share common ancestors with output nodes, conventional regression-based methods often yield biased estimates. To address this challenge, we introduce a domain-adaptive meta-learning algorithm that leverages covariate balancing through propensity scores. The key idea is to learn a causal representation that generalizes across input distributions, enabling unbiased estimation even when confounders are present. This framework not only corrects for selection bias but also enhances robustness under covariate shift—making it well-suited for causal inference in complex medical prediction tasks. Given that these input neurons are causally entangled with others and subject to confounding with the output layer, we first identified and controlled for confounding variables using the backdoor criterion, enabling the estimation of unbiased causal effects.

To evaluate the performance of different causal effect estimation models, we adopt the R score metric based on the R-Learner framework. The R score is defined as

Rscore=1L^RLbase 1

where L^R is the R-loss obtained from cross-validation using the estimated treatment effect model τ^(x), and Lbase is the baseline loss computed using a constant average treatment effect.

This metric assesses the ability of the estimated individualized treatment effect to explain residual variation, in comparison to a constant effect. A higher R score indicates better performance in modeling heterogeneous treatment effects. Our proposed meta-learning model achieved superior accuracy, with the best result obtained using a GBDT-based estimator (R score = 0.3850). Further improvements were achieved via an ensemble method combining GBDT and Random Forest with a 10:1 weight ratio, yielding an R score of 0.6277, indicating strong attribution performance (see Supplementary material for details).

To mitigate bias introduced by discretizing continuous variables, we applied both equal-width and equal-frequency discretization strategies to the two Confounded Units—axial length (AL) and cycloplegic refraction (CR)—and estimated their average causal effects (ACE) on the model’s binary outputs (’No’ and ’Yes’). As shown in Fig. 4a, c, AL exhibits a positive causal effect on myopia risk, consistent with established clinical findings9496. Similarly, CR shows a pattern where lower refractive power corresponds to increased myopia risk9799. Equal-width discretization provides a clearer causal trend compared to equal-frequency discretization.

Fig. 4. Causal attribution results of Confounded Units.

Fig. 4

a and c Causal effects of AL and CR on output neurons (Yes/No) under equal-width and equal-frequency discretization, respectively. Equal-width discretization divides AL and CR into 10 groups within their value intervals and calculates the ACE between adjacent discrete groups. The equal-frequency discretization divides variables into 10 groups, and the number of data points in each group is the same. The output neuron value (Yes/No) was benchmarked at 0. The corresponding causal effect value was added to each intervention interval, from which the dashed line of the output neuron value change showed. b and d Refutation results of equal-width and equal-frequency discretization, respectively.

Importantly, the causal attribution trends for both AL and CR are aligned with current medical understanding, indicating that the model correctly internalized known relationships between these variables and myopia.

Refutation experiments validate causal robustness

To evaluate the internal validity and robustness of our estimated causal effects, we conducted a series of refutation experiments inspired by the DoWhy framework100. These experiments were designed to determine whether our results hold under systematic perturbations to data and model assumptions.

Five types of intervention strategies were employed: bootstrap resampling, random addition of artificial confounders, subset validation, placebo treatment, and dummy outcome substitution.

The results of the Pure Units refutation experiment are shown in Fig. 3f. It can be seen from the figure that the error rates (err%) obtained in the five refutation experiments are all <10%, and there is no situation where the error rates of the five refutation experiments are all >1%.

The results of the Confounded Units refutation experiment are shown in Fig. 4b, d. For AL, equal-frequency discretization produced lower refutation error, while CR showed greater stability with equal-width discretization. These results collectively demonstrate that the model is capable of producing accurate and stable causal attributions even under confounding conditions.

Disscusion

In this study, we present a causal learning-based interpretability framework for neural networks, tailored to the clinical prediction of pediatric myopia. By abstracting a feedforward neural network into a structured causal system, we quantify the causal contribution of each input neuron to the model output and implement targeted attribution strategies according to the neuron’s structural role in the learned causal graph. Using a longitudinal dataset collected in City Anyang, Henan Province, China (2011–2017), our method integrates causal discovery, mediation analysis, and domain-adaptive estimation to produce interpretable and robust neural reasoning.

The inferred causal graph, constructed using a constraint-based algorithm (PC with Degenerate Gaussian Scoring), revealed stable directed dependencies among the 16 input features. Based on this structure, we categorized input neurons into three functionally distinct types—Isolated, Pure, and Confounded Units—each representing a unique causal configuration. Attribution results for Isolated Units aligned with established risk factors in several cases: for instance, increased pulse rate and outdoor activity were found to have protective effects against myopia, consistent with previous studies101. In contrast, misalignment was observed for some dietary variables (e.g., carbonated beverage and egg consumption), suggesting either data limitations or prior model biases. Pure Units, evaluated through linear path-based inference, demonstrated biologically plausible effects—e.g., the positive causal influence of height on myopia risk was found to be mediated through axial elongation. For Confounded Units such as axial length (AL) and cycloplegic refraction (CR), our meta-learning-based estimation—guided by the backdoor criterion—produced coherent and stable causal effects. This was further validated by a suite of counterfactual perturbation experiments.

These results demonstrate that black-box neural networks can be decomposed into biologically meaningful, causally interpretable substructures102,103. The model’s ability to internalize known physiological mechanisms, particularly around AL and CR—which directly modulate the eye’s focal characteristics—confirms its biological plausibility. Overall, the causal pathways identified by our framework demonstrate strong biological plausibility and align closely with established clinical knowledge. Throughout the study, we placed particular emphasis on clinical interpretability, and for each major result, we cross-referenced supporting ophthalmic literature to ensure consistency with prior findings. We fully acknowledge the central role of axial length (AL) as the dominant anatomical determinant of myopia, and our model’s detection of the AL → CR relationship is fully consistent with this consensus. At the same time, our framework is inherently data-driven: the causal paths it reveals should be understood as internal attribution patterns within the predictive model, which require cautious interpretation in light of existing clinical evidence. Notably, we found that many of these data-driven paths are corroborated by previous studies, reinforcing the clinical validity of our approach. Compared to conventional attribution methods like SHAP or LIME104108, which often fail under confounding or produce unstable saliency maps, our causal framework ensures interpretability by explicitly modeling conditional dependencies and causal paths. This not only improves explanation reliability but also enables insight into systemic bias or misattribution patterns—critical for deploying AI safely in clinical settings.

Importantly, our method has direct implications for the design of digital interventions and early prevention strategies. The identification of modifiable behavioral risk factors, such as outdoor activity, as causal precursors to myopia progression suggests that individualized lifestyle adjustments could be guided by our model outputs. Additionally, we identify discrepancies between learned model logic and medical consensus, which could be used to iteratively improve neural architectures or training protocols, enhancing both performance and clinical acceptability.

At the same time, we note that several features exhibited attribution patterns that diverged from previously established clinical or epidemiological knowledge. These discrepancies may, at least in part, arise from distributional characteristics of the dataset—such as uneven representation of certain behaviors across age or socioeconomic subgroups—which can shape the model’s learned associations. It is important to emphasize that our causal framework is primarily intended to disentangle and visualize the internal attribution logic of predictive models, rather than to fully correct for population-level confounding. Consequently, while the framework enhances interpretability and highlights biologically plausible pathways, residual distributional imbalances may still produce effects inconsistent with consensus knowledge. We regard this as a constructive signal, since such mismatches can reveal potential data artifacts or, alternatively, point to novel hypotheses warranting further validation.

Despite these promising findings, several limitations remain. First, the accuracy of the causal discovery process relies on the completeness and quality of observational data. In particular, the presence of missing covariates or unobserved confounders may bias the inferred graph structure. Furthermore, our method was validated on complete longitudinal data, which may not fully reflect real-world scenarios where data are often incomplete or irregularly sampled. Using complete cases may introduce selection bias if missingness is not completely at random. To address incomplete data in future applications, our framework could be extended through multiple imputation techniques for missing values before causal discovery, or through modified causal discovery algorithms that handle missing data directly. Second, our mediation model currently assumes linear causal effects, while robustness is verified through counterfactual tests, highly nonlinear interactions may not be fully captured. Kernel-based or neural causal inference methods may be needed to improve generalizability. Third, while this study chose a feedforward architecture, our framework is designed to be model-agnostic due to the decoupling of causal discovery from neural attribution. In practice, extending this method to other architectures would primarily involve substituting the attribution module while maintaining the structural causal categorization logic. For instance, when applying the framework to convolutional neural networks (CNNs) for retinal imaging, input units could be defined as high-level features extracted from specific regions of interest, and attribution could be estimated via Grad-CAM or Integrated Gradients. For transformer-based models handling genomic sequences, the nodes could represent token embeddings or attention heads, utilizing Attention Rollout or perturbation-based attribution to quantify causal contributions.

Looking ahead, our work opens several directions for innovation: (1) Embedding causal graphs into neural network training could lead to structure-aware architectures that align model connectivity with known biomedical pathways. (2) Extending our framework to cross-modal causal interpretation—bridging electronic health records with imaging or sensor data—would further improve clinical relevance. Such an extension would leverage the framework’s modularity, where the universal “causal skeleton" derived from data governs the interpretation of diverse neural components across different sub-networks. (3) As causal attributions can highlight mislearned associations, they may serve as real-time monitors for algorithmic fairness and bias. (4) With temporally dense datasets, our framework can evolve into dynamic causal modeling, enabling the simulation of patient-specific interventions for myopia prevention or progression control.

In conclusion, we introduce a rigorous and generalizable framework for unit-level causal attribution in neural networks, validated in a clinically relevant setting. By focusing on the interpretation of the model in causal principles, we offer a pathway toward more transparent, robust, and trustworthy AI systems for digital medicine.

Methods

Dataset

We used the Dataset of a prospective longitudinal school-based cohort study—the Anyang Childhood Eye Study (ACES)109, one of the largest longitudinal studies of myopia in school-aged children in China. The dataset was collected through annual follow-up surveys from 2011 to 2017 and included 3112 first-grade students at baseline, who were followed through sixth grade (school grades 1–6, spanning six academic years). Each year corresponded to a single visit per participant, with examinations conducted once annually at their schools. At baseline, participants had a mean age of 7.1 years (range: 5.7–9.3 years), and 57.8% were male. Data collection was conducted in collaboration with local primary schools under the supervision of certified medical professionals and epidemiologists. The study received ethics approval from the Institutional Review Board of Beijing Tongren Hospital, Capital Medical University, and informed consent was obtained from all participants and their guardians. Written informed consent was obtained from all participants and their parents or legal guardians.

All children provided a written informed consent form signed by their parents, and verbal consent was also obtained from each child. This study adhered to the tenets of the Declaration of Helsinki. Ethics committee approval was obtained from the Institutional Review Board of Beijing Tongren Hospital, Capital Medical University (TRECKY2018-030).

The survey captured a broad range of data across behavioral, physiological, environmental, dietary, and hereditary domains. After preprocessing, 2748 records with complete and valid longitudinal data were retained for analysis (88.3% of baseline participants). These records spanned a 6-year follow-up period, enabling the construction of a rich temporal feature space suitable for causal discovery and model training. In this survey, myopia was classified as a binary outcome (0 = non-myopia, 1 = myopia) according to cycloplegic refraction (CR > −0.5 D for non-myopia, CR ≤ −0.5 D for myopia). The myopia classification was determined based on whether the child developed myopia at any point during the 6-year follow-up period, rather than solely at the last visit, thereby capturing incident myopia cases throughout the study period.

To minimize potential methodological bias, different preprocessing strategies were applied depending on the temporal characteristics of each variable. For dynamic ophthalmic measures (e.g., CR), we derived annual progression rates rather than simple averages. For behavioral and dietary variables (e.g., NW, DTO, CB, EGG, REDM, WHIM), cumulative values across 6 years were used to capture long-term exposure. For relatively stable biometric and physiological measures (e.g., NAR, DAR, PULSE, AL, K1, K2), 6-year averages were computed to reduce random fluctuations and measurement error. Static variables such as gender and parental myopia (PWG) were retained as baseline values.

After data preprocessing (see Supplementary material for details), 16 variables were finally selected and used as input for model training and causal attribution. A full list of selected variables and their definitions is provided in Table 1.

Neural network architecture

We constructed a feedforward neural network (FNN) model to predict myopia status based on the preprocessed features derived from the dataset. The network architecture consists of an input layer with 16 neurons corresponding to the selected variables, two fully connected hidden layers with 12 and 8 neurons, respectively, and an output layer with 2 neurons representing binary classification outcomes (myopia vs. non-myopia). The network structure is illustrated in Fig. 1b.

Each neuron in the hidden layers uses the Rectified Linear Unit (ReLU) as the activation function to introduce non-linearity. The final output layer employs the softmax function to produce probability distributions over the two classes. The model is trained using the cross-entropy loss function and optimized via stochastic gradient descent with backpropagation.

To identify optimal model performance, we conducted a series of hyperparameter tuning experiments, adjusting the learning rate, batch size, and number of training epochs. The best-performing configuration was selected based on validation accuracy. Final performance metrics of the four-layer FNN model on the test set are reported in Table 2, demonstrating high accuracy and robustness across multiple evaluation criteria.

To benchmark performance, we also implemented five traditional machine learning models—logistic regression, Naive Bayes, random forest, K-nearest neighbors, and support vector machine—using the same input features. As shown in Table 2, the 4-layer FNN achieved consistently high performance across all evaluation metrics. Its sensitivity was comparable to that of SVM and logistic regression, all of which reached the highest values among the tested models (within the allowable statistical error), thereby reducing the risk of missed myopia cases. Compared with other baseline models, the FNN also provided a more balanced trade-off between sensitivity and specificity, indicating its robustness in distinguishing myopia versus non-myopia. Furthermore, given its representative “black-box” nature, the FNN serves as an appropriate foundation for our proposed causal explanation framework, which aims to move beyond predictive accuracy to provide interpretable and clinically meaningful insights.

Causal discovery

Causal discovery, a key approach for inferring causal relationships from observational data via graphical models like DAGs, enables researchers to uncover intervention-relevant structures in the absence of randomized trials110. In this study, we adopted a constraint-based causal discovery approach, specifically the Peter-Clark (PC) algorithm30, to identify causal relationships among the 16 input variables of the myopia prediction model. The PC algorithm begins with a fully connected undirected graph and iteratively removes edges by testing conditional independence. It then determines edge directions using v-structure identification and Meek rules111, producing a partially directed acyclic graph (CPDAG) that represents the underlying causal structure.

A critical challenge in biomedical datasets is the presence of mixed-type variables, including both continuous and categorical features. Traditional independence tests (e.g., Pearson correlation, Chi-squared test, or Fisher’s Z-test) are limited to continuous or discrete data and often fail in mixed domains.

To address this, we integrated the degenerate Gaussian likelihood ratio test (DG-LRT) into the PC algorithm. This method, originally proposed by Andrews112,113 and refined for mixed data, uses a transformed representation of categorical variables via one-hot encoding, followed by degenerate Gaussian modeling. The resulting structure is especially well-suited for medical applications where variable types and scales vary widely.

This design highlights the cross-disciplinary adaptability. It is capable of capturing statistically reasonable causal structures across hybrid domains, bridging data mining, statistical learning, and clinical reasoning.

For stability and completeness, the causal discovery process was conducted on the entire dataset rather than a training split, ensuring that the learned structure reflected global dependencies among variables.

The final causal graph extracted from our dataset included 10 out of the 16 input variables, forming 15 directed edges (as shown in Fig. 1e). This structure serves as the backbone for the following causal attribution and interpretation. The detailed steps of the algorithm are presented in Algorithm 1.

Algorithm 1

PC algorithm with a degenerate Gaussian likelihood ratio test Inline graphic

Attribution algorithm

In this study, causal inference is conducted at two levels: estimating the effect of input-layer neurons on output neurons within a neural network (i.e., attribution) and evaluating inter-feature causal effects in the dataset using regression-based techniques.

Causal effects can be defined at different levels. Assuming binary treatment T ∈ {0, 1}, the most commonly used is

Average treatment effect (ATE):

ATE=E[Y(1)]E[Y(0)]

Isolated Units: Isolated Units refer to specific input neurons in a neural network whose causal effects on output neurons are to be independently assessed. Based on the operator framework introduced in Section Causal Inference, we formalize the average causal effect (ACE) from input neuron xi to output neuron y, as shown in Fig. 1f left one.

For a binary variable x, its ACE on another variable y is defined as

ACE(x)=E[ydo(x=1)]E[ydo(x=0)] 2

For continuous-valued variables, we extend this to

ACExi=1highilowilowihighiE[y|do(xi=α)]dα 3

To mitigate the inefficiency and variance introduced by high-dimensional sampling, we approximate the intervention expectation using a second-order Taylor expansion of the network function f.

Let μ be the mean input vector, and f(μ) be the smooth neural network function. For a small perturbation δ in the input, the Taylor expansion around μ is

f(μ+δ)f(μ)+f(μ)δ+12δ2f(μ)δ 4

Taking expectation over δ where E[δ]=0 leads to:

E[y|do(xi=α)]f(μ)+12tr(2f(μ)Cov) 5

This allows efficient estimation of intervention effects without full re-evaluation over the dataset.

The algorithm estimates the causal effect of input neuron xi on the output neuron y over a specified intervention interval [lowi, highi] by dividing it into num uniform sub-intervals.

The full procedure is shown in Algorithm 2.

Algorithm 2

Approximate causal effect for isolated input unit Inline graphic

Pure Units: This section discusses how to attribute causal effects for input neurons that are causally dependent on other inputs but are not affected by confounders—referred to as Pure Units. We assume these neurons are connected to other input neurons via causal pathways, but no common ancestor influences both the input and output neurons (i.e., no confounding exists).

As illustrated in Fig. 1f, middle one, let l11 be the intervened input neuron and ln1 be the output neuron. In this structure, l11 can influence ln1 through multiple paths, such as the direct path P1 and an indirect path P2 via intermediate neuron l12, but no confounders are present.

In this scenario, the causal effect of l11 on ln1 includes both direct and indirect components:

ACE(l11ln1)=ACEP1+ACEP2+ 6

Each term corresponds to a causal path, and the effect along each path can be estimated as the product of edge weights along that path. For instance, the average causal effect along the indirect path P2 from l11l12ln1 is given by

ACEP2=E[l12do(l11=α2)]E[l12do(l11=α1)]βln1~l12 7

where βln1~l12 denotes the regression coefficient from l12 to ln1. Equation (7) reflects the mediated effect of l11 on ln1 through l12. The direct effect βln1~l11 accounts for path P1.

Once all relevant paths are identified and estimated, we update the mean vector μ and covariance matrix Σ of the training data to reflect the intervened distribution. The expected effect of l11 on ln1 under intervention do (l11 = α) is then computed using the intervention expectation approximation from the previous subsection:

E[ln1|do(l11=α)]f(μ)+12tr(2f(μ)Σ) 8

This method effectively aggregates the impact of multiple causal paths while maintaining computational efficiency. By correcting for potential bias induced by intermediaries, it enables robust causal interpretability in neural networks.

The full procedure is shown in Algorithm 3.

Algorithm 3

Estimate causal effect for pure input unit Inline graphic

Confounded Units: In real-world neural networks, causal dependencies among input neurons are often interleaved with hidden confounding variables. A confounder is a variable that simultaneously influences both the intervention (input neuron) and the outcome (output neuron), thereby creating backdoor paths that induce selection bias and invalidate naive causal estimation.

As shown in Fig. 1f, right one, the input neuron l11 affects output neuron ln1 via the direct causal path P1: l11ln1. However, the presence of neuron l12, which also influences ln1 and is influenced by l11, forms a backdoor path l11l12ln1. This makes l12 a confounder and it needs to be adjusted to avoid biased estimation of the average treatment effect (ATE) of l11 on ln1.

To estimate unbiased causal effects in the presence of confounders, we adopt a four-stage framework that integrates graphical causal identification, propensity-based domain adaptation, and ensemble-based meta-learning114. Specifically:

Confounder identification: Based on a predefined causal graph, we identify a sufficient set of covariates Z that satisfy the backdoor criterion, thereby blocking all spurious associations between the treatment variable X and the outcome Y.

Propensity score estimation: We estimate the propensity score e(z) = P(X = 1∣Z = z) using a supervised learning model. These scores are subsequently used to reweight samples and mitigate distributional imbalance across treatment groups, enabling domain adaptation.

Ensemble meta-learning for outcome modeling: To model potential outcomes, we train ensemble-based predictors using gradient boosted decision trees (GBDT) and random forests (RF), combined with a fixed weight ratio of 10:1. These models are trained separately on the treatment and control groups with inverse-propensity weighting to account for confounding.

Causal effect estimation: Individual treatment effects (ITEs) are approximated by computing residuals between observed and predicted counterfactual outcomes. A final regression model is fitted to these residuals to estimate the conditional average treatment effect τ^(x). Prior to modeling, continuous covariates such as axial length (AL) and cycloplegic refraction (CR) are discretized using both equal-width and equal-frequency discretization to satisfy the discrete input requirements of the meta-learning model.

Algorithm 4

Domain-adaptive causal attribution via ensemble-based meta-learning Inline graphic

Refutation experiments

To evaluate the robustness and reliability of the estimated causal effects, we conduct a series of refutation experiments following the methodology proposed in DoWhy100. These experiments are designed to examine whether the estimated effect is stable under a variety of controlled perturbations. Specifically, five types of falsification strategies are applied:

Bootstrap validation (BV): This method uses resampling with replacement from the original dataset to generate synthetic bootstrap datasets. A reliable causal effect estimator should produce consistent estimates across these samples. The error rate is defined as

ERRBV=NewEffectEstimatedEffectEstimatedEffect 9

Add random common cause (ARCC): This method randomly introduces artificial confounding variables into the dataset. A robust estimator should show minimal change in causal effect when noise variables are added. The deviation indicates sensitivity to confounding bias. The error rate is calculated as

ERRARCC=NewEffectEstimatedEffectEstimatedEffect 10

Data subsets validation (DSV): This method randomly selects a subset of the data as the new evaluation set and recomputes the causal effect. A stable estimation algorithm should yield similar results across data splits. The error rate is defined as

ERRDSV=NewEffectEstimatedEffectEstimatedEffect 11

Placebo treatment (PT): In this test, the original treatment variable is replaced by a randomly permuted variable (i.e., placebo). Since there is no actual treatment effect, a valid causal estimator should report an effect close to zero. The error rate is defined as

ERRPT=NewEffect 12

Dummy outcome (DO): In this setting, the outcome variable Y is replaced with a randomly generated variable. If the causal inference method is valid, it should yield an estimated causal effect near zero. The error rate is

ERRDO=NewEffect 13

These falsification strategies provide a comprehensive suite of counterfactual diagnostics. A reliable causal estimation method should demonstrate stability across bootstrap samples, insensitivity to irrelevant variables, robustness to subsampling, and null effects under placebo or dummy variable conditions.

Supplementary information

Supplementary Information (602.8KB, pdf)

Acknowledgements

This research was funded by National Key Research and Development Program of China 2025YFE0112100, the Key Laboratory of Epidemiology of Major Diseases (Peking University), Ministry of Education (Grant No. 2025102), the National Outstanding Young Physician Project, the Beijing High-Level Innovation and Entrepreneurship Talent Support Program Leading Talent Projects (G202512030), the National Key R&D Program of China (2022YFC3502502), the Fundamental Research Funds for the Central Universities 2025XC11020, the National Natural Science Foundation of China (82471113), the Beijing Natural Science Foundation (L248023), the Excellent Youth Talents Program of Capital Medical University (A2307), the Capital Health Research and Development of special grant (2024-2G-1081), and the Beijing New-star Plan of Science and Technology Cross-cooperation Project (20250484983).

Author contributions

Conceptualization: X.Y., Z.J., and W.S.; methodology: Z.J. and M.K.; investigation: Z.J., W.Z., W.G., and W.S.; visualization: W.Z. and Z.J.; data collection: H.L., Y.T., and Y.H.; data analysis: N.W. and S.L.; funding acquisition: X.Y., C.Y., and S.L.; project administration: X.Y. and W.S.; supervision: X.Y.; writing—original draft: Z.J., M.K., and W.Z.; writing—review and editing: all authors.

Data availability

The datasets analyzed in the current study are not publicly available due to patient privacy purposes, but are available upon reasonable request to the corresponding author Shi-Ming Li (lishiming81@163.com).

Code availability

The code can be made available upon reasonable request to the corresponding author Xu Yang (pyro_yangxu@bit.edu.cn).

Competing interests

The authors declare no competing interests.

Footnotes

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

These authors contributed equally: Zihui Jin, Mengtian Kang.

Contributor Information

Weihua Song, Email: liuliyue1118@163.com.

Ningli Wang, Email: wningli@vip.163.com.

Xu Yang, Email: pyro_yangxu@bit.edu.cn.

Shi-Ming Li, Email: lishiming81@163.com.

Supplementary information

The online version contains supplementary material available at 10.1038/s41746-026-02442-7.

References

  • 1.Liang, J. Trend and projection of myopia in children and adolescents from 1990 to 2050: a comprehensive systematic review and meta-analysis. Br. J. Ophthalmol.362, 109 (2025). [DOI] [PubMed] [Google Scholar]
  • 2.Lee, Y., Keel, S. & Yoon, S. Evaluating the effectiveness and scalability of the World Health Organization myopiaed digital intervention: mixed methods study. JMIR Public Health Surveill.10, e66052 (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Yang, B.-Y. Significance of school greenspaces in preventing childhood myopia. Innovation5, 100551 (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Morgan, I. G., Ohno-Matsui, K. & Saw, S.-M. Myopia. Lancet379, 1739–1748 (2012). [DOI] [PubMed] [Google Scholar]
  • 5.Rose, K. A., French, A. N. & Morgan, I. G. Environmental factors and myopia: paradoxes and prospects for prevention. Asia-Pac. J. Ophthalmol.5, 403–410 (2016). [DOI] [PubMed] [Google Scholar]
  • 6.Dolgin, E. The myopia boom. Nature519, 276 (2015). [DOI] [PubMed] [Google Scholar]
  • 7.Morgan, I. G., He, M. & Rose, K. A. Epidemic of pathologic myopia: what can laboratory studies and epidemiology tell us?. Retina37, 989–997 (2017). [DOI] [PubMed] [Google Scholar]
  • 8.Ng, D. S. & Lai, T. Y. Insights into the global epidemic of high myopia and its implications. JAMA Ophthalmol.140, 123–124 (2022). [DOI] [PubMed] [Google Scholar]
  • 9.Ghorbani Mojarrad, N., Williams, C. & Guggenheim, J. A. A genetic risk score and number of myopic parents independently predict myopia. Ophthalmic Physiol. Opt.38, 492–502 (2018). [DOI] [PubMed] [Google Scholar]
  • 10.Chua, S. Y. et al. Age of onset of myopia predicts risk of high myopia in later childhood in myopic Singapore children. Ophthalmic Physiol. Opt.36, 388–394 (2016). [DOI] [PubMed] [Google Scholar]
  • 11.Huang, T. et al. Artificial intelligence for medicine: progress, challenges, and perspectives. Innov. Med1, 100030 (2023). [Google Scholar]
  • 12.Xie, Y. Values and limitations of statistical models. Res. Soc. Stratif. Mobil.29, 343–349 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Yang, Y. et al. Computational modeling for medical data: from data collection to knowledge discovery. Innov. Life2, 100079–1 (2024). [Google Scholar]
  • 14.Grebovic, M., Filipovic, L., Katnic, I., Vukotic, M. & Popovic, T. Overcoming limitations of statistical methods with artificial neural networks. In 2022 International Arab Conference on Information Technology (ACIT), 1–6 (IEEE, 2022).
  • 15.Almeida, J. S. Predictive non-linear modeling of complex data by artificial neural networks. Curr. Opin. Biotechnol.13, 72–76 (2002). [DOI] [PubMed] [Google Scholar]
  • 16.Patra, J. C., Pal, R. N., Chatterji, B. & Panda, G. Identification of nonlinear dynamic systems using functional link artificial neural networks. IEEE Trans. Syst. Man Cybern. Part b (Cybern.)29, 254–262 (1999). [DOI] [PubMed] [Google Scholar]
  • 17.Aziz, R., Verma, C. & Srivastava, N. Artificial neural network classification of high dimensional data with novel optimization approach of dimension reduction. Ann. Data Sci.5, 615–635 (2018). [Google Scholar]
  • 18.Aggarwal, C. C. et al. Neural Networks and Deep Learning Vol. 10 (Springer, 2018).
  • 19.Luo, M. et al. Artificial intelligence for life sciences: a comprehensive guide and future trends. Innov. Life2, 100105–1 (2024). [Google Scholar]
  • 20.He, C., Ma, M. & Wang, P. Extract interpretability-accuracy balanced rules from artificial neural networks: a review. Neurocomputing387, 346–358 (2020). [Google Scholar]
  • 21.Zhang, Y., Tiňo, P., Leonardis, A. & Tang, K. A survey on neural network interpretability. IEEE Trans. Emerg. Top. Comput. Intell.5, 726–742 (2021). [Google Scholar]
  • 22.Fan, F.-L., Xiong, J., Li, M. & Wang, G. On interpretability of artificial neural networks: a survey. IEEE Trans. Radiat. Plasma Med. Sci.5, 741–760 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Zhang, Z. et al. Opening the black box of neural networks: methods for interpreting neural network models in clinical applications. Ann. Transl. Med.6, 216 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Lundberg, S. M. & Lee, S.-I. A unified approach to interpreting model predictions. Adv. Neural Inf. Process. Syst30, 4765–4774 (2017). [Google Scholar]
  • 25.Cremades, A., Hoyas, S. & Vinuesa, R. Additive-feature-attribution methods: a review on explainable artificial intelligence for fluid dynamics and heat transfer. Int. J. Heat Fluid Flow112, 109662 (2025). [Google Scholar]
  • 26.Moraffah, R., Karami, M., Guo, R., Raglin, A. & Liu, H. Causal interpretability for machine learning-problems, methods and evaluation. ACM SIGKDD Explor. Newsl.22, 18–33 (2020). [Google Scholar]
  • 27.Ahmad, O., Béreux, N., Baret, L., Hashemi, V. & Lecue, F. Causal analysis for robust interpretability of neural networks. In Proc. of the IEEE/CVF Winter Conference on Applications of Computer Vision 4685–4694 (IEEE, 2024).
  • 28.Xu, G., Duong, T. D., Li, Q., Liu, S. & Wang, X. Causality learning: A new perspective for interpretable machine learning. arXiv preprint arXiv:2006.16789 (2020).
  • 29.Pearl, J. Causality (Cambridge University Press, 2009).
  • 30.Spirtes, P., Glymour, C. N. & Scheines, R.Causation, Prediction, and Search (MIT Press, 2000).
  • 31.Ramsey, J. D. et al. Tetrad–a toolbox for causal discovery. In Proc. 8th International Workshop on Climate Informatics (eds Ebert-Uphoff, I., Gagne II, D. J. & Monteleoni. C.) 1–4 (National Center for Atmospheric Research, 2018).
  • 32.Baksh, J. et al. Myopia is an ischemic eye condition: a review from the perspective of choroidal blood flow. J. Clin. Med.13, 2777 (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Li, S. et al. Effects of physical exercise on macular vessel density and choroidal thickness in children. Sci. Rep.11, 2015 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Yang, Y. S. & Koh, J. W. Choroidal blood flow change in eyes with high myopia. Korean J. Ophthalmol.29, 309 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.SHIH, Y.-F. et al. Ocular pulse amplitude in myopia. J. Ocul. Pharmacol. Ther.7, 83–87 (1991). [DOI] [PubMed] [Google Scholar]
  • 36.Zhang, D. et al. Correlation analysis of myopia and dietary factors among primary and secondary school students in Shenyang, China. Sci. Rep.14, 20619 (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Hao, Z. et al. Association between sugar-sweetened beverages intake and myopia: results from the National Health and Nutrition Examination Survey 2001–2008. Ophthal. Epidemiol. 1–8 10.1080/09286586.2025.2548001 (2025). [DOI] [PubMed]
  • 38.Kim, J.-M. & Choi, Y. J. Association between dietary nutrient intake and prevalence of myopia in Korean adolescents: evidence from the 7th Korea National Health and Nutrition Examination Survey. Front. Pediatr.11, 1285465 (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Mrugacz, M. et al. Interdependence of nutrition, physical activity, and myopia. Nutrients16, 3331 (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Dutheil, F. et al. Myopia and near work: a systematic review and meta-analysis. Int. J. Environ. Res. Public Health20, 875 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Huang, H.-M., Chang, D. S.-T. & Wu, P.-C. The association between near work activities and myopia in children—a systematic review and meta-analysis. PLoS ONE10, e0140419 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Lingham, G., Mackey, D. A., Lucas, R. & Yazar, S. How does spending time outdoors protect against myopia? A review. Br. J. Ophthalmol.104, 593–599 (2020). [DOI] [PubMed] [Google Scholar]
  • 43.Mei, Z. et al. Efficacy of outdoor interventions for myopia in children and adolescents: a systematic review and meta-analysis of randomized controlled trials. Front. Public Health12, 1452567 (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Philipp, D. et al. The relationship between myopia and near work, time outdoors and socioeconomic status in children and adolescents. BMC Public Health22, 2058 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.He, M. et al. Effect of time spent outdoors at school on the development of myopia among children in China: a randomized clinical trial. JAMA. 314, 1142–1148 (2015). [DOI] [PubMed] [Google Scholar]
  • 46.Kearney, S., Strang, N. C., Cagnolati, B. & Gray, L. S. Change in body height, axial length and refractive status over a four-year period in caucasian children and young adults. J. Optom.13, 128–136 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Tao, L. et al. Correlation between increase of axial length and height growth in Chinese school-age children. Front. Public Health9, 817882 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Kuoliene, K., Danieliene, E. & Tutkuviene, J. Eye morphometry, body size, and flexibility parameters in myopic adolescents. Sci. Rep.14, 6787 (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Zhang, J. et al. The association between physical stature and myopia in elementary and junior high school graduates in Chongqing, China. Front. Med.12, 1530960 (2025). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Wang, D., Zhao, C., Huang, S., Huang, W. & He, M. Longitudinal relationship between axial length and height in Chinese children: Guangzhou twin eye study. Eye Sci.30, 12 (2015). [PubMed] [Google Scholar]
  • 51.Wang, D., Ding, X., Liu, B., Zhang, J. & He, M. Longitudinal changes of axial length and height are associated and concomitant in children. Investig. Ophthalmol. Vis. Sci.52, 7949–7953 (2011). [DOI] [PubMed] [Google Scholar]
  • 52.Lu, T. L. et al. Axial length and associated factors in children: the Shandong children eye study. Ophthalmologica235, 78–86 (2016). [DOI] [PubMed] [Google Scholar]
  • 53.Alrasheed, S. H. & Alghamdi, W. Systematic review and meta-analysis of the prevalence of myopia among school-age children in the Eastern Mediterranean region. East. Mediterr. Health J.30, (2024). [DOI] [PubMed]
  • 54.Zhao, L. et al. Prevalence and risk factors of myopia among children and adolescents in Hangzhou. Sci. Rep.14, 24615 (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Rudnicka, A. R. et al. Global variations and time trends in the prevalence of childhood myopia, a systematic review and quantitative meta-analysis: implications for aetiology and early prevention. Br. J. Ophthalmol.100, 882–890 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Groot, A. L., Lissenberg-Witte, B. I., van Rijn, L. J. & Hartong, D. T. Meta-analysis of ocular axial length in newborns and infants up to 3 years of age. Surv. Ophthalmol.67, 342–352 (2022). [DOI] [PubMed] [Google Scholar]
  • 57.Terasaki, H. et al. Sex differences in rate of axial elongation and ocular biometrics in elementary school students. Clin. Ophthalmol15, 4297–4302 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Zadnik, K. et al. Ocular component data in schoolchildren as a function of age and gender. Optom. Vis. Sci.80, 226–236 (2003). [DOI] [PubMed] [Google Scholar]
  • 59.He, X. et al. Normative data and percentile curves for axial length and axial length/corneal curvature in Chinese children and adolescents aged 4–18 years. Br. J. Ophthalmol.107, 167–175 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Hashemi, H., Fotouhi, A. & Mohammad, K. The age-and gender-specific prevalences of refractive errors in Tehran: the Tehran eye study. Ophthalmic Epidemiol.11, 213–225 (2004). [DOI] [PubMed] [Google Scholar]
  • 61.Lee, D. C., Lee, S. Y. & Kim, Y. C. An epidemiological study of the risk factors associated with myopia in young adult men in Korea. Sci. Rep.8, 511 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62.Pandian, A. et al. Accommodative facility in eyes with and without myopia. Investig. Ophthalmol. Vis. Sci.47, 4725–4731 (2006). [DOI] [PubMed] [Google Scholar]
  • 63.Scheiman, M. et al. Longitudinal changes in corneal curvature and its relationship to axial length in the correction of myopia evaluation trial (COMET) cohort. J. Optom.9, 13–21 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64.Jiang, L. et al. Associations between corneal curvature and other anterior segment biometrics in young myopic adults. Sci. Rep.14, 8305 (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65.Carney, L. G., Mainstone, J. C. & Henderson, B. A. Corneal topography and myopia. A cross-sectional study. Investig. Ophthalmol. Vis. Sci.38, 311–320 (1997). [PubMed] [Google Scholar]
  • 66.Fan, Q. et al. Genome-wide association meta-analysis of corneal curvature identifies novel loci and shared genetic influences across axial length and refractive error. Commun. Biol.3, 133 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67.Wang, Y.-M. et al. Myopia genetics and heredity. Children9, 382 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68.Jiang, X. et al. Association of parental myopia with higher risk of myopia among multiethnic children before school age. JAMA Ophthalmol.138, 501–509 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69.Martínez-Albert, N., Bueno-Gimeno, I. & Gené-Sampedro, A. Risk factors for myopia: a review. J. Clin. Med.12, 6062 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70.Koomson, N. Y. et al. Relationship between peripheral refraction, axial lengths and parental myopia of young adult myopes. J. Optom.15, 122–128 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 71.Kurtz, D. et al. Role of parental myopia in the progression of myopia and its interaction with treatment in COMET children. Investig. Ophthalmol. Vis. Sci.48, 562–570 (2007). [DOI] [PubMed] [Google Scholar]
  • 72.Ishikuro, M. et al. Parental axial lengths and prenatal conditions related to child axial length: the TMM birthree cohort study. AJO Int.2, 100088 (2025). [Google Scholar]
  • 73.Mutti, D. O., Mitchell, G. L., Moeschberger, M. L., Jones, L. A. & Zadnik, K. Parental myopia, near work, school achievement, and children’s refractive error. Investig. Ophthalmol. Vis. Sci.43, 3633–3640 (2002). [PubMed] [Google Scholar]
  • 74.Hui, J., Peck, L. & Howland, H. C. Correlations between familial refractive error and children’s non-cycloplegic refractions. Vis. Res.35, 1353–1358 (1995). [DOI] [PubMed] [Google Scholar]
  • 75.Xiang, F., He, M. & Morgan, I. G. The impact of parental myopia on myopia in Chinese children: population-based evidence. Optom. Vis. Sci.89, 1487–1496 (2012). [DOI] [PubMed] [Google Scholar]
  • 76.Ma, Y. et al. Effect of parental myopia on change in refraction in Shanghai preschoolers: a 1-year prospective study. Front. Pediatr.10, 864233 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 77.Tang, S. M. et al. Independent influence of parental myopia on childhood myopia in a dose-related manner in 2,055 trios: the Hong Kong Children Eye Study. Am. J. Ophthalmol.218, 199–207 (2020). [DOI] [PubMed] [Google Scholar]
  • 78.Onyeahiri, C. Approach to cycloplegic refraction. Can. J. Optom.86, 11–30 (2024). [Google Scholar]
  • 79.Hinkley, S., Iverson-Hill, S. & Haack, L. The correlation between accommodative lag and refractive error in minors under 18. Austin J. Clin. Ophthalmol.1, 1–5 (2014). [Google Scholar]
  • 80.Lim, L. S. et al. Dietary factors, myopia, and axial dimensions in children. Ophthalmology117, 993–997 (2010). [DOI] [PubMed] [Google Scholar]
  • 81.Xue, C. C. et al. Omega-3 polyunsaturated fatty acids as a protective factor for myopia. Am. J. Ophthalmol.268, 368–377 (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 82.Yin, C. et al. Dietary patterns and associations with myopia in Chinese children. Nutrients15, 1946 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 83.Jin, G. et al. Corneal biometric features and their association with axial length in high myopia. Am. J. Ophthalmol.238, 45–51 (2022). [DOI] [PubMed] [Google Scholar]
  • 84.Merriam, J. & Zheng, L. The relationship of corneal curvature and axial length in adults. Investig. Ophthalmol. Vis. Sci.46, 864–864 (2005). [Google Scholar]
  • 85.Aljuhani, G., Alharbi, M., Alsaidi, R. & Alharbi, A. Axial length and keratometry characteristics of patients undergoing cataract surgery in Saudi Arabia. J. Med. Life17, 620 (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 86.Li, S.-M. et al. Corneal power, anterior segment length and lens power in 14-year-old Chinese children: the Anyang Childhood Eye Study. Sci. Rep.6, 20243 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 87.Tideman, J. W. L., Polling, J. R., Jaddoe, V. W., Vingerling, J. R. & Klaver, C. C. Environmental risk factors can reduce axial length elongation and myopia incidence in 6-to 9-year-old children. Ophthalmology126, 127–136 (2019). [DOI] [PubMed] [Google Scholar]
  • 88.He, X. et al. Axial length/corneal radius ratio: association with refractive state and role on myopia detection combined with visual acuity in Chinese schoolchildren. PLoS ONE10, e0111766 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 89.Grosvenor, T. & Scott, R. Role of the axial length/corneal radius ratio in determining the refractive state of the eye. Optom. Vis. Sci.71, 573–579 (1994). [DOI] [PubMed] [Google Scholar]
  • 90.Angie, E., Amra, A. A. & Sari, M. D. Correlation between axial length and amplitude of accommodation in myopia patients. J. Public Health Pharm.4, 300–317 (2024). [Google Scholar]
  • 91.Hughes, R. P., Read, S. A., Collins, M. J. & Vincent, S. J. Changes in ocular biometry during short-term accommodation in children. Ophthalmic Physiol. Opt.40, 584–594 (2020). [DOI] [PubMed] [Google Scholar]
  • 92.Woodman, E. C., Read, S. A. & Collins, M. J. Axial length and choroidal thickness changes accompanying prolonged accommodation in myopes and emmetropes. Vis. Res.72, 34–41 (2012). [DOI] [PubMed] [Google Scholar]
  • 93.Read, S. A., Collins, M. J., Woodman, E. C. & Cheong, S.-H. Axial length changes during accommodation in myopes and emmetropes. Optom. Vis. Sci.87, 656–662 (2010). [DOI] [PubMed] [Google Scholar]
  • 94.Flitcroft, D. I. et al. Imi–defining and classifying myopia: a proposed set of standards for clinical and epidemiologic studies. Investig. Ophthalmol. Vis. Sci.60, M20–M30 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 95.Mutti, D. O. et al. Refractive error, axial length, and relative peripheral refractive error before and after the onset of myopia. Investig. Ophthalmol. Vis. Sci.48, 2510–2519 (2007). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 96.Meng, W., Butterworth, J., Malecaze, F. & Calvas, P. Axial length of myopia: a review of current research. Ophthalmologica225, 127–134 (2011). [DOI] [PubMed] [Google Scholar]
  • 97.Tharwat, E. et al. Effect of cycloplegia on the refractive status of children. Afr. Vis. Eye Health83, 916 (2024). [Google Scholar]
  • 98.Fotedar, R. et al. Necessity of cycloplegia for assessing refractive error in 12-year-old children: a population-based study. Am. J. Ophthalmol.144, 307–309 (2007). [DOI] [PubMed] [Google Scholar]
  • 99.Khan, H. A. et al. Comparison between cycloplegic and noncycloplegic refraction in young adult myopes. Optom. Vis. Sci.101, 470–476 (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 100.Sharma, A. & Kiciman, E. Dowhy: An end-to-end library for causal inference. arXiv preprint arXiv:2011.04216 (2020).
  • 101.Li, S.-M. et al. Effect of text messaging parents of school-aged children on outdoor time to control myopia: a randomized clinical trial. JAMA Pediatr.176, 1077–1083 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 102.Doshi-Velez, F. & Kim, B. Towards a rigorous science of interpretable machine learning. arXiv preprint arXiv:1702.08608 (2017).
  • 103.Sundararajan, M., Taly, A. & Yan, Q. Axiomatic attribution for deep networks. In International Conference on Machine Learning (eds. Precup D. & Whye Teh Y.) 3319–3328 (PMLR, 2017).
  • 104.Adebayo, J. et al. Sanity checks for saliency maps. In Advances in Neural Information Processing Systems (ed. Bengio, S., Wallach, H., Larochelle, H., Grauman, K., Cesa-Bianchi, N. & Garnett R.) Vol. 31 (Curran Associates, Inc., 2018).
  • 105.Samek, W., Wiegand, T. & Müller, K.-R. Explainable artificial intelligence: understanding, visualizing and interpreting deep learning models. ITU Journal: ICT Discoveries 1, 39–48 (2018).
  • 106.Alvarez-Melis, D. & Jaakkola, T. S. On the robustness of interpretability methods. arXiv preprint arXiv:1806.08049 (2018).
  • 107.Salih, A. M. et al. A perspective on explainable artificial intelligence methods: Shap and lime. Adv. Intell. Syst.7, 2400304 (2025). [Google Scholar]
  • 108.Kelodjou, G. et al. Shaping up shap: enhancing stability through layer-wise neighbor selection. In Proc. of the AAAI Conference on Artificial Intelligence (ed. Michael W., Jennifer N. & Iryna G.) Vol. 38, 13094–13103 (AAAI Press, 2024).
  • 109.Li, S.-M. et al. Design, methodology and baseline data of a school-based cohort study in central China: the Anyang Childhood Eye Study. Ophthalmic Epidemiol.20, 348–359 (2013). [DOI] [PubMed] [Google Scholar]
  • 110.Pearl, J. & Mackenzie, D. The Book of Why: The New Science of Cause and Effect (Basic Books, 2018).
  • 111.Meek, C. Causal inference and causal explanation with background knowledge. In Proceedings of the Eleventh Conference on Uncertainty in Artificial Intelligence (UAI'95), 403–410 (Morgan Kaufmann, 1995).
  • 112.Andrews, B., Ramsey, J. & Cooper, G. F. Learning high-dimensional directed acyclic graphs with mixed data-types. In The 2019 ACM SIGKDD Workshop on Causal Discovery, (ed. Zhang, K. Huang, B., and Glymour, C.)4–21 (PMLR, 2019). [PMC free article] [PubMed]
  • 113.Venter, J. H. On estimation of the mode. Ann. Math. Stat38, 1446–1455 (1967). [Google Scholar]
  • 114.Keith. Battocchi., et al. EconML: A Python Package for ML-Based Heterogeneous Treatment Effects Estimation, Version 0.xhttps://github.com/py-why/EconML (2019).

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Information (602.8KB, pdf)

Data Availability Statement

The datasets analyzed in the current study are not publicly available due to patient privacy purposes, but are available upon reasonable request to the corresponding author Shi-Ming Li (lishiming81@163.com).

The code can be made available upon reasonable request to the corresponding author Xu Yang (pyro_yangxu@bit.edu.cn).


Articles from NPJ Digital Medicine are provided here courtesy of Nature Publishing Group

RESOURCES