Skip to main content
Scientific Reports logoLink to Scientific Reports
. 2024 Oct 7;14:23282. doi: 10.1038/s41598-024-72158-9

Prediction of adolescent depression from prenatal and childhood data from ALSPAC using machine learning

Arielle Yoo 1,2,3,#, Fangzhou Li 1,2,3,#, Jason Youn 1,2,3,#, Joanna Guan 4,5, Amanda E Guyer 5,6, Camelia E Hostinar 4,5, Ilias Tagkopoulos 1,2,3,
PMCID: PMC11458604  PMID: 39375420

Abstract

Depression is a major cause of disability and mortality for young people worldwide and is typically first diagnosed during adolescence. In this work, we present a machine learning framework to predict adolescent depression occurring between ages 12 and 18 years using environmental, biological, and lifestyle features of the child, mother, and partner from the child’s prenatal period to age 10 years using data from 8467 participants enrolled in the Avon Longitudinal Study of Parents and Children (ALSPAC). We trained and compared several cross-sectional and longitudinal machine learning techniques and found the resulting models predicted adolescent depression with recall (0.59 ± 0.20), specificity (0.61 ± 0.17), and accuracy (0.64 ± 0.13), using on average 39 out of the 885 total features (4.4%) included in the models. The leading informative features in our predictive models of adolescent depression were female sex, parental depression and anxiety, and exposure to stressful events or environments. This work demonstrates how using a broad array of evidence-driven predictors from early in life can inform the development of preventative decision support tools to assist in the early detection of risk for mental illness.

Subject terms: Psychology, Computer science

Introduction

Depression is an impairing and prevalent disorder, affecting approximately 34% of adolescents aged 10–19 years globally1. Depression is also one of the leading causes of non-fatal disability2 and a major risk factor for suicide, the second leading cause of death for people aged 10–34 in the United States3. The main symptoms of depression include low mood, diminished interest or pleasure, change in sleep, weight or appetite, decrease in energy, feelings of worthlessness or guilt, and frequent thoughts of death or suicide4. Although these symptoms most often reach clinical levels of concern leading to diagnosis during adolescence, the developmental processes resulting in adolescent depression start years prior5, potentially even during gestation6. Through a hypothesized mechanism of stress sensitization, exposure to adverse early-life experiences such as parental mental illness or financial hardship is thought to increase the risk of future depression7,8. The connection between challenging childhood experiences and the risk of depression offers a chance to identify at-risk children early. Identification can be done by combining information about stressful events children have faced with data from comprehensive developmental assessments linked to depression risk (e.g., socio-emotional, cognitive, and biological). Early identification by age 10 of children at risk of developing depression during their adolescent years would provide new avenues for preemptive interventions, thereby reducing suffering, adolescent mortality, and treatment costs associated with depression9,10.

One prominent depression model is the multilevel biopsychosocial model put forth by Garber5. This model is based on evidence that depression is a complex condition influenced by various social-contextual, psychological, and biological factors, with no single factor accounting for a significant portion of the risk for depression5. Accordingly, individual factors (e.g., cognitive vulnerabilities, temperament, biological factors such as elevated inflammation or physical health problems) and social-contextual risk factors (e.g., stressful life events, poor quality of relationships with parents and peers, family socioeconomic status, neighborhood conditions) combine dynamically across development to heighten the risk for depression5. More recent evidence supports this model, illustrating the premise that no single risk factor causes depression and rather that multiple social, cognitive, and biological factors are linked to increased risk of depression1113. Several of these factors or features have been identified as being particularly important for predicting depression. One of the most important features is parental depression, as parental history of major depression disorder (MDD) increases a child’s likelihood of being diagnosed with MDD three to fivefold1416. Other important features include child gender (adolescent girls and women are at higher risk for depression)17, stressful family and social circumstances1820, and cognitive and emotion regulation deficits21,22. Some of these features also relate to other features that influence mental health in general, such as parental incarceration affecting income and employment opportunities23,24, which makes this problem complex as non-linear interactions between multiple features can contribute to depression. Challenges encountered within previous studies aiming to predict adolescent depression include small sample sizes25 and the use of cross-sectional data that does not assess depression frequently26. Because depression is complex and multifaceted, predicting it accurately requires methods that can consider numerous factors and offer reliable measures for each potential outcome.

Machine Learning (ML) is a process where machines use algorithms to detect patterns from data without being explicitly programmed27. ML has led to unprecedented advances in many areas of medicine, from diagnosing heart disease28 and cancer29 to detecting schizophrenia through speech recognition30. ML is well-suited for learning from large multidimensional datasets in order to predict later clinical outcomes. More specifically, supervised ML methods use labeled data to make predictions on unseen data by minimizing the predictive error on the observed training data31 and could be useful for efforts to predict clinical outcomes such as depression. This process allows the method to learn the relations between the independent variables and the labeled dependent variable31. Some of the ML algorithms perform predictions by analyzing data cross-sectionally (i. e. without taking into account when each feature was recorded, thus making each observation a separate feature). In contrast, there are also time-series ML methods where data for the same construct recorded at different times is considered as the same feature containing a series of observations indexed by time32. Generally, cross-sectional data is much easier to obtain than time-series data thus making cross-sectional ML methods easier to apply. However, time-series ML methods are better for modeling temporal disease dynamics33,34. While there is some consensus on the kinds of ML algorithms that are appropriate for certain tasks, in general, it is difficult to determine a priori the algorithm that will perform best on a specific dataset35. Since prior work comparing models for depression prediction has not considered time-series model architectures36,37, it remains unknown whether models designed for time-series data could provide an advantage for depression prediction in comparison to simpler models that are more agnostic to temporality. Thus, we evaluated both approaches, one that considers the temporal sequence of the features in its architectural design and one that does not, as each approach could yield unique insights into the most predictive features.

Depression is typically diagnosed using clinical interviews and screening questionnaires with thresholds to determine whether the individual meets the criteria for a diagnosis of depression (e.g., the Diagnostic and Statistical Manual of Mental Disorders (DSM) criteria for MDD)38. For predicting depression, ML has been used in a sample of patients with brain injury39 in a large study of 6588 Korean adults40. ML has also been used to detect current depression based on the content of Twitter posts41, wearable mobile sensor data42, patterns of brain activity captured using electroencephalography data43, and to predict treatment outcomes for depression in adults4446. For predicting adolescent mental health and depression trajectories years in advance, it was found that prediction accuracy results were mixed in terms of clinical relevance47,48. Additionally, prior research used ML methods for predicting depression applied to the Avon Longitudinal Study of Parents and Children (ALSPAC) dataset which includes demographic, clinical, and survey features49,50. This prior work focused on classifying depression trajectories from adolescence through adulthood51 and predicting depression during early adulthood (ages 23 through 28)52. Although features related to adult outcomes of depression are important to consider in depression development, we were interested in identifying a general model for predicting adolescent depression since the first onset of depression is often in adolescence53 and early intervention opportunities may improve adult outcomes9,10.

In this work, we built ML predictors for adolescent depression using the ALSPAC dataset, focusing on features from the prenatal period to age 10. To this end, we generated six different datasets to predict adolescent depression at five different time points (ages 12, 13, 16, 17, and 18 years) as well as to predict having depression at any time during these time points. We focused on these time points because the incidence of depression in the population increases steadily from age 12 to 1854, with many individuals having their first depression onset within this developmental period53. Through both statistical feature analysis and recursive feature elimination (RFE)55, we were able to identify and select the subset of features critical to the accurate prediction of adolescent depression and build predictors with an average recall of 0.59 and specificity of 0.61.

Methods

Dataset

The samples in this study consist of participants from the ALSPAC49,50. ALSPAC is an ongoing birth cohort study that has been following more than 14,000 participants from the prenatal period into adulthood to understand the role of environmental and genetic factors in shaping a wide range of developmental and health outcomes. Mothers were recruited if they had an expected delivery date between April 1, 1991, and December 31, 1992, and lived in the former county of Avon in the United Kingdom (UK). There were 20,248 eligible pregnancies in this region and time period. The initial recruitment resulted in a sample of 14,541 pregnant mothers, resulting in 14,676 fetuses, 14,062 of whom were alive at birth and 13,988 children who were alive at one year of age. Later efforts to bolster the initial sample with eligible cases who had failed to join the initial study yielded a total sample size for analyses of 15,645. A 10% sample of the ALSPAC cohort, known as the Children in Focus (CiF) group, attended clinics at the University of Bristol at various time intervals between 4 and 61 months of age. The CiF group was chosen at random from the last 6 months of ALSPAC births (1432 families attended at least one clinic). Excluded were those mothers who had moved out of the area or were lost to follow-up and those partaking in another study of infant development in Avon. The ALSPAC dataset includes many waves of data collection, including questionnaires completed by children, parents, and teachers; administrative records; observational data; clinical assessments; and biological samples. The study website contains details of all available data through a fully searchable data dictionary and variable search tool: http://www.bristol.ac.uk/alspac/researchers/our-data/. For further information regarding sample enrollment, participant characteristics, and general study methodology, refer to publications from the ALSPAC team that have profiled this cohort49,50. Ethical approval for the study was obtained from the ALSPAC Ethics and Law Committee and the Local Research Ethics Committees. Informed consent for the use of the data collected via questionnaires and clinics was obtained from participants following the recommendations of the ALSPAC Ethics and Law Committee at the time. All methods were performed in accordance with relevant guidelines and regulations, and all participants gave informed consent.

Sample description

From the original ALSPAC dataset, we gained access to 6163 features from 15,645 participants measured at different time points and used data from 8467 participants with depression data at one or more time points from age 12 to age 18. Features represented major domains of child functioning (cognitive, social, emotional, and biological), as well as captured known risk factors for depression, based on prior theory5,11,12 (Fig. 1, Table 1). Some of these domains include socioeconomic factors, child-peer relationships, child psychopathology, child physical health, and parents’ perceived social support. See Table 1 for a list of all domains and ages recorded. We pruned the dataset by merging the features that are potential duplicates, relevant to only a specific subset of samples, and consistent with prior publications (Supplementary Information 1.1), resulting in 885 features.

Fig. 1.

Fig. 1

Overview of the adolescent depression prediction framework. (a) The Avon Longitudinal Study of Parents and Children (ALSPAC) dataset is a long-term study spanning over two decades since the early 90 s in the Bristol, UK area, which includes features like questionnaires, hospital records, and lab samples of the child, mother, and her partner from the gestation stage through adolescence. We use this ALSPAC dataset to generate 6 derivative datasets, five for predicting depression at each target age (12, 13, 16, 17, and 18) and one for predicting depression diagnosis any time between ages 12–18, using features from the gestation stage to age 10. These are represented in the figure as Dep12, Dep13, Dep16, Dep17, Dep18, and Dep12-18. (b) The model selection pipeline selects the best combination of feature selection (FS), missing value imputation (MVI), outlier detection (OD), and binary classification (CLS) for each derived dataset. For each combination, we also performed hyperparameter tuning using a fivefold cross-validation optimized for F1-score. (c) Once the best model pipeline is selected for each dataset, we run the recursive feature elimination (RFE) to reduce the number of features while retaining the model performance. TPR stands for true positive rate and FPR stands for false positive rate.

Table 1.

Domains and periods covered by the unique features used across the models.

Domains Prenatal Ages 0–2 Ages 3–5 Ages 6–7 Age 8 Age 9 Age 10
Child sex at birth X
Socioeconomic factors (parents, grandparents, neighborhood)* X X X X X X X
Parental marital status (divorce, separation, etc.) X X X X X X X
Parent–child relationship (e.g., observed interaction, self-reports) X X X X X
Child-peer relationships (e.g., friendships, popularity, prosociality, antisociality) X X X
Child temperament (e.g., activity, approach, adaptability, emotionality, distractibility, shyness) X X X
Child psychopathology (e.g., depression, emotional and behavioral difficulties) X X X X
Child self-esteem & locus of control X X
Child biomarkers (e.g., CRP, IL6, parathyroid, cotinine, HDL, Vitamin D, calcium, albumin, hemoglobin, leptin) X X
Child anthropometrics & physiology (BMI, fat mass assessed with DEXA, hip, waist, height, heart rate) X X X X
Child cognition, school-related variables, & extracurriculars X X X
Child physical health (e.g., general health, hospitalizations) X X X X X
Child athletics/physical activity & physical appearance X X X
Child pubertal development X X X
Child stressful life events, trauma, maltreatment (e.g., physical abuse, sexual abuse, separations from parents) X X X X X X
Parental psychopathology (e.g., depression) & stress X X X X X X
Parents’ perceived social support and social network index X X X X X

*Socioeconomic factors included measures of educational attainment, income, social class score based on occupation (CAMSIS score) assessed at multiple ages, as well as the neighborhood quality index, neighborhood stress score, and mother’s and father’s score for their opinion of whether their neighborhood is a good place to live.

Depression variable description

We used the Short Mood and Feelings Questionnaire (SMFQ)56 from the ALSPAC dataset as our depression variable. The SMFQ is a validated depression screening questionnaire5658 that scores questions related to core depression symptoms between 0 and 26 using the interval scale56. We used the SMFQ to assign the dependent binary variable (depressed versus not depressed) at each target age (12, 13, 16, 17, and 18). Children with a score < 12 were assigned to the class ‘not depressed’ (label 0)’ and children with a score of 12 or higher were assigned to the class ‘depressed’ (label 1)’. Note that ALSPAC did not include the SMFQ measure for ages 11, 14, and 15, and thus, our analysis excludes the prediction at these ages. Features from the gestation period up to 10 years old (inclusive of age 10) were used as independent features.

Cross-sectional data cleaning

We then processed this ALSPAC dataset in two different ways to be used for downstream ML analyses, resulting in two types of derivative datasets. The first dataset type was used to predict adolescent depression at each of the five target ages. To this end, for each target age, we cleaned the dataset by first removing the samples with a missing target variable, reducing the sample size (Supplementary Table 1, Supplementary Spreadsheet 1). We then removed the independent features that were obtained after ten years old, were constant, or had a proportion of missing values greater than or equal to 60% (Supplementary Figs. 17, Supplementary Table 2). We chose this missing value cutoff because it retained the most number of features without necessitating imputing a large majority of the data. Finally, we reduced the feature space by calculating the Pearson correlation coefficient between all pairwise combinations of the independent features and the target variable. We used a threshold of p-value < 0.05 to discard the independent features that were not significantly correlated with the target depression outcome variable (Supplementary Table 1, Supplementary Spreadsheet 1).

In this work, we refer to these cross-sectional datasets prepared for each target age as Dep12 (6715 samples, 255 features), Dep13 (6015 samples, 279 features), Dep16 (4993 samples, 338 features), Dep17 (4496 samples, 318 features), and Dep18 (3334 samples, 299 features), respectively. We also created an additional type of dataset to predict adolescent depression anytime between ages 12 and 18. We were interested in predicting adolescent depression anytime between ages 12 and 18 in order to assess the general risk of a child becoming depressed during adolescence in addition to specific target ages. For this objective, we merged the dependent variables at five target ages into one by assigning ‘not depressed (label 0)’ if the child had a score below the clinical cutoff at all five time points and ‘depressed (label 1)’ if the child had a score that was above the clinical cutoff for at least one of the five time points. We refer to the resulting dataset that has 1799 samples and 266 features as Dep12-18 in the manuscript. The number of samples and features after each data cleaning step can be found in Supplementary Table 1. Correlational statistics linking the predictors to depression at each time point and missing value ratios for the predictors can be found in Supplementary Spreadsheet 1.

Cross-sectional model selection

To select the best model configuration that predicts the depression status of an adolescent, we designed an exhaustive model selection pipeline to select the optimal combination of preprocessing options and a classifier. The pipeline was developed primarily using the scikit-learn60, imbalanced-learn61, and MLxtend62 Python packages. The overall steps included preprocessing, which transformed the data such that it can be used for ML methods, and classification, which used the transformed data to learn relations among the predictors and depression. During the preprocessing step, the pipeline included, in the mentioned order, a categorical encoding method (one-hot encoding), three feature scaling (FS) methods (standard, minmax, robust63), three missing value imputation (MVI) methods (k-Nearest Neighbors (KNN)64, Multivariate Imputation by Chained Equations (MICE)59, MissForest65), and three outlier detection (OD) methods (isolation forest (IF)66, local outlier factor (LOF)67, no outlier removal (none)).

During the classification step, the pipeline included seven binary classification (CLS) methods (decision tree68, gaussian naïve Bayes (NB)69, multinomial NB70, support vector classifier (SVC)71, AdaBoost72, RandomForest73, multilayer perceptron (MLP)74) with the synthetic minority over-sampling technique (SMOTE)75 for up-sampling and a grid search to find the optimal hyperparameters. Up-sampling makes the dataset balanced while training so that the classifier is less biased toward predicting the majority class75, which in this case was “not depressed”. For each dataset, we randomly selected 20% for a held-out test set, and the rest was used for fivefold cross-validation, where the validation set was used to find the optimal hyperparameters of the classifiers using grid search as well as the best model combination using the F1-score. See Supplementary Tables 3 and 4 for the hyperparameters space used during the fivefold cross-validation grid search and the best hyperparameters selected from the grid search.

Feature analysis

To understand which features are important in improving the classification performance, we performed sequential backward feature selection (SBFS)55, a specific form of recursive feature elimination (RFE). SBFS attempts to reduce the number of feature spaces to a smaller size by sequentially removing features until the best subset of features that is most relevant to the prediction is obtained. We applied the SBFS to the six best models selected from the model selection pipeline above and chose the optimal subset of features based on the smallest subset of features within one standard deviation of the best cross-validation performance.

Time-series data cleaning

We converted the pruned ALSPAC dataset to a time-series format. We first unified similar features’ coding to have the same meaning across all the time points at which they were recorded, such that they could be part of the same time series. Even if a feature was recorded for only one time point, we still treated it as a time-series feature. We also removed 40 features averaged across multiple time points and removed one cross-sectional duplicate variable. After this processing, we had 380 time-series features, which consisted of 377 independent features and three features for the participant identification (ID), timestamp, and target feature.

To reduce the feature space, we calculated a t-test between the depressed and not depressed samples for each feature and time stamp. If an independent feature did not have a p-value of < 0.05 for at least one of its time stamps, the feature was not included in the dataset. For the 12 through 18 age group, we removed 68.4% of the features. In this work, we refer to these datasets prepared for the 12 through 18 age group as Dep12-18TS (1799 samples, 120 features). See Supplementary Spreadsheet 2 for more details. The remaining data preprocessing steps were the same as that of the cross-sectional analysis.

Time-series model selection

To select the best model configuration for predicting depression from the time-series data, we designed a time-series model selection pipeline to find the optimal preprocessing options and classifier, similar to the cross-sectional depression prediction. The pipeline was developed primarily using the scikit-learn60, imbalanced-learn61, MLxtend62, PyTorch76, and skorch77 Python packages. The time-series pipeline followed the same overall steps as the cross-sectional pipeline, which were preprocessing and classification. During the preprocessing step, the pipeline included three MVI methods (Last Observation Carried Forward (LOCF)78, Next Observation Carried Backwards (NOCB)78, Simple Imputer79), a categorical encoding method (one-hot encoding), and two FS methods (standard63, standard by sample) in the order listed. Since features in our time-series data were not measured frequently, we decided to skip outlier detection because many time-series outlier detection methods rely on data from neighboring time points for detection and smoothing. For additional information on how MVI methods were modified, see Supplementary Information 1.2.

For classification, we tested recurrent neural network (RNN)80,81 and Long Short-Term Memory (LSTM)82 with random resampling to upsample the minority class while training and a grid search to find the optimal hyperparameters. For each dataset, we randomly selected 20% for a held-out test set. The rest was used for fivefold cross-validation, where the validation set was used to find the optimal hyperparameters of the classifiers and the best model combination using the F1-score using grid search. See Supplementary Tables 5 and 6 for the hyperparameter space used during the fivefold cross-validation grid search and the best hyperparameters selected from the grid search.

Significance statement

Depression is a major cause of disability and mortality for young people worldwide. Although depression is typically first diagnosed during adolescence, this outcome results from a developmental process that begins many years prior, as early as the prenatal period for some children, creating opportunities for early identification of children at risk. Unresolved questions are whether depression can be accurately predicted early in life and what factors are most predictive. The current study used ALSPAC data from the prenatal period to age ten years for 8467 participants to predict depressed status between ages 12–18 years. Overall prediction accuracy was 64%, and female sex, parental depression and anxiety, and exposure to stressful events or environments were identified as leading predictors of adolescent depression.

Results

Descriptive statistics

The number of participants with depression data in the dataset was 6715 at the age of 12 and it dropped to 3334 at the age of 18 due to missing data, while the number of independent variables included ranged from 255 at the age of 12 to 338 at the age of 16 (Supplementary Table 1, Fig. 2a). The class distribution in the datasets was unbalanced, such that the prevalence of depression was as little as 5.3% at age 12 and 33.9% when considering youth who were above the clinical cutoff for at least one time point between age 12 and age 18 (Fig. 2a). This prevalence is comparable to other estimates of the lifetime prevalence of depression obtained with self-report mental health questionnaires of participants from the UK Biobank study (29 to 35% depending on the measure, definition, and subsample used)83. The application of RFE for feature selection provided better visualization of the underlying structure of the adolescent population, as the t-SNE shows the Dep12-18 data forms groups after RFE that were relevant to depression prediction (Fig. 2b,c, Supplementary Fig. 8, Supplementary Spreadsheet 3).

Fig. 2.

Fig. 2

Statistics of the preprocessed ALSPAC data. (a) Number of participants and features for each derivative dataset generated from ALSPAC (Dep12, Dep13, Dep16, Dep17, Dep18, Dep12-18, from left to right). The percentages of depressed participants for each dataset are also on the graph in orange. (b) t-distributed Stochastic Neighbor Embedding (t-SNE) plot of the Dep12-18 data where all 266 independent features were used to generate the plot. (c) t-SNE plot of the Dep12-18 data where only the subset of 14 features after performing recursive feature elimination (RFE) was used to generate the plot. (d) Top 10 features (from top to bottom) highly correlated to the target variables. For each dataset, we calculated the Pearson correlation between the independent features and the target variable. We then assigned a rank to each feature, where the feature with the highest absolute correlation coefficient was assigned rank 1. We then averaged these ranks across all 6 datasets and identified the top 10 features (left). The respective correlation coefficient is shown on the right. The box represents the interquartile range, the middle line represents the median, the whisker line extends from minimum to maximum values, and the diamond represents outliers. The colored circles denote the raw data points (n = 6, 6 datasets). SDQ stands for the Strengths and Difficulties Questionaire while SMFQ stands for the Short Mood and Feelings Questionnaire. For the characters indicating age, w indicates weeks, m indicates months, and g indicates gestation. One duplicate feature was removed from the plot (Supplementary Information 1.1). (e) Top 10 features (from top to bottom) identified after performing RFE. For each dataset, we performed the model selection pipeline and performed RFE on the best model pipeline. Then, we ranked the RFE selected features according to the RFE results and sorted the features according to their number of appearances across the 6 datasets (e.g., # of appearance = 6 means that this feature was selected by RFE for all 6 datasets). The features that share the same # of appearances were further sorted incrementally by their average rank to identify the top 10 features (left). We also included the Pearson correlation from these features in the training data for all 6 datasets (right). The box represents the interquartile range, the middle line represents the median, the whisker line extends from minimum to maximum values, and the diamond represents outliers. The colored circles denote the raw data points (n = 6, 6 datasets), however, some features do not have data points for all datasets due to data preprocessing. SDQI stands for the Self Description Questionnaire-I. Freq stands for frequency. One duplicate feature was removed from the plot (Supplementary Information 1.1).

A rich repertoire of diverse features correlates with depression

In this work, we predicted adolescent depression from ALSPAC data. We generated six cross-sectional datasets from the ALSPAC dataset for each target age (predicting depression at ages 12, 13, 16, 17, and 18 years using features from the prenatal period to age ten entered simultaneously) and all target ages combined (predicting depression anytime between ages 12 to 18) by cleaning the feature set and keeping only the features correlated to the target variable (p-value < 0.05, see Methods). To identify the features consistently correlated with depression, we calculated the Pearson correlation coefficient between the independent and dependent variables for each of the six datasets. Finding the features that ranked at the top across all the datasets revealed that the child-reported depression score at age 10 was most strongly correlated to predicting depression across all ages (ρ = 0.17 ± 0.03, p-value < 5.55 × 10–9) (Fig. 2d), while the Strengths and Difficulties Questionnaire (SDQ) total difficulties score at 108 months, which is a composite measure of multiple behavioral and emotional indices including hyperactivity, emotional symptoms, conduct problems, and peer problems, ranked as the second most important feature (ρ = 0.12 ± 0.03, p-value < 1.76 × 10–8) (Fig. 2d). Next, the child’s SMFQ depression score reported at 91 months was positively correlated with depression (ρ = 0.12 ± 0.03, p-value < 1.07 × 10–7), females were more likely to have depression than males (ρ = 0.13 ± 0.04, p-value < 1.76 × 10–8), low math abilities were associated with greater likelihood of depression (ρ = − 0.10 ± 0.02, p-value < 4.90 × 10–6), a child whose mother had a higher Edinburgh postnatal maternal depression score when the child was 33 months was more likely to have depression (ρ = 0.10 ± 0.02, p-value < 2.57 × 10–6), a child with higher self-esteem was less likely to have depression (ρ = − 0.11 ± 0.01, p-value < 8.95 × 10–5), a child whose mother reported high maternal depression on the Crown Crisp Experiential Index (CCEI) at 18 weeks gestation was more likely to have depression (ρ = 0.10 ± 0.02, p-value < 1.61 × 10–6), a child with a mother who had a high Edinburgh maternal depression score at 32 weeks gestation was more likely to have depression (ρ = 0.10 ± 0.02, p-value < 2.64 × 10–5), and a child who scored higher on the difficulties score from the SDQ at 108 months was more likely to have depresssion (ρ = 0.10 ± 0.03, p-value < 2.42 × 10–5) (Fig. 2d). The ranked lists of the features assigned using the Pearson correlation coefficient for each dataset are similar to each other when compared to the random baseline (rank-based overlap (RBO)84 = 0.48 ± 0.08 vs. 0.10 ± 0.02, respectively, p-value = 2.87 × 10–16; Supplementary Information 1.3).

Adolescent depression can be predicted but with low confidence

We ran the model selection pipeline to select the best combination of preprocessing steps (feature scaling, missing value imputation, and outlier detection) and binary classifiers for all six datasets, as well as the RFE (see Methods and Fig. 1). Evaluation of these model selection pipelines, which have been optimized for the F1-score using a fivefold cross-validation and held-out test set, showed that robust for FS (4 out of 6), either KNN or MICE for MVI (3 for each), either none or LOF for OD (3 for each), and MLP for the classifier (4 out of 6) were the best combination (Supplementary Table 7). The prediction performance increased as the target age increased, with an F1-score of 0.13 at age 12 to 0.37 at age 18 (173.9% increase), while the baseline (always predict ‘depressed’) was 0.09 and 0.32, respectively (Fig. 3a). We obtained the best F1-score of 0.54 for the merged dataset Dep12-18 (baseline 0.52). It is possible that this trend occured because the class imbalance between depressed and not depressed samples decreases as age increases, which reduced the amount of up-sampling occurring (Fig. 2a). The best-performing models for all datasets also outperformed their respective baselines in both the area under the precision-recall curve (AUCPR) and the area under the receiver operating characteristic curve (AUROC) (Fig. 3b,c; Supplementary Tables 813). The time-series model optimized for the Dep12-18TS data (standard for FS, NOCB for MVI, and RNN for classifier) did not perform better than the baseline nor the best cross-sectional F1 score (0.49, 0.51, and 0.54, respectively), although the models AUCPR and AUROC were better than the baseline (see Fig. 3 and Supplementary Tables 13 and 14). See Supplementary Fig. 17 and Supplementary Information 1.4 for precision-recall (PR) and receiver operating characteristic (ROC) curves when taking a subset of the features of the Dep12-18TS data.

Fig. 3.

Fig. 3

Performance and evaluation of the machine learning models. (a) Precision, recall, specificity, accuracy, and F1-score for the 6 best performing models for each cross-sectional dataset after performing the RFE and for the best-performing time-series model for the Dep12-18TS dataset. The whisker line and the corresponding percentage value denote the change in score for the cross-sectional models after the RFE was performed. (b,c) The precision-recall (PR) and receiver operating characteristic (ROC) curves of the 6 best performing models for each cross-sectional dataset after the RFE and of the best-performing time-series model for the Dep12-18TS dataset obtained from the held-out test set. The operating point was selected using the F1-score. The area under the precision-recall curve (AUCPR) and the area under the receiver operating characteristic curve (AUROC) are shown in the legends. See Supplementary Figs. 9 and 10 for PR and ROC curves before RFE and Supplementary Figs. 1116 for the RFE curves. (d) Number of features before and after running the RFE for each cross-sectional dataset.

A few features are adequate to predict adolescent depression

Figure 2e depicts the top features for each model, with “Child sex” and “Child depression score at Focus 10 assessment” being some of the top features, consistent with the univariate analysis (Fig. 2d). Through RFE, we were able to significantly reduce the feature size from 73.0% (318 to 86) for Dep17 up to 94.7% (266 to 14) for Dep12-18 (Fig. 3d) with insignificant changes to all of the test metrics before and after the application of the RFE (precision p-value = 4.6 × 10–1, recall p-value = 1.9 × 10–1, specificity p-value = 2.4 × 10–1, accuracy p-value = 3.2 × 10–1, and F1-score p-value = 9.8 × 10–1; Supplementary Tables 813). Although the ranked lists of the remaining features after the RFE for each dataset were similar to each other when compared to the random baseline (RBO = 0.14 ± 0.06 vs. 0.10 ± 0.06, respectively, p-value = 3.30 × 10–2; Supplementary Information 1.3), only child sex appeared in all six post-RFE subsets, whereas child-reported depression score at age 10 appeared in all subsets except for Dep18 (Fig. 2e, Supplementary Table 15, Supplementary Fig. 18, Supplementary Spreadsheet 4). Interestingly, the RFE-ranked subfeatures as a whole were not correlated with the features ranked using the Pearson correlation coefficient for any of the datasets (τ = 0.04, p-value = 3.51 × 10–1 for Dep 12; τ = 0.05, p-value = 2.00 × 10–1 for Dep 13; τ = − 0.02, p-value = 5.22 × 10–1 for Dep 16; τ = 0.04, p-value = 2.50 × 10–1 for Dep 17; τ = 0.02, p-value = 6.70 × 10–1 for Dep 18; τ = − 0.01, p-value = 8.14 × 10–1 for Dep 12–18). Comparing the lists of the top five predictors identified with RFE for each age when the depression outcome was measured revealed both common predictors across different ages and predictors that were more influential in predicting depression at a specific age (see Supplementary Table 16). Female sex and various indices of maternal depression in childhood were among the top five predictors for depression at multiple ages. The quality of the relationship with parents at age 10 was among the top five predictors for depression at age 12 but not for any later time points. Childhood problems with peers and not being good with athletics were among the top five predictors for depression at age 13 but not other ages. Participants’ scholastic competence score and teacher ratings of their general knowledge in childhood were among the top five predictors for depression at ages 16 and 18, respectively.

Discussion

Our study revealed multiple statistically significant associations of early life environmental, socio-emotional, cognitive, and biological variables with adolescent depression, several of which lend empirical support, based on a large sample and numerous variables, to the stress accumulation and stress sensitization theories of depression7,8. Although we predicted adolescent depression with adequate performance, we were unable to reach the recommended level for high clinical relevance as it has been proposed that precision of 80% or more would be necessary to reach high clinical relevance (i.e., if 80% of patients predicted to have a disorder using a prediction algorithm would be true positives according to subsequent diagnosis)85. This may relate to this dataset being collected from a community sample as opposed to a clinic-referred cohort. While we used time-series models like RNN and LSTM, we did not observe a performance gain over the cross-sectional models, which we hypothesized to be due to the sparsity of the dataset with many missing values, such that 38.8% of the features (343 out of 885) have more than 50% missing values. As a whole, the large amount of missing data and the limitations of imputation procedures may have also constrained the prediction performance of the final models.

We identified a subset of features that carried more predictive power for each derivative dataset and are consistent with prior theory and empirical research. Female sex was a leading feature across all models, consistent with the 2:1 female-to-male prevalence ratio found in epidemiological studies8688 and emerging research about the role of the estrogen receptor in depression89. Parental depression and anxiety also emerged as leading features (e.g., Edinburgh postnatal depression scores at 32 weeks gestation and 33 months postnatal and maternal depression as measured on the CCEI, measured at 18 weeks gestation from the top 10 Pearson correlated features (Fig. 2d), and frequency the mother was depressed when the child was aged 33 months from the top 10 RFE features (Fig. 2e)), consistent with a prior study using ML to predict depression in youth using the Adolescent Brain Cognitive Development dataset90 and other studies on the genetic and environmental impacts of being raised by parents who suffer from mood and anxiety disorders91,92. Furthermore, other recent work using ML found that the parental emotional state was an important predictor of depression trajectories in early adolescence48. Consistent with leading theories of depression as a disorder of stress accumulation and stress sensitization7,8 and prior empirical evidence48, features indicating exposure to stressful events or stressful environments (financial hardship, parental unemployment, poor neighborhood quality, child life events, etc.) emerged as important in our analysis (e.g., SDQ total difficulties score at 108 months from the top 10 Pearson correlated features (Fig. 2d); weighted life events score at 108 months from the top 10 RFE features (Fig. 2e); variables related to job and employment status of the mother and partner being selected by RFE for all except Dep12 (Supplementary Spreadsheet 4); and variables related to the mother and partner’s opinion of the neighborhood being selected by RFE for Dep17 and Dep12-18 (Supplementary Spreadsheet 4)). The predictive value of composite variables such as child life events scores supports stress accumulation theories of depression, given that children who accumulated more adverse events were more vulnerable to depression during adolescence. Additionally, the predictive value of early-life variables, such as maternal perceived social support during her child’s infancy, provides support for early-life stress sensitization theoretical models, which postulate that stressful early-life experiences can leave an imprint on subsequent developmental trajectories.

In addition, we found that some predictors were influential in predicting depression outcomes at multiple ages across adolescence, whereas others were especially salient in predicting depression at a specific age (Supplementary Table 16, Supplementary Fig. 18). For example, female sex was among the top five predictors for depression at ages 12, 13, 16, 18, and of suffering from depression for at least one time point in the 12–18 age range, suggesting girls have heightened vulnerability across all of adolescence. However, the quality of the relationship with parents only appeared among the top five predictors when predicting depression at age 12, but not at later ages, which may be explained by developmental trends towards increasing independence from parents and increased reliance on peers across adolescence93,94. Consistent with the puberty-related shift from relying on parents to relying on peers, the quality of peer relationships rose among the top five predictors of depression at age 13. Scholastic competence and teacher ratings of general knowledge in childhood rose among the top five predictors for depression at ages 16 and 18, when youth were approaching the end of high school or preparing for university entry, suggesting that a history of academic difficulties may be particularly influential for depression risk around the end of high school. If replicated by future work, these results suggest possible avenues for age-specific prevention and intervention strategies to ameliorate adolescent depression.

Predicting the onset of depression among adolescents is a challenging task because humans function as open dynamic systems in a constant transaction with and adaptation to the environment, and therefore, long-term future outcomes may be causally indeterminate and remain open to multiple developmental pathways at any given time95,96. This may be particularly true when forecasting adolescent outcomes, given that adolescence has been recognized as a developmental period with heightened neural and behavioral plasticity and more openness to the environment compared to other periods97,98. Thus, outcomes during this period may be inherently less predictable than outcomes during adulthood40. Additionally, longitudinal studies on the development of psychopathology, including depression, have also shown substantial evidence of equifinality (individuals can develop the same disorder despite having different developmental histories) and multifinality (individuals can develop different disorders or no disorders despite having similar developmental pathways and growing up in the same homes)99. Equifinality and multifinality pose significant challenges to long-term prediction from prior developmental variables and may highlight the role of stochastic events in development. Stochastic events may make prediction difficult at the individual level, even when research has identified a large number of useful predictors at the population level100. Generally, ML studies have higher prediction accuracy if conducted with adults40, or if they use ML to detect current depressed status106 or aim for short-term prediction101 than studies with adolescents aiming for long-term prediction47.

One potential solution for improving prediction accuracy in future studies would be to use recently obtained features (as opposed to past or chronic life events) to predict depression over shorter time frames (e.g., 1 month or 3 months prior). Recent findings8,101 suggest that combining information on childhood developmental history (e.g., data from gestation to age 10 years as included in the current project) with information about recent acute stressors in adolescence (e.g., from the previous 0–3 months) may be necessary to achieve high accuracy for predicting the onset of depression, though this possibility would limit our ability to implement interventions in a timely fashion several months or years before the onset of depressive symptoms or disorder. Incorporating more in-depth data on stress physiology (e.g., multiple days of cortisol output or hair cortisol measures) may also improve prediction accuracy given prior meta-analytic evidence linking depression to elevated cortisol output107. In addition, the missing data due to attrition over time is a well-recognized challenge in human longitudinal research and is also a limitation of the current cohort study, as it can introduce selection bias. Nevertheless, depression prevalence rates at age 18 in this study were comparable to other reports from studies in the UK, suggesting that results may generalize. Future research should incentivize participant retention. To improve the prediction accuracy, our next steps are to further extend the results by updating our model selection pipeline with additional tools and incorporating multimodal neurobiological48,102104 and genomic features. Whether diversifying the data sources and reducing data missingness will improve prediction accuracy for distant psychiatric outcomes remains to be determined. Because ML methods have provided useful insights in a number of biomedical fields, future research should aim to examine methods of improving prediction accuracy for psychological outcomes. If short- and long-term individualized prediction proves difficult with current methods and assessment tools, cost-effective universal prevention programs may be delivered to all children (e.g., via schools) as an alternative or complementary prevention strategy.

Conclusions

Although prediction performance did not reach clinical relevance in the current study or in other recent attempts to use developmental datasets to predict long-term psychological or behavioral outcomes47,105, this study provides insight into additional approaches that might be successful. Additionally, the predictors influencing depression prediction from our models were consistent with previous work. The leading informative features in our predictive models of adolescent depression were sex, parental depression and anxiety, and exposure to stressful events or environments. This study demonstrates how using a broad array of evidence-driven predictors from early in life can inform the development of preventative decision support tools to assist in the early detection of risk for mental illness.

Supplementary Information

Acknowledgements

We are extremely grateful to all the families who took part in this study, the midwives for their help in recruiting them, and the whole ALSPAC team, which includes interviewers, computer and laboratory technicians, clerical workers, research scientists, volunteers, managers, receptionists, and nurses. The UK Medical Research Council, Wellcome (Grant ref: 217065/Z/19/Z), and the University of Bristol provide core support for ALSPAC. A comprehensive list of grant funding is available on the ALSPAC website (http://www.bristol.ac.uk/alspac/external/documents/grant-acknowledgements.pdf). This research was specifically funded by MRC Grant G0401540 73080. This publication is the work of the authors (Yoo, Li, Youn, Guan, Guyer, Hostinar, and Tagkopoulos), who will serve as guarantors for the contents of this paper. AEG, CEH, and IT were supported by R21MH125346. We would also like to thank LillyBelle Deer, Ph.D., for her assistance with the initial stages of this project.

Author contributions

A.Y., F.L., and J.Y. have an equal contribution. A.Y., F.L., and J.Y. performed all computational analyses. J.Y. and A.Y. created the figures. J.G. converted the ALSPAC data into a time-series format. J.Y., A.G., C.H., and I.T. contributed to the critical analysis and wrote the manuscript. C.H. and I.T. conceived and supervised all aspects of the project.

Data availability

All code and instructions on how to reproduce the results can be found at https://github.com/IBPA/DepressionProject.

Competing interests

The authors declare no competing interests.

Footnotes

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

These authors contributed equally: Arielle Yoo, Fangzhou Li, and Jason Youn.

Supplementary Information

The online version contains supplementary material available at 10.1038/s41598-024-72158-9.

References

  • 1.Shorey, S., Ng, E. D. & Wong, C. H. J. Global prevalence of depression and elevated depressive symptoms among adolescents: A systematic review and meta-analysis. Br. J. Clin. Psychol.61(2), 287–305 (2022). [DOI] [PubMed] [Google Scholar]
  • 2.Friedrich, M. J. Depression is the leading cause of disability around the world. Jama317(15), 1517 (2017). [DOI] [PubMed] [Google Scholar]
  • 3.Hedegaard, H., & Warner, M. Suicide mortality in the United States, 1999–2019. (2021). [PubMed]
  • 4.Fried, E. I. & Nesse, R. M. Depression sum-scores don’t add up: Why analyzing specific depression symptoms is essential. BMC Med.13(1), 1–11 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Garber, J. Depression in youth: A developmental psychopathology perspective. Multilevel Dyn. Dev. Psychopathol. Pathw. Futur.34, 181–242 (2007). [Google Scholar]
  • 6.Monk, C., Lugo-Candelas, C. & Trumpff, C. Prenatal developmental origins of future psychopathology: Mechanisms and pathways. Ann. Rev. Clin. Psychol.15, 317–344 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Heim, C., Newport, D. J., Mletzko, T., Miller, A. H. & Nemeroff, C. B. The link between childhood trauma and depression: Insights from HPA axis studies in humans. Psychoneuroendocrinology33(6), 693–710 (2008). [DOI] [PubMed] [Google Scholar]
  • 8.Hammen, C. Depression and stressful environments: Identifying gaps in conceptualization and measurement. Anxiety Stress Coping.29(4), 335–351 (2016). [DOI] [PubMed] [Google Scholar]
  • 9.Gariepy, G., Honkaniemi, H. & Quesnel-Vallee, A. Social support and protection from depression: Systematic review of current findings in Western countries. Br. J. Psychiatry209(4), 284–293 (2016). [DOI] [PubMed] [Google Scholar]
  • 10.Andrés, M. L., de Minzi, M. C., Castañeiras, C., Canet-Juric, L. & Rodríguez-Carvajal, R. Neuroticism and depression in children: The role of cognitive emotion regulation strategies. J. Genet. Psychol.177(2), 55–71 (2016). [DOI] [PubMed] [Google Scholar]
  • 11.Thapar, A., Eyre, O., Patel, V. & Brent, D. Depression in young people. Lancet400(10352), 617–631 (2022). [DOI] [PubMed] [Google Scholar]
  • 12.Pfeifer, J. H. & Allen, N. B. Puberty initiates cascading relationships between neurodevelopmental, social, and internalizing processes across adolescence. Biol. Psychiatry89(2), 99–108 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Chahal, R., Gotlib, I. H. & Guyer, A. E. Research review: Brain network connectivity and the heterogeneity of depression in adolescence—A precision mental health perspective. J. Child Psychol. Psychiatry61(12), 1282–1298 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Lieb, R., Isensee, B., Höfler, M., Pfister, H. & Wittchen, H.-U. Parental major depression and the risk of depression and other mental disorders in offspring: A prospective-longitudinal community study. Arch. Gen. Psychiatry59(4), 365–374 (2002). [DOI] [PubMed] [Google Scholar]
  • 15.Williamson, D. E., Birmaher, B., Axelson, D. A., Ryan, N. D. & Dahl, R. E. First episode of depression in children at low and high familial risk for depression. J. Am. Acad. Child Adolesc. Psychiatry43(3), 291–297 (2004). [DOI] [PubMed] [Google Scholar]
  • 16.Hirshfeld-Becker, D. R. et al. Intrinsic functional brain connectivity predicts onset of major depression disorder in adolescence: A pilot study. Brain Connect.9(5), 388–398 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Nolen-Hoeksema, S. & Girgus, J. S. The emergence of gender differences in depression during adolescence. Psychol. Bull.115(3), 424 (1994). [DOI] [PubMed] [Google Scholar]
  • 18.Goodman, S. H. & Gotlib, I. H. Risk for psychopathology in the children of depressed mothers: A developmental model for understanding mechanisms of transmission. Psychol. Rev.106(3), 458 (1999). [DOI] [PubMed] [Google Scholar]
  • 19.Hammen, C., Shih, J. H. & Brennan, P. A. Intergenerational transmission of depression: Test of an interpersonal stress model in a community sample. J. Consult. Clin. Psychol.72(3), 511 (2004). [DOI] [PubMed] [Google Scholar]
  • 20.Hammen, C. Stress exposure and stress generation in adolescent depression. In Handbook of Depression in Adolescents (Routledge/Taylor & Francis Group, 2009). [Google Scholar]
  • 21.Lakdawalla, Z., Hankin, B. L. & Mermelstein, R. Cognitive theories of depression in children and adolescents: A conceptual and quantitative review. Clin. Child Fam. Psychol. Rev.10(1), 1–24 (2007). [DOI] [PubMed] [Google Scholar]
  • 22.Hammen, C., Brennan, P. A. & Keenan-Miller, D. Patterns of adolescent depression to age 20: The role of maternal depression and youth interpersonal dysfunction. J. Abnorm. Child Psychol.36(8), 1189–1198 (2008). [DOI] [PubMed] [Google Scholar]
  • 23.Esposito, M. H., Lee, H., Hicken, M. T., Porter, L. C. & Herting, J. R. The consequences of contact with the criminal justice system for health in the transition to adulthood. Longitud. Life Course Stud.8(1), 57 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Ross, C. & Mirowsky, J. Education, social status, and health (social institutions and social change) (Taylor & Francis Group, 2003). [Google Scholar]
  • 25.Gao, S., Calhoun, V. D. & Sui, J. Machine learning in major depression: From classification to treatment outcome prediction. CNS Neurosci. Ther.24(11), 1037–1052 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Rost, N., Binder, E. B. & Brückl, T. M. Predicting treatment outcome in depression: An introduction into current concepts and challenges. Eur. Arch. Psychiatry Clin. Neurosci.273, 1–15 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Wazid, M., Das, A. K., Chamola, V. & Park, Y. Uniting cyber security and machine learning: Advantages, challenges and future research. ICT Express8(3), 313–321 (2022). [Google Scholar]
  • 28.Hannun, A. Y. et al. Cardiologist-level arrhythmia detection and classification in ambulatory electrocardiograms using a deep neural network. Nat. Med.25(1), 65–69 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Mariani, M. C., Tweneboah, O. K. & Bhuiyan, M. A. M. Supervised machine learning models applied to disease diagnosis and prognosis. AIMS Public Health6(4), 405–423 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Rezaii, N., Walker, E. & Wolff, P. A machine learning approach to predicting psychosis using semantic density and latent content analysis. npj Schizophr5(1), 1–12 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Mohri, M., Rostamizadeh, A. & Talwalkar, A. Foundations of Machine Learning (MIT Press, 2018). [Google Scholar]
  • 32.Bock, C., Moor, M., Jutzeler, C. R. & Borgwardt, K. Machine learning for biomedical time series classification: From shapelets to deep learning. In Artificial Neural Networks (ed. Cartwright, H.) 33–71 (Springer, 2021). 10.1007/978-1-0716-0826-5_2. [DOI] [PubMed] [Google Scholar]
  • 33.Li, Y., Swift, S. & Tucker, A. Modelling and analysing the dynamics of disease progression from cross-sectional studies. J. Biomed. Inform.46(2), 266–274 (2013). [DOI] [PubMed] [Google Scholar]
  • 34.Morid, M. A., Sheng, O. R. L. & Dunbar, J. Time series prediction using deep learning methods in healthcare. ACM Trans. Manag. Inf. Syst.14(1), 2:1-2:29. 10.1145/3531326 (2023). [Google Scholar]
  • 35.Ahmed, N. K., Atiya, A. F., Gayar, N. E. & El-Shishiny, H. An empirical comparison of machine learning models for time series forecasting. Econom. Rev.29(5–6), 594–621. 10.1080/07474938.2010.481556 (2010). [Google Scholar]
  • 36.Zulfiker, M. S., Kabir, N., Biswas, A. A., Nazneen, T. & Uddin, M. S. An in-depth analysis of machine learning approaches to predict depression. Curr. Res. Behav. Sci.2, 100044 (2021). [Google Scholar]
  • 37.Haque, U. M., Kabir, E. & Khanam, R. Detection of child depression using machine learning methods. PLOS ONE16(12), e0261131. 10.1371/journal.pone.0261131 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Goldman, L. S., Nielsen, N. H., Champion, H. C., for the Council on Scientific Affairs American Medical Association Awareness. Awareness, diagnosis, and treatment of depression. J. Gen. Intern. Med.14(9), 569–580. 10.1046/j.1525-1497.1999.03478.x (1999). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Levin, H. S. et al. Predicting depression following mild traumatic brain injury. Arch. Gen. Psychiatry62(5), 523–528 (2005). [DOI] [PubMed] [Google Scholar]
  • 40.Na, K.-S., Cho, S.-E., Geem, Z. W. & Kim, Y.-K. Predicting future onset of depression among community dwelling adults in the Republic of Korea using a machine learning algorithm. Neurosci. Lett.721, 134804 (2020). [DOI] [PubMed] [Google Scholar]
  • 41.Orabi, A. H., Buddhitha, P., Orabi, M. H., & Inkpen, D. Deep learning for depression detection of twitter users. In Proceedings of the Fifth Workshop on Computational Linguistics and Clinical Psychology: From Keyboard to Clinic 88–97 (2018).
  • 42.Mullick, T., Radovic, A., Shaaban, S. & Doryab, A. Predicting depression in adolescents using mobile and wearable sensors: Multimodal machine learning-based exploratory study. JMIR Form. Res.6(6), e35807 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Uyulan, C. et al. Major depressive disorder classification based on different convolutional neural network models: Deep learning approach. Clin. EEG Neurosci.52(1), 38–51 (2021). [DOI] [PubMed] [Google Scholar]
  • 44.Foland-Ross, L. C. et al. Cortical thickness predicts the first onset of major depression in adolescence. Int. J. Dev. Neurosci.46, 125–131 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Chekroud, A. M. et al. Cross-trial prediction of treatment outcome in depression: A machine learning approach. Lancet Psychiatry3(3), 243–250 (2016). [DOI] [PubMed] [Google Scholar]
  • 46.Kannampallil, T. et al. Cross-trial prediction of depression remission using problem-solving therapy: A machine learning approach. J. Affect. Disord.308, 89–97 (2022). [DOI] [PubMed] [Google Scholar]
  • 47.Tate, A. E. et al. Predicting mental health problems in adolescence using machine learning techniques. PloS One15(4), e0230389 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Xiang, Q. et al. Prediction of the trajectories of depressive symptoms among children in the adolescent brain cognitive development (ABCD) study using machine learning approach. J. Affect. Disord.310, 162–171 (2022). [DOI] [PubMed] [Google Scholar]
  • 49.Fraser, A. et al. Cohort profile: The Avon Longitudinal Study of Parents and Children: ALSPAC mothers cohort. Int. J. Epidemiol.42(1), 97–110 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Boyd, A. et al. Cohort profile: The ‘children of the 90s’—the index offspring of the Avon Longitudinal Study of Parents and Children. Int. J. Epidemiol..42(1), 111–127 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Weavers, B. et al. The antecedents and outcomes of persistent and remitting adolescent depressive symptom trajectories: A longitudinal, population-based English study. Lancet Psychiatry8(12), 1053–1061 (2021). [DOI] [PubMed] [Google Scholar]
  • 52.Fraser, H., Kwong, A. S. F., Brooks, M., Davidson, B. I., McConville, R., & Pearson, R. M. Modelling the risk ecosystem of depression using machine learning in a population of young adults. medRxiv; 2023 p. 2023.08.15.23294062. 10.1101/2023.08.15.23294062v1@@
  • 53.Solmi, M. et al. Age at onset of mental disorders worldwide: Large-scale meta-analysis of 192 epidemiological studies. Mol. Psychiatry27(1), 281–295 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Merikangas, K. R. et al. Lifetime prevalence of mental disorders in U.S. adolescents: Results from the National Comorbidity Survey Replication-Adolescent Supplement (NCS-A). J. Am. Acad. Child Adolesc. Psychiatry49(10), 980–989 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Raschka, S. MLxtend: Providing machine learning and data science utilities and extensions to Python’s scientific computing stack. J. Open Sour. Softw.3(24), 638 (2018). [Google Scholar]
  • 56.Messer, S. C. et al. Development of a short questionnaire for use in epidemiological studies of depression in children and adolescents: Factor composition and structure across development. Int. J. Methods Psychiatr. Res.5, 251–262 (1995). [Google Scholar]
  • 57.Turner, N., Joinson, C., Peters, T. J., Wiles, N. & Lewis, G. Validity of the short mood and feelings questionnaire in late adolescence. Psychol. Assess.26(3), 752–762 (2014). [DOI] [PubMed] [Google Scholar]
  • 58.Sharp, C., Goodyer, I. M. & Croudace, T. J. The short mood and feelings questionnaire (SMFQ): A unidimensional item response theory and categorical data factor analysis of self-report ratings from a community sample of 7-through 11-year-old children. J. Abnorm. Child Psychol.34(3), 365–377. 10.1007/s10802-006-9027-x (2006). [DOI] [PubMed] [Google Scholar]
  • 59.Van Buuren, S. & Groothuis-Oudshoorn, K. mice: Multivariate imputation by chained equations in R. J. Stat. Softw.45, 1–67 (2011). [Google Scholar]
  • 60.Pedregosa, F. et al. Scikit-learn: Machine learning in Python. J. Mach. Learn. Res.12, 2825–2830 (2011). [Google Scholar]
  • 61.Lemaître, G., Nogueira, F. & Aridas, C. K. Imbalanced-learn: A Python toolbox to tackle the curse of imbalanced datasets in machine learning. J. Mach. Learn. Res.18, 1–5 (2017). [Google Scholar]
  • 62.Raschka, S. MLxtend: Providing machine learning and data science utilities and extensions to Python’s scientific computing stack. J. Open Sour. Softw.3(24), 638. 10.21105/joss.00638 (2018). [Google Scholar]
  • 63.Thara, D. K., PremaSudha, B. G. & Xiong, F. Auto-detection of epileptic seizure events using deep neural network with different feature scaling techniques. Pattern Recognit. Lett.128, 544–550 (2019). [Google Scholar]
  • 64.Troyanskaya, O. et al. Missing value estimation methods for DNA microarrays. Bioinformatics17(6), 520–525 (2001). [DOI] [PubMed] [Google Scholar]
  • 65.Stekhoven, D. J. & Bühlmann, P. MissForest—non-parametric missing value imputation for mixed-type data. Bioinformatics28(1), 112–118 (2012). [DOI] [PubMed] [Google Scholar]
  • 66.Liu, F. T., Ting, K. M., & Zhou, Z.-H. Isolation forest. In 2008 Eighth Ieee International Conference on Data Mining 413–22 (2008).
  • 67.Breunig, M. M., Kriegel, H.-P., Ng, R. T., & Sander, J. LOF: Identifying density-based local outliers. In Proceedings of the 2000 ACM SIGMOD international conference on Management of data 93–104 (2000).
  • 68.Breiman, L., Friedman, J. H., Olshen, R. A. & Stone, C. J. Classification and regression trees (Routledge, 2017). [Google Scholar]
  • 69.Zhang, H. The optimality of naive Bayes. Aa1(2), 3 (2004). [Google Scholar]
  • 70.Domingos, P. & Pazzani, M. On the optimality of the simple Bayesian classifier under zero-one loss. Mach. Learn.29(2), 103–130 (1997). [Google Scholar]
  • 71.Platt, J. Probabilistic outputs for support vector machines and comparisons to regularized likelihood methods. Adv. Large Margin Classif.10(3), 61–74 (1999). [Google Scholar]
  • 72.Freund, Y., Schapire, R. & Abe, N. A short introduction to boosting. J. Jpn. Soc. Artif. Intell.14(771–780), 1612 (1999). [Google Scholar]
  • 73.Breiman, L. Random forests. Mach. Learn.45(1), 5–32 (2001). [Google Scholar]
  • 74.Hinton, G. E. Connectionist learning procedures. In Machine Learning 555–610 (Elsevier, 1990). [Google Scholar]
  • 75.Chawla, N. V., Bowyer, K. W., Hall, L. O. & Kegelmeyer, W. P. SMOTE: Synthetic minority over-sampling technique. J. Artif. Intell. Res.16, 321–357 (2002). [Google Scholar]
  • 76.Paszke, A., Gross, S., Massa, F., Lerer, A., Bradbury, J., Chanan, G. et al. PyTorch: An imperative style, high-performance deep learning library. (2019) Available from: http://arxiv.org/abs/1912.01703
  • 77.Tietz, M., Fan, T. J., Nouri, D., & Bossan, B. skorch Developers. skorch: A scikit-learn compatible neural network library that wraps PyTorch. (2017 July). Available from: https://skorch.readthedocs.io/en/stable/
  • 78.Engels, J. M. & Diehr, P. Imputation of missing longitudinal data: A comparison of methods. J. Clin. Epidemiol.56(10), 968–976 (2003). [DOI] [PubMed] [Google Scholar]
  • 79.Che, Z., Purushotham, S., Cho, K., Sontag, D. & Liu, Y. Recurrent neural networks for multivariate time series with missing values. Sci. Rep.8(1), 6085 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 80.Rumelhart, D. E., Hinton, G. E. & Williams, R. J. Learning representations by back-propagating errors. Nature323(6088), 533–536 (1986). [Google Scholar]
  • 81.Cho, K., Van Merriënboer, B., Gulcehre, C., Bahdanau, D., Bougares, F., Schwenk, H., et al. Learning phrase representations using RNN encoder-decoder for statistical machine translation. arXiv preprint http://arXiv.org/abs/14061078. (2014).
  • 82.Hochreiter, S. & Schmidhuber, J. Long short-term memory. Neural Comput.9(8), 1735–1780 (1997). [DOI] [PubMed] [Google Scholar]
  • 83.Howard, D. M. et al. Genetic stratification of depression in UK Biobank. Transl. Psychiatry10(1), 1–8 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 84.Webber, W., Moffat, A. & Zobel, J. A similarity measure for indefinite rankings. ACM Trans. Inf. Syst. (TOIS)28(4), 1–38 (2010). [Google Scholar]
  • 85.Savitz, J. B., Rauch, S. L. & Drevets, W. C. Clinical application of brain imaging for the diagnosis of mood disorders: The current state of play. Mol. Psychiatry18(5), 528–539 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 86.Salk, R. H., Hyde, J. S. & Abramson, L. Y. Gender differences in depression in representative national samples: Meta-analyses of diagnoses and symptoms. Psychol. Bull.143(8), 783 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 87.Mojtabai, R., Olfson, M. & Han, B. National trends in the prevalence and treatment of depression in adolescents and young adults. Pediatrics138, 1. 10.1542/peds.2016-1878 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 88.Breslau, J. et al. Sex differences in recent first-onset depression in an epidemiological sample of adolescents. Transl. Psychiatry7(5), e1139–e1139 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 89.Ryan, J. & Ancelin, M.-L. Polymorphisms of estrogen receptors and risk of depression. Drugs72(13), 1725–1738 (2012). [DOI] [PubMed] [Google Scholar]
  • 90.Ho, T. C., Shah, R., Mishra, J., May, A. C. & Tapert, S. F. Multi-level predictors of depression symptoms in the Adolescent Brain Cognitive Development (ABCD) study. J. Child Psychol. Psychiatry63(12), 1523–1533. 10.1111/jcpp.13608 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 91.Goodman, S. H. Intergenerational transmission of depression. Ann. Rev. Clin. Psychol.16(1), 213–238. 10.1146/annurev-clinpsy-071519-113915 (2020). [DOI] [PubMed] [Google Scholar]
  • 92.Gotlib, I. H., Goodman, S. H. & Humphreys, K. L. Studying the intergenerational transmission of risk for depression: Current status and future directions. Curr. Dir. Psychol. Sci.29(2), 174–179 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 93.Hostinar, C. E., Johnson, A. E. & Gunnar, M. R. Parent support is less effective in buffering cortisol stress reactivity for adolescents compared to children. Dev. Sci.18(2), 281–297. 10.1111/desc.12195 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 94.Furman, W. & Buhrmester, D. Age and sex differences in perceptions of networks of personal relationships. Child Dev.63(1), 103–115. 10.1111/j.1467-8624.1992.tb03599.x (1992). [DOI] [PubMed] [Google Scholar]
  • 95.Richters, J. E. The Hubble hypothesis and the developmentalist’s dilemma. Dev. psychopathol.9(2), 193–229 (1997). [DOI] [PubMed] [Google Scholar]
  • 96.Newman, B. M. & Newman, P. R. Theories of Adolescent Development (Academic Press, 2020). [Google Scholar]
  • 97.Fuhrmann, D., Knoll, L. J. & Blakemore, S.-J. Adolescence as a sensitive period of brain development. Trends Cognit. Sci.19(10), 558–665 (2015). [DOI] [PubMed] [Google Scholar]
  • 98.Guyer, A. E. Adolescent psychopathology: The role of brain-based diatheses, sensitivities, and susceptibilities. Child Dev. Perspect.14(2), 104–109 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 99.Cicchetti, D. & Rogosch, F. A. Equifinality and multifinality in developmental psychopathology. Dev. Psychopathol.8(4), 597–600 (1996). [Google Scholar]
  • 100.Chen, J. H. & Asch, S. M. Machine learning and prediction in medicine—beyond the peak of inflated expectations. N. Engl. J. Med.376(26), 2507 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 101.Cellini, P., Pigoni, A., Delvecchio, G., Moltrasio, C. & Brambilla, P. Machine learning in the prediction of postpartum depression: A review. J. Affect. Disord.309, 350 (2022). [DOI] [PubMed] [Google Scholar]
  • 102.Schmaal, L. et al. Cortical abnormalities in adults and adolescents with major depression based on brain scans from 20 cohorts worldwide in the ENIGMA Major Depressive Disorder Working Group. Mol. Psychiatry22(6), 900–909 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 103.Schmaal, L. et al. Subcortical brain alterations in major depressive disorder: Findings from the ENIGMA Major Depressive Disorder working group. Mol. Psychiatry21(6), 806–812 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 104.De Asis-Cruz, J., Andescavage, N. & Limperopoulos, C. Adverse prenatal exposures and fetal brain development: Insights from advanced fetal MRI. Biol. Psychiatry Cognit. Neurosci. Neuroimaging7, 480 (2021). [DOI] [PubMed] [Google Scholar]
  • 105.Salganik, M. J. et al. Measuring the predictability of life outcomes with a scientific mass collaboration. Proc. Natl. Acad. Sci.117(15), 8398–8403 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 106.Oh, J., Yun, K., Maoz, U., Kim, T.-S. & Chae, J.-H. Identifying depression in the National Health and Nutrition Examination Survey data using a deep learning algorithm. J. Affect. Disord.257, 623–631 (2019). [DOI] [PubMed] [Google Scholar]
  • 107.Stetler, C. & Miller, G. E. Depression and hypothalamic-pituitary-adrenal activation: A quantitative summary of four decades of research. Psychosom. Med.73(2), 114–126 (2011). [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Data Availability Statement

All code and instructions on how to reproduce the results can be found at https://github.com/IBPA/DepressionProject.


Articles from Scientific Reports are provided here courtesy of Nature Publishing Group

RESOURCES