Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2025 Nov 1.
Published in final edited form as: Soc Psychiatry Psychiatr Epidemiol. 2024 Jun 7;59(11):2073–2082. doi: 10.1007/s00127-024-02694-2

Delaying high school start times impacts depressed mood among students: evidence from a natural experiment

Ekaterina Sadikova 1, Rachel Widome 2, Elise Robinson 3,4, Izzuddin M Aris 5, Henning Tiemeier 1
PMCID: PMC11846681  NIHMSID: NIHMS2052730  PMID: 38847813

Abstract

Purpose

Delaying high school start times prolongs weekday sleep. However, it is not clear if longer sleep reduces depression symptoms and if the impact of such policy change is the same across groups of adolescents.

Methods

We examined how gains in weekday sleep impact depression symptoms in 2,134 high school students (mean age 15.16 ± 0.35 years) from the Minneapolis metropolitan area. Leveraging a natural experiment design, we used the policy change to delay school start times as an instrument to estimate the effect of a sustained gain in weekday sleep on repeatedly measured Kandel-Davies depression symptoms. We also evaluated whether allocating the policy change to subgroups with expected benefit could improve the impact of the policy.

Results

Over 2 years, a sustained half-hour gain in weekday sleep expected as a result of the policy change to delay start times decreased depression symptoms by 0.78 points, 95%CI (−1.32,−0.28), or 15.6% of a standard deviation. The benefit was driven by a decrease in fatigue and sleep-related symptoms. While symptoms of low mood, hopelessness, and worry were not affected by the policy on average, older students with greater daily screen use and higher BMI experienced greater improvements in mood symptoms than would be expected on average, signaling heterogeneity. Nevertheless, universal implementation outperformed prescriptive strategies.

Conclusion

High school start time delays are likely to universally decrease fatigue and overall depression symptoms in adolescents. Students who benefit most with respect to mood are older, spend more time on screens and have higher BMI.

Keywords: School start time delays, Adolescent depression, Causal inference, Instrumental variable, Heterogeneity of treatment effect

Introduction

The prevalence of mood disorders, including major depression, increases sharply from 1%–17–25% as children progress through adolescence, with the fastest incidence rate evident in ages 15 to 18 [1, 2]. Numerous underlying processes make adolescence a period of vulnerability to mood dysregulation. Adolescence is marked by rapid physiological changes, development of social networks, increased academic pressures, a greater propensity for risk-taking behaviors, and heightened reactivity to emotional stimuli [35]. Puberty – which demarcates adolescence from childhood – also brings about greater sensitivity to gonadal and adrenal hormones, and morphological changes in brain regions responsible for emotion regulation [69].

Moreover, extensive changes in sleep physiology occur over the course of puberty [10]. Overall melatonin levels decrease and the timing of melatonin peak shifts later as children undergo puberty, making it difficult to fall asleep before 11PM [1113]. The homeostatic accumulation of sleep pressure slackens, making it easier for adolescents to forego sleep and accumulate sleep debt [14, 15]. In addition, adolescents are confronted with earlier school start times as they progress from elementary to middle and high school. More than 36% of U.S. public high schools start before 8 a.m. and approximately 70% of U.S. high school students sleep less than the recommended 8–10 h per night [1619]. The confluence of biological and structural shifts that arise during adolescence and results in short and ill-timed weeknight sleep has been termed a “the perfect storm” [20, 21].

Sleep deprivation in adolescence can disrupt normative development of the prefrontal cortex and multiple brain networks involved in cognitive control and emotional regulation [22]. Short sleep duration has been implicated in greater odds of depressed mood in adolescents [23]. However, several questions remain unanswered. First, if a structural intervention such as a policy to delay school start times is implemented and successfully increases weekday sleep duration, is there a sustained protective effect on depression symptoms? Second, would such a structural intervention affect all adolescents homogeneously, or would identifiable subgroups of adolescents benefit more than others? Lastly, changes in sleep patterns and reductions in motivation due to fatigue are two of the minimally five diagnostic criteria required for a diagnosis of a major depressive episode [24]. As such, does increasing sleep prospectively predict lower depression symptoms overall or does it simply ameliorate the dysregulated sleep component of the syndrome? There is a lack of clarity regarding these questions in epidemiologic findings to date.

We strive to address these questions with data from a natural experiment conducted in the Minneapolis metropolitan area – the START study [25]. Prior work using this data demonstrates a sustained gain in expected sleep duration following a delay in start times [26]. We will assess the impact of the sustained gain in sleep attributable to delaying school start times on depression symptoms, and examine the subgroups – determined by baseline socioeconomic and lifestyle covariates – that benefit most from the implementation of a school start time delay policy.

Method

Study & setting

START collected three waves of data on students from five high schools, two of which delayed their start time. Baseline data was collected from 9th -graders in the spring of 2016, while all the schools were on an early start schedule. In the fall of 2016, two schools shifted their start time from 7:30AM and 7:45AM to 8:20AM and 8:50AM, respectively, while three comparison schools, similar on demographic, cultural, and geographic characteristics to the delay schools, continued to start at 7:30AM. Follow-up assessments were conducted in the spring of 2017 and 2018, when the enrolled students were in 10th and 11th grade. Detailed study procedures have been previously described [25]. The conduct of this analysis was approved by institutional review boards of the University of Minnesota and Harvard University. This study followed the STrengthening the Reporting of Observational Studies in Epidemiology (STROBE) guidelines.

Measurement and variables

Weekday sleep duration was self-reported. The students were asked when they usually go to bed on school days, how long it takes them to fall asleep after going to bed (less than 10 min, 10–20 min, more than 20 min, or unknown), and when they usually wake up on school days. Bedtimes and wake-up times were collected using multiple-choice questions with answers binned into 15-minute intervals (bedtimes between 9PM and 2AM and wake-up times between 5AM and 8:30AM). Weekday sleep duration was calculated as the hours between going to bed and waking up, minus the approximate amount of time it takes to fall asleep.

Depression symptoms were measured at all three time points with the six-item Kandel-Davies questionnaire, which is validated in adolescents aged 12–18 years [27]. Participants were asked how often they have been bothered or troubled by the following during the last two weeks: (a) feeling too tired to do things, (b) having trouble going to sleep or staying asleep, (c) feeling unhappy, sad, or depressed, (d) feeling hopeless about the future, (e) feeling nervous or tense, and (f) worrying too much about things. Likert-scale responses of “not at all”, “somewhat”, and “very much” conferred 1, 2 and 3 points, respectively. The primary outcome was overall depression symptoms operationalized as the average of the individual item scores, multiplied by 10 for a final score ranging from 10 (no depression symptoms) to 30 (most severe symptoms). As secondary outcomes, we analyzed average scores on fatigue symptoms (items a and b) as well as items on low mood (item c), hopelessness (item d), and worry symptoms (items e and f) – together termed “mood” symptoms henceforth. These subdomains have been operationalized in prior work utilizing the Kandel-Davies questionnaire [28].

We obtained a comprehensive set of covariates to control for possible confounding and to study the heterogeneity of the policy’s effect: participants’ biological sex (male or female), Hispanic ethnicity, race (American Indian or Alaska Native; Asian; Black, African, or African American; Native Hawaiian or other Pacific Islander; or White), an indicator of a one-parent household, the highest level of parental education (less than high school, high school or GED, some college, college degree, or advanced degree), time in minutes that it takes to get to school, body-mass index (BMI), an indicator of a goal to lose weight, an exertion-weighted total of hours of mild (weight of 1), moderate (weight of 2) and strenuous (weight of 3) exercise in the past week, a count of days with at least 60 min of physical activity in the past week, severity of uncontrollable eating behavior (reverse-coded average of Likert-scale responses of “Hardly ever”(= 1), “Sometimes”(= 2), “Most of the time”(= 3), “Almost always”(= 4) to “How often you stop eating when you feel full?” and “How often you trust your body to tell you how much to eat?”), an ordinal measure of feeling hungry at school (“Hardly ever”(= 1), “Sometimes”(= 2), “Most of the time”(= 3), “Almost always”(= 4)), an indicator of having no meals with family in the past week, a count of caffeinated beverages consumed in the past month (including energy drinks, coffee and tea), an indicator of morning obligations (school activities, homework, chores, or work for pay), an indicator of working for pay, eligibility for free or reduced-price lunch, the number of hours of screen time dedicated to things other than school work on an average weekday, and the count of nights per week using a device with a screen in bed. Asian and Native Hawaiian and Other Pacific Islander race categories, and parental education of less than high school and high school or GED were combined for analysis due to small cell counts.

Missing data procedures

Of the 2,134 students enrolled in the study, 2,054 fully completed the baseline survey. Attrition over the 2-year follow-up was modest; 1,917 participants contributed data at either the first or second follow-up. Item-level missing data at baseline was imputed using hot-deck imputation, an approach well suited for the mostly ordinal and categorical data collected in START [29]. Non-response at follow-up was addressed using stabilized inverse probability of censoring weights, which involved conducting logistic regressions for the numerator and denominator to predict probabilities of remaining in the study at the first and second follow-up times [30]. At each time point, the probability of remaining in the study was modeled separately for those in the delay and early start groups, with numerator models reflecting the group-wise probabilities (intercept-only models). The denominator models accounted for the school, eligibility for free or reduced-price lunch, parental education, one-parent household status, not having any family meals in the past week, uncontrollable eating behavior, caffeine intake, BMI, depression symptoms, and weekday sleep duration at baseline. Weights were computed as inverted quotients of the numerator and denominator predicted probabilities and cumulated over time.

Statistical analyses

We compared the distributions of baseline covariates between groups of students from schools that implemented the delay policy (the policy change group) and schools that remained on an early schedule (the comparison group) over the duration of the study using studentized t-tests for continuous variables and Chi-square tests for categorical variables. Using hierarchical targeted maximum likelihood estimation (htmle), we estimated the effects of the policy change on weekday sleep duration, overall depression symptoms, and fatigue and mood symptom clusters at 1- and 2-years post-baseline. Htmle is an efficient, doubly robust algorithm that combines inverse probability of treatment weighting and outcome regression to estimate average treatment effects in data where the intervention is assigned to clusters rather than individuals [31, 32].

We next examined whether the policy change had a heterogeneous effect on depression symptom outcomes. Our aim was to learn which subgroups of students experienced the greatest reduction in symptoms following the school start time delay. The data was split into 70% training and 30% testing sets. We used an X-learner algorithm to estimate the conditional average treatment effect for each individual in the training set [33]. Estimates were calculated by fitting SuperLearner models for the outcome within strata of the delay variable and cross-applying the predictions. The difference in the predictions was relearned with another SuperLearner, adjusting for non-random allocation of the delay policy and potentially informative censoring using the product of inverse probability of treatment and censoring weights. This step directly estimated the conditional average treatment effect, eliminating the reliance on the correct specification of the model for the outcome under no intervention.

Trained models for the conditional average treatment effect were applied to the testing sample to identify those for whom the delay policy is prescribed: those with negative predicted conditional average treatment effect values. Htmle was used to compute the contrast in the expected depression outcomes had everyone been assigned to their prescribed school start time regime (delayed or early) versus (a) had the delay policy been assigned universally or (b) had the policy been applied to a random 50% of the population. Improvement given prescribed rather than universal assignment indicates the existence of identifiable subgroups determined by combinations of baseline prescriptive covariates for whom the policy is both more beneficial and more harmful than expected on average. Improvement relative to at-random allocation, on the other hand, only signified subgroups with greater than average benefit expected from the policy change. Contrast with either universal or at-random implementation indicated heterogeneity of the delay policy’s effect. Where heterogeneity was found, an xgboost variable importance analysis identified student characteristics most predictive of the contrast in the testing sample using the gain and frequency metrics [34]. Section 1 of the online supplement provides a detailed overview of the computational algorithms used to detect heterogeneity.

Lastly, leveraging the policy change as an instrumental variable, we homed in on the effect of a sustained sleep gain that is expected had everyone transitioned to a delayed start schedule versus stayed on an early schedule. To do so, we utilized g-estimation of the parameters of a structural nested mean model (SNMM) comprising two equations for the two follow-up periods [35]. These models were run for outcomes which were homogeneously impacted by the delay policy. Standard errors for the estimates were bootstrapped with 1000 samples.

Initial data wrangling and imputation were conducted using SAS 9.4 (SAS Institute Inc., Cary, NC, USA) and subsequent analysis of the data was conducted using R version 4.2.1 (R Core Team (2022)).

Results

Of the 1,917 students who contributed data over the 2-year follow-up, 1,157 (60.4%) experienced the policy change to delay start times in the fall of their sophomore year. At baseline, students who were about to undergo the policy change were less likely to experience hunger during the school day, reported less difficulty with uncontrollable eating, lower intake of caffeinated beverages, slightly longer weekday sleep duration (7.2 (SD = 1.1) versus 7.1 (SD = 1.2) hours on average) and lower baseline depression scores (15.9 (SD = 4.8) versus 17.0 (SD = 5.2) on average) relative to students in comparison schools. A comprehensive comparison of baseline student characteristics – measured while all students were still on an early schedule – is outlined in Table 1.

Table 1.

Distributions of Baseline Covariates Compared by Policy Change Among Participants with Follow-Up Data from the START Study, Set in the Minneapolis Metropolitan Area, 2016 (N = 1,917)

Policy changea Comparisonb Overall

n (%) 1157 (60.4) 760 (39.6) 1,917 P c
Age in years: mean(SD) 15.1 (0.3) 15.2 (0.4) 15.2 (0.4) < 0.01
Female biological sex: n(%) 573 (49.5) 368 (48.4) 941 (49.1) 0.67
Race/ethnicity: n(%) d
 Hispanic ethnicity 39 (3.4) 18 (2.4) 57 (3.0) 0.26
 American Indian or Alaska Native 33 (2.9) 30 (3.9) 63 (3.3) 0.24
 Asian 150 (13.0) 28 (3.7) 178 (9.3) < 0.01
 Native Hawaiian or Other Pacific Islander 4 (0.3) 8 (1.1) 12 (0.6) 0.10
 Black, African American 78 (6.7) 23 (3.0) 101 (5.3) < 0.01
 White 968 (83.7) 711 (93.6) 1679 (87.6) < 0.01
One parent household: n(%) 38 (3.3) 24 (3.2) 62 (3.2) 0.98
Highest parental education: n(%) < 0.01
 Less than high school 13 (1.1) 7 (0.9) 20 (1.0)
 High school degree 57 (6.0) 65 (9.7) 122 (7.4)
 Some college 113 (9.8) 118 (15.5) 231 (12.1)
 College degree 460 (39.8) 337 (44.3) 797 (41.6)
 Advanced degree 514 (44.4) 233 (30.7) 747 (39.0)
Time (hours) to get to school: mean(SD) : n(%) 0.8 (0.3) 0.8 (0.4) 0.8 (0.3) < 0.01
Student has work, chores, activities, or schoolwork obligations in the morning before school: n(%) 295 (25.5) 221 (29.1) 516 (26.9) 0.09
Student works for pay: n(%) 172 (14.9) 112 (14.7) 284 (14.8) 0.99
Eligible for free or reduced-price lunch: n(%) 164 (14.2) 112 (14.7) 276 (14.4) 0.78
Body mass index: mean(SD) 21.5 (4.2) 21.7 (3.9) 21.6 (4.1) 0.30
Intention to lose weight: n(%) 312 (27.0) 214 (28.2) 526 (27.4) 0.60
Weighted total of physical activity in the past week: mean(SD) 18.5 (9.1) 19.8 (9.7) 19.0 (9.3) < 0.01
Days with at least 60 min of physical activity in the past week: mean(SD) 4.9 (1.8) 4.9 (1.9) 4.9 (1.8) 0.82
Difficulty controlling eating: mean(SD) 1.8 (0.7) 2.0 (0.7) 1.9 (0.7) < 0.01
Hungry at school: n(%) < 0.01
 Hardly ever 104 (9.0) 35 (4.6) 139 (7.3)
 Sometimes 560 (48.4) 277 (36.4) 837 (43.7)
 Much of the time 325 (28.1) 214 (28.2) 539 (28.1)
 Almost always 168 (14.5) 234 (30.8) 402 (21.0)
No meals with family in the past week: n(%) 85 (7.3) 53 (7.0) 138 (7.2) 0.83
# Caffeinated beverages in the past month: mean(SD) 9.0 (20.4) 11.0 (23.9) 9.8 (21.9) 0.05
Hours of screen time on average weekday: mean(SD) 2.4 (1.4) 2.7 (1.4) 2.5 (1.4) < 0.01
Nights per week using device in bed: mean(SD) 3.6 (1.9) 4.0 (1.7) 3.8 (1.8) < 0.01
Weekday sleep (hours) : mean(SD) 7.1 (1.1) 7.1 (1.2) 7.1 (1.1) 0.21
Overall depression symptoms: mean(SD) 15.9 (4.8) 17.0 (5.1) 16.3 (5.0) < 0.01

Note. SD = standard deviation

a

The policy change group reflects schools that delayed their start times from 7:30AM and 7:45AM to 8:20AM and 8:50AM, respectively

b

The comparison school group reflects the schools that remained on an early schedule (start time of 7:30AM)

c

Double-sided p-values from studentized t-tests for continuous variables and Chi-sq tests for categorical variables

d

Percent do not add to 100, multiple categories allowed as choices

The comparison of baseline characteristics between those who contributed data at follow-up and those who were censored post baseline is shown in Table S1 in the online supplement. Remaining on an early school start schedule, eligibility for free or reduced-price school lunch, having higher BMI, exercising less, experiencing shorter weekday sleep, and higher baseline depression symptoms all signaled a greater likelihood of dropping out of the study after baseline.

In Fig. 1, we compare counterfactual sleep and depression symptom outcomes across follow up under the policy change and comparison conditions. The policy change condition reflects a delay in school start times from 7:30AM and 7:45AM to 8:20AM and 8:50AM, respectively, whereas the comparison condition reflects a school start time of 7:30AM. Weeknight sleep duration would be expected to increase from baseline if the policy change was universally implemented, but gradually decrease if all had remained on an early schedule. Depression symptoms – overall, mood and fatigue symptoms – are expected to increase under both counterfactual conditions, but to a lesser degree for overall and fatigue symptoms if school start times were delayed. Average effects of the policy change, estimated as differences in counterfactual outcomes, are outlined in Table 2. Given a delayed versus early schedule, overall depression symptoms are expected to be on average 0.13 points lower, 95% CI (−0.17,−0.08)) and 0.26 points lower, 95% CI (−0.50,−0.02) at 1- and 2-years post baseline, respectively, had everyone transitioned to a later start time. Larger effects are expected for fatigue symptoms if the delay policy is implemented (−0.54 points, 95% CI (−0.73,−0.35) and − 0.91 points, 95% CI (−1.45,−0.38) at 1- and 2-years post-baseline, respectively). There is no average effect of the policy change on mood symptoms. Baseline differences in weekday sleep and depression symptom outcomes were minimized by using inverse probability weighting, as demonstrated in Fig. 1, Table S2 and Table S3 in the online supplement, and are not expected to influence expected changes in the outcomes across follow-up.

Fig. 1.

Fig. 1

Counterfactual sleep and depression symptom outcomes across follow-up (a. weeknight sleep duration; b. overall depression symptom score; c. mood symptom score; d. fatigue and sleep-related symptom score)

Table 2.

Average Effects of the Policy Change on Sleep and Depression Outcomes Over Follow-up among Participants of the START Study, Set in the Minneapolis Metropolitan Area, 2017–2018 (N = 1,917)

1-year post-baseline 2-years post-baseline

Est (95% CI) Est (95% CI)

Weekday sleep duration 0.53(0.34,0.72) 0.45(0.23,0.66)
Depression symptoms
 Overall −0.13(−0.17,−0.08) −0.26(−0.50,−0.02)
 Mood −0.04(−0.46,0.38) −0.03(−0.09,0.02)
 Fatigue −0.54(−0.73,−0.35) −0.91(−1.45,−0.38)

Note. Est = point estimate of the average effect of the delay estimated using htmle; CI = confidence interval

Counterfactual conditions reflect the policy change where schools delay their start time from 7:30AM and 7:45AM to 8:20AM and 8:50AM, respectively, and a comparison condition where schools continue to start at 7:30AM.

We next addressed whether the impact of the delay was heterogeneous with respect to depression symptoms over time. In Table 3, we compared population outcomes if everyone followed their prescribed assignment versus (a) if the delay policy was implemented universally or (b) at-random. Prescriptive allocation of the delay did not yield benefit relative to universal implementation. However, a 0.76-point decrease, 95% CI (−1.43,−0.08) in mood symptoms 1-year post-baseline is expected if the delay is implemented in subgroups with expected benefit of the policy rather than at-random in the population. Overall depression symptoms and fatigue symptoms did not exhibit heterogeneous response to the policy. The putative benefits of universal relative to prescribed and prescribed relative to at-random implementation did not survive multiple testing correction using the Benjamini-Hochberg method. However, multiple testing correction may be an overly conservative strategy in this case, given that the outcomes are all based on the same depression symptom scale.

Table 3.

Comparison of Prescribed vs. Universal or At-Random Policy Change Implementation on Counterfaetual Depression Symptom Outcomes among Participants of the START Study, Set in the Minneapolis Metropolitan Area, 2017–2018 (N = 1,917)

Universal Implementation Random implementation


1-year post-baseline 2-years post-baseline 1-year post-baseline 2-years post-baseline

Depression symptoms Est (95% CI) Est (95% CI) Est (95% CI) Est(95% CI)
 Overall 0.03(−1.24,1.29) 0.77(−0.46,1.99) −0.03(−1.30,1.23) 0.21(−1.03,1.44)
 Mood 0.85(0.19,1.51) 0.85(−0.35,2.04) −0.76(−1.43,−0.08) 0.49(−0.72,1.70)
 Fatigue −0.13(−1.44,1.18) 1.32(−0.12,2.75) −0.24(−1.55,1.07) 0.01(−1.44,1.45)

Note. Est = point estimate of the average effect of prescribing the delay contrasted with a counterfactual strategy (universal or at-random implementation) estimated using htmle; CI = confidence interval

Table 4 summarizes the variable importance statistics (gain and frequency) for the conditional average effect on mood symptoms and outlines the top 10 variables that in combination contribute most strongly to the accuracy of its prediction. Student age, daily school-unrelated screen-time, BMI, physical activity, and the intension to lose weight are statistically different for those for whom the delay is prescribed (46.7%). The delay is expected to improve low mood for students who are older (15.28 (SD = 0.32) vs. 15.05 (SD = 0.35)), have higher BMI (21.94 (SD = 4.02) vs. 21.20 (SD = 3.62)), spend more time on screens on an average day (3.12 h (SD = 1.42) vs. 2.02 h (SD = 1.31)), exercise less often (4.74 days (SD = 1.95) vs. 5.12 days (SD = 1.16) per week), are more likely to want to lose weight (37% vs. 19%), and consume more caffeine in a given month (10.71 cups (SD = 22.56) vs. 7.59 cups (SD = 19.06)).

Table 4.

Variable Importance Analysis Outlining the Top 10 Characteristics Most Predictive of the Conditional Average Treatment Effect for the 1-year Mood Symptom Outcome; Testing Sample of START Study Participants from the Minneapolis Metropolitan Area, 2017 (N = 534)

Characteristic Gaina Frequencyb Delay prescribed Delay not prescribed P c

n = 240 n = 274

Mean(SD) Mean(SD)

Age (years) 0.21 0.15 15.28 (0.32) 15.05 (0.35) < 0.01
Hours of screen time on average weekday 0.12 0.04 3.12 (1.42) 2.02 (1.31) < 0.01
Weekday sleep duration (hours) 0.10 0.17 7.16 (1.13) 7.13 (1.07) 0.73
Weighted hours of physical activity (past week) 0.10 0.10 19.37 (9.80) 19.40 (9.00) 0.97
Body mass index 0.09 0.14 21.94 (4.02) 21.20 (3.62) 0.03
Difficulty controlling eating 0.06 0.04 2.04 (0.75) 1.69 (0.61) < 0.01
# Caffeinated beverages in the past month 0.05 0.06 10.71 (22.56) 7.59 (19.06) 0.09
Days with 60 + min of physical activity (past week) 0.04 0.04 4.74 (1.95) 5.12 (1.61) 0.02
Intension to lose weight (proportion) 0.04 0.01 0.37 (0.48) 0.19 (0.40) < 0.01
Time it takes to get to school (hours) 0.03 0.04 0.79 (0.37) 0.78 (0.31) 0.73

Note.SD = standard deviation

a

Gain statistic quantifies each characteristic’s contribution to the accuracy of the counterfactual contrast prediction

b

Frequency quantifies the relative number of times a feature appears in regression trees

c

Double-sided p-values from studentized t-tests

We used the policy change as an instrumental variable to estimate the impact of a sustained 30-minute sleep gain – as approximately expected following the school start time delay – for overall and fatigue symptoms, which were homogeneously impacted by the delay. We estimated a reduction in overall depression symptoms by 0.78, 95%CI (−1.32,−0.28) points over 2 years and a more pronounced impact on fatigue (−1.36 points, 95% CI (−2.19−0.69)), which likely drives the reductions in overall symptoms.

Discussion

This study examined how a structural intervention to delay school start times impacts depression symptoms in a population of high school students. We found that start time delay policies have the potential to reduce population-level depression symptom burden in high school students, particularly improving self-reported difficulty due to fatigue. While the delay does not confer an average impact on mood symptoms, students who are older, have higher BMI, express the intension to lose weight, spend more time on screens, exercise less, and consume more caffeine may experience improvements in mood symptoms following a shift to later school start times.

Insufficient sleep is a prevalent issue among adolescents. According to a 2018 report of nationally representative data from the Youth Risk Behavior Surveillance System, 72.7% of high school students in the U.S. get less than the American Academy of Sleep Medicine’s recommended 8–10 h of sleep per night [17, 18]. Nearly half of 16-year-olds in the U.S. regularly sleep less than 7 h per night [36]. Sustained sleep deprivation increases the risk of developing chronic health conditions including major depressive disorder [37]. Recent work theorizes that sleep deficits in adolescence may increase internalizing symptoms by promoting functional imbalance in the default brain network, promoting repetitive negative thoughts and rumination, further compromising sleep [38].

School-based interventions have been shown to increase sleep duration in adolescents. A successful reduction of insufficient sleep (< 7 h/night) was evidenced by an experimental school-based sleep education curriculum, but the intervention group reported no change in perceived stress over 1 year of follow-up [39]. Interventions targeting individual behaviors to improve sleep may be insufficient to safeguard downstream mental health outcomes when chronobiological changes in adolescents collide with structural morning-side schedule demands. The START study demonstrates that a school start delay results in a meaningful and sustained average gain in weekday sleep duration – self-reported and objectively-measured with actigraphy in a sub-sample [26]. Our findings confirm that a structural change towards later school start times reduces depression symptoms in high school-aged youth [40]. We expect students who maintain a 30-minute gain in weekday sleep over two years to have a lower burden of depression symptoms longitudinally. Overall depression symptoms are expected to be lower than if students remained on an early schedule, largely driven by a pronounced reduction in fatigue.

This study is novel in demonstrating that delaying high school start times may be particularly protective for students who are older for their cohort, who have higher BMI and indicate struggling with excess weight, who are more sedentary than their peers, and spend more time on screens. Struggling with excess weight and spending more time on screens have been cited as independent risk factors for dysregulated mood [4143]. A large study found significant increases in adolescent depression symptoms over time associated with greater time spent on social media and watching television, revealing reinforcing interactions between the within-person and between-person impacts of media use on deteriorating self-esteem [44]. Overweight and obesity have a strong bidirectional connection with major depression, particularly among female adolescents and young adults [45, 46]. There is a potential link between excess adipose tissue and adolescent-onset major depression via inflammatory mechanisms that are mediated by increases in sex hormones during puberty [47]. In adolescence, being heavier considerably limits exercise and promotes sedentary activities – including time spent on screens [48]. Notably, children growing up in disadvantaged socioeconomic circumstances are more likely to fit the profile of characteristics identifying those with greatest benefits from the school start time delay intervention [4951]. Our findings suggest that delaying start times in underserved school districts – as part of a universal implementation strategy – may lower population-level depression symptom burden and potentially reduce observed disparities in depression.

Our finding that prescribed assignment is better than at-random but not universal allocation of delayed start times indicates that the subgroups with empirically identified harm are not well-characterized by the set of prescriptive covariates examined. The impact of the delay policy is likely heterogeneous with respect to the mood symptom cluster, but the information we have only characterizes subgroups who have better-than-average mood symptoms as a result of the policy change. This work may help inform the choice of school if alternatives are available to a family with a high school student. Otherwise, schools may consider scheduling study hall or free periods at the start of the school days such that students who need extra sleep can opt out of early morning obligations. In addition, future research should assess if school bus routes can be optimized to provide students who benefit most from later school start times with extra time in the mornings. Such studies would need to connect individual characteristics to geocoded addresses of participating students.

There are notable limitations and strengths to this work. First, the school start time delay was not randomly assigned; however, we used inverse probability of treatment weighting to control for imbalances in measured student characteristics that may affect depressed mood over follow-up. Second, while overall retention was good, loss to follow-up depended on baseline student characteristics. We ameliorated this potential source of bias by using stabilized inverse probability weights. Third, we recognize that unmeasured school-district level characteristics such as average household income were not well-accounted for and may have been drivers in the school districts’ decisions to delay start times, study attrition, and levels of depression symptoms in the study population. The geographic proximity of the participating schools and efforts made to select schools similar on economic and demographic characteristics gives us hope that the delayed and early-start schools were comparable on unmeasured cluster-level characteristics. On the other hand, the selection of schools from a proximal geographic area limits generalizability. For estimation and inference of overall and heterogeneous effects, we utilized hierarchical targeted learning, a doubly robust algorithm that accounts for clustering of the delay implementation, but with only 5 clusters, our precision may have been reduced. As such, evidence for heterogeneity of treatment response was limited after correction for multiple testing. Lastly, we were not able to assess gender identity as a potential prescriptive factor – only biological sex (male or female) was collected in this study. Despite limitations, the START study gave us an opportunity to leverage the delay policy as an instrumental variable and thus control for unmeasured confounding of the relationship between individual weekday sleep duration and self-reported mood.

Our findings stand in favor of universal high school start time delays, which result in meaningful reductions in fatigue, driving modest reductions in overall depression symptoms over two years of follow-up. Delays are expected to have greater than average impact on low mood in older students who have higher average BMI, indicate a goal to lose weight, spend more time on screens, exercise less, and consume more caffeine. Universal delays of high school start times have the potential to not only sustainably prolong weekday sleep among high school students but improve depression symptom outcomes.

Supplementary Material

supplemental

Acknowledgements

The authors would like to thank the adolescents participating in the START study, the districts that welcomed us to do research in their schools, the START data collectors, and Bill Baker for his work to manage the data. Thank you to Kate Bauer for sharing your great ideas. This study was supported by funding from the National Institutes of Health’s (NIH) Eunice Kennedy Shriver National Institute of Child Health and Human Development (NICHD) (R01 HD088176). Additionally, the authors gratefully acknowledge support from the Minnesota Population Center (P2C HD041023) funded through a grant from NICHD. Ekaterina Sadikova was supported by T32 MH 17119–36.

Funding

This study is supported by funding from the National Institutes of Health’s (NIH) Eunice Kennedy Shriver National Institute of Child Health and Human Development (NICHD) (R01 HD088176). Additionally, the authors gratefully acknowledge support from the Minnesota Population Center (P2C HD041023) funded through a grant from NICHD. Ekaterina Sadikova was supported by T32 MH 17119–36.

Footnotes

Competing interests The authors have no relevant financial or non-financial interests to disclose.

Supplementary Information The online version contains supplementary material available at https://doi.org/10.1007/s00127-024-02694-2.

Ethics approval The research presented has been approved by the Institutional Review Boards of the University of Minnesota and Harvard University.

Consent Informed consent was obtained from parents of the START study participants each year data was collected. Assent was obtained from the participants, who were informed that they were free to refuse participating in any element of data collection with no ensuing consequences.

Data The data underlying this article will be shared on reasonable request to the corresponding author.

Data availability

No datasets were generated or analysed during the current study.

References

  • 1.Emslie GJ, Mayes TL, Ruberu M (2005) Continuation and maintenance therapy of early-onset major depressive disorder. Paediatr Drugs 7(4):203–217. 10.2165/00148581-200507040-00001 [DOI] [PubMed] [Google Scholar]
  • 2.Kessler RC, Avenevoli S, Ries Merikangas K (2001) Mood disorders in children and adolescents: an epidemiologic perspective. Biol Psychiatry (1969) 49(12):1002–1014. 10.1016/S0006-3223(01)01129-5 [DOI] [PubMed] [Google Scholar]
  • 3.Ge X, Natsuaki MN (2009) In search of explanations for early pubertal timing effects on Developmental psychopathology. Current directions in psychological science: a. J Am Psychol Soc 18(6):327–331. 10.1111/j.1467-8721.2009.01661.x [DOI] [Google Scholar]
  • 4.Spear LP (2009) Heightened stress responsivity and emotional reactivity during pubertal maturation: implications for psychopathology. Dev Psychopathol 21(1):87–97. 10.1017/S0954579409000066 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Steinberg L (2008) A Social Neuroscience Perspective on adolescent risk-taking. Dev Rev 28(1):78–106. 10.1016/j.dr.2007.08.002 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Andersen SL, Teicher MH (2008) Stress, sensitive periods and maturational events in adolescent depression. Trends in neurosciences (regular ed). 31(4):183–191. 10.1016/j.tins.2008.01.004 [DOI] [PubMed] [Google Scholar]
  • 7.Lord C, Sekerovic Z, Carrier J (2014) Sleep regulation and sex hormones exposure in men and women across adulthood. Pathol Biol (Paris) 62(5):302–310. 10.1016/j.patbio.2014.07.005 [DOI] [PubMed] [Google Scholar]
  • 8.Andersen SL (2003) Trajectories of brain development: point of vulnerability or window of opportunity? Neuroscience and biobehavioral reviews 27. 13–18. 10.1016/S0149-7634(03)00005-8 [DOI] [PubMed] [Google Scholar]
  • 9.Perlman WR, Webster MJ, Herman MM, Kleinman JE, Weickert CS (2007) Age-related differences in glucocorticoid receptor mRNA levels in the human brain. Neurobiol Aging 28(3):447–458. 10.1016/j.neurobiolaging.2006.01.010 [DOI] [PubMed] [Google Scholar]
  • 10.Meers J, Stout-Aguilar J, Nowakowski S (2019) Chap. 3 - Sex differences in sleep health. In: Grandner MA(ed) Sleep and Health. Academic Press, pp 21–29. 10.1016/B978-0-12-815373-4.00003-4 [DOI] [Google Scholar]
  • 11.Crowley SJ, Acebo C, Carskadon MA (2006) Sleep, circadian rhythms, and delayed phase in adolescence. Sleep Med 8(6):602–612. 10.1016/j.sleep.2006.12.002 [DOI] [PubMed] [Google Scholar]
  • 12.Silman RE, Leone RM, Hooper RJ, Preece MA (1979) Melatonin, the pineal gland and human puberty. Nature 282(5736):301–303. 10.1038/282301a0 [DOI] [PubMed] [Google Scholar]
  • 13.Zisapel N (2018) New perspectives on the role of melatonin in human sleep, circadian rhythms and their regulation. Br J Pharmacol 175(16):3190–3199. 10.1111/bph.14116 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Jenni OG, Achermann P, Carskadon MA (2005) Homeostatic sleep regulation in adolescents. Sleep 28(11):1446–1454. 10.1093/sleep/28.11.1446 [DOI] [PubMed] [Google Scholar]
  • 15.Taylor DJ, Jenni OG, Acebo C, Carskadon MA (2005) Sleep tendency during extended wakefulness: insights into adolescent sleep regulation and behavior. J Sleep Res 14(3):239–244. 10.1111/j.1365-2869.2005.00467.x [DOI] [PubMed] [Google Scholar]
  • 16.Owens J, GROUP ASW, ADOLESCENCE CO, Au R, Carskadon M, Millman R, Wolfson A, Braverman PK, Adelman WP, Breuner CC, Levine DA, Marcell AV, Murray PJ, O’Brien RF (2014) Insufficient sleep in adolescents and young adults: an update on causes and consequences. Pediatrics 134(3):e921–e932. 10.1542/peds.2014-1696 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Paruthi S, Brooks LJ, D’Ambrosio C, Hall WA, Kotagal S, Lloyd RM, Malow BA, Maski K, Nichols C, Quan SF, Rosen CL, Troester MM, Wise MS (2016) Consensus Statement of the American Academy of Sleep Medicine on the recommended amount of sleep for healthy children: methodology and discussion. J Clin Sleep Med 12(11):1549–1561. 10.5664/jcsm.6288 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Wheaton AG, Jones SE, Cooper AC, Croft JB (2018) Short Sleep Duration among Middle School and High School Students — United States, 2015. MMWR Morbidity Mortal Wkly Rep 67(3):85–90. 10.15585/mmwr.mm6703a1 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Taie SL, Laurie (2022) Characteristics of 2020–21 public and private K–12 schools in the United States: results from the National teacher and principal survey. Institute of Education Sciences, U.S. Department of Education [Google Scholar]
  • 20.Carskadon MA (2011) Sleep in adolescents: the perfect storm. Pediatr Clin North Am 58(3):637–647. 10.1016/j.pcl.2011.03.003 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Crowley SJ, Wolfson AR, Tarokh L, Carskadon MA (2018) An update on adolescent sleep: new evidence informing the perfect storm model. J Adolesc 67:55–65. 10.1016/j.adolescence.2018.06.001 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Galván A (2020) The need for Sleep in the adolescent brain. Trends Cogn Sci 24(1):79–89. 10.1016/j.tics.2019.11.002 [DOI] [PubMed] [Google Scholar]
  • 23.Short MA, Booth SA, Omar O, Ostlundh L, Arora T (2020) The relationship between sleep duration and mood in adolescents: a systematic review and meta-analysis. Sleep Med Rev 52:101311–101311. 10.1016/j.smrv.2020.101311 [DOI] [PubMed] [Google Scholar]
  • 24.APA (2013) Diagnostic and Statistical Manual of Mental Disorders. 5th edn., Washington, DC [Google Scholar]
  • 25.Widome R, Wahlstrom KL, Laska MN, Erickson DJ, Berger A, Iber C, Kilian G (2020) The START study: an evaluation to study the impact of a natural experiment in high school start times on adolescent weight and related behaviors. Obs Stud 6:66–86 [PMC free article] [PubMed] [Google Scholar]
  • 26.Widome R, Berger AT, Iber C, Wahlstrom K, Laska MN, Kilian G, Redline S, Erickson DJ (2020) Association of Delaying School Start Time with Sleep Duration, timing, and Quality among adolescents. JAMA Pediatr 174(7):697–704. 10.1001/jamapediatrics.2020.0344 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Kandel DB, Davies M (1982) Epidemiology of depressive mood in adolescents: an empirical study. Arch Gen Psychiatry 39(10):1205–1212. 10.1001/archpsyc.1982.04290100065011 [DOI] [PubMed] [Google Scholar]
  • 28.Roane BM, Seifer R, Sharkey KM, Van Reen E, Bond TLY, Raffray T, Carskadon MA (2013) Reliability of a Scale assessing depressed Mood in the Context of Sleep. TPM Test Psychom Methodol Appl Psychol 20(1):3–11. 10.4473/TPM20.1.1 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Ono M, Miller HP (1969) Income nonresponses in the current population survey. US Bureau of the Census [Google Scholar]
  • 30.Robins JM, Finkelstein DM (2000) Correcting for non-compliance and dependent censoring in an AIDS Clinical Trial with inverse probability of censoring weighted (IPCW) log-rank tests. Biometrics 56(3):779–788. 10.1111/j.0006-341x.2000.00779.x [DOI] [PubMed] [Google Scholar]
  • 31.Balzer LB, Zheng W, van der Laan MJ, Petersen ML (2019) A new approach to hierarchical data analysis: targeted maximum likelihood estimation for the causal effect of a cluster-level exposure. Stat Methods Med Res 28(6):1761–1780. 10.1177/0962280218774936 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.van der Laan MJ, Polley EC, Hubbard AE (2007) Super learner. Stat Appl Genet Mol Biol 6:Article25. 10.2202/1544-6115.1309 [DOI] [PubMed] [Google Scholar]
  • 33.Luedtke AR, van der Laan MJ (2016) Super-learning of an optimal dynamic treatment rule. Int J Biostat 12(1):305–332. 10.1515/ijb-2015-0052 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Chen T, Guestrin C, XGBoost: A Scalable Tree Boosting System. In, Ithaca, 2016 2016. Cornell University Library, arXiv.org. 10.1145/2939672.2939785 [DOI] [Google Scholar]
  • 35.Robins JM, Blevins D, Ritter G, Wulfsohn M (1992) G-estimation of the effect of prophylaxis therapy for Pneumocystis carinii pneumonia on the survival of AIDS patients. Epidemiology:319–336 [DOI] [PubMed] [Google Scholar]
  • 36.Keyes KM, Maslowsky J, Hamilton A, Schulenberg J (2015) The great sleep recession: changes in sleep duration among US adolescents, 1991–2012. Pediatr (Evanston) 135(3):460–468. 10.1542/peds.2014-2707 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Colten HR, Altevogt BM (2006) Sleep disorders and sleep deprivation: an unmet public health problem [PubMed] [Google Scholar]
  • 38.Akbar SA, Mattfeld AT, Laird AR, McMakin DL (2022) Sleep to Internalizing Pathway in Young adolescents (SIPYA): a proposed neurodevelopmental model. Neurosci Biobehav Rev 140:104780–104780. 10.1016/j.neubiorev.2022.104780 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Bauducco SV, Flink IK, Boersma K, Linton SJ (2020) Preventing sleep deficit in adolescents: long-term effects of a quasi-experimental school-based intervention study. J Sleep Res 29(1):e12940. 10.1111/jsr.12940 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Berger AT, Erickson DJ, Wahlstrom K, Laska MN, Lammert S, Widome R (Under review) Consequences of delaying school start time on mental health among adolescents: the START study [Google Scholar]
  • 41.Twenge JM, Campbell WK (2018) Associations between screen time and lower psychological well-being among children and adolescents: evidence from a population-based study. Prev Med Rep 12:271–283. 10.1016/j.pmedr.2018.10.003 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Twenge JM, Cooper AB, Joiner TE, Duffy ME, Binau SG (2019) Age, period, and cohort trends in mood disorder indicators and suicide-related outcomes in a nationally representative dataset, 2005–2017. J Abnorm Psychol 128(3):185–199. 10.1037/abn0000410 [DOI] [PubMed] [Google Scholar]
  • 43.Lindberg L, Hagman E, Danielsson P, Marcus C, Persson M (2020) Anxiety and depression in children and adolescents with obesity: a nationwide study in Sweden. BMC Med 18(1):30. 10.1186/s12916-020-1498-z [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Boers E, Afzali MH, Newton N, Conrod P (2019) Association of Screen Time and depression in adolescence. JAMA Pediatr 173(9):853–859. 10.1001/jamapediatrics.2019.1759 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Marmorstein NR, Iacono WG, Legrand L (2014) Obesity and depression in adolescence and beyond: reciprocal risks. Int J Obes 38(7):906–911. 10.1038/ijo.2014.19 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Markowitz S, Friedman MA, Arent SM (2008) Understanding the relation between obesity and depression: causal mechanisms and implications for treatment. Clinical psychology. (New York NY) 15(1):1–20. 10.1111/j.1468-2850.2008.00106.x [DOI] [Google Scholar]
  • 47.Byrne ML, O’Brien-Simpson NM, Mitchell SA, Allen NB (2015) Adolescent-onset depression: are obesity and inflammation Developmental mechanisms or outcomes? Child Psychiatry Hum Dev 46(6):839–850. 10.1007/s10578-014-0524-9 [DOI] [PubMed] [Google Scholar]
  • 48.Wójcik M, Goncerz D, Piasny M, Surówka A, Mazurek E, Drożdż D, Kozioł-Kozakowska A, Starzyk JB, Makara-Studzińska M (2022) Obesity in adolescents may be associated with limitations in daily activities and an increased level of anxiety in patients and their parents – preliminary results of a pilot study. Front Endocrinol 13. 10.3389/fendo.2022.1007765 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Lee H, Harris KM, Gordon-Larsen P (2009) Life course perspectives on the links between poverty and obesity during the transition to young adulthood. Popul Res Policy Rev 28:505–532 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Armstrong S, Wong CA, Perrin E, Page S, Sibley L, Skinner A (2018) Association of Physical Activity with Income, Race/Ethnicity, and sex among adolescents and young adults in the United States: findings from the National Health and Nutrition Examination Survey, 2007–2016. JAMA Pediatr 172(8):732–740. 10.1001/jamapediatrics.2018.1273 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Lee S, Kim S, Suh S, Han H, Jung J, Yang S, Shin Y (2022) Relationship between screen time among children and lower economic status during elementary school closures due to the coronavirus disease 2019 pandemic. BMC Public Health 22(1):160–160. 10.1186/s12889-022-12559-5 [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

supplemental

Data Availability Statement

No datasets were generated or analysed during the current study.

RESOURCES