Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2013 Dec 20.
Published in final edited form as: N Engl J Med. 2013 May 21;368(25):2366–2376. doi: 10.1056/NEJMoa1215881

A Randomized Trial of Adenotonsillectomy for Childhood Sleep Apnea

Carole L Marcus, Reneé H Moore, Carol L Rosen, Bruno Giordani, Susan L Garetz, H Gerry Taylor, Ron B Mitchell, Raouf Amin, Eliot S Katz, Raanan Arens, Shalini Paruthi, Hiren Muzumdar, David Gozal, Nina Hattiangadi Thomas, Janice Ware, Dean Beebe, Karen Snyder, Lisa Elden, Robert C Sprecher, Paul Willging, Dwight Jones, John P Bent, Timothy Hoban, Ronald D Chervin, Susan S Ellenberg, Susan Redline; for the Childhood Adenotonsillectomy Trial (CHAT)
PMCID: PMC3756808  NIHMSID: NIHMS503890  PMID: 23692173

Abstract

BACKGROUND

Adenotonsillectomy is commonly performed in children with the obstructive sleep apnea syndrome, yet its usefulness in reducing symptoms and improving cognition, behavior, quality of life, and polysomnographic findings has not been rigorously evaluated. We hypothesized that, in children with the obstructive sleep apnea syndrome without prolonged oxyhemoglobin desaturation, early adenotonsillectomy, as compared with watchful waiting with supportive care, would result in improved outcomes.

METHODS

We randomly assigned 464 children, 5 to 9 years of age, with the obstructive sleep apnea syndrome to early adenotonsillectomy or a strategy of watchful waiting. Polysomnographic, cognitive, behavioral, and health outcomes were assessed at baseline and at 7 months.

RESULTS

The average baseline value for the primary outcome, the attention and executive-function score on the Developmental Neuropsychological Assessment (with scores ranging from 50 to 150 and higher scores indicating better functioning), was close to the population mean of 100, and the change from baseline to follow-up did not differ significantly according to study group (mean [±SD] improvement, 7.1±13.9 in the early-adenotonsillectomy group and 5.1±13.4 in the watchful-waiting group; P = 0.16). In contrast, there were significantly greater improvements in behavioral, quality-of-life, and polysomnographic findings and significantly greater reduction in symptoms in the early-adenotonsillectomy group than in the watchful-waiting group. Normalization of polysomnographic findings was observed in a larger proportion of children in the early-adenotonsillectomy group than in the watchful-waiting group (79% vs. 46%).

CONCLUSIONS

As compared with a strategy of watchful waiting, surgical treatment for the obstructive sleep apnea syndrome in school-age children did not significantly improve attention or executive function as measured by neuropsychological testing but did reduce symptoms and improve secondary outcomes of behavior, quality of life, and polysomnographic findings, thus providing evidence of beneficial effects of early adenotonsillectomy. (Funded by the National Institutes of Health; CHAT ClinicalTrials.gov number, NCT00560859.)


The childhood obstructive sleep apnea syndrome is associated with numerous adverse health outcomes, including cognitive and behavioral deficits.1 The most commonly identified risk factor for the childhood obstructive sleep apnea syndrome is adenotonsillar hypertrophy. Thus, the primary treatment is adenotonsillectomy, which accounts for more than 500,000 procedures annually in the United States alone.2 Nevertheless, there has been no controlled study evaluating the benefits and risks of adenotonsillectomy, as compared with watchful waiting, for the management of the obstructive sleep apnea syndrome.

The Childhood Adenotonsillectomy Trial (CHAT) was designed to evaluate the efficacy of early adenotonsillectomy versus watchful waiting with supportive care, with respect to cognitive, behavioral, quality-of-life, and sleep factors at 7 months of follow-up, in children with the obstructive sleep apnea syndrome. Our primary outcome was a neurobehavioral measure of attention and executive function, a domain that has been shown to be sensitive to intermittent hypoxemia related to the obstructive sleep apnea syndrome.3 Given the prevalence of this syndrome among black children and obese children,4,5 we also evaluated whether the relative efficacy of the treatment differed according to race, weight, or baseline severity of the syndrome.

METHODS

STUDY DESIGN AND PATIENTS

We conducted this multicenter, single-blind, randomized, controlled trial at seven academic sleep centers. Methodologic details have been published previously6 and are provided in the full protocol and in the Supplementary Appendix (available with the full text of this article at NEJM.org).

Eligible children were 5 to 9 years of age, had the obstructive sleep apnea syndrome without prolonged oxyhemoglobin desaturation, and were considered to be suitable candidates for adenotonsillectomy. The obstructive sleep apnea syndrome was defined as an obstructive apnea–hypopnea index (AHI) score of 2 or more events per hour or an obstructive apnea index (OAI) score of 1 or more events per hour.710 Children with an AHI score of more than 30 events per hour, an OAI score of more than 20 events per hour, or arterial oxyhemoglobin saturation of less than 90% for 2% or more of the total sleep time were not eligible, owing to the severity of the polysomnographic findings. Exclusion criteria included recurrent tonsillitis, a z score based on the body-mass index (the weight in kilograms divided by the square of the height in meters) of 3 or more, and medication for attention deficit–hyperactivity disorder (ADHD).

Children were randomly assigned to early adenotonsillectomy (surgery within 4 weeks after randomization) or a strategy of watchful waiting. At the baseline visit, children with coexisting conditions that could exacerbate the obstructive sleep apnea syndrome (e.g., allergies and poorly controlled asthma) were referred for treatment as needed.

STUDY OVERSIGHT

The study was approved by the institutional review board at each participating site. Written informed consent was obtained from caregivers, and assent from children who were 7 years of age or older. An independent data and safety monitoring board reviewed interim data on safety and study quality. An external medical monitor adjudicated treatment failures, defined as changes in clinical status requiring a change in the assigned therapy.6 All the authors vouch for the completeness and accuracy of the data and the fidelity of the study to the protocol (available at NEJM.org). There was no commercial support for this study.

ASSESSMENTS

Children underwent standardized polysomnographic testing with scoring at a centralized sleep reading center, cognitive and behavioral testing, and other clinical and laboratory evaluations at baseline and 7 months after randomization.6 At both examinations, caregivers were asked to complete survey instruments, and teachers were mailed behavioral assessments (see the Supplementary Appendix).

OUTCOMES

The primary study outcome was the change in the attention and executive-function score on the Developmental Neuropsychological Assessment (NEPSY; scores range from 50 to 150, with 100 representing the population mean and higher scores indicating better functioning).11 This test has well-established psychometric properties11 and comprises three tasks (tower building, visual attention, and auditory attention) performed under the supervision of a psychometrist.

Other outcomes included caregiver and teacher ratings of behavior (Conners’ Rating Scale Revised: Long Version Global Index, comprising Restless–Impulsive and Emotional Lability factor sets [caregiver-rated T scores range from 38 to 90, and teacher-rated T scores range from 40 to 90, with higher scores indicating worse functioning]),12 and the Behavior Rating Inventory of Executive Function [BRIEF] Global Executive Composite T score, comprising summary measures of behavioral regulation and metacognition [caregiver-rated scores range from 28 to 101, and teacher-rated scores range from 37 to 131, with higher scores indicating worse functioning])13; symptoms of the obstructive sleep apnea syndrome, as assessed by means of the Pediatric Sleep Questionnaire sleep-related breathing disorder scale (PSQ-SRBD), in which scores range from 0 to 1, with higher scores indicating greater severity14; sleepiness, as assessed with the use of the Epworth Sleepiness Scale modified for children, in which scores range from 0 to 24, with higher scores indicating greater daytime sleepiness15; global quality of life (caregiver-rated total score from the Pediatric Quality of Life Inventory [PedsQL], in which scores range from 0 to 100, with higher scores indicating better quality of life)16; disease-specific quality of life (total score on the 18-item Obstructive Sleep Apnea-18 assessment tool, in which scores range from 18 to 126, with higher scores indicating worse quality of life)17; generalized intellectual functioning (General Conceptual Ability score from the Differential Ability Scales-II [DAS], in which scores range from 30 to 170, with higher scores indicating better functioning)18; and polysomnographic indexes.

STATISTICAL ANALYSIS

We calculated that with a sample of 400 children, randomly assigned in a 1:1 ratio to early adenotonsillectomy or a strategy of watchful waiting, the study would have 90% power to detect an effect size of 0.32 or more (on the basis of an estimate from one previous study19) for the primary outcome. We planned to enroll 460 children to compensate for withdrawal from the study. Children who crossed over to the other treatment were included in their assigned study groups for the primary analysis, consistent with the intention-to-treat principle.

Primary and secondary outcomes were evaluated with the use of an analysis of covariance with adjustment for the stratification factors of age, race, weight status, and study site. Additional prespecified analyses included adjustments for other factors and restrictions to certain subgroups (see the Supplementary Appendix). Models evaluating possible effect modification of treatment according to race, obesity status, AHI score, and age were tested by including terms for interactions between the two groups and by the effect of each of these factors on each of the study outcomes. A sensitivity analysis was performed with the use of multiple imputation to assess the effect of missing values on the primary outcome.20

RESULTS

STUDY OVERVIEW

Figure 1 shows the enrollment and randomization of the participants. From January 2008 through September 2011, a total of 464 children underwent randomization. Follow-up visits were conducted for 400 children (86%), with 397 children having measurements of attention and executive function on the NEPSY that could be evaluated. A comparison of children who completed the study and those who did not showed a significant difference only with respect to race; black children were less likely to complete the study (P = 0.04), but this trend was evident in both study groups.

Figure 1. Study Enrollment and Randomization.

Figure 1

A total of 464 children underwent randomization, with 226 children assigned to early adenotonsillectomy and 227 to a strategy of watchful waiting with supportive care. A total of 194 children in the early-adenotonsillectomy group and 203 in the watchful-waiting group were included in the analysis of the primary outcome, the attention and executive-function score on the Developmental Neuropsychological Assessment (NEPSY).

Baseline characteristics are shown according to study group (Tables 1 and 2, and Table S1 in the Supplementary Appendix). Baseline demographic and clinical characteristics were generally well balanced between the study groups, and cognitive and behavioral scores were close to population means. Nearly half the participants were overweight or obese. A similar number of children in each group used nasal glucocorticoids (19 children in the early-adenotonsillectomy group and 8 in the watchful-waiting group) or montelukast (7 in the early-adenotonsillectomy group and 8 in the watchful-waiting group) for allergic rhinitis or asthma; the data were the same at baseline and at 7 months of follow-up.

Table 1.

Baseline Characteristics of Patients Who Completed the Study.*

haracteristic Watchful
Waiting
(N = 203)
Early
Adenotonsillectomy
(N = 194)
Age — yr 6.5±1.4 6.5±1.4
Male sex — no. (%) 106 (52) 89 (46)
Race — no. (%)
  Black 108 (53) 103 (53)
  White 76 (37) 67 (35)
  Other 19 (9) 24 (12)
Hispanic ethnicity — no. (%) 17 (8) 15 (8)
Height — cm 124.7±10.5 125.1±11.2
Height z score 0.6±1.0 0.7±1.0
Weight — kg 30.1±11.7 31.2±13.1
Weight z score 1.0±1.2 1.0±1.3
Weight class — no. (%)
  Overweight or obese 94 (46) 93 (48)
  Obese 67 (33) 68 (35)
  Failure to thrive 3 (1) 4 (2)
Maternal educational level less
    than high school
    — no. (%)
64 (32) 62 (32)
Annual household income
    <$30,000 — no. (%)
82 (40) 73 (38)
*

Plus–minus values are means ±SD. There were no significant differences between the study groups. The characteristics of the sample included in the primary outcome analysis were similar to those of the baseline population, except that black children were less likely to complete the study (P = 0.04), but this trend was evident in both study groups (Table S1 in the Supplementary Appendix).

Race reported by caregivers.

Overweight or obese was defined as a body-mass index (BMI; the weight in kilograms divided by the square of the height in meters) in the 85th percentile or higher, obese as a BMI in the 95th percentile or higher, and failure to thrive as a BMI in less than the 5th percentile.

Table 2.

Outcome Measures.*

Outcome Normative
Mean
Watchful Waiting Early
Adenotonsillectomy
Effect
Size
P Value
Baseline Change from
Baseline
to 7 Mo
Baseline Change from
Baseline
to 7 Mo
Primary outcome

NEPSY attention and executive-function
    score
100±15 101.1±14.6 5.1±13.4 101.5±15.9 7.1±13.9 0.15 0.16

Secondary outcomes

Conners’ Rating Scale score§ 50±10

  Caregiver rating 52.6±11.7 −0.2±9.4 52.5±11.6 −2.9±9.9 0.28 0.01

  Teacher rating 55.1±12.8 −1.5±10.7 56.4±14.4 −4.9±12.9 0.29 0.04

BRIEF score 50±10

  Caregiver rating 50.1±11.5 0.4±8.8 50.1±11.2 −3.3±8.5 0.28 <0.001

  Teacher rating 56.4±11.7 −1.0±11.2 57.2±14.1 −3.1±12.6 0.18 0.22

PSQ-SRBD score 0.2±0.1 0.5±0.2 −0.0±0.2 0.5±0.2 −0.3±0.2 1.50 <0.001

PedsQL score** 78±16 76.5±15.7 0.9±13.3 77.3±15.3 5.9±13.6 0.37 <0.001

Apnea-hypopnea index — no. of events/hr†† NA

  Median 4.5 −1.6 4.8 −3.5 0.57 <0.001‡‡

  Interquartile range 2.5 to 8.9 −3.7 to 0.5 2.7 to 8.8 −7.1 to −1.8
*

Plus–minus values are means ±SD, and nonnormally distributed data are medians with interquartile ranges. All P values were adjusted for the stratification factors of age (5 to 7 years of age vs. 8 to 9 years of age), race (black vs. other), weight status (overweight or obese vs. nonoverweight), and study site. NA denotes not applicable.

Effect sizes were calculated with the use of Cohen’s d, relating the magnitude of group difference to the standard deviation, and may be interpreted as follows: small, more than 0.20 to 0.49; medium, 0.50 to 0.79; and large, 0.80 or more.

Scores on the attention and executive-function domain of the Developmental Neuropsychological Assessment (NEPSY) range from 50 to 150, with higher scores indicating better functioning. Data are shown for 203 patients in the watchful-waiting group and 194 in the early-adenotonsillectomy group.

§

Scores on the Conners’ Parent Rating Scale Revised: Long Version Global Index, comprising the Restless–Impulsive and Emotional Lability factor sets, range from 38 to 90, with higher scores indicating worse functioning.12Data are shown for 199 patients in the watchful-waiting group and 193 in the early-adenotonsillectomy group. Scores on the Conners’ Teacher Rating Scale Revised (with scores ranging from 40 to 90 and higher scores indicating worse functioning) are shown for 109 patients in the watchful-waiting group and 103 in the early-adeno-tonsillectomy group.

On the Behavior Rating Inventory of Executive Function (BRIEF) Global Executive Composite section, comprising summary measures of behavioral regulation and metacognition, higher scores indicate worse functioning.13Data on the caregiver ratings, with a range of 28 to 101, are shown for 197 patients in the watchful-waiting group and 195 in the early-adenotonsillectomy group. Data on the teacher ratings, with a range from 37 to 131, are shown for 103 patients in the watchful-waiting group and 104 in the early-adenotonsillectomy group.

Scores on the Pediatric Sleep Questionnaire sleep-related breathing disorder scale (PSQ-SRBD) range from 0 to 1, with higher scores indicating greater severity.14Data are shown for 202 patients in the watchful-waiting group and 194 in the early-adenotonsillectomy group.

**

Scores on the Pediatric Quality of Life Inventory (PedsQL) range from 0 to 100, with higher scores indicating better quality of life.16,21 Data are shown for 204 patients in the watchful-waiting group and 195 in the early-adenotonsillectomy group.

††

A score of 2 or more on the apnea–hypopnea index indicated the obstructive sleep apnea syndrome. Higher scores indicate more severe obstructive sleep apnea.

‡‡

Testing was performed on naturally log-transformed variables.

STUDY OUTCOMES

The baseline attention and executive function score on the NEPSY was close to the population mean of 100 in both groups. Average scores increased in both groups; the difference between the groups favored early adenotonsillectomy but was not significant (P = 0.16). A sensitivity analysis to assess the possible effect of missing data yielded results that were essentially identical to those presented in Table 2.

There was a significantly greater improvement on the caregiver-reported Conners’ Rating Scale among children randomly assigned to early adenotonsillectomy than among those assigned to watchful waiting. Teacher-reported data for this measure, which were available for 212 children, also showed significantly greater improvement in the early-adenotonsillectomy group. The caregiver-reported BRIEF score was lower (indicating an improvement) in the early-adenotonsillectomy group, with a small increase in score in the watchful-waiting group; the teacher-reported version, which was available for 207 children, showed changes that paralleled the caregiver data but were not significantly different between the groups.

Symptoms of the obstructive sleep apnea syndrome were measured with the use of the PSQ-SRBD and the Epworth Sleepiness Scale, and generic and disease-specific measures of quality of life were assessed by means of the PedsQL and OSA-18, respectively. All these instruments showed a significantly greater reduction in symptoms in the early-adenotonsillectomy group than in the watchful-waiting group. The DAS score did not change significantly in either study group (data not shown).

The AHI score improved in both groups but significantly more so in the early-adenotonsillectomy group. Similar results were observed for the oxygen desaturation index (the number of times per hour of sleep that the blood oxygen level drops by 3% or more from baseline) and the level of hypercapnia. The early-adenotonsillectomy group had significantly larger decreases in the arousal index and in the percentage of sleep time in stage N1 (light sleep), consistent with improved sleep continuity. However, no changes were observed in stage N3 or in rapid-eye-movement sleep. Normalization of the obstructive sleep apnea syndrome, as defined by a reduction in both the AHI score to fewer than two events per hour and the OAI score to fewer than one event per hour,710 was more common in the early-adenotonsillectomy group than in the watchful-waiting group (79% vs. 46%, P<0.001).

EFFECT OF BASELINE CHARACTERISTICS

For all the outcomes, relative improvements with early adenotonsillectomy, as compared with watchful waiting, were similar in subgroups defined according to age or obesity status. However, relative improvements with early adenoton-sillectomy were significantly smaller among black children than among children of other races on the Conners’ Rating Scale, the BRIEF, and the PSQ-SRBD scale (see the Supplementary Appendix). Among children with more severe obstructive sleep apnea syndrome at baseline (i.e., AHI score above the median), those randomly assigned to early adenotonsillectomy had a greater improvement in AHI score than did those assigned to watchful waiting (Fig. S1 in the Supplementary Appendix). There was no interaction between the severity of the obstructive sleep apnea syndrome and treatment with respect to the attention and executive-function score on the NEPSY or the behavioral or symptom outcomes.

SUBGROUP ANALYSES

Substantial subgroup differences with regard to the normalization of polysomnographic findings were observed within each study group. Regardless of the assigned treatment, normalization of polysomnographic findings was seen less frequently in black children than in children of other races, in obese children than in nonobese children, and in children with a baseline AHI score above the median than in those with a baseline AHI score at or below the median (Fig. 2).

Figure 2. Normalization of Polysomnographic Findings.

Figure 2

The percentage of participants with normalization of polysomnographic findings (defined as an apnea–hypopnea index [AHI] score of <2 events per hour or an obstructive apnea index score of <1 event per hour) is shown for black children versus children of other races (Panel A), obese children versus nonobese children (Panel B), and children with baseline AHI scores above the median baseline level of 4.7 events per hour versus those with baseline AHI scores at or below the median baseline level (Panel C). Among children randomly assigned to early adenotonsillectomy, the incidence of normalized polysomnographic findings was significantly higher among children who were not black (P<0.001), children who were not obese (P<0.001), and children who had a baseline AHI score at or below the median value (P<0.05). Similarly, among children randomly assigned to watchful waiting, the incidence of normalized polysomnographic findings was significantly higher among nonblack children (P<0.05), nonobese children (P<0.001), and children with a baseline AHI score at or below the median value (P<0.001).

OTHER EXPLORATORY ANALYSES

No consistent relationships were observed between changes in AHI score and changes in other study outcomes, nor were substantive differences seen in models adjusted for additional covariates or restricted to children with low cognitive scores at baseline. The exclusion from the analyses of 24 children who did not receive the assigned intervention (8 children in the early-adenotonsillectomy group who declined surgery and 16 in the watchful-waiting group who underwent early surgery) did not yield any appreciable changes in the study results (see the Supplementary Appendix).

ADVERSE EVENTS

There were 15 serious adverse events after randomization, 6 of which occurred in children randomly assigned to early adenotonsillectomy and 9 in those assigned to watchful waiting (Table 3, and Results and Table S3 in the Supplementary Appendix). A total of 8 events were associated with perioperative complications, 3 of which occurred in children who were randomly assigned to watchful waiting but who had crossed over to surgery. Nine treatment failures were identified, all in the watchful-waiting group.

Table 3.

Adverse Events.*

Event Watchful
Waiting
(N = 203)
Early
Adenotonsillectomy
(N = 194)
no. of patients with event
Total 225 160

Tonsillar hemorrhage 1 2

Postoperative pain 0 3

Asthma 18 3

Lower respiratory tract illness 6 8

Upper respiratory tract or ear illness 90 67

Cough 11 15

Gastrointestinal tract illness 12 11

Dehydration 1 3

ADHD 4 3

Other infection 40 17

Hypersomnolence 2 1

Exacerbation of sleep-apnea symptoms 6 0

Other 34 27
*

Serious adverse events included tonsillar hemorrhage in three children randomly assigned to early adenotonsillectomy and in one child randomly assigned to watchful waiting who crossed over to early adenotonsillectomy; see Table S3 in the Supplementary Appendix for details. ADHD denotes attention deficit– hyperactivity disorder.

DISCUSSION

This large, randomized, controlled trial of therapy for the pediatric obstructive sleep apnea syndrome included rigorous assessments of cognitive and behavioral measurements, sleep apnea symptoms, and sleep. After a 7-month intervention period, school-age children with the obstructive sleep apnea syndrome without prolonged oxyhemoglobin desaturation who underwent surgery did not have significantly greater improvement in attention and executive function, as measured by means of neuropsychological testing, than did children in the watchful-waiting group. However, surgery resulted in greater reductions in symptoms and greater improvements in behavior, quality of life, and polysomnographic findings, with effect sizes in the moderate-to-large range. Polysomnographic findings were normalized in the majority of children (79%) in the early-adenotonsillectomy group, although polysomnographic abnormalities also resolved in 46% of the children randomly assigned to watchful waiting. Among obese children, those randomly assigned to early adenotonsillectomy had greater reductions in symptoms and greater improvement in behavioral and polysomnographic outcomes than did those in the watchful-waiting group.

Potential cognitive and behavioral effects of the obstructive sleep apnea syndrome are of major concern to parents of children with this syndrome.2230 The plausibility of the obstructive sleep apnea syndrome contributing to cognitive deficits is supported by research showing impaired learning in juvenile rats exposed to intermittent hypoxemia3 and by imaging studies showing cerebral neuronal injury in children with the syndrome.30 Previous studies have shown differences in the cognitive function of children with the obstructive sleep apnea syndrome, as compared with controls.19,27,31 However, baseline cognitive scores of children with the obstructive sleep apnea syndrome fell within the normal range, a finding similar to that in the current study. Cognitive and behavioral abnormalities have been shown to be reduced after adenotonsillectomy in some,2427,29 but not all,30 nonrandomized studies, with inconsistency in the reported effect after treatment.25,29,32 Previous studies have been limited by small samples, lack of randomization or appropriate controls, heterogeneous study groups, and sole reliance on parent questionnaires rather than including neuropsychological testing.

We observed no significant difference between the two groups in the change from base-line to follow-up in our primary outcome, the attention and executive-function score of the NEPSY; thus our trial is a negative one. However, other tests showed evidence of changes in behavior. There were greater improvements in the BRIEF, which assesses executive function and behavior on the basis of children’s performance in activities of daily living, among children randomly assigned to early adenotonsillectomy than among those assigned to watchful waiting. Since caregivers were aware of the intervention, it is possible that the improvements in the BRIEF score in the early-adenotonsillectomy group were influenced by parental expectations. Alternatively, the children treated with early adenotonsillectomy might have been better able to attend to tasks in less controlled, real-world settings (such as those in which the BRIEF scores were assessed) than in the closely supervised environment of the assessment of the NEPSY attention and executive-function domain, in which children directly interacted with a psychometrist. Results of previous research showing more robust effects on caregiver ratings of behavior than on psychometric tests are consistent with either possibility.33

There were also significantly larger improvements in the scores on the Conners’ Rating Scale, measuring restlessness and impulsiveness and emotional lability, in the early-adenotonsillectomy group than in the watchful-waiting group. The improvements, which were considered to be small to moderate in size, were observed in both the caregiver-reported and teacher-reported scores, suggesting that the positive behavioral change is due not solely to the influence of unblinded assessment or parental expectation.

The reduction in symptoms of the obstructive sleep apnea syndrome and improvement in quality of life in children treated surgically are noteworthy, owing to the importance of symptoms and health-related quality of life to patients and their families. Polysomnographic findings improved markedly after adenotonsillectomy, with more children in the early-adenotonsillectomy group than in the watchful-waiting group having normalization of polysomnographic findings (79% vs. 46%). Normalization of the polysomnographic findings in the large majority of children randomly assigned to early adenotonsillectomy is consistent with some,34,35 but not all,36,37 studies. Previous nonrandomized studies may have been limited by differential follow-up of more symptomatic children.

Nearly half the children in the watchful-waiting group showed normalization of the AHI score. This improvement may have been due to growth of the airway or regression of lymphoid tissue, routine medical care, or regression to the mean.

This study did not show a correlation between the severity of the obstructive sleep apnea syndrome, as measured by means of polysomnography, and neurobehavioral outcomes — a finding similar to that in other studies.15,25 The lack of correlation may be due to the influence of insufficient sleep or other environmental or genetic influences.

Studies on the efficacy of adenotonsillectomy in obese children have shown contradictory results.38 In CHAT, obese children in both groups had lower rates of normalization of polysomnographic findings than did nonobese children, although the prevalence of residual obstructive sleep apnea syndrome in the obese group postoperatively (33% of children) was lower than has been reported in some studies,5,38,39 possibly because CHAT was limited to preadolescents and excluded extremely obese patients. Polysomnographic findings, as well as symptoms and behavior, improved more with early adenotonsillectomy than with watchful waiting, however, in both obese and nonobese children. These results support a strategy of early adenotonsillectomy in both obese and nonobese children for the treatment of the physiological disturbances of the obstructive sleep apnea syndrome and associated symptoms, but they underscore the need to carefully follow obese children after surgery.

Black children have been reported to have more severe cases of the obstructive sleep apnea syndrome than white children4; we also observed more severe cases of this syndrome among black children at baseline. Black children in both groups had lower rates of normalization of polysomnographic findings than did children of other races but, similar to children of other races, had relatively greater improvement on polysomnographic findings with early adenotonsillectomy than with watchful waiting. However, early adenotonsillectomy, as compared with watchful waiting, was associated with less relative improvement in caregiver-reported measures of behavior and symptoms in black children than in children of other races. Differences persisted in analyses that were adjusted for obesity, baseline behavioral scores, and household income and in analyses that were restricted to children in whom the obstructive sleep apnea syndrome resolved. The reasons for this racial disparity are unclear. Possible explanations include differences in parents’ expectations, coping mechanisms, or perceptions of their child’s behavior and the presence of risk factors for behavioral problems unrelated to the obstructive sleep apnea syndrome.

Children with more severe cases of the obstructive sleep apnea syndrome showed larger absolute improvements in polysomnographic findings with early adenotonsillectomy than with watchful waiting, although, as a group, they were less likely to show normalization of the findings than were children with less severe cases. More severe cases of the obstructive sleep apnea syndrome were not associated with differences between the two groups in cognitive, behavioral, or symptom outcomes.

Surgery was associated with a low rate of perioperative complications; the treatment failure rate was also low but was limited to the watchful-waiting group. Thus, this trial supports the overall safety of both early adenotonsillectomy and watchful waiting but suggests the need for clinical monitoring of children who are being treated conservatively.

The strengths of this study include the large sample, randomized design, standardization of measurements, blinding of key personnel, wide geographic and racial representation, and high follow-up rates. Data from both caregivers and teachers provided independent behavioral assessments.

The study also had limitations. It did not include children younger than 5 years of age, in whom the obstructive sleep apnea syndrome is common. Since children who had prolonged oxyhemoglobin desaturation or who were taking medications for ADHD were excluded, the study results cannot be extrapolated to these vulnerable groups. It is possible that the follow-up period was not long enough to show the full response to surgery. There are insufficient longitudinal data to determine when recovery is maximal. It is possible that neurobehavioral sequelae related to sleepiness resolve quickly, whereas those related to hypoxemia may result from neuronal damage and take longer to resolve.

A total of 200 interaction tests were conducted, yielding 23 significant interactions; 10 significant interactions would be expected by chance. Therefore, the results of exploratory analyses must be viewed cautiously.

Among school-age children with the obstructive sleep apnea syndrome without prolonged oxyhemoglobin desaturation, early adenotonsillectomy, as compared with a strategy of watchful waiting with supportive care, did not result in significantly greater improvement in our prespecified primary outcome, scores on a formal test of attention and executive function after a period of 7 months. However, early adenotonsillectomy was associated with significant improvements in several other prespecified secondary outcomes, many with effect sizes considered to be moderate to large40 and thus likely to be clinically significant, including polysomnographic findings; caregiver-reported measures of executive function, behavior, and sleep apnea symptoms; and teacher reports of behavior. Beneficial effects of early adenotonsillectomy were observed in nonobese children as well as in obese children. Normalization of polysomnographic findings in a large number of children in the watchful-waiting group and an absence of significant cognitive decline in this group indicate that medical management and reassessment after a period of observation may be a valid therapeutic option.

Supplementary Material

Supplement1

Acknowledgments

Supported by grants (HL083075, HL083129, UL1 RR024134, and UL1 RR024989) from the National Institutes of Health.

Dr. Marcus reports receiving a loan of research equipment from Philips Respironics and Ventus Medical. Dr. Gozal reports receiving consulting fees from Galleon Pharmaceuticals. Dr. Hoban reports receiving payment for expert testimony for cases regarding pediatric neurology. Dr. Chervin reports serving as a board member for Sweet Dreamzzz and Pavad Medical; receiving stock options from Pavad Medical; receiving consulting fees from Arena Pharmaceuticals, Guidepoint Global, OrbiMed Advisors, Procter & Gamble, and Zansors; having patents and patents pending, owned by his institution, for signal-analysis algorithms relevant to diagnosis of sleep disorders and having a patent pending, owned by his institution, for a novel device to treat obstructive sleep apnea (none of these patents are currently licensed); and receiving gifts to his institution for educational purposes from Philips Respironics and Fisher and Paykel.

Disclosure forms provided by the authors are available with the full text of this article at NEJM.org.

We thank the Childhood Adenotonsillectomy Trial (CHAT) research staff, including Jean Arnold, Mary Ellen Carroll, Mary Anne Cornaglia, Beth Ann Compton, Casey Critchlow, Judith Emancipator, Melissa Fernando, Theresa Friederich, Amanda Goodman, Xiaoling Hou, Elise Hodges, Laurie Karamessinis, Kim Lacy, Megan McDougall, Daniel Mobley, Michelle Nicholson, Angela Orlando, Deborah L. Ruzicka, Gauri Sathe, Nancy Scott, Susan Surovec, Omarya Vega, Xingmei Wang, and Catherine Williams; the families of the children enrolled in the study; and the members of the data and safety monitoring board, including Lynn Taussig, M.D., (chair), Thomas Anders, M.D., Julie Buring, Sc.D., Karina Davidson, Ph.D., Estelle Gauda, M.D., Steven Piantadosi, M.D., Ph.D., Bennett Shaywitz, M.D., Benjamin Wilfond, M.D., Tucker Woodson, M.D., and Robert Zeiger, M.D.

APPENDIX

The authors’ affiliations are as follows: the Department of Pediatrics, Sleep Center (C.L.M.), Department of Child and Adolescent Psychiatry and Behavioral Sciences (N.H.T.), and Department of Otolaryngology (L.E.), Children’s Hospital of Philadelphia, and the Department of Biostatistics and Epidemiology, School of Medicine (R.H.M., S.S.E.), University of Pennsylvania, Philadelphia; the Department of Statistics, North Carolina State University, Raleigh (R.H.M.); the Departments of Pediatrics (C.L.R., H.G.T.) and Otolaryngology (R.C.S.), Rainbow Babies and Children’s Hospital, University Hospitals Case Medical Center, Case Western Reserve University, Cleveland; the Departments of Psychiatry and Psychology (B.G.), Pediatrics and Neurology (B.G., T.H., R.D.C.), and Otolaryngology (S.L.G.), University of Michigan, Ann Arbor; the Departments of Otolaryngology (R.B.M.) and Pediatrics (S.P., K.S.), Cardinal Glennon Children’s Medical Center, Saint Louis University, St. Louis; the Departments of Otolaryngology and Pediatrics, University of Texas Southwestern Medical Center, Dallas (R.B.M.); the Departments of Pediatrics (R. Amin, D.B.) and Otolaryngology (P.W.), Cincinnati Children’s Hospital Medical Center, Cincinnati; the Departments of Pediatrics (E.S.K.), Medicine and Psychiatry (J.W.), and Otorhinolaryn-gology (D.J.), Boston Children’s Hospital, and the Department of Medicine, Brigham and Women’s Hospital and Beth Israel Deaconess Medical Center, Harvard Medical School, Boston (S.R.); the Departments of Pediatrics (R. Arens, H.M.) and Otolaryngology–Head and Neck Surgery (J.P.B.), Children’s Hospital at Montefiore and Montefiore Medical Center, Albert Einstein College of Medicine, Bronx, NY; and the Department of Pediatrics, University of Chicago, Chicago (D.G.).

Footnotes

No other potential conflict of interest relevant to this article was reported.

REFERENCES

  • 1.Marcus CL, Brooks LJ, Draper KA, et al. Diagnosis and management of childhood obstructive sleep apnea syndrome. Pediatrics. 2012;130:576–584. doi: 10.1542/peds.2012-1671. [DOI] [PubMed] [Google Scholar]
  • 2.Bhattacharyya N, Lin HW. Changes and consistencies in the epidemiology of pediatric adenotonsillar surgery 1996–2006. Otolaryngol Head Neck Surg. 2010;143:680–684. doi: 10.1016/j.otohns.2010.06.918. [DOI] [PubMed] [Google Scholar]
  • 3.Row BW, Kheirandish L, Neville JJ, Gozal D. Impaired spatial learning and hyperactivity in developing rats exposed to intermittent hypoxia. Pediatr Res. 2002;52:449–453. doi: 10.1203/00006450-200209000-00024. [DOI] [PubMed] [Google Scholar]
  • 4.Redline S, Tishler PV, Schluchter M, Aylor J, Clark K, Graham G. Risk factors for sleep-disordered breathing in children: associations with obesity, race, and respiratory problems. Am J Respir Crit Care Med. 1999;159:1527–1532. doi: 10.1164/ajrccm.159.5.9809079. [DOI] [PubMed] [Google Scholar]
  • 5.Mitchell RB, Kelly J. Outcome of adeno-tonsillectomy for obstructive sleep apnea in obese and normal-weight children. Otolaryngol Head Neck Surg. 2007;137:43–48. doi: 10.1016/j.otohns.2007.03.028. [DOI] [PubMed] [Google Scholar]
  • 6.Redline S, Amin R, Beebe D, et al. The Childhood Adenotonsillectomy Trial (CHAT): rationale, design, and challenges of a randomized controlled trial evaluating a standard surgical procedure in a pediatric population. Sleep. 2011;34:1509–1517. doi: 10.5665/sleep.1388. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Marcus CL, Omlin KJ, Basinki DJ, et al. Normal polysomnographic values for children and adolescents. Am Rev Respir Dis. 1992;146:1235–1239. doi: 10.1164/ajrccm/146.5_Pt_1.1235. [DOI] [PubMed] [Google Scholar]
  • 8.Witmans MB, Keens TG. Davidson Ward SL, Marcus CL. Obstructive hypop-neas in children and adolescents: normal values. Am J Respir Crit Care Med. 2003;168:1540. doi: 10.1164/ajrccm.168.12.954. [DOI] [PubMed] [Google Scholar]
  • 9.Uliel S, Tauman R, Greenfeld M, Sivan Y. Normal polysomnographic respiratory values in children and adolescents. Chest. 2004;125:872–878. doi: 10.1378/chest.125.3.872. [DOI] [PubMed] [Google Scholar]
  • 10.Montgomery-Downs HE, O’Brien LM, Gulliver TE, Gozal D. Polysomnographic characteristics in normal preschool and early school-aged children. Pediatrics. 2006;117:741–753. doi: 10.1542/peds.2005-1067. [DOI] [PubMed] [Google Scholar]
  • 11.Korkman N, Kirk U, Kemp S. NEPSY: a developmental neuropsychological assessment manual. New York: Psychological Corporation; 1998. [Google Scholar]
  • 12.Conners CK. Conners Rating Scales — Revised Technical Manual. 5th ed. North Tonawanda, NY: Multi-Health Systems; 2001. [Google Scholar]
  • 13.Gioia GA, Isquith PK, Guy PK, Kenworthy L. Behavior Rating Inventory of Executive Function (BRIEF) Odessa, FL: Psychological Assessment Resources; 2000. [Google Scholar]
  • 14.Chervin RD, Hedger K, Dillon JE, Pituch KJ. Pediatric Sleep Questionnaire (PSQ): validity and reliability of scales for sleep-disordered breathing, snoring, sleepiness, and behavioral problems. Sleep Med. 2000;1:21–32. doi: 10.1016/s1389-9457(99)00009-x. [DOI] [PubMed] [Google Scholar]
  • 15.Melendres MC, Lutz JM, Rubin ED, Marcus CL. Daytime sleepiness and hy-peractivity in children with suspected sleep-disordered breathing. Pediatrics. 2004;114:768–775. doi: 10.1542/peds.2004-0730. [DOI] [PubMed] [Google Scholar]
  • 16.Varni JW, Seid M, Kurtin PS. PedsQL 4.0: reliability and validity of the Pediatric Quality of Life Inventory version 4.0 generic core scales in healthy and patient populations. Med Care. 2001;39:800–812. doi: 10.1097/00005650-200108000-00006. [DOI] [PubMed] [Google Scholar]
  • 17.Franco RA, Jr, Rosenfeld RM, Rao M. First place — resident clinical science award 1999: quality of life for children with obstructive sleep apnea. Otolaryngol Head Neck Surg. 2000;123:9–16. doi: 10.1067/mhn.2000.105254. [DOI] [PubMed] [Google Scholar]
  • 18.Elliott CD. Differential abilities scale: introductory and technical handbook. San Antonio: Harcourt Brace Jovanovich; 1990. [Google Scholar]
  • 19.Gottlieb DJ, Chase C, Vezina RM, et al. Sleep-disordered breathing symptoms are associated with poorer cognitive function in 5-year-old children. J Pediatr. 2004;145:458–464. doi: 10.1016/j.jpeds.2004.05.039. [DOI] [PubMed] [Google Scholar]
  • 20.Proc MI. version 9.3. Cary NC: SAS Institute; 2011. [Google Scholar]
  • 21.Varni JW, Burwinkle TM, Seid M, Skarr D. The PedsQL 4.0 as a pediatric population health measure: feasibility, reliability, and validity. Ambul Pediatr. 2003;3:329–341. doi: 10.1367/1539-4409(2003)003<0329:tpaapp>2.0.co;2. [DOI] [PubMed] [Google Scholar]
  • 22.Brouillette RT, Fernbach SK, Hunt CE. Obstructive sleep apnea in infants and children. J Pediatr. 1982;100:31–40. doi: 10.1016/s0022-3476(82)80231-x. [DOI] [PubMed] [Google Scholar]
  • 23.Guilleminault C, Korobkin R, Winkle R. A review of 50 children with obstructive sleep apnea syndrome. Lung. 1981;159:275–287. doi: 10.1007/BF02713925. [DOI] [PubMed] [Google Scholar]
  • 24.Gozal D. Sleep-disordered breathing and school performance in children. Pediatrics. 1998;102:616–620. doi: 10.1542/peds.102.3.616. [DOI] [PubMed] [Google Scholar]
  • 25.Chervin RD, Ruzicka DL, Giordani BJ, et al. Sleep-disordered breathing, behavior, and cognition in children before and after adenotonsillectomy. Pediatrics. 2006;117(4):e769–e778. doi: 10.1542/peds.2005-1837. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Montgomery-Downs HE, Crabtree VM, Gozal D. Cognition, sleep and respiration in at-risk children treated for obstructive sleep apnoea. Eur Respir J. 2005;25:336–342. doi: 10.1183/09031936.05.00082904. [DOI] [PubMed] [Google Scholar]
  • 27.O’Brien LM, Mervis CB, Holbrook CR, et al. Neurobehavioral correlates of sleep-disordered breathing in children. J Sleep Res. 2004;13:165–172. doi: 10.1111/j.1365-2869.2004.00395.x. [DOI] [PubMed] [Google Scholar]
  • 28.Friedman BC, Hendeles-Amitai A, Kozminsky E, et al. Adenotonsillectomy improves neurocognitive function in children with obstructive sleep apnea syndrome. Sleep. 2003;26:999–1005. doi: 10.1093/sleep/26.8.999. [DOI] [PubMed] [Google Scholar]
  • 29.Kohler MJ, Lushington K, van den Heuvel CJ, Martin J, Pamula Y, Kennedy D. Adenotonsillectomy and neurocognitive deficits in children with sleep disordered breathing. PLoS One. 2009;4(10):e7343. doi: 10.1371/journal.pone.0007343. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Halbower AC, Degaonkar M, Barker PB, et al. Childhood obstructive sleep apnea associates with neuropsychological deficits and neuronal brain injury. PLoS Med. 2006;3(8):e301. doi: 10.1371/journal.pmed.0030301. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.O’Brien LM, Mervis CB, Holbrook CR, et al. Neurobehavioral implications of habitual snoring in children. Pediatrics. 2004;114:44–49. doi: 10.1542/peds.114.1.44. [DOI] [PubMed] [Google Scholar]
  • 32.Cortese S, Faraone SV, Konofal E, Lecendreux M. Sleep in children with attention-deficit/hyperactivity disorder: meta-analysis of subjective and objective studies. J Am Acad Child Adolesc Psychiatry. 2009;48:894–908. doi: 10.1097/CHI.0b013e3181ac09c9. [DOI] [PubMed] [Google Scholar]
  • 33.Emancipator JL, Storfer-Isser A, Taylor HG, et al. Variation of cognition and achievement with sleep-disordered breathing in full-term and preterm children. Arch Pediatr Adolesc Med. 2006;160:203–210. doi: 10.1001/archpedi.160.2.203. [DOI] [PubMed] [Google Scholar]
  • 34.Apostolidou MT, Alexopoulos EI, Chaidas K, et al. Obesity and persisting sleep apnea after adenotonsillectomy in Greek children. Chest. 2008;134:1149–1155. doi: 10.1378/chest.08-1056. [DOI] [PubMed] [Google Scholar]
  • 35.Brietzke SE, Gallagher D. The effectiveness of tonsillectomy and adenoidectomy in the treatment of pediatric obstructive sleep apnea/hypopnea syndrome: a meta-analysis. Otolaryngol Head Neck Surg. 2006;134:979–984. doi: 10.1016/j.otohns.2006.02.033. [DOI] [PubMed] [Google Scholar]
  • 36.Bhattacharjee R, Kheirandish-Gozal L, Spruyt K, et al. Adenotonsillectomy outcomes in treatment of obstructive sleep apnea in children: a multicenter retrospective study. Am J Respir Crit Care Med. 2010;182:676–683. doi: 10.1164/rccm.200912-1930OC. [DOI] [PubMed] [Google Scholar]
  • 37.Tauman R, Gulliver TE, Krishna J, et al. Persistence of obstructive sleep apnea syndrome in children after adenotonsillectomy. J Pediatr. 2006;149:803–808. doi: 10.1016/j.jpeds.2006.08.067. [DOI] [PubMed] [Google Scholar]
  • 38.Costa DJ, Mitchell R. Adenotonsillectomy for obstructive sleep apnea in obese children: a meta-analysis. Otolaryngol Head Neck Surg. 2009;140:455–460. doi: 10.1016/j.otohns.2008.12.038. [DOI] [PubMed] [Google Scholar]
  • 39.Mitchell RB, Kelly J. Adenotonsillectomy for obstructive sleep apnea in obese children. Otolaryngol Head Neck Surg. 2004;131:104–108. doi: 10.1016/j.otohns.2004.02.024. [DOI] [PubMed] [Google Scholar]
  • 40.Cohen J. Statistical power analysis for the behavioral sciences. 2nd ed. Hillsdale, NJ: Lawrence Erlbaum; 1988. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplement1

RESOURCES