Abstract
Purpose
This review addresses the importance of the prospective cohort design in large, unselected populations starting early in life for understanding the origins of childhood health disorders.
Recent findings
Cohort studies originating in healthy populations have contributed to great advances in health, especially in cardiovascular diseases, but have only recently been applied systematically to the study the origins of childhood disorders. Several large population-based pregnancy and/or birth cohorts have been developed in different parts of the world, and these are beginning to contribute to better understanding of the underlying causes of rare but important childhood disorders, such as autism. The ECHO program is distinct in leveraging and building upon 84 existing cohorts to prospectively investigate the role of early life exposures and underlying biological mechanisms in childhood health and disease, specifically perinatal conditions, obesity, neurodevelopmental disorders, asthma and related pulmonary disorders as well as optimum child health. ECHO is expected to comprise approximately 50,000 children. It is the first US study of this size and scope since the US Collaborative Perinatal Project of 1959–1966.
Summary
The ECHO project represents a new approach to cohort studies in childhood, efficiently making use of extant cohorts while adding new data collection elements that should permit novel insights into the underlying causes of several important pediatric conditions.
Keywords: birth cohort studies, prospective studies, disease etiology
1. Introduction
The study of disease, not surprisingly, has historically focused largely on the patient, the individual already affected by disease. In the search for causes, this approach was suitable for diseases whose onset closely followed the inciting exposure, as with many infectious diseases. But for chronic diseases, whose origins lie in exposures and behaviors operating years or decades before the onset of disease, study of the patient is unlikely to fully reveal the underlying causes.
Medicine was slow to come around to the idea that to fully investigate the origins of chronic diseases, or more precisely, diseases whose origins are remote in time from the onset of clinically recognizable symptoms, it is necessary to initiate studies in the pre-diseased population and track exposures as they happen, so as to observe the evolution of disease. That understanding was not to come to full fruition until nearly the second half of the 20th century. Cohort studies of disease that tracked healthy individuals over time until the onset of disease were extremely rare before the founding of the Framingham heart study in 1948.
It is perhaps not surprising that such studies were not mounted earlier, inasmuch as the size of the assembled study population and the duration of time required to study the evolution of disease are formidable obstacles, requiring substantial investment of resources. The essential ingredients for a successful cohort study of the origins of any disease are first, the commitment to assembling a population sufficiently large so that enough instances of the disease under investigation will appear and so that any important differences in risk of disease between groups ranging in hypothesized causal factors are unlikely to be due to chance. The second commitment is to have sufficient patience, in the face of publish-or-perish tenure committees in universities, to wait the requisite number of years before disease emerges and relevant findings can be published.
But that is not all that is required. All successful cohort studies of the origins of disease are embedded in well-defined populations from which results can be generalized. These need not be statistically ideal probability samples of the general population; while many of the best known large cohort studies are from defined geographical regions, several important cohort studies have been mounted in medical professionals, such as the Nurse’s Health Study,1 or the British Doctor’s Study,2 or other professional or industry groupings, such as civil servants.3 But cohort studies should not be based on participants selected in ways that make generalization risky —volunteers, for example, or — participants identified in medical settings, with some important exceptions, such as prenatal care settings that cover most pregnant women. Cohort studies need clear hypotheses in advance, as well as an a priori plan to collect the appropriate exposure information, whether through biospecimens, questionnaires, or clinical assessments, and to prospectively define the features of the diseases or conditions whose etiology and preventability motivates the study. Without specification of the condition or conditions of interest, power calculations are not possible; and without specification of the exposures of interest, one cannot obtain the key ingredient in prospective cohort studies, which is real-time assessment of exposures of interest. In studies starting with extant disease, (i.e. case control studies) exposure information can only be ascertained from what happened to be recorded earlier or remembered later. The prospective cohort study allows the investigators to carefully measure the hypothesized exposures long before their effects are manifest in the disease state.
2. Notable successes of prospective studies
The establishment of the Framingham Heart Study in 1948,4 a prospective cohort study of some 5,000 generally healthy citizens of Framingham, MA, with follow-up that continues to this day, was the first of a large number of such studies across the US and overseas. The contributions of these studies to public health cannot be overstated. This cohort study securely established the key risk factors for the West’s leading cause of death, cardiovascular disease, as elevated blood pressure, abnormalities of cholesterol and its fractions, cigarette smoking and glucose intolerance. Primary prevention of heart disease through reductions in smoking and adoption of less atherogenic diets, and secondary prevention through management of high blood pressure, diabetes and elevated cholesterol levels has been the major factor behind the gratifying reduction in US death rates from ischemic heart disease and stroke, which are now at levels 75% lower than when the Framingham Study was initiated.
The British Doctor’s study (cited above) was the first prospective study to clearly delineate the risks of cigarette smoking for lung cancer and heart disease and established a durable tradition of using medical professionals as cohorts to be followed, such as the contemporary Nurses’ Health Study. Medical professionals provide efficiencies in cohort assembly and follow-up, as do civil servants, who have contributed to several important new forms of understanding. The British Whitehall Study3 demonstrated the important role of social stratification, income and job stress on risk of cardiovascular disease. A cohort study initiated in Taiwanese civil servants 5 demonstrated that the small fraction of the population who were carriers of the Hepatitis B antigen accounted for nearly all cases of primary liver cancer, then one of the commonest cancers in Taiwan, Japan and China, and set the stage for primary prevention of cancer via immunization of newborns against Hepatitis B.
3. Origins of the cohort study starting at birth or in pregnancy
The first of the British National Birth Cohort studies was mounted in 1946,6 sampling one week of births in March of that year in England and Wales. Initially designed to study the medical and other costs of having a baby and the distribution of maternity services in the nation,7 subsets of the 1946 cohort (which numbered nearly 14,000 at birth) have been followed throughout life. This study was followed by two additional British birth cohorts, using the same one-week-of-births sampling frame in 1958 8 and 1970.9
The focus of the 1946 study was more social than biological, examining economic circumstances of parents in relation to a range of perinatal and child health outcomes particularly growth, school performance and cognitive functioning.10 The later cohorts focused on the impact of pregnancy medical conditions, largely based on information collected at birth. As the cohorts have aged, they have increasingly been used to study the early origins of adult disorders such as cardiovascular disease and cancer.11
In the US at about the same time as the second British National Birth Cohort study, two large cohort studies intended to study child health were established that had an advantage over the British birth cohorts in that they were initiated in pregnancy, but the disadvantage that they were selected from the prenatal clinics of large medical centers. The Child Health and Development Study (CHES)12 enrolled some 10,000 pregnancies in the Kaiser Permanente System in California, while the National Collaborative Perinatal Project (NCPP) enrolled some 50,000 pregnancies in 12 major urban medical centers, largely in the North East United States.13 Both were initiated in 1959. CHES is not diverse; its population is entirely Caucasian. The NCPP, by contrast, had many African-American and Latino participants. CHES, taking advantage of its single location, has been the source of a large number of spinoff studies, particularly focusing on adult disease.14 The NCPP followed children to age seven, and its most notable contributions were in child neurology. The benign nature of most uncomplicated febrile seizures was demonstrated by the NCPP,15 leading to a sharp decrease in the number of affected children treated with anticonvulsants, some of which had been demonstrated to impair learning capacity. The NCPP also showed that the etiology of cerebral palsy (CP) was more complicated than initially thought.16 Contrary to popular belief, the majority of children with CP had not been brain injured by the birth process, nor had they suffered from birth asphyxia.
4. Contemporary large pregnancy cohorts
Studies of the major sources of childhood morbidity require large sample sizes because important causes of childhood disability and handicap such as autism (1–2% of the population; severe intellectual disability (0.5% of the population) and CP (about 0.3% of live births). Birth defects are individually rare, and very early preterm birth is found in fewer than 1% of births. Such health outcomes are too few to be studied in cohorts of the size of Framingham. For this reason, recent child health studies have been much larger than most adult cohort studies of the past.
The Danish National Birth Cohort (DNBC) and the Norwegian Mother and Child Cohort study (MoBa) are the largest of several European efforts to mount meaningfully-scaled cohort studies beginning in pregnancy. Each enrolled about 100,000 relatively (but not completely) unselected pregnancies from routine prenatal care settings and followed them primarily in a passive manner, making use of linkage to the extensive population-based disease registries in both countries. Data from the two cohorts are now being combined to study CP, using record linkage to national registries, in a cohort of 200,000 births.17 Because the two national studies required surveying in the national language, recent immigrants were not included. Among important results emerging from MOBA and DNCB are the findings that peri-conceptional folate supplementation may prevent autism,18 that mobile phone use in pregnancy is unrelated to later child development,19 and that low maternal iodine intake is associated with delayed development.20
5. A pediatric cohort for 21st century research and prevention
In late 2015, when the National Institutes of Health announced the establishment of the seven-year ECHO program (https://www.nih.gov/echo) through various Funding Announcement Opportunities (FOAs), existing pediatric cohorts were envisioned as a central component. The overall aim was to prospectively investigate the role of early life exposures and underlying biological mechanisms in childhood health and disease, and the approach was novel — to form a consortium of established pediatric research cohorts to leverage already existing longitudinal data so that the collective efforts would, scientifically, have far more impact than the isolated studies on their own. The cohorts were, however, given the opportunity to expand as well as to extend their period of follow-up.
The ECHO Pediatric Cohort FOA plan outlined supporting “multiple, synergistic, longitudinal studies using existing study populations, called cohorts, to investigate environmental exposures — including physical, chemical, biological, social, behavioral, natural and built environments — on child health and development.” The exposure period was established as from before conception to age 5 years, with an explicit inclusion of the prenatal period, and in some cases, the preconceptional period. The outcomes of interest were identified as the following disorders of high public health impact: 1) Upper and lower airway disease, principally asthma; 2) Obesity; 3) Pre-, peri-, and postnatal problems (e.g., low birthweight); and 4) Neurodevelopmental disorders. A fifth outcome, positive health, was added later.
The Pediatric Centers and their cohorts were charged with investigating these outcomes using two parallel approaches: the development of new aims specific to each pediatric cohort for the local extant cohorts and their expansions; and the development of an ECHO-wide uniform data collection protocol to which all cohorts would adhere through new data collection and/or harmonization of data already collected. One year after ECHO was launched in the fall of 2016, the Pediatric Cohort component consists of awards to 35 centers that have 74 principal investigators (many centers have multiple principal investigators) and who follow 84 cohorts, with some centers aggregating as many as ten different cohorts. The ultimate sample size, including new recruitments to expand extant cohorts, is expected to reach about 50,000.
In the first year, cohorts have launched their individual cohort-specific research projects (ranging from analyses of breast milk for environmental toxins to task-based neuro-imaging to self-report indicators of positive health) while simultaneously developing the ECHO-wide protocol and related scientific policies (such as for publications and biospecimen utilization). The sample size, geographic, racial/ethnic diversity, the range of exposures, and the commitment to new methodology for data collection that characterize the ECHO Pediatric Cohorts are such that ECHO is poised to significantly add to information generated by earlier birth cohort studies: 36 states are represented in ECHO as well as two territories and districts (Washington, DC and Puerto Rico) and includes rural, urban, and suburban populations.
Numerous known or suspected categories of contributors to children’s health outcomes are being, or will be measured, including nutrition, physical activity, environmental toxicants, psychosocial stressors, and an array of omic technologies, with a particular focus on exposures that are remediable or susceptible to change, so that discoveries may be used to lead to improved child health. It is anticipated that the depth of measurement and the large sample size will permit meaningful studies of gene-environment interaction. Other goals that are expected to be reached include:
Clear clinical characterization of incident cases and pre-clinical stages of disease and the frequency of these disorders in the population.
Assessment of the influence of multi-level environmental exposures on the risk of childhood health outcomes.
Identification of the measurable normal human variation (of key exposures and key outcomes) in the population and across the lifespan studied.
A degree of generalizability of findings to the US population not found in single cohort studies.
Moreover, in line with newly promulgated NIH policy, data from ECHO will be publically available closely following collection and upload to central repositories (6–12 months, including results from bioassays).
Another key, as well as novel, aspect of the ECHO Pediatric Cohorts comes from the program-wide structure that integrates observational research with a clinical trials network. Specifically, as part of the ECHO program, established clinical investigators and their support teams working in academic research settings in Institutional Development Awards Program (IDeA) (this NIH program provides resources to 23 states and Puerto Rico with historically low levels of NIH funding) states will conduct pediatric clinical trials research, with priority given to ECHO’s outcome areas. This integration is consistent with recent initiatives in so called ‘translational epidemiology’, 21,22 which inverts medicine’s traditional bench-to-bedside information flow from bedside (i.e. from population health observations)-to-bench (i.e. to clinical trials and basic science research).
With ECHO’s integrated structure, cohort observations of contributors to poor health trajectories, as well as protections from them, will suggest interventions that can efficiently be tested in clinical trials with two useful outcomes: 1) underserved and rural children will benefit from potentially useful and new health interventions and 2) the rigor and methodology of the Randomized Control Trial can be leveraged to provide experimental control that tests the associations between removal of observed risk factors identified in ECHO and changes in health outcomes. In addition, since the Pediatric Cohorts will test, using the many disciplines of its investigators, an array of biomarkers and biological pathways for exposure effects on health (including, for example epigenetic and microbiome work) it is likely that findings will spark laboratory research into fundamental mechanisms.
Conclusion
Pregnancy and birth cohorts are expensive undertakings, justified, ultimately, if they produce scientific knowledge that can advance child health. The ECHO program of research uses extant cohorts, harmonizes data already collected across cohorts to the extent possible, encourages cohort investigators to collaborate in special studies of interest, and plans to collect new data in expanded cohorts in uniform ways. These efficiencies in study design, combined with the broad and diverse set of scientific interests found among study investigators and the plans to make the data available to all serious scientific researchers suggest that ECHO will be a rich source of useful, actionable information about child health for decades to come.
Bullet points.
The essential ingredients for a successful cohort study of the origins of any disease are large enough sample size to provide enough cases of the disease of interest, time for disease progression, and a cohort that reflects the population to which the findings are expected to apply.
Longitudinal cohort studies that track healthy individuals over time until the onset of disease, such as the U.S. Framingham Heart Study, were transformative in securely establishing, as key risk factors cardiovascular disease, elevated blood pressure, abnormalities of cholesterol and its fractions, cigarette smoking and glucose intolerance.
Pediatric cohort studies initiated in pregnancy have identified the benign nature of most uncomplicated febrile seizures, leading to reduction in anticonvulsant use, and showing that CP is not solely a result of brain injury via the birth process.
The ECHO program is distinct in leveraging and building upon 84 existing cohorts to comprise approximately 50,000 children and prospectively investigate the role of early life exposures and underlying biological mechanisms in childhood health and disease, specifically perinatal conditions, obesity, neurodevelopmental disorders, asthma and related pulmonary disorders as well as optimum child health.
In line with newly promulgated NIH policy, data from ECHO will be publically available closely following collection and upload to central repositories thus ensuring that ECHO will be a rich source of useful, actionable information about child health for decades to come.
Acknowledgments
We would like to thank the PIs and co-PIs of our ECHO projects in New York, Puerto Rico and Michigan: Cristiane Duarte, Glorisa Canino, Jonathan Posner, John Vena, Ronald Wapner, Kelly Hunt, Douglas Ruden, Michael Elliott, Glenn Copeland and Charles Barone.
Financial support and sponsorship: This work was supported by the National Institutes of Health
Footnotes
Conflicts of Interest: None
Contributor Information
Nigel Paneth, Departments of Epidemiology & Biostatistics and Pediatrics & Human Development, Michigan State University, College of Human Medicine, 909 Fee Road, East Lansing, MI 48824.
Catherine Monk, Departments of Psychiatry & Obstetrics & Gynecology, Columbia University Medical Center, New York State Psychiatric Institute, 630 West 168th Street, PH1540H (for mail), New York, NY 10032.
References
- 1.Belanger CF, Hennekens CH, Rosner B, et al. The nurses’ health study. Am J Nurs. 1978;78:1039–40. [PubMed] [Google Scholar]
- 2.Doll R, Hill AB. Mortality in relation to smoking: ten years’ observations of British doctors. Br Med J. 1964 May 30;:1399–410. doi: 10.1136/bmj.1.5395.1399. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Marmot M, Brunner E. Cohort Profile: The Whitehall II study. Int J Epidemiol. 2005;34:251–6. doi: 10.1093/ije/dyh372. [DOI] [PubMed] [Google Scholar]
- 4.Dawber TR, Meadors GF, Moore FE. Epidemiological approaches to heart disease: the Framingham Study. Am J Pub Health. 1951;41:279–81. doi: 10.2105/ajph.41.3.279. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Beasley RP. Hepatitis B virus. The major etiology of hepatocellular carcinoma. Cancer. 1988;61(10):1942–56. doi: 10.1002/1097-0142(19880515)61:10<1942::aid-cncr2820611003>3.0.co;2-j. [DOI] [PubMed] [Google Scholar]
- 6.Wadsworth M, Kuh D, Richards M, et al. Cohort Profile: The 1946 National Birth Cohort (MRC National Survey of Health and Development) Int J Epidemiol. 2006;35:49–54. doi: 10.1093/ije/dyi201. [DOI] [PubMed] [Google Scholar]
- 7.Douglas JWB. Social and economic problems of child-bearing; a proposed inquiry. Mother Child. 1946;16:160–3. [PubMed] [Google Scholar]
- 8.Power C, Elliott J. Cohort profile: 1958 British birth cohort (National Child Development Study) Int J Epidemiol. 2006;35:34–41. doi: 10.1093/ije/dyi183. [DOI] [PubMed] [Google Scholar]
- 9.Elliott J, Shepherd P. Cohort Profile: 1970 British Birth Cohort (BCS70) Int J Epidemiol. 2006;35:836–843. doi: 10.1093/ije/dyl174. [DOI] [PubMed] [Google Scholar]
- 10.Douglas JWB. The Home and the School. London: MacGibbon and Kee; 1964. [Google Scholar]
- 11.Hardy R, Wadsworth MEJ, Langenberg C, et al. Birth weight, childhood growth and blood pressure at 43 years in a British birth cohort. Int J Epidemiol. 2004;33:121–29. doi: 10.1093/ije/dyh027. [DOI] [PubMed] [Google Scholar]
- 12.Van den Berg BJ, Yerushalmy J. The relationship of the rate of intrauterine growth of infants of low birth weight to mortality, morbidity, and congenital anomalies. J Pediatr. 1966;69:531–45. doi: 10.1016/s0022-3476(66)80038-0. [DOI] [PubMed] [Google Scholar]
- 13.Klebanoff ML. The Collaborative Perinatal Project: A 50-year retrospective. Paed Perinat Epidemiol. 2009;23:2–8. doi: 10.1111/j.1365-3016.2008.00984.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Cohn BA, Cirillo PM, Hopper BR, et al. Third Trimester Estrogens and Maternal Breast Cancer: Prospective Evidence. J Clin Endocrinol Metab. 2017;102:3739–3748. doi: 10.1210/jc.2016-3476. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Nelson KB, Ellenberg JH. Predictors of epilepsy in children who have experienced febrile seizures. N Engl J Med. 1976;295:1029–33. doi: 10.1056/NEJM197611042951901. [DOI] [PubMed] [Google Scholar]
- 16.Nelson KB, Ellenberg JH. Antecedents of cerebral palsy. Multivariate analysis of risk. N Engl J Med. 1986;315:81–6. doi: 10.1056/NEJM198607103150202. [DOI] [PubMed] [Google Scholar]
- 17•.Tollånes MC, Strandberg-Larsen K, Forthun I, et al. Cohort profile: cerebral palsy in the Norwegian and Danish birth cohorts (MOBAND-CP) BMJ Open. 2016;6:e012777. doi: 10.1136/bmjopen-2016-012777. This paper describes the creation of the MOBAND-CP cohort, a pregnancy cohort of more than 210,000 children, among whom 438 children with CP were located in the Danish and Norwegian CP registries. Biological samples and survey data were collected in pregnancy and can now be examined for a wide range of potential etiologies in the largest prospective cohort study of CP ever mounted. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Surén P, Roth C, Bresnahan M, et al. Association between maternal use of folic acid supplements and risk of autism spectrum disorders in children. JAMA. 2013;309:570–7. doi: 10.1001/jama.2012.155925. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Divan HA, Kheifets L, Olsen J, et al. Prenatal cell phone use and developmental milestone delays among infants. Scand J Work Environ Health. 2011;37:341–8. doi: 10.5271/sjweh.3157. [DOI] [PubMed] [Google Scholar]
- 20•.Abel MH, Caspersen IH, Meltzer HM, et al. Suboptimal Maternal Iodine Intake Is Associated with Impaired Child Neurodevelopment at 3 Years of Age in the Norwegian Mother and Child Cohort Study. J Nutr. 2017;147:1314–1324. doi: 10.3945/jn.117.250456. This study of nearly 50,000 mother-child pairs recruited during pregnancy found that low maternal iodine intake in pregnancy, assessed by a validated food-frequency questionnaire, was associated with child language delay, externalizing and internalizing behavior problems and fine motor skills. The importance of this observation is underlined by the finding, from NHANES data, that iodine levels in women of childbearing age have dropped significantly in the US in recent years. [DOI] [PubMed] [Google Scholar]
- 21.Khoury MJ, Gwinn M, Ioannidis JP. The emergence of translational epidemiology: from scientific discovery to population health impact. Am J Epidemiol. 2010;172:517–24. doi: 10.1093/aje/kwq211. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Weissman MM, Brown AS, Talati A. Translational epidemiology in psychiatry: linking population to clinical and basic sciences. Arch Gen Psychiatry. 2011;68:600–8. doi: 10.1001/archgenpsychiatry.2011.47. [DOI] [PMC free article] [PubMed] [Google Scholar]