The emergence of coronavirus disease 2019 (COVID-19) prompted a surge in research activity. Funding bodies swiftly allocated resources to establish research infrastructures and partnerships to study the novel virus. The scientific community realigned existing research and launched new studies to define the clinical course of COVID-19 and identify therapeutic candidates. Overall, comparatively fewer studies were initiated in children relative to adults,1 in part due to the lower prevalence and disease morbidity recorded in pediatric populations. However, characterizing the disease in pediatric patients is critical to elucidate transmission dynamics, inform public health measures, and generate evidence on best practices for clinical care and therapeutic interventions. The life-threatening multisystem inflammatory syndrome further underscores the need for natural history studies and drug development in pediatric populations.2
Numerous observational studies have been established to collect information on pediatric patients with laboratory-confirmed COVID-19, including children in inpatient and outpatient settings, and those with specific underlying conditions (Table 1). These efforts have been driven largely by single institutions and research organizations. As would be expected during a pandemic response, there was initially limited opportunity to establish extensive regional or international collaborations, and there has been little coordination across existing initiatives. However, as we approach a steady state in morbidity and gain a better view of the long-term trajectory of the pandemic, it is important that we re-assess our approaches and take a deliberate and strategic approach to further resource investments. If the current fragmented approach is maintained, many studies will not yield the sample sizes needed to support robust analyses—especially as infection rates decline in certain areas—and resources will be wasted on inefficient enrollment and duplicative efforts. For example, as illustrated in Table 1, we identified at least three unique non-interventional studies focused on pediatric patients requiring critical care in hospitals in North America.
Table 1.
Study | Type | Scopea | Inclusion criteria | Capability for data sharing and reuse |
---|---|---|---|---|
Australian Neonatal COVID-19 Study | Registry | Reporting by designated clinicians in Australia | Mother/infant dyad hospitalized in perinatal period with suspected or laboratory-confirmed COVID-19 | Specified procedures for data sharing and reuse |
BPSU Study: Neonatal complications of COVID-19 (British Paediatric Surveillance Unit Study) | Surveillance study | Hospitals in the United Kingdom and Northern Ireland | Neonate (aged <29 days) hospitalized with confirmed COVID-19 or born to mother with confirmed infection | Limited access to data via application |
COVID-19 and Childhood Cancer Registry | Registry | Healthcare providers globally | Patients aged <19 years with malignancy or prior hematopoietic stem cell transplant | Not possible at this time |
COVID-19 Data: North American Pediatric ICUs | Dashboard | Hospitals in North America | Patients with confirmed COVID-19 and admitted to pediatric ICU | Not specified |
COVID-19 Pediatric Rheumatology Database | Registry | Sites globally | Patients aged <19 years with any type of pediatric rheumatic disease | Limited access to data via application |
COVID-19—service evaluation and audit | Surveillance study | Hospitals in England | Patients aged 29 days to <18 years hospitalized with confirmed COVID-19 | Not specified |
EPICENTRE | Registry | Hospitals globally with an active pediatric or neonatal ICU | Neonate (up to 30 days) or child (aged 30 days to <18 years) hospitalized with confirmed COVID-19 and requiring critical care or born to mother with confirmed infection | Limited access to data via application |
International COVID-19 PICU Collaborative | Research collaborative | Set of pediatric hospitals in North America | Patients aged <21 years with confirmed COVID-19 and admitted to pediatric ICU | Not specified |
National Registry for Surveillance and Epidemiology of Perinatal COVID-19 Infections | Registry | Hospitals in the US with newborn delivery services | Mother/infant dyad where mother has confirmed COVID-19 in the perinatal period | Not possible at this time |
Overcoming COVID-19 | Surveillance study | Pre-specified set of hospitals in the US | Patients aged <26 years hospitalized with confirmed COVID-19 | Not possible at this time |
PAN-COVID (Pregnancy And Neonatal outcomes for women with COVID-19) | Registry | Hospitals globally where clinician has registered to contribute data | Neonate of mother with suspected or confirmed COVID-19 | Not specified |
Pediatric COVID-19 Convalescent Plasma Use Registry Study | Registry | Hospitals in the US | Patients aged <18 years of age who received COVID-19 convalescent plasma transfusion | Not specified |
Pediatric COVID-19 US Registry | Registry | Sites in the US | Patients aged <21 years with confirmed COVID-19 | Limited access to data via application |
PERN COVID-19 study (Pediatric Emergency Research Network COVID-19 study) |
Prospective observational study | Pre-specified set of 50 emergency departments across 11 countries | Patients aged <18 years tested for COVID-19 in the emergency department | Not possible at this time |
Pediatric Intensive Care Audit Network (PICANet) | Surveillance | Pediatric ICUs within the UK | Patients aged <18 years with suspected or confirmed COVID-19 admitted to pediatric ICU | Limited access to data via application |
The PAWS-COVID-19 Registry (Pediatric AirWays ComplicationS COVID-19 Registry) |
Registry | Hospitals globally | Patients aged <19 years undergoing anesthesia during the pandemic, including cohorts with confirmed and suspected COVID-19 | Not specified |
WHO Global COVID-19 Clinical Platform: multisystem inflammatory syndrome (MIS) in children and adolescents | Registry | Hospitals globally | Patients aged <20 years suspected to have multisystem inflammatory syndrome temporally related to COVID-19 | Not specified |
COVID-19 coronavirus disease 2019, ICU intensive care unit.
aReflects eligibility or participation and data contribution based on available reports as of November 9, 2020.
Building on what has been accomplished to date, pediatric clinical data collected in studies could be aggregated to develop a larger, representative dataset from single countries as well as global patient populations. Sharing pediatric COVID-19 data and resources across existing studies would strengthen our research capacity during the current pandemic, including long-term study of natural history and identification of rare complications. This resource would not replace any individual studies but provide a collaborative entity to complement local efforts. As an illustration of this approach, we have adopted this method with the launch of the Repository of Aggregated Pediatric International Data on COVID-19 (RAPID-19).3 This infrastructure leverages contemporary informatics methodologies that efficiently pool data, as has been previously accomplished with routine electronic health record and patient cohort data.4 RAPID-19 aggregates de-identified clinical data on pediatric patients with laboratory-confirmed severe acute respiratory syndrome coronavirus 2 infection, curated across existing research activities. All data contributors are able to access the integrated dataset, which is maintained on a secure HIPAA and HITRUST compliant, cloud-based, Apache 2.0 open source platform (https://github.com/hms-dbmi/pic-sure-documentation). The infrastructure includes embedded analytic tools and export functionality, which are managed by an API-enabled user interface with access controls for individual investigations. A steering group with representation from contributing sites oversees access and export of patient-level data. To ensure efficient and equitable use of the dataset, a number of mechanisms are in place to provide full transparency on site participation, research activities, data use, and opportunities for collaboration and authorship.
One fundamental challenge inherent to collaborative networks is the need for data standardization across sites. This is a limitation we have encountered, as certain studies participating in the Repository collect different variables and use distinct definitions for data elements. This limits options for combining datasets or variables across studies. While not all variables will be amenable to harmonization, a core set of data elements can be standardized to support queries and analyses across healthcare sites and study networks. We aim to also promote standardization of new data collection and have made study documents and tools available at www.RAPID-19.org to support COVID-19 research by other investigator teams and to facilitate collaborative work.
Our experiences building the RAPID-19 Repository highlight key principles the scientific community should consider in the study of any pediatric condition or population.
First, as a research community, we must build consensus on the importance of sharing pediatric data and apply multi-institutional approaches as the default design. Research infrastructures and networks must implement contemporary informatics methodologies that support systematic computational approaches across research entities. At a minimum, all data should be FAIR: Findable, Accessible, Interoperable, and Reusable.5 Whenever possible, new studies should deploy standard definitions that have been established for many pediatric specialties, such as the Pediatric Terminology sets developed by the National Institute of Child Health and Human Development.6 Further, harmonized approaches to structuring datasets are also essential to data sharing and reuse, including common formats and representations (e.g., terminologies and coding schemes). Pediatric observational studies need to use existing data models, such as the Observational Medical Outcomes Partnership, the i2b2 framework, or the PCORNet Common Data Model to support transformation to a shared format amenable to reproducible analyses.
Second, pediatric studies must be developed with data sharing and combined analyses in mind, even if a prospective network is not possible or all relevant collaborations are not yet established at the outset of a study. Specifically, collaborative work must be considered in the study organization, design, and documentation to enable future partnerships. This includes using informed consent documents that specify sharing of de-identified patient information with external collaborators and allow secondary data use beyond primary research questions.7 Data use agreements for registries and other multi-institutional studies should be written to allow for additional sharing and research analyses outside of the original study, building in measures for embargo periods, country-specific laws for protection of personal and medical information, and appropriate authorship attributions. Study governance must also allow for flexibility to enable onboarding of new co-investigators and expansion of study infrastructure and oversight.
Third, participation in collaborative studies should be incentivized. This can be achieved, for example, by providing access to integrated datasets for approved investigators and by offering authorship opportunities, such as group authorship by default for all data contributors. Study committees and working groups should include representation across sites and networks, including steering committees governing policies and procedures for research activities. More broadly, academic institutions need to recognize data sharing and input to patient-level datasets as an important contribution to the research enterprise and endorse these, for example, in promotion and tenure criteria.8 Finally, resources for data sharing need to be built into grants and institutional research infrastructure to make them feasible and sustainable.
The COVID-19 pandemic has highlighted the fragmentation of pediatric research. Ideally, research involves prospective formation of multi-institutional networks, but absent such infrastructure, it is essential that pediatric data be collected with the potential for collaboration and reuse. The principles suggested in this Commentary are being applied already in other scientific domains, placing pediatric research at risk of being left behind. Insights gained from the pandemic should serve as a stimulus to advance pediatric research programs involving other conditions as well and pave the way for routine use of coordinated approaches and application of sophisticated methodologies for efficient and scalable data integration.
Acknowledgements
F.T.B.’s work was supported by a grant from the Burroughs Wellcome Fund and by the Harvard-MIT Center for Regulatory Science.
Author contributions
All authors contributed to the concepts and content of the manuscript, helped draft the manuscript, and approved the final version for publication.
Competing interests
The authors declare no competing interests.
Patient consent
Patient consent was not required for this work.
Footnotes
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
References
- 1.Hwang TJ, Randolph AG, Bourgeois FT. Inclusion of children in clinical trials of treatments for coronavirus disease 2019 (COVID-19) JAMA Pediatr. 2020;174:825–826. doi: 10.1001/jamapediatrics.2020.1888. [DOI] [PubMed] [Google Scholar]
- 2.Feldstein LR, et al. Multisystem inflammatory syndrome in U.S. children and adolescents. N. Engl. J. Med. 2020 doi: 10.1056/nejmoa2021680. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Repository of Aggregated Pediatric International Data on COVID-19 (RAPID-19). https://www.rapid-19.org (2020). Accessed 23 Oct 2020.
- 4.Kothari C, et al. Phelan-McDermid syndrome data network: Integrating patient reported outcomes with clinical notes and curated genetic reports. Am. J. Med. Genet. B Neuropsychiatr. Genet. 2018 doi: 10.1002/ajmg.b.32579. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.GO FAIR. FAIR principles. https://www.go-fair.org/fair-principles/ (2020). Accessed 23 Oct 2020.
- 6.National Cancer Institute. Pediatric terminology. https://www.cancer.gov/research/resources/terminology/pediatric (2020). Accessed 23 Oct 2020.
- 7.Bourgeois FT, et al. Development of the precision link biobank at Boston Children’s Hospital: challenges and opportunities. J. Pers. Med. 2017;7:21. doi: 10.3390/jpm7040021. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Rice DB, Raffoul H, Ioannidis JPA, Moher D. Academic criteria for promotion and tenure in biomedical sciences faculties: cross sectional analysis of international sample of universities. BMJ. 2020 doi: 10.1136/bmj.m2081. [DOI] [PMC free article] [PubMed] [Google Scholar]