Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2026 Feb 1.
Published in final edited form as: Curr Opin Pediatr. 2024 Nov 4;37(1):27–33. doi: 10.1097/MOP.0000000000001415

Linking household members and defining relational networks using routine health data

Jeffrey I Campbell 1, Ana Poblacion 2, Richard Sheward 3
PMCID: PMC11659028  NIHMSID: NIHMS2031757  PMID: 39509188

Abstract

Purpose of review:

The growth of rich electronic health record (EHR) data and large health databases has introduced new opportunities to link individuals together into households and relational networks. These “linked relational networks” hold promise for providing family-level care and studying intergenerational epidemiology and clinical outcomes. However, as linked relational networks become more commonly available in EHRs and research databases, it is critical to understand their challenges and limitations.

Recent findings:

Matching algorithms are being used to create linked relational networks in EHR and health databases. Clinically, these algorithms have been most useful to provide dyadic maternal-newborn care. In research, studies using these algorithms investigate topics ranging from the pharmacoepidemiology of parental drug exposure on childhood health outcomes, to heritability of chronic conditions, to associations between parental and child healthcare access and service delivery. However, ethical and technical challenges continue to limit use of these algorithms. There is also a critical research gap in the external validity of these matching algorithms.

Summary:

Linked relational networks are in widespread use in pediatric clinical care and research. More research is needed to understand the scope, limitations, and biases inherent in existing matching strategies.

Keywords: Household, Family, Caregiver, Relational network, Electronic health record, Pharmacoepidemiology

Introduction

Delivering family-centered care has long been a goal of clinicians and health organizations. As increasing volumes of health data have been electronically recorded, health information technology (HIT) strategies emerged to create linked databases. These linked databases achieve matching by using sets of unique identifier data to relate individuals or groups across multiple databases or clinical records.

An extension of database linking is the ability to connect records and data of one individual to the records and/or data of other individuals. The network of linked individuals—what we call a “linked relational network”—can exist within electronic health records (EHR), claims databases, and epidemiologic datasets. Generating linked relational networks has introduced new opportunities and challenges in pediatric and family-centered care. As linking strategies become more prevalent and sophisticated, it is important for clinicians and researchers to understand what these tools are, the expanding ways in which they are used, and their limitations.

For this review we adopt a definition of a linked relational network as the set of individuals who can be linked by shared identifying attributes that: 1) are routinely collected for the provision of healthcare, 2) can specify a unique network of two or more individuals, and 3) that plausibly mirror established real-world relationships. Surveying seminal and recent literature, we highlight algorithms that identify linked relational networks, and scientific advances that these algorithms have enabled. We discuss how these algorithms have been and likely will be used in clinical care, and their uses in epidemiology and health policy research (Table 1). Finally, we discuss important ethical considerations surrounding the use of these algorithms.

Table 1.

Applications of linked relational networks using health and administrative data in clinical care, policy research, and clinical and epidemiological research.

Clinical care
 • EHR maternal-neonatal dyad hyperlinks and data piping between clinical records enables newborn care
Policy research
 • Family insurance numbers enable evaluation of multiplier effect of enrolling individuals in insurance
Clinical and epidemiology research
 • Parent-child links enable evaluation of:
  ⚬ Intergenerational prevalence and incidence of chronic conditions
  ⚬ Association between parental preventive care and child well child care
  ⚬ Evaluation of co-location of clinical care services
  ⚬ Associations between parent and child vaccine uptake
  ⚬ Effects of maternal and paternal antenatal medication exposure and neonatal/childhood outcomes
 • Multigenerational links among large family networks enable study of heritable conditions in clinical databases
 • Family insurance identifiers facilitate study of effects of surgery and medication prescribing on spousal mental health and medication receipt
 • Linkages between health and multi-generational demographic/administrative databases enables sibling comparison quasi-experimental cohort designs

Generating linked relational networks

Methods to link individuals within health records and databases can be categorized as deterministic—requiring exact data matches to establish links—and probabilistic—using a computed probability that fully- or partially-matching data represent a link (Figure 1).[1] To date, most linking algorithms have used deterministic matches. In clinical practice, the certainty and uniqueness of deterministic matching is critical so that accurate information can be correctly routed to clinicians. For example, in the EHR deterministic matching ensures that neonates are accurately linked to their mothers to facilitate newborn care. In administrative and research databases, deterministic matching similarly ensures linking accuracy by using specific identifiers such as phone numbers, addresses,[2, 3] emergency contact, and/or payment guarantor information.[4] Within claims databases, these identifiers include shared insurance numbers and family identification numbers.[2, 47] Computed data can also be deterministically matched to establish linked relational networks. For instance, address geocodes can be matched to ascertain linked relational networks based on residence. Recent literature has described how combinations of data elements (e.g., phone number plus address plus insurance information) can increase sensitivity of matching algorithms.[2, 8, 9]

Figure 1.

Figure 1.

Established and future relational network linkages using health information technology and data collected in health records and databases. Created in BioRender. Campbell, J. (2024) BioRender.com/k51o237.

A limitation of deterministic matching is that relational ties between individuals in a database or clinical record may be overlooked due to discrepancies in data, such as misspellings of addresses or names. Thus, there has been increasing interest in using probabilistic matching algorithms to infer linked relational networks. A recent study of pregnancy-offspring outcomes conducted within the OCHIN Accelerating Data Value Across a National Community Health Center Network (ADVANCE) database leveraged free text data to establish probabilistic links between children’s and mothers’ records.[9] Here, the investigators matched free text emergency contact names within a database of candidate contacts, and added specificity by matching phone numbers, street addresses, and zip codes. Each step required only a partial data match, but the overall strategy generated precise mother-child linkages that could be used for epidemiologic research.

The types of relationships that HIT-based linking strategies create are dependent upon available data. Algorithms using individuals’ addresses will generate networks that mirror the classical definition of a “household”. Meanwhile, links based on insurance guarantor or emergency contact phone numbers will resemble networks of social support, which do not need to be (and often are not) based on cohabitation.

Linked relational networks in clinical care

To date, clinical applications of linking technologies have been primarily developed for maternal-neonatal care settings. Several large EHR systems contain perinatal care applications in which maternal and delivery data from maternal records are automatically piped to a neonate’s record. Integrated clinical platforms that include linked maternal-neonatal records have shown to improve neonatal care documentation and billing.[10, 11] Meanwhile, the absence of linked maternal-neonatal records has been identified as a barrier to ensuring accurate postnatal diagnosis of congenital illnesses.[12]

The routine linkage of maternal-neonatal records expands the amount and variety of data available for novel clinical tools in newborn care. For example, Gerard and colleagues used linked maternal-infant records and machine learning to develop predictive models of neonatal hypoglycemia.[13] In their analysis, an identifier was used to specify maternal-neonatal dyads, enabling incorporation of maternal medical history and comorbidity variables in candidate predictive models. These linked data will likely be valuable as machine learning and similar methods are used to identify patterns in large clinical datasets that can be translated to the bedside.

Scant literature describes implementation of linkage techniques for clinical care beyond the neonatal period. Clinical tools that link caregiver-child dyads as well as siblings in the EHR have several potential applications. One potential application is a “family health record”, which would share clinical information of family members between multiple clinicians and locations, thereby improving family-centered care.[14] In public health, linked relational networks could facilitate provision of infection control and prophylaxis to at risk household members after exposure to transmissible infectious diseases. Finally, if embedded widely in the EHR, linked relational networks might decrease patient and care navigator burden when households, not individuals, are screened for household-level social needs. Despite benefits, technical and ethical hurdles may hinder use of health data-defined relational networks beyond the newborn period, as it will be discussed below. To mitigate challenges, it will be important for patients and clinicians to provide input and ideally co-create clinical linked relational network tools to ensure acceptability.

Linked relational networks in policy research

The use of family identifier numbers in claims databases has enabled evaluation of the effects of health policy interventions within linked relational networks. In a seminal study, DeVoe and colleagues used household identification numbers within the Oregon Health Plan (Oregon’s Medicaid program) and emergency contact and guarantor linkages within the OCHIN EHR database[4] to generate a cohort of caregiver and linked children.[15] Then, investigators examined the association between caregiver insurance enrollment and subsequent enrollment of their children in the health plan, finding increased enrollment of children when their parents had been selected to enroll in the health plan. This finding indicates that policies targeting adult coverage can have ripple effects on children’s access to healthcare, while illustrating how linked relational networks can be used to study health policy effects within families.

Innovative uses of linked relational networks in clinical research and epidemiology

The most common use of linked relational networks in research databases has been to study clinical and epidemiologic relationships within network members. Many of these studies use data elements intentionally designed to specify family units—in particular, family identification numbers within insurance databases. Other studies have used innovative linkages of data elements that were not initially designed to specify relational networks, such as emergency contact information, address geocodes, and insurance guarantor data.

Research using unique, linked identification numbers in administrative databases

Studies have used established family identifiers in insurance and administrative databases to examine clinical outcomes and epidemiology within relational networks. For instance, several insurance database include a “family identifier number” that can be used to link family members who share insurance plans (notably, Medicaid removed this identifier number in 2014).[9] The archetypical use of these unique identifiers are studies linking mothers to neonates.[1618] Recent research has sought to expand the sample sizes of these studies to enable population-level assessments. A key example is a recent population-wide study on the effects of maternal disability on children’s receipt of well child care using linked clinical data from dyads mother-child born in-hospital in the Ontario (Canada) MOMBABY dataset.[19] Likewise, strategies to create maternal-neonatal relational network links within the widely-used Pediatric Health Information System database have recently been published.[20]

Another key advance in this field has been the use of a broad array of relational ties that can be defined using family identification numbers, expanding linked relational networks beyond maternal-neonatal dyads. In a retrospective, claims-based analysis using the OptumLabs® Data Warehouse, Meserve and colleagues used an insurance family identifier number to link expectant fathers to neonates.[21] The investigators examined whether paternal receipt of immunomodulators for immune mediated inflammatory diseases was associated with adverse birth outcomes, finding no association. Other studies have linked adults to other adults,[22, 23] illustrating how family identifier numbers in insurance databases can be used to understand family-level clinical and epidemiologic outcomes.

Unique identifier numbers have also enabled creation of linked relational networks across multiple health-related, administrative, and demographic databases. In a systematic review of the association between paternal medication prescriptions and linked neonatal outcomes, Gaitonde and colleagues used national patient registries that enabled deterministic linkages with birth records and other administrative databases.[24] Similarly, Chen and colleagues analyzed the relationship between experiencing a traumatic event and subsequent development of a psychiatric disorder, using a comparison cohort comprised of siblings who had not experienced a traumatic event.[25] This sibling network enabled the investigators to control for potential confounding by unmeasurable familial factors. To create the sibling comparison cohort, the authors linked individual and family-level identifiers across the Swedish Patient Register and Swedish Multi-Generation Register. This study illustrates how a sibling comparison quasi-experimental study design can be implemented at a population level when data in registries can be linked.[26]

Novel linkage strategies to define relational networks in clinical research and epidemiology

While unique identifier numbers facilitate creation of relational networks across clinical and research databases, many databases do not contain such numbers. This data limitation has led investigators to explore the use of other deterministic and probabilistic linking methods, often using more than one identifier common to members of the linked relational network.

One of the first research databases to implement these linkages for clinical and epidemiologic studies was the OCHIN research database.[4, 8] Within the database, investigators determined caregiver-child relational networks using common emergency contact records and/or guarantor fields of patients who were labeled in the relevant field as the children’s mother or father and who were 12–55 years older than the child to examine health services delivery within the specified networks.[2729] Notably, in one study in the OCHIN database, researchers were only able to link 33% of children with a parent.[8] Of these, 82% were linked to a mother only, 14% to a father only, and 4% to both. Most families consisted of only one linked child (61%). These statistics indicate potential limitations and gaps in using health record data to define relational networks that comprehensively reflect family or household membership.

Others have added data elements to matching algorithms to improve the yield of linkage generation. The ongoing PReventing Obesity through healthy Maternal gestational weight gain In the Safety nEt (PROMISE) Study uses a combination of deterministic and probabilistic matching to capture maternal-neonatal dyads, which it leverages to examine relationships between antenatal weight gain and neonatal outcomes in the OCHIN database.[9] This study uses deterministic guarantor, emergency contact, geocode, and phone number matching, plus probabilistic “fuzzy matching” of a variety of data elements to establish linkages. Similarly, to investigate the impact of prenatal drug exposure on neonatal and pediatric outcomes, Weaver and colleagues developed an algorithm using mother’s sex, age, pregnancy with a live birth, children’s age, child’s birthdate, insurance identifier number, and insurance coverage time to link mothers and neonates in two large claims datasets.[30] The use of multiple variables resulted in a high linkage rate. Linked mothers and infants comprised 73.6% of all mothers and 49.1% of all infants in these databases, respectively. Heerman and colleagues used insurance (common plan identification), child’s identification (listed on maternal records), address (common geocoded location), home phone, and emergency contact information variables to link mother-child dyads while investigating the intergenerational impact of antibiotic treatment during pregnancy on child obesity at 5 years.[31]

While most relational networks in this literature have been used to study clinical outcomes and epidemiology, investigators in New York City leveraged multigenerational networks established with health record linking to study disease heritability and simulate large-scale observational genetics studies.[32] Investigators developed a method create linked relational networks for a subset of patients within two large healthcare systems. This algorithm—named relationship inference from the electronic health record (RIFTEHR)—used emergency contact name, ZIP code, and phone number matching, along with relationships specified in emergency contact fields in the EHR. This method was used to infer 7.4 million relationships in this clinical database. This approach has been subsequently adapted to other healthcare systems to understand multigenerational patterns of obesity.[33]

Ethical challenges in generating and using linked relational networks

The principal challenge that linkage strategies raise is potential breaches of privacy. In 2020, passage of the 21st Century Cures Act required health systems to provide patients with access to their EHR information.[34] When data from linked relational network members are piped between clinical charts, private health information may be unintentionally shared between patients. Lamar and colleagues report an instance in which a history of intimate partner violence, legal information, and a safety plan documented in a mother’s chart was automatically transferred to the chart of her newborn and thereby became visible to the newborn’s father, who was also the perpetrator.[35] EHR systems may contain private information-blocking features, but in this case these systems had not been activated. Other potential protections could include restricting access to relational network data to clinicians only. Several questions surrounding consent have also not been addressed because clinical applications to date have focused on maternal-newborn dyads. If clinical applications expand to other relationships (e.g., connecting adults to other adults), should individuals who could be linked be informed and given the option to opt in or out? Should consent be reviewed periodically in case relationships have changed? Or should these linkages be automatic or covered under blanket privacy clauses signed by patients when they enroll in clinical care? Addressing these questions will be important to ensure ethical standards are maintained and that potential clinical tools are acceptable to patients.

When used in research, linked relational networks may exacerbate disparities in cases of incomplete or inaccurate data. English and colleagues developed a linkage algorithm based on insurance identification numbers, phone numbers, and addresses, and found that patients with lower socioeconomic status had lower chances of having a linkage.[2] This difference may have been due to higher absence of family identifier numbers in Medicaid insurance plans than commercial insurance, due to frequent changes in phone numbers, multiple moves, or due to residence in multi-unit buildings where a shared addresses were flagged as false-positive linkages.

To date, the sensitivity and specificity of health data-linked relational networks has been validated using other health record data (e.g., using maternal-child dyadic links in an EHR to understand performance of multiple-variable linkage strategies). However, no external gold standard, such as direct patient interviews, has been developed to understand potential limitations or systemic errors in these algorithms. Performing external validations will be critical for understanding the scope and limitations of linked relational networks. External validation will also be important for uncovering systemic biases in these algorithms’ abilities to define clinically and scientifically relevant networks. In pediatric care and research, it is important to perform these external validations to understand how well linked relational networks in health data map to real-world relationships that are important for child health and development, such as caregivers, parents, and siblings.

Conclusions and implications for practice

Linked relational networks are widely used in pediatric and family-centered research and clinical care. They demonstrate how health information technology and data science have leveraged large and complex health databases, offering new opportunities for family-centered care, epidemiological research, and policy recommendations. As these technologies evolve, there is potential for broader applications in family-centered care, such as streamlining social needs screening or enabling household-level infectious disease control. Linked relational also enable novel research designs, such as sibling comparison studies and multi-generational analyses of health outcomes. Key challenges include potential privacy concerns, questions surrounding consent, and lack of external validation of linkage algorithms. Those utilizing these networks should be aware of their strengths and limitations, and more research is needed to understand the scope, limitations, and biases inherent in existing matching strategies.

KEY POINTS.

  • Matching algorithms are widely used to link individuals to each other in electronic health records, and in research and administrative databases, forming what we call “linked relational networks”.

  • Linked relational networks enable novel opportunities for family-level care, and to understand disease epidemiology and health service delivery within interpersonal networks at a large scale.

  • As new applications of these algorithms emerge, it will be important for clinicians and researchers to understand the current limitations of linking, and the potential ethical challenges that use of these algorithms create.

  • More research is needed to demonstrate the external validity of these matching algorithms.

Funding:

Dr. Campbell was supported by NIH/NIAID K23AI186597 and by the Boston University Clinical and Translational Sciences Institute (NIH/NCATS UL1TR001430).

Footnotes

Conflicts of Interest: All authors declare no conflicts.

Contributor Information

Jeffrey I. Campbell, Boston University Chobanian & Avedisian School of Medicine.

Ana Poblacion, Boston University Chobanian & Avedisian School of Medicine.

Richard Sheward, Boston University Chobanian & Avedisian School of Medicine.

REFERENCES

  • 1.Cuccaro-Alamin S, Eastman AL, Foust R, McCroskey J, Nghiem HT, Putnam-Hornstein E. Strategies for constructing household and family units with linked administrative records. Children and Youth Services Review. 2021;120:105706. [Google Scholar]
  • 2.English TM, Kinney RL, Davis MJ, et al. Identification of Relationships Between Patients Through Elements in a Data Warehouse Using the Familial, Associational, and Incidental Relationship (FAIR) Initiative: A Pilot Study. JMIR Med Inform. 2015;3(1):e9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Brady KJS, Legler A, Adams WG. Exploring Opportunities for Household-Level Chronic Care Management Using Linked Electronic Health Records of Adults and Children: A Retrospective Cohort Study. Maternal and child health journal. 2020;24(7):829–36. Epub 2020/05/11. [DOI] [PubMed] [Google Scholar]
  • 4.Angier H, Gold R, Crawford C, et al. Linkage methods for connecting children with parents in electronic health record and state public health insurance data. Maternal and child health journal. 2014;18(9):2025–33. Epub 2014/02/25. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Bertoia ML, Phiri K, Clifford CR, et al. Identification of pregnancies and infants within a US commercial healthcare administrative claims database. Pharmacoepidemiology and Drug Safety. 2022;31(8):863–74. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6. Chung MK, Hart B, Santillana M, Patel CJ. Pediatric and Young Adult Household Transmission of the Initial Waves of SARS-CoV-2 in the United States: Administrative Claims Study. Journal of medical Internet research. 2024;26:e44249. ** Chung 2024: This study used household linked relational networks to examine spread of an infectious disease (SARS-CoV-2) through households when children were index patients. This study demonstrates the potential for linkage strategies to study household-level infectious disease epidemiology within large health databases.
  • 7.Smith HC, Petersen I, Schartau P. Association of Recent Fatherhood With Antidepressant Treatment Initiation Among Men in the United Kingdom. JAMA Network Open. 2023;6(5):e2316105–e. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Angier H, Giebultowicz S, Kaufmann J, et al. Creation of a linked cohort of children and their parents in a large, national electronic health record dataset. Medicine (Baltimore). 2021;100(32):e26950. Epub 2021/08/17. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9. Boone-Heinonen J, Lyon-Scott K, Springer R, et al. Pregnancy health in a multi-state U.S. population of systemically underserved patients and their children: PROMISE cohort design and baseline characteristics. BMC public health. 2024;24(1):886. ** Boone-Heinonen 2024: This preliminary analysis of the PROMISE study describes implementation of a multiple-variable linking strategy within the OCHIN database. This strategy enables robust linking of mothers and children to study effects of antenatal weight gain on neonatal outcomes.
  • 10.Liu W, Walsh T. The Impact of Implementation of a Clinically Integrated Problem-Based Neonatal Electronic Health Record on Documentation Metrics, Provider Satisfaction, and Hospital Reimbursement: A Quality Improvement Project. JMIR Med Inform. 2018;6(2):e40. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Rehman Durrani NU, Cowsill C, Krishnappa SB. Improving the Quality and Timeliness of Neonatal Intensive Care Unit Discharge Note: A Quality Improvement Project. Journal of Clinical Neonatology. 2022;11(2). [Google Scholar]
  • 12.Combs CA, Hameed AB, Friedman AM, Hoskins IA. Special statement: Proposed quality metrics to assess accuracy of prenatal detection of congenital heart defects. American Journal of Obstetrics and Gynecology. 2020;222(6):B2–B9. [DOI] [PubMed] [Google Scholar]
  • 13. Gerard J, Stuebe A, Sweeney A, et al. Using Machine Learning to Develop a Predictive Model of Infant Hypoglycemia Based on Maternal and Infant Variables in an Electronic Health Record. Proceedings of the International Symposium on Human Factors and Ergonomics in Health Care. 2023;12(1):94–100. * Gerard 2023: In this study, the authors used machine learning to examine the role of mutliple infant and maternal variables to predict neonatal hypoglycemia, leveraging health record maternal-neonatal network links.
  • 14.Angier H, Garvey B, DeVoe JE. Focus on Families to Improve Child Health During the COVID-19 Pandemic and Beyond. JAMA Health Forum. 2021;2(3). Epub 2021/12/17. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.DeVoe JE, Marino M, Angier H, et al. Effect of Expanding Medicaid for Parents on Children’s Health Insurance Coverage: Lessons From the Oregon Experiment. JAMA Pediatrics. 2015;169(1):e143145–e. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Figueroa R Use of antidepressants during pregnancy and risk of attention-deficit/hyperactivity disorder in the offspring. J Dev Behav Pediatr. 2010;31(8):641–8. Epub 2010/07/09. [DOI] [PubMed] [Google Scholar]
  • 17.Whitley R, Davis EA, Suppapanya N. Incidence of neonatal herpes simplex virus infections in a managed-care population. Sex Transm Dis. 2007;34(9):704–8. Epub 2007/04/07. [DOI] [PubMed] [Google Scholar]
  • 18.Chang S, Begier EM, Schech SD, et al. Perinatal hepatitis B transmission and vaccination timing in a managed care cohort: assessment of the temporary delay in newborn hepatitis B vaccination due to thimerosal content. Pediatr Infect Dis J. 2007;26(4):329–33. Epub 2007/04/07. [DOI] [PubMed] [Google Scholar]
  • 19.Brown HK, Taylor C, Camden A, et al. Maternal Disability and Early Child Preventive Care. Pediatrics. 2024. [DOI] [PubMed] [Google Scholar]
  • 20.Hahn PD, Melvin P, Graham DA, Milliren CE. A Methodology to Create Mother-Baby Dyads Using Data From the Pediatric Health Information System. Hospital Pediatrics. 2022;12(10):884–92. [DOI] [PubMed] [Google Scholar]
  • 21.Meserve J, Luo J, Zhu W, et al. Paternal Exposure to Immunosuppressive and/or Biologic Agents and Birth Outcomes in Patients With Immune-Mediated Inflammatory Diseases. Gastroenterology. 2021;161(1):107–15.e3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Varady NH, Chen AF, Niu R, Chung M, Freccero DM, Smith EL. Association Between Surgical Opioid Prescriptions and Opioid Initiation by Opioid-naïve Spouses. Annals of Surgery. 2023;277(6):e1218–e24. [DOI] [PubMed] [Google Scholar]
  • 23.Khalil M, Woldesenbet S, Iyer S, et al. Impact of Spousal Mental Illness on Healthcare Utilization Among Patients With Gastrointestinal Cancer. Journal of Surgical Oncology.n/a(n/a). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24. Gaitonde S, Moride Y, Suarez E, Lopez-Leon S. Where are the data to assess the safety of paternal drug exposure? A systematic review of secondary databases: A contribution from IMI concePTION. Pharmacoepidemiology and Drug Safety. 2024;33(2):e5764. * Gaitonde 2024: In this study, the authors review literature on paternal medication prescriptions and neonatal outcomes, highlighting several databases from Nordic countries that enable linkage of fathers to newborns at the population level.
  • 25. Chen Y, Shen Q, Lichtenstein P, et al. Incidence Trajectories of Psychiatric Disorders After Assault, Injury, and Bereavement. JAMA Psychiatry. 2024;81(4):374–85. ** Chen 2024: This study demonstrates how linked relational networks can be used to enable quasi-experimental study designs that use sibling controls at the population level. In doing so, this study is able to control for family-level confounding in the incidence of mental health diagnoses following traumatic events.
  • 26.D’Onofrio BM, Lahey BB, Turkheimer E, Lichtenstein P. Critical Need for Family-Based, Quasi-Experimental Designs in Integrating Genetic and Social Science Research. American Journal of Public Health. 2013;103(S1):S46–S55. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Angier H, Kaufmann J, Heintzman J, et al. Association of Parent Preventive Care with their Child’s Recommended Well-Child Visits. Acad Pediatr. 2022. Epub 2022/04/05. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28. Martwick J, Kaufmann J, Bailey S, et al. Impact of Healthcare Location Concordance on Receipt of Preventive Care Among Children Whose Parents have a Substance Use and/or Mental Health Diagnosis. Journal of Primary Care & Community Health. 2024;15:21501319241229925. * Martwick 2024: This paper demonstrates the utility of linked relational networks to study how healthcare delivery location affects well child care adherence among a population of children whose parents have mental health and/or substance use diagnoses.
  • 29.Kaufmann J, DeVoe JE, Angier H, Moreno L, Cahen V, Marino M. Association of parent influenza vaccination and early childhood vaccinations using linked electronic health record data. Vaccine. 2022;40(49):7097–107. Epub 2022/11/22. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30. Weaver J, Hardin JH, Blacketer C, Krumme AA, Jacobson MH, Ryan PB. Development and evaluation of an algorithm to link mothers and infants in two US commercial healthcare claims databases for pharmacoepidemiology research. BMC medical research methodology. 2023;23(1):246. ** Weaver 2023: This paper validates a methodology to link parents and children in administrative healthcare claims databases using IBM® Marketscan® Commercial Database (CCAE) and Optum’s de-identified Clinformatics® Data Mart Database (Clinformatics®) of almost 3.5 million mothers to more than 4.1 million children. This methodology is used in several subsequent papers due its utility for pharmacoepidemiologic research within these large databases.
  • 31.Heerman WJ, Daley MF, Boone-Heinonen J, et al. Maternal antibiotic use during pregnancy and childhood obesity at age 5 years. International Journal of Obesity. 2019;43(6):1202–9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Polubriaginof FCG, Vanguri R, Quinnies K, et al. Disease Heritability Inferred from Familial Relationships Reported in Medical Records. Cell. 2018;173(7):1692–704.e11. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Krefman AE, Ghamsari F, Turner DR, et al. Using electronic health record data to link families: an illustrative example using intergenerational patterns of obesity. Journal of the American Medical Informatics Association. 2023;30(5):915–22. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Department of Health and Human Services. 21st Century Cures Act: Interoperability, Information Blocking, and the ONC Health IT Certification Program. Accessed online at: https://wwwfederalregistergov/documents/2020/05/01/2020-07419/21st-century-cures-act-interoperability-information-blocking-and-the-onc-health-it-certification Last accessed October 7, 2024 2020.
  • 35. Lamar CE, Jain D, Schmitz KH. Perils of Auto-Population of Newborn EHR With Maternal Intimate Partner Violence History. Pediatrics. 2023;152(4 ).* Lamar 2023: This case report identifies a potential harm arising from automatically sharing information between linked network members. It proposes potential safeguards against breach of privacy in the setting of linked records and data sharing mandated in the 21st Century Cures Act.

RESOURCES