Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2024 May 31.
Published in final edited form as: Genet Med. 2021 Jul 6;23(10):1830–1832. doi: 10.1038/s41436-021-01179-w

One is the loneliest number: genotypic matchmaking using the electronic health record

Elly Brokamp 1,, Mary E Koziura 1, John A Phillips III 1, Leigh Anne Tang 3, Joy D Cogan 1, Lynette C Rives 1, Amy K Robertson 1, Laura Duncan 1, Anna Bican 1, Josh F Peterson 3, John H Newman 2, Rizwan Hamid 1, Lisa Bastarache 3; Undiagnosed Diseases Network*
PMCID: PMC11140587  NIHMSID: NIHMS1993043  PMID: 34230636

When genetic testing detects a candidate variant (CV) in a gene that is not known to cause disease in humans, clinicians face a dilemma. Where can they find the additional evidence needed to make a diagnosis? One action a clinician can take is to search for additional patients with a deleterious variant in the same gene. Here we present a case study where the electronic health record (EHR) was used to facilitate genotypic matchmaking to help diagnose a patient at Vanderbilt’s Undiagnosed Disease Network (UDN)1 clinical site.

A 26-year-old female with unexplained mild intellectual disability (ID), autism spectrum disorder (ASD), obsessive compulsive disorder (OCD), high myopia, and joint hypermobility had exome sequencing (ES) as part of her work-up for the UDN. Interestingly, a de novo MSL2 p.S232Tfs*10 variant was included in the ES report along with its possible association with ASD.2

The MSL2 protein had been reported to be part of a complex that plays a major role in chromatin regulation and structure through histone H4 acetylation and H2B ubiquitylation.3,4 Due to the participant’s CV being a frameshift, de novo, and the function of the MSL2 protein, the Vanderbilt University Medical Center (VUMC) UDN team considered it an attractive candidate, but the lack of a previously reported role in Mendelian disease precluded it from being diagnostic.

The UDN’s process to find additional cases is to upload CVs to PhenomeCentral, a founding member of Matchmaker Exchange.5 A widely used, similar strategy is for the provider to post the candidate gene in GeneMatcher, MyGene2, and/or Matchmaker Exchange.68 These online connection tools rely on other providers using these platforms, require some length of waiting time, and need communication between providers to share additional information to possibly solidify a match. To date, the VUMC team has not had any MSL2 matches through PhenomeCentral.

Having failed to find a match through available matchmaking platforms, the Vanderbilt UDN team searched for “MSL2” in VUMC’s Synthetic Derivative (SD), a continuously updated, de-identified database of EHRs for more than 3 million patients.9 This revealed two patients whose charts were subsequently reviewed using SD Discover, a browser-based application developed at Vanderbilt to facilitate chart review. Both patients were found to have de novo MSL2 variants identified through clinical trio ES and had strikingly similar phenotypes to the proband.

The first patient identified through SD was a 15-year-old male with a history of ASD, mild ID, obsessive–compulsive disorder (OCD), hypermobile Ehlers–Danlos syndrome (EDS), and high myopia. He was heterozygous for a de novo p.Pro25Ser MSL2 CV, a well conserved variant that is predicted to be damaging based on in silico models. The second patient was a 13-year-old female with a history of global developmental delay, ASD, attention deficit disorder (ADD), visual and language processing disorder, OCD, high myopia, and hypermobile EDS. She was heterozygous for a de novo p.Thr217AspfsX2 MSL2 CV, a variant that created a premature stop codon and was predicted to cause loss of normal protein function. The variants found in both patients were absent from large population cohorts.

In both cases, clinical genetics providers were suspicious of the MSL2 CV as a cause of their patients’ ASD, behavioral problems, and possibly their hypermobility. They documented that there was not enough evidence to consider the MSL2 CV to be diagnostic.

All three individuals had a de novo, likely gene disrupting, MSL2 CV. Since all three individuals also had similar phenotypes (ASD, OCD, mild ID or learning disabilities, and an EDS type III or joint hypermobility), this suggests its causative role in a new disorder. These additional patients gave the team the confidence to initiate contact with clinical laboratories to find more individuals with MSL2 as a candidate gene and connect with a team doing functional studies. Finding these two additional individuals was an integral part of giving the UDN participant a molecular diagnosis for a new disease.

The American College of Medical Genetics and Genomics (ACMG) recently published a statement encouraging providers to optimize their use of genomic testing results in the EHR.10 Based on the experience with MSL2, VUMC’s team proposes another use for genomic testing results in the EHR: a full text search for a gene name to quickly ascertain additional individuals with shared phenotype(s) and candidate gene(s). We have termed this strategy “EHR-based genotype matchmaking” to indicate the process of finding a genotype match that has been clinically tested and might provide evidence for causality.

EHR-based genotype matchmaking offers several benefits beyond what is realized by current methods. EHRs contain dense and longitudinal medical histories that are not always available to clinical laboratories; also, EHRs contain a large and growing population of patients who have received genetic testing compared with patient cohorts whose data have been uploaded to matchmaking platforms. Motivated by the successful resolution of the case above, the Vanderbilt UDN team has added EHR-based genotype matchmaking to its variant analysis pipeline and is in the process of reanalyzing past cases.

The VUMC UDN team recognizes that this powerful tool has potential barriers that complicate implementation. Like many medical centers, VUMC does not currently store genetic data in an easily accessible format. Genetic testing reports are mainly uploaded to the EHR as PDF files that are currently not searchable in the text-based SD. Our geneticists and genetic counselors record genetic results as text in the EHR, but this is a timeconsuming process that is subject to typos and errors, and not all VUMC providers record genetic test results in this way. Fortunately, many genes have fairly distinctive names—the search for “MSL2” easily found the two records above. However, searching for gene names that are short (e.g., TR), overlap with common terms, or that are frequently reported as somatic variants (e.g., EGFR) can yield thousands of irrelevant records.

Major EHR vendors are beginning to develop and market ancillary systems and modules to store structured genetic test results. If adopted, these new features, along with standard reporting of results from laboratories, will greatly simplify the process of identifying patients with matching genotypes. In the meantime, the UDN team plans to continue to index medical records based on a string-matching approach for new cases, and to reanalyze previous undiagnosed cases. The database that stores these string matches has become an invaluable resource for rapidly assessing new UDN cases, and the growth illustrates the rapid growth over the last decade of genetic data in the EHR.

More work is needed to streamline the dissemination of knowledge gained from EHR-based patient matching. The Vanderbilt UDN team is working on an institutional review board (IRB) application that would allow them to inform the ordering provider of the new connections regarding their patients’ genetic testing results. While the research team is not the clinical laboratory who performed the testing, there is still an ethical duty to reinterpret unsolved exomes and inform the ordering provider/patients of new conclusions.11,12 This IRB permission would allow the research team to also ask clarifying phenotype questions that are unanswered in the EHR. Ideally, VUMC’s search process, which is done on a research basis, could potentially be done as part of routine clinical care in the future.

Further work may be directed toward more automated phenotyping methods that use the EHR. The combination of structured genetic data and automated phenotyping methods may facilitate more automated approaches to scaling EHR-based matchmaking, similar to approaches used by matchmaking platforms.

The problem with candidate genes is not restricted to a research setting. Clinical ES, which is increasing in popularity, often reports CVs to clinicians. Although it may not be standard for all clinical laboratories to report variants in candidate genes, two large clinical laboratories have reported that ~8% of their diagnostic ES testing reports a candidate gene.13,14 One laboratory shared that a candidate gene was included on a total of 24% of their reports.14 Linking candidate genes found in these tests with the medical phenome described in the EHR represents an opportunity to improve interpretation of exomes, which will in turn improve patient care, increase the utility of clinical genetic testing, and further the discovery of new Mendelian diseases. The institutional and technological changes necessary to enable this function in other institutions will take extensive effort and funding, but we hope that this report creates additional incentive for change.

ACKNOWLEDGEMENTS

This research was supported by the National Institutes of Health (NIH) Common Fund, through the Office of Strategic Coordination/Office of the NIH Director under award number U01HG007672. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health.

UNDIAGNOSED DISEASES NETWORK

Mercedes E. Alejandro4, Mahshid S. Azamian4, Carlos A. Bacino4, Ashok Balasubramanyam4, Bret L. Bostwick4, Lindsay C. Burrage4, Hsiao-Tuan Chao4, Gary D. Clark4, William J. Craigen4, Shweta U. Dhar4, Lisa T. Emrick4, Alica M. Goldman4, Neil A. Hanchard4, Fariha Jamal4, Lefkothea Karaviti4, Seema R. Lalani4, Brendan H. Lee4, Richard A. Lewis4, Ronit Marom4, Paolo M. Moretti4, David R. Murdock4, Sarah K. Nicholas4, James P. Orengo4, Jennifer E. Posey4, Lorraine Potocki4, Jill A. Rosenfeld4, Susan L. Samson4, Daryl A. Scott4, Alyssa A. Tran4, Tiphanie P. Vogel4, Hugo J. Bellen5, Michael F. Wangler5, Shinya Yamamoto5, Christine M. Eng6, Pengfei Liu6, Patricia A. Ward6, David B. Goldstein7, Nicholas Stong7, Heidi Cope8, Yong-hui Jiang8, Allyn McConkie-Rosell8, Kelly Schoch8, Vandana Shashi8, Rebecca C. Spillmann8, Jennifer A. Sullivan8, Queenie K.-G. Tan8, Nicole M. Walley8, Pankaj Agrawal9, Alan H. Beggs9, Gerard T. Berry9, Lauren C. Briere9, Cynthia M. Cooper9, Laurel A. Cobban9, Elizabeth L. Fieg9, Frances High9, Ingrid A. Holm9, Susan Korrick9, Joel B. Krier9, Sharyn A. Lincoln9, Joseph Loscalzo9, Richard L. Maas9, Calum A. MacRae9, J. Carl Pallais9, Lance H. Rodan9, Edwin K. Silverman9, Joan M. Stoler9, David A. Sweetser9, Melissa Walker9, Chris A. Walsh9, Cecilia Esteves10, Emily G. Kelley10, Isaac S. Kohane10, Kimberly LeBlanc10, Alexa T. McCray10, Matthew Might10, David P. Bick11, Camille L. Birch11, Braden E. Boone11, Donna M. Brown11, Daniel C. Dorset11, Angela L. Jones11, Jozef Lazar11, Shawn E. Levy11, Thomas May11, J. Scott Newberry11, Elizabeth A. Worthey11, Ian R. Lanza12, Devin Oglesbee12, Eva Morava-Kozicz12, Brendan C. Lanpher12, Surendra Dasari12, Guney Bademci13, Deborah Barbouth13, Stephanie Bivona13, Olveen Carrasquillo13, Ta Chen Peter Chang13, Irman Forghani13, Alana Grajewski13, Rosario Isasi13, Byron Lam13, Roy Levitt13, Xue Zhong Liu13, Jacob McCauley13, Ralph Sacco13, Mario Saporta13, Judy Schaechter13, Mustafa Tekin13, Fred Telischi13, Willa Thorson13, Stephan Zuchner13, Gabriel F. Batzli14, Heather A. Colley14, Jyoti G. Dayal14, David J. Eckstein14, Donna M. Krasnewich14, Laura A. Mamounas14, Teri A. Manolio14, John J. Mulvihill14, Robb K. Rowley14, Catherine H. Sillari14, Cecelia P. Tamburro14, Tiina K. Urv14, Anastasia L. Wise14, Maria T. Acosta15, Carsten Bonnenmann15, Elizabeth A. Burke15, Precilla D’Souza15, Mariska Davids15, David D. Draper15, Tyra Estwick15, Carlos Ferreira15, Rena A. Godfrey15, Catherine A. Groden15, Jean M. Johnston15, C. Christopher Lau15, Ellen F. Macnamara15, Valerie V. Maduro15, Thomas C. Markello15, Marie Morimoto15, Avi Nath15, Donna Novacic15, Barbara N. Pusey15, Prashant Sharma15, Camilo Toro15, Colleen E. Wahl15, Guoyun Yu15, Andrea L. Gropman16, Eva Baker17, David R. Adams18, William A. Gahl18, May Christine V. Malicdan18, Cynthia J. Tifft18, Lynne A. Wolfe18, John Yang18, Matthew Brush19, Jean-Philippe F. Gourdine19, Melissa Haendel19, David M. Koeller19, Jennifer E. Kyle20, Thomas O. Metz20, Katrina M. Waters20, Bobbie-Jo M. Webb-Robertson20, Euan A. Ashley21, Jonathan A. Bernstein21, Devon Bonner21, Terra R. Coakley21, Liliana Fernandez21, Paul G. Fisher21, Jason Hom21, Yong Huang21, Jennefer N. Kohler21, Marta M. Majcherska21, Shruti Marwaha21, Colleen E. McCormack21, Chloe M. Reuter21, Jacinda B. Sampson21, Kevin S. Smith21, Matthew T. Wheeler21, Diane B. Zastrow21, Chunli Zhao21, Maura Ruzhnikov21, Laure Fresard21, Holly K. Tabor21, Archana N. Raja21, Gill Bejerano21, Beth A. Martin21, Shirley Sutton21, Patrick Allard22, Manish J. Butte22, Esteban C. Dell’Angelica22, Naghmeh Dorrani22, Emilie D. Douine22, Brent L. Fogel22, Alden Huang22, Deborah Krakow22, Hane Lee22, Sandra K. Loo22, Martin G. Martin22, Julian A. Martínez-Agosto22, Stanley F. Nelson22, Christina G. S. Palmer22, Jeanette C. Papp22, Neil H. Parker22, Genecee Renteria22, Rebecca H. Signer22, Janet S. Sinsheimer22, Jijun Wan22, Lee-kai Wang22, Jeremy D. Woods22, Amanda J. Yoon22, Lorenzo Botto23, Ashley Andrews23, Nicola Longo23, John Carey23, John Bohnsack23, Dave Viskochil23, Moretti Paolo23, Pace Laura23, Justin Alvey23, Jim Bale23, Gabor Marth23, Matt Velinder23, Aaron Quinlan23, Rong Mao24, Pinar Bayrak-Toydemir24, John H. Postlethwait25, Monte Westerfield25, Lisa Shakachite26, F. Sessions Cole27, Nichole Hayes27, Dana Kiley27, Kathleen Shields27, Kathy Sisco27, Jennifer Wambach27, Daniel Wegner27, Dustin Baldridge27, Timothy Schedl27, Lilianna Solnica-Krezel27, Stephen Pak27 and Jimann Shin27

4BCM Clinical, Houston, TX, USA. 5BCM MOSC, Houston, TX, USA. 6BCM Sequencing, Houston, TX, USA. 7Columbia, New York City, NY, USA. 8Duke Nashville, Durham, NC, USA. 9Harvard, Cambridge, MA, USA. 10Harvard CC, Cambridge, MA, USA. 11HudsonAlpha, Huntsville, AL, USA. 12Mayo Clinic, Rochester, MN, USA. 13University of Miami, Miami, FL, USA. 14NIH, Bethesda, MD, USA. 15NIH UDP, Bethesda, MD, USA. 16NIH UDP, CNMC, Bethesda, MD, USA. 17NIH UDP, DRM, Bethesda, MD, USA. 18NIH UDP, NHGRI, Bethesda, MD, USA. 19OHSU Metabolomics, Portland, OR, USA. 20PNNL Metabolomics, Richland, WA, USA. 21Stanford University, Stanford, CA, USA. 22UCLA, Los Angeles, CA, USA. 23University of Utah, Salt Lake City, UT, USA. 24University of Utah/ARUP, Salt Lake City, UT, USA. 25UO MOSC, Eugene, OR, USA. 26Vanderbilt University Medical Center, Nashville, TN, USA. 27Washington University Clinical, St. Louis, MO, USA.

Footnotes

COMPETING INTERESTS

The authors declare no competing interests.

Reprints and permission information is available at http://www.nature.com/reprints

REFERENCES

  • 1.Gahl W, Wise A & Ashley E The Undiagnosed Diseases Network of the National Institutes of Health: a national extension. JAMA. 314, 1797–1798 (2015). [DOI] [PubMed] [Google Scholar]
  • 2.Iossifov I et al. The contribution of de novo coding mutations to autism spectrum disorder. Nature. 515, 216–221 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Smith ER et al. A human protein complex homologous to the Drosophila MSL complex is responsible for the majority of histone H4 acetylation at lysine 16. Mol. Cell. Biol. 25, 9175–9188 (2006). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Wu L, Zee BM, Wang Y, Garcia BA & Dou Y The RING finger protein MSL2 in the MOF complex is an E3 ubiquitin ligase for H2B K34 and is involved in crosstalk with H3 K4 and K79 methylation. Mol. Cell. 43, 132–144 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Buske OK et al. PhenomeCentral: a portal for phenotypic and genotypic matchmaking of patients with rare genetic diseases. Hum. Mutat. 36, 931–940 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Philippakis AA et al. The Matchmaker Exchange: a platform for rare disease gene discovery. Hum. Mutat. 36, 915–921 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Sobreira N, Schiettecatte F, Valle D & Hamosh A GeneMatcher: a matching tool for connecting investigators with an interest in the same gene. Hum. Mutat. 36, 928–930 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.NHGRI/NHLBI University of Washington-Center for Mendelian Genomics (UW-CMG). MyGene2. http://www.mygene2.org/ (2020).
  • 9.Roden DM et al. Development of a large-scale de-identified DNA biobank to Enable Personalized Medicine. Clin Pharmacol Ther 84, 362–369 (2008). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Grebe T et al. The interface of genomic information with electronic health record: a points to consider statement of the American College of Medical Genetics and Genomics (ACMG). Genet. Med. 22, 1431–1436. 10.1038/s41436-020-0841-2 (2020). [DOI] [PubMed] [Google Scholar]
  • 11.Appelbaum PS et al. Is there a duty to reinterpret genetic data? The ethical dimensions. Genet. Med. 22, 633–639 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.David KL et al. Patient re-contact after revision of genomic test results: points to consider—a statement of the American College of Medical Genetics and Genomics (ACMG). Genet. Med. 21, 769–771 (2019). [DOI] [PubMed] [Google Scholar]
  • 13.Farwell-Hagman K et al. Candidate-gene criteria for clinical reporting: diagnostic exome sequencing identifies altered candidate genes among 8% of patients with undiagnosed diseases. Genet. Med. 19, 224–235 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Retterer M et al. Clinical application of whole-exome sequencing across clinical indications. Genet. Med. 18, 696–704 (2015). [DOI] [PubMed] [Google Scholar]

RESOURCES