Next-generation phenotyping integrated in a national framework for patients with ultrarare disorders improves genetic diagnostics and yields new molecular findings

Axel Schmidt; Magdalena Danyel; Kathrin Grundmann; Theresa Brunet; Hannah Klinkhammer; Tzung-Chien Hsieh; Hartmut Engels; Sophia Peters; Alexej Knaus; Shahida Moosa; Luisa Averdunk; Felix Boschann; Henrike Lisa Sczakiel; Sarina Schwartzmann; Martin Atta Mensah; Jean Tori Pantel; Manuel Holtgrewe; Annemarie Bösch; Claudia Weiß; Natalie Weinhold; Aude-Annick Suter; Corinna Stoltenburg; Julia Neugebauer; Tillmann Kallinich; Angela M Kaindl; Susanne Holzhauer; Christoph Bührer; Philip Bufler; Uwe Kornak; Claus-Eric Ott; Markus Schülke; Hoa Huu Phuc Nguyen; Sabine Hoffjan; Corinna Grasemann; Tobias Rothoeft; Folke Brinkmann; Nora Matar; Sugirthan Sivalingam; Claudia Perne; Elisabeth Mangold; Martina Kreiss; Kirsten Cremer; Regina C Betz; Martin Mücke; Lorenz Grigull; Thomas Klockgether; Isabel Spier; André Heimbach; Tim Bender; Fabian Brand; Christiane Stieber; Alexandra Marzena Morawiec; Pantelis Karakostas; Valentin S Schäfer; Sarah Bernsen; Patrick Weydt; Sergio Castro-Gomez; Ahmad Aziz; Marcus Grobe-Einsler; Okka Kimmich; Xenia Kobeleva; Demet Önder; Hellen Lesmann; Sheetal Kumar; Pawel Tacik; Meghna Ahuja Basin; Pietro Incardona; Min Ae Lee-Kirsch; Reinhard Berner; Catharina Schuetz; Julia Körholz; Tanita Kretschmer; Nataliya Di Donato; Evelin Schröck; André Heinen; Ulrike Reuner; Amalia-Mihaela Hanßke; Frank J Kaiser; Eva Manka; Martin Munteanu; Alma Kuechler; Kiewert Cordula; Raphael Hirtz; Elena Schlapakow; Christian Schlein; Jasmin Lisfeld; Christian Kubisch; Theresia Herget; Maja Hempel; Christina Weiler-Normann; Kurt Ullrich; Christoph Schramm; Cornelia Rudolph; Franziska Rillig; Maximilian Groffmann; Ania Muntau; Alexandra Tibelius; Eva M C Schwaibold; Christian P Schaaf; Michal Zawada

doi:10.1038/s41588-024-01836-1

. 2024 Jul 22;56(8):1644–1653. doi: 10.1038/s41588-024-01836-1

Next-generation phenotyping integrated in a national framework for patients with ultrarare disorders improves genetic diagnostics and yields new molecular findings

Axel Schmidt ^1,^#, Magdalena Danyel ^2,^3,^#, Kathrin Grundmann ^4,^#, Theresa Brunet ^5,^#, Hannah Klinkhammer ^6,⁷, Tzung-Chien Hsieh ⁶, Hartmut Engels ¹, Sophia Peters ¹, Alexej Knaus ⁶, Shahida Moosa ⁸, Luisa Averdunk ⁹, Felix Boschann ^2,³, Henrike Lisa Sczakiel ^2,³, Sarina Schwartzmann ², Martin Atta Mensah ^2,³, Jean Tori Pantel ^2,¹⁰, Manuel Holtgrewe ¹¹, Annemarie Bösch ¹², Claudia Weiß ¹², Natalie Weinhold ¹², Aude-Annick Suter ¹², Corinna Stoltenburg ¹², Julia Neugebauer ¹², Tillmann Kallinich ¹², Angela M Kaindl ^13,^14,¹⁵, Susanne Holzhauer ¹², Christoph Bührer ¹², Philip Bufler ¹², Uwe Kornak ², Claus-Eric Ott ², Markus Schülke ², Hoa Huu Phuc Nguyen ¹⁶, Sabine Hoffjan ¹⁶, Corinna Grasemann ¹⁷, Tobias Rothoeft ¹⁷, Folke Brinkmann ¹⁷, Nora Matar ¹⁷, Sugirthan Sivalingam ¹, Claudia Perne ¹, Elisabeth Mangold ¹, Martina Kreiss ¹, Kirsten Cremer ¹, Regina C Betz ¹, Martin Mücke ¹⁸, Lorenz Grigull ¹⁸, Thomas Klockgether ¹⁹, Isabel Spier ¹, André Heimbach ¹, Tim Bender ¹⁸, Fabian Brand ⁶, Christiane Stieber ¹⁸, Alexandra Marzena Morawiec ¹⁸, Pantelis Karakostas ²⁰, Valentin S Schäfer ²⁰, Sarah Bernsen ¹⁸, Patrick Weydt ¹⁹, Sergio Castro-Gomez ¹⁹, Ahmad Aziz ¹⁹, Marcus Grobe-Einsler ¹⁹, Okka Kimmich ¹⁹, Xenia Kobeleva ¹⁹, Demet Önder ¹⁹, Hellen Lesmann ¹, Sheetal Kumar ¹, Pawel Tacik ¹⁹, Meghna Ahuja Basin ⁶, Pietro Incardona ⁶, Min Ae Lee-Kirsch ^21,²², Reinhard Berner ^21,²², Catharina Schuetz ^21,²², Julia Körholz ^21,²², Tanita Kretschmer ^21,²², Nataliya Di Donato ^21,²³, Evelin Schröck ^21,²³, André Heinen ^21,²², Ulrike Reuner ^21,²⁴, Amalia-Mihaela Hanßke ²¹, Frank J Kaiser ²⁵, Eva Manka ²⁶, Martin Munteanu ²⁵, Alma Kuechler ²⁵, Kiewert Cordula ²⁶, Raphael Hirtz ²⁶, Elena Schlapakow ²⁷, Christian Schlein ²⁸, Jasmin Lisfeld ²⁸, Christian Kubisch ^28,²⁹, Theresia Herget ²⁸, Maja Hempel ^28,^29,³⁰, Christina Weiler-Normann ^29,³¹, Kurt Ullrich ²⁹, Christoph Schramm ^29,³¹, Cornelia Rudolph ²⁹, Franziska Rillig ²⁹, Maximilian Groffmann ²⁹, Ania Muntau ³², Alexandra Tibelius ³⁰, Eva M C Schwaibold ³⁰, Christian P Schaaf ³⁰, Michal Zawada ³⁰, Lilian Kaufmann ³⁰, Katrin Hinderhofer ³⁰, Pamela M Okun ³³, Urania Kotzaeridou ³³, Georg F Hoffmann ³³, Daniela Choukair ³³, Markus Bettendorf ³³, Malte Spielmann ³⁴, Annekatrin Ripke ³⁵, Martje Pauly ^36,³⁷, Alexander Münchau ^35,³⁸, Katja Lohmann ³⁹, Irina Hüning ³⁴, Britta Hanker ⁴⁰, Tobias Bäumer ^35,³⁸, Rebecca Herzog ^35,³⁶, Yorck Hellenbroich ⁴¹, Dominik S Westphal ⁵, Tim Strom ⁵, Reka Kovacs ⁵, Korbinian M Riedhammer ^5,⁴², Katharina Mayerhanser ⁵, Elisabeth Graf ⁵, Melanie Brugger ⁵, Julia Hoefele ⁵, Konrad Oexle ⁴³, Nazanin Mirza-Schreiber ⁴³, Riccardo Berutti ⁴³, Ulrich Schatz ⁵, Martin Krenn ^5,⁴⁴, Christine Makowski ⁴⁵, Heike Weigand ⁴⁶, Sebastian Schröder ⁴⁶, Meino Rohlfs ⁴⁶, Katharina Vill ⁴⁶, Fabian Hauck ⁴⁶, Ingo Borggraefe ⁴⁶, Wolfgang Müller-Felber ⁴⁶, Ingo Kurth ¹⁰, Miriam Elbracht ¹⁰, Cordula Knopp ¹⁰, Matthias Begemann ¹⁰, Florian Kraft ¹⁰, Johannes R Lemke ^47,⁴⁸, Julia Hentschel ⁴⁷, Konrad Platzer ⁴⁷, Vincent Strehlow ⁴⁷, Rami Abou Jamra ⁴⁷, Martin Kehrer ⁴, German Demidov ⁴, Stefanie Beck-Wödl ⁴, Holm Graessner ⁴⁹, Marc Sturm ⁴, Lena Zeltner ⁴⁹, Ludger J Schöls ⁵⁰, Janine Magg ⁴⁹, Andrea Bevot ⁵¹, Christiane Kehrer ⁵¹, Nadja Kaiser ⁵¹, Ernest Turro ⁵², Denise Horn ², Annette Grüters-Kieslich ⁵³, Christoph Klein ⁴⁶, Stefan Mundlos ², Markus Nöthen ¹, Olaf Riess ⁴, Thomas Meitinger ⁵, Heiko Krude ⁵³, Peter M Krawitz ^6,^✉, Tobias Haack ⁴, Nadja Ehmke ^2,³, Matias Wagner ^5,^43,⁴⁶

¹Institute of Human Genetics, University of Bonn, Medical Faculty and University Hospital Bonn, Bonn, Germany

²Institute for Medical Genetics and Human Genetics, Charité – Universitätsmedizin Berlin, Berlin, Germany

³BIH Charité Clinician Scientist Program, Berlin Institute of Health at Charité – Universitätsmedizin Berlin, Berlin, Germany

⁴Institute for Medical Genetics and Applied Genomics, University of Tübingen, Tübingen, Germany

⁵Institute of Human Genetics, Klinikum rechts der Isar, School of Medicine, Technical University of Munich, München, Germany

⁶Institute for Genomic Statistics and Bioinformatics, University of Bonn, Medical Faculty and University Hospital Bonn, Bonn, Germany

⁷Institut für Medizinische Biometrie, Informatik und Epidemiologie, University of Bonn, Medical Faculty and University Hospital Bonn, Bonn, Germany

⁸Institute for Medical Genetics, Stellenbosch University, Cape Town, South Africa

⁹Department of Pediatrics, University Hospital Düsseldorf, Düsseldorf, Germany

¹⁰Institute for Human Genetics and Genomic Medicine, Medical Faculty, Uniklinik RWTH Aachen University, Aachen, Germany

¹¹Core Uni Bioinformatics, Berlin Institute of Health at Charité – Universitätsmedizin Berlin, Berlin, Germany

¹²Department of Pediatrics, Charité – Universitätsmedizin Berlin, Berlin, Germany

¹³Department of Pediatric Neurology, Charité – Universitätsmedizin Berlin, Berlin, Germany

¹⁴Center for Chronically Sick Children, Charité – Universitätsmedizin Berlin, Berlin, Germany

¹⁵Institute of Cell and Neurobiology, Charité – Universitätsmedizin Berlin, Berlin, Germany

¹⁶Department of Human Genetics, Ruhr University Bochum, Bochum, Germany

¹⁷Department of Pediatrics Bochum and CeSER, Ruhr University Bochum, Bochum, Germany

¹⁸Center for Rare Diseases, University of Bonn, Medical Faculty and University Hospital Bonn, Bonn, Germany

¹⁹Department of Neurology, University of Bonn, Medical Faculty and University Hospital Bonn, Bonn, Germany

²⁰Clinic for Internal Medicine III, University of Bonn, Medical Faculty and University Hospital Bonn, Bonn, Germany

²¹University Center for Rare Diseases, University Hospital Carl Gustav Carus, Dresden, Germany

²²Department of Pediatrics, University Hospital Carl Gustav Carus, Dresden, Germany

²³Institute for Clinical Genetics, University Hospital Carl Gustav Carus, Dresden, Germany

²⁴Department of Neurology, University Hospital Carl Gustav Carus, Dresden, Germany

²⁵Institute of Human Genetics, University Hospital Essen, Essen, Germany

²⁶Department of Pediatrics II, University Hospital Essen, Essen, Germany

²⁷Department of Neurology, University Hospital Halle, Halle, Germany

²⁸Institute of Human Genetics, University Hospital Hamburg-Eppendorf, Hamburg, Germany

²⁹Martin Zeitz Center for Rare Diseases, University Hospital Hamburg-Eppendorf, Hamburg, Germany

³⁰Institute of Human Genetics, Heidelberg University, Heidelberg, Germany

³¹I. Department of Medicine, University Hospital Hamburg-Eppendorf, Hamburg, Germany

³²Department of Pediatrics, University Hospital Hamburg-Eppendorf, Hamburg, Germany

³³Center for Child and Adolescent Medicine, University Hospital Heidelberg, Heidelberg, Germany

³⁴Institute of Human Genetics, University Hospital Schleswig-Holstein, Lübeck, Germany

³⁵Center for Rare Diseases, University Hospital Schleswig-Holstein, Lübeck, Germany

³⁶Department of Neurology, University Hospital Schleswig-Holstein, Lübeck, Germany

³⁷Institute for Neurogenetics, University Hospital Schleswig-Holstein, Lübeck, Germany

³⁸Institute of Systems Motor Science, University of Lübeck, Lübeck, Germany

³⁹Institute of Neurogenetics, University of Lübeck, Lübeck, Germany

⁴⁰Institute of Human Genetics, University of Lübeck, Lübeck, Germany

⁴¹Department of Human Genetics, University Hospital Schleswig-Holstein, Lübeck, Germany

⁴²Department of Nephrology, Klinikum rechts der Isar, School of Medicine, Technical University of Munich, München, Germany

⁴³Institute of Neurogenomics, Helmholtz Zentrum München, München, Germany

⁴⁴Department of Neurology, Medical University of Vienna, Wien, Austria

⁴⁵Department of Paediatrics, Adolescent Medicine and Neonatology, München, Germany

⁴⁶Dr. von Hauner Children’s Hospital, University Hospital Munich, München, Germany

⁴⁷Institute of Human Genetics, University of Leipzig Medical Center, Leipzig, Germany

⁴⁸Center for Rare Diseases, University of Leipzig Medical Center, Leipzig, Germany

⁴⁹Center for Rare Diseases, University of Tübingen, Tübingen, Germany

⁵⁰Department of Neurology, University of Tübingen, Tübingen, Germany

⁵¹Department of Pediatric Neurology and Developmental Medicine, University of Tübingen, Tübingen, Germany

⁵²Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY USA

⁵³Berlin Centre for Rare Diseases, Charité – Universitätsmedizin Berlin, Berlin, Germany

^✉

Corresponding author.

Contributed equally.

PMCID: PMC11319204 PMID: 39039281

Abstract

Individuals with ultrarare disorders pose a structural challenge for healthcare systems since expert clinical knowledge is required to establish diagnoses. In TRANSLATE NAMSE, a 3-year prospective study, we evaluated a novel diagnostic concept based on multidisciplinary expertise in Germany. Here we present the systematic investigation of the phenotypic and molecular genetic data of 1,577 patients who had undergone exome sequencing and were partially analyzed with next-generation phenotyping approaches. Molecular genetic diagnoses were established in 32% of the patients totaling 370 distinct molecular genetic causes, most with prevalence below 1:50,000. During the diagnostic process, 34 novel and 23 candidate genotype–phenotype associations were identified, mainly in individuals with neurodevelopmental disorders. Sequencing data of the subcohort that consented to computer-assisted analysis of their facial images with GestaltMatcher could be prioritized more efficiently compared with approaches based solely on clinical features and molecular scores. Our study demonstrates the synergy of using next-generation sequencing and phenotyping for diagnosing ultrarare diseases in routine healthcare and discovering novel etiologies by multidisciplinary teams.

Subject terms: Genetics research, Genetic testing

Exome sequencing within a structured diagnostic process for rare diseases in Germany shows how facial image analysis and machine learning can guide variant prioritization and uncover many ultrarare diseases.

Main

A recent analysis of the Orphanet database showed that around 3–6% of the global population have a rare disease (that is, a disease with a prevalence of <1 in 2,000) and that 72% of such cases may have a genetic cause¹. Rare diseases thus represent a substantial global health burden. However, only a minority of patients suspected to have a rare disease receive both a definite clinical diagnosis and a confirmatory molecular test result^2,3. This concerns in particular the subset of patients with ultrarare disorders that are defined in the European Union as affecting no more than one person in 50,000 and that follow a long tail distribution with respect to their frequency (Regulation (EU) No. 536/2014). It is estimated that roughly 80% of the more than 5,000 rare genetic diseases have a prevalence below one in a million¹.

The International Rare Disease Research Consortium therefore stated that, by 2027, all patients who come to medical attention with a suspected rare or ultrarare disease should be diagnosed within 1 year if the respective disorder has been described in the medical literature⁴. Since many rare diseases are Mendelian in nature, comprehensive genetic testing is a key element to achieve that goal.

In Germany, around 90% of the population has statutory health insurance, and the current reimbursement scheme allows physicians to request chromosome analyses, molecular karyotyping and sequencing of single genes or gene panels. For example, high-resolution genome-wide array-based segmental aneusomy profiling detects a pathogenic aberration in around 19% of patients with developmental delay⁵. Besides contiguous gene syndromes, most of the remaining rare disorders are monogenic and are caused by single nucleotide variants or small insertions or deletions (indels). However, single gene analyses or small gene panels are only likely to detect a pathogenic aberration if the phenotype is highly predictive of the molecular cause, for example, hemoglobinopathies⁶.

For phenotypes with high genetic heterogeneity, such as neurodevelopmental disorders, genetic investigation is more challenging. For intellectual disability, for example, studies so far have identified disease associations for more than a thousand genes⁷. For these disorders, research has shown that exome sequencing can be more cost-effective than sequencing potentially multiple gene panels⁸. However, this is also accompanied by more genetic variants that have to be assessed. Therefore, a clear indication for exome sequencing and efficient data analysis strategies are crucial. Between 2018 and 2020, a novel diagnostic concept within the German healthcare system was evaluated in the prospective study TRANSLATE NAMSE⁹.

This involved standardized structures and procedures and multidisciplinary teams (MDTs) at ten university hospital-based centers for rare diseases (CRDs). The MDTs conducted a three-step diagnostic process: (1) primary review of patient records; (2) selection of diagnostic procedures, including a possible recommendation for exome sequencing; and (3) evaluation of all findings, including genetic variants. A key goal was to investigate whether exome sequencing would facilitate the diagnosis of ultrarare disorders or even the delineation of novel monogenic disorders. In this work, we report the molecular findings of this study.

Furthermore, we investigated how phenotypic features can be used to estimate the probability that a molecular diagnosis can be established with exome sequencing (YieldPred). In a companion study, we also assessed the extent to which the results from computer-assisted pattern recognition in facial dysmorphism contribute to variant interpretation (prioritization of exome data by image analysis, PEDIA). The present analyses demonstrated that exome sequencing facilitated the diagnosis of ultrarare genetic diseases and novel gene–disease associations and that artificial intelligence (AI)-driven technologies improved the diagnostic yield for ultrarare genetic disorders.

Results

Phenotypic characteristics of the study cohort

Between 2018 and 2020, a total of 5,652 individuals (2,033 adults and 3,619 children) with a suspected rare disorder were enrolled in TRANSLATE NAMSE by CRDs at ten German university hospitals (Fig. 1a)⁹. The present analyses were performed using the data from a total of 1,577 of these 5,652 patients (268 adults, 1,309 children). In these individuals, the MDT at the respective CRD considered a genetic cause as plausible and exome sequencing as the most suitable test (exome sequencing cohort, Supplementary Table 1). Each of these 1,577 individuals was assigned to one of six major disease categories by the respective CRD physician (Fig. 1b). The majority of children were assigned to the disease category ‘neurodevelopmental disorders’ (n = 702, 54%), and the largest proportion of adults were assigned to the disease category ‘neurological or neuromuscular disorders’ (n = 117, 44%). Smaller proportions of adult and pediatric cases were assigned to the groups ‘organ malformation’, ‘endocrine/metabolic disorders’, ‘immune/hematologic disorders’ and ‘cardiovascular disorders’. Patient phenotypes were also annotated with terms of the Human Phenotype Ontology (HPO) by the respective CRD physicians. On average, five HPO terms were specified per individual (Supplementary Fig. 1a). The phenotypes within the present cohort were visualized by projecting the patient-specific HPO terms into a two-dimensional space. While most patients from the same disease group were in close proximity, the clusters showed a partial overlap (Fig. 1c). For example, many patients categorized within ‘neurological or neuromuscular disorders’ also showed HPO terms typically associated with ‘neurodevelopmental disorders’ and vice versa (Supplementary Fig. 1b). This suggests that grouping patients into single disease groups may be overly simplistic.

Fig. 1 — a, Patients with a suspected rare disease were referred to a MDT and deeply phenotyped using HPO terminology. If a genetic etiology was considered likely, exome sequencing was performed. The MDT then evaluated the molecular findings and could order additional analyses for variants of uncertain significance or variants in potentially novel disease candidate genes (created with BioRender.com). b, Exome sequencing was performed predominantly in children. The main indications for exome sequencing in children were neurodevelopmental disorders. In adults, the main indications were neurological/neuromuscular disorders. In both children and adults, the least common disease categories were ‘cardiovascular’, ‘endocrine, metabolic, mitochondrial, nutritional’ (emmn) and ‘hematopoiesis/immune system’ (his). c, Phenotypic similarities between patients, as encoded according to their HPO terms, were visualized with UMAP. As reference, all OMIM diseases were included using their HPO annotations (gray background dots). For each patient, color coding indicates allocation to disease groups, in accordance with the leading clinical feature. An overlap is evident for patients in the neurodevelopmental and neuromuscular groups (aquamarine and blue clusters), which indicates high phenotypic similarity. This precludes the unequivocal assignment of these patients to a diagnostic group. The triangles indicate patients who contributed to the identification of a novel, high-evidence gene–phenotype association.

Diagnostic yield of exome sequencing

A molecular diagnosis was established in a total of 499 of the 1,577 patients (32%), that is, in these cases, exome sequencing identified variants that fully or partially explained the phenotype. The diagnostic yield was slightly higher in children (32%) than in adults (28%, P = 0.13, Fisher’s exact test; Fig. 2a) and twofold higher in patients assigned to the category ‘neurodevelopmental disorder’ than for all other disease categories (42% versus 22%, P < 0.001, Fisher’s exact test with Bonferroni correction; for single comparisons between disorder groups, see Fig. 2b). Furthermore, exome sequencing found variants of uncertain significance. Specifically, these variants were enriched for missense variants (80% versus 45%, P < 0.001; Supplementary Fig. 2), due to lower support for pathogenicity according to the guidelines of the American College of Medical Genetics (ACMG) and the Association for Molecular Pathologists for interpretation of sequence variants.

Fig. 2 — a,b, The diagnostic yield differed according to age group (adult/child) (a) and disease category (b). For all disease categories, with the exception of cardiovascular, the diagnostic yield was increased by novel DGGs and high-evidence candidate genes (dark-colored tip of the bar). The absolute number of solved cases in which a variant was found in an established disease gene is given at the bottom of each bar, and the number of solved cases attributable to a novel DGG or high-evidence candidate gene is given at the top of each bar. The entire TRANSLATE NAMSE exome sequencing cohort was considered for a and b (n = 1,577). Diagnostic yield between disease categories were compared using two-sided Fisher’s exact test. P values were adjusted by Bonferroni correction. ***P < 0.001; exact corrected P values: neurodevelopmental (ndd) versus neurologic neuromuscular P = 5.4 × 10⁻⁵, ndd versus organ abnormality P = 5.2 × 10⁻⁵, ndd versus emmn P = 5.9 × 10⁻⁴, ndd versus his P = 1.1 × 10⁻¹¹. emnn, endocrine, metabolic, mitochondrial, nutritional; his, hematopoiesis/immune system.

De novo variants and parental mosaicism

A total of 228 diagnoses (45% of 510 diagnoses including dual diagnoses) were attributable to de novo variants, making them the most common cause of disease in families with an autozygosity below 0.02 and the second most common cause in families with consanguinity (Fig. 3). In three families with variants that were initially classified as de novo, evidence for probable or certain parental mosaicism was found (Supplementary Note). In one of these families, the same likely pathogenic variant in PUF60 was identified as the cause of developmental delay in two affected brothers. Since the variant was not detectable in the exome data of either parent, gonadal mosaicism could not be confirmed and was instead presumed on the basis of the family history. The detection in the exome sequencing analysis of three probable parental mosaics among 228 patients corresponds to a frequency of 1.3%, which is within the estimated interval of clinically relevant parental mosaicism^10–12.

Fig. 3 — a, Pie chart showing the distribution of modes of inheritance (MOI) for all diagnoses (n = 510). Most disease-causing variants occurred de novo and on an autosome. At least 75% of all autosomal recessive diagnoses could have been identified by expanded carrier screening (slice). b, Box plots of autozygosity for each MOI (n = 375). Individuals are indicated by gray dots. Autozygosity was substantially increased in individuals with autosomal recessive disorders due to homozygous variants. In the box plots, the center lines indicate the median values, and the bottom and top edges of the boxes are the first (25%) and the third (75%) quartiles. The whiskers extend to the minimal and maximal data points with a maximum distance of 1.5 interquartile ranges from the edges of the box. c, Bar graphs illustrating MOI in individuals with low (<2%, n = 313) and high (>2%, n = 62) autozygosity. On the right, the autosomal dominant de novo rate has been used for normalization. Individuals with high autozygosity had a higher relative burden of recessive diseases, mainly due to the presence of homozygous pathogenic variants. The box plots present the median as the center line, the upper and lower quartiles as box limits, and 1.5× the interquartile range as the whisker length (in the style of Tukey). AD, autosomal dominant inheritance, variant inherited or of unknown origin; AD (de novo), autosomal dominant inheritance with de novo variant; AR (comp het), autosomal recessive inheritance with compound heterozygous variants; AR (hom), autosomal recessive inheritance with homozygous variant; mt, mitochondrial inheritance; XL, X-linked inheritance.

Recessive disease burden

The second-largest proportion of solved cases involved an autosomal recessive (AR) mode of inheritance (125 solved cases, 14.5% of all diagnoses; Fig. 3a). In total, 94 of the causative variants in the 125 recessive diagnoses in the present cohort would also have been classified as pathogenic if identified in healthy individuals¹³. The diagnostic yield was considerably higher in patients with presumed consanguinity (low autozygosity 31%, n = 1,014 versus high autozygosity 41%, n = 144, P = 0.01, Fisher’s exact test), and the composition of the modes of inheritance also differed significantly between the high- and low-autozygosity groups (Fig. 3b). The relative contribution of homozygous variants was significantly higher in the high-autozygosity group (73% of n = 62 diagnoses) than in the low-autozygosity group (2% of n = 313 diagnoses) (odds ratio (OR) 111.5, P < 0.001, Fisher’s exact test). In contrast, the contribution to disease of de novo variants was 13% (n = 62 diagnoses) in the high-autozygosity group compared with 54% (n = 313 diagnoses) in the low-autozygosity group (OR 0.2, P < 0.001, Fisher’s exact test). Since the de novo mutation count is dependent on parental age but not on autozygosity, the disease prevalence that is attributable to de novo variants should be comparable between both groups and can be used for normalization (Fig. 3c). For an inbreeding coefficient of >2%, this suggests a recessive disease burden that is sevenfold higher than for those with lower inbreeding coefficients, which is consistent with previous reports^14–16. However, it also has to be acknowledged that population expansion results in a drop in the prevalence of recessive disorders in random mating populations and that the lower recessive disease burden might be only a transient effect¹⁷.

Dual molecular diagnoses and secondary findings

For 11 individuals, who represented approximately 2% of all solved cases, molecular diagnoses for two distinct or overlapping disease phenotypes were established (Supplementary Table 2). This group showed a tendency for high autozygosity (43%, n = 7 versus 16%, n = 361, P = 0.09, Fisher’s exact test) and recessive disorders (41%, n = 22 diagnoses versus 24%, n = 488 diagnoses, P = 0.08, Fisher’s exact test). The detected percentage of dual diagnoses (2%, 11 of 499 solved cases) is consistent with both the enrichment of high autozygosity and recessive disorders in this group, and earlier reports^18,19.

In 17 individuals who had consented to being informed about secondary findings, we identified medically actionable variants that were unrelated to the present phenotype. The list of 59 actionable genes was based on the ACMG recommendations; however, secondary findings in 7 additional genes were reported following discussions within the respective MDTs (Supplementary Note).

Enrichment of ultrarare diagnoses

For the 499 individuals in whom exome sequencing led to a molecular diagnosis, a total of 549 disease-causing variants were identified in 362 different disease-associated genes as well as structural variants affecting 14 genomic regions (Supplementary Table 1). This plethora of diagnoses suggests that each specific genetic disorder had a very low prevalence. To clarify this, the results were compared with the total number of (likely) pathogenic ClinVar submissions for the respective genes (Fig. 4a). The first quartile of ClinVar variants corresponds to the more frequently identified rare diseases and contains 40,078 variants assigned to 47 genes. In the group of 499 individuals with a molecular diagnosis in the present cohort, only 33 patients and 14 different disease-associated genes fell into this first quartile. In contrast, the majority of the present 499 patients (corresponding to 192 different disorders) were assigned to the fourth quartile, which contains disease genes with the least ClinVar submissions (Fig. 4b). Notably, almost half of the diagnoses assigned in the present cohort were only established in the past decade (Fig. 4c). A comparison with a cohort of comparable size²⁰ revealed a significantly different distribution with respect to the years in which the phenotype was first associated with the respective disease-causing gene (Kolmogorov–Smirnoff test, P < 0.001; Supplementary Figs. 3 and 4).

Fig. 4 — a, Comparison of the number of (likely) pathogenic variants per gene in TRANSLATE NAMSE relative to the frequency of submission of (likely) pathogenic variants to ClinVar. Genes are ordered from left to right according to a decreasing frequency of ClinVar submissions. The black line corresponds to the complementary cumulative distribution (1 − CDF; cumulative distribution function) of ClinVar submissions. Diagnostic variants in TRANSLATE NAMSE (counts displayed on the right axis) were plotted as dots above their respective gene and in the color corresponding to the year in which the gene was first described as being associated with the respective disease. b, Variant counts in TRANSLATE NAMSE in genes with high (first quartile, Q1) to low (Q4) counts of submissions per gene in ClinVar. The genes in Q1–Q4 each cover approximately 1/4 of the submissions of likely or confirmed pathogenic variants to ClinVar, as shown on the x axis in a. Variants in the same gene are grouped in horizontal blocks. c, Bar graph showing the number of variants relative to the time interval in which the gene was first described as being associated with the respective disease. Note that 59 genes listed in the recommendations for reporting of secondary findings (version 2) of the ACMG were excluded from the analyses to counteract potential biases in ClinVar due to submissions of secondary findings⁶⁷. TNAMSE, TRANSLATE NAMSE; vars, variants.

Novel DGGs and candidates

In cases for which no molecular diagnosis could be established due to variants in the known clinical exome, all potentially deleterious variants in the remaining exome were assessed for plausible novel disease etiologies (see detailed scoring for 57 candidate genes in 65 cases in Methods, Supplementary Note and Supplementary Table 3). Moderate evidence was generated for 23 of 57 candidate genes, and high evidence was generated for the remaining 34. A total of 17 candidate genes with high evidence are currently undergoing further investigation, mostly within the framework of international projects. A total of 17 genes (12 with autosomal dominant inheritance, 5 with autosomal recessive inheritance) have acquired diagnostic-grade gene (DGG) status during the first three years through international cooperation^21–33. After the end of the study, two more candidate genes transitioned to the group of DGGs due to additional phenotypic, functional and statistical evidence became available^32,34.

In comparison with pathogenic variants in previously known disease-associated genes, the present candidate gene set showed a higher proportion of missense variants. This is probably attributable to the fact that the classification of missense variants is more challenging (Supplementary Table 3).

Functional assays

For 18 cases that were classified as uncertain or unsolved after initial exome sequencing, multi-omic assays were performed, that is, an analysis of the methylome (n = 4), proteome (n = 3) or transcriptome (n = 14). Epigenetic signatures, as derived from methylome analyses, clarified the status of de novo missense variants as likely benign in one case and as pathogenic in three. This is exemplified by a case with a missense variant in KMT2D (Supplementary Note)^35,36. Variants in MDH2 were reclassified to pathogenic, on the basis of a proteome analysis of patient-derived fibroblasts (Supplementary Note), while results were inconclusive in two unsolved cases. In 13 unsolved cases, RNA sequencing was performed but could not identify transcriptome alterations that lead to the identification of causative variants. Thus, in 5/18 cases, complementary assays facilitated variant reclassification and highlighted the importance of variant validation strategies in diagnostics for suspected rare genetic diseases (Supplementary Note)^37–39.

Predicting the diagnostic yield using machine learning

Analyses were then conducted to investigate whether the phenotype predicted the diagnostic yield of exome sequencing. For this purpose, a least absolute shrinkage and selection operator (LASSO) analysis for binary outcomes was performed. To reduce the phenotypic dimension and to increase interpretability, HPO terms were first aggregated into 49 nonoverlapping phenotypic groups. These phenotypic groups were used as predictors in the LASSO analysis. The resulting model was able to discriminate between solved and unsolved cases (Supplementary Fig. 5a; area under the curve (AUC) 0.67, 95% confidence interval (CI) 0.61–0.74, on a held-out test set of the exome sequencing cohort, n = 321) and yielded the HPO groups ‘dysfunction of higher cognitive abilities’, ‘hematological abnormalities’ and ‘ataxia’ as very influential predictors in terms of the establishment of a molecular diagnosis via exome sequencing (Fig. 5a). To improve the predictions for a wider variety of phenotypic features, we trained on samples of additional cohorts and made the model available as a web service (https://translate-namse.de). YieldPred can now be used to estimate the diagnostic yield of exome sequencing on the basis of the phenotypic features of a given patient and might therefore help in expectation management (Methods and Supplementary Figs. 3, 5 and 6).

Fig. 5 — a, The coefficient paths of regression analysis using the LASSO are shown. Only features that are included in the final model and are present in at least 5% of the cases that were used for training are depicted. The more to the left [lower ln(λ)] a coefficient path starts to deviate from the x axis, the more informative the corresponding feature is in terms of predicting the diagnostic yield. Features with positive coefficients increase the diagnostic yield. In contrast, features with negative coefficients render a monogenic cause less likely. For example, dysfunction of higher cognitive abilities and ataxia are associated with a higher diagnostic yield (clinical features are colored according to their higher-order HPO groups; for details, see Supplementary Note). An algorithm to predict the diagnostic yield (YieldPred) was developed on the basis of these data and can be found online (https://translate-namse.de). b, The performances of variant prioritization approaches were compared. All disease-associated genes were ranked using the respective variant prioritization method. Subsequently, the proportion of cases detected with the correct disease-associated gene (sensitivity) was shown as a function of the number of disease-associated genes considered, beginning at the top score. The following four approaches for variant prioritization were tested in solved cases from the PEDIA cohort (n = 94): (1) only a molecular pathogenicity score (CADD⁶⁸) with top-10 accuracy of 48%; (2) feature-based score (CADA⁶⁹) in addition to CADD with top-10 accuracy of 68%; and (3 and 4) a gestalt score from facial image analysis (GestaltMatcher⁴⁰) alone or in addition to both CADD and CADA referred to as PEDIA score⁴¹ with top-10 accuracy of 82%. Note that the bold lines indicate the observed top-k accuracy and bootstrapped 95% CIs are indicated by the lighter shading around the lines. MRI, magnetic resonance imaging; abn., abnormality; con., congenital; dysf., dysfunction; psych., psychiatric; sym., symptoms; sec, secondary.

Variant prioritization using facial image analysis (PEDIA)

A total of 224 of the 1,577 patients had also provided written informed consent for the evaluation of their facial images with the AI tool GestaltMatcher⁴⁰ and the use of the results (gestalt scores) in exome variant interpretation (PEDIA)⁴¹. In 94 of these PEDIA subcohort cases, a molecular diagnosis was established. For 81 of these 94 cases, the gestalt scores improved prioritization results, that is, the correct diagnosis was ranked higher. In general, the PEDIA approach (that is, a combined scoring approach involving genotype-, phenotype- and facial gestalt-based prioritization tools) can contribute to prioritization efficiency, provided that (1) the clinical features of the underlying disorder include facial dysmorphism and (2) molecularly solved cases are already part of the GestaltMatcher Database⁴⁰ (https://db.gestaltmatcher.org/). In the present PEDIA subcohort, for 81 cases, representing 68 different disorders, one or more previously solved cases were phenotypically so similar that the gestalt score for the associated disease gene resulted in a higher ranking for the pathogenic variant than prioritization approaches that do not make use of image analysis.

Four different variant prioritization approaches involving genotype-based and/or phenotype-based scores were analyzed and their respective accuracy rates compared. For the PEDIA approach, the correct disease-associated gene was listed among the top ten suggestions in 82% of the cases. The PEDIA approach outperformed prioritization by either a molecular score (combined annotation-dependent depletion, CADD⁴²) or GestaltMatcher only, as well as the combined molecular and feature score (CADD + case annotation and disorder annotation (CADA)) (Fig. 5b). As the latter can be considered routine in exome sequencing analysis, additional gestalt scores help to improve variant interpretation in diagnostics.

Based on these results and the extension of the TRANSLATE NAMSE study beyond the initial 3 years, the PEDIA workflow was implemented at further sites. The exome sequencing data of another 149 patients were then analyzed. In this additional cohort, a molecular diagnosis was established in 69 patients, and a top-10 accuracy of 83% was achieved using the PEDIA score (Supplementary Fig. 7).

The PEDIA approach is highly modular, and the GestaltMatcher score for image analysis can also be combined with other prioritization tools such as Exomiser⁴³, Xrare⁴⁴, LIRICAL⁴⁵ or Amelie⁴⁶, which use different molecular scores or HPO-based scores. All tested combinations showed improvements in the top-k accuracies and are discussed in Supplementary Note and Supplementary Fig. 8.

In some cases, the gestalt scores were particularly suggestive and facilitated the identification of otherwise challenging pathogenic variants. For instance, in a patient with a very high gestalt score for Koolen de Vries syndrome, a 4.7-kb de novo deletion affecting KANSL1 was detected⁴⁷. Other case reports of particular interest are described in Supplementary Note and Supplementary Fig. 9.

Exemplary diagnoses with targeted therapy

Implications of diagnoses on clinical management were not assessed in a structured way. However, for five patients in the TRANSLATE NAMSE cohort with a molecular diagnosis (1%), individualized treatments or therapies directed against the mechanism of the disease could be initiated⁴⁸. A patient with metachromatic leukodystrophy due to pathogenic variants in arylsulfatase alpha was treated with autologous CD34⁺ cells that were transduced ex vivo using a lentiviral vector encoding arylsulfatase alpha⁴⁹. The gene therapeutic approach with atidarsagene autotemcel has been authorized by European Medicines Agency (EMA) in the European Union since 17 December 2020. A patient with pyruvate dehydrogenase E1-α deficiency due to a de novo variant in PDHA1 and another patient with GLUT1-deficiency due to pathogenic variants in SLC2A1 were treated with a ketogenic diet. In a patient with cerebral creatine deficiency syndrome 1, due to a missense substitution in SLC6A8, supplementation with creatine was started. In a patient with congenital disorder of glycosylation of type IIc, due to a homozygous missense variant in SLC35C1, the fucosylation deficiency was treated by oral fucose supplementation⁵⁰.

Discussion

Reducing the time to diagnosis from several years to less than 1 year is highly relevant in terms of both prognosis and the targeted use of healthcare resources, since the number of approved therapies for rare diseases in which early treatment is associated with better outcomes is now increasing⁵¹. Establishing a molecular diagnosis quickly will require the implementation of frameworks within healthcare systems that are dedicated to patients with rare diseases. The novel diagnostic approach evaluated in TRANSLATE NAMSE was the practical realization of such a concept. The present investigation suggests that a combination of a structured clinical assessment by an MDT, an advanced sequencing test, such as exome sequencing, and a comprehensive discussion of the results reduces diagnostic delay and may improve therapy. These findings are consistent with reports from other healthcare systems and other disorders that benefit from interdisciplinary structures^20,52–56. On the basis of the present data, in 2021, exome sequencing was included in the list of standard medical services offered to patients with suspected rare diseases who were referred to German CRDs. For all the patients that are still awaiting a molecular diagnosis, new multi-omics approaches are promising but also costly. Therefore, in a complex healthcare system, these tests compete with other analyses, and their efficiency and efficacy in establishing a diagnosis should be evaluated in the future. However, it will be crucial within the German healthcare system that the inclusion of MDTs in the diagnostic process does not delay or even hinder genetic testing for patients with rare diseases. With exome sequencing being incorporated into an increasing number of guidelines, we also anticipate that the focus of the MDT will shift from test selection toward variant interpretation and identifying therapeutic options. By these means, MDTs operating in CRDs would fulfill a similar purpose for patients with rare disorders as molecular tumor boards in centers for personalized medicine already do for cancer patients⁵⁷.

Two notable findings of the present analyses were that, in comparison with ClinVar and a previously reported rare disease cohort of similar size²⁰, the TRANSLATE NAMSE cohort was significantly enriched for ultrarare disorders (Fig. 4a and Supplementary Fig. 4) and that a large number of recently described gene–disease associations were found^1,8,20,58. In our opinion, this accumulation of ultrarare diagnoses and the relative absence of more common conditions is explained by the study protocol, which required consideration of different test options, including gene panels. Furthermore, the fact that a large number of the established diagnoses have only become possible in recent years as a result of increasing medical genetic knowledge (Fig. 4c) highlights the importance of reanalysis of exome data^59,60. Indeed, the present analyses identified a large number of individuals who carried variants that indicated a novel disease–gene association (12% of solved cases), which highlights the fact that the analysis of exome sequencing data should not be limited to known disease genes. Establishing novel gene–disease associations and conducting functional analyses for the reclassification of variants of uncertain significance are time-consuming and highly complex endeavors⁶¹. Hence, from the present logistical perspective, such analyses are easier to perform in a research context than within the routine diagnostic context of clinical practice. However, these findings are of crucial importance for affected individuals and their families. Thus, from a teleological perspective, in some rare disease cases, boundaries separating diagnostics and research are somewhat blurred. Therefore, in the tertiary, academic setting, collaboration between experts from diagnostics and research is highly relevant for patients with suspected ultrarare diseases and a lack of definitive diagnostic findings.

In several patients from the present cohort, molecular diagnoses also resulted in a change of clinical management to a causal or even curative approach to therapy as described above. These cases emphasize the fact that molecular genetic diagnoses are essential in terms of the development of personalized treatments or therapies that are directed against the underlying disease mechanism. The systematic, consortium-based collection of molecular and clinical data represents the first necessary milestone toward achieving this goal. Particularly in the case of ultrarare disorders, the collection of these data requires additional international collaborative efforts.

Besides the ability to select the appropriate genetic test for diagnosing a disease, a core competence of a clinical geneticist is to estimate disease risk in the offspring of healthy individuals.¹⁷ In addition to the relatedness of the partners, the burden of heterozygous pathogenic variants in recessive genes, which can vary considerably depending on demographics^62–65, could play an increasingly important role in family planning. In a total of 94 of the 125 cases with recessive molecular diagnoses, the causal variants would also have been classified as (likely) pathogenic if they had been identified in healthy individuals¹³. This also means that, if the parents of pediatric patients with a recessive disorder in the present cohort had undergone exome sequencing to determine their carrier status, three out of four of these couples could have received appropriate genetic counseling concerning disease risk in future offspring, which supports the argument for extended screening⁶⁶.

Another aim of the present study was to determine whether complementary AI and machine learning approaches would facilitate diagnostic effectiveness and efficiency in the exome sequencing cohort. The PEDIA analyses showed that AI-powered next-generation phenotyping increased the efficiency of exome sequencing data analysis. However, not every case in the present cohort was solved via exome sequencing. Therefore, the machine learning model YieldPred was developed to identify features that had a major impact on the diagnostic yield in our and other study cohorts. Prospectively, this approach can also be used for two purposes. First, it can be used to estimate the probability that exome sequencing will result in a molecular diagnosis in each patient with a suspected rare disease and can by these means help to manage expectations. Second, as YieldPred in its current form provides an estimation of the diagnostic yield of exome sequencing and not of an underlying monogenic condition of a certain individual, it can be used to stratify individuals for more comprehensive genetic testing, that is, a low YieldPred score despite a high likelihood of a monogenic disease indicates that transcriptomics, proteomics or genome sequencing could be promising.

It would be desirable for all individuals with a suspected monogenic disorder for whom no definitive diagnosis can yet be established to have the option of participating in large-scale genomic diagnostic and research initiatives. We present TRANSLATE NAMSE as the German framework that organizes diagnostics for patients with ultrarare diseases with a backbone of case conferences in MDTs in academic CRDs. TRANSLATE NAMSE represents the first national-level project for undiagnosed patients in Germany, and the future expansion of the network on both the national and international level is planned.

In summary, the results of the present study demonstrate that our novel, structured diagnostic concept facilitates the identification of ultrarare disorders on a national level, provides undiagnosed patients with the opportunity to participate in international research, and represents a platform for data sharing that facilitates the development of machine learning and AI tools to improve the diagnostic yield.

Methods

Enrollment, research ethics and consent

A detailed description of the TRANSLATE NAMSE project is provided elsewhere^9,70. In brief, participants for TRANSLATE NAMSE were recruited between January 2018 and December 2020 from a total of ten German CRDs (Berlin, Bochum, Bonn, Dresden, Duisburg/Essen, Hamburg, Heidelberg, Kiel/Lübeck, München and Tübingen). Overall coordination of the recruitment process was performed by the Institute of Public Health Berlin. This study is governed by the approval of the following institutional review boards: Charité – Universitätsmedizin Berlin, Germany (EA2/140/17); UKB Universitätsklinikum Bonn, Germany (Lfd.Nr.386/17); Universitätsklinikum Essen, University Duisburg-Essen, Germany (17-7774-BO); Universitätsklinikum Heidelberg, Germany (S-499/2017); Universitätsklinikum Tübingen, Germany (643/2017BO1); Universität zu Lübeck, Germany (17-272); Ludwig-Maximilians-Universität München, Germany (17-640); Ärztekammer Hamburg, Germany (MC-316/17); Technische Universität Dresden, Germany (AK 464122017). All patients or their legal guardians provided written informed consent before inclusion. The inclusion criteria for TRANSLATE NAMSE were the lack of a definitive diagnosis and the clinical suspicion of a rare disease. The medical records and family history of each individual were evaluated by a MDT, which comprised at least board-certified physicians of two specialities with domain-specific expertise. For each individual, the respective MDT then made recommendations concerning diagnostics and further clinical management. To make the recommendation of exome sequencing, a board-certified human geneticist was additionally required within the MDT. For example, strong criteria for the indication of exome sequencing were congenital malformations, a syndromic phenotype, a positive family history suggestive of a monogenic disease and lack of absence of an alternative test with a comparable suspected diagnostic yield. A total of 1,577 patients (268 adult and 1,309 pediatric) from the TRANSLATE NAMSE cohort were referred for exome sequencing on the recommendation of the MDT at the respective CRD (exome sequencing cohort). The phenotypic and molecular genetic data of these 1,577 patients were evaluated in the present analyses.

Clinical and laboratory phenotype data

Clinical and laboratory phenotype data were transferred to the sequencing laboratory in the form of hard-copy case report forms or as online data capture applications (Face2Gene Clinic). Online data capture allowed the free entry of HPO terms. Data from hard-copy report forms and free-text entries were transformed into HPO terms. The phenotypes reported in the present study are those that were reported to the sequencing laboratories. On the basis of the leading presenting clinical feature, each case was assigned to one of six major disease groups (Supplementary Fig. 1b). This allowed a more definitive statement on diagnostic yield in relation to the clinical features of the patient. In the subsequent analyses, all assigned HPO terms (n = 1,649) were compiled and divided into higher-order groups (n = 12) and subcategories (n = 49) by expert clinicians. Therefore, patients were additionally assigned to at least one higher-order group as well as at least one subgroup. To assign a patient to an HPO-defined group, the patient had to have at least one of the HPO terms belonging to the respective group. The following higher-order groups were defined: 1, neurodevelopmental; 2, neuromuscular; 3, seizures; 4, growth disorders; 5, facial dysmorphism; 6, abnormality of connective tissue; 7, congenital malformations; 8, endocrine and metabolic abnormalities; 9, immune and hematological abnormalities; 10, sensory organ alterations; 11, abnormal findings on brain magnetic resonance imaging; 12, others. Within the respective higher-order groups, HPO terms were further assigned to subcategories (n = 49) (https://github.com/Ax-Sch/TNAMSE_geno_pheno/blob/main/resources/hpo_categorization_19_12_2022.tsv).

DNA sequencing

Details on DNA sequencing for each sequencing laboratory are given in Supplementary Table 4. Trio sequencing was conducted for 58% of the cases. When additional informative relatives were available, these were also included in the analysis as permitted by German law (healthy minors were not analyzed). EDTA-treated whole-blood samples or saliva kits were delivered to one of the five participating sequencing centers (Berlin, Bonn, LMU Munich, Munich or Tuebingen) for further processing. After DNA extraction, fragment size and purity were assessed. If the DNA fulfilled all quality criteria, the sample was submitted for sequencing. Exome sequencing was performed on exon targets that were isolated using capture and either Agilent SureSelect Human All Exon kits v6 or v7 (Agilent Technologies), or the Human Core Exome Kit (Twist Bioscience). One microgram of DNA was sheared into 350–400-bp fragments, which were then repaired, ligated to adaptors and purified for subsequent polymerase chain reaction amplification. Amplified products were then captured by biotinylated RNA library baits in solution, in accordance with the manufacturer’s instructions. Bound DNA was isolated with streptavidin-coated beads and reamplified. The final isolated products were sequenced using the Illumina NextSeq 500, NextSeq 550, HiSeq 2500 or NovaSeq 6000 sequencing system and 2 × 100-bp paired-end reads (Illumina). All five sequencing centers ensured a coverage of over 20× in over 95% of the RefSeq target region.

Exome sequencing data-processing pipeline

Details on exome sequencing data processing for each sequencing laboratory are given in Supplementary Table 4. At each of the five sequencing centers, exome sequencing processing pipelines were established according to best practice guidelines. The DNA sequence was mapped to the published human genome build GRCh37 reference sequence using Burrows–Wheeler Aligner (BWA). The most up-to-date version at the time of sequencing was used, progressing from BWA v0.7.11 through to BWA-Mem v0.7.17^71,72. Single nucleotide variants and small indels were detected with HaplotypeCaller (v3.7, v3.8 or v4.1; three laboratories, 40.0% of cases), Freebayes (v1.2.0, one laboratory, 16.6% of cases) or HaplotypeCaller as well as SAMtools v.0.1.7 (one laboratory, 43.4% of cases)^73,74. Mitochondrial DNA variants were assessed using data from exome sequencing in three laboratories (80% of cases)⁷⁵. Copy number variations were detected using ExomeDepth or ClinCNN on short-read data (two laboratories, 60.0% of cases), before exome sequencing by array CGH (two laboratories, 30.0% of cases) or not evaluated (one laboratory, 10.1%)^76,77. Additionally, analysis for structural variants was only conducted by one laboratory (16.6% of cases). Analysis for uniparental disomy was performed in two sequencing laboratories (60.0% of cases) using the UpdHunter function of ngs-bits v2019_09 (https://github.com/imgag/ngs-bits) or custom scripts. Finally, analyses for mosaic variants were conducted by four laboratories (90% of cases).

Variants were annotated using VEP (four laboratories, 80.2% of cases)⁷⁸ or Jannovar (one laboratory, 19.8% of cases)⁷⁹ and analyzed in VarFish⁸⁰, megaSAP (https://github.com/imgag/megSAP) or EVAdb (https://github.com/mri-ihg/EVAdb) or in tabular format depending on the center. Virtual gene panels were used in four out of five sequencing sites (56.7% of cases). In the sequencing site where no virtual panels were used, a similar approach (HPO-based and Online Mendelian Inheritance in Man (OMIM) full-text search) was used. Additionally, filter parameters specific for assumed modes of inheritances were applied (all laboratories; mainly cutoffs of allele frequencies or counts in the population database gnomAD).

The population background of each individual was estimated with peddy⁸¹. This revealed that the cohort was of predominantly European origin (Supplementary Table 1 and Supplementary Fig. 10).

Autozygosity was estimated using RohHunter, bcftools/roh or a sliding-window framework^82–84. A small subset of samples was run on all three tools, and this yielded comparable results for autozygosity. A threshold of 2% was used to assign patients to a high- or a low-autozygosity group¹⁴ (Supplementary Fig. 11).

The variants identified in exome sequencing were assessed in accordance with the standards and guidelines of the ACMG for the interpretation of sequence variants⁸⁵. At least two physicians or experts in molecular genetics participated in the assessment of the variants. Finally, all variants that were potentially disease-causing (pathogenicity class 3–5) and actionable secondary findings were reported to the respective patients.

Cases in which no diagnosis could be established in a known disease-associated gene were included in national and international studies for the discovery of novel disease etiologies for example, via the MatchMaker Exchange network^86,87. Variants with a high likelihood of being disease-causing, for example, those with loss of function or high pathogenicity scores, or those that had arisen de novo, were shared through MatchMaker Exchange or a similar network in order to identify similar patients^88,89.

Statistical analyses

All statistical analyses were conducted in R (version 4.2.2)⁹⁰. Proportions were tested using a two-sided Fisher’s exact test. The significance level was set to α = 0.05, and P values were corrected via Bonferroni correction if necessary.

Visualization of phenotype space using UMAP

First, data on known diseases and their clinical features were downloaded from the HPO website (https://hpo.jax.org/app/download/annotation, file: genes_to_phenotype.txt, downloaded on 10 April 2021). The disease data were merged with the data of the 1,577 individuals from TRANSLATE NAMSE by treating each disease–ID as one individual. Similarities in HPO terms between all pairs of individuals were then calculated using the R package ontologySimilarity (version 2.5). The similarities were then converted to a distance matrix and projected into a four-dimensional space using uniform manifold approximation and projection (UMAP). Subsequently, the first two dimensions of this projection were plotted using ggplot2 (version 3.3.4).

Variants amenable to carrier screening

In cases with autosomal recessive inheritance, disease-causing variants in ClinVar were queried in January 2017 (beginning of the project) to take into account the state of knowledge available at the time of analysis. Variants were classified as amenable to carrier screening if they were classified as pathogenic or likely pathogenic in ClinVar or if they were predicted loss-of-function variants that were not predicted to escape nonsense-mediated messenger RNA decay. In compound-heterozygous inheritance, both variants were required to be (likely) pathogenic.

Comparison of disease-associated genes reported in TRANSLATE NAMSE with those reported in other cohorts

In the German healthcare system, genetic testing of the more frequent rare disorders, for example, retinitis pigmentosa or hearing impairment, is performed using gene panels.

For a comparison with the cohort from the NIHR BioResource described in Turro et al.²⁰, all disease-associated genes were first ranked according to the frequency of submissions of pathogenic and likely pathogenic variants to ClinVar. Disorders caused by genes in the first quartile of the ClinVar gene distribution, such as USH2A, ABCA4 and BMPR2, are more prevalent than phenotypes associated with genes in the fourth quartile. In addition, the year in which phenotype–gene associations had first been reported was determined to assess when a diagnosis could first have been established. The characteristics of the variants identified in the TRANSLATE NAMSE exome sequencing cohort were then compared with those identified in a cohort reported by Turro et al. in 2020.

Turro et al. subjected DNA from 9,802 individuals with a suspected rare disease to genome sequencing and reported pathogenic or likely pathogenic variants in 1,138 cases²⁰. Around a quarter of these variants were assigned to genes with a high disease prevalence (Supplementary Fig. 4). In contrast, most disease-associated genes identified in the TRANSLATE NAMSE cohort were ultrarare, and more frequent diagnoses were underrepresented.

Novel disease candidate genes

Sequence data from the unsolved cases were analyzed for variants in potential novel disease candidate genes. The following mandatory criteria for novel disease candidate genes were defined: (1) the gene had shown no previous robust association with any human phenotype; (2) no other clearly causative disease explanation was found; (3) the allele frequency of the respective variant was below the minor allele frequency cutoff or the variant was absent in controls; (4) inheritance was in accordance with the phenotype in the family and/or the variant co-segregated with the disease in multiple affected family members. As in the ClinGen approach and as suggested by others, characteristics, including gnomAD constraint metrics, inheritance and functional data, by which the level of evidence for the manually identified candidate genes could be assessed were defined^61,91,92 (Supplementary Table 3). An evidence score was then calculated, which could reach a maximum value of 8. Three of the nine criteria can only be applied to genes with an autosomal dominant mode of inheritance (de novo status and gnomAD constraint metrics), rendering the score less informative for autosomal recessive inheritance. For autosomal dominant inheritance, a score of 1–3 was ranked as medium evidence and a score of 4 and above as high evidence. For recessive inheritance, a score of 3 or above was ranked as high evidence and a score of below 3 was ranked as medium evidence. Genes first published as disease-associated during the course of TRANSLATE NAMSE were classified as novel DGG.

Diagnostic yield prediction (YieldPred)

The TRANSLATE NAMSE exome sequencing cohort (n = 1,577) was randomly divided into a training set comprising 1,256 cases (399 solved, 32%) and a test set comprising 321 cases (99 solved, 31%). The binary status of a case (1, solved; 0, unsolved) was regressed on the 49 HPO-defined subcategories (cf. clinical and laboratory phenotype data) using LASSO for binary outcomes with the logit function as a link function (R package glmnet, version 4.1-4) and by controlling for age (adult/child), sex (male/female), sequencing laboratory and the use of the PEDIA workflow. Variable selection was applied on the 49 HPO-defined subcategories only. The model was fitted on the training set, and the penalty parameter was tuned via tenfold cross-validation. The resulting model was then applied to the test set, and its predictive performance was evaluated using the receiver operator characteristics curve.

We further validated the influence of the separate HPO terms on the model. Figure 5 shows the resulting coefficient plot and was checked for plausibility. We found a positive correlation between the number of HPO terms and the predicted probability on the complete TRANSLATE NAMSE exome sequencing cohort (n = 1,577; Supplementary Fig. 6). Since the approach of HPO-defined subcategories ensures that multiple lower-order terms are only counted once, this finding indicates that a monogenic cause and diagnosis via exome sequencing is more likely if a patient exhibits a diverse set of clinical features. Furthermore, we investigated the discriminatory power of all 1,649 unique HPO terms that were annotated in the TRANSLATE NAMSE cohort. Considering each HPO term separately to discriminate between solved and unsolved patients led to an average AUC of 0.5 (s.d. 0.003), that is, no discriminatory power. The maximum achieved AUC of a single HPO term, namely HP:0001263 (global developmental delay), was 0.58. As a sensitivity analysis, we then fitted a logistic regression on the complete TNAMSE cohort with the top five HPO terms, namely HP:0001263 (global developmental delay), HP:0000252 (microcephaly), HP:0001252 (hypotonia), HP:0001250 (seizure) and HP:0001251 (ataxia), and achieved an AUC of 0.64 (95% CI 0.61–0.67). On the complete TNAMSE set (that is, training and test set combined) our YieldPred model yielded an AUC of 0.72 (95% CI 0.69–0.74). In summary, there are some HPO terms that have higher discriminatory power than the majority of the HPO terms. However, the signal of YieldPred is additionally driven by the combination of multiple phenotypic features that are present in a patient.

To increase the portability and applicability of the Lasso model, two additional external and independent cohorts were included. This first external cohort (n = 753, 545 solved, 72%; Supplementary Table 5) was recruited by the Technical University of Munich, and all individuals consented in the scientific use of their phenotype and genotype data. As a second external cohort, we used the NIHR BioResource cohort described by Turro et al. (n = 5,510, 1,059 solved, 19%). The Lasso model was then retrained on cases of all three cohorts and 20% of the cases of each cohort were kept as hold-out test set. The AUCs of the final model ranged from 0.64 for the TRANSLATE NAMSE cases of the test set and 0.65 for the Munich cases of the test set to 0.71 for the cases of the test set from the cohort of Turro et al. (Supplementary Fig. 5). The final model was provided as the tool YieldPred as a web service, where users can specify the age, sex and assigned HPO terms of their patient, while the remaining confounders are estimated via the mean confounder values of the training cohort.

PEDIA analysis

PEDIA integrated the facial image and clinical feature analysis with exome data analysis⁴¹. For each patient, a frontal facial image, clinical features encoded in HPO terminology, and exome sequencing data were available for analysis.

The PEDIA approach was used, in which the facial image analysis was analyzed by GestaltMatcher⁴⁰. GestaltMatcher was trained on 6,354 frontal images with 204 different disorders to learn the respective facial dysmorphic features, and it further encoded each image into a 512-dimensional facial phenotype descriptor. The model ensembles and test-time augmentation were later used to generate 12 512-dimensional facial phenotype descriptors for each image⁹³. The similarity between two patients can be quantified by averaging 12 cosine distances of the facial phenotype descriptors. For each test image, a list of similarity scores for 816 disease-causing genes were obtained. To convert HPO terms of individual patients into feature scores for each gene, the CADA approach was used⁶⁹. For the exome data, each variant was annotated with a version 1.6 CADD score⁴². After filtering out the common variants, the highest CADD score for each gene was taken.

In this analysis, benchmarking was performed on two cohorts: the PEDIA subcohort and the validation cohort. The PEDIA subcohort consisted of a subset of 224 of the 1,577 exome sequencing patients (194 pediatric, 30 adult). Of these, 94 had a molecular genetic diagnosis (86 pediatric, 8 adult). After the end of the 3-year TRANSLATE NAMSE recruitment period, a further 149 patients were enrolled and used as a validation cohort. In the validation cohort, 69 out of 149 patients were solved cases. All facial images analyzed in the present study can be accessed in GestaltMatcher Database (https://db.gestaltmatcher.org/) by the GMDB ID in Supplementary Tables 1 and 6. For each patient, each gene had a GestaltMatcher score, a CADA score and a CADD score. These three scores were the input of the PEDIA approach. The output for each patient was a list of genes, and each gene had a PEDIA score. The genes were then prioritized by ranking the PEDIA scores in descending order. To benchmark the performance, top-k accuracy was used, as calculated by the percentage of the patients with the disease-causing gene ranked in the top-k position. Finally, the top-1 to top-100 accuracies of the two cohorts (the PEDIA subcohort of the exome sequencing cohort and validation cohort) were reported.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Online content

Any methods, additional references, Nature Portfolio reporting summaries, source data, extended data, supplementary information, acknowledgements, peer review information; details of author contributions and competing interests; and statements of data and code availability are available at 10.1038/s41588-024-01836-1.

Supplementary information

Supplementary Information^{(6.1MB, pdf)}

Supplementary note and Figs. 1–14 and legends for Supplementary Tables 1–7.

Reporting Summary^{(1.9MB, pdf)}

Peer Review File^{(2.1MB, pdf)}

Supplementary Tables^{(326.6KB, xlsx)}

Supplementary Tables 1–7.

Acknowledgements

We thank all patients and families from TRANSLATE NAMSE and NIHR BioResource for their cooperation. We thank C. Schmael for proofreading of the manuscript. M.D., H.L.S. and M.A.M. are participants in the BIH Charité (Digital/Junior) Clinician Scientist Program, which is funded by Charité – Universitätsmedizin Berlin and the Berlin Institute of Health (BIH). F. Boschann is a participant in the Clinician Scientist Program (CS4RARE) funded by the Alliance4Rare and associated to the BIH Charité Clinician Scientist Program. A.S. was supported by the BONFOR program of the Medical Faculty, University of Bonn (O-149.0134). M.A.L.-K. received funding from DFG (CRC237 369799452/B21 and CRC237 369799452/A11). C. Schlein received funding from DFG (SCHL2276/2-1; 450149205-TRR333/1). E.T. was funded by NIH awards R01HL161365 and R03HD111492.

Author contributions

Study conceptualization and design: N.E., T. Haack, P.M.K. and M.W. Sample and data acquisition: R.A.J., L.A., A.A., S.B.-W., M. Begemann, T. Bender, R. Berner, S.B., R. Berutti, M. Bettendorf, R.C.B., A. Bevot, I.B., F. Boschann, F. Brand, F. Brinkmann, M. Brugger, T. Brunet, P.B., T. Bäumer, A. Bösch, C.B., S.C.-G., D.C., K.C., M.D., G.D., N.D.D., N.E., M.E., H.E., H.G., E.G., C.G., L.G., M.G.-E., M.G., K.G., A.G.-K., T. Haack, B.H., A.-M.H., F.H., A. Heimbach, A. Heinen, Y.H., M.H., J. Hentschel, T. Herget, R. Herzog, K.H., R. Hirtz, J. Hoefele, S. Hoffjan, G.F.H., S. Holzhauer, D.H., I.H., A.M.K., F.J.K., N.K., T. Kallinich, P.K., V.K., L.T.K., C. Kehrer, M. Kehrer, C. Kiewart, O.K., C. Klein, T. Klockgether, A. Knaus, C. Knopp, X.K., U. Kornak, U. Kotzaeridou, R.K., F.K., P.M.K., M. Kreiss, M. Krenn, T. Kretschmer, H.K., C. Kubisch, A. Kuechler, S.K., I.K., J.K., M.A.L.-K., J.R.L., H.L., J.L., K.L., J.M., C.M., E. Mangold, E. Manka, N.M., K.M., T.M., M.A.M., N.M.-S., A.M.M., S. Mundlos, A.C.M., M. Munteanu, M. Mücke, W.M.-F., A.M., J.N., H.H.P.N., M.N., K.O., P.M.O., C.-E.O., J.T.P., M.P., C.P., S.P., K.P., U.R., K.M.R., O.R., F.R., A.R., M.R., T.R., C.R., C.P.S., U.A.S., E. Schlapakow, C. Schlein, A.S., C. Schramm, E. Schröck, S. Schröder, M. Schuelke, C. Schuetz, E.M.C.S., S. Schwartzmann, V.S.S., L.J.S., H.L.S., S. Sivalingam, M. Spielmann, I.S., C. Stieber, C. Stoltenburg, V.S., T.S., M. Sturm, A.-A.S., P.T., A.T., E.T., K.U., M.W., H.W., C.W.-N., N.W., C.W., D.S.W., P.W., M.Z., L.Z. and D.Ö. Analysis and interpretation: M.A.B., L.A., T. Brunet, M.D., G.D., N.E., H.E., K.G., T. Haack, M.H., T.-C.H., P.I., H. Klinkhammer, A. Knaus, P.M.K., S. Moosa, A.S., S. Sivalingam, E.T. and M.W. Manuscript writing: T. Brunet, M.D., N.E., H.E., K.G., T. Haack, T.-C.H., H. Klinkhammer, P.M.K., A.S. and M.W. Coordination and funding acquisition: N.E., T. Haack, P.M.K., H. Krude, T.M., S. Mundlos, M.N., O.R. and M.W.

Peer review

Peer review information

Nature Genetics thanks Zornitza Stark, and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.

Data availability

The corresponding author agrees to fulfill any requests for materials not included in the article, subject to verification that the request adheres to the consent provided by the research participants. Patient-related data not included in the article may be subject to patient confidentiality. Raw sequencing data were not consented for sharing, except for the PEDIA subset, which is available upon request. Reported alleles and their clinical interpretation have been deposited in ClinVar using the following submitters: Institute for Genomic Statistics and Bioinformatics (University Hospital Bonn) (https://www.ncbi.nlm.nih.gov/clinvar/submitters/507028/, https://www.ncbi.nlm.nih.gov/clinvar/submitters/508040/); Institute of Human Genetics, Klinikum rechts der Isar (Technical University Munich) (https://www.ncbi.nlm.nih.gov/clinvar/submitters/500240/); Institute for Medical Genetics and Human Genetics (Charité – Universitätsmedizin Berlin) (https://www.ncbi.nlm.nih.gov/clinvar/submitters/505735/); Institute of Medical Genetics and Applied Genomics (University Hospital Tübingen) (https://www.ncbi.nlm.nih.gov/clinvar/submitters/506385/); and Genomics Facility (Ludwig-Maximilians-Universität München) (https://www.ncbi.nlm.nih.gov/clinvar/submitters/507363/).

Code availability

The study’s landing page (https://www.translate-namse.de) redirects to a web service for the prediction of the diagnostic yield and the code repository at GitHub (https://github.com/Ax-Sch/TNAMSE_geno_pheno). Code is also available via Zenodo at 10.5281/zenodo.10964188 (ref. ⁹⁴). All source codes are available under a creative commons license.

Competing interests

V.S.S. has received consultant fees from Novartis, Chugai, AbbVie, Celgene, Sanofi, Lilly, Hexal, Pfizer, Amgen, BMS, Roche, Gilead, Medac, Boehringer-Ingelheim and Alexion and speaker’s bureau fees from AbbVie, Novartis, BMS, Chugai, Celgene, Medac, Sanofi, Lilly, Hexal, Pfizer, Janssen, Roche, Schire, Onkowissen, Royal College London, Boehringer-Ingelheim and UCB Fresenius. M.G.-E. has received research support from the German Ministry of Education and Research (BMBF) within the European Joint Program for Rare Diseases (EJP-RD) 2021 Transnational Call for Rare Disease Research Projects (funding number 01GM2110), from the National Ataxia Foundation (NAF) and from Ataxia UK and received consulting fees from Healthcare Manufaktur, Germany, all unrelated to this study. All other authors declare no competing interests.

Footnotes

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

These authors contributed equally: Axel Schmidt, Magdalena Danyel, Kathrin Grundmann, Theresa Brunet.

These authors jointly supervised this work: Peter M. Krawitz, Tobias Haack, Nadja Ehmke, Matias Wagner.

Supplementary information

The online version contains supplementary material available at 10.1038/s41588-024-01836-1.

References

1.Nguengang Wakap, S. et al. Estimating cumulative point prevalence of rare diseases: analysis of the Orphanet database. Eur. J. Hum. Genet.28, 165–173 (2020). 10.1038/s41431-019-0508-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
2.Blöß, S. et al. Diagnostic needs for rare diseases and shared prediagnostic phenomena: results of a German-wide expert Delphi survey. PLoS ONE12, e0172532 (2017). 10.1371/journal.pone.0172532 [DOI] [PMC free article] [PubMed] [Google Scholar]
3.Boycott, K. M. et al. International cooperation to enable the diagnosis of all rare genetic diseases. Am. J. Hum. Genet.100, 695–705 (2017). 10.1016/j.ajhg.2017.04.003 [DOI] [PMC free article] [PubMed] [Google Scholar]
4.Austin, C. P. et al. Future of rare diseases eesearch 2017–2027: an IRDiRC Perspective. Clin. Transl. Sci.11, 21–27 (2018). 10.1111/cts.12500 [DOI] [PMC free article] [PubMed] [Google Scholar]
5.Hochstenbach, R. et al. Array analysis and karyotyping: workflow consequences based on a retrospective study of 36,325 patients with idiopathic developmental delay in the Netherlands. Eur. J. Med. Genet.52, 161–169 (2009). 10.1016/j.ejmg.2009.03.015 [DOI] [PubMed] [Google Scholar]
6.Choi, H. S. et al. Molecular diagnosis of hereditary spherocytosis by multi-gene target sequencing in Korea: matching with osmotic fragility test and presence of spherocyte. Orphanet J. Rare Dis.14, 114 (2019). 10.1186/s13023-019-1070-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
7.Kochinke, K. et al. Systematic phenomics analysis deconvolutes genes mutated in intellectual disability into biologically coherent modules. Am. J. Hum. Genet.98, 149–164 (2016). 10.1016/j.ajhg.2015.11.024 [DOI] [PMC free article] [PubMed] [Google Scholar]
8.100,000 Genomes Project Pilot Investigatorset al. 100,000 Genomes pilot on rare-disease diagnosis in health care—preliminary report. N. Engl. J. Med.385, 1868–1880 (2021). 10.1056/NEJMoa2035790 [DOI] [PMC free article] [PubMed] [Google Scholar]
9.Rillig, F., Grüters, A., Schramm, C. & Krude, H. The interdisciplinary diagnosis of rare diseases: results of the TRANSLATE-NAMSE project. Dtsch. Arztebl. Int.119, 469–475 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
10.Cao, Y. et al. A clinical survey of mosaic single nucleotide variants in disease-causing genes detected by exome sequencing. Genome Med.11, 48 (2019). 10.1186/s13073-019-0658-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
11.Gambin, T. et al. Low-level parental somatic mosaic SNVs in exomes from a large cohort of trios with diverse suspected Mendelian conditions. Genet. Med.22, 1768–1776 (2020). 10.1038/s41436-020-0897-z [DOI] [PMC free article] [PubMed] [Google Scholar]
12.Wright, C. F. et al. Clinically-relevant postzygotic mosaicism in parents and children with developmental disorders in trio exome sequencing data. Nat. Commun.10, 2985 (2019). 10.1038/s41467-019-11059-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
13.Landrum, M. J. et al. ClinVar: improvements to accessing data. Nucleic Acids Res.48, D835–D844 (2020). 10.1093/nar/gkz972 [DOI] [PMC free article] [PubMed] [Google Scholar]
14.Martin, H. C. et al. Quantifying the contribution of recessive coding variation to developmental disorders. Science362, 1161–1164 (2018). 10.1126/science.aar6731 [DOI] [PMC free article] [PubMed] [Google Scholar]
15.Fridman, H. et al. The landscape of autosomal-recessive pathogenic variants in European populations reveals phenotype-specific effects. Am. J. Hum. Genet.108, 608–619 (2021). 10.1016/j.ajhg.2021.03.004 [DOI] [PMC free article] [PubMed] [Google Scholar]
16.Hu, H. et al. Genetics of intellectual disability in consanguineous families. Mol. Psychiatry24, 1027–1039 (2019). 10.1038/s41380-017-0012-2 [DOI] [PubMed] [Google Scholar]
17.La Rocca, L. A. et al. Understanding recessive disease risk in multi-ethnic populations with different degrees of consanguinity. Am. J. Med. Genet. A194, e63452 (2024). 10.1002/ajmg.a.63452 [DOI] [PubMed] [Google Scholar]
18.Posey, J. E. et al. Resolution of disease phenotypes resulting from multilocus genomic variation. N. Engl. J. Med.376, 21–31 (2017). 10.1056/NEJMoa1516767 [DOI] [PMC free article] [PubMed] [Google Scholar]
19.Mitani, T. et al. High prevalence of multilocus pathogenic variation in neurodevelopmental disorders in the Turkish population. Am. J. Hum. Genet.108, 1981–2005 (2021). 10.1016/j.ajhg.2021.08.009 [DOI] [PMC free article] [PubMed] [Google Scholar]
20.Turro, E. et al. Whole-genome sequencing of patients with rare diseases in a national health system. Nature583, 96–102 (2020). 10.1038/s41586-020-2434-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
21.Körholz, J. et al. Novel mutation and expanding phenotype in IRF2BP2 deficiency. Rheumatology62, 1699–1705 (2023). 10.1093/rheumatology/keac575 [DOI] [PubMed] [Google Scholar]
22.Mochel, F. et al. Variants in the SK2 channel gene (KCNN2) lead to dominant neurodevelopmental movement disorders. Brain143, 3564–3573 (2020). 10.1093/brain/awaa346 [DOI] [PubMed] [Google Scholar]
23.Magg, T. et al. Heterozygous OAS1 gain-of-function variants cause an autoinflammatory immunodeficiency. Sci. Immunol.6, eabf9564 (2021). 10.1126/sciimmunol.abf9564 [DOI] [PMC free article] [PubMed] [Google Scholar]
24.den Hoed, J. et al. Mutation-specific pathophysiological mechanisms define different neurodevelopmental disorders associated with SATB1 dysfunction. Am. J. Hum. Genet.108, 346–356 (2021). 10.1016/j.ajhg.2021.01.007 [DOI] [PMC free article] [PubMed] [Google Scholar]
25.Li, D. et al. Pathogenic variants in SMARCA5, a chromatin remodeler, cause a range of syndromic neurodevelopmental features. Sci. Adv.7, eabf2066 (2021). 10.1126/sciadv.abf2066 [DOI] [PMC free article] [PubMed] [Google Scholar]
26.Thaventhiran, J. E. D. et al. Whole-genome sequencing of a sporadic primary immunodeficiency cohort. Nature583, 90–95 (2020). 10.1038/s41586-020-2265-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
27.Vogt, G. et al. Biallelic truncating variants in ATP9A cause a novel neurodevelopmental disorder involving postnatal microcephaly and failure to thrive. J. Med. Genet.59, 662–668 (2022). 10.1136/jmedgenet-2021-107843 [DOI] [PMC free article] [PubMed] [Google Scholar]
28.Stenton, S. L. et al. Impaired complex I repair causes recessive Leber’s hereditary optic neuropathy. J. Clin. Invest.131, e138267 (2021). 10.1172/JCI138267 [DOI] [PMC free article] [PubMed] [Google Scholar]
29.Horn, D. et al. Biallelic truncating variants in MAPKAPK5 cause a new developmental disorder involving neurological, cardiac, and facial anomalies combined with synpolydactyly. Genet. Med.23, 679–688 (2021). 10.1038/s41436-020-01052-2 [DOI] [PubMed] [Google Scholar]
30.Brugger, M. et al. A homozygous truncating variant in CCDC186 in an individual with epileptic encephalopathy. Ann. Clin. Transl. Neurol.8, 278–283 (2021). 10.1002/acn3.51260 [DOI] [PMC free article] [PubMed] [Google Scholar]
31.Marafi, D. et al. A reverse genetics and genomics approach to gene paralog function and disease: Myokymia and the juxtaparanode. Am. J. Hum. Genet.109, 1713–1723 (2022). 10.1016/j.ajhg.2022.07.006 [DOI] [PMC free article] [PubMed] [Google Scholar]
32.Ebstein, F. et al. PSMC3 proteasome subunit variants are associated with neurodevelopmental delay and type I interferon production. Sci. Transl. Med.15, eabo3189 (2023). 10.1126/scitranslmed.abo3189 [DOI] [PMC free article] [PubMed] [Google Scholar]
33.Richard, E. M. et al. Bi-allelic variants in SPATA5L1 lead to intellectual disability, spastic-dystonic cerebral palsy, epilepsy, and hearing loss. Am. J. Hum. Genet.108, 2006–2016 (2021). 10.1016/j.ajhg.2021.08.003 [DOI] [PMC free article] [PubMed] [Google Scholar]
34.Liu, Z. et al. Hemizygous variants in protein phosphatase 1 regulatory subunit 3F (PPP1R3F) are associated with a neurodevelopmental disorder characterized by developmental delay, intellectual disability and autistic features. Hum. Mol. Genet.32, 2981–2995 (2023). 10.1093/hmg/ddad124 [DOI] [PMC free article] [PubMed] [Google Scholar]
35.Aref-Eshghi, E. et al. Genomic DNA methylation signatures enable concurrent diagnosis and clinical genetic variant classification in neurodevelopmental syndromes. Am. J. Hum. Genet.102, 156–174 (2018). 10.1016/j.ajhg.2017.12.008 [DOI] [PMC free article] [PubMed] [Google Scholar]
36.Mirza-Schreiber, N. et al. Blood DNA methylation provides an accurate biomarker of KMT2B-related dystonia and predicts onset. Brain145, 644–654 (2022). 10.1093/brain/awab360 [DOI] [PubMed] [Google Scholar]
37.Cummings, B. B. et al. Improving genetic diagnosis in Mendelian disease with transcriptome sequencing. Sci. Transl. Med.9, eaal5209 (2017). 10.1126/scitranslmed.aal5209 [DOI] [PMC free article] [PubMed] [Google Scholar]
38.Murdock, D. R. et al. Transcriptome-directed analysis for Mendelian disease diagnosis overcomes limitations of conventional genomic testing. J. Clin. Invest.131, e141500 (2021). 10.1172/JCI141500 [DOI] [PMC free article] [PubMed] [Google Scholar]
39.Frésard, L. et al. Identification of rare-disease genes using blood transcriptome sequencing and large control cohorts. Nat. Med.25, 911–919 (2019). 10.1038/s41591-019-0457-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
40.Hsieh, T.-C. et al. GestaltMatcher facilitates rare disease matching using facial phenotype descriptors. Nat. Genet.54, 349–357 (2022). 10.1038/s41588-021-01010-x [DOI] [PMC free article] [PubMed] [Google Scholar]
41.Hsieh, T.-C. et al. PEDIA: prioritization of exome data by image analysis. Genet. Med.21, 2807–2814 (2019). 10.1038/s41436-019-0566-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
42.Kircher, M. et al. A general framework for estimating the relative pathogenicity of human genetic variants. Nat. Genet.46, 310–315 (2014). 10.1038/ng.2892 [DOI] [PMC free article] [PubMed] [Google Scholar]
43.Robinson, P. N. et al. Improved exome prioritization of disease genes through cross-species phenotype comparison. Genome Res.24, 340–348 (2014). 10.1101/gr.160325.113 [DOI] [PMC free article] [PubMed] [Google Scholar]
44.Li, Q., Zhao, K., Bustamante, C. D., Ma, X. & Wong, W. H. Xrare: a machine learning method jointly modeling phenotypes and genetic evidence for rare disease diagnosis. Genet. Med.21, 2126–2134 (2019). 10.1038/s41436-019-0439-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
45.Robinson, P. N. et al. Interpretable clinical genomics with a likelihood ratio paradigm. Am. J. Hum. Genet.107, 403–417 (2020). 10.1016/j.ajhg.2020.06.021 [DOI] [PMC free article] [PubMed] [Google Scholar]
46.Birgmeier, J. et al. AMELIE speeds Mendelian diagnosis by matching patient phenotype and genotype to primary literature. Sci. Transl. Med.12, eaau9113 (2020). 10.1126/scitranslmed.aau9113 [DOI] [PMC free article] [PubMed] [Google Scholar]
47.Brand, F. et al. Next-generation phenotyping contributing to the identification of a 4.7 kb deletion in KANSL1 causing Koolen-de Vries syndrome. Hum. Mutat.43, 1659–1665 (2022). 10.1002/humu.24467 [DOI] [PubMed] [Google Scholar]
48.Bick, D. et al. An online compendium of treatable genetic disorders. Am. J. Med. Genet. C187, 48–54 (2021). 10.1002/ajmg.c.31874 [DOI] [PMC free article] [PubMed] [Google Scholar]
49.Capotondo, A. et al. Safety of arylsulfatase A overexpression for gene therapy of metachromatic leukodystrophy. Hum. Gene Ther.18, 821–836 (2007). 10.1089/hum.2007.048 [DOI] [PubMed] [Google Scholar]
50.Feichtinger, R. G. et al. A spoonful of L-fucose-an efficient therapy for GFUS-CDG, a new glycosylation disorder. EMBO Mol. Med.13, e14332 (2021). 10.15252/emmm.202114332 [DOI] [PMC free article] [PubMed] [Google Scholar]
51.Tambuyzer, E. et al. Therapies for rare diseases: therapeutic modalities, progress and challenges ahead. Nat. Rev. Drug Discov.19, 93–111 (2020). 10.1038/s41573-019-0049-9 [DOI] [PubMed] [Google Scholar]
52.Stark, Z. et al. Prospective comparison of the cost-effectiveness of clinical whole-exome sequencing with that of usual care overwhelmingly supports early use and reimbursement. Genet. Med.19, 867–874 (2017). 10.1038/gim.2016.221 [DOI] [PubMed] [Google Scholar]
53.Retterer, K. et al. Clinical application of whole-exome sequencing across clinical indications. Genet. Med.18, 696–704 (2016). 10.1038/gim.2015.148 [DOI] [PubMed] [Google Scholar]
54.Kingsmore, S. F. et al. A randomized, controlled trial of the analytic and diagnostic performance of singleton and trio, rapid genome and exome sequencing in ill infants. Am. J. Hum. Genet.105, 719–733 (2019). 10.1016/j.ajhg.2019.08.009 [DOI] [PMC free article] [PubMed] [Google Scholar]
55.Benito-Lozano, J. et al. Diagnostic process in rare diseases: determinants associated with diagnostic delay. Int. J. Environ. Res. Public Health19, 6456 (2022). 10.3390/ijerph19116456 [DOI] [PMC free article] [PubMed] [Google Scholar]
56.Benito-Lozano, J., López-Villalba, B., Arias-Merino, G., Posada de la Paz, M. & Alonso-Ferreira, V. Diagnostic delay in rare diseases: data from the Spanish rare diseases patient registry. Orphanet J. Rare Dis.17, 418 (2022). 10.1186/s13023-022-02530-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
57.Illert, A. L. et al. The german network for personalized medicine to enhance patient care and translational research. Nat. Med.29, 1298–1301 (2023). 10.1038/s41591-023-02354-z [DOI] [PubMed] [Google Scholar]
58.Kaplanis, J. et al. Evidence for 28 genetic disorders discovered by combining healthcare and research data. Nature586, 757–762 (2020). 10.1038/s41586-020-2832-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
59.Wright, C. F. et al. Evaluating variants classified as pathogenic in ClinVar in the DDD Study. Genet. Med.23, 571–575 (2021). 10.1038/s41436-020-01021-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
60.Wright, C. F. et al. Making new genetic diagnoses with old data: iterative reanalysis and reporting from genome-wide data in 1,133 families with developmental disorders. Genet. Med.20, 1216–1223 (2018). 10.1038/gim.2017.246 [DOI] [PMC free article] [PubMed] [Google Scholar]
61.MacArthur, D. G. et al. Guidelines for investigating causality of sequence variants in human disease. Nature508, 469–476 (2014). 10.1038/nature13127 [DOI] [PMC free article] [PubMed] [Google Scholar]
62.Gao, Z., Waggoner, D., Stephens, M., Ober, C. & Przeworski, M. An estimate of the average number of recessive lethal mutations carried by humans. Genetics199, 1243–1254 (2015). 10.1534/genetics.114.173351 [DOI] [PMC free article] [PubMed] [Google Scholar]
63.Narasimhan, V. M. et al. Health and population effects of rare gene knockouts in adult humans with related parents. Science352, 474–477 (2016). 10.1126/science.aac8624 [DOI] [PMC free article] [PubMed] [Google Scholar]
64.Chakraborty, R. & Chakravarti, A. On consanguineous marriages and the genetic load. Hum. Genet.36, 47–54 (1977). 10.1007/BF00390435 [DOI] [PubMed] [Google Scholar]
65.La Rocca, L. A. et al. Understanding recessive disease risk in multi-ethnic populations with different degrees of consanguinity. Am. J. Med. Genet. A194, e63452 (2024). 10.1002/ajmg.a.63452 [DOI] [PubMed] [Google Scholar]
66.Antonarakis, S. E. Carrier screening for recessive disorders. Nat. Rev. Genet.20, 549–561 (2019). 10.1038/s41576-019-0134-2 [DOI] [PubMed] [Google Scholar]
67.Kalia, S. S. et al. Recommendations for reporting of secondary findings in clinical exome and genome sequencing, 2016 update (ACMG SF v2.0): a policy statement of the American College of Medical Genetics and Genomics. Genet. Med.19, 249–255 (2017). 10.1038/gim.2016.190 [DOI] [PubMed] [Google Scholar]
68.Rentzsch, P., Schubach, M., Shendure, J. & Kircher, M. CADD-splice-improving genome-wide variant effect prediction using deep learning-derived splice scores. Genome Med.13, 31 (2021). 10.1186/s13073-021-00835-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
69.Peng, C. et al. CADA: phenotype-driven gene prioritization based on a case-enriched knowledge graph. NAR Genom. Bioinform.3, lqab078 (2021). 10.1093/nargab/lqab078 [DOI] [PMC free article] [PubMed] [Google Scholar]
70.Choukair, D. et al. An Integrated clinical pathway for diagnosis, treatment and care of rare diseases: model, operating procedures, and results of the project TRANSLATE-NAMSE funded by the German Federal Joint Committee. Orphanet J. Rare Dis.16, 474 (2021). 10.1186/s13023-021-02092-w [DOI] [PMC free article] [PubMed] [Google Scholar]
71.Vasimuddin, M., Misra, S., Li, H. & Aluru, S. Efficient Architecture-Aware Acceleration of BWA-MEM for Multicore Systems. In 2019 IEEE International Parallel and Distributed Processing Symposium (IPDPS) 314–324 (IPDPS, 2019).
72.Li, H. & Durbin, R. Fast and accurate short read alignment with Burrows–Wheeler transform. Bioinformatics25, 1754–1760 (2009). 10.1093/bioinformatics/btp324 [DOI] [PMC free article] [PubMed] [Google Scholar]
73.DePristo, M. A. et al. A framework for variation discovery and genotyping using next-generation DNA sequencing data. Nat. Genet.43, 491–498 (2011). 10.1038/ng.806 [DOI] [PMC free article] [PubMed] [Google Scholar]
74.Li, H. A statistical framework for SNP calling, mutation discovery, association mapping and population genetical parameter estimation from sequencing data. Bioinformatics27, 2987–2993 (2011). 10.1093/bioinformatics/btr509 [DOI] [PMC free article] [PubMed] [Google Scholar]
75.Wagner, M. et al. Mitochondrial DNA mutation analysis from exome sequencing—a more holistic approach in diagnostics of suspected mitochondrial disease. J. Inherit. Metab. Dis.42, 909–917 (2019). 10.1002/jimd.12109 [DOI] [PubMed] [Google Scholar]
76.Ye, K. et al. Split-read indel and structural variant calling using PINDEL. Methods Mol. Biol.1833, 95–105 (2018). 10.1007/978-1-4939-8666-8_7 [DOI] [PubMed] [Google Scholar]
77.Plagnol, V. et al. A robust model for read count data in exome sequencing experiments and implications for copy number variant calling. Bioinformatics28, 2747–2754 (2012). 10.1093/bioinformatics/bts526 [DOI] [PMC free article] [PubMed] [Google Scholar]
78.McLaren, W. et al. The ensembl variant effect predictor. Genome Biol.17, 122 (2016). 10.1186/s13059-016-0974-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
79.Jäger, M. et al. Jannovar: a java library for exome annotation. Hum. Mutat.35, 548–555 (2014). 10.1002/humu.22531 [DOI] [PubMed] [Google Scholar]
80.Holtgrewe, M. et al. VarFish: comprehensive DNA variant analysis for diagnostics and research. Nucleic Acids Res.48, W162–W169 (2020). 10.1093/nar/gkaa241 [DOI] [PMC free article] [PubMed] [Google Scholar]
81.Pedersen, B. S. & Quinlan, A. R. Who’s who? Detecting and resolving sample anomalies in human DNA sequencing studies with Peddy. Am. J. Hum. Genet.100, 406–413 (2017). 10.1016/j.ajhg.2017.01.017 [DOI] [PMC free article] [PubMed] [Google Scholar]
82.Pemberton, T. J. et al. Genomic patterns of homozygosity in worldwide human populations. Am. J. Hum. Genet.91, 275–292 (2012). 10.1016/j.ajhg.2012.06.014 [DOI] [PMC free article] [PubMed] [Google Scholar]
83.Wang, S., Haynes, C., Barany, F. & Ott, J. Genome-wide autozygosity mapping in human populations. Genet. Epidemiol.33, 172–180 (2009). 10.1002/gepi.20344 [DOI] [PMC free article] [PubMed] [Google Scholar]
84.Narasimhan, V. et al. BCFtools/RoH: a hidden Markov model approach for detecting autozygosity from next-generation sequencing data. Bioinformatics32, 1749–1751 (2016). 10.1093/bioinformatics/btw044 [DOI] [PMC free article] [PubMed] [Google Scholar]
85.Richards, S. et al. Standards and guidelines for the interpretation of sequence variants: a joint consensus recommendation of the American College of Medical Genetics and Genomics and the Association for Molecular Pathology. Genet. Med.17, 405–424 (2015). 10.1038/gim.2015.30 [DOI] [PMC free article] [PubMed] [Google Scholar]
86.Philippakis, A. A. et al. The MatchMaker Exchange: a platform for rare disease gene discovery. Hum. Mutat.36, 915–921 (2015). 10.1002/humu.22858 [DOI] [PMC free article] [PubMed] [Google Scholar]
87.Sobreira, N. L. M. et al. MatchMaker Exchange. Curr. Protoc. Hum. Genet.95, 9.31.1–9.31.15 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
88.Karczewski, K. J. et al. The mutational constraint spectrum quantified from variation in 141,456 humans. Nature581, 434–443 (2020). 10.1038/s41586-020-2308-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
89.Samocha, K. E. et al. A framework for the interpretation of de novo mutation in human disease. Nat. Genet.46, 944–950 (2014). 10.1038/ng.3050 [DOI] [PMC free article] [PubMed] [Google Scholar]
90.R Core Team. R: a language and environment for statistical computing. R Projecthttps://www.R-project.org/ (2021).
91.Lieberwirth, J. et al. AutoCaSc: prioritizing candidate genes for neurodevelopmental disorders. Hum. Mutat.43, 1795–1807 (2022). 10.1002/humu.24451 [DOI] [PubMed] [Google Scholar]
92.Strande, N. T. et al. Evaluating the clinical validity of gene–disease associations: an evidence-based framework developed by the Clinical Genome Resource. Am. J. Hum. Genet.100, 895–906 (2017). 10.1016/j.ajhg.2017.04.015 [DOI] [PMC free article] [PubMed] [Google Scholar]
93.Hustinx, A. et al. Improving deep facial phenotyping for ultra-rare disorder verification using model ensembles. In 2023 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) (IEEE, 2023).
94.Schmidt, A. Code used for the analysis of the TRANSLATE-NAMSE data. Zenodo10.5281/zenodo.10964188 (2024).

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Information^{(6.1MB, pdf)}

Supplementary note and Figs. 1–14 and legends for Supplementary Tables 1–7.

Reporting Summary^{(1.9MB, pdf)}

Peer Review File^{(2.1MB, pdf)}

Supplementary Tables^{(326.6KB, xlsx)}

Supplementary Tables 1–7.

Data Availability Statement

[CR1] 1.Nguengang Wakap, S. et al. Estimating cumulative point prevalence of rare diseases: analysis of the Orphanet database. Eur. J. Hum. Genet.28, 165–173 (2020). 10.1038/s41431-019-0508-0 [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR2] 2.Blöß, S. et al. Diagnostic needs for rare diseases and shared prediagnostic phenomena: results of a German-wide expert Delphi survey. PLoS ONE12, e0172532 (2017). 10.1371/journal.pone.0172532 [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR3] 3.Boycott, K. M. et al. International cooperation to enable the diagnosis of all rare genetic diseases. Am. J. Hum. Genet.100, 695–705 (2017). 10.1016/j.ajhg.2017.04.003 [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR4] 4.Austin, C. P. et al. Future of rare diseases eesearch 2017–2027: an IRDiRC Perspective. Clin. Transl. Sci.11, 21–27 (2018). 10.1111/cts.12500 [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR5] 5.Hochstenbach, R. et al. Array analysis and karyotyping: workflow consequences based on a retrospective study of 36,325 patients with idiopathic developmental delay in the Netherlands. Eur. J. Med. Genet.52, 161–169 (2009). 10.1016/j.ejmg.2009.03.015 [DOI] [PubMed] [Google Scholar]

[CR6] 6.Choi, H. S. et al. Molecular diagnosis of hereditary spherocytosis by multi-gene target sequencing in Korea: matching with osmotic fragility test and presence of spherocyte. Orphanet J. Rare Dis.14, 114 (2019). 10.1186/s13023-019-1070-0 [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR7] 7.Kochinke, K. et al. Systematic phenomics analysis deconvolutes genes mutated in intellectual disability into biologically coherent modules. Am. J. Hum. Genet.98, 149–164 (2016). 10.1016/j.ajhg.2015.11.024 [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR8] 8.100,000 Genomes Project Pilot Investigatorset al. 100,000 Genomes pilot on rare-disease diagnosis in health care—preliminary report. N. Engl. J. Med.385, 1868–1880 (2021). 10.1056/NEJMoa2035790 [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR9] 9.Rillig, F., Grüters, A., Schramm, C. & Krude, H. The interdisciplinary diagnosis of rare diseases: results of the TRANSLATE-NAMSE project. Dtsch. Arztebl. Int.119, 469–475 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR10] 10.Cao, Y. et al. A clinical survey of mosaic single nucleotide variants in disease-causing genes detected by exome sequencing. Genome Med.11, 48 (2019). 10.1186/s13073-019-0658-2 [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR11] 11.Gambin, T. et al. Low-level parental somatic mosaic SNVs in exomes from a large cohort of trios with diverse suspected Mendelian conditions. Genet. Med.22, 1768–1776 (2020). 10.1038/s41436-020-0897-z [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR12] 12.Wright, C. F. et al. Clinically-relevant postzygotic mosaicism in parents and children with developmental disorders in trio exome sequencing data. Nat. Commun.10, 2985 (2019). 10.1038/s41467-019-11059-2 [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR13] 13.Landrum, M. J. et al. ClinVar: improvements to accessing data. Nucleic Acids Res.48, D835–D844 (2020). 10.1093/nar/gkz972 [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR14] 14.Martin, H. C. et al. Quantifying the contribution of recessive coding variation to developmental disorders. Science362, 1161–1164 (2018). 10.1126/science.aar6731 [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR15] 15.Fridman, H. et al. The landscape of autosomal-recessive pathogenic variants in European populations reveals phenotype-specific effects. Am. J. Hum. Genet.108, 608–619 (2021). 10.1016/j.ajhg.2021.03.004 [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR16] 16.Hu, H. et al. Genetics of intellectual disability in consanguineous families. Mol. Psychiatry24, 1027–1039 (2019). 10.1038/s41380-017-0012-2 [DOI] [PubMed] [Google Scholar]

[CR17] 17.La Rocca, L. A. et al. Understanding recessive disease risk in multi-ethnic populations with different degrees of consanguinity. Am. J. Med. Genet. A194, e63452 (2024). 10.1002/ajmg.a.63452 [DOI] [PubMed] [Google Scholar]

[CR18] 18.Posey, J. E. et al. Resolution of disease phenotypes resulting from multilocus genomic variation. N. Engl. J. Med.376, 21–31 (2017). 10.1056/NEJMoa1516767 [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR19] 19.Mitani, T. et al. High prevalence of multilocus pathogenic variation in neurodevelopmental disorders in the Turkish population. Am. J. Hum. Genet.108, 1981–2005 (2021). 10.1016/j.ajhg.2021.08.009 [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR20] 20.Turro, E. et al. Whole-genome sequencing of patients with rare diseases in a national health system. Nature583, 96–102 (2020). 10.1038/s41586-020-2434-2 [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR21] 21.Körholz, J. et al. Novel mutation and expanding phenotype in IRF2BP2 deficiency. Rheumatology62, 1699–1705 (2023). 10.1093/rheumatology/keac575 [DOI] [PubMed] [Google Scholar]

[CR22] 22.Mochel, F. et al. Variants in the SK2 channel gene (KCNN2) lead to dominant neurodevelopmental movement disorders. Brain143, 3564–3573 (2020). 10.1093/brain/awaa346 [DOI] [PubMed] [Google Scholar]

[CR23] 23.Magg, T. et al. Heterozygous OAS1 gain-of-function variants cause an autoinflammatory immunodeficiency. Sci. Immunol.6, eabf9564 (2021). 10.1126/sciimmunol.abf9564 [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR24] 24.den Hoed, J. et al. Mutation-specific pathophysiological mechanisms define different neurodevelopmental disorders associated with SATB1 dysfunction. Am. J. Hum. Genet.108, 346–356 (2021). 10.1016/j.ajhg.2021.01.007 [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR25] 25.Li, D. et al. Pathogenic variants in SMARCA5, a chromatin remodeler, cause a range of syndromic neurodevelopmental features. Sci. Adv.7, eabf2066 (2021). 10.1126/sciadv.abf2066 [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR26] 26.Thaventhiran, J. E. D. et al. Whole-genome sequencing of a sporadic primary immunodeficiency cohort. Nature583, 90–95 (2020). 10.1038/s41586-020-2265-1 [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR27] 27.Vogt, G. et al. Biallelic truncating variants in ATP9A cause a novel neurodevelopmental disorder involving postnatal microcephaly and failure to thrive. J. Med. Genet.59, 662–668 (2022). 10.1136/jmedgenet-2021-107843 [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR28] 28.Stenton, S. L. et al. Impaired complex I repair causes recessive Leber’s hereditary optic neuropathy. J. Clin. Invest.131, e138267 (2021). 10.1172/JCI138267 [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR29] 29.Horn, D. et al. Biallelic truncating variants in MAPKAPK5 cause a new developmental disorder involving neurological, cardiac, and facial anomalies combined with synpolydactyly. Genet. Med.23, 679–688 (2021). 10.1038/s41436-020-01052-2 [DOI] [PubMed] [Google Scholar]

[CR30] 30.Brugger, M. et al. A homozygous truncating variant in CCDC186 in an individual with epileptic encephalopathy. Ann. Clin. Transl. Neurol.8, 278–283 (2021). 10.1002/acn3.51260 [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR31] 31.Marafi, D. et al. A reverse genetics and genomics approach to gene paralog function and disease: Myokymia and the juxtaparanode. Am. J. Hum. Genet.109, 1713–1723 (2022). 10.1016/j.ajhg.2022.07.006 [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR32] 32.Ebstein, F. et al. PSMC3 proteasome subunit variants are associated with neurodevelopmental delay and type I interferon production. Sci. Transl. Med.15, eabo3189 (2023). 10.1126/scitranslmed.abo3189 [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR33] 33.Richard, E. M. et al. Bi-allelic variants in SPATA5L1 lead to intellectual disability, spastic-dystonic cerebral palsy, epilepsy, and hearing loss. Am. J. Hum. Genet.108, 2006–2016 (2021). 10.1016/j.ajhg.2021.08.003 [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR34] 34.Liu, Z. et al. Hemizygous variants in protein phosphatase 1 regulatory subunit 3F (PPP1R3F) are associated with a neurodevelopmental disorder characterized by developmental delay, intellectual disability and autistic features. Hum. Mol. Genet.32, 2981–2995 (2023). 10.1093/hmg/ddad124 [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR35] 35.Aref-Eshghi, E. et al. Genomic DNA methylation signatures enable concurrent diagnosis and clinical genetic variant classification in neurodevelopmental syndromes. Am. J. Hum. Genet.102, 156–174 (2018). 10.1016/j.ajhg.2017.12.008 [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR36] 36.Mirza-Schreiber, N. et al. Blood DNA methylation provides an accurate biomarker of KMT2B-related dystonia and predicts onset. Brain145, 644–654 (2022). 10.1093/brain/awab360 [DOI] [PubMed] [Google Scholar]

[CR37] 37.Cummings, B. B. et al. Improving genetic diagnosis in Mendelian disease with transcriptome sequencing. Sci. Transl. Med.9, eaal5209 (2017). 10.1126/scitranslmed.aal5209 [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR38] 38.Murdock, D. R. et al. Transcriptome-directed analysis for Mendelian disease diagnosis overcomes limitations of conventional genomic testing. J. Clin. Invest.131, e141500 (2021). 10.1172/JCI141500 [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR39] 39.Frésard, L. et al. Identification of rare-disease genes using blood transcriptome sequencing and large control cohorts. Nat. Med.25, 911–919 (2019). 10.1038/s41591-019-0457-8 [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR40] 40.Hsieh, T.-C. et al. GestaltMatcher facilitates rare disease matching using facial phenotype descriptors. Nat. Genet.54, 349–357 (2022). 10.1038/s41588-021-01010-x [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR41] 41.Hsieh, T.-C. et al. PEDIA: prioritization of exome data by image analysis. Genet. Med.21, 2807–2814 (2019). 10.1038/s41436-019-0566-2 [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR42] 42.Kircher, M. et al. A general framework for estimating the relative pathogenicity of human genetic variants. Nat. Genet.46, 310–315 (2014). 10.1038/ng.2892 [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR43] 43.Robinson, P. N. et al. Improved exome prioritization of disease genes through cross-species phenotype comparison. Genome Res.24, 340–348 (2014). 10.1101/gr.160325.113 [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR44] 44.Li, Q., Zhao, K., Bustamante, C. D., Ma, X. & Wong, W. H. Xrare: a machine learning method jointly modeling phenotypes and genetic evidence for rare disease diagnosis. Genet. Med.21, 2126–2134 (2019). 10.1038/s41436-019-0439-8 [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR45] 45.Robinson, P. N. et al. Interpretable clinical genomics with a likelihood ratio paradigm. Am. J. Hum. Genet.107, 403–417 (2020). 10.1016/j.ajhg.2020.06.021 [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR46] 46.Birgmeier, J. et al. AMELIE speeds Mendelian diagnosis by matching patient phenotype and genotype to primary literature. Sci. Transl. Med.12, eaau9113 (2020). 10.1126/scitranslmed.aau9113 [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR47] 47.Brand, F. et al. Next-generation phenotyping contributing to the identification of a 4.7 kb deletion in KANSL1 causing Koolen-de Vries syndrome. Hum. Mutat.43, 1659–1665 (2022). 10.1002/humu.24467 [DOI] [PubMed] [Google Scholar]

[CR48] 48.Bick, D. et al. An online compendium of treatable genetic disorders. Am. J. Med. Genet. C187, 48–54 (2021). 10.1002/ajmg.c.31874 [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR49] 49.Capotondo, A. et al. Safety of arylsulfatase A overexpression for gene therapy of metachromatic leukodystrophy. Hum. Gene Ther.18, 821–836 (2007). 10.1089/hum.2007.048 [DOI] [PubMed] [Google Scholar]

[CR50] 50.Feichtinger, R. G. et al. A spoonful of L-fucose-an efficient therapy for GFUS-CDG, a new glycosylation disorder. EMBO Mol. Med.13, e14332 (2021). 10.15252/emmm.202114332 [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR51] 51.Tambuyzer, E. et al. Therapies for rare diseases: therapeutic modalities, progress and challenges ahead. Nat. Rev. Drug Discov.19, 93–111 (2020). 10.1038/s41573-019-0049-9 [DOI] [PubMed] [Google Scholar]

[CR52] 52.Stark, Z. et al. Prospective comparison of the cost-effectiveness of clinical whole-exome sequencing with that of usual care overwhelmingly supports early use and reimbursement. Genet. Med.19, 867–874 (2017). 10.1038/gim.2016.221 [DOI] [PubMed] [Google Scholar]

[CR53] 53.Retterer, K. et al. Clinical application of whole-exome sequencing across clinical indications. Genet. Med.18, 696–704 (2016). 10.1038/gim.2015.148 [DOI] [PubMed] [Google Scholar]

[CR54] 54.Kingsmore, S. F. et al. A randomized, controlled trial of the analytic and diagnostic performance of singleton and trio, rapid genome and exome sequencing in ill infants. Am. J. Hum. Genet.105, 719–733 (2019). 10.1016/j.ajhg.2019.08.009 [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR55] 55.Benito-Lozano, J. et al. Diagnostic process in rare diseases: determinants associated with diagnostic delay. Int. J. Environ. Res. Public Health19, 6456 (2022). 10.3390/ijerph19116456 [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR56] 56.Benito-Lozano, J., López-Villalba, B., Arias-Merino, G., Posada de la Paz, M. & Alonso-Ferreira, V. Diagnostic delay in rare diseases: data from the Spanish rare diseases patient registry. Orphanet J. Rare Dis.17, 418 (2022). 10.1186/s13023-022-02530-3 [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR57] 57.Illert, A. L. et al. The german network for personalized medicine to enhance patient care and translational research. Nat. Med.29, 1298–1301 (2023). 10.1038/s41591-023-02354-z [DOI] [PubMed] [Google Scholar]

[CR58] 58.Kaplanis, J. et al. Evidence for 28 genetic disorders discovered by combining healthcare and research data. Nature586, 757–762 (2020). 10.1038/s41586-020-2832-5 [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR59] 59.Wright, C. F. et al. Evaluating variants classified as pathogenic in ClinVar in the DDD Study. Genet. Med.23, 571–575 (2021). 10.1038/s41436-020-01021-9 [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR60] 60.Wright, C. F. et al. Making new genetic diagnoses with old data: iterative reanalysis and reporting from genome-wide data in 1,133 families with developmental disorders. Genet. Med.20, 1216–1223 (2018). 10.1038/gim.2017.246 [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR61] 61.MacArthur, D. G. et al. Guidelines for investigating causality of sequence variants in human disease. Nature508, 469–476 (2014). 10.1038/nature13127 [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR62] 62.Gao, Z., Waggoner, D., Stephens, M., Ober, C. & Przeworski, M. An estimate of the average number of recessive lethal mutations carried by humans. Genetics199, 1243–1254 (2015). 10.1534/genetics.114.173351 [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR63] 63.Narasimhan, V. M. et al. Health and population effects of rare gene knockouts in adult humans with related parents. Science352, 474–477 (2016). 10.1126/science.aac8624 [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR64] 64.Chakraborty, R. & Chakravarti, A. On consanguineous marriages and the genetic load. Hum. Genet.36, 47–54 (1977). 10.1007/BF00390435 [DOI] [PubMed] [Google Scholar]

[CR65] 65.La Rocca, L. A. et al. Understanding recessive disease risk in multi-ethnic populations with different degrees of consanguinity. Am. J. Med. Genet. A194, e63452 (2024). 10.1002/ajmg.a.63452 [DOI] [PubMed] [Google Scholar]

[CR66] 66.Antonarakis, S. E. Carrier screening for recessive disorders. Nat. Rev. Genet.20, 549–561 (2019). 10.1038/s41576-019-0134-2 [DOI] [PubMed] [Google Scholar]

[CR67] 67.Kalia, S. S. et al. Recommendations for reporting of secondary findings in clinical exome and genome sequencing, 2016 update (ACMG SF v2.0): a policy statement of the American College of Medical Genetics and Genomics. Genet. Med.19, 249–255 (2017). 10.1038/gim.2016.190 [DOI] [PubMed] [Google Scholar]

[CR68] 68.Rentzsch, P., Schubach, M., Shendure, J. & Kircher, M. CADD-splice-improving genome-wide variant effect prediction using deep learning-derived splice scores. Genome Med.13, 31 (2021). 10.1186/s13073-021-00835-9 [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR69] 69.Peng, C. et al. CADA: phenotype-driven gene prioritization based on a case-enriched knowledge graph. NAR Genom. Bioinform.3, lqab078 (2021). 10.1093/nargab/lqab078 [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR70] 70.Choukair, D. et al. An Integrated clinical pathway for diagnosis, treatment and care of rare diseases: model, operating procedures, and results of the project TRANSLATE-NAMSE funded by the German Federal Joint Committee. Orphanet J. Rare Dis.16, 474 (2021). 10.1186/s13023-021-02092-w [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR71] 71.Vasimuddin, M., Misra, S., Li, H. & Aluru, S. Efficient Architecture-Aware Acceleration of BWA-MEM for Multicore Systems. In 2019 IEEE International Parallel and Distributed Processing Symposium (IPDPS) 314–324 (IPDPS, 2019).

[CR72] 72.Li, H. & Durbin, R. Fast and accurate short read alignment with Burrows–Wheeler transform. Bioinformatics25, 1754–1760 (2009). 10.1093/bioinformatics/btp324 [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR73] 73.DePristo, M. A. et al. A framework for variation discovery and genotyping using next-generation DNA sequencing data. Nat. Genet.43, 491–498 (2011). 10.1038/ng.806 [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR74] 74.Li, H. A statistical framework for SNP calling, mutation discovery, association mapping and population genetical parameter estimation from sequencing data. Bioinformatics27, 2987–2993 (2011). 10.1093/bioinformatics/btr509 [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR75] 75.Wagner, M. et al. Mitochondrial DNA mutation analysis from exome sequencing—a more holistic approach in diagnostics of suspected mitochondrial disease. J. Inherit. Metab. Dis.42, 909–917 (2019). 10.1002/jimd.12109 [DOI] [PubMed] [Google Scholar]

[CR76] 76.Ye, K. et al. Split-read indel and structural variant calling using PINDEL. Methods Mol. Biol.1833, 95–105 (2018). 10.1007/978-1-4939-8666-8_7 [DOI] [PubMed] [Google Scholar]

[CR77] 77.Plagnol, V. et al. A robust model for read count data in exome sequencing experiments and implications for copy number variant calling. Bioinformatics28, 2747–2754 (2012). 10.1093/bioinformatics/bts526 [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR78] 78.McLaren, W. et al. The ensembl variant effect predictor. Genome Biol.17, 122 (2016). 10.1186/s13059-016-0974-4 [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR79] 79.Jäger, M. et al. Jannovar: a java library for exome annotation. Hum. Mutat.35, 548–555 (2014). 10.1002/humu.22531 [DOI] [PubMed] [Google Scholar]

[CR80] 80.Holtgrewe, M. et al. VarFish: comprehensive DNA variant analysis for diagnostics and research. Nucleic Acids Res.48, W162–W169 (2020). 10.1093/nar/gkaa241 [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR81] 81.Pedersen, B. S. & Quinlan, A. R. Who’s who? Detecting and resolving sample anomalies in human DNA sequencing studies with Peddy. Am. J. Hum. Genet.100, 406–413 (2017). 10.1016/j.ajhg.2017.01.017 [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR82] 82.Pemberton, T. J. et al. Genomic patterns of homozygosity in worldwide human populations. Am. J. Hum. Genet.91, 275–292 (2012). 10.1016/j.ajhg.2012.06.014 [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR83] 83.Wang, S., Haynes, C., Barany, F. & Ott, J. Genome-wide autozygosity mapping in human populations. Genet. Epidemiol.33, 172–180 (2009). 10.1002/gepi.20344 [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR84] 84.Narasimhan, V. et al. BCFtools/RoH: a hidden Markov model approach for detecting autozygosity from next-generation sequencing data. Bioinformatics32, 1749–1751 (2016). 10.1093/bioinformatics/btw044 [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR85] 85.Richards, S. et al. Standards and guidelines for the interpretation of sequence variants: a joint consensus recommendation of the American College of Medical Genetics and Genomics and the Association for Molecular Pathology. Genet. Med.17, 405–424 (2015). 10.1038/gim.2015.30 [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR86] 86.Philippakis, A. A. et al. The MatchMaker Exchange: a platform for rare disease gene discovery. Hum. Mutat.36, 915–921 (2015). 10.1002/humu.22858 [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR87] 87.Sobreira, N. L. M. et al. MatchMaker Exchange. Curr. Protoc. Hum. Genet.95, 9.31.1–9.31.15 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR88] 88.Karczewski, K. J. et al. The mutational constraint spectrum quantified from variation in 141,456 humans. Nature581, 434–443 (2020). 10.1038/s41586-020-2308-7 [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR89] 89.Samocha, K. E. et al. A framework for the interpretation of de novo mutation in human disease. Nat. Genet.46, 944–950 (2014). 10.1038/ng.3050 [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR90] 90.R Core Team. R: a language and environment for statistical computing. R Projecthttps://www.R-project.org/ (2021).

[CR91] 91.Lieberwirth, J. et al. AutoCaSc: prioritizing candidate genes for neurodevelopmental disorders. Hum. Mutat.43, 1795–1807 (2022). 10.1002/humu.24451 [DOI] [PubMed] [Google Scholar]

[CR92] 92.Strande, N. T. et al. Evaluating the clinical validity of gene–disease associations: an evidence-based framework developed by the Clinical Genome Resource. Am. J. Hum. Genet.100, 895–906 (2017). 10.1016/j.ajhg.2017.04.015 [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR93] 93.Hustinx, A. et al. Improving deep facial phenotyping for ultra-rare disorder verification using model ensembles. In 2023 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) (IEEE, 2023).

[CR94] 94.Schmidt, A. Code used for the analysis of the TRANSLATE-NAMSE data. Zenodo10.5281/zenodo.10964188 (2024).

PERMALINK

Next-generation phenotyping integrated in a national framework for patients with ultrarare disorders improves genetic diagnostics and yields new molecular findings

Axel Schmidt

Magdalena Danyel

Kathrin Grundmann

Theresa Brunet

Hannah Klinkhammer

Tzung-Chien Hsieh

Hartmut Engels

Sophia Peters

Alexej Knaus

Shahida Moosa

Luisa Averdunk

Felix Boschann

Henrike Lisa Sczakiel

Sarina Schwartzmann

Martin Atta Mensah

Jean Tori Pantel

Manuel Holtgrewe

Annemarie Bösch

Claudia Weiß

Natalie Weinhold

Aude-Annick Suter

Corinna Stoltenburg

Julia Neugebauer

Tillmann Kallinich

Angela M Kaindl

Susanne Holzhauer

Christoph Bührer

Philip Bufler

Uwe Kornak

Claus-Eric Ott

Markus Schülke

Hoa Huu Phuc Nguyen

Sabine Hoffjan

Corinna Grasemann

Tobias Rothoeft

Folke Brinkmann

Nora Matar

Sugirthan Sivalingam

Claudia Perne

Elisabeth Mangold

Martina Kreiss

Kirsten Cremer

Regina C Betz

Martin Mücke

Lorenz Grigull

Thomas Klockgether

Isabel Spier

André Heimbach

Tim Bender

Fabian Brand

Christiane Stieber

Alexandra Marzena Morawiec

Pantelis Karakostas

Valentin S Schäfer

Sarah Bernsen

Patrick Weydt

Sergio Castro-Gomez

Ahmad Aziz

Marcus Grobe-Einsler

Okka Kimmich

Xenia Kobeleva

Demet Önder

Hellen Lesmann

Sheetal Kumar

Pawel Tacik

Meghna Ahuja Basin

Pietro Incardona

Min Ae Lee-Kirsch

Reinhard Berner

Catharina Schuetz

Julia Körholz

Tanita Kretschmer

Nataliya Di Donato

Evelin Schröck

André Heinen

Ulrike Reuner

Amalia-Mihaela Hanßke

Frank J Kaiser