Abstract
Background:
Early-onset scoliosis (EOS), defined by an onset age of scoliosis less than 10 years, conveys significant health risk to affected children. Identification of the molecular etiology underlying EOS patients could provide valuable information for both clinical management and prenatal screening.
Methods:
In this study, we consecutively recruited a cohort of 447 Chinese patients with operative EOS. We performed exome sequencing (ES) screening on these individuals and their available family members (totaling 670 subjects). Another cohort of 13 idiopathic EOS patients from the United States (US) who underwent ES was also recruited.
Results:
After ES data processing and variant interpretation, we detected molecular diagnostic variants in 92 out of 447 (20.6%) Chinese EOS patients, including 8 patients with molecular confirmation of their clinical diagnosis and 84 patients with molecular diagnosis of previously unrecognized diseases underlying scoliosis. One out of thirteen idiopathic EOS patients from the US cohort was molecularly diagnosed. The age at presentation, the number of organ systems involved, and the Cobb angle were the three top features predictive of a molecular diagnosis.
Conclusion:
ES enabled the molecular diagnosis/classification of patients with EOS. Specific clinical features/feature pairs are able to indicate the likelihood of gaining a molecular diagnosis through ES.
Keywords: Exome sequencing (ES), Early-onset scoliosis (EOS), Molecular diagnosis, Molecular classification
Introduction
Early-onset scoliosis (EOS), defined as clinical presentation of scoliosis before 10 years of age, can be associated with progressive pulmonary compromise and severe dysmorphic gross skeletal appearance if left untreated. The deformed spine results in impaired pulmonary function and impairment or inability to perform activities of daily life, which is an extremely debilitating type of disease in the pediatric age group [1].
The etiology of EOS includes congenital scoliosis (CS) due to structural spine defects, neuromuscular scoliosis (NMS), syndromic condition, and idiopathic EOS (IS) [2 3]. However, it is difficult to provide a definite etiological classification or diagnosis solely based on clinical evaluation, especially for syndromic EOS with variable expressivity, such as Ehlers–Danlos syndrome [4] and neurofibromatosis [5], and other neuromuscular conditions like Charcot-Marie-Tooth disease that can have an age-dependent penetrance[6]. Identification of the molecular etiology underlying EOS patients could provide valuable information for both clinical management and prenatal screening.
In previous studies, we identified that TBX6 compound variants, a rare loss-of-function (LoF) allele and a common variant noncoding haplotype, could explain around 9.6% of CS cases [7–9]. In addition to TBX6, variants in DDR2 [10] (Spondylometaepiphyseal dysplasia, MIM: 271665), FLNB [11] (Spondylocarpotarsal synostosis syndrome, MIM: 272460), RUNX2 [12] (Cleidocranial dysplasia, MIM:119600) and numerous other genes are thus far known to cause early-onset scoliotic phenotypes. Moreover, for other diseases and loci, only certain alleles or genotypes may be more likely to cause the EOS phenotype (e.g. homozygous CMT1A duplication and heterozygous CMT1A triplication)[6].
In this study, we aim to investigate the diagnostic performance of ES in EOS patients, and to dissect the genetic architecture of the clinical entity of EOS. We report the molecular diagnostic rate by ES among two cohorts: 1) a Chinese cohort ascertained and studies in Beijing and consisting of 447 clinically diagnosed EOS patients (encompassing 670 subjects) who underwent orthopedic surgery and 2) a Texas (United States) cohort that consisted of 13 patients with idiopathic EOS. The potential association between clinical phenotypic features and molecular diagnostic status of the patients was also investigated.
Methods
Cohort collections
Chinese cohort-
In the Chinese cohort, we consecutively recruited 447 EOS patients of Chinese Han ethnicity who underwent spinal surgery at Peking Union Medical College Hospital (PUMCH) from 2009 to 2016, as a part of the Deciphering Disorders Involving Scoliosis and COmorbidities (DISCO) study (http://www.discostudy.org/). Samples on unrelated probands and closest relatives (kinships) for 105 patients were available, including six parents (male and female) also affected by scoliosis, presenting a potential dominant disease trait pattern (Figure S1). The remaining patients/research subjects, who denied a family history, were counted as sporadic cases. Physical examination, X-ray, computed tomography (CT), and magnetic resonance imaging (MRI) were performed on each patient to give a prior clinical diagnosis [2]: (1) CS (CS I, vertebral malformations, CS II, segmentation defects, CS III, mixed type); (2) NMS; (3) Syndromic EOS; and (4) IS (Supplementary Methods).
United States Texas cohort –
A US population-based cohort of 13 patients specifically diagnosed as idiopathic EOS (either operated or non-operated) from Texas Scottish Rite Hospital for Children in Dallas (TSRH) was also recruited [13]. Clinical details for each subject are provided in Table S1.
Exome sequencing, data processing and analysis
Chinese cohort-
For the Chinese cohort, ES was performed on DNA extracted from peripheral blood from 342 assumed sporadic singleton cases and 105 cases along with their family members, totaling 670 subjects (Table S2). The detailed sequencing process and in-house developed Peking Union Medical College Hospital Pipeline (PUMP) [14] are described in the Supplementary Methods.
US Texas cohort-
For the US cohort, ES of 42 samples from 13 EOS families were performed at the Human Genome Sequencing Center (HGSC) at Baylor College of Medicine through the Baylor-Hopkins Center for Mendelian Genomics initiative. Detailed sequencing methods are described in the Supplementary Methods.
Variant interpretation
For both cohorts, interpretation of single nucleotide variants (SNVs) and insertions/deletions (indel variant alleles) was adapted from the American College of Medical Genetics and Genomics (ACMG) guidelines [15] (Supplementary Methods) and demonstrated in Figure S2. Variant pathogenicity, expected mode of disease inheritance (e.g. autosomal dominant [AD] for monoallelic and autosomal recessive [AR] for bi-allelic variants), and patient phenotype were taken into consideration. Potential clinically actionable secondary findings from 59 ACMG-recommended genes were analyzed [16].
Copy number variant (CNV) allele calling from ES data
For both cohorts, CNVs were computed from ES data using two software packages: XHMM [17] and Copy Number Inference from Exome Reads (CoNIFER) [18]. Detailed calling and analysis methods are described in Supplementary Methods and Figure S3. Identified Pathogenic CNVs were validated using comparative genomic hybridization microarray (aCGH), as described in Supplementary Methods.
Statistics
Phenotypic analysis among different diagnostic groups was performed based on the random forest algorithm [19] (Supplementary Methods). Mean comparison of relevant features was conducted using the Student’s t-test or Wilcoxon Signed Rank Test.
Results
Diagnostic findings in the Chinese cohort of operative EOS
In the Chinese cohort, 447 unrelated EOS patients of Chinese Han ethnicity were recruited, including 342 singletons and 105 cases with first-degree relative samples (Table 1). Six families have more than one generation affected by scoliosis, presenting a potential dominant AD or XL) pattern (Figure S1). Among the recruited patients, most have congenital (i.e. structural) scoliosis; NMS, syndromic conditions, and presumed IS are less common. The average age at presentation is 3.1y, and 217/447 (48.5%) are male (Table 1).
Table 1.
Constitution and overall molecular diagnostic yield of two cohorts
| Chinese cohort | US cohort | |||||
|---|---|---|---|---|---|---|
| Subgroup | CS | NMS | Syndromic | IS | Overall | IS |
| No. patients | 424 | 5 | 8 | 10 | 447 | 13 |
| Age of onset, mean | 3.1y | 3.2y | 2.8ỳ | 3.1y | 3.1y | 1.5y |
| Male (%) | 202/424 (47.6%) | 2/5 | 6/8 | 6/10 | 217/447 (48.5%) | 7/13 |
| TACS rate (%) | 41/424 (9.7%) | 0 | 0 | 0 | 41/447 (9.2%) | 0 |
| SNV/indel positive rate (%) | 35/424 (8.2%) | 3/5 | 7/8 | 2/10 | 47/447 (10.5%) | 1 |
| CNV positive rate (%) | 4/424 (0.9%) | 0 | 1/7 | 0 | 5/447 (1.1%) | 0 |
| Overall diagnostic rate (%) | 79/424 (18.6%) | 3/5 | 8/8 | 2/10 | 92/447 (20.6%) | 1/13 |
Abbreviations: CS, congenital scoliosis; NMS, neuromuscular scoliosis; NFS, neurofibromatosis; ACH, achondroplasia; IS, idiopathic scoliosis; TACS, TBX6-associated scoliosis; SNV, single nucleotide variant; CNV, copy number variation.
After ES data processing and variant interpretation, we detected molecular diagnostic variants in 92/447 (20.6%) Chinese EOS patients (Table 1), encompassing 33 disease-causing genes and 5 genomic regions (Table 2).
Table 2.
Genetic architecture and disease trait of EOS revealed by exome sequencing.
| Disease trait | CS | NMS | Syndromic EOS | IS | Overall | ||
|---|---|---|---|---|---|---|---|
| CS I | CS II | CS III | |||||
| Autosomal dominant | RUNX2 (2)*, JAG1 (1), FLNB (1), COMP (1) | RYR1 (2)*, MYH3 (1), POGZ (1), FBN1 (1), SOX9 (1), MYH7 (1) | TCF12 (1), PTPN11 (1), COL11A1 (1), SUFU (1), MYH3 (1), CELSR1 (1), TRPV4 (2), MMP13 (1), COL5A2 (1), BMP2 (1), CHD7 (2)* | LMNA (1), RYR1 (1) | NF1 (3)*, FGFR3 (3)* | COL5A2 (1), COMP (1) | 34 |
| Autosomal recessive | PLOD1 (1), DDR2 (1), DCHS1 (1) | FLNB (1), B3GALT6 (1) | FLNB (1), HERC1 (1), COL27A1 (1) | TTN (1) | PLOD1 (1) | 0 | 10 |
| X-linked | FLNA (1), EBP (1) | 0 | ZC4H2 (1) | 0 | 0 | 0 | 3 |
Abbreviations: CS I, congenital scoliosis type I, vertebral malformations; CS II, congenital scoliosis type II, segmentation defects); CS III, congenital scoliosis type II, mixed type); NMS, neuromuscular scoliosis; NFS, neurofibromatosis; ACH, achondroplasia; IS, idiopathic scoliosis. Bold: de novo changes.
One of two RUNX2 variants arose de novo; one of two CHD7 variants arose de novo, one of two RYR1 variants arose de novo, two of three NF1 variants arose de novo, two of three FGFR3 variants arose de novo.
In the CS group, 79/424 (18.6%) patients had identified a molecular diagnosis and potential molecular etiology, including 41 (9.7%) patients with TBX6-associated congenital scoliosis (TACS) and 38 (9.0%) patients with other pathogenic variant alleles/CNVs (Table 1). The molecular diagnostic rate was higher in the CS I (45/130, 34.6%) patient population than that of either CS II (7/41, 17%) or CS III (27/253 ,10.7%), due to the fact that TACS is predominantly observed in CS I cases (35/130, 26.9%) (Table S3). For NMS, pathogenic variants in TTN, LMNA, and RYR1 were found in three of five patients. As anticipated, eight patients with clinically diagnosed syndromic EOS, including four patients diagnosed with neurofibromatosis type 1 (NF1) three patients diagnosed with achondroplasia (ACH) and one patient with Ehlers-Danlos syndrome (EDS), were all solved by the identification of a specific variant allele in the given disease genes (NF1, FGFR3 and PLOD1) which could explain their complete phenotype (Table 2). Interestingly, one of the NF1 patients was found to have a 573 kb deletion CNV spanning the NF1 gene (Table 1), which would not be detected using a single-gene test and might be missed by clinical exome sequencing (cES) that can sometimes focus exclusively on SNV alleles. In the IS subgroup, causative variants in 2 out of 10 patients were detected (Table 1).
TBX6-associated scoliosis (TACS)
TACS, defined by a combination of a heterozygous 16p11.2 deletion/TBX6 null variant and a hypomorph allele in trans [7 8], accounts for 41/422 (9.7%) CS patients in this study, which is consistent with the result of our previous study [7–9] and recent studies of multiple cohorts [20 21].
Of the 41 patients with TACS, 31 harbored a 16p11.2 deletion CNV identified by computational CNV analysis and validated by additional experiments (Table S4), while the other 10 were carrying TBX6 truncating alleles (nonsense, frameshift, or canonical splicing site variants) (Table S4). All 41 patients carried the T-C-A (rs2289292, rs3809624, rs3809627) haplotype (frequency = 44.4%, Chinese Han population) in trans with the 16p11.2 deletion or null allele (Table S4), consistent with our previous findings regarding the compound inheritance of TACS [7–9].
From the clinical phenotypic perspective, all (41/41) of the patients presented with hemivertebra or butterfly vertebra at the lower half of the spine, which are in accordance with the reported clinical phenotypic characteristics of TACS and the animal model with Tbx6 compound inheritance [7 8 22].
Pathogenic SNVs/indels identified by ES
In addition to the contribution of TACS to diagnostics, 36 pathogenic and 21 likely pathogenic SNVs/indels in 33 genes were identified in 47 patients in the Chinese cohort (Table S5).
Variant types include missense (32/57), nonsense (6/57), frameshift (12/57), in-frame-indel (1/57) and splice site (6/57) alleles. Disease traits in the 47 patients included autosomal dominant (34/47), autosomal recessive (10/47), and X-linked (3/47) (Table 2, Figure S5). Of the 34 monoallelic autosomal dominant conditions, 12 variants arose as de novo variants as confirmed by Sanger sequencing of family trios (Figure S6); 8 variants were inherited, including 5 inherited from affected parents and 3 from unaffected asymptomatic parents without apparent parental mosaicism, implicating incomplete penetrance. This conclusion could not be made for sure due to the lack of radiographs from parents; the remaining 16 were identified in proband-only cases with unknown inheritance. In addition, dual diagnoses resulting in blended phenotypes [23] were found in 2/93 (2%) patients with pathogenic variants and molecular diagnostic findings (Table S8).
Of the 34 disease-causing genes, only 11 genes (TBX6, NF1, RUNX2, RYR1, COL5A2, PLOD1, MYH3, TRPV4, FGFR3, FLNB, CHD7) were implicated in more than one unrelated patient (Table 2), suggesting that the genetic heterogeneity underlying EOS is rather substantial. Critical biological processes such as skeletal system development, extracellular matrix organization, and ossification were identified by Gene Ontology enrichment from analysis of the 34 causative genes (Figure S7).
ES-based CNV analysis
ES-based CNV prediction was performed by computational tools and confirmed by array comparative genomic hybridization (aCGH, Supplementary Methods). Besides 16p11.2 deletions, pathogenic CNVs were identified in five patients, including two 16p13.1 duplications, one 22q11.2 deletion, one 5q35.3 deletion, and one 17q11.2 deletion (Table S6, Figure S8). NF1 likely acts as a key driver gene within the 17q11.2 deletion CNV, given the type 1 neurofibromatosis phenotype in this patient (XH821). XH821 had cafe-au-lait spots at birth, left inguinal hernia, and mild intellectual disability, a more severe phenotype compared with the other three patients with type 1 neurofibromatosis solved by NF1 single nucleotide variant alleles (Table S6). The patient with the 5q35.3 deletion, the leading cause of Sotos syndrome-1 (MIM: 117550) in reported Japanese patients [24], had congenital heart disease, macrocephaly and rapid growth, consistent with the driver gene being NSD1 (Table S6). We are unable to localize the causative gene(s) for 16p13.1 duplication and 22q11.2 deletion, but their pathogenic roles and causal relationship with scoliosis are supported by other cases from the Database of Chromosomal Imbalance and Phenotype in Humans using Ensembl Resources (DECIPHER) [25].
Clinically relevant secondary findings
Secondary findings involving the 59 ACMG-recommended genes were identified in 9 patients [16]. Relevant variants were identified in 7 genes, including BRCA1, BRCA2, APC, RET, TSC2, COL3A1, and LDLR (Table S7). Notably, RYR1 is one of the 59 ACMG-recommended genes for secondary findings, but in the present cohort was identified as a primary finding (i.e. related to the scoliosis phenotype) in three cases (Table S5). In addition, one patient (XH984) with hyperglycemia was found to carry a pathogenic GCK variant that could lead to Glucokinase-maturity-onset diabetes of the young (GCK-MODY, MIM: 125851) (Table S7).
Diagnostic yield in the US cohort of IS
For the US cohort, one (non-Hispanic white) of the thirteen patients received a molecular diagnosis of Sotos syndrome (MIM: 117550), caused by a de novo heterozygous frameshift variant: c.2051_2055TAAAGdel in NSD1 (Figure S4). This patient presented premature birth, mild developmental delay, atrial septal defect (ASD) and ventricular septal defect (VSD). Interestingly, one patient in the Chinese cohort who was firstly diagnosed with CS was also found to carry a 5q35 deletion encompassing NSD1, thus molecularly diagnosed with Sotos syndrome.
Molecular classification of EOS patients
Through identification of the pathogenic variants/CNVs and their corresponding Mendelian diseases in patients with a primary clinical diagnosis of CS/NMS/IS, 84/447 (18.8%) patients from the Chinese cohort (Figure 1) and 1/13 patients from the US IS cohort were molecularly classified as either TACS or a syndromic form of EOS. The secondary identification of TACS/syndromic EOS by molecular diagnosis indicates the efficacy of genetic testing when patients only exhibit mild phenotypes rather than the whole phenotypic spectrum of specific Mendelian syndromes, and when regular methods to ascertain a diagnosis have been exhausted. Our results also exemplify the cohort-level changes in EOS classification enabled by ES.
Figure 1. Molecular classification of early-onset scoliosis after exome sequencing.

Molecular classification for the 92 patients with molecular diagnosis in the Chinese cohort after exome sequencing. Abbreviations: ES, exome sequencing; IS idiopathic scoliosis; TACS: TBX6-associated congenital scoliosis; NMS, neuromuscular scoliosis; CS, congenital scoliosis.
Clinical features of cases with different diagnostic status/categories
In an effort to explore the potential medical indications for genetic testing in EOS patients, we selected 12 clinical features of significance (Supplementary Methods). We hypothesized that the observed pattern of phenotypic features might be used to identify individuals more likely to benefit from genomic testing. All patients were divided into three groups according to their diagnostic status and categories: the group without a current molecular diagnosis, the TACS group, and the syndromic EOS group. The ability of the 12 clinical features mentioned above to distinguish these three patient groups was analyzed using the random forest algorithm (Supplementary Methods). A mean reciprocal rank (MRR, 0–1) was determined for each clinical feature, demonstrating the degree of relevance between that feature and each of the patient groups (the more relevant, the higher the rank score) (Table S9).
As a result, the age of presentation (MRR = 0.53), the number of organ systems involved (MRR = 0.50), and the Cobb angle (MRR = 0.49) were ranked as the most distinct features among the three groups. By independently analyzing each of the three features, we found that TACS patients are associated with younger age at presentation (1.56±2.17 y, P = 0.001, Wilcoxon Signed Rank Test), involvement of fewer organ systems (0.07±0.26, P = 0.002, Wilcoxon Signed Rank Test), and smaller Cobb angle (51.41±19.85°, P = 0.032, Student’s t-test) than patients without a molecular diagnosis (Table S10, Figure 2A/2B/2C). Whereas, patients with syndromic EOS are associated with a younger age at presentation (2.50±2.52, P = 0.038, Wilcoxon Signed Rank Test) and involvement of more organ systems (0.54±0.68, P = 0.026, Wilcoxon Signed Rank Test) than the undiagnosed group (Table S10, Figure 2A/2B).
Figure 2. Phenotypic analysis of patients diagnosed by TBX6 or other genes.

(A-C) The distribution of three clinical features: number of organ systems involved, age of presentation and Cobb angle in three diagnostic groups. Y-axis indicates the probability density. Gray curve/column indicates undiagnosed patient group (n = 355); Orange curve/column indicates TBX6-associated scoliosis (TACS) patient group (n = 41); Blue curve/column indicates the syndromic EOS group (n = 51).
Discussion
Hereby, we provide evidence of the important utility of genetic testing in molecularly diagnosing and classifying a large cohort of EOS patients. By performing ES on Chinese operative EOS patients (N=447), followed by rare variant family-based genomics, variant interpretation and computational CNV analysis, a molecular diagnostic rate of 20.6% was achieved. As the major subgroup of EOS, CS patients received a diagnostic rate of 18.6%, which is higher compared to a recent genetic study on CS using both ES (2/28, 8%) and panel-based sequencing of 5 genes (10/73, 13.7%) [26]. The increased diagnostic rate observed in CS patients by ES is possibly due to the more expansive gene coverage in the exome analysis and the utility of computational CNV analysis. The positive rate for neuromuscular EOS (3/5) is consistent with a previous study on a cohort of neuromuscular disease (18/38, 47.4%) [27]. As anticipated, ES confirmed the diagnosis of syndromic EOS in 8/8 patients. In our study, we also identified the molecular basis of idiopathic EOS in 3/23 patients from both the Chinese and the US cohorts, which demonstrated the power of ES in cases that have eluded a clinical diagnosis.
One of the main obstacles for seeking a molecular diagnosis in EOS patients is the heterogeneous genetic predisposition within this broad clinical description. Indeed, 93 molecularly diagnosed patients were explained by variants in 34 genes and five distinct CNV regions (Table 2/Table S6). Because our study is driven by a core clinical endophenotype of EOS, patients in our cohort may arguably tend toward the milder end of the clinical spectrum of each molecularly diagnosed syndrome. Strong genetic heterogeneity and limited clinical phenotype expressivity brought challenges to locating the disease-causing genes and characterizing gene-specific phenotypes in the patients.
In addition to the molecular diagnosis made for individuals, ES on a large sample size in our study also provided insights into the genetic and genomic architecture of EOS. Several highly relevant biological processes were identified (Figure S7), which are instructive for both clinical ES analysis and future discovery of novel disease-associated genes.
By phenotype analysis, we identified that the Cobb angle, the number of multi-systemic defects, and the age of onset as the three top associated clinical phenotypic features that directly correlate with the likelihood of identifying a genetic etiology for CS. Thus, these may help to stratify which EOS patients are suggested to receive genetic testing and what testing should be done. Patients with younger age at presentation, fewer multi-systemic defects, and a smaller Cobb angle are more likely to be affected with TBX6 compound mutations, i.e., TACS [7–9 22]. For these patients, an aCGH analysis or clinical chromosomal microarray (CMA) CNV analysis supplemented by targeted next-generation sequencing (NGS) would be beneficial and cost-effective. In contrast, patients with younger age at presentation and more multi-systemic defects are more likely to have a molecular diagnosis in genes other than TBX6. Given the genetic heterogeneity in this patient group, exome or genome sequencing would be the first-line diagnostic tool for patients with more complex phenotypes.
The identification of a molecular diagnosis for certain genetic disorders from this study could inform clinical management and provide new clinical insights. Three pathogenic RYR1 variants were identified in three EOS patients (Table S5), resulting in a molecular diagnosis of central core disease (MIM: 117000), which increased their potential risk of developing malignant hyperthermia (MH) upon general anesthesia [28]. One of the three patients (XH696) indeed developed malignant hyperthermia during his surgery, and fortunately survived after immediate clinical response to dantrolene salvage. Under such a condition, the risk of MH could be readily avoided by alternative anesthesia if the molecular diagnosis is revealed prior to the surgery [29].
Even in patients with clinically diagnosed syndromic disorders, such as ACH and NF1, the exact molecular diagnosis, including presumed pathogenic variant allele contribution to the patient’s disease process that was revealed by ES could inform clinical management. Of the four patients with NF1, three were molecularly diagnosed with single nucleotide variants (SNV) alleles in the NF1 gene and the other was found to carry a 573 kb genomic deletion CNV spanning NF1; different alleles with potentially different clinical consequences. NF1 caused by 17q11.2 deletion is associated with early-onset of neurofibromatosis[30] accompanied with congenital heart malformation and cognitive dysfunction[31], i.e., a more severe clinical phenotype is potentially anticipated than that caused by NF1 variant alleles that were SNV mutations. Therefore, the precise molecular diagnosis has important management implications and potentially lifelong clinical (cardiovascular, skeletal, and malignancy) surveillance issues for this patient with NF. Moreover, the variant information may help contribute to family management and recurrence risk information that they seek. Clinical experience suggests up to half of NF cases may be due to new mutations. Moreover, the clinically observed phenotype of ‘segmental NF’ may potentially reflect mosaic mutations[32].
There are several limitations of our study. First, we enrolled more CS cases in the Chinese cohort, this patient selection hinders us from exploring the real diagnostic rate of other forms in EOS patients. Secondly, the use of multiple capture and sequencing platforms for ES resulted in different capture regions and exome coverage across the cohort, which might affect the identification of some small fraction of pathogenic SNVs and CNVs. Due to the small number of individual genetic loci (34 genes of about 20,000 interrogated by ES) involved in the molecular diagnoses, subtle differences in capture design, genomic sequencing or sequence analytical pipelines are not likely to affect our molecular diagnostic rate. Nevertheless, due to the small number of molecularly diagnosed Mendelian conditions by each specific gene, we are unable to perform gene-based phenotypic analysis, and thus cannot explain the mechanism of phenotypic characteristics identified from molecularly diagnosed patients.
To conclude, ES enabled the molecular classification of EOS in 84 out of 447 (18.8%) patients from the Chinese cohort and 1/13 patients from the US cohort. Specific clinical features/feature pairs are able to indicate the likelihood of gaining a molecular diagnosis through ES.
Supplementary Material
Acknowledgement
We appreciate all of the patients, their families, and clinical research coordinators who participated in this project. We thank GeneSeeq Inc. for exome sequencing technical support.
Funding
This research was funded in part by the National Natural Science Foundation of China (81822030 to N.W., 81930068 and 81772299 to Z.W., 81672123 and 81972037 to J.Z., 31625015, 31571297 and 31771396 to F.Z., 81871746 to Y.W. and 81772301 to G.Q.), Beijing Natural Science Foundation (7191007 to Z.W.), 2016 Milstein Medical Asian American Partnership Foundation Fellowship Award in Translational Medicine (to N.W.), CAMS Initiative Fund for Medical Sciences (2016-I2M-3-003 to G.Q. and N.W., 2016-I2M-2-006 and 2017-I2M-2-001 to Z.W.), Tsinghua University-Peking Union Medical College Hospital Initiative Scientific Research Program (to N.W.), the National Key Research and Development Program of China (No. 2016YFC0901501 to Shuyang.Z.), and the National Undergraduates Innovation and Training Program of Peking Union Medical College (2019zlgc0627 to Sen.Z.), CAMS Innovation Fund for Graduates (2018-1002-01-09 to Yuan.Z.). Also supported by the US National Institutes of Health, National Institute of Neurological Disorders and Stroke (NINDS R35 NS105078 to J.R.L), National Human Genome Research Institute/National Heart, Lung, and Blood Institute (NHGRI/NHLBI UM1 HG006542 to Baylor-Hopkins Center for Mendelian Genomics), the National Human Genome Research Institute (NHGRI K08 HG00 8986 to J.E.P), TX Scottish Rite Hospital Research Fund (to C.A.W), Foundation Cotrel (to C.A.W), and P01 HD084387 (to C.A.W).
Footnotes
Competing interest
J.R.L has stock ownership in 23andMe, is a paid consultant for Regeneron Pharmaceuticals and Novartis, and is a co-inventor on multiple United States and European patents related to molecular diagnostics for inherited neuropathies, eye diseases and bacterial genomic fingerprinting. The Department of Molecular and Human Genetics at Baylor College of Medicine derives revenue from the chromosomal microarray analysis (CMA by aCGH and/or SNP arrays), clinical exome sequencing (cES) and whole-genome sequencing (WGS) offered in the Baylor Genetics (BG) Laboratory (http://bmgl.com).
Ethics approval and consent to participate
Written informed consent was provided by each participant in the two cohorts. Approval for the study was obtained from the ethics committee at Peking Union Medical College Hospital (JS-098), the Institutional Review Board of the University of Texas Southwestern Medical Center (STU 112010–150) and Baylor College of Medicine (H-29697).
Data sharing statement
The datasets analyzed during the current study are available from the corresponding author on reasonable request.
Reference
- 1.Corona J, Miller DJ, Downs J, Akbarnia BA, Betz RR, Blakemore LC, Campbell RM Jr., Flynn JM, Johnston CE, McCarthy RE, Roye DP Jr., Skaggs DL, Smith JT, Snyder BD, Sponseller PD, Sturm PF, Thompson GH, Yazici M, Vitale MG. Evaluating the extent of clinical uncertainty among treatment options for patients with early-onset scoliosis. J Bone Joint Surg Am 2013;95(10):e67 doi: 10.2106/JBJS.K.00805. [DOI] [PubMed] [Google Scholar]
- 2.Williams BA, Matsumoto H, McCalla DJ, Akbarnia BA, Blakemore LC, Betz RR, Flynn JM, Johnston CE, McCarthy RE, Roye DP Jr., Skaggs DL, Smith JT, Snyder BD, Sponseller PD, Sturm PF, Thompson GH, Yazici M, Vitale MG. Development and initial validation of the Classification of Early-Onset Scoliosis (C-EOS). J Bone Joint Surg Am 2014;96(16):1359–67 doi: 10.2106/JBJS.M.00253. [DOI] [PubMed] [Google Scholar]
- 3.Gillingham BL, Fan RA, Akbarnia BA. Early onset idiopathic scoliosis. J Am Acad Orthop Surg 2006;14(2):101–12 [DOI] [PubMed] [Google Scholar]
- 4.De Paepe A, Malfait F. The Ehlers-Danlos syndrome, a disorder with many faces. Clin Genet 2012;82(1):1–11 doi: 10.1111/j.1399-0004.2012.01858.x. [DOI] [PubMed] [Google Scholar]
- 5.Carey JC, Viskochil DH. Neurofibromatosis type 1: A model condition for the study of the molecular basis of variable expressivity in human disorders. Am J Med Genet 1999;89(1):7–13 [PubMed] [Google Scholar]
- 6.Matsunami N, Smith B, Ballard L, Lensch MW, Robertson M, Albertsen H, Hanemann CO, Muller HW, Bird TD, White R. Peripheral myelin protein-22 gene maps in the duplication in chromosome 17p11.2 associated with Charcot-Marie-Tooth 1A. Nat Genet 1992;1(3):176–9 doi: 10.1038/ng0692-176. [DOI] [PubMed] [Google Scholar]
- 7.Wu N, Ming X, Xiao J, Wu Z, Chen X, Shinawi M, Shen Y, Yu G, Liu J, Xie H, Gucev ZS, Liu S, Yang N, Al-Kateb H, Chen J, Zhang J, Hauser N, Zhang T, Tasic V, Liu P, Su X, Pan X, Liu C, Wang L, Shen J, Shen J, Chen Y, Zhang T, Zhang J, Choy KW, Wang J, Wang Q, Li S, Zhou W, Guo J, Wang Y, Zhang C, Zhao H, An Y, Zhao Y, Wang J, Liu Z, Zuo Y, Tian Y, Weng X, Sutton VR, Wang H, Ming Y, Kulkarni S, Zhong TP, Giampietro PF, Dunwoodie SL, Cheung SW, Zhang X, Jin L, Lupski JR, Qiu G, Zhang F. TBX6 null variants and a common hypomorphic allele in congenital scoliosis. N Engl J Med 2015;372(4):341–50 doi: 10.1056/NEJMoa1406829. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Liu J, Wu N, Deciphering Disorders Involving S, study CO, Yang N, Takeda K, Chen W, Li W, Du R, Liu S, Zhou Y, Zhang L, Liu Z, Zuo Y, Zhao S, Blank R, Pehlivan D, Dong S, Zhang J, Shen J, Si N, Wang Y, Liu G, Li S, Zhao Y, Zhao H, Chen Y, Zhao Y, Song X, Hu J, Lin M, Tian Y, Yuan B, Yu K, Niu Y, Yu B, Li X, Chen J, Yan Z, Zhu Q, Meng X, Chen X, Su J, Zhao X, Wang X, Ming Y, Li X, Raggio CL, Zhang B, Weng X, Zhang S, Zhang X, Watanabe K, Matsumoto M, Japan Early Onset Scoliosis Research G, Jin L, Shen Y, Sobreira NL, Posey JE, Giampietro PF, Valle D, Baylor-Hopkins Center for Mendelian G, Liu P, Wu Z, Ikegawa S, Lupski JR, Zhang F, Qiu G. TBX6-associated congenital scoliosis (TACS) as a clinically distinguishable subtype of congenital scoliosis: further evidence supporting the compound inheritance and TBX6 gene dosage model. Genet Med 2019;21(7):1548–58 doi: 10.1038/s41436-018-0377-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Chen W, Lin J, Wang L, Li X, Zhao S, Liu J, Akdemir ZC, Zhao Y, Du R, Ye Y, Song X, Zhang Y, Yan Z, Yang X, Lin M, Shen J, Wang S, Gao N, Yang Y, Liu Y, Li W, Liu J, Zhang N, Yang X, Xu Y, Zhang J, Delgado MR, Posey JE, Qiu G, Rios JJ, Liu P, Wise CA, Zhang F, Wu Z, Lupski JR, Wu N. TBX6 missense variants expand the mutational spectrum in a non-Mendelian inheritance disease. Hum Mutat 2020;41(1):182–95 doi: 10.1002/humu.23907. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Bargal R, Cormier-Daire V, Ben-Neriah Z, Le Merrer M, Sosna J, Melki J, Zangen DH, Smithson SF, Borochowitz Z, Belostotsky R, Raas-Rothschild A. Mutations in DDR2 gene cause SMED with short limbs and abnormal calcifications. Am J Hum Genet 2009;84(1):80–4 doi: 10.1016/j.ajhg.2008.12.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Krakow D, Robertson SP, King LM, Morgan T, Sebald ET, Bertolotto C, Wachsmann-Hogiu S, Acuna D, Shapiro SS, Takafuta T, Aftimos S, Kim CA, Firth H, Steiner CE, Cormier-Daire V, Superti-Furga A, Bonafe L, Graham JM Jr., Grix A, Bacino CA, Allanson J, Bialer MG, Lachman RS, Rimoin DL, Cohn DH. Mutations in the gene encoding filamin B disrupt vertebral segmentation, joint formation and skeletogenesis. Nat Genet 2004;36(4):405–10 doi: 10.1038/ng1319. [DOI] [PubMed] [Google Scholar]
- 12.Mundlos S, Otto F, Mundlos C, Mulliken JB, Aylsworth AS, Albright S, Lindhout D, Cole WG, Henn W, Knoll JH, Owen MJ, Mertelsmann R, Zabel BU, Olsen BR. Mutations involving the transcription factor CBFA1 cause cleidocranial dysplasia. Cell 1997;89(5):773–9 [DOI] [PubMed] [Google Scholar]
- 13.Gao X, Gotway G, Rathjen K, Johnston C, Sparagana S, Wise CA. Genomic Analyses of Patients With Unexplained Early-Onset Scoliosis. Spine Deform 2014;2(5):324–32 doi: 10.1016/j.jspd.2014.04.014. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Wang K, Zhao S, Liu B, Zhang Q, Li Y, Liu J, Shen Y, Ding X, Lin J, Wu Y, Yan Z, Chen J, Li X, Song X, Niu Y, Liu J, Chen W, Ming Y, Du R, Chen C, Long B, Zhang Y, Tong X, Zhang S, Posey JE, Zhang B, Wu Z, Wythe JD, Liu P, Lupski JR, Yang X, Wu N. Perturbations of BMP/TGF-beta and VEGF/VEGFR signalling pathways in non-syndromic sporadic brain arteriovenous malformations (BAVM). J Med Genet 2018;55(10):675–84 doi: 10.1136/jmedgenet-2017-105224. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Richards S, Aziz N, Bale S, Bick D, Das S, Gastier-Foster J, Grody WW, Hegde M, Lyon E, Spector E, Voelkerding K, Rehm HL, Committee ALQA. Standards and guidelines for the interpretation of sequence variants: a joint consensus recommendation of the American College of Medical Genetics and Genomics and the Association for Molecular Pathology. Genet Med 2015;17(5):405–24 doi: 10.1038/gim.2015.30. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Kalia SS, Adelman K, Bale SJ, Chung WK, Eng C, Evans JP, Herman GE, Hufnagel SB, Klein TE, Korf BR, McKelvey KD, Ormond KE, Richards CS, Vlangos CN, Watson M, Martin CL, Miller DT. Recommendations for reporting of secondary findings in clinical exome and genome sequencing, 2016 update (ACMG SF v2.0): a policy statement of the American College of Medical Genetics and Genomics. Genet Med 2017;19(2):249–55 doi: 10.1038/gim.2016.190. [DOI] [PubMed] [Google Scholar]
- 17.Fromer M, Purcell SM. Using XHMM Software to Detect Copy Number Variation in Whole-Exome Sequencing Data. Curr Protoc Hum Genet 2014;81:7 23 1–21 doi: 10.1002/0471142905.hg0723s81. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Krumm N, Sudmant PH, Ko A, O’Roak BJ, Malig M, Coe BP, Project NES, Quinlan AR, Nickerson DA, Eichler EE. Copy number variation detection and genotyping from exome sequence data. Genome Res 2012;22(8):1525–32 doi: 10.1101/gr.138115.112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Breiman L Random Forests. Machine Learning 2001;45(1):5–32 doi: 10.1023/a:1010933404324. [DOI] [Google Scholar]
- 20.Takeda K, Kou I, Kawakami N, Iida A, Nakajima M, Ogura Y, Imagawa E, Miyake N, Matsumoto N, Yasuhiko Y, Sudo H, Kotani T, Japan Early Onset Scoliosis Research G, Nakamura M, Matsumoto M, Watanabe K, Ikegawa S. Compound Heterozygosity for Null Mutations and a Common Hypomorphic Risk Haplotype in TBX6 Causes Congenital Scoliosis. Hum Mutat 2017;38(3):317–23 doi: 10.1002/humu.23168. [DOI] [PubMed] [Google Scholar]
- 21.Lefebvre M, Duffourd Y, Jouan T, Poe C, Jean-Marcais N, Verloes A, St-Onge J, Riviere JB, Petit F, Pierquin G, Demeer B, Callier P, Thauvin-Robinet C, Faivre L, Thevenon J. Autosomal recessive variations of TBX6, from congenital scoliosis to spondylocostal dysostosis. Clin Genet 2017;91(6):908–12 doi: 10.1111/cge.12918. [DOI] [PubMed] [Google Scholar]
- 22.Yang N, Wu N, Zhang L, Zhao Y, Liu J, Liang X, Ren X, Li W, Chen W, Dong S, Zhao S, Lin J, Xiang H, Xue H, Chen L, Sun H, Zhang J, Shi J, Zhang S, Lu D, Wu X, Jin L, Ding J, Qiu G, Wu Z, Lupski JR, Zhang F. TBX6 compound inheritance leads to congenital vertebral malformations in humans and mice. Hum Mol Genet 2019;28(4):539–47 doi: 10.1093/hmg/ddy358. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Posey JE, Harel T, Liu P, Rosenfeld JA, James RA, Coban Akdemir ZH, Walkiewicz M, Bi W, Xiao R, Ding Y, Xia F, Beaudet AL, Muzny DM, Gibbs RA, Boerwinkle E, Eng CM, Sutton VR, Shaw CA, Plon SE, Yang Y, Lupski JR. Resolution of Disease Phenotypes Resulting from Multilocus Genomic Variation. N Engl J Med 2017;376(1):21–31 doi: 10.1056/NEJMoa1516767. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Kurotaki N, Imaizumi K, Harada N, Masuno M, Kondoh T, Nagai T, Ohashi H, Naritomi K, Tsukahara M, Makita Y, Sugimoto T, Sonoda T, Hasegawa T, Chinen Y, Tomita Ha HA, Kinoshita A, Mizuguchi T, Yoshiura Ki K, Ohta T, Kishino T, Fukushima Y, Niikawa N, Matsumoto N. Haploinsufficiency of NSD1 causes Sotos syndrome. Nat Genet 2002;30(4):365–6 doi: 10.1038/ng863. [DOI] [PubMed] [Google Scholar]
- 25.Firth HV, Richards SM, Bevan AP, Clayton S, Corpas M, Rajan D, Van Vooren S, Moreau Y, Pettett RM, Carter NP. DECIPHER: Database of Chromosomal Imbalance and Phenotype in Humans Using Ensembl Resources. Am J Hum Genet 2009;84(4):524–33 doi: 10.1016/j.ajhg.2009.03.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Lefebvre M, Dieux-Coeslier A, Baujat G, Schaefer E, Judith SO, Bazin A, Pinson L, Attie-Bitach T, Baumann C, Fradin M, Pierquin G, Julia S, Quelin C, Doray B, Berg S, Vincent-Delorme C, Lambert L, Bachmann N, Lacombe D, Isidor B, Laurent N, Joelle R, Blanchet P, Odent S, Kervran D, Leporrier N, Abel C, Segers K, Guiliano F, Ginglinger-Fabre E, Selicorni A, Goldenberg A, El Chehadeh S, Francannet C, Demeer B, Duffourd Y, Thauvin-Robinet C, Verloes A, Cormier-Daire V, Riviere JB, Faivre L, Thevenon J. Diagnostic strategy in segmentation defect of the vertebrae: a retrospective study of 73 patients. J Med Genet 2018;55(6):422–29 doi: 10.1136/jmedgenet-2017-104939. [DOI] [PubMed] [Google Scholar]
- 27.Todd EJ, Yau KS, Ong R, Slee J, McGillivray G, Barnett CP, Haliloglu G, Talim B, Akcoren Z, Kariminejad A, Cairns A, Clarke NF, Freckmann ML, Romero NB, Williams D, Sewry CA, Colley A, Ryan MM, Kiraly-Borri C, Sivadorai P, Allcock RJ, Beeson D, Maxwell S, Davis MR, Laing NG, Ravenscroft G. Next generation sequencing in a large cohort of patients presenting with neuromuscular disease before or at birth. Orphanet J Rare Dis 2015;10:148 doi: 10.1186/s13023-015-0364-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Manning BM, Quane KA, Ording H, Urwyler A, Tegazzin V, Lehane M, O’Halloran J, Hartung E, Giblin LM, Lynch PJ, Vaughan P, Censier K, Bendixen D, Comi G, Heytens L, Monsieurs K, Fagerlund T, Wolz W, Heffron JJ, Muller CR, McCarthy TV. Identification of novel mutations in the ryanodine-receptor gene (RYR1) in malignant hyperthermia: genotype-phenotype correlation. Am J Hum Genet 1998;62(3):599–609 doi: 10.1086/301748. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Urwyler A, Deufel T, McCarthy T, West S, European Malignant Hyperthermia G. Guidelines for molecular genetic detection of susceptibility to malignant hyperthermia. Br J Anaesth 2001;86(2):283–7 [DOI] [PubMed] [Google Scholar]
- 30.Riva P, Corrado L, Natacci F, Castorina P, Wu BL, Schneider GH, Clementi M, Tenconi R, Korf BR, Larizza L. NF1 microdeletion syndrome: refined FISH characterization of sporadic and familial deletions with locus-specific probes. Am J Hum Genet 2000;66(1):100–9 doi: 10.1086/302709. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Venturin M, Guarnieri P, Natacci F, Stabile M, Tenconi R, Clementi M, Hernandez C, Thompson P, Upadhyaya M, Larizza L, Riva P. Mental retardation and cardiovascular malformations in NF1 microdeleted patients point to candidate genes in 17q11.2. J Med Genet 2004;41(1):35–41 doi: 10.1136/jmg.2003.014761. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Tinschert S, Naumann I, Stegmann E, Buske A, Kaufmann D, Thiel G, Jenne DE. Segmental neurofibromatosis is caused by somatic mutation of the neurofibromatosis type 1 (NF1) gene. Eur J Hum Genet 2000;8(6):455–9 doi: 10.1038/sj.ejhg.5200493. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
