MotorPlex provides accurate variant detection across large muscle genes both in single myopathic patients and in pools of DNA samples

Marco Savarese; Giuseppina Di Fruscio; Margherita Mutarelli; Annalaura Torella; Francesca Magri; Filippo Maria Santorelli; Giacomo Pietro Comi; Claudio Bruno; Vincenzo Nigro

doi:10.1186/s40478-014-0100-3

. 2014 Sep 11;2:100. doi: 10.1186/s40478-014-0100-3

MotorPlex provides accurate variant detection across large muscle genes both in single myopathic patients and in pools of DNA samples

Marco Savarese ^1,², Giuseppina Di Fruscio ^1,², Margherita Mutarelli ², Annalaura Torella ¹, Francesca Magri ³, Filippo Maria Santorelli ⁴, Giacomo Pietro Comi ³, Claudio Bruno ⁵, Vincenzo Nigro ^1,^2,^✉

PMCID: PMC4172906 PMID: 25214167

Abstract

Mutations in ~100 genes cause muscle diseases with complex and often unexplained genotype/phenotype correlations. Next-generation sequencing studies identify a greater-than-expected number of genetic variations in the human genome. This suggests that existing clinical monogenic testing systematically miss very relevant information.

We have created a core panel of genes that cause all known forms of nonsyndromic muscle disorders (MotorPlex). It comprises 93 loci, among which are the largest and most complex human genes, such as TTN, RYR1, NEB and DMD. MotorPlex captures at least 99.2% of 2,544 exons with a very accurate and uniform coverage. This quality is highlighted by the discovery of 20-30% more variations in comparison with whole exome sequencing. The coverage homogeneity has also made feasible to apply a cost-effective pooled sequencing strategy while maintaining optimal sensitivity and specificity.

We studied 177 unresolved cases of myopathies for which the best candidate genes were previously excluded. We have identified known pathogenic variants in 52 patients and potential causative ones in further 56 patients. We have also discovered 23 patients showing multiple true disease-associated variants suggesting complex inheritance. Moreover, we frequently detected other nonsynonymous variants of unknown significance in the largest muscle genes. Cost-effective combinatorial pools of DNA samples were similarly accurate (97-99%).

MotorPlex is a very robust platform that overcomes for power, costs, speed, sensitivity and specificity the gene-by-gene strategy. The applicability of pooling makes this tool affordable for the screening of genetic variability of muscle genes also in a larger population. We consider that our strategy can have much broader applications.

Electronic supplementary material

The online version of this article (doi:10.1186/s40478-014-0100-3) contains supplementary material, which is available to authorized users.

Keywords: Next generation sequencing, Myopathies, Target sequencing, Pooling, Muscular dystrophies

Introduction

Muscle genetic disorders comprise about 100 different genetic conditions [1],[2], characterized by a clinical, genetic and biochemical heterogeneity. The molecular diagnosis for myopathic patients is crucial for genetic counseling, for prognosis and for available and forthcoming mutation-specific treatments [3]-[5]. In addition, patients that share the same mutation may have a different type of muscle affection with the selective involvement of other muscle compartments or myocardial damage. Thus, the primary defect may be modified or not by additional and variable elements that may be genetic or not. The most severe cases of congenital or childhood-onset myopathies often result from mutations in genes encoding proteins belonging to common pathways [6]. To provide a clue to address genetic testing, a muscle biopsy is often required that may be useful, but not well accepted by patients. The single gene testing can be diagnostic only in patients with most recognizable disorders. In unspecific cases of muscular diseases, however, no effective methodology has been developed for the parallel testing of all disease genes identified so far [7].

Next-generation sequencing (NGS) is changing our view of biology and medicine allowing the large-scale calling of small variations in DNA sequences [8]. In the last few years, the whole-exome sequencing (WES) and whole-genome sequencing (WGS) have received widespread recognition as universal tests for the discovery of novel causes of Mendelian disorders in families [9]. The power to discover a novel Mendelian condition increases with the family size, even if successful studies, identifying novel disease genes from multiple small families with the same phenotype, have been published [10]. Structural and copy number variations are not well detected by NGS technologies [11]-[14]. However, the WES/WGS use for the clinical testing of isolated cases is still debated. First, there are ethical issues linked to the management of the incidental findings [15]. The second limitation is given by the practical problem that the coverage is usually too low for clinical diagnosis. Hence the cost-effectiveness is reduced, considering that WES/WGS may require either numerous validation procedures, mainly based on conventional PCR and Sanger sequencing reactions [16]. Innovative strategies of clinical exome sequencing at high coverage have been described [17], but the cost for a single patient is still too high for routine diagnosis. Thus, there is still space for targeted strategies [18] and the HaloPlex Target Enrichment System [19] represents an innovative technology for targeting, since it uses a combination of eight different enzyme restriction followed by probe capture. It permits a single-tube target amplification and one can accurately predict the precise sequence coverage in advance. We have developed a NGS targeting workflow as a single testing methodology for the diagnosis of genetic myopathies that we named Motorplex. Here we demonstrate the high sensitivity and specificity of Motorplex. We challenged our platform against complex DNA pools. Even with this complexity, Motorplex kept producing reliable data with high sensitivity and specificity values. Furthermore, pooling reduced the cost of the entire analysis at negligible values, implementing applications for large studies of populations [16],[20].

Materials and methods

Patients

Encrypted DNA samples from patients with clinical diagnosis of nonspecific myopathies, congenital myopathy, proximal muscle weakness or limb-girdle muscular dystrophy (LGMD) were included. The Italian Networks of Congenital Myopathies (coordinated by C.B. and F.M.S.) of LGMD (by F.M. and G.P.C.) were involved together with a large number of other single clinical centers. We asked all them the possibility to share more clinical and laboratory findings, when necessary. We also requested to provide information on familial segregation and previous negative genetic tests. Internal patients signed a written informed consent, according to the guidelines of Telethon Italy and approved by the Ethics Committee of the “Seconda Università degli Studi di Napoli”, Naples, Italy.

DNA samples were extracted using standard procedures. DNA quality and quantity were assessed using both spectrophotometric (Nanodrop ND 1000, Thermo Scientific Inc., Rockford, IL, USA) and fluorometry-based (Qubit 2.0 Fluorometer, Life Technologies, Carlsbad, CA, USA) methods.

In silico design of MotorPlex

We included in the design all the 93 genes that are universally considered as genetic causes of nonsyndromic myopathies (Additional file 1: Table S1). In particular, we only selected genes determining a primary skeletal muscle disease, such as underlying muscular dystrophies, congenital myopathies, metabolic myopathies, congenital muscular dystrophies, Emery-Dreifuss muscular dystrophy, etc. We therefore excluded loci associated with other neuromuscular and neurological disorders such as congenital myasthenias, myotonic dystrophy, spinal muscular atrophy, ataxias, neuropathies, or paraplegias for which differential diagnosis may be clinically possible. For each locus, all predicted exons and at least ten flanking nucleotides were always included in the electronic design by the custom NGS Agilent SureDesign webtool. Setting the sequence length at 100×2 nucleotides, the predicted target size amounted to 2,544 regions and 493.598kb. Around 20% of the target is represented by TTN coding regions.

NGS workflow

For library preparation of single samples, we followed the manufacturer’s instructions (HaloPlex Target Enrichment System For Illumina Sequencing, Protocol version D, August 2012, Agilent Technologies, Santa Clara, CA, USA). We started using 200ng of genomic DNA and strictly followed the protocol, with the exception that restricted fragments were hybridized for at least 16–24 hours to the specific probes. After the capture of biotinylated target DNA, using streptavidin beads, nicks in the circularized fragments were closed by a ligase. Finally, the captured target DNA was eluted by NaOH and amplified by PCR. Amplified target molecules were purified using Agencourt AMPure XP beads (Beckman Coulter Genomics, Bernried am Starnberger See, Germany).

The enriched target DNA in each library sample was validated and quantified by microfluidics analysis using the Bioanalyzer High Sensitivity DNA Assay kit (Agilent Technologies) and the 2100 Bioanalyzer with the 2100 Expert Software. Usually 20 individual samples were run in a single lane (250M reads), generating 100-bp paired end reads.

For Pool-Seq experiments, equimolar pools of 5 or 16 DNA samples (detector and scouting pools) were created and 200ng of each pool was used for the HaloPlex enrichment strategy. Sixteen detector and five scouting pools were usually run in a single HiSeq1000 lane.

Targeted sequencing analysis

The libraries were sequenced using the HiSeq1000 system (Illumina inc., San Diego, CA, USA). The generated sequences were analyzed using an in-house pipeline designed to automate the analysis workflow, composed by modules performing every step using the appropriate tools available to the scientific community or developed in-house [21]. Paired sequencing reads were aligned to the reference genome (UCSC, hg19 build) using BWA [22], sorted with Picard (http://picard.sourceforge.net) and locally realigned around insertions-deletions with Genome Analysis Toolkit (GATK) [23]. The UnifiedGenotyper algorithm of GATK was used for SNV and small insertions-deletions (ins-del) calling, with parameters adapted to the Haloplex-generated sequences. The analysis of pools was performed with UnifiedGenotyper as well, adapting the ploidy parameter to the number of chromosomes present in the samples (10 for the detector and 32 for the scout pools) and the minimal ins-del fraction parameter accordingly. The called SNV and ins-del variants produced with both platforms were annotated using ANNOVAR [24] with: the relative position in genes using RefSeq [25] gene model, amino acid change, presence in dbSNP v137 [26], frequency in NHLBI Exome Variant Server (http://evs.gs.washington.edu/EVS) and 1000 genomes large scale projects [27], multiple cross-species conservation [28],[29] and prediction scores of damaging on protein activity [30]-[33]. The annotated variants were then imported into the internal variation database, which stores all the variations found in the re-sequencing projects performed so far in our institute. The database was then queried to generate the filtered list of variations and the internal database frequency in samples with unrelated phenotype was used as further annotation and filtering criteria. The alignments at candidate positions were visually inspected using the Integrative genomics viewer (IGV) [34]. We selected from the database the non-synonymous SNVs and ins-del, with a frequency lower than 2%, which was followed by manual inspection and further filtering criteria based on the presence in unrelated samples of the database, on the presence in the other samples of the Motorplex experiment and on the conservation of the mutations, with a final selection of rare, possibly causative, variations per individual.

Results

Validation study of MotorPlex

To design MotorPlex we used a straightforward procedure. Briefly, disease genes causing a muscular phenotype, including the biggest genes of the human genome, like titin (TTN) or dystrophin (DMD), were selected. The target sequences, corresponding to 0.5Mbp were enriched by the HaloPlex system (see Materials and methods). To validate MotorPlex, we created a training set of twenty DNA samples belonging to patients (15 males and 5 females) affected by different forms of limb-girdle muscular dystrophy or congenital myopathy (Additional file 2: Table S2) and compared with data from whole exome sequencing (WES) (Figure 1). For each sample, about 98% of reads generated (Figure 1a and Additional file 3: Table S4) were on target (compared to 88% obtained by WES) and fewer than 0.5% of targeted regions were not covered (about 15% of human exons are not analyzed by WES, Additional file 4: Figure S1). Moreover, more than 95% of targeted nucleotides were read at a 100× depth and a 500× depth was obtained for 80% of these; on the contrary, by performing a WES analysis, fewer than 70% of exons were covered at 20× (Figure 1b). From previous amplicon Sanger sequencing from these samples, we knew about 84 variants in 17 different genes (Additional file 5: Table S3). All these known variants were correctly called and no additional change was seen within the sequenced target (100% sensitivity and specificity). Moreover, to assess the reproducibility of the targeted enrichment and the subsequent NGS workflow, the same sample (43U) was analyzed twice. After filtering, variants were always confirmed, including the putative causative one (Table 1). Outside the Sanger coverage, 4,991 additional variations were called (Additional file 6: Table S5).

**A comparison between MotorPlex and a Whole Exome strategy** (**WES**) **demonstrates the better performance of the targeted strategy. (a)** 97.75% of reads generated in a MotorPlex experiment fall in the regions of interest and only 0.67% of targeted regions are not sequenced. On the contrary, for WES 88.66% of reads are on target and 14.89% of targeted exons are not effectively covered. **(b)** The percentage of targeted regions covered at high depth by MotorPlex is higher than that obtained by WES. In particular, 96.01% and 81.6% of regions are, respectively, covered at 100x and 200x by using MotorPlex versus 35.49% and 1.90% by WES.

Table 1.

List of pathogenic variants

Sample ID	Sex	Clinical diagnosis	Inheritance	Histopathologic features	Variant(s)
Single1	M	CM	Sp	c.n.	DNM2	chr19:10934538*	c.1856 C>G	p.S619W	het	c.n.^sr1
Single3	M	LGMD	Sp	m.f.	CAPN3	chr15:42695076*	c.1621 C>T	p.R541W	het	LGMD^sr2
Single3	M	LGMD	Sp	m.f.	CAPN3	chr15:42682142*	c.802-9G>A	spl.	het	LGMD^sr3
Single6	M	LGMD	Rec	m.f.	FKRP	chr19:47259458	c.751G>T	p.A251S	het
Single6	M	LGMD	Rec	m.f.	FKRP	chr19:47259758	c.G1051C	p.A351P	het
Single8	M	LGMD	Sp	n.a.	DYSF	chr2:71838708	c.4119 C>A	p.N1373K	het
Single8	M	LGMD	Sp	n.a.	DYSF	chr2:71762413	c.1369G>A	p.E457K	het
Single15	F	LGMD/CM	Sp	d.f.	SYNE2	chr14:64688329	c.663G>A	p.W221X	het
Single16	M	LGMD/DCM	Sp	d.f.	SGCG	chr13:23869573*	c.525 delT	p.F175L fsX20	hom	LGMD^sr4
Single16	M	LGMD/DCM	Sp	d.f.	LDB3	chr10:88446830*	c.349G>A	p.D117N	het	DCM^sr5
Single19	M	LGMD	Sp	m.f.	RYR1	chr19:39062797*	c.13885G>A	p.V4629M	het	CM^sr6
Single20	M	LGMD/DCM	Rec	c.n.	RYR1	chr19:39009932*	c.10097G>A	p.R3366H	het	Multiminicore^sr7
					RYR1	chr19:38973933*	c.4711 A>G	p.I1571V	het	MH^sr8
					RYR1	chr19:39034191*	c.11798A>G	p.Y3933C	het	MH^sr9
					RYR1	chr19:38942453	c.G1172C	p.R391P	het
					DES	chr2:220284876*	c.638 C>T	p.A213V	het	DCM¹⁰
1/17s	F	CM	Sp	c.n.	TTN	chr2:179452695*	c.63439G>A	p.A21157T	het	ARVD^sr11
					TTN	chr2:179496025	c.G43750T	p.G14584X	het
					TTN	chr2:179392277*	c.107576T>C	p.M35859T	het	ARVD^sr11
1/21s	M	LGMD	n.a.	n.a.	SGCA	chr17:48246607*	c.739G>A	p.247V>M	het	LGMD^sr12
1/21s	M	LGMD	n.a.	n.a.	SGCA	chr17:48245758*	c.409G>A	p.E137K	het	LGMD^sr13
2/17s	F	CM	Sp	cftdm	MYH7	chr14:23886406	c.T4475C	p.L1492P	het
2/20s	M	LGMD	n.a.	n.a.	POMT2	chr14:77745129*	c.1975 C>T	p.659 R>W	het	CMD^sr14
2/20s	M	LGMD	n.a.	n.a.	POMT2	chr14:77769283*	c.551 C>T	p.T184M	het	LGMD^sr15
3/20s	F	LGMD	Sp	cftdm	TPM2	chr9:35689792*	c.20_22delAGA	p.7Kdel	het	CM^sr16
4/17s	M	LGMD	Rec	c.n.	ANO5	chr11:22242646*	ANO5:c.191dupA	p.64N>Kfs*15	hom	LGMD^sr17
4/18s	M	LGMD	Sp	vacuoles	DNAJB6	chr7:157175006	c.413G>A	p.G138E	het
5/17s	M	LGMD/DCM	Sp	m.f.	MYOT	chr5:137213267	c.591delTG	p.199F>S fsX3	het
5/21s	M	LGMD	Sp	c.n.	CAV3	chr3:8787288*	c.191C>G	p.T64S	het	HCM^sr18
6/20s	M	LGMD	Sp	d.f.	ACADVL	chr17:7127330*	c.G1376A	p.R459Q	het	VLCAD^sr19
6/20s	M	LGMD	Sp	d.f.	ACADVL	chr17:7128130	c.C754T	p.A585V	het
7/17s	M	LGMD	Sp	m.f.	CAPN3	chr15:42702843*	c.2242 C>T	p.R748X	het	LGMD^sr20
7/17s	M	LGMD	Sp	m.f.	CAPN3	chr15:42693952*	c.1468 C>T	p.R490W	het	LGMD^sr21
7/20s	F	LGMD	Sp	d.f.	LMNA	chr1:156100408*	c.357 C>T	p.R119R (spl.)	het	EDMD^sr22
8/19s	M	LGMD	n.a.	d.f.	DNAJB6	chr7:157155959	c.C170T	p.S57L	het
10/17s	F	CM	Sp	m.f.	MYH7	chr14:23886518	c.G4363T	p.E1455X	het
10/21s	M	LGMD/FSHD	Dom	d.f.	SMCHD1	chr18:2700849*	c.C1580T	p.T527M	het	FSHD^sr23
11/18s	M	CM	Sp	nemaline	NEB	chr2:152447860	c.6915+2T>C	spl.	het
11/18s	M	CM	Sp	nemaline	NEB	chr2:152553662	c.C1470T	p.D490D (spl.?)	het
12/18s	F	CM	Sp	cftdm	MYH7	chr14:23882063	c.G5808C	p.X1936Y	het
12/21s	F	LGMD	Sp	d.f.	PYGM	chr11:64519958	c.A1537G	p.I513V	het
12/21s	F	LGMD	Sp	d.f.	PYGM	chr11:64514809*	c.C2199G	p.Y733X	het	McArdle^sr24
13/20s	M	LGMD	Rec	n.a.	LAMA2	chr6:129722399*	c.C5476T	p.R1826X	het	LGMD^sr25
13/20s	M	LGMD	Rec	n.a.	LAMA2	chr6:129571264	c.1791_1793del AGT	p.598 del V	het
13/21s	M	LGMD	Sp	d.f.	SGCG	chr13:23898652*	c.848G>A	p.C283Y	hom	LGMD^sr26
14/20s	F	LGMD	n.a.	n.a.	CAPN3	chr15:42686485*	c.1061T>G	p.V354G	het	LGMD^sr21
14/20s	F	LGMD	n.a.	n.a.	CAPN3	chr15:42689077	c.1193+2T>C	spl.	het
14/18s	M	LGMD	n.a.	d.f.	DMD	chrX:32360366*	c.G5773T	p.E1925X	hem	Duchenne^sr27
15/19s	M	CM	Sp	multiminicores	MYH7	chr14:23885313*	c.4850_4852del	p.1617 del K	het	Distal^sr28
16/18s	M	LGMD	Sp	no alterations	CAPN3	chr15:42691746*	c.1250 C>T	p.T417M	hom	LGMD^sr29
16/20s	M	CM	Sp	cftdm	TTN	chr2:179431175	c.C79684T	p.R26562X	het
16/20s	M	CM	Sp	cftdm	TTN	chr2:179526510	c.A39019T	p.K13007X	het
16/21s	F	CM	Dom	n.a.	TPM2	chr9:35685541*	c.A382G	p.K128E	het	CFTD^sr30
23/38s	M	CM	Sp	cftdm	RYR1	chr19:38959672	c.3449delG	p.C1150fs	het
					RYR1	chr19:38985186	c.6469G>A	p.E2157K	het
					RYR1	chr19:39003108*	c.9457G>A	p.G3153R	het	MH^sr31
23/41s	M	CM	Sp	m.f.	RYR1	chr19:38990637*	c.G7304T	p.R2435L	hom	CCD^sr32
24/42s	F	CM	n.a.	n.a.	ACTA1	chr1:229567867*	c.G682C	p.E228Q	het	Nemaline^sr33
25/38s	M	CM	Sp	cftdm	CRYAB	chr11:111779520	c.A496T	p.K166X	het
25/39s	F	CM	Dom	c.n.	RYR1	chr19:39075614*	c.14678G>A	p.R4893Q	het	CCD^sr34
25/41s	F	CM	n.a.	n.a.	MYH7	chr14:23886750	c.G4315C	p.A1439P	het
28/39s	F	CM	Dom	minicore	MYH7	chr14:23885313*	c.4850_4852del	p.1617del K	het	Distal^sr28
28/41s	M	CM	Sp	c.n.	MTM1	chrX:149831996*	c.C1558T	p.R520X	hem	Myotubular^sr35
29/41s	F	CM	Rec	n.a.	NEB	chr2:152387617	c.21628-2A>T	spl.	het
29/41s	F	CM	Rec	n.a.	NEB	chr2:152541300	c.C2827T	p.Q943X	het
30/42s	F	CM	Rec	cftdm	RYR1	chr19:38948185*	c.C1840T	p.R614C	het	MH^sr36
30/42s	F	CM	Rec	cftdm	RYR1	chr19:38959747	c.G3523A	p.E1175K	het
31/42s	F	CM	Rec	nemaline	NEB	chr2:152471093	c.11298_11300delTAC	p.Y3766del	hom
32/41s	M	CM	Dom	c.n.	MTM1	chrX:149826390	c.1150 C>T	p.Q384X	het
32/42s	F	CM	Dom	minicore	DNM2	chr19:10939917	c.C2252A	p.T751N	het
33/41s	M	CM	Rec	nemaline	NEB	chr2:152370944	c.23122-2A>G	spl.	het
33/41s	M	CM	Rec	nemaline	NEB	chr2:152544037	c.A2533G	p.K845E	het
36/42s	M	CM	Dom	n.a.	RYR1	chr19:39075629*	c.T14693C	p.I4898T	het	CCD^sr37
37/39s	M	LGMD	Sp	d.f.	DMD	chrX:32841417*	c.T328C	p.W110R	hem	Becker^sr38
37/40s	F	LGMD	Sp	n.a.	SYNE2	chr14:64676751*	c.C18632T	p.T6211M	het	EDMD^sr39
37/41s	F	CM	Dom	m.f.	MTM1	chrX:149826390	c.1150 C>T	p.Q384X	het

Open in a new tab

*Already reported. For references, see Additional file 10.

Validation study of double-check pooling

To challenge MotorPlex to be applied to large studies on thousands of patients and/or to detect mosaic mutations, we designed a combinatorial pooling strategy. After some initial attempts with pools of identical sizes, we changed our strategy. The general arrangement was to have the same sample in two different independent pools, composed of two exclusive combinations of samples (Figure 2). This permitted us to identify both the rare variations and the sample mutated. In particular, the pools were organized in two groups: the “detector pool” only containing five samples (10 alleles) that had the purpose of detecting variations with the optimal sensitivity and the “scout pool” composed of 16 samples (32 alleles) that confirmed the variation(s) and attribute them univocally to distinct DNA samples (Additional file 7: Figure S2; Additional file 8, Table S6). We paid attention each time to include the index cases alone, excluding related family members.

**NGS targeting workflow.** Ninety-three disease genes causing a muscular phenotype were selected. To cover all their exons and the ten flanking bases, an enrichment strategy, based on HaloPlex system, was designed. DNA samples of 80 patients were analyzed twice in an independent manner, using a combinatorial pooling scheme. As requested by HaloPlex protocol, DNA samples were digested, barcoded and amplified. The 80 samples were run at the same time in a single lane of the flow cell of HiSeq 1000. The following data analysis allowed us to detect putative causative variants validated by Sanger sequencing.

To validate this arrangement, we selected five samples that we previously sequenced individually and called 1,235 variations. We pooled them in the same detector pool (P9) and then reanalyzed in different scout pools. Impressively, in pool P9 we called 1,232/1,235 variations belonging to the individual samples, calculating the sensitivity value at 99.8%. The three missing variations (an insertion in RRM2B and two point variants in TTN) were located in regions with lower coverage. On the contrary, no variation was called in pool 9 in addition to those of individual samples, demonstrating the absence of false positives and artefacts due to the pooling strategy. Another two samples from the training set were inserted in another two detector pools, showing similar results.

We then confirmed 223/230 (97%) variations tested by Sanger sequencing, thus providing the specificity value of the method. Moreover, the combined use of detector pools and scout pools allowed us to “clean” the results. 50% of off target variations (n=1,291), in fact, were not called in the scout pools and were easily filtered off during bionformatic analysis. In addition, about 25% of variants in low covered regions (<500 total reads), representing in a large percentage false positive calls, were similarly filtered off because they were not detected in the scout pools (Additional file 9: Figure S3).

Variants and interpretation

The targeted analysis of 93 genes showed a total of 23,109 rare variants (<0.01 frequency) in 173 patients (1.4 variants/gene/patient). To provide a preliminary interpretation in relationship with the clinical suspicion, we set bioinformatic filters that weigh the variant class (missense, indel, stopgain or stoploss), the calculated frequency in public and internal databases and the annotation as causative variants. Finally, we reconsidered critically the correspondence with the clinical presentation, the age at onset and the segregation in familial cases.

In detail, we identified 52 patients (52/177=29%) with variants of likely pathogenicity or predicted to affect function (Table 1 and Additional file 10): most of them (38/52=73%) had known or truncating variants (indel, stopgtain or stoploss). Five patients (5/52=9.6%) showed a novel variant in addition to a pathogenic allele in a recessive gene. The remaining samples (9/52=17%) had novel variants that are predicted to affect function in genes fitting with the clinical suspicion.

In other 56 samples (56/177=32%), we identified potential causative variants (Table 2 and Additional file 10). In these cases, there was only a partial correspondence with the clinical phenotype. For example, a number of variants had been previously associated with cardiomyopathy, but their pathogenic role in congenital myopathy or in LGMDs was not yet established. To the group belong patients having two rare variants in TTN gene or at least one variant in COL6A1, COL6A2, COL6A3, SYNE1, SYNE2 and FLNC genes. These molecular findings in these 56 samples were not considered strictly disease-causing and further tests are required.

Table 2.

Variants of unknown significance (Vous)

Sample ID	Sex	Clinical diagnosis	Inheritance	Histopathologic features	Variant(s)
Single7	M	LGMD/EDMD	Rec	d.f.	NEB	chr2:152468776	c.A11729G	p.D3910G	het
					NEB	chr2:152495898	c.C8890T spl.	p.R2964C	het
					COL6A2	chr21:47552071	c.2665 C>T	p.Q889X	het
Single9	M	LGMD	n.a.	m.f.	RYR1	chr19:38986923*	c.6617 C>T	p.T2206M	het	MH^sr40
Single13	M	CM	Sp	n.a.	LAMA2	chr6:129687396*	c.G4750G>A	p.G1584S	het	LGMD^sr41
					LAMA2	chr6:129775423	c.6697G>A	p.V2233I	het
					NEB	chr2:152506812	c.C7309T	p.R2437W	het
					NEB	chr2:152512781	c.T6381A	p.D2127E	het
Single14	F	LGMD	Sp	d.f.	COL6A3	chr2:238249316	c.C8243T	p.P2748L	het
Single14	F	LGMD	Sp	d.f.	COL6A3	chr2:238289767	c.A1688G	p.D563G	het
Single18	M	CM	n.a.	n.a.	HSPG2	chr1:22176684	c.7296 A>T	spl.	het
Single18	M	CM	n.a.	n.a.	HSPG2	chr1:22200473	c.3688G>A	p.G1230S	het
1/18s	M	CM	Sp	c.n.	RYR1	chr19:38990340	c.G7093A	p.G2365R	het
1/18s	M	CM	Sp	c.n.	RYR1	chr19:39018347*	c.G10747C	p.E3583Q	het	MH^sr42
2/19s	M	LGMD/DCM	Sp	d.f.	NEB	chr2:152404851	c.G20128A	p.V6710I	het
2/19s	M	LGMD/DCM	Sp	d.f.	NEB	chr2:152534216	c.C3637T	p.T1213M	het
3/17s	F	LGMD	Sp	cftdm	SYNE2	chr14:64407373	c.A121G	p.I41V	het
4/21s	M	LGMD	Sp	d.f.	MYH7	chr14:23882979*	c.A5779T	p.I1927F	het	HCM^sr43
4/21s	M	LGMD	Sp	d.f.	FLNC	chr7:128487762	c.C4300T	p.R1434C	het
5/18s	M	LGMD	n.a.	n.a.	TTN	chr2:179393000	c.107377+1G>A	spl.	het
5/18s	M	LGMD	n.a.	n.a.	TTN	chr2:179441932	c.C69130T	p.P23044S	het
5/19s	F	CM	n.a.	n.a.	TTN	chr2:179439491	c.C71368T	p.R23790C	het
5/19s	F	CM	n.a.	n.a.	TTN	chr2:179596569	c.G17033A	p.R5678Q	het
5/20s	M	LGMD	Sp	d.f.	COL6a3	chr2:238283289*	c.C3445T	p.R1149W	het	AVSD^sr44
					COL6a3	chr2:238296516	c.C1021T	p.R341C	het
					NEB	chr2:152476125	c.G10712C	p.R3571P	het
					NEB	chr2:152580847	c.A539G	p.K180R	het
6/21s	M	CM	Dom	cftdm	SYNE1	chr6:152776709	c.C2744T	p.T915I	het
6/21s	M	CM	Dom	cftdm	SYNE2	chr14:64468677	c.C3664T	p.R1222W	het
7/19s	M	CM	Sp	cftdm	COL6A3	chr2:238287746*	c.G2030A	p.R677H	het	Bethlem^sr45
7/21s	M	LGMD	Sp	normal	TTN	chr2:179500777	c.G41521A	p.D13841N	het
7/21s	M	LGMD	Sp	normal	TTN	chr2:179615278	c.T11849C	p.I3950T	het
8/20s	F	LGMD	Sp	d.f.	COL6A3	chr2:238253701	c.C7162T	p.P2388S (spl.)	het
8/21s	M	LGMD	Sp	d.f.	SMCHD1	chr18:2740713	c.C3527T	p.T1176I	het
10/18s	F	LGMD	n.a.	n.a.	RYR	chr19:39034191*	c.A11798G	p.Y3933C	het	MH^sr9
10/19s	F	LGMD	Sp	d.f.	RYR	chr19:38990359*	c.A7112G	p.E2371G	het	MH^sr31
10/21s	M	LGMD	Sp	d.f.	SMCHD1	chr18:2700849	c.C1580T	p.T527M	het
11/17s	M	LGMD	Sp	T1FP	FHL1	chrX:135278980	c.T19C	p.S7P	het
11/19s	M	LGMD	Dom	m.f.	MYH2	chr17:10446451	c.A769G	p.T257A	het
11/20s	M	LGMD	Sp	normal	FLNC	chr7:128482964	c.C2506T	p.P836S	het
12/19s	M	LGMD	Sp	d.f.	COL6A2	chr21:47545454	c.T1892C	p.F631S	het
13/18s	M	CM	Sp	cftdm and multiminicore	MYBPC2	chr11:47356715*	c.C2783T	p.S928L	het	HCM^sr46
13/18s	M	CM	Sp	cftdm and multiminicore	SYNE2	chr14:64447727	c.A1672C	p.K558Q	het
14/21s	M	LGMD	Sp	d.f.	RYR1	chr19:39076763	c.C14901G	p.D4967E	het
14/21s	M	LGMD	Sp	d.f.	RYR1	chr19:39076777	c.C14915T	p.T4972I	het
15/20s	M	LGMD	Sp	normal	LDB3	chr10:88492723	c.T2174A	p.I725N	het
15/21s	F	CM	Sp	central core	PHKA1	chrX:71840734	c.G1978A	p.V660I	het
					SYNE1	chr6:152746618	c.C5165T	p.S1722L	het
					SYNE2	chr14:64548224	c.A11410G	p.T3804A	het
23/40s	M	CM	n.a.	c.n.	TMEM43	chr3:14175304	c.C578T	p.S193L	het
23/40s	M	CM	n.a.	c.n.	MYBPC3	chr11:47364189*	c.G1564A	p.A522T	het	HCM^sr47
24/38s	M	CM	Sp	cftdm	TTN	chr2:179559591	c.G31313A	p.R10438Q	het
					TTN	chr2:179586762	c.C22628T	p.P7543L	het
					FLNC	chr7:128475627	c.C600T	p.P200P spl.	het
24/39s	M	CM	n.a.	n.a.	FLNC	chr7:128492888	c.C6011T	p.S2004F	het
24/41s	F	CM	n.a.	n.a.	TTN	chr2:179495045	c.A44204G	p.N14735S	het
24/41s	F	CM	n.a.	n.a.	TTN	chr2:179586756	c.G22634A	p.R7545Q	het
25/40s	M	CM	Sp	nemaline	FLNC	chr7:128494538	c.G6799A	p.V2267I	het
25/42s	M	CM	n.a.	cftdm	RYR1	chr19:38986890	c.C6584T	p.P2195L	het
26/39s	M	CM	Sp	core miopathy	TTN	chr2:179431924	c.T78935C	p.L26312P	het
26/39s	M	CM	Sp	core miopathy	TTN	chr2:179614124	c.A13003G	p.R4335G	het
26/41s	M	CM	n.a.	n.a.	DYSF	chr2:71740851*	c.G463A	p.G155R	het	LGMD^sr48
26/41s	M	CM	n.a.	n.a.	DYSF	chr2:71827853	c.C3724T	p.R1242C	het
26/42s	M	CM	n.a.	core miopathy	TTN	chr2:179522230	c.T38033C	p.V12678A	het
26/42s	M	CM	n.a.	core miopathy	TTN	chr2:179527095	c.C37009T	p.P12337S	het
27/39s	M	CM	Sp	cftdm	COL6A1	chr21:47406897	c.C628G	p.R210G	het
27/41s	F	CM	n.a.	cftdm	SYNE1	chr6:152746682	c.G5001T	p.A1701S (spl.)	het
27/41s	F	CM	n.a.	cftdm	SYNE2	chr14:64484328	c.G4903A	p.E1635K	het
27/42s	F	CM	n.a.	multiminicores	COL6A1	chr21:47406559	c.G548A	p.G183D	het
					MYH7	chr14:23885359	c.G4807C	p.A1603P	het
					DNM2	chr19:10909210	c.A1384G	p.T462A	het
28/40s	M	CM	n.a.	n.a.	TTN	chr2:179415978	c.G91280T	p.G30427V	het
28/40s	M	CM	n.a.	n.a.	TTN	chr2:179415952	c.C91306T	p.R30436W	het
28/41s	M	CM	Sp	d.f.	COL6A1	chr21:47410893	c.G1057A	p.G353S	het
29/38s	M	LGMD	Rec	d.f.	COL6A2	chr21:47539756	c.G1324T	p.G442W	het
29/38s	M	LGMD	Rec	d.f.	COL6A2	chr21:47551934*	c.G2528A	p.R843Q	het	AVSD^sr44
30/38s	F	CM	Sp	n.a.	TTN	chr2:179411904	c.C94348T	p.R31450C	het
30/38s	F	CM	Sp	n.a.	TTN	chr2:179428049	c.G82814A	p.G27604S	het
31/39s	M	CM	Sp	minicores	ATP7A	chrX:77301920	c.G4356C	p.L1452F	het
31/40s	F	CM	Sp	cftdm	PHKA1	chrX:71840734	c.G1978A	p.V660I	het
31/41s	M	CM	Sp	reducing body	KBTBD13	chr15:65369638	c.C485T	p.T162M	het
32/40s	M	CM	Sp	T1FP	TTN	chr2:179583104	c.C24729A	p.C8243X	het
32/40s	M	CM	Sp	T1FP	TTN	chr2:179589034	c.A21068C	p.Q7023P	het
33/38s	F	LGMD	Sp	d.f.	CNTN1	chr12:41337835	c.A1546G	p.I516V	het
34/38s	F	LGMD	Sp	d.f.	SMCHD1	chr18:2656250	c.G176T	p.C59F	het
34/41s	M	CM	n.a.	m.f.	COL6A2	chr21:47545473	c.C1911G	p.F637L	het
35/41s	M	CM	n.a.	c.n.	DYSF	chr2:71730384	c.277G>A	p.A93T	hom
35/41s	M	CM	n.a.	c.n.	TTN	chr2:179411050	c.C95008T	p.R31670X	het
36/38s	M	LGMD	Sp	d.f.	SYNE1	chr6:152651958	c.C15746T	p.T5249M	het
36/39s	F	CM	Sp	cftdm	COL6A2	chr21:47545885	c.G2156A	p.R719Q	het
36/39s	F	CM	Sp	cftdm	CPT1B	chr22:51012938	c.G767A	p.R256H	het
36/40s	M	LGMD and DCM	Sp	m.f.	SYNE2	chr14:64447788	c.A1733G	p.K578R	het
37/38s	M	LGMD	Sp	m.f.	COL6a3	chr2:238277282	c.A4824T	p.R1608S	het

Open in a new tab

* Already reported. For references, see Additional file 10.

The most surprising finding was, however, the presence of additional damaging or potential damaging variants in 16 patients of the first two groups (23/108=21%) in whom other pathogenic variants or variants of uncertain significance had already been identified. These variants, if they had been detected alone in the context of a single gene testing, would have been considered as causative.

The third group includes 26 patients (26/177=15%) in which we discovered a single truncating variant (or a known disease-associated variant) in a recessive gene that is compatible with the phenotype. The second allele may carry a RNA splicing defect that is generally not predictable by DNA sequencing or, also, a variation in not investigated promoters or regulatory regions.

Discussion

In the last decade, a remarkable progress has been made in discovering new disease genes and differentiating similar muscle disorders [1],[2]. This growing genetic heterogeneity highlights the problem of a very complex diagnosis [35]. Furthermore, genome sequencing studies suggest that the clinical genetic test may be incomplete not only when the causative mutation is missing, but also when the genotype/phenotype correlation appears weak. This is particularly true when the familial recurrence is unclear, with some relatives that only share minor affections. In families with patients who are more severely affected, this “grey area” is problematic for both genetic counselling and forthcoming mutation-specific treatments. However, this represents the proper challenge for the new genomic, high-throughput technologies: the power of discovery has been dramatically boosted by the introduction of the next-generation sequencing (NGS) techniques [13],[36]-[38]. In the NGS era, the genetic testing is going to move from few candidate genes to broader panels of genes [39] or, ultimately, to the entire genome. This will have consequences on the diagnostic flowchart: NGS tests may represent the first tier test, preceding biopsy and other invasive procedures.

We have applied both WES and targeted approaches to the diagnosis of genetic disorders of muscle and collected DNA samples of patients without diagnosis and realized that NGS technology can be helpful for clinical diagnostics, provided that a suitable tool is created. We traced an ideal profile of it. This tool should fulfil the following requirements [16],[20]: 1) to be cost-effective and thus applicable to a large number of patients and normal individuals, 2) to be robust in the terms of target reproducibility, 3) to be specific and sensitive with a limited need for further validation steps, 4) to be large enough to include all relevant genes and, finally, 5) to be easily upgradable in view of novel discoveries. Here we demonstrate the ability to generate this complex targeting and to fulfil all these requirements. We decided to use Haloplex as the enrichment technology. Haloplex first digests DNA using eight different combinations of endonucleases. Our experience suggests that this approach is more reproducible and accurate than the random mechanical DNA fragmentation. In addition, the capture is independent of the target base composition and is predictable from the probe design phase. As a proof of specificity and efficiency, we show that less than 2% of reads generated by Motorplex are off-target, in comparison with >12% of WES. This factor further improves the cost-effectiveness of the approach. This platform, based on eight different digestions and hybridization, is more accurate, reproducible and sensitive in comparison with other published methods [34]. We have designed the MotorPlex to detect variations in 93 muscle-disease genes and assayed 177 pre-screened DNA samples from myopathic patients. It is important to consider that these are all patients with zero mutations so far detected, even if most of them have been lengthily studied using a gene-by-gene sequencing approach. The high coverage and depth obtained permitted us to detect variations in most genes with sensitivity comparable with Sanger sequencing. According to our conservative NGS data interpretation, in 52 patients (29%) the diagnosis is complete. However, the detection rate will grow after a further molecular characterization of putative pathogenic variations in a second group of 56 patients. In addition, there are 26 samples (15%) that have defects in one single allele associated with a recessive condition. We predict that most of these can carry an elusive hit on the other allele such as splicing defects or copy number mutation(s). A percentage of 15%, in fact, is a usual value for disease-causing variants not detectable by sequencing.

The most interesting and quite surprising finding is, however, the very high number of rare damaging variants identified and first the cases (26/177) with more damaging variants in other genes in addition to those classified as causative. These additional variants may have a potential modifier effect. This percentage of these genetically complex patients may be higher, if we consider that many other important muscular genes (even if not disease-causing) can also carry damaging alleles. We can easily predict that a broader NGS approach could strengthen this observation. We hypothesize that the intrafamilial and interfamilial phenotypic differences may be frequently related to the combinations of multiple disease-causing alleles, more than to SNPs or CNVs. The so-called “modifier gene variants” could be individually rare, but collectively common. A comprehensive view of all the genes involved in a pathological process helps to point out these alleles having a minor but probably not negligible role in the disease aetiology.

The ultimate goal of MotorPlex is given by the pooling performances. The specificity and sensitivity values are very high and quite similar to those obtained in singleton testing and, above all, the diagnostic rate is not affected. The potential applications of pooling are just in large studies of complex and non-Mendelian disorders when a large number of samples have to be analyzed to improve the statistical power [40]. Considering our finding of multiple damaging variants in disease genes, these large studies are just around the corner. In addition, MotorPlex may discover low-allelic fraction variants in single samples, as in somatic mosaicisms. The pooled MotorPlex is likewise the cheapest genetic test (Table 3) ever presented that is able to screen 93 complex conditions at the cost of a few PCR reactions.

Table 3.

Predicted enrichment costs and workload for single and pooled DNA samples

Technical step	Cost (€)
	Single	PoolSeq
Haloplex Kit (96 samples)	16240,83	4263,22
Polymerase	86	22,575
AMPure XP beads	400	105
Validation and quantification of enriched target DNA	386,8	101,5
Total (total per sample)	17113.63 (213.92)	4492.29 (56.15)
Run Time	Total Time (h)
	Single	PoolSeq
Enrichment procedure	4days	1day

Open in a new tab

In conclusion, we here demonstrate that MotorPlex can be used to identify accurately all DNA variants also in huge muscle genes: the platform overcomes for sensitivity and coverage the WES approach. In addition, Pool-Seq may be the first option to perform cost-effective population studies to understand polygenic conditions. We think that similar protocols could be designed to extend the NGS applications to other studies for human genetics, as well as for disease prevention, nutrition, forensics and many others.

Additional files

Electronic supplementary material

40478_2014_9100_MOESM1_ESM.xls^{(31.5KB, xls)}

Additional file 1: Table S1.: List of genes. (XLS 32 KB)

40478_2014_9100_MOESM2_ESM.xls^{(29.5KB, xls)}

Additional file 2: Table S2.: Training set samples. (XLS 30 KB)

40478_2014_9100_MOESM3_ESM.xls^{(23KB, xls)}

Additional file 3: Table S4.: Run Statistics of PoolSeq experiments. (XLS 23 KB)

40478_2014_9100_MOESM4_ESM.ppt^{(334.5KB, ppt)}

Additional file 4: Figure S1.: Coverage comparison. (PPT 334 KB)

40478_2014_9100_MOESM5_ESM.xls^{(32KB, xls)}

Additional file 5: Table S3.: Control variants. (XLS 32 KB)

40478_2014_9100_MOESM6_ESM.xls^{(21.5KB, xls)}

Additional file 6: Table S5.: Summary of variants identified in training set samples. (XLS 22 KB)

40478_2014_9100_MOESM7_ESM.ppt^{(631KB, ppt)}

Additional file 7: Figure S2.: Pooling strategy. (PPT 631 KB)

40478_2014_9100_MOESM8_ESM.xls^{(39.5KB, xls)}

Additional file 8: Table S6.: Summary of rare variants identified in PoolSeq experiments. (XLS 40 KB)

40478_2014_9100_MOESM9_ESM.ppt^{(144.5KB, ppt)}

Additional file 9: Figure S3.: Scout pools help filtering results. (PPT 144 KB)

40478_2014_9100_MOESM10_ESM.doc^{(27KB, doc)}

Additional file 10: List of references for Table 1 and Table 2. (DOC 27 KB)

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1^{(25.4KB, gif)}

Authors’ original file for figure 2^{(31.8KB, gif)}

Acknowledgements

We are grateful to Manuela Dionisi for the NGS, Anna Cuomo and Rosalba Erpice for the Sanger sequence analyses and Mario Guarracino for helpful discussion and suggestions. This study was entirely supported by grants from Telethon, Italy (TGM11Z06 to V.N.) and Telethon-UILDM (Unione Italiana Lotta alla Distrofia Muscolare) (GUP 10006 to G.P.C. and V.N., GUP11006 to V.N. and GUP08005 to C.B.). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript. The authors have declared that no competing interests exist.

Abbreviations

NGS: Next-generation sequencing
WES: Whole-exome sequencing
WGS: Whole-genome sequencing
IGV: Integrative genomics viewer
CM: Congenital myopathies
LGMD: Limb-girdle muscular dystrophy
FSHD: Facioscapulohumeral muscular dystrophy
EDMD: Emery-Dreifuss muscular dystrophy
DCM: Dilated cardiomyopathy
MH: Malignant hyperthermia
ARVD: Arrhythmogenic right ventricular cardiomyopathy
CMD: Congenital Muscular Dystrophy
HCM: Hypertrophic cardiomyopathy
VLCAD: Very long chain acyl-CoA dehydrogenase deficiency
CCD: Central core disease
AVSD: Atrioventricular septal defect
n.a.: not available
Sp: Sporadic
Rec: Recessive
Dom: Dominant
c.n.: Central nuclei
m.f.: Myopathic features
d.f.: Dystrophic features
cftdm: Congenital Myopathy with Fiber-Type Disproportion
T1FP: Type 1 fiber predominance

Footnotes

Competing interests

The authors declare that they have no competing interests.

Contributor Information

Marco Savarese, Email: msavarese@tigem.it.

Giuseppina Di Fruscio, Email: giuseppinadifruscio@virgilio.it.

Margherita Mutarelli, Email: mutarelli@tigem.it.

Annalaura Torella, Email: annalaura.torella@unina2.it.

Francesca Magri, Email: francescam.magri@gmail.com.

Filippo Maria Santorelli, Email: filippo3364@gmail.com.

Giacomo Pietro Comi, Email: giacomo.comi@unimi.it.

Claudio Bruno, Email: claudio2246@gmail.com.

Vincenzo Nigro, Email: vinnigro@gmail.com.

References

1.Kaplan JC, Hamroun D. The 2014 version of the gene table of monogenic neuromuscular disorders (nuclear genome) Neuromuscul Disord. 2013;23:1081–1111. doi: 10.1016/j.nmd.2013.10.006. [DOI] [PubMed] [Google Scholar]
2.Nigro V, Savarese M. Genetic basis of limb-girdle muscular dystrophies: the 2014 update. Acta Myol. 2014;33:1–12. [PMC free article] [PubMed] [Google Scholar]
3.Nigro V, Aurino S, Piluso G. Limb girdle muscular dystrophies: update on genetic diagnosis and therapeutic approaches. Curr Opin Neurol. 2011;24:429–436. doi: 10.1097/WCO.0b013e32834aa38d. [DOI] [PubMed] [Google Scholar]
4.Mercuri E, Muntoni F. Muscular dystrophies. Lancet. 2013;381:845–860. doi: 10.1016/S0140-6736(12)61897-2. [DOI] [PubMed] [Google Scholar]
5.Leung DG, Wagner KR. Therapeutic advances in muscular dystrophy. Ann Neurol. 2013;74:404–411. doi: 10.1002/ana.23989. [DOI] [PMC free article] [PubMed] [Google Scholar]
6.Sacconi S, Lemmers RJ, Balog J, van der Vliet PJ, Lahaut P, van Nieuwenhuizen MP, Straasheijm KR, Debipersad RD, Vos-Versteeg M, Salviati L, Casarin A, Pegoraro E, Tawil R, Bakker E, Tapscott SJ, Desnuelle C, van der Maarel SM. The FSHD2 gene SMCHD1 is a modifier of disease severity in families affected by FSHD1. Am J Hum Genet. 2013;93:744–751. doi: 10.1016/j.ajhg.2013.08.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
7.Nigro V, Piluso G. Next generation sequencing (NGS) strategies for the genetic testing of myopathies. Acta Myol. 2012;31:196–200. [PMC free article] [PubMed] [Google Scholar]
8.Metzker ML. Sequencing technologies - the next generation. Nat Rev Genet. 2009;11:31–46. doi: 10.1038/nrg2626. [DOI] [PubMed] [Google Scholar]
9.Yu Y, Wu BL, Wu J, Shen Y. Exome and whole-genome sequencing as clinical tests: a transformative practice in molecular diagnostics. Clin Chem. 2012;58:1507–1509. doi: 10.1373/clinchem.2012.193128. [DOI] [PubMed] [Google Scholar]
10.Beaulieu CL, Majewski J, Schwartzentruber J, Samuels ME, Fernandez BA, Bernier FP, Brudno M, Knoppers B, Marcadier J, Dyment D, Adam S, Bulman DE, Jones SJ, Avard D, Nguyen MT, Rousseau F, Marshall C, Wintle RF, Shen Y, Scherer SW, Canada Consortium FORGE, Friedman JM, Michaud JL, Boycott KM. FORGE Canada Consortium: outcomes of a 2-year national rare-disease gene-discovery project. Am J Hum Genet. 2014;94:809–817. doi: 10.1016/j.ajhg.2014.05.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
11.Gilissen C, Hoischen A, Brunner HG, Veltman JA. Disease gene identification strategies for exome sequencing. Eur J Hum Genet. 2012;20:490–497. doi: 10.1038/ejhg.2011.258. [DOI] [PMC free article] [PubMed] [Google Scholar]
12.Roscioli T, Kamsteeg EJ, Buysse K, Maystadt I, van Reeuwijk J, van den Elzen C, van Beusekom E, Riemersma M, Pfundt R, Vissers LE, Schraders M, Altunoglu U, Buckley MF, Brunner HG, Grisart B, Zhou H, Veltman JA, Gilissen C, Mancini GM, Delrée P, Willemsen MA, Ramadža DP, Chitayat D, Bennett C, Sheridan E, Peeters EA, Tan-Sindhunata GM, de Die-Smulders CE, Devriendt K, Kayserili H, El-Hashash OA. Mutations in ISPD cause Walker-Warburg syndrome and defective glycosylation of alpha-dystroglycan. Nat Genet. 2012;44:581–585. doi: 10.1038/ng.2253. [DOI] [PMC free article] [PubMed] [Google Scholar]
13.Torella A, Fanin M, Mutarelli M, Peterle E, Del Vecchio Blanco F, Rispoli R, Savarese M, Garofalo A, Piluso G, Morandi L, Ricci G, Siciliano G, Angelini C, Nigro V. Next-generation sequencing identifies transportin 3 as the causative gene for LGMD1F. PLoS One. 2013;8:e63536. doi: 10.1371/journal.pone.0063536. [DOI] [PMC free article] [PubMed] [Google Scholar]
14.Harms MB, Sommerville RB, Allred P, Bell S, Ma D, Cooper P, Lopate G, Pestronk A, Weihl CC, Baloh RH. Exome sequencing reveals DNAJB6 mutations in dominantly-inherited myopathy. Ann Neurol. 2012;71:407–416. doi: 10.1002/ana.22683. [DOI] [PMC free article] [PubMed] [Google Scholar]
15.Johnston JJ, Rubinstein WS, Facio FM, Ng D, Singh LN, Teer JK, Mullikin JC, Biesecker LG. Secondary variants in individuals undergoing exome sequencing: screening of 572 individuals identifies high-penetrance mutations in cancer-susceptibility genes. Am J Hum Genet. 2012;91:97–108. doi: 10.1016/j.ajhg.2012.05.021. [DOI] [PMC free article] [PubMed] [Google Scholar]
16.Rehm HL, Bale SJ, Bayrak-Toydemir P, Berg JS, Brown KK, Deignan JL, Friez MJ, Funke BH, Hegde MR, Lyon E. ACMG clinical laboratory standards for next-generation sequencing. Genet Med. 2013;15:733–747. doi: 10.1038/gim.2013.92. [DOI] [PMC free article] [PubMed] [Google Scholar]
17.Yang Y, Muzny DM, Reid JG, Bainbridge MN, Willis A, Ward PA, Braxton A, Beuten J, Xia F, Niu Z, Hardison M, Person R, Bekheirnia MR, Leduc MS, Kirby A, Pham P, Scull J, Wang M, Ding Y, Plon SE, Lupski JR, Beaudet AL, Gibbs RA, Eng CM. Clinical whole-exome sequencing for the diagnosis of mendelian disorders. N Engl J Med. 2013;369:1502–1511. doi: 10.1056/NEJMoa1306555. [DOI] [PMC free article] [PubMed] [Google Scholar]
18.Rehm HL. Disease-targeted sequencing: a cornerstone in the clinic. Nat Rev Genet. 2013;14:295–300. doi: 10.1038/nrg3463. [DOI] [PMC free article] [PubMed] [Google Scholar]
19.Berglund EC, Lindqvist CM, Hayat S, Övernäs E, Henriksson N, Nordlund J, Wahlberg P, Forestier E, Lönnerholm G, Syvänen AC. Accurate detection of subclonal single nucleotide variants in whole genome amplified and pooled cancer samples using HaloPlex target enrichment. BMC Genomics. 2013;14:856. doi: 10.1186/1471-2164-14-856. [DOI] [PMC free article] [PubMed] [Google Scholar]
20.Desai AN, Jere A. Next-generation sequencing: ready for the clinics? Clin Genet. 2012;81:503–510. doi: 10.1111/j.1399-0004.2012.01865.x. [DOI] [PubMed] [Google Scholar]
21.Mutarelli M, Marwah VS, R R, Carrella D, Dharmalingam G, Oliva G, di Bernardo D (2014) A community-based resource for automatic exome variant-calling and annotation in Mendelian disorders. BMC Genomics 15: doi: 10.1186/1471–2164–15-s3-s5 [DOI] [PMC free article] [PubMed]
22.Li H, Durbin R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics. 2009;25:1754–1760. doi: 10.1093/bioinformatics/btp324. [DOI] [PMC free article] [PubMed] [Google Scholar]
23.DePristo MA, Banks E, Poplin R, Poplin R, Garimella KV, Maguire JR, Hartl C, Philippakis AA, del Angel G, Rivas MA, Hanna M, McKenna A, Fennell TJ, Kernytsky AM, Sivachenko AY, Cibulskis K, Gabriel SB, Altshuler D, Daly MJ. A framework for variation discovery and genotyping using next-generation DNA sequencing data. Nat Genet. 2011;43:491–498. doi: 10.1038/ng.806. [DOI] [PMC free article] [PubMed] [Google Scholar]
24.Wang K, Li M, Hakonarson H. ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data. Nucleic Acids Res. 2010;38:e164. doi: 10.1093/nar/gkq603. [DOI] [PMC free article] [PubMed] [Google Scholar]
25.Pruitt KD, Tatusova T, Brown GR, Maglott DR. NCBI Reference Sequences (RefSeq): current status, new features and genome annotation policy. Nucleic Acids Res. 2012;40:D130–135. doi: 10.1093/nar/gkr1079. [DOI] [PMC free article] [PubMed] [Google Scholar]
26.Sherry ST, Ward MH, Kholodov M, Baker J, Phan L, Smigielski EM, Sirotkin K. dbSNP: the NCBI database of genetic variation. Nucleic Acids Res. 2001;29:308–311. doi: 10.1093/nar/29.1.308. [DOI] [PMC free article] [PubMed] [Google Scholar]
27.Abecasis GR, Altshuler D, Auton A, Brooks LD, Durbin RM, Gibbs RA, Hurles ME, McVean GA. A map of human genome variation from population-scale sequencing. Nature. 2010;467:1061–1073. doi: 10.1038/nature09534. [DOI] [PMC free article] [PubMed] [Google Scholar]
28.Pollard KS, Hubisz MJ, Rosenbloom KR, Siepel A. Detection of nonneutral substitution rates on mammalian phylogenies. Genome Res. 2010;20:110–121. doi: 10.1101/gr.097857.109. [DOI] [PMC free article] [PubMed] [Google Scholar]
29.Goode DL, Cooper GM, Schmutz J, Dickson M, Gonzales E, Tsai M, Karra K, Davydov E, Batzoglou S, Myers RM, Sidow A. Evolutionary constraint facilitates interpretation of genetic variation in resequenced human genomes. Genome Res. 2010;20:301–310. doi: 10.1101/gr.102210.109. [DOI] [PMC free article] [PubMed] [Google Scholar]
30.Liu X, Jian X, Boerwinkle E. dbNSFP: a lightweight database of human nonsynonymous SNPs and their functional predictions. Hum Mutat. 2011;32:894–899. doi: 10.1002/humu.21517. [DOI] [PMC free article] [PubMed] [Google Scholar]
31.Schwarz JM, Rodelsperger C, Schuelke M, Seelow D. MutationTaster evaluates disease-causing potential of sequence alterations. Nat Methods. 2010;7:575–576. doi: 10.1038/nmeth0810-575. [DOI] [PubMed] [Google Scholar]
32.Adzhubei I, Jordan DM, Sunyaev SR (2013) Predicting functional effect of human missense mutations using PolyPhen-2. Curr Protoc Hum Genet, Chapter 7: Unit7 20 [DOI] [PMC free article] [PubMed]
33.Kumar P, Henikoff S, Ng PC. Predicting the effects of coding non-synonymous variants on protein function using the SIFT algorithm. Nat Protoc. 2009;4:1073–1081. doi: 10.1038/nprot.2009.86. [DOI] [PubMed] [Google Scholar]
34.Thorvaldsdottir H, Robinson JT, Mesirov JP. Integrative Genomics Viewer (IGV): high-performance genomics data visualization and exploration. Brief Bioinform. 2012;14:178–192. doi: 10.1093/bib/bbs017. [DOI] [PMC free article] [PubMed] [Google Scholar]
35.Laing NG. Genetics of neuromuscular disorders. Crit Rev Clin Lab Sci. 2012;49:33–48. doi: 10.3109/10408363.2012.658906. [DOI] [PubMed] [Google Scholar]
36.Cirak S, Foley AR, Herrmann R, Willer T, Yau S, Stevens E, Torelli S, Brodd L, Kamynina A, Vondracek P, Roper H, Longman C, Korinthenberg R, Marrosu G, Nürnberg P, Michele DE, Plagnol V, Hurles M, Moore SA, Sewry CA, Campbell KP, Voit T, Muntoni F. ISPD gene mutations are a common cause of congenital and limb-girdle muscular dystrophies. Brain. 2013;136:269–281. doi: 10.1093/brain/aws312. [DOI] [PMC free article] [PubMed] [Google Scholar]
37.Logan CV, Lucke B, Pottinger C, Abdelhamed ZA, Parry DA, Szymanska K, Diggle CP, van Riesen A, Morgan JE, Markham G, Ellis I, Manzur AY, Markham AF, Shires M, Helliwell T, Scoto M, Hübner C, Bonthron DT, Taylor GR, Sheridan E, Muntoni F, Carr IM, Schuelke M, Johnson CA. Mutations in MEGF10, a regulator of satellite cell myogenesis, cause early onset myopathy, areflexia, respiratory distress and dysphagia (EMARDD) Nat Genet. 2011;43:1189–1192. doi: 10.1038/ng.995. [DOI] [PubMed] [Google Scholar]
38.Sarparanta J, Jonson PH, Golzio C, Sandell S, Luque H, Screen M, McDonald K, Stajich JM, Mahjneh I, Vihola A, Raheem O, Penttilä S, Lehtinen S, Huovinen S, Palmio J, Tasca G, Ricci E, Hackman P, Hauser M, Katsanis N, Udd B. Mutations affecting the cytoplasmic functions of the co-chaperone DNAJB6 cause limb-girdle muscular dystrophy. Nat Genet. 2012;44:450–455. doi: 10.1038/ng.1103. [DOI] [PMC free article] [PubMed] [Google Scholar]
39.Vasli N, Böhm J, Le Gras S, Muller J, Pizot C, Jost B, Echaniz-Laguna A, Laugel V, Tranchant C, Bernard R, Plewniak F, Vicaire S, Levy N, Chelly J, Mandel JL, Biancalana V, Laporte J. Next generation sequencing for molecular diagnosis of neuromuscular diseases. Acta Neuropathol. 2012;124:273–283. doi: 10.1007/s00401-012-0982-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
40.Chen X, Listman JB, Slack FJ, Gelernter J, Zhao H. Biases and errors on allele frequency estimation and disease association tests of next-generation sequencing of pooled samples. Genet Epidemiol. 2012;36:549–560. doi: 10.1002/gepi.21648. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

40478_2014_9100_MOESM1_ESM.xls^{(31.5KB, xls)}

Additional file 1: Table S1.: List of genes. (XLS 32 KB)

40478_2014_9100_MOESM2_ESM.xls^{(29.5KB, xls)}

Additional file 2: Table S2.: Training set samples. (XLS 30 KB)

40478_2014_9100_MOESM3_ESM.xls^{(23KB, xls)}

Additional file 3: Table S4.: Run Statistics of PoolSeq experiments. (XLS 23 KB)

40478_2014_9100_MOESM4_ESM.ppt^{(334.5KB, ppt)}

Additional file 4: Figure S1.: Coverage comparison. (PPT 334 KB)

40478_2014_9100_MOESM5_ESM.xls^{(32KB, xls)}

Additional file 5: Table S3.: Control variants. (XLS 32 KB)

40478_2014_9100_MOESM6_ESM.xls^{(21.5KB, xls)}

Additional file 6: Table S5.: Summary of variants identified in training set samples. (XLS 22 KB)

40478_2014_9100_MOESM7_ESM.ppt^{(631KB, ppt)}

Additional file 7: Figure S2.: Pooling strategy. (PPT 631 KB)

40478_2014_9100_MOESM8_ESM.xls^{(39.5KB, xls)}

Additional file 8: Table S6.: Summary of rare variants identified in PoolSeq experiments. (XLS 40 KB)

40478_2014_9100_MOESM9_ESM.ppt^{(144.5KB, ppt)}

Additional file 9: Figure S3.: Scout pools help filtering results. (PPT 144 KB)

40478_2014_9100_MOESM10_ESM.doc^{(27KB, doc)}

Additional file 10: List of references for Table 1 and Table 2. (DOC 27 KB)

Authors’ original file for figure 1^{(25.4KB, gif)}

Authors’ original file for figure 2^{(31.8KB, gif)}

[CR1] 1.Kaplan JC, Hamroun D. The 2014 version of the gene table of monogenic neuromuscular disorders (nuclear genome) Neuromuscul Disord. 2013;23:1081–1111. doi: 10.1016/j.nmd.2013.10.006. [DOI] [PubMed] [Google Scholar]

[CR2] 2.Nigro V, Savarese M. Genetic basis of limb-girdle muscular dystrophies: the 2014 update. Acta Myol. 2014;33:1–12. [PMC free article] [PubMed] [Google Scholar]

[CR3] 3.Nigro V, Aurino S, Piluso G. Limb girdle muscular dystrophies: update on genetic diagnosis and therapeutic approaches. Curr Opin Neurol. 2011;24:429–436. doi: 10.1097/WCO.0b013e32834aa38d. [DOI] [PubMed] [Google Scholar]

[CR4] 4.Mercuri E, Muntoni F. Muscular dystrophies. Lancet. 2013;381:845–860. doi: 10.1016/S0140-6736(12)61897-2. [DOI] [PubMed] [Google Scholar]

[CR5] 5.Leung DG, Wagner KR. Therapeutic advances in muscular dystrophy. Ann Neurol. 2013;74:404–411. doi: 10.1002/ana.23989. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR6] 6.Sacconi S, Lemmers RJ, Balog J, van der Vliet PJ, Lahaut P, van Nieuwenhuizen MP, Straasheijm KR, Debipersad RD, Vos-Versteeg M, Salviati L, Casarin A, Pegoraro E, Tawil R, Bakker E, Tapscott SJ, Desnuelle C, van der Maarel SM. The FSHD2 gene SMCHD1 is a modifier of disease severity in families affected by FSHD1. Am J Hum Genet. 2013;93:744–751. doi: 10.1016/j.ajhg.2013.08.004. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR7] 7.Nigro V, Piluso G. Next generation sequencing (NGS) strategies for the genetic testing of myopathies. Acta Myol. 2012;31:196–200. [PMC free article] [PubMed] [Google Scholar]

[CR8] 8.Metzker ML. Sequencing technologies - the next generation. Nat Rev Genet. 2009;11:31–46. doi: 10.1038/nrg2626. [DOI] [PubMed] [Google Scholar]

[CR9] 9.Yu Y, Wu BL, Wu J, Shen Y. Exome and whole-genome sequencing as clinical tests: a transformative practice in molecular diagnostics. Clin Chem. 2012;58:1507–1509. doi: 10.1373/clinchem.2012.193128. [DOI] [PubMed] [Google Scholar]

[CR10] 10.Beaulieu CL, Majewski J, Schwartzentruber J, Samuels ME, Fernandez BA, Bernier FP, Brudno M, Knoppers B, Marcadier J, Dyment D, Adam S, Bulman DE, Jones SJ, Avard D, Nguyen MT, Rousseau F, Marshall C, Wintle RF, Shen Y, Scherer SW, Canada Consortium FORGE, Friedman JM, Michaud JL, Boycott KM. FORGE Canada Consortium: outcomes of a 2-year national rare-disease gene-discovery project. Am J Hum Genet. 2014;94:809–817. doi: 10.1016/j.ajhg.2014.05.003. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR11] 11.Gilissen C, Hoischen A, Brunner HG, Veltman JA. Disease gene identification strategies for exome sequencing. Eur J Hum Genet. 2012;20:490–497. doi: 10.1038/ejhg.2011.258. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR12] 12.Roscioli T, Kamsteeg EJ, Buysse K, Maystadt I, van Reeuwijk J, van den Elzen C, van Beusekom E, Riemersma M, Pfundt R, Vissers LE, Schraders M, Altunoglu U, Buckley MF, Brunner HG, Grisart B, Zhou H, Veltman JA, Gilissen C, Mancini GM, Delrée P, Willemsen MA, Ramadža DP, Chitayat D, Bennett C, Sheridan E, Peeters EA, Tan-Sindhunata GM, de Die-Smulders CE, Devriendt K, Kayserili H, El-Hashash OA. Mutations in ISPD cause Walker-Warburg syndrome and defective glycosylation of alpha-dystroglycan. Nat Genet. 2012;44:581–585. doi: 10.1038/ng.2253. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR13] 13.Torella A, Fanin M, Mutarelli M, Peterle E, Del Vecchio Blanco F, Rispoli R, Savarese M, Garofalo A, Piluso G, Morandi L, Ricci G, Siciliano G, Angelini C, Nigro V. Next-generation sequencing identifies transportin 3 as the causative gene for LGMD1F. PLoS One. 2013;8:e63536. doi: 10.1371/journal.pone.0063536. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR14] 14.Harms MB, Sommerville RB, Allred P, Bell S, Ma D, Cooper P, Lopate G, Pestronk A, Weihl CC, Baloh RH. Exome sequencing reveals DNAJB6 mutations in dominantly-inherited myopathy. Ann Neurol. 2012;71:407–416. doi: 10.1002/ana.22683. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR15] 15.Johnston JJ, Rubinstein WS, Facio FM, Ng D, Singh LN, Teer JK, Mullikin JC, Biesecker LG. Secondary variants in individuals undergoing exome sequencing: screening of 572 individuals identifies high-penetrance mutations in cancer-susceptibility genes. Am J Hum Genet. 2012;91:97–108. doi: 10.1016/j.ajhg.2012.05.021. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR16] 16.Rehm HL, Bale SJ, Bayrak-Toydemir P, Berg JS, Brown KK, Deignan JL, Friez MJ, Funke BH, Hegde MR, Lyon E. ACMG clinical laboratory standards for next-generation sequencing. Genet Med. 2013;15:733–747. doi: 10.1038/gim.2013.92. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR17] 17.Yang Y, Muzny DM, Reid JG, Bainbridge MN, Willis A, Ward PA, Braxton A, Beuten J, Xia F, Niu Z, Hardison M, Person R, Bekheirnia MR, Leduc MS, Kirby A, Pham P, Scull J, Wang M, Ding Y, Plon SE, Lupski JR, Beaudet AL, Gibbs RA, Eng CM. Clinical whole-exome sequencing for the diagnosis of mendelian disorders. N Engl J Med. 2013;369:1502–1511. doi: 10.1056/NEJMoa1306555. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR18] 18.Rehm HL. Disease-targeted sequencing: a cornerstone in the clinic. Nat Rev Genet. 2013;14:295–300. doi: 10.1038/nrg3463. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR19] 19.Berglund EC, Lindqvist CM, Hayat S, Övernäs E, Henriksson N, Nordlund J, Wahlberg P, Forestier E, Lönnerholm G, Syvänen AC. Accurate detection of subclonal single nucleotide variants in whole genome amplified and pooled cancer samples using HaloPlex target enrichment. BMC Genomics. 2013;14:856. doi: 10.1186/1471-2164-14-856. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR20] 20.Desai AN, Jere A. Next-generation sequencing: ready for the clinics? Clin Genet. 2012;81:503–510. doi: 10.1111/j.1399-0004.2012.01865.x. [DOI] [PubMed] [Google Scholar]

[CR21] 21.Mutarelli M, Marwah VS, R R, Carrella D, Dharmalingam G, Oliva G, di Bernardo D (2014) A community-based resource for automatic exome variant-calling and annotation in Mendelian disorders. BMC Genomics 15: doi: 10.1186/1471–2164–15-s3-s5 [DOI] [PMC free article] [PubMed]

[CR22] 22.Li H, Durbin R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics. 2009;25:1754–1760. doi: 10.1093/bioinformatics/btp324. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR23] 23.DePristo MA, Banks E, Poplin R, Poplin R, Garimella KV, Maguire JR, Hartl C, Philippakis AA, del Angel G, Rivas MA, Hanna M, McKenna A, Fennell TJ, Kernytsky AM, Sivachenko AY, Cibulskis K, Gabriel SB, Altshuler D, Daly MJ. A framework for variation discovery and genotyping using next-generation DNA sequencing data. Nat Genet. 2011;43:491–498. doi: 10.1038/ng.806. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR24] 24.Wang K, Li M, Hakonarson H. ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data. Nucleic Acids Res. 2010;38:e164. doi: 10.1093/nar/gkq603. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR25] 25.Pruitt KD, Tatusova T, Brown GR, Maglott DR. NCBI Reference Sequences (RefSeq): current status, new features and genome annotation policy. Nucleic Acids Res. 2012;40:D130–135. doi: 10.1093/nar/gkr1079. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR26] 26.Sherry ST, Ward MH, Kholodov M, Baker J, Phan L, Smigielski EM, Sirotkin K. dbSNP: the NCBI database of genetic variation. Nucleic Acids Res. 2001;29:308–311. doi: 10.1093/nar/29.1.308. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR27] 27.Abecasis GR, Altshuler D, Auton A, Brooks LD, Durbin RM, Gibbs RA, Hurles ME, McVean GA. A map of human genome variation from population-scale sequencing. Nature. 2010;467:1061–1073. doi: 10.1038/nature09534. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR28] 28.Pollard KS, Hubisz MJ, Rosenbloom KR, Siepel A. Detection of nonneutral substitution rates on mammalian phylogenies. Genome Res. 2010;20:110–121. doi: 10.1101/gr.097857.109. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR29] 29.Goode DL, Cooper GM, Schmutz J, Dickson M, Gonzales E, Tsai M, Karra K, Davydov E, Batzoglou S, Myers RM, Sidow A. Evolutionary constraint facilitates interpretation of genetic variation in resequenced human genomes. Genome Res. 2010;20:301–310. doi: 10.1101/gr.102210.109. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR30] 30.Liu X, Jian X, Boerwinkle E. dbNSFP: a lightweight database of human nonsynonymous SNPs and their functional predictions. Hum Mutat. 2011;32:894–899. doi: 10.1002/humu.21517. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR31] 31.Schwarz JM, Rodelsperger C, Schuelke M, Seelow D. MutationTaster evaluates disease-causing potential of sequence alterations. Nat Methods. 2010;7:575–576. doi: 10.1038/nmeth0810-575. [DOI] [PubMed] [Google Scholar]

[CR32] 32.Adzhubei I, Jordan DM, Sunyaev SR (2013) Predicting functional effect of human missense mutations using PolyPhen-2. Curr Protoc Hum Genet, Chapter 7: Unit7 20 [DOI] [PMC free article] [PubMed]

[CR33] 33.Kumar P, Henikoff S, Ng PC. Predicting the effects of coding non-synonymous variants on protein function using the SIFT algorithm. Nat Protoc. 2009;4:1073–1081. doi: 10.1038/nprot.2009.86. [DOI] [PubMed] [Google Scholar]

[CR34] 34.Thorvaldsdottir H, Robinson JT, Mesirov JP. Integrative Genomics Viewer (IGV): high-performance genomics data visualization and exploration. Brief Bioinform. 2012;14:178–192. doi: 10.1093/bib/bbs017. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR35] 35.Laing NG. Genetics of neuromuscular disorders. Crit Rev Clin Lab Sci. 2012;49:33–48. doi: 10.3109/10408363.2012.658906. [DOI] [PubMed] [Google Scholar]

[CR36] 36.Cirak S, Foley AR, Herrmann R, Willer T, Yau S, Stevens E, Torelli S, Brodd L, Kamynina A, Vondracek P, Roper H, Longman C, Korinthenberg R, Marrosu G, Nürnberg P, Michele DE, Plagnol V, Hurles M, Moore SA, Sewry CA, Campbell KP, Voit T, Muntoni F. ISPD gene mutations are a common cause of congenital and limb-girdle muscular dystrophies. Brain. 2013;136:269–281. doi: 10.1093/brain/aws312. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR37] 37.Logan CV, Lucke B, Pottinger C, Abdelhamed ZA, Parry DA, Szymanska K, Diggle CP, van Riesen A, Morgan JE, Markham G, Ellis I, Manzur AY, Markham AF, Shires M, Helliwell T, Scoto M, Hübner C, Bonthron DT, Taylor GR, Sheridan E, Muntoni F, Carr IM, Schuelke M, Johnson CA. Mutations in MEGF10, a regulator of satellite cell myogenesis, cause early onset myopathy, areflexia, respiratory distress and dysphagia (EMARDD) Nat Genet. 2011;43:1189–1192. doi: 10.1038/ng.995. [DOI] [PubMed] [Google Scholar]

[CR38] 38.Sarparanta J, Jonson PH, Golzio C, Sandell S, Luque H, Screen M, McDonald K, Stajich JM, Mahjneh I, Vihola A, Raheem O, Penttilä S, Lehtinen S, Huovinen S, Palmio J, Tasca G, Ricci E, Hackman P, Hauser M, Katsanis N, Udd B. Mutations affecting the cytoplasmic functions of the co-chaperone DNAJB6 cause limb-girdle muscular dystrophy. Nat Genet. 2012;44:450–455. doi: 10.1038/ng.1103. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR39] 39.Vasli N, Böhm J, Le Gras S, Muller J, Pizot C, Jost B, Echaniz-Laguna A, Laugel V, Tranchant C, Bernard R, Plewniak F, Vicaire S, Levy N, Chelly J, Mandel JL, Biancalana V, Laporte J. Next generation sequencing for molecular diagnosis of neuromuscular diseases. Acta Neuropathol. 2012;124:273–283. doi: 10.1007/s00401-012-0982-8. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR40] 40.Chen X, Listman JB, Slack FJ, Gelernter J, Zhao H. Biases and errors on allele frequency estimation and disease association tests of next-generation sequencing of pooled samples. Genet Epidemiol. 2012;36:549–560. doi: 10.1002/gepi.21648. [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

MotorPlex provides accurate variant detection across large muscle genes both in single myopathic patients and in pools of DNA samples

Marco Savarese

Giuseppina Di Fruscio

Margherita Mutarelli

Annalaura Torella

Francesca Magri

Filippo Maria Santorelli

Giacomo Pietro Comi

Claudio Bruno

Vincenzo Nigro

Abstract

Electronic supplementary material

Introduction

Materials and methods

Patients

In silico design of MotorPlex

NGS workflow

Targeted sequencing analysis

Results

Validation study of MotorPlex

Figure 1.

Table 1.

Validation study of double-check pooling

Figure 2.

Variants and interpretation

Table 2.

Discussion

Table 3.

Additional files

Electronic supplementary material

Acknowledgements

Abbreviations

Footnotes

Contributor Information

References

Associated Data

Supplementary Materials

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases