Skip to main content
iScience logoLink to iScience
. 2023 Oct 30;26(12):108365. doi: 10.1016/j.isci.2023.108365

Uncovering the true features of dystrophin gene rearrangement and improving the molecular diagnosis of Duchenne and Becker muscular dystrophies

Chao Ling 1,3,5, Yi Dai 2,5,6,, Chang Geng 2, Shirang Pan 4, Weipeng Quan 4, Qingyun Ding 2, Xunzhe Yang 2, Dongchao Shen 2, Qing Tao 4, Jingjing Li 4, Jia Li 2, Yinbing Wang 2, Shan Jiang 2, Yang Wang 4, Lin Chen 2, Liying Cui 2, Depeng Wang 4
PMCID: PMC10690541  PMID: 38047063

Summary

Duchenne and Becker muscular dystrophies (DMD/BMD) are caused by complex mutations in the dystrophin gene (DMD). Currently, there is no integrative method for the precise detection of all potential DMD variants, a gap which we aimed to address using long-read sequencing. The captured long-read sequencing panel developed in this study was applied to 129 subjects, including 11 who had previously unsolved cases. The results showed that this method accurately detected DMD mutations, ranging from single-nucleotide variations to structural variations. Furthermore, our findings revealed that continuous exon duplication/deletion in the DMD/BMD cohort may be attributed to complex segmental rearrangements and that noncontiguous duplication/deletion is generally attributed to intragenic inversion or interchromosome translocation. Mutations in the deep introns were confirmed to produce a pseudoexon. Moreover, variations in female carriers were precisely identified. The integrated and precise DMD gene screening method proposed in this study could improve the molecular diagnosis of DMD/BMD.

Subject areas: Health sciences, Clinical genetics, Pediatrics

Graphical abstract

graphic file with name fx1.jpg

Highlights

  • Long-read sequencing can detect DMD mutations that involve both SNVs and complex SVs

  • Non-contiguous changes due to intragenic inversion or interchromosome translocation

  • Deep intron mutations generate pseudoexons, advancing DMD/BMD genetics

  • The findings could enhance diagnoses and inform clinicians and researchers


Health sciences; Clinical genetics; Pediatrics

Introduction

Molecular genetic studies have indicated that both Duchenne (OMIM: 310200) muscular dystrophy (DMD) and Becker (OMIM: 300376) muscular dystrophy (BMD) are caused by mutations in the dystrophin gene (DMD).1 In male probands, hemizygous pathogenic variants are frequently identified in DMD. In contrast, females are usually asymptomatic carriers; however, some female carriers show mild skeletal muscle and/or cardiac symptoms of DMD/BMD and are referred to as manifesting carriers. In patients with DMD/BMD, 68–70% of mutations are single or multiple exon deletions, and 8–11% are duplications of the DMD gene. Additionally, 18–20% were small mutations, including insertions, deletions, and single-nucleotide substitutions in exons and introns.2,3,4 In the case of patients with DMD/BMD, diverse methods for causative mutation investigation have been employed, of which multiplex ligation-dependent probe amplification (MLPA) is commonly and preferentially applied to simultaneously screen all 79 exons of DMD for deletions/duplications.5 However, single exon deletion still requires additional validation owing to the probe binding problem. Moreover, MLPA is not suitable for detecting single nucleotide variants (SNVs) and small indels.6 High-throughput sequencing (short-read sequencing) technologies are widely applied to call SNVs, microdeletions, and microduplications. Depending on the read count, breakpoints for most of the deletions can be identified; however, the generation of accurate results for duplications or complex structural variants is difficult owing to the limitation of read length (100–300 bp).7,8,9 Nanochannel-based next-generation mapping enables the sensitive detection of pathogenic structural variations (SVs), which can be missed by PCR-based techniques or chromosomal microarrays; however, the resolution of breakpoints is limited to endonuclease nicking site density.10 Approximately 2–7% of patients with DMD/BMD cannot be diagnosed using MLPA or high-throughput sequencing/mapping methods, which may be attributed to deep intronic variations, including microindels, substitutions, and large-scale deletions and duplications.3,11,12,13 RNA studies have shown that deep intronic mutations can create new splicing sites and generate pseudoexons.13,14,15 Thus, RNA analysis of muscle tissue using RT-PCR or short-read sequencing could serve as a genetic diagnostic tool for these rare mutations; however, this method requires a highly invasive muscle biopsy procedure. Moreover, clinicians usually require further investigations of DNA mutations for gene therapy and prenatal diagnosis. Therefore, we believe that a method that can obtain an accurate and full spectrum of mutation types of the DMD gene based on genomic DNA resources could benefit the molecular diagnosis and treatment of patients with DMD/BMD.

Long-read sequencing, represented by two mainstream platforms: Oxford Nanopore Technologies (ONT) and Pacific Biosciences (PB) single-molecule real-time sequencing, continues to rapidly improve throughput and reduce costs.16,17 Compared to conventional methods, long-read sequencing technologies offer more possibilities, especially in the detection of SVs and complex rearrangements, which involve duplications,18,19 whereas SNVs and small indels can also be accurately identified with adequate read depth (>30×) and HiFi sequencing technology.20 Additionally, targeted long-read sequencing is considered to be an efficient and cost-effective method for complex variant detection.21 Therefore, long-read sequencing can be used as a unified methodology to detect all heterogeneous sets of pathogenic mutations in the DMD gene. Accordingly, we designed a complete DMD gene panel based on long-read sequencing technology. Furthermore, to uncover the true features of DMD mutations, we performed genetic sequencing and analyzed 129 patients with DMD/BMD using ONT and PB platforms.

Results

Patient information

Among the 129 subjects included in this study, there were 76 males diagnosed with DMD, 16 males with BMD, seven males with intermediate muscular dystrophy (IMD), and two symptomatic female carriers. Additionally, 24 asymptomatic female carriers and four normal subjects (one male and three females) were included. Of the subjects, 76 were identified by the MLPA method, 33 had mutations that could be detected by conventional short-read sequencing methods, such as whole exome sequencing (WES) or DMD gene panel, and 11 were specifically sampled for this study due to the failure of genetic diagnosis using MPLA or NGS-based methods. Additionally, nine relatives who had not been previously diagnosed using MLPA or short-read sequencing were enrolled. In this cohort, 70 subjects were sequenced using the ONT platform, 51 using the PB platform, and eight using both the ONT and PB platforms for parallel validation (Figure 1; Tables S1 and S2).

Figure 1.

Figure 1

Flow chart of captured long-read sequencing

The capture and library preparation of the entire genomic DMD gene are shown step-by-step for both PB and ONT platforms, respectively.

Sequencing data overview

A total of 137 independent long-read datasets were generated for the 129 subjects. An average of 118,433 and 1,261,799 reads were collected from the PB and ONT datasets, respectively. The PB platform had an average read length of 2,502 bp with a coverage of 96.92% of the target region, while the ONT platform had an average read length of 1,945 bp with a coverage of 99.99% of the target region. The average sequencing depth for PB and ONT was 168 × and 382 ×, respectively.

Captured long-read sequencing provided a more efficient method for Duchenne muscular dystrophies gene mutation detection

In this study, accurate variations of the DMD gene were identified and corrected using the designed long-read sequencing panel, including mutations that had previously failed to be detected. Captured long-read sequencing provides an advantage for large-scale inversion and translocation detection, with SV fragment sizes ranging from 11 kb to 40 Mb. The results showed that translocations, duplications, and inversions had a wider range of alterations than deletions in the DMD gene (Tables 1 and 2). In addition, targeted long-read sequencing is capable of detecting microindels and SNVs, including deep intron variations, which is more accurate than short-read sequencing at the same 30× sequencing depth. In this study, 41 patients had SNVs and small indels within the DMD gene. Of these, five were deep intron mutations predicted to affect transcription by generating splicing sites and pseudoexons. Moreover, transcriptional alteration in subject D116, who had an ∼11-kb deletion in intron 48 (chrX: 31881062–31892406), was confirmed to generate a 161-bp transcription by muscle biopsy RNA-Seq (Figure S1). All SNV mutations were correctly identified using long-read sequencing platforms. Moreover, the software and analytic process mentioned in the STAR methods section accurately detected SNVs located within the homologous sequence (subject D130) in both the ONT and PB datasets, even in areas where short-read WGS failed. Nonsense and splice-site mutations were more frequently detected in SNVs (Table 1).

Table 1.

Long-read sequencing allows for efficient DMD gene mutation detection

MP Total count MF(%) Detection rate (a/b#)
SV size (bp)
MLPA NGS-panelα/WESβ/WGSγ LRSP
L-Del 34 26.4 33/34 NA 34/34 11,345–652,383
L-Dup 32 25.6 32/32 NA 32/32 12,735–1,260,562
Inversion 9 7.2 0/8 0/1 α 9/9 85,898–1,298,407
Translocation 2 1.6 0/1 0/2 1α/1β 2/2 13,394,796–40,251,284
LDD 7 5.6 4/4 NA 7/7 30,576–8,653,099
Micro-indel 11 8.8 0/9 9/9 α 11/11 1–4
Stop codon 15 12.0 0/15 15/15 α 15/15 0
Missense 2 1.6 0/2 2/2 α 2/2 0
Splicing 8 6.4 0/8 8/8 α 8/8 0
Deep-intron 5 4.0 0/5 0/5 4β/1γ 5/5 0

MP, mutation patterns; L-Del, large sequential deletion; L-Dup, large sequential duplication; LDD, large discontinuous deletion and duplication; MF, mutation frequency; a∗, number of positive detections; b#, total number of samples in the detection method; α, NGS-panel; β, WES; γ, WGS; NA, the detection method is NOT used for this pattern; LRSP, long-read sequencing panel; SV, structural variation.

Table 2.

Captured long-read sequencing corrected MLPA and short-read sequencing results

ID MLPA (NM_004006.3) WES/NGS-panel Long-read sequencing panel (NC_000023.10)
D54/D59 E51del; E64-79dup NA g.30866278_31256392dup; g.30866278_31766900inv; g.31766901_31804205del
D78 E30-43dup NA g.32235128_32266641dup; g.32235128_32453034inv; g.32266659_32453034dup
D84/D85 E54-59del; E61-75del NA g.311648843_31396554del; g.31396556-31482449inv; g.31482452_31692080del
D89/D90 E2dup; E45-49dup NA g.31849725_32161440dup; g.31849725_33148131inv; g.32904520_33148131dup
D133 NA None NC_000022.10:g.1_37909829delins[NC_000023.10:g.1_33085040]
NC_000023.10:g.1_33085051delins[NC_000022.10:g.1_37909760]
D134 NA None NC_000005.9:g.1_140663979delins[NC_000023.10:g.1_32953142]
NC_000023.10:g.1_32953139delins[NC_000005.9:g.1_140663975]
D135 NA None g.33077035_33127081dup; g.33077033_33166191inv; g.33128480_33166191dup

NA, the detection method is NOT used for the subject; SV, structural variation; None, no variant was detected; inv, invention; delins, the HGVS recommendation for the nomenclature of translocation.

Captured long-read sequencing revealed the precise characteristics of structural variation ligation in Duchenne muscular dystrophies

Using the target long-read sequencing method, a total of 84 patients were found with duplications and/or deletions in the DMD gene, of which 32 had duplications, 34 had deletions, nine had inversions, two had translocations, and seven had large discontinuous deletions and duplications (Table S2). Based on the captured long-read sequencing results, we confirmed that all breakpoints of the SVs fell within the intron region and that most contained microinsertions. Additionally, a few subjects had a large fragment (600–1000 bp) insertion between the breakpoints (Table S3); however, sequential ligation was not affected. Moreover, although most of the subjects were found to have head-to-tail ligation, we found a rare subject (D127) with continuous exon 19 to exon 44 duplications that resulted from an upstream to exon 18 deletion (NC_000023.10:g.32521926_40387205del) and an upstream to exon 44 duplication (NC_000023.10:g.32119087_40772185dup). Meanwhile, a nonsense mutation (NM_004006.3:c.5404C>T) was identified. Therefore, the above findings showed that not all sequential duplications were as simple as they seemed in the MLPA results (Figure 2). In addition, in nine of the subjects exhibiting DMD intragenic inversion, seven had previously been detected as having discontinuous deletions or duplications using the MLPA method (Figure 3; Table 2). The junction reads produced by inversion and translocation were verified with Sanger sequencing (Figure S2). The captured long-read sequencing panel revealed the characteristics of SV ligation in DMD.

Figure 2.

Figure 2

Captured long-read sequencing panel precisely uncovered the SVs of DMD

(A–G) Long-read sequencing revealed that the non-contiguous variants and the minority of duplications detected by MLPA methods are definitively attributed to inversion (A–D); translocation between autosomal and X chromosomes involving the DMD gene is shown (E and F); an inversion occurring in the intron region is indicated (G).

Figure 3.

Figure 3

Rare continuous duplication results from deletion and duplication

The same colored reads represent the same junction reads, which cross two separate regions of DMD.

Captured long-read sequencing exhibited higher accuracy for structural variation haplotype detection in female carriers

For SV haplotype detection, samples from 29 female participants were analyzed. We found that the SV haplotypes could be identified by long-read sequencing and were independent of sex. In addition, translocation occurred in two female patients (Figure 3; Table 2; Table S1). Thus, compared with MLPA and short-read sequencing methods, this one-step captured long-read sequencing method could attain higher accuracy in screening female carriers (Tables S1 and S2).

Discussion

In this study, we investigated the efficacy of a captured long-read sequencing panel and validated its performance in a cohort of 129 individuals. The results demonstrated that, compared with MLPA and short-read sequencing, the captured long-read sequencing method has significant advantages in uncovering the underlying large-scale rearrangements of DMD, especially in the detection of deep intron SVs, inversions, and translocations. In addition, this designed panel could capture the homologous region of DMD, which the short-read sequencing method could not achieve. Furthermore, at a 30× sequencing depth, this long-read sequencing panel precisely detected microindel mutations and deep intronic SNVs. We also observed that this method provided a more precise identification of SV breakpoint positions in the DMD gene. Both the ONT and PB sequencing methods were able to determine the exact breakpoints for deletions, duplications, and other complex rearrangements; however, PB showed slightly better consensus calling than ONT due to a more congruent sequence error pattern. Thus, for SNV detection, PB is more accurate than ONT. However, the accuracy of ONT could be improved by increasing the sequencing depth. Thus, ONT sequencing has the potential to meet the requirements for DMD mutation detection.

Although previous studies have characterized the deletion and duplication patterns of DMD, accurately identifying the precise junctions of breakpoint sequences, especially those involving duplicates, remains challenging. This highlights the need for further analyses of duplicates to achieve an accurate diagnosis and facilitate antisense-mediated exon-skipping therapy.22,23 Here, long-read sequencing demonstrated robust proficiency in decoding the entire sequence of the DMD rearrangement region, surpassing conventional methodologies. In particular, we found that not all duplications were tandem repeats, which directly revealed the nature of exon duplication in DMD.

MLPA is an effective method for detecting exon rearrangements in patients with DMD or BMD. These variations include the concurrence of exon deletion and duplication as well as the non-contiguous deletion or duplication of DMD exons. However, few studies have provided insight into the underlying sequence of complex SVs in DMD.24,25,26 Our long-read sequencing study suggests that complex SVs are mostly induced by inversion or translocation in unstable regions. Previous studies using chromosomal inversion and translocation detection methods have shown that an inverted duplication occurring next to a terminal deletion is a common rearrangement pattern in cancer and constitutional genomes.27,28 Our junction sequencing analysis revealed that non-contiguous duplications were interconnected with duplications in either the direct or inverted orientation. Complex deletions and duplication rearrangements may be the result of replication fork collapse.29

In addition, captured long-read sequencing significantly improves the detection of SVs in numerous genetic diseases for repeat expansion diseases in both male and female carriers.30,31 In our study, all female carriers were sequenced, and SVs were accurately identified in single alleles. Unlike the short-read sequencing-based method, which identifies duplications or deletions (copy number variations) using read-depth information, the long-read-based method takes advantage of split-read mapping. Hence, it can more accurately identify copy number variations in the heterozygous state. Therefore, captured long-read sequencing is a considerably more appropriate molecular diagnostic method than short-read sequencing for female carriers.

Both DMD and BMD are primarily caused by DMD mutations that lead to DMD genes of abnormal size, quantity, or function.32 However, occasionally, muscle biopsies may be necessary to accurately identify cases that cannot be resolved through mRNA and dystrophin protein expression studies alone.33 Specifically, an SNV in the exon or intron of DMD produces a premature termination codon or pseudoexon, resulting in DMD/BMD.34,35 However, these mutations are generally considered when SVs are not found in clinical gene detection, which extends the time of diagnosis. Our one-step mutation detection platform will accelerate the process of DMD gene detection, making the diagnosis much more efficient. But, the exception is the variation occurs during transcription only.

Furthermore, in emerging treatments for patients with DMD/BMD, antisense oligonucleotide-mediated exon skipping is one of the most promising therapeutic approaches and is expected to be applicable to most mutations, including deletions, duplications, and nonsense mutations of in-frame exons.36 However, it is difficult to design an antisense oligonucleotide for certain duplications and complex variations when the actual sequences are unclear. Similarly, a gene-editing therapeutic strategy also requires accurate and detailed DMD defect information to select the appropriate candidate. Therefore, the new era of gene therapy for DMD/BMD requires more accurate mutation detection. The long-read sequencing panel used in this study offers a highly efficient and accurate approach for the molecular diagnosis of DMD/BMD. Additionally, these findings have expanded our understanding of DMD gene rearrangements and provided insights into the treatment of DMD/BMD.

Limitations of the study

In this study, we explored the captured long-read sequencing method for DMD mutation detection. However, some rare patients with DMD/BMD could be caused by transcription variation alone. We did not investigate the possible mechanisms underlying DMD rearrangement and the mRNA variation, and further research in this area is essential for our future studies.

STAR★Methods

Key resources table

REAGENT or RESOURCE SOURCE IDENTIFIER
Biological samples

Genomic DNA. DMD/BMD patients peripheral blood.
The raw sequence data have been deposited in the Genome Sequence Archive. The accession number is HRA004622.
Peking Union Medical College Hospital (Beijing, China).

Oligonucleotides

Primers. Please see Table S5 for detail. Sangon Biotech (Shanghai Co., Ltd.).

Software and algorithms

Guppy (high acc) https://pypistats.org/packages/guppy3 https://pypistats.org/packages/guppy3
Nanoplexer (0.1) https://anaconda.org/bioconda/nanoplexer/files https://anaconda.org/bioconda/nanoplexer/files
minimap2 (2.17-r941) https://github.com/lh3/minimap2/releases https://github.com/lh3/minimap2/releases
mosdepth (0.2.9) https://github.com/brentp/mosdepth/releases https://github.com/brentp/mosdepth/releases
sniffles (1.0.12) https://github.com/fritzsedlazeck/Sniffles https://github.com/fritzsedlazeck/Sniffles
pepper_deepvariant (0.10.0) https://github.com/kishwarshafin/pepper https://github.com/kishwarshafin/pepper
clair (2.1.0) https://github.com/quay/clair https://github.com/quay/clair
Annovar (2017-07-17 01:17:05–0400) https://annovar.openbioinformatics.org/en/latest/ https://annovar.openbioinformatics.org/en/latest/
AnnotSV(version 2.1) https://github.com/lgmgeo/AnnotSV https://github.com/lgmgeo/AnnotSV
spliceAI (1.3) https://github.com/Illumina/SpliceAI https://github.com/Illumina/SpliceAI
ccs (v4.2.0-1-g450908e4) https://github.com/PacificBiosciences/ccs https://github.com/PacificBiosciences/ccs
lima (v1.11.0-1-gec618c9) https://lima1.readthedocs.io/en/latest/ https://lima1.readthedocs.io/en/latest/
bam2fastq (1.3.0) https://github.com/jts/bam2fastq https://github.com/jts/bam2fastq
deepvariant (0.10.0) https://github.com/google/deepvariant https://github.com/google/deepvariant

Resource availability

Lead contact

Further information and requests for resources and reagents should be directed to and will be fulfilled by the lead contact, Yi Dai (pumchdy@pumch.cn).

Materials availability

This study did not generate new unique reagents.

Data and code availability

  • The raw sequence data have been deposited in the Genome Sequence Archive (Genomics, Proteomics & Bioinformatics 2021) at the National Genomics Data Center (Nucleic Acids Res 2022), China National Center for Bioinformation/Beijing Institute of Genomics, Chinese Academy of Sciences. and are publicly available as of the date of publication.37,38 Accession numbers are listed in the key resources table.

  • This paper does not report original code.

  • Any additional information required to reanalyze the data reported in this paper is available from the lead contact upon request.

Experimental model and study participant details

Ethics declaration

This study was approved by the Ethics Committee of Peking Union Medical College Hospital in Beijing, China. Written informed consent was obtained from all participants.

Human participants

A total of 129 Han Chinese individuals were included in this study. Among them, there were 100 males: 76 diagnosed with DMD with an average age of 6.55 years, 16 with BMD and an average age of 14.0 years, and seven with IMD and an average age of 10.29 years. Additionally, there was one normal subject aged 61.0 years. Furthermore, there were also 29 females: 24 asymptomatic carriers with an average age of 34.58 years, three normal subjects with an average age of 10.0 years, and two symptomatic carriers with an average age of 3.50 years. All of the subjects admitted to the Peking Union Medical College Hospital (Beijing, China), were enrolled for a single-blind, in-depth investigation and validation of DMD mutations using the long-read sequencing method Table S1. This study was approved by the Ethics Committee of Peking Union Medical College Hospital. Written informed consent was obtained from each participant, and all participants were anonymised for DNA sequencing and data analysis.

Method details

DMD gene panel design

We designed a set of DNA probes with a length of 100 bp to cover the entire DMD gene, including all exons and introns, and the 20 kb upstream and downstream regions (GRCh37/hg19 chrX: 31115345–33377726). Probes corresponding to repetitive sequences in the human genome were excluded from the analysis.

DNA isolation and assessment

Genomic DNA was extracted from the peripheral blood using a genomic DNA extraction kit (Sangon Bioengineer Co., Shanghai, China) according to the manufacturer’s protocol. The quantity and quality of DNA were assessed using Qubit 3.0 (Thermo Fisher Scientific Inc., Carlsbad, CA, USA) and agarose gel electrophoresis, respectively.

Library preparation and sequencing

Sequencing libraries were constructed as follows: 3 μg of genomic DNA per sample was sheared to 1–6 kb fragments by g-Tube (Cat. No. 520079, Covaris, Woburn, MA) centrifugation (1,5000 × g, 2 min, twice). DNA was purified after fragment size characterization, after which end repair, A-tailing at the 3′ ends, and adapter ligation were performed through pre-capture amplification. After performing end repair using the NEBNext Ultra II End Repair/dA-Tailing Module (New England Biolabs, MA, USA), the adapter was attached using a NEBNext Ultra II Ligation Module (New England Biolabs). Pre-capture amplification was carried out using PrimeSTAR GXL DNA Polymerase (Takara Bio, Shiga, Japan). The PCR cycling conditions consisted of an initial denaturation at 98°C for 1 min, followed by 4 cycles of amplification (denaturation 98°C for 10 s, annealing 65°C for 15 s, and extension 68°C for 5 min). Targeted sequence capture was conducted by pooling indexed PCR products and hybridization with custom capture probes. Briefly, 1 μg of pre-capture library DNA was hybridized with solution probes and blockers (custom, IDT), along with 5 μg human Cot-1 DNA (#15279011, Thermo Fisher Scientific), at 65°C for 14 h. Biotinylated probes were captured using M-270 Streptavidin beads (#65306, Thermo Fisher Scientific). Hybridization and washing were performed according to the protocols recommended by Boke Biotechnologies. Captured fragments were amplified using PrimeSTAR GXL DNA Polymerase in a 50 μL reaction volume. The cycling conditions were as follows: initial denaturation at 98°C for 1 min, followed by 18 cycles of amplification (denaturation 98°C for 10 s, annealing 65°C for 15 s, and extension 68°C for 6 min) and a final extension at 68°C for 5 min. Finally, the PCR product was size selected using a 0.4× volume of AmpureXP beads (Beckman Coulter, USA). Purified DNA fragments were amplified by PCR, quantified, and subjected to sequencing on the long-read sequencing platforms of Oxford Nanopore PromethION using an R9.4.1 flow cell and Pacific Biosciences Sequel II, according to the manufacturer’s standard protocols (Figure 1).

Sequencing data analysis

For the ONT sequencing data, Guppy (version 3.0.5 + 45c3543) was used to perform base-calling in high-accuracy mode, and FASTQ reads were generated from the raw electrical signals in FAST5 format. Reads with a quality >7 were retained and demultiplexed into their corresponding samples using a Nanoplexer (version 0.1). For PB sequencing data, HiFi reads with more than five subreads were kept in the Circular Consensus Sequencing step using ccs (version v4.2.0-1-g450908e4) and further demultiplexed using lima (version v1.11.0-1-gec618c9). The long reads were then mapped to the reference genome hg19 using minimap2 (version 2.15-r906-dirty or 2.17-r941). Next, Sniffles (version 1.0.12) was used for SV calling, and Pepper_deepvariant (version 0.10.0) and deepvariant (0.10.0) were used for SNV calling for the ONT and PB platforms, respectively. SpliceAI was used for novel splice-site prediction. Consensus sequences of SV reads were generated using spoa (version v4.0.5), and insertion bases between breakpoints were manually checked using UCSC BLAT (https://genome.ucsc.edu/cgi-bin/hgBlat) (Table S4; Figure S3). Finally, the SVs in each sample were annotated with AnnotSV (version 2.1), while the SNVs and splicing events were annotated using Annovar (2017-07-17 01:17:05–0400) and spliceAI (1.3), respectively. These annotations were performed for downstream analysis.

Sanger sequencing

Breakpoint validation was performed using Sanger sequencing. Using Primer3web (version 4.0.0) software, the forward primer was designed using the upstream sequence of the 5′ breakpoint, and the reverse primer was designed using the downstream sequence of the 3' breakpoint (Table S5). Primers were synthesized by Sangon Biotech (Shanghai Co., Ltd.) The PCR reaction was performed with KOD FX Neo polymerase (TOYOBO, Osaka, Japan) using a three-step PCR protocol in a 50 μL reaction volume. The reaction mixture contained 25 μL of 2× PCR buffer, 0.4 mM of each dNTP, 0.15 μM of each primer, 1 U of polymerase, and 100 ng of genomic DNA. The PCR amplification consisted of an initial denaturation step at 94°C for 2 min, followed by denaturation at 98°C for 10 s, annealing at 60°C for 30 s, extension at 72°C for 2 min for 35 cycles, and a final elongation step of 10 min at 68°C. The presence of the expected products was confirmed by 1% agarose gel electrophoresis. The amplified products were subsequently purified using a TIANgel midi purification kit (Tiangen, Beijing) and sequenced at the Sangon Biotech facility (Shanghai, China).

Quantification and statistical analysis

No statistical analysis was involved in this study.

Additional resource

There are no other resources to declare.

Acknowledgments

This study was supported by the National High Level Hospital Clinical Research Funding, Grant Number: 2022-PUMCH-D-002; and the National Key Research and Development Program of China, Grant Number: 2022YFC2703104.

Author contributions

Conceptualization, Y.D. and C.L.; methodology, D.P.W., Y.W., Y.D., and C.L.; investigation, C.G., S.R.P., W.P.Q., Q.Y.D., T.Q., J.J.L., J.L., S.J., and Y.B.W.; writing – original draft, C.L. and Y.D.; writing – review and editing, C.L.; funding acquisition, Y.D.; resources, L.Y.C., L.C., X.Z.Y., and D.C.S.; Supervision, Y.D.

Declaration of interests

The authors declare no competing interests.

Inclusion and diversity

We worked to ensure ethnic or other types of diversity in the recruitment of human subjects.

Published: October 30, 2023

Footnotes

Supplemental information can be found online at https://doi.org/10.1016/j.isci.2023.108365.

Supplemental information

Document S1. Figures S1‒S3
mmc1.pdf (743.8KB, pdf)
Table S1. Profile of the 129 subjects, related to Table 1, Table 2 and Figure 3
mmc2.xlsx (16.5KB, xlsx)
Table S2. Pathogenic mutation results of the 129 subjects across different platforms

The variants were described in accordance with HGVS nomenclature and verified using the Mutalyzer program (version: 3.0.5; https://mutalyzer.nl/), related to Table 1, Table 2, Figure 1, Figure 2 and Figure 3.

mmc3.xlsx (21.1KB, xlsx)
Table S3. Random base insertion statistics in SV breakpoints, related to Table 1
mmc4.xlsx (15.8KB, xlsx)
Table S4. Software used for sequencing data analysis, related to Figure 3
mmc5.xlsx (11.3KB, xlsx)
Table S5. Primers used for breakpoints validation, related to Table 2
mmc6.xlsx (11.3KB, xlsx)

References

  • 1.Kunkel L.M., Hejtmancik J.F., Caskey C.T., Speer A., Monaco A.P., Middlesworth W., Colletti C.A., Bertelson C., Müller U., Bresnan M., et al. Analysis of Deletions in DNA from Patients with Becker and Duchenne Muscular-Dystrophy. Nature. 1986;322:73–77. doi: 10.1038/322073a0. [DOI] [PubMed] [Google Scholar]
  • 2.Okubo M., Minami N., Goto K., Noguchi S., Mitsuhashi S., Nishino I. Genetic diagnosis of Duchenne/Becker muscular dystrophy using next-generation sequencing: Validation analysis of DMD mutations. Neuromuscular Disord. 2016;26:S96. doi: 10.1016/j.nmd.2016.06.041. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Kong X., Zhong X., Liu L., Cui S., Yang Y., Kong L. Genetic analysis of 1051 Chinese families with Duchenne/Becker Muscular Dystrophy. BMC Med. Genet. 2019;20:139. doi: 10.1186/s12881-019-0873-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Bladen C.L., Salgado D., Monges S., Foncuberta M.E., Kekou K., Kosma K., Dawkins H., Lamont L., Roy A.J., Chamova T., et al. The TREAT-NMD DMD Global Database: analysis of more than 7,000 Duchenne muscular dystrophy mutations. Hum. Mutat. 2015;36:395–402. doi: 10.1002/humu.22758. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Lalic T., Vossen R.H.A.M., Coffa J., Schouten J.P., Guc-Scekic M., Radivojevic D., Djurisic M., Breuning M.H., White S.J., den Dunnen J.T. Deletion and duplication screening in the DMD gene using MLPA. Eur. J. Hum. Genet. 2005;13:1231–1234. doi: 10.1038/sj.ejhg.5201465. [DOI] [PubMed] [Google Scholar]
  • 6.Koczok K., Merő G., Szabó G.P., Madar L., Gombos É., Ajzner É., Mótyán J.A., Hortobágyi T., Balogh I. A novel point mutation affecting Asn76 of dystrophin protein leads to dystrophinopathy. Neuromuscul. Disord. 2018;28:129–136. doi: 10.1016/j.nmd.2017.12.003. [DOI] [PubMed] [Google Scholar]
  • 7.Nallamilli B.R.R., Chaubey A., Valencia C.A., Stansberry L., Behlmann A.M., Ma Z., Mathur A., Shenoy S., Ganapathy V., Jagannathan L., et al. A single NGS-based assay covering the entire genomic sequence of the DMD gene facilitates diagnostic and newborn screening confirmatory testing. Hum. Mutat. 2021;42:626–638. doi: 10.1002/humu.24191. [DOI] [PubMed] [Google Scholar]
  • 8.Ling C., Dai Y., Fang L., Yao F., Liu Z., Qiu Z., Cui L., Xia F., Zhao C., Zhang S., et al. Exonic rearrangements in DMD in Chinese Han individuals affected with Duchenne and Becker muscular dystrophies. Hum. Mutat. 2020;41:668–677. doi: 10.1002/humu.23953. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Verhaart I.E.C., Aartsma-Rus A. Therapeutic developments for Duchenne muscular dystrophy. Nat. Rev. Neurol. 2019;15:373–386. doi: 10.1038/s41582-019-0203-3. [DOI] [PubMed] [Google Scholar]
  • 10.Béroud C., Carrié A., Beldjord C., Deburgrave N., Llense S., Carelle N., Peccate C., Cuisset J.M., Pandit F., Carré-Pigeon F., et al. Dystrophinopathy caused by mid-intronic substitutions activating cryptic exons in the DMD gene. Neuromuscul. Disord. 2004;14:10–18. doi: 10.1016/s0960-8966(03)00169-x. [DOI] [PubMed] [Google Scholar]
  • 11.Baskin B., Gibson W.T., Ray P.N. Duchenne muscular dystrophy caused by a complex rearrangement between intron 43 of the DMD gene and chromosome 4. Neuromuscul. Disord. 2011;21:178–182. doi: 10.1016/j.nmd.2010.11.008. [DOI] [PubMed] [Google Scholar]
  • 12.Gurvich O.L., Tuohy T.M., Howard M.T., Finkel R.S., Medne L., Anderson C.B., Weiss R.B., Wilton S.D., Flanigan K.M. DMD pseudoexon mutations: splicing efficiency, phenotype, and potential therapy. Ann. Neurol. 2008;63:81–89. doi: 10.1002/ana.21290. [DOI] [PubMed] [Google Scholar]
  • 13.Segarra-Casas A., Domínguez-González C., Hernández-Laín A., Sanchez-Calvin M.T., Camacho A., Rivas E., Campo-Barasoain A., Madruga M., Ortez C., Natera-de Benito D., et al. Genetic diagnosis of Duchenne and Becker muscular dystrophy through mRNA analysis: new splicing events. J. Med. Genet. 2023;60:615–619. doi: 10.1136/jmg-2022-108828. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Tuffery-Giraud S., Saquet C., Chambert S., Claustres M. Pseudoexon activation in the DMD gene as a novel mechanism for Becker muscular dystrophy. Hum. Mutat. 2003;21:608–614. doi: 10.1002/humu.10214. [DOI] [PubMed] [Google Scholar]
  • 15.Lu H., Giordano F., Ning Z. Oxford Nanopore MinION Sequencing and Genome Assembly. Dev. Reprod. Biol. 2016;14:265–279. doi: 10.1016/j.gpb.2016.05.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Rhoads A., Au K.F. PacBio Sequencing and Its Applications. Dev. Reprod. Biol. 2015;13:278–289. doi: 10.1016/j.gpb.2015.08.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Selvatici R., Fang M., Falzarano M., Gualandi F., Delin S., Bensemmane S., Shatillo A., Bello L., Pegoraro E., Ferlini A. Whole exome and whole genome sequencing for the genetic diagnosis of dystrophinopathies. Neuromuscular Disord. 2020;30:S142. doi: 10.1016/j.nmd.2020.08.323. [DOI] [Google Scholar]
  • 18.Logsdon G.A., Vollger M.R., Eichler E.E. Long-read human genome sequencing and its applications. Nat. Rev. Genet. 2020;21:597–614. doi: 10.1038/s41576-020-0236-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Bruels C.C., Littel H.R., Daugherty A.L., Stafki S., Estrella E.A., McGaughy E.S., Truong D., Badalamenti J.P., Pais L., Ganesh V.S., et al. Diagnostic capabilities of nanopore long-read sequencing in muscular dystrophy. Ann. Clin. Transl. Neurol. 2022;9:1302–1309. doi: 10.1002/acn3.51612. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Vollger M.R., Logsdon G.A., Audano P.A., Sulovari A., Porubsky D., Peluso P., Wenger A.M., Concepcion G.T., Kronenberg Z.N., Munson K.M., et al. Improved assembly and variant detection of a haploid human genome using single-molecule, high-fidelity long reads. Ann. Hum. Genet. 2020;84:125–140. doi: 10.1111/ahg.12364. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Miller D.E., Sulovari A., Wang T., Loucks H., Hoekzema K., Munson K.M., Lewis A.P., Fuerte E.P.A., Paschal C.R., Walsh T., et al. Targeted long-read sequencing identifies missing disease-causing variation. Am. J. Hum. Genet. 2021;108:1436–1449. doi: 10.1016/j.ajhg.2021.06.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.White S.J., Aartsma-Rus A., Flanigan K.M., Weiss R.B., Kneppers A.L.J., Lalic T., Janson A.A.M., Ginjaar H.B., Breuning M.H., den Dunnen J.T. Duplications in the DMD gene. Hum. Mutat. 2006;27:938–945. doi: 10.1002/humu.20367. [DOI] [PubMed] [Google Scholar]
  • 23.Echigoya Y., Yokota T. Skipping multiple exons of dystrophin transcripts using cocktail antisense oligonucleotides. Nucleic Acid Ther. 2014;24:57–68. doi: 10.1089/nat.2013.0451. [DOI] [PubMed] [Google Scholar]
  • 24.Luce L., Abelleyro M.M., Carcione M., Mazzanti C., Rossetti L., Radic P., Szijan I., Menazzi S., Francipane L., Nevado J., et al. Analysis of complex structural variants in the DMD gene in one family. Neuromuscul. Disord. 2021;31:253–263. doi: 10.1016/j.nmd.2020.11.015. [DOI] [PubMed] [Google Scholar]
  • 25.Fenollar-Cortés M., Gallego-Merlo J., Trujillo-Tiebas M.J., Lorda-Sánchez I., Ayuso C. Two non-contiguous duplications in the DMD gene in a Spanish family. J. Neurogenet. 2008;22:93–101. doi: 10.1080/01677060701686184. [DOI] [PubMed] [Google Scholar]
  • 26.Janssen B., Hartmann C., Scholz V., Jauch A., Zschocke J. MLPA analysis for the detection of deletions, duplications and complex rearrangements in the dystrophin gene: potential and pitfalls. Neurogenetics. 2005;6:29–35. doi: 10.1007/s10048-004-0204-1. [DOI] [PubMed] [Google Scholar]
  • 27.Hermetz K.E., Newman S., Conneely K.N., Martin C.L., Ballif B.C., Shaffer L.G., Cody J.D., Rudd M.K. Large inverted duplications in the human genome form via a fold-back mechanism. PLoS Genet. 2014;10 doi: 10.1371/journal.pgen.1004139. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Weckselblatt B., Rudd M.K. Human Structural Variation: Mechanisms of Chromosome Rearrangements. Trends Genet. 2015;31:587–599. doi: 10.1016/j.tig.2015.05.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Liu P., Erez A., Nagamani S.C.S., Dhar S.U., Kołodziejska K.E., Dharmadhikari A.V., Cooper M.L., Wiszniewska J., Zhang F., Withers M.A., et al. Chromosome catastrophes involve replication mechanisms generating complex genomic rearrangements. Cell. 2011;146:889–903. doi: 10.1016/j.cell.2011.07.042. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Mizuguchi T., Toyota T., Adachi H., Miyake N., Matsumoto N., Miyatake S. Detecting a long insertion variant in SAMD12 by SMRT sequencing: implications of long-read whole-genome sequencing for repeat expansion diseases. J. Hum. Genet. 2019;64:191–197. doi: 10.1038/s10038-018-0551-7. [DOI] [PubMed] [Google Scholar]
  • 31.Zeng S., Zhang M.Y., Wang X.J., Hu Z.M., Li J.C., Li N., Wang J.L., Liang F., Yang Q., Liu Q., et al. Long-read sequencing identified intronic repeat expansions in SAMD12 from Chinese pedigrees affected with familial cortical myoclonic tremor with epilepsy. J. Med. Genet. 2019;56:265–270. doi: 10.1136/jmedgenet-2018-105484. [DOI] [PubMed] [Google Scholar]
  • 32.Hoffman E.P., Fischbeck K.H., Brown R.H., Johnson M., Medori R., Loike J.D., Harris J.B., Waterston R., Brooke M., Specht L., et al. Characterization of dystrophin in muscle-biopsy specimens from patients with Duchenne's or Becker's muscular dystrophy. N. Engl. J. Med. 1988;318:1363–1368. doi: 10.1056/NEJM198805263182104. [DOI] [PubMed] [Google Scholar]
  • 33.Jones H.F., Bryen S.J., Waddell L.B., Bournazos A., Davis M., Farrar M.A., McLean C.A., Mowat D.R., Sampaio H., Woodcock I.R., et al. Importance of muscle biopsy to establish pathogenicity of DMD missense and splice variants. Neuromuscul. Disord. 2019;29:913–919. doi: 10.1016/j.nmd.2019.09.013. [DOI] [PubMed] [Google Scholar]
  • 34.García-Rodríguez R., Hiller M., Jiménez-Gracia L., van der Pal Z., Balog J., Adamzek K., Aartsma-Rus A., Spitali P. Premature termination codons in the DMD gene cause reduced local mRNA synthesis. Proc. Natl. Acad. Sci. USA. 2020;117:16456–16464. doi: 10.1073/pnas.1910456117. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Keegan N.P. Pseudoexons of the DMD Gene. J. Neuromuscul. Dis. 2020;7:77–95. doi: 10.3233/JND-190431. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Sheikh O., Yokota T. Developing DMD therapeutics: a review of the effectiveness of small molecules, stop-codon readthrough, dystrophin gene replacement, and exon-skipping therapies. Expert Opin. Investig. Drugs. 2021;30:167–176. doi: 10.1080/13543784.2021.1868434. [DOI] [PubMed] [Google Scholar]
  • 37.CNCB-NGDC Members and Partners, and Partners Database Resources of the National Genomics Data Center, China National Center for Bioinformation in 2023. Nucleic Acids Res. 2023;51:D18–D28. doi: 10.1093/nar/gkac1073. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Chen T., Chen X., Zhang S., Zhu J., Tang B., Wang A., Dong L., Zhang Z., Yu C., Sun Y., et al. The Genome Sequence Archive Family: Toward Explosive Data Growth and Diverse Data Types. Dev. Reprod. Biol. 2021;19:578–583. doi: 10.1016/j.gpb.2021.08.001. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Document S1. Figures S1‒S3
mmc1.pdf (743.8KB, pdf)
Table S1. Profile of the 129 subjects, related to Table 1, Table 2 and Figure 3
mmc2.xlsx (16.5KB, xlsx)
Table S2. Pathogenic mutation results of the 129 subjects across different platforms

The variants were described in accordance with HGVS nomenclature and verified using the Mutalyzer program (version: 3.0.5; https://mutalyzer.nl/), related to Table 1, Table 2, Figure 1, Figure 2 and Figure 3.

mmc3.xlsx (21.1KB, xlsx)
Table S3. Random base insertion statistics in SV breakpoints, related to Table 1
mmc4.xlsx (15.8KB, xlsx)
Table S4. Software used for sequencing data analysis, related to Figure 3
mmc5.xlsx (11.3KB, xlsx)
Table S5. Primers used for breakpoints validation, related to Table 2
mmc6.xlsx (11.3KB, xlsx)

Data Availability Statement

  • The raw sequence data have been deposited in the Genome Sequence Archive (Genomics, Proteomics & Bioinformatics 2021) at the National Genomics Data Center (Nucleic Acids Res 2022), China National Center for Bioinformation/Beijing Institute of Genomics, Chinese Academy of Sciences. and are publicly available as of the date of publication.37,38 Accession numbers are listed in the key resources table.

  • This paper does not report original code.

  • Any additional information required to reanalyze the data reported in this paper is available from the lead contact upon request.


Articles from iScience are provided here courtesy of Elsevier

RESOURCES