Skip to main content
Advanced Science logoLink to Advanced Science
. 2024 Dec 31;12(9):2410992. doi: 10.1002/advs.202410992

Haplotype‐Resolved Genotyping and Association Analysis of 1,020 β‐Thalassemia Patients by Targeted Long‐Read Sequencing

Yuhua Ye 1,2, Chao Niu 1,2, Aiping Mao 3, Lang Qin 1,2, Jiahan Zhan 3, Weijie Chen 1,2, Zhentian Liu 3, Tiantian Xie 3, Qianqian Zhang 1,2, Jiaqi Li 3, Li Huang 1,2, Wanli Meng 3, Yumeng Liu 1,2, Liuhua Liao 4, Junqin Cai 4, Riyang Liu 4, Xinhua Zhang 5, Lihong Zeng 5, Yaoyun Li 6, Bin Lin 7, Kui Li 7, Xiaoyun Hua 7, Binbin Huang 8, Honggui Qin 8, Yueyan Huang 9, Zhijing Huang 9, Jinquan Lao 10, Xiang Qu 10, Juanjuan Chen 11, Xiaoqin Feng 12, Qiujun Liu 12, Wanying Lin 13, Xiaoman Zhou 14, Yidan Liang 1,2, Xingjiang Long 15, Jiaofeng Qin 15, Lixiang Yan 16, Weijian Zhu 17, Lian Yu 18, Chengwu Fan 19, Deguo Tang 20, Tianyu Zhong 21, Jufang Tan 22, Zhilin Ren 3, Xiangmin Xu 1,2,
PMCID: PMC11884621  PMID: 39737841

Abstract

Despite the well‐documented mutation spectra of β‐thalassemia, the genetic variants and haplotypes of globin gene clusters modulating its clinical heterogeneity remain incompletely illustrated. Here, a targeted long‐read sequencing (T‐LRS) is demonstrated to capture 20 genes/loci in 1,020 β‐thalassemia patients. This panel permits not only identification of thalassemia mutations at 100% of sensitivity and specificity, but also detection of rare structural variants (SVs) and single nucleotide variants (SNVs) in modifier genes/loci. The highly homologous regions of α‐/β‐globin gene clusters are then phased and 3 novel haplotypes in HBG1/HBG2 region are reported in this population of β‐thalassemia patients. Furthermore, one of the haplotypes is associated with ameliorated symptoms of β‐thalassemia. Similarly, 5 major haplotypes are identified in HBA1/HBA2 homologous region while one of them is found highly linked with deletional α‐thalassemia mutations. Finally, rare mutations in erythroid transcription factors in DNMT1 and KLF1 associated with increased expression of fetal hemoglobin and reduced transfusion dependencies are identified. This study presents the largest T‐LRS study for β‐thalassemia patients to date, facilitating precise clinical diagnosis and haplotype phasing of globin gene clusters.

Keywords: fetal hemoglobin, thalassemia, third‐generation sequencing


A targeted long‐read sequencing (T‐LRS) panel for accurate genotyping of β‐hemoglobinopathies is developed and its performance is tested in 1,020 β‐thalassemia patients. Novel haplotypes in the globin gene clusters are detected and functional rare variants are found by association analysis. This study presents the largest T‐LRS study for β‐thalassemia patients to date, facilitating precise clinical diagnosis and haplotype phasing of globin gene clusters.

graphic file with name ADVS-12-2410992-g002.jpg

1. Introduction

Thalassemia is one of the most common monogenic disorders affecting 4.4 out of every 10 000 live births worldwide.[ 1 ] The two clinical forms, α‐ and β‐thalassemia are characterized by imbalanced synthesis of α‐ and β‐globin due to the genetic defects majorly in HBA1/HBA2 and HBB,[ 2 ] which trigger the pathogenesis of ineffective erythropoiesis, increased hemolysis, and iron overload.[ 3 , 4 , 5 ] Patients with severe forms of β‐thalassemia mostly require life‐long blood transfusion though emerging cases have been reported cured by allogeneic stem cell transplantation or gene‐editing of hematological stem cells.[ 6 ] Population screening and molecular diagnosis is thus considered as an effective strategy for the prevention of the birth defect.

Traditional routine strategy for the screening of thalassemia carriers relies primarily on the thalassemic traits as evaluated by their hematological indices and hemoglobin components prior to molecular diagnosis.[ 7 , 8 ] However, limitations also arise regarding its time‐effectiveness and the potential risk of missed detections in cases with “silent” or “rare” mutations or complex structural variants.[ 7 , 9 ] To tackle these challenges, significant efforts have been dedicated to developing a high‐throughput system for the comprehensive detection of thalassemia mutations using next‐generation sequencing (NGS) and long‐read sequencing (LRS), independent of prior knowledge of hematological indices.[ 10 , 11 , 12 , 13 ] Up to date, haplotype phasing of these genomic regions involving the disease‐causing mutations of thalassemia is mainly based on linkage disequilibrium (LD) test among the population of thalassemia patients or carriers.[ 14 , 15 ] Apart from detecting disease‐causing mutations of thalassemia, comprehensive analysis of modifier variants in β‐thalassemia patients might advance precise diagnosis and genetic therapies for treating these monogenic diseases.[ 16 ]

To date, robust evidence from population‐based studies has shown a wide range of clinical severity in β‐thalassemia patients, even among those with identical disease‐causing mutations.[ 2 , 17 ] This enabled population‐based genome‐wide association studies and the identification of key modulator genes and loci, including BCL11A, KLF1, GATA1, MYB‐HBS1L, and HBG promoters associated with the levels of fetal hemoglobin (HbF).[ 18 , 19 , 20 , 21 ] Moreover, in vitro validations have led to the identification of key regulatory role of trans‐acting complexes and members on hemoglobin switching.[ 22 , 23 , 24 ] Some of these targets, such as the erythroid‐specific enhancer in BCL11A and promoters of HBG genes, were applied as targets for gene‐editing of autologous stem cells as a genetic therapy of β‐thalassemia and sickle cell anemia.[ 25 , 26 , 27 , 28 ] These findings highlighted the critical role of population‐based screening of naturally occurring variants modifying the clinical symptoms of thalassemia patients. We previously developed a T‐LRS panel with the PacBio Sequel II platform to identify single‐nucleotide variants (SNVs) and structural variants (SVs) within long‐range fragments, designed as targeted amplicons covering all known regions associated with disease‐causing mutations in thalassemia. This approach, termed Comprehensive Analysis of Thalassemia Alleles (CATSA), has demonstrated its advantages in precise prenatal diagnosis and genetic counseling compared to traditional PCR‐based methods.[ 29 , 30 ] In this study, we optimized the panel by additionally capturing all cis‐elements in globin gene clusters and modifier genes known to regulate hemoglobin switching. We next introduced a cohort of 1020 β‐thalassemia patients to perform T‐LRS with this panel to test its performance in the detection of thalassemia genotypes. Furthermore, the major haplotypes in the highly homologous HBG1/HBG2 and HBA1/HBA2 regions were characterized using the long‐sequencing reads, followed by haplotype‐based association studies. Finally, we provided the mutation spectra among the modifier genes captured in this panel and uncovered a total of 198 variants associated with the expression of HbF in the 1020 β‐thalassemia patients. Taken together, our study provided the general phasing profiles and spectra of modifier variants in a large cohort of β‐thalassemia patients through application of an optimized T‐LRS panel.

2. Experimental Section

2.1. The Design and Experimental Procedures of the LRS Panel

The T‐LRS panel contained four long‐range multiplex PCR reactions to target known disease‐causing variants of thalassemia, core regulatory elements in α‐ and β‐globin gene loci, known and potential novel modifier genes of β‐hemoglobinopathies (Figure 1A–F; Figure S1 and Table S1, Supporting Information). Specifically, reaction 1 was an upgraded LRS approach modified from previously published CATSA assay,[ 31 ] to incorporate new primers for the detection of HBD gene and large deletions in the β‐globin locus (Figure 1A,B). Reaction 2 contained 10 pairs of primers to target other core genes and regulatory elements in the α‐globin locus, and five modifier genes including BCL11A, GATAD2A, DNMT1, GATA1, and ZBTB7A (Figure 1A,D; Figure S1, Supporting Information). Reaction 3 contained eight pairs of primers to target other core genes and regulatory elements in the β‐globin locus, and two modifier genes including HBS1L‐MYB and KLF1 (Figure 1B,C,E). Reaction 4 contained 11 pairs of primers to target four potential modifier genes including CHD4, KLF3, KLF8, and SIRT1 (Figure 1F; Figure S1, Supporting Information). To achieve accurate genotyping of thalassemia alleles and modifier variants in β‐hemoglobinopathies, primers for multiplex LR‐PCR for a T‐LRS gene panel were designed to cover known disease‐causing variants in the α‐/β‐globin gene clusters and modifier genes/loci mentioned above (Figure 1; Figure S1, Supporting Information). Vigorous tests of primer sequences and concentration were performed for all the four reactions to achieve optimized balanced amplification among fragments (Figure S2, Supporting Information).

Figure 1.

Figure 1

Design of the PCR primers for the T‐LRS panel. A,B) Primers designed in the α‐globin (A) and β‐globin (B) gene loci. The purple primers were included in reaction 1 (upgraded CATSA assay) to cover SNVs/indels in HBA1, HBA2, HBB, and HBD genes, as well as common deletions in the gene loci. The black primers were designed to cover other core genes and regulatory elements in the α‐globin (A) and β‐globin (B) gene loci. C) Primer pair included in reaction 3 for the HBS1LMYB amplicon to cover HBS1LMYB intergenic polymorphism. D) Primer pair included in reaction 2 for the BCL11A amplicon located in the intron 2 of BCL11A. E) Primer pair included in reaction 3 for the KLF1 amplicon to cover the full‐length gene. F) Four primer pairs included in reaction 4 for the four CHD4 amplicons to cover the core exons and the majority of introns for CHD4.

PCR amplification and long‐read library preparation were performed similarly as previously described.[ 31 , 32 , 33 , 34 ] Multiplex long‐range PCR (LR‐PCR) was performed in 50‐µL reactions containing 10 to 100 ng of genomic DNA, 1xPCR buffer for KOD FX Neo, 0.4 mm of each dNTP, 1 µm of primer mixture, and 1 µL of KOD FX Neo (TOYOBO). PCR cycling conditions for optimal fragment amplification were 94 °C for 5 min (1 cycle), 98 °C for 10 s, and 68 °C for 12 min (30 cycles), and 68 °C for 10 min (1 cycle). Barcoded adaptors were ligated to the PCR products to construct individual sequencing library. Then each library was quantified and pooled together by equal molarity. After purification and quantification, the pooled library was converted to single‐molecule real‐time dumbbell (SMRTbell) library with Binding kit 3.2 and cleanup beads (Pacific Biosciences) and sequenced for 30 h on the Sequel II platform (Pacific Biosciences) under circular consensus sequencing (CCS) mode.

2.2. Sample Recruitment

A total of 1020 β‐thalassemia patients were recruited from five provinces (Guangxi, Guangdong, Fujian, Jiangxi, and Hunan) in southern China from 2019 to 2022. Patients under 3 years of age and pregnant women were excluded. Basic information including age, sex, body weight, and height of each patient were recorded. By reviewing the electronic medical records of the patients, the medical history including the age at first transfusion, blood transfusion history was collected. Pre‐transfusion peripheral blood test was conducted to obtain the hematological indices of each patient. β‐Thalassemia genotypes were primarily determined by traditional PCR‐based methods, listed in Table S1 (Supporting Information). This study was approved by the ethics committee of Nanfang Hospital, Southern Medical University, and the ethics committees at each local hospital (Approval ID: NFEC‐2019‐039). All subjects and/or their guardians provided signed informed consent for participation and this study adhered to the Declaration of Helsinki.

2.3. Sequencing Data Processing and Association Analysis

A customized bioinformatics pipeline was developed and optimized for detection of structure variations and SNVs/indels, construction of haplotypes for each fragment, and correlation between variants/haplotypes and phenotype of β‐hemoglobinopathies (Figures S3S4, Supporting Information). The raw subreads in the sequencing bam files were processed to high‐fidelity CCS reads, divided by unique barcodes for different samples, and aligned to genome build hg38 in the SMRTlink software suite (Pacific Biosciences). For all the CCS reads from the four reactions, structural variations were called based on the utilization of primers on each end and the read length of CCS reads using blastn.[ 35 ] SNVs and indels of all the targeted reads were called by FreeBayes 1.3.4 with read depth ≥20 (Biomatters). The pathogenicity of variants identified in α‐ and β‐globin loci was interpreted according to the general guidelines and from information provided in hemoglobin variant databases.[ 36 , 37 , 38 ] For all the variants identified from the t‐LRS assay, the correlation between each variant and phenotype of β‐hemoglobinopathies was analyzed with Plink 1.9. For each of the 31 fragments from T‐LRS, the structural variants and variant calling files from FreeBayes were combined to generate individual variant matrix, which was then subjected to haplotype phasing using WhatsHap.[ 39 ] The haplotypes of all the samples for specific fragments were clustered and displayed with ggplot2. To generally visualize the major haplotypes in all patients of the cohort, the alleles of each locus were marked as either 0 or 1 to represent wild type or mutant allele in each patient and formed a matrix. The matrix was then deduplicated by row and clustered using pheatmap (v1.0.12) package with Pearson correlation and complete linkage method. The genotype‐phenotype association studies were conducted by generalized linear model method and visualized using the R software package ggstatsplot (v0.12.2). Survival curves of haplogroups were calculated using the Kaplan–Meier method and compared using the log‐rank test to evaluate the impact of candidate variants on clinical symptoms of β‐thalassemia patients.

For phylogenetic analysis, the haplotypes were aligned using the MAFFT (v7.520) program with default parameters, and a phylogenetic tree was generated using FastTree (v2.1.11). The resulting tree was visualized using FigTree (v1.4.4). Haplogroups were identified using a heatmap and classified using hierarchical clustering, with different colors for distinguishment. LD and haplotype blocks were constructed using the LDBlockShow software (V1.40). Transcription factor binding sites were identified by SnapGene Viewer based on the motifs of five key erythroid regulators: KLF1, BCL11A, GATA1, NFY, and TAL1. The variants potentially altered the TFBS on the globin gene clusters were plotted using the package ggplot2.

2.4. Homology Modeling

To evaluate the potential effects of the missense mutations in KLF1 and DNMT1 on protein structures, the homology models of the wild‐type and polypeptide chains were constructed using SWISS MODEL Server.[ 40 ] The one with the highest Global Model Quality Estimation (GMQE) values were selected as the optimal model for further analysis. The models were displayed using the PyMOL Molecular Graphics System, Version 1.3, Schrödinger, LLC.

2.5. Statistical Analysis

The false discovery rate (FDR) to reflect the statistical significance of genetic variants called by T‐LRS in the phenotype‐genotype association analysis was determined by Benjamini‐Hochberg FDR correction for multiple testing, implemented by Plink 1.9. The impact of rare mutations identified in KLF1 and DNMT1 on the clinical symptoms and hematological indices was evaluated by a one‐sample t‐test while the inter‐group comparisons of 1020 β‐thalassemia patients with different haplotypes in the HBG1/2 and HBA1/2 fragments were implemented by unpaired t‐test.

3. Results

3.1. Establishing and Optimizing a System for a T‐LRS Gene Panel

This panel is designed to capture 8 globin genes (HBA1, HBA2, HBB, HBD, HBG1, HBG2, HBZ, HBE), 10 modifier genes (BCL11A, KLF1, GATA1, GATAD2A, ZBTB7A, DNMT1, CHD4, KLF3, KLF8, and SIRT1) and 3 cis‐elements (HS‐40 in α‐globin gene cluster, LCR in β‐globin gene cluster and the intergenic polymorphisms in MYB‐HBS1L). We then tested the practicability of this panel by recruiting 100 samples with known genotypes of thalassemia. Upon optimization of the panel, a cohort of 1020 β‐thalassemia patients were subjected to large‐scale T‐LRS, in which single‐blind test of thalassemia mutations were performed to evaluate the capability performance, followed by detection of modifier variants, haplotypic phasing, and association studies to report variants or haplotypes associated with phenotypic changes in β‐thalassemia patients. The flowchart of this study described above is shown in Figure 2 .

Figure 2.

Figure 2

The overall flowchart of this study. The target region is composed of 8 globin genes (HBA1, HBA2, HBB, HBD, HBG1, HBG2, HBZ, HBE), 10 modifier genes (BCL11A, KLF1, GATA1, GATAD2A, ZBTB7A, DNMT1, CHD4, KLF3, KLF8, and SIRT1) and 3 cis‐elements (HS‐40 in α‐globin gene cluster, LCR in β‐globin gene cluster and the intergenic polymorphisms in MYB‐HBS1L). 100 samples with pre‐typed genotypes of thalassemia were recruited to test this panel with their detailed genotypes presented in Supplemental File S3. 1020 β‐thalassemia patients were then recruited for the evaluation of capability performance of this panel, from which the long‐sequencing read data were applied for variant detection, haplotypic phasing, and association studies.

3.2. Detection of Disease‐Causing Variants of Thalassemia and Modifier Variants

To evaluate the accuracy of the T‐LRS panel for detecting thalassemia variants, validation of the T‐LRS panel for detecting thalassemia variants was performed on genomic DNA samples with known genotypes, including SNVs/indels in HBA1 and HBA2 (Figure 3A), structural variations such as αααanti3.7, αααanti4.2, and HKαα (Figure 3B), large deletions in the α‐globin locus including ‐α3.7, ‐α4.2, –SEA, –THAI, and –FIL (Figure 3C), SNVs/indels in HBB and HBD (Figure 3D), large deletions in the β‐globin locus including Taiwanese deletion, Hb Lepore, SEA‐HPFH (South‐East Asia type hereditary persistence of fetal hemoglobin), and Chinese Gγ(Aγδβ)0 (Figure 3E). T‐LRS is also able to phase common variants in the β‐globin gene cluster, taking rs10128556 and rs2071348 in HBBP1 as an example (Figure 3F). Moreover, the long‐sequencing reads facilitate construction of the haplotypes of modifier genes (Figure 3G–J; Figure S5, Supporting Information) within one amplicon. Taken together, the established T‐LRS assay enabled comprehensive and accurate analysis of disease‐causing variants of thalassemia, variant identification and haplotype construction of modifier genes in the panel.

Figure 3.

Figure 3

Integrative Genomics Viewer (IGV) plots displaying the long CCS reads of T‐LRS for representative samples. A) IGV plots displaying the detection of variants in HBA2 (c.369C > G, c.377T > C, and c.427T > C) and HBA1 (c.84G > T, and c.364G > A). B) IGV plots displaying the detection of deletions (−α4.2 and −α3.7), duplications (ααα4.2 and ααα3.7), and structural rearrangements (HKαα) caused by unequal crossover in the α‐globin locus. The exact deletion regions of −α4.2 and −α3.7 were annotated according to IthaID 301 and 300, respectively. C) IGV plots displaying the detection of large deletions including –SEA, –THAI, and –FIL in the α‐globin locus. D) IGV plots displaying the detection of variants in HBB (c.52A > T, c.126_129del, c.165_177del, c.216_217insA, and c.316–197C > T) and HBD (c.−127T > C). E) IGV plots displaying the detection of large deletions including Taiwanese deletion, Hb Lepore, SEA‐HPFH, and Chinese Gγ + (Aγδβ)0 in the β‐globin locus. F) IGV plots displaying the detection the rs10128556 and rs2071348, as well as cis‐configuration of the two variants in HBBP1. G) IGV plot displaying the detection and cis‐configuration of reported modifying SNPs in HBS1LMYB intergenic region. H) IGV plot displaying the detection and cis‐configuration of reported modifying SNPs in intron 2 of BCL11A. I) IGV plot displaying the detection of the heterozygous variant KLF1: c.544T > C. J) IGV plot displaying the CCS reads of four CHD4 amplicons. LRS could determine the phasing inside one amplicon, but could not determine the phasing among different amplicons.

3.3. Distinguishing Highly Homologous Regions in the α‐ and γ‐Globin Genes

Genetic analysis of the two α‐globin genes is complex due to the presence of two highly homologous units spanning 4 kb, which are divided into X‐, Y‐, and Z‐homology boxes.[ 41 ] Therefore, we designed the primers to capture the two homologous units in one long fragment, in order to accurately align and distinguish variants in HBA1 and HBA2 (Figure 3A). CCS reads were subjected to variant calling and heterozygous variants were used to divide the CCS reads into two clusters. The combination of variants in each cluster constructed the specific haplotype (Figure 3A). As a representative case, T‐LRS could accurately determine whether the variant c.369C > G was located in HBA1, HBA2, or in both genes (Figures 4B, and S6, Supporting Information). Similarly, T‐LRS could distinguish the variants in highly homologous regions of γ‐globin genes allowing for haplotypic phasing (Figure 4C), such as the two known functional variants (c.−29G > A in HBG1 and g.−158C > T in HBG2) associated with HbF (Figure 4D). As recombination between the homologous HBG1 and HBG2 could cause gene conversion and deletion, the T‐LRS panel identified a conversion in this region spanning from HBG promoter to intron 2 (Figure 4D), as well as a novel 4.9 kb deletion, with the precise breakpoint detected using the split reads (Figure 4E). In summary, T‐LRS enabled precise variant calling and haplotype construction of the highly homologous regions in the α‐ and γ‐globin genes.

Figure 4.

Figure 4

T‐LRS enabled precise variant calling and haplotype construction of the highly homologous regions in the α‐ and γ‐globin genes. A) Diagram showing the two 4 kb homologous units, design of primers, variant calling and haplotype construction based on heterozygous variants in the α‐globin genes. B) IGV plots displaying representative samples that had the variant c.369C > G in both HBA1 and HBA2 genes. C) Diagram showing the highly homologous regions, design of primers, variant calling and haplotype construction based on heterozygous variants in the γ‐globin genes. D) IGV plots displaying the detection of HBG1: c.−29G > A (blue box) and HBG2: g.−158C > T (orange box), as well as cis‐configuration of the two variants in γ‐globin gene loci. The blue box highlighted the conversion of HBG1 to HBG2 in the region encompassing promoter to intron 2. E) IGV plots displaying the 4.9 kb deletion between HBG1 and HBG2, as well as the breakpoints identified by T‐LRS.

3.4. Capability Performance of Thalassemic Variant Detection in a Cohort of 1020 β‐Thalassemia Patients

After optimization of the T‐LRS panel and test with samples with known genotypes, we next recruited a cohort of 1020 β‐thalassemia patients to test the capability performance of this panel in precise diagnosis of the disease. First, we performed routine PCR‐based approaches including reverse dot blotting and gap‐PCR, which covered a total of 23 mutations including SNVs, InDels, and SVs causing α‐ or β‐thalassemia (Table S1, Supporting Information). After genotyping by the routine PCR‐based approaches, the 1020 samples were then subjected to the T‐LRS panel, which facilitates the sequencing of both disease‐causing mutations of thalassemia and variants within the modifier genes mentioned above. As a result, T‐LRS panel was able to accurately detect all the thalassemia mutations called by routine PCR‐based approaches with 100% consistency (Table S2, Supporting Information). Those additional genotyping results by T‐LRS were further validated by multiplex ligation dependent probe amplification (MLPA) and Sanger sequencing (The representative results are shown in Figure S7, Supporting Information). In general, we additionally detected 17 HPFH alleles, 6 α‐duplication alleles, 24 rare α‐thalassemia mutation alleles, and 22 rare β‐thalassemia mutation alleles (File S2 and Table S3, Supporting Information). Upon detection of the patients carrying HPFH or α‐thalassemia mutations, we next evaluated the impact of these variants on the expression of HbF and age at first transfusion (Figure S8, Supporting Information). Notably, we found that the co‐inheritance of deletional α‐thalassemia mutations is more beneficial to ameliorating the clinical severity of β‐thalassemia patients compared to those of non‐deletional α‐thalassemia mutations as evaluated by ordinary one‐way analysis of variance (ANOVA) and Dunnett's correction for multiple comparisons (Figure S8, Supporting Information). Similarly, the patients with HPFH mutations showed relatively mild symptoms due to the high extent of re‐activation of HbF analyzed by unpaired Student's t‐test (Figure S8, Supporting Information). These results highlighted the need for comprehensive analysis of disease‐causing mutations of thalassemia.

3.5. Genetic Variants and Their Phenotypic Effects Identified in the Modifier Genes/Loci

In terms of the modifier genes or cis‐elements captured in this panel, we provided mutation spectra of the modifier genes in a large cohort of patients from Chinese population (Table S4, Supporting Information). We then carried out association studies to evaluate the effect of those variants on the expression of HbF, aiming at identifying novel modifier variants associated with the extent of re‐activation of HbF, which was one of key factors modulating the clinical heterogeneity of β‐thalassemia and sickle cell anemia. Consequently, we identified 198 variants statistically associated with HbF (false discovery rate, FDR < 0.05) (File S1, Supporting Information) while the FDR was determined by Benjamini‐Hochberg FDR correction for multiple testing using Plink 1.9.

3.6. Clinical Precision Diagnosis of β‐Thalassemia Patients by Integrated Analysis of Both Common and Rare Modifier Variants

Among the 198 HbF‐associated variants, the majority of them were rare mutations with minor allele frequency (MAF) lower than 0.01 in this cohort. Notably, two rare missense variants were found with significant impact on the clinical severity of β‐thalassemia patients. The first one is NC_000019.10: g.12885335G > C (NP_006554.1:p.His299Asp) in KLF1 marked as rs137852688 and the second one is NC_000019.10:g.10151412C > T (NP_001370.1:p.Gly735Arg) in DNMT1 marked as rs1381758934. Of note, the first variant was known to be associated with HbF while the latter remains unknown of its correlation with HbF levels.[ 20 ] The carriers of GC genotype in rs137852688 are two β00 patients with abnormally high levels of HbF (48.50 and 118.87 g L−1, respectively). We therefore compared their key phenotypes with those patients with the same disease‐causing mutations of thalassemia. As a result, we found that the two patients exhibited higher levels of HbF, later age of onset reflected by the survival time without transfusion, but no significant changes in the levels of serum ferritin (Figure 5A–C). Given the limited number of cases, a one‐sample t‐test was employed for the inter‐group comparison. Similar analysis was conducted on the rs1381758934 in DNMT1 while we observed remarkable increase in the HbF levels (p = 0.038) and a trend of ameliorated clinical severity reflected by later age of onset and lower levels of serum ferritin with marginal p values (Figure 5D–F). We next explored the effects of these two functional mutations by homology modeling. Interestingly, we found that rs137852688 has led to an amino acid substitution of conserved residues in zinc finger domain 1 of KLF1 (Figure 5G), which has been reported to be associated with In (Lu) phenotype,[ 42 ] suggesting pleiotropy of this locus. Furthermore, the missense mutation in DNMT1 was predicted to alter the state of its flanking α‐helix though it was not located with any of known domains (Figure 5H).

Figure 5.

Figure 5

The effects of rare missense mutations in KLF1 and DNMT1 on the clinical severity of β‐thalassemia patients. A–C) The differences in the levels of HbF, survival time without transfusion and serum ferritin among the β‐thalassemia patients with different genotypes of rs137852688 in KLF1. D–F) The differences in the levels of HbF, survival time without transfusion and serum ferritin among the β‐thalassemia patients with different genotypes of rs1381758934 in DNMT1. G,H) The impact of the missense mutations in KLF1 and DNMT1 on protein structures modeled by Swiss‐prot and visualized in Pymol. The abbreviation of protein domains presented in Figure 5G,H were listed as below: DMAP: DMAP_binding; FRD: Cytosine specific DNA methyltransferase replication foci domain; CXXC: CXXC zinc finger domain; BAH1: Bromo adjacent homology (BAH) domain; BAH2: Bromo adjacent homology (BAH) domain; MeTfrase: C‐5 cytosine methyltransferase; EKLF1: Erythroid krueppel‐like transcription factor, transactivation 1; EKLF2: Erythroid krueppel‐like transcription factor, transactivation 2; Znf1: Zinc finger C2H2‐type domain; Znf2: Zinc finger C2H2‐type domain; Znf3: Zinc finger C2H2‐type domain. The levels of p values for the evaluation of statistical differences between the two groups were marked by asterisk. “*” means p < 0.05; “**” means p < 0.01; “***” means p < 0.001; “****” means p < 0.0001.

Upon identification of novel rare mutations associated with the expression of HbF, we next attempted to answer whether targeting and genotyping the modifier genes would better explain the clinical heterogeneity of β‐thalassemia patients. We thus evaluated the impact of three known modifier variants (rs4671393 in BCL11A, rs7776054 in MYB‐HBS1L intergenic regions, rs7482144 in HBG promoter and KLF1 mutations) on the HbF levels and clinical typing of the 1020 β‐thalassemia patients. Notably, we found accumulated effects of the known variants on the clinical phenotypes of β‐thalassemia (Table S5, Supporting Information), which means that with the increasing number of “beneficial” variants one carries, the more likely this β‐thalassemia patient might show milder clinical symptoms with higher expression of HbF. These results suggest the practicability of the T‐LRS panel in precise diagnosis and phenotyping of β‐thalassemia.

3.7. Haplotype Clustering and Haplotype‐Based Association Analysis in HBG1/HBG2 and HBA1/HBA2 Homologous Regions

By clustering the CCS reads for each β‐thalassemia sample, the two alleles of HBG1‐HBG2 region were distinguished and the variants called in each haplotype were marked. We thus yielded a matrix composed of 2040 haplotypes from the 1020 samples, which were then de‐duplicated according to the allele status and 249 unique haplotypes were finally obtained. These unique haplotypes were clustered into three groups, termed as Hap_s1, Hap_s2, and Hap_s3 (Figure 6A–C), indicating the major haplotypes in HBG1/HBG2 region existed in β‐thalassemia patients from southern China. The LD block was constructed to show the linkage of variants in this region (Figure 6D). Notably, we found that the patients carrying Hap_s1 showed significantly higher levels of HbF (mean value = 10.30 g L−1) than the patients with Hap_s2 and Hap_s3 (mean value of Hap_s2 = 3.39 g L−1, mean value of Hap_s2 = 6.10 g L−1) analyzed using unpaired t‐test (Figure 6E). We further confirmed that the elevated HbF levels associated with Hap_s1 are not due to a higher linkage with milder mutations of β‐thalassemia. This finding also suggests that Hap_s1 acts independently, driven by genetic variants within the HBG1/2 region that leads to increased levels of HbF (Figure S9, Supporting Information). The particularly high expression of HbF led to later age of onset for the patients in this group by survival analysis (Figure 6F) and lower levels of serum ferritin as well (Figure 6G). We next scanned all the variants identified from this cohort to identify 13 variants that potentially altered the binding motifs of key erythroid transcription factors (TFs) including BCL11A, GATA1, KLF1, NFY, and TAL1 (Figure 6H), among which the three variants highlighted in red were from Hap_s1. These data might facilitate identification of functional or tag SNPs from the HbF‐associated haplotype.

Figure 6.

Figure 6

The general haplotypes of HBG1/HBG2 regions and their phenotypic effects in the cohort of 1020 β‐thalassemia patients. (A) The diagram of seven fragments in the β‐globin gene cluster designed for long‐read sequencing in the T‐LRS panel. Among them, F3 covering the regions of HBG1 and HBG2 was highlighted by dotted purple lines to further show the unique haplotypes of the cohort in this fragment; (B) A heatmap displaying the haplotypes of the HBG1/HBG2 haplotypes identified from 1020 β‐thalassemia patients. A total of 249 SNVs were identified from the 1020 β‐thalassemia patients in this region. Each row represents one haplotype while the 249 grids in each row, marked in either blue or red, denote the allele information in corresponding position. The blue color stands for a reference allele in this locus while the red stands for alteration allele. A total of 209 unique haplotypes were identified and these haplotypes were clustered into 3 main groups as shown in this figure; (C) The overview of the tree clustering results of the redundant 2040 HBG1/HBG2 haplotypes in the patient population; (D) The LD block showing the linkage disequilibrium of the variants within the HBG1/HBG2 genomic regions. The deeper color in each cell indicated higher R2 values, which means higher linkage extent between the two variants of interest; (E‐G) The effects of the three major haplotypes in the β‐thalassemia patients on the expression levels of HbF (E), the transfusion‐free survival time (F) and the levels of serum ferritin (G); H) A diagram showing the SNVs in the HBG1‐HBG2 regions which potentially altered the transcription factor binding of 5 key erythroid regulators, namely KLF1, BCL11A, GATA1, NFY, TAL1. The consensus binding motifs of each TFs were shown on the right and their potential binding positions were highlighted in different colors on the horizontal bar standing for genomic region of HBG1 and HBG2. 13 SNVs marked on this bar, which were identified as statistically significant with the expression of HbF, were predicted to alter the transcription binding of one of the 5 specific regulators mentioned above.

Similar analysis on HBA1/HBA2 homologous region were then conducted and we found 5 major haplotypes from the population of 1020 β‐thalassemia patients (Figure S10, Supporting Information). Interestingly, we identified one out of the 5 haplotypes significantly associated with higher levels of HbF and later age of onset (Figure S10, Supporting Information). This unique haplotype was closely linked with the three deletional types of α‐thalassemia including –SEA, ‐α3.7, and ‐α4.2, which ameliorated the α/β‐globin imbalance and resulted in the milder symptoms of the patients (Figure S11, Supporting Information). The haplotypic phasing and association studies might allow for a better understanding toward thalassemic patient‐specific haplotypes and its impacts on the clinical severity of β‐thalassemia.

4. Discussion

In this study, we first designed a T‐LRS panel capturing the α/β‐globin gene clusters and modifier genes or loci known to regulate hemoglobin switching or hematopoiesis. A cohort of 1020 β‐thalassemia patients was then recruited and subjected to the T‐LRS panel. Though this is a cohort enriched with β‐thalassemia patients, 158 of these patients were also in co‐inheritance with α‐thalassemia mutations, allowing simultaneous evaluation of detection accuracy of both α‐ and β‐thalassemia mutations for this T‐LRS panel. We performed the design of recruiting the patient‐based cohort aiming not only at evaluating the accuracy of this panel in detecting thalassemia mutations, but also trying to assess its practicability in precise phenotyping of β‐thalassemia and dissect the haplotypes of globin gene clusters enriched with disease‐causing alleles. The combined evaluation of both thalassemia mutations and modifier variants might promote individualized prenatal diagnosis and genetic counselling. Our previous efforts on at‐risk couple screening by LRS have demonstrated that a comprehensive detection of disease‐causing SNVs and CNVs of both α‐ and β‐thalassemia is fundamental in avoiding missed diagnosis of thalassemia major fetus.[ 30 ]

After verification of capability performance of the T‐LRS panel in detecting thalassemia mutations, we focused on the genetic variants within the modifier genes or loci in this cohort. In this study, we designed to capture 10 modifier genes or 3 major cis‐elements, which enabled observation of mutation spectra within these target regions among 1020 β‐thalassemia patients. By introducing the transfusion dependencies and HbF levels of the patients as dependent variables, we were able to launch phenotype‐genotype association studies and identify 198 variants significantly correlated with the phenotypes mentioned above. The findings could be divided into “known variants in known genes” as Tier 1 results and “novel variants in known genes” as Tier 2 results. Further analysis on Tier 1 results suggested that the patients tended to have higher levels of HbF and milder symptoms with the accumulation of the variants previously reported in BCL11A, HMIP and HBG gene promoters, making this T‐LRS panel a promising tool in both precise diagnosis and phenotyping of β‐hemoglobinopathies. In terms of Tier 2 results, we aimed to provide the mutation spectra on the genes known to participate in hemoglobin switching or hematopoiesis such as CHD4, KLF3, KLF8 and DNMT1. The natural variants identified in a patient‐based population might serve as a good supplement in backing laboratorial findings by evaluating their impact on patients’ phenotypes despite the limitation that only a part of the modifier genes were included for economic reasons.[ 43 ]

Finally, we investigated the major haplotypes of the HBG1/HBG2 and HBA1/HBA2 in the β‐thalassemia cohort. The long sequencing reads spanning the homologous regions ensured unambiguous alignment of the reads as the putative length of amplification fragments reached 7122 bp for HBG1/HBG2 and 8135 bp for HBA1/HBA2, which made the haplotype phasing more accurate compared to next‐generation sequencing (NGS), an approach with sequencing reads generally ranging from 80 to 150 bp, resulting in false mapping and reduced amounts of effective reads. According to the findings of genotype‐phenotype association analysis in this study, we propose that a combined panel for the detection of α/β‐globin gene clusters and key modifier genes including BCL11A, MYB‐HBS1L, KLF1, HBG1/HBG2, and DNMT1 is capable for precise diagnosis of thalassemia while the average cost would be ≈50 US dollars per sample, which makes this method to have promising application prospects in the population screening of thalassemia. To our knowledge, this is the first population report on haplotypes of globin genes in a large‐scale β‐thalassemia patient cohort by T‐LRS, which also lead to the identification of the unique haplotypes associated with clinical phenotypes of β‐thalassemia. The population haplotype results might facilitate further studies on the identification of functional or tag SNPs among a series of highly linked variants.

Conflict of Interest

The authors declare no conflict of interest.

Author Contributions

Y.Y., C.N., and A.M. contributed equally to this work. Conceptualization was performed by Z.R. and X.X. Methodology was performed by Y.Y., C.N., A.M., J.Z., W.C., Q.Z., Y. Liu. Project administration was performed by B.L., K.L., and X.H. Investigation was performed by Q.Z., Y.Y., L.Q., L.L., J. Cai, R.L., X. Zhang, L.Z., Y. Liu, B.H., H.Q., Y. Huang, Z.H., J.L., X.Q., J. Chen, X.F., Q.L., W.L., X. Zhou, Y. Liang, X. Long, J.Q., L. Yan, W.Z., L. Yu, C.F., D.T., T.Z., J.T. Funding acquisition: XX. Original draft was written by Y.Y., C.N., and A.M. Review was written and editing was performed by Z.R. and X.X.

Supporting information

Supporting Information

Supplemental File 1

ADVS-12-2410992-s003.xlsx (26.3KB, xlsx)

Supplemental File 2

Supplemental File 3

ADVS-12-2410992-s002.xlsx (13.3KB, xlsx)

Acknowledgements

The authors thank all the participants in this research and the Thalassemia Associations (Liuzhou, Huizhou, Shenzhen, Fujian, Guangzhou) for the assistance in the recruitment of subjects. The authors also thank Zilin Li (Northeast Normal University); Xiang Guo (Sichuan Academy of Medical Sciences); Peng Xu (Soochow University); Xuan Shang, Xiaofeng Wei, Fu Xiong, Cunyou Zhao, Liang Li, Wanjun Zhou, Fei He, Zhen Zhao, Yuan Huang, Tongtong Chen, Mengyang Song, Yong Long, Congwen Shao, Shaomin Zou, Xingmin Wang, Jialong Li, Hailiang Liu, Peng Lin, Wei Zhang, Haoyang Huang, Tingfeng Zheng, Hualei Luo, Zongrui Shen, Dina Zhu, Yida Jiang (Southern Medical University); Zhenzhong Tao, Neng Cheng, Yongzhang Wu, Shaobin Chen, Shuang Gao, Songbai Zheng, Fei Yuan, Xiaoye Tang, Ziping Zhong, Fuxia Li, Jia Li, Zhiling Ling (Guangzhou Huayinkang Healthcare Group Co., Ltd.); Xiarong Li, Cuihua Hu, Ruixin Guo, Li Gao (GeneDock Co.Ltd.); Zhenyuan Xia, Shaoke Chen (The Second Affiliated Hospital of Guangxi Medical University); Zhirong Lu, Musheng Huang, Liping Huang (Sixth People's Hospital of Nanning); Xiuxia Chen, Xianjin Wu, Xinxia Li (Huizhou Central People's Hospital); Shengwen Huang (Guizhou Provincial People's Hospital); Chunchun Yuan, Yongjun Wang (Shanghai University of Traditional Chinese Medicine); Pingping Li, Bo Zhang, Li Wang, Fang Yang (923(rd) Hospital of the People's Liberation Army); Ken Huang, Huatuo Huang, Yuke Chen, Chengcai Chen, Jie Lu (Affiliated Hospital of Youjiang Medical University for Nationalities); Qi Wang, Ning Qiu, Liuyuan Wei, Xiangzi Xie, Jing Chen, Likui Wu, Libo Qin, Xiaoli Xu (Liuzhou Worker's Hospital); Limin Li, Yuchen Zhu, Lixue Cheng (Liuzhou Worker's Hospital); Xinping Yang, Zhi Liu, Zilong He, Bingquan Lin, Wanxia Tan, Jing Du, Jianhui Yue, Jing Bai, Anhong Huang, Xingyuan Hu (Nanfang Hospital); Zhixiang Liu, Ling Zeng (Heyuan Maternal and Child Health Care Hospital); Mufang Huang, Ying He, Panyan Zhou, Yanrong Hu, Dan Liu (Zhuhai People's Hospital); Shuiling Shi, Yaowei Wen, Jinhua Fu, Lili Zhang (Longyan First Hospital); Tao Huang, Bin Xu (Maternal and Child Health Hospital of Yongzhou City); Caiyun Li (Chenzhou First People's Hospital); Uet Yu, Changgang Li (Shenzhen Children's Hospital), Zhenzhou Li (Second People's Hospital of Shenzhen) for their direct and indirect help in this study. National Natural Science Foundation of China (to X.X.; grant U20A20353), National Key Research and Development Program of China (to X.X.; grant2018YFA0507800 and 2018YFA0507803).

Ye Y., Niu C., Mao A., Qin L., Zhan J., Chen W., Liu Z., Xie T., Zhang Q., Li J., Huang L., Meng W., Liu Y., Liao L., Cai J., Liu R., Zhang X., Zeng L., Li Y., Lin B., Li K., Hua X., Huang B., Qin H., Huang Y., Huang Z., Lao J., Qu X., Chen J., Feng X., Liu Q., Lin W., Zhou X., Liang Y., Long X., Qin J., Yan L., Zhu W., Yu L., Fan C., Tang D., Zhong T., Tan J., Ren Z., Xu X., Haplotype‐Resolved Genotyping and Association Analysis of 1,020 β‐Thalassemia Patients by Targeted Long‐Read Sequencing. Adv. Sci. 2025, 12, 2410992. 10.1002/advs.202410992

Data Availability Statement

The data that support the findings of this study are available on request from the corresponding author. The data are not publicly available due to privacy or ethical restrictions.

References

  • 1. Muncie H. L. Jr., Campbell J., Am. Fam. Physician 2009, 80, 339. [PubMed] [Google Scholar]
  • 2. Kattamis A., Kwiatkowski J. L., Aydinok Y., Lancet 2022, 399, 2310. [DOI] [PubMed] [Google Scholar]
  • 3. Cazzalo M., Blood 2022, 139, 2460.34932791 [Google Scholar]
  • 4. Taher A. T., Saliba A. N., Hematology Am. Soc. Hematol. Educ. Program 2017, 2017, 265. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5. Muckenthaler M. U., Rivella S., Hentze M. W., Galy B., Cell 2017, 168, 344. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6. Motta I., Bou‐Fakhredin R., Taher A. T., Cappellini M. D., Drugs 2020, 80, 1053. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7. Traeger‐Synodinos J., Harteveld C. L., Old J. M., Petrou M., Galanello R., Giordano P., Angastioniotis M., De la Salle B., Henderson S., May A., Eur. J. Hum. Genet. 2015, 23, 426. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8. Viprakasit V., Ekwattanakit S., Hematol. Oncol. Clin. North Am. 2018, 32, 193. [DOI] [PubMed] [Google Scholar]
  • 9. Piel F. B., Weatherall D. J., N. Engl. J. Med. 2014, 371, 1908. [DOI] [PubMed] [Google Scholar]
  • 10. Shang X., Peng Z., Ye Y., Asan, Zhang X., Chen Y., Zhu B., Cai W., Chen S., Cai R., Guo X., Zhang C., Zhou Y., Huang S., Liu Y., Chen B., Yan S., Chen Y., Ding H., Yin X., Wu L., He J., Huang D., He S., Yan T., Fan X., Zhou Y., Wei X., Zhao S., Cai D., Guo F., et al., E. Bio. Medicine 2017, 23, 150. [Google Scholar]
  • 11. Songdej D., Kadegasem P., Tangbubpha N., Sasanakul W., Deelertthaweesap B., Chuansumrit A., Sirachainan N., Br. J. Haematol. 2022, 198, 1051. [DOI] [PubMed] [Google Scholar]
  • 12. Huang W., Qu S., Qin Q., Yang X., Han W., Lai Y., Chen J., Zhou S., Yang X., Zhou W., Clin. Chem. 2023, 69, 1062. [DOI] [PubMed] [Google Scholar]
  • 13. Lunke S., Bouffler S. E., Patel C. V., Sandaradura S. A., Wilson M., Pinner J., Hunter M. F., Barnett C. P., Wallis M., Kamien B., Tan T. Y., Freckmann M. L., Chong B., Phelan D., Francis D., Kassahn K. S., Ha T., Gao S., Arts P., Jackson M. R., Scott H. S., Eggers S., Rowley S., Boggs K., Rakonjac A., Brett G. R., de Silva M. G., Springer A., Nat. Med. 2023, 29, 1681. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14. Chen C., Li R., Sun J., Zhu Y., Jiang L., Li J., Fu F., Wan J., Guo F., An X., Wang Y., Fan L., Sun Y., Guo X., Zhao S., Wang W., Zeng F., Yang Y., Ni P., Ding Y., Xiang B., Peng Z., Liao C., Genome Med. 2021, 13, 18. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15. Karnpean R., Tepakhan W., Suankul P., Thingphom S., Poonsawat A., Thanunchaikunlanun N., Ruangsanngamsiri R., Jomoui W., Genes (Basel) 2022, 13, 1384. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16. Porubsky D., Eichler E. E., Cell 2024, 187, 1024. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17. Hariharan P., Nadkarni A., Blood Rev. 2021, 49, 100823. [DOI] [PubMed] [Google Scholar]
  • 18. Bae H. T., Baldwin C. T., Sebastiani P., Telen M. J., Ashley‐Koch A., Garrett M., Hooper W. C., Bean C. J., DeBaun M. R., Arking D. E., Bhatnagar P., Casella J. F., Keefer J. R., Barron‐Casella E., Gordeuk V., Kato G. J., Minniti C., Taylor J., Campbell A., Luchtman‐Jones L., Hoppe C., Gladwin M. T., Zhang Y., Steinberg M. H., Blood 2012, 120, 1961. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19. Makani J., Menzel S., Nkya S., Cox S. E., Drasar E., Soka D., Komba A. N., Mgaya J., Rooks H., Vasavda N., Fegan G., Newton C. R., Farrall M., Thein S. L., Blood 2011, 117, 1390. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20. Liu D., Zhang X., Yu L., Cai R., Ma X., Zheng C., Zhou Y., Liu Q., Wei X., Lin L., Yan T., Huang J., Mohandas N., An X., Xu X., Blood 2014, 124, 803. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21. Chen D., Zuo Y., Zhang X., Ye Y., Bao X., Huang H., Tepakhan W., Wang L., Ju J., Chen G., Zheng M., Liu D., Huang S., Zong L., Li C., Chen Y., Zheng C., Shi L., Zhao Q., Wu Q., Fucharoen S., Zhao C., Xu X., Am. J. Hum. Genet. 2017, 101, 130. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22. Amaya M., Desai M., Gnanapragasam M. N., Wang S. Z., Zu Zhu S., Williams D. C. Jr., Ginder G. D., Blood 2013, 121, 3493. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23. Vinjamur D. S., Yao Q., Cole M. A., McGuckin C., Ren C., Zeng J., Hossain M., Luk K., Wolfe S. A., Pinello L., Bauer D. E., Nat. Genet. 2021, 53, 719. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24. Feng R., Mayuranathan T., Huang P., Doerfler P. A., Li Y., Yao Y., Zhang J., Palmer L. E., Mayberry K., Christakopoulos G. E., Xu P., Li C., Cheng Y., Blobel G. A., Simon M. C., Weiss M. J., Nature 2022, 610, 783. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25. Wu Y., Zeng J., Roscoe B. P., Liu P., Yao Q., Lazzarotto C. R., Clement K., Cole M. A., Luk K., Baricordi C., Shen A. H., Ren C., Esrick E. B., Manis J. P., Dorfman D. M., Williams D. A., Biffi A., Brugnara C., Biasco L., Brendel C., Pinello L., Tsai S. Q., Wolfe S. A., Bauer D. E., Nat. Med. 2019, 25, 776. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26. Antoniou P., Hardouin G., Martinucci P., Frati G., Felix T., Chalumeau A., Fontana L., Martin J., Masson C., Brusson M., Maule G., Rosello M., Giovannangeli C., Abramowski V., de Villartay J. P., Concordet J. P., Del Bene F., El Nemer W., Amendola M., Cavazzana M., Cereseto A., Romano O., Miccio A., Nat. Commun. 2022, 13, 6618. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27. Frangoul H., Altshuler D., Cappellini M. D., Chen Y. S., Domm J., Eustace B. K., Foell J., de la Fuente J., Grupp S., Handgretinger R., Ho T. W., Kattamis A., Kernytsky A., Lekstrom‐Himes J., Li A. M., Locatelli F., Mapara M. Y., de Montalembert M., Rondelli D., Sharma A., Sheth S., Soni S., Steinberg M. H., Wall D., Yen A., Corbacioglu S., N. Engl. J. Med. 2021, 384, 252. [DOI] [PubMed] [Google Scholar]
  • 28. Mayuranathan T., Newby G. A., Feng R., Yao Y., Mayberry K. D., Lazzarotto C. R., Li Y., Levine R. M., Nimmagadda N., Dempsey E., Kang G., Porter S. N., Doerfler P. A., Zhang J., Jang Y., Chen J., Bell H. W., Crossley M., Bhoopalan S. V., Sharma A., Tisdale J. F., Pruett‐Miller S. M., Cheng Y., Tsai S. Q., Liu D. R., Weiss M. J., Yen J. S., Nat. Genet. 2023, 55, 1210. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29. Xu L., Mao A., Liu H., Gui B., Choy K. W., Huang H., Yu Q., Zhang X., Chen M., Lin N., Chen L., Han J., Wang Y., Zhang M., Li X., He D., Lin Y., Zhang J., Cram D. S., Cao H., J. Mol. Diagn. 2020, 22, 1087. [DOI] [PubMed] [Google Scholar]
  • 30. Liang Q., He J., Li Q., Zhou Y., Liu Y., Li Y., Tang L., Huang S., Li R., Zeng F., Mao A., Liu Y., Liang D., Wu L., Clin. Chem. 2023, 69, 239. [DOI] [PubMed] [Google Scholar]
  • 31. Liang Q., Gu W., Chen P., Li Y., Liu Y., Tian M., Zhou Q., Qi H., Zhang Y., He J., Li Q., Tang L., Tang J., Teng Y., Zhou Y., Huang S., Lu Z., Xu M., Hou W., Huang T., Li Y., Li R., Hu L., Li S., Guo Q., Zhuo Z., Mou Y., Cram D. S., Wu L., J. Mol. Diagn. 2021, 23, 1195. [DOI] [PubMed] [Google Scholar]
  • 32. Li S., Han X., Xu Y., Chang C., Gao L., Li J., Lu Y., Mao A., Wang Y., J. Mol. Diagn. 2022, 24, 1009. [DOI] [PubMed] [Google Scholar]
  • 33. Liu Y., Chen M., Liu J., Mao A., Teng Y., Yan H., Zhu H., Li Z., Liang D., Wu L., Clin. Chem. 2022, 68, 927. [DOI] [PubMed] [Google Scholar]
  • 34. Liu Y., Li D., Yu D., Liang Q., Chen G., Li F., Gao L., Li Z., Xie T., Wu L., Mao A., Wu L., Liang D., Thromb. Haemost. 2023, 123, 1151. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35. Ye J., McGinnis S., Madden T. L., Nucleic Acids Res. 2006, 34, W6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36. Giardine B., Borg J., Viennas E., Pavlidis C., Moradkhani K., Joly P., Bartsakoulia M., Riemer C., Miller W., Tzimas G., Wajcman H., Hardison R. C., Patrinos G. P., Nucleic Acids Res. 2014, 42, D1063. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37. Kountouris P., Lederer C. W., Fanis P., Feleki X., Old J., Kleanthous M., PLoS One 2014, 9, e103020. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38. Richards S., Aziz N., Bale S., Bick D., Das S., Gastier‐Foster J., Grody W. W., Hegde M., Lyon E., Spector E., Voelkerding K., Rehm H. L., A. L. Q. A , Genet. Med. 2015, 17, 405. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39. Martin M., Ebert P., Marschall T., Methods Mol. Biol. 2023, 2590, 127. [DOI] [PubMed] [Google Scholar]
  • 40. Schwede T., Kopp J., Guex N., Peitsch M. C., Nucleic Acids Res. 2003, 31, 3381. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41. Farashi S., Harteveld C. L., Blood Cells Mol. Dis. 2018, 70, 43. [DOI] [PubMed] [Google Scholar]
  • 42. Singleton B. K., Burton N. M., Green C., Brady R. L., Anstee D. J., Blood 2008, 112, 2081. [DOI] [PubMed] [Google Scholar]
  • 43. Karczewski K. J., Francioli L. C., Tiao G., Cummings B. B., Alfoldi J., Wang Q., Collins R. L., Laricchia K. M., Ganna A., Birnbaum D. P., Gauthier L. D., Brand H., Solomonson M., Watts N. A., Rhodes D., Singer‐Berk M., England E. M., Seaby E. G., Kosmicki J. A., Walters R. K., Tashman K., Farjoun Y., Banks E., Poterba T., Wang A., Seed C., Whiffin N., Chong J. X., Samocha K. E., Pierce‐Hoffman E., et al., Nature 2020, 581, 434. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supporting Information

Supplemental File 1

ADVS-12-2410992-s003.xlsx (26.3KB, xlsx)

Supplemental File 2

Supplemental File 3

ADVS-12-2410992-s002.xlsx (13.3KB, xlsx)

Data Availability Statement

The data that support the findings of this study are available on request from the corresponding author. The data are not publicly available due to privacy or ethical restrictions.


Articles from Advanced Science are provided here courtesy of Wiley

RESOURCES