Abstract
The molecular landscape of acute lymphoblastic leukemia (ALL) is highly heterogeneous, and genetic lesions are clinically relevant for diagnosis, risk stratification, and treatment guidance. Next-generation sequencing (NGS) has become an essential tool for clinical laboratories, where disease-targeted panels are able to capture the most relevant alterations in a cost-effective and fast way. However, comprehensive ALL panels assessing all relevant alterations are scarce. Here, we design and validate an NGS panel including single-nucleotide variants (SNVs), insertion–deletions (indels), copy number variations (CNVs), fusions, and gene expression (ALLseq). ALLseq sequencing metrics were acceptable for clinical use and showed 100% sensitivity and specificity for virtually all types of alterations. The limit of detection was established at a 2% variant allele frequency for SNVs and indels, and at a 0.5 copy number ratio for CNVs. Overall, ALLseq is able to provide clinically relevant information to more than 83% of pediatric patients, making it an attractive tool for the molecular characterization of ALL in clinical settings.
Keywords: next-generation sequencing, NGS, molecular characterization, childhood acute lymphoblastic leukemia
1. Introduction
Acute lymphoblastic leukemia (ALL) is the most common type of pediatric cancer. The current cure rate reaches 80–90%, but it decreases with age, and the prognosis of relapsed patients is very poor [1,2].
ALL is caused by the clonal proliferation of immature lymphocytes due to the accumulation of genetic alterations, which gives rise to different biological and clinical subtypes of leukemia. The presence of genetic lesions that drive distinct subtypes of leukemia has set the bases of ALL classification, in which novel categories defined by point mutations (such as PAX5 p.P80R or IKZF1 p.N159N) and additional gene fusions (i.e., ZNF384 or MEF2D rearrangements) have been recently acknowledged [3,4]. Some of these variants, as well as copy number variation (CNV) affecting other genes, are used for risk stratification, which drives treatment intensity [5,6]. Genetic lesions can further be utilized for targeted therapy selection, against either the affected gene or the altered signaling pathway [7,8]. Therefore, as molecular variants are used as diagnostic, prognostic, and predictive biomarkers, their identification is being increasingly demanded by pediatricians and hematologists.
Next-generation sequencing (NGS) allows the simultaneous assessment of a broad number of targets in several samples and has therefore become an indispensable tool for clinical laboratories. Different approaches have been developed in order to optimize resources and shorten the turnaround time, with pathology-directed panels being the preferred option in most centers [9]. However, the availability of commercial ALL NGS panels is scarce, as pan-hematological assays usually lack relevant genes or omit CNV identification.
The purpose of this work was to design and validate an ALL-targeted NGS panel (ALLseq), including clinically relevant point mutations, insertion–deletions (indels), CNVs, fusions, and gene expression. Our results show that ALLseq constitutes a useful tool for patient characterization, identifying driver and secondary alterations in a single experiment, thus allowing accurate diagnosis, patient risk stratification, and (in some cases) treatment selection.
2. Results
2.1. ALLseq Design
The ALLseq design included the targets listed in Table 1 and Supplemental Table S1, for which point mutations, indels, CNVs, fusions, and/or gene expression can be assessed.
Table 1.
DNA (2 Primer Pools): 96,7 kb; 1138 Amplicons, 97.91% Coverage | RNA (2 Primer Pools) | ||
---|---|---|---|
HotSpots (SNVs and Indels; 22 Genes) | Whole Coding Sequence (SNVs and Indels; 32 Genes) | Fusions (271 Fusions, 634 Isoforms) | Expression Quantitation (7 Genes) |
CREBBP *, CRLF2, DNMT3A, EP300, EZH2, FBXW7, FLT3, IDH1/2, IL7R, JAK1/2/3, NOTCH1, PAX5, PIK3CA, PTPN11, K/NRAS, SETD2, STAT5B, and SH2B3 * | AKT, BCL11B, BTG1, CDKN2A/B *, DNM2, EBF1, EED, ERG *, ETV6 *, GATA3, IKZF1 *, IL2RB, KDM6A, LEF1, NF1, NT5C, PAX5 *, PHF6, PTEN, PTPN2, RB1 *, RUNX1 *, STAG2, SUZ12, TET2, TP53, and WT1 | Main drivers: ABL1/2, KMT2A, TCF3, ETV6, EPOR, CSF1R, FLT3, JAK2, PDGFRA/B, LYN, NTRK1/2/3, TYK2, FGFR1, IL2RB, TSLP, PAX5, NOTCH1, MEF2D, ZNF384, and MYB/L1. Main fusions: P2RY8::CRLF2, SET::NUP214, PICALM::MLLT10, and STIL::TAL | CRLF2, HOXA, LMO2, NKX2, TAL1, TLX1, and TLX3 |
* CNV can be assessed.
2.2. ALLseq Set-Up and Sequencing Metrics
The maximum chip efficiency was obtained when 200 ng of the library pool (4:1 DNA:RNA) was used for template preparation. Under these conditions, the mean chip load was 88.3% (35.88% polyclonal; 55.6% usable reads), which yielded a mean of 18,451,079 total reads.
The mean quality metrics per sample showed a read depth of 1903×, on-target and uniformity percentages > 95%, and a median absolute pairwise difference (MAPD) mean value of 0.19. The mean read depth < 100× was found on 9/1138 amplicons (0.8%).
We compared the ALLseq main sequencing metrics with those obtained with the Oncomine Childhood Research Assay (OCCRA). The OCCRA is a commercial pan-pediatric cancer panel, where some ALL targets are present. No significant differences were found for the on-target % or the percentage of amplicons showing a low mean read depth (<100×) (Figure 1A,B). However, the median uniformity % and MAPD were lower in ALLseq (uniformity: 96.46 vs. 97.77, p < 0.05; MAPD 0.16 vs. 0.25) (Figure 1C,D).
Focusing on RNA results, the mean total number of RNA mapped reads per sample was 358,240. All of the analyzed samples passed the quality control test (>20,000 total mapped reads).
2.3. Technical Validation Strategy and ALLseq Performance
For technical validation, 25 molecularly characterized samples were selected. Nineteen carried SNVs or indels in DNM2, EP300, FBXW7, FLT3, H3F3A, IKZF1, JAK1, JAK3, KMT2D, NOTCH1, NRAS, PHF6, PTPN11, or TP53. Eighteen samples harbored CDKN2A, CDKN2B, IKZF1, JAK2, RB1, PAX5, and/or ETV6 deletions. Six samples were used as DNA negative controls for SNVs/indels, and seven for CNVs.
Seventeen RNA samples carried one of the following fusions: TCF3::ZNF384, KMT2A::MLLT10, BCR::ABL1, TCF3::PBX1, ARID1B::ZNF384, PICALM::MLLT10, TERF::JAK2, IGH::CRLF2 (does not generate a fusion transcript but overexpresses CRLF2), ETV6::RUNX1, KMT2A::AFF1, STIL::TAL1 (overexpresses TAL1), or EBF1::PDGFRB. Eight samples were used as negative RNA controls.
2.3.1. SNVs and Indels
ALLseq detected all of the expected SNVs and indels; KMT2D c.8743C>T and H3F3A c.82A>G were not detected, as these genes were not included in the ALLseq design. ALLseq identified two additional variants: DNM2 c.2080G>T (not included in OCCRA’s design) and KRAS c.34G>C (confirmed by direct sequencing and Minor Variant Finder analysis) (Supplemental Table S2). The VAF of the 34 overlapping variants showed a high correlation between ALLseq and OCCRA (R2 = 0.93), and the Bland-Altman plot showed that 93.9% of the VAF values were in agreement within the 95% confidence intervals (Figure 2).
2.3.2. CNVs
ALLseq detected at least one CNV in 18/25 (72%) samples and a total of 39 CNVs: CDKN2A—9/39 (23.1%), CDKN2B—9/39 (23.1%), ETV6—6/39 (15.4%), PAX5—6/39 (15.4%), IKZF1—4/39 (10.3%), JAK2—2/39 (5.1%), RB1—2/39 (5.1%), and EBF1—1/39 (2.6%). Eight discrepancies were observed between ALLseq and MLPA (three for IKZF1, two for PAX5, one for ETV6, one for EBF1, and one for RB1) (Supplemental Table S3). Altogether, the CNV Cohen’s kappa coefficient was 0.88.
2.3.3. Fusions and Gene Expression
ALLseq detected fusions and/or high gene expression in 18/25 samples (72%). No false positives were observed (Supplemental Table S4).
RNA 10 showed CRLF2 expression 229-fold higher than the median, consistent with the presence of a CRLF2 translocation at the IGH locus in this sample. Similarly, RNA 18, which harbored a STIL::TAL1 fusion (also identified by ALLseq), expressed TAL1 25-fold compared to the median (Supplemental Table S5). The expression values of CRLF2 and TAL1 measured by ALLseq and RT-qPCR showed a high correlation (R2 = 0.98 and 0.90, respectively) (Supplemental Figure S1). As expected, TLX1, TLX3, NKX2-1, and HOXAA expressions were undetectable in all patients, as these genes are not expressed in bone marrow or peripheral blood unless they are deregulated. LMO2 showed a stable basal expression in all samples. Overall, RNA results were in 100% agreement with the expected results.
A summary of ALLseq performance can be found in Table 2 and Supplemental Table S4.
Table 2.
Alteration Type | Sensitivity | Specificity | PPV | NPV | Precision |
---|---|---|---|---|---|
SNVs and indels | 100% | 100% | 100% | 100% | 100% |
CNVs (Cohen kappa coefficient = 0.88) |
88.87% | 97.92% | 92.85% | 95.91% | 96.45% |
Fusions | 100% | 100% | 100% | 100% | 100% |
Gene expression | 100% | 100% | 100% | 100% | 100% |
2.4. Limit of Detection, Reproducibility, Repeatability, and Linearity
The limit of detection (LoD) for SNVs and indels was established at 2% of VAF. Intra-experiment repeatability and inter-experiment reproducibility showed a 100% concordance above this VAF, and a high correlation (R2 ≥ 0.98) was observed between inter- and intra-sequencing runs (Figure 3, Supplemental Table S6).
The CNV LoD was established at a copy number ratio of 0.5, corresponding with a heterozygous deletion, when, at least, confidence > 20 and precision > 10 were reached (Table 3). Different analysis rounds showed 100% repeatability and reproducibility for CNV analysis.
Table 3.
Copy Number Ratio | ||||
---|---|---|---|---|
CDKN2A/B | IKZF1 | |||
Expected | Observed | Expected | Observed | |
Dil1 (3:1) | 0.5 | <1 | 0.25 | ND 1 |
Dil2 (1:1) | 1 | 1 | 0.5 | <1 |
Dil3 (1:2) | 0.66 | <1 | 0.66 | <1 |
1 ND = not detected.
Regarding fusion expression quantitation, serial dilutions yielded a linear range up to the 10−4 dilution (Supplemental Figure S2).
2.5. Clinical Validation: Prospective Sequencing
In total, 43 correlative patients were prospectively analyzed with ALLseq, whose main characteristics are shown in Supplemental Table S7.
DNA sequencing identified 54 SNVs or indels, resulting in a mean of 1.26 variants per sample. Twenty-five cases (58.1%) harbored at least one mutation. The genes with the highest mutational frequency were KRAS (18.52%) (10/54), NRAS (14.81%) (8/54), PTPN11 (14.81%) (8/54), and NOTCH1 (9.26%) (5/54). All of the SNVs and indels were confirmed by direct Sanger sequencing (Supplemental Table S8, Supplemental Figure S3A).
Sixty-eight CNVs were found in 28 out of 43 samples (65.12%). The mean number of affected genes was 1.58 (range 0–6) per patient. The most frequently deleted gene was CDKN2A (19.12%; 13/68), followed by CDKN2B (16.18%; 11/68) and ETV6 (8.82%; 6/68). Notably, ALLseq detected additional copies of RUNX1 and TP53 in patients carrying chromosome 17 and 21 gains (including a patient harboring intrachromosomic amplification of chromosome 21, iAMP21), respectively (Figure 4, Supplemental Figure S3B). Results from 5/43 (11.63%) patients were discordant with MLPA (Supplemental Table S8).
Overall, a total of 122 SNVs, indels, and CNVs were detected in the DNA. Genes affected by these alterations were grouped according to the signaling pathway in which they have a role. The most frequently altered were the TP53-cell cycle (30.33%; 37/122), followed by the RAS (18.03%; 22/122) and lymphoid differentiation (17.21%; 21/122) pathways. Interestingly, the RAS pathway was only affected by SNVs or indels and represented 41.07% (22/54) of these alterations (Supplemental Figure S4).
ALLseq detected fusions in 11/43 samples (25.6%). The most frequent fusion was ETV6::RUNX1, identified in 5/43 patients (11.6%), followed by KMT2A rearrangements and STIL::TAL1, each detected in 2/43 patients (4.6%) (Supplemental Table S8). All the fusions were confirmed by orthogonal methods.
Regarding gene expression, 3 out of 43 (6.9%) patients overexpressed CRLF2. One harbored the t(X;14)(p22;q32) translocation (detected by cytogenetics and FISH); another carried the CRLF2::CSF2RA fusion, also detected by ALLseq and FISH; and the third patient showed a pseudoautosomic region 1 (PAR1) amplification, which contains CRLF2. Furthermore, the two samples harboring STIL::TAL1 (detected by ALLseq and FISH) overexpressed TAL1 (Figure 5). Additionally, two patients with translocated TLX3 (confirmed by FISH) met the overexpression criteria for this gene (Supplemental Table S8).
Of note, we detected ectopic TLX1 and NKX2-1 expression in two patients. These genes are not usually expressed in hematopoietic tissue unless deregulated, but TLX1 was expressed in one T-cell patient harboring the t(7;17)(q31;q12) translocation (identified by conventional cytogenetics), and NKX2-1 was expressed in a T-cortical patient for which no molecular mechanism was found.
Overall, the combined DNA and RNA results showed a total of 142 alterations. Comprehensive molecular and basic clinical data are shown in Figure 6.
Next, we tested the ALLseq clinical yield by exclusively classifying prospective patients according to these NGS results. Only pathogenic variants complying with at least one of these criteria were considered as clinically relevant, i.e., (a) variants that define World Health Organization (WHO 2022) and/or the International Consensus Classification of myeloid neoplasms and acute leukemias (ICC 2022) categories [3,4] (of note, ALL classification was updated during the development of this project so we used the latest classifications, although the design was based on the previous versions); (b) variants defining genetic risk groups considered by the ALLTogether treatment protocol; and (c) variants allowing patient selection for targeted therapy according to the ALLTogether protocol or active clinical trials.
Under these premises, 63/142 (44.37%) of the pathogenic variants were considered as clinically relevant. Subsequently, ALL patients were allocated into three groups, depending on the clinical utility of the alteration(s) they carried: diagnosis, risk stratification, and/or targeted therapy. Moreover, 12/43 (27.91%) patients carried entity-defining alterations, 32/43 (74.42%) harbored risk-associated lesions, and 10/43 (23.26%) were suitable for targeted therapy (Figure 7A). Co-occurrence among these categories is shown in Figure 7B. Overall, 36/43 (83.72%) patients could benefit from molecular findings derived from ALLseq. Among the seven remaining patients, four harbored aneuploidies, and three did not show any additional molecular lesion.
Finally, we analyzed the clinical utility of ALLseq and cytogenetics as independent or combined techniques. When used as a stand-alone method, ALLseq provided information related to prognosis or treatment to more patients than cytogenetics, while slightly more patients benefited from cytogenetics than ALLseq for the identification of ALL entities. By combining both techniques, most patients (40/43; 93.02%) could be diagnosed, classified into risk groups, and/or benefit from targeted therapies (Figure 7C).
3. Discussion
In the present study, an ALL-targeted NGS panel was designed and validated. ALLseq allows clinically relevant SNVs, indels, CNVs, fusions, and gene expression alterations to be detected, which, to the best of our knowledge, makes our panel unique.
ALLseq was conceived specifically for somatic analysis. However, recent research points to germline variants as a driving mechanism in familiar cases [10]. In suspected cases, germline origin must be confirmed in culture skin fibroblasts according to current recommendations. If confirmed, the patient should be assigned to a specialized unit [11,12].
ALLseq sequencing metrics were equivalent to those reported by commercial panels such as the OCCRA, which were also in line with its Illumina counterpart [13]. The overall performance of ALLseq, as assessed by sensitivity, specificity, PPV, NPV, and accuracy, was 100% on SNVs, indels, and fusions, and only CNV identification was slightly poorer.
The reliable identification of CNVs using NGS continues to be a challenge due to unequal target coverage [14]; however, ALLseq CNV reliability was acceptable according to Cohen’s criterion.
The LoD was set at 2% VAF for SNVs and indels, which is acceptable in the clinical context, as most protocols establish 5% as the cut-off value for considering variants as clinically relevant [15,16]. Although the LoD is sufficient for SNV and indel detection, special attention to low blast % samples is required when analyzing CNV and gene expression with NGS, as recommended by Jennings et al. [17].
The main limitation of ALLseq, like most targeted panels, is its inability to identify aneuploidies, which are present in up to 30% of pediatric ALL cases. However, these alterations are easily identified by cytogenetic techniques, which are routinely performed in all laboratories. It cannot detect DUX4 deregulation (which represents 7% of B-ALL), which has been considered an independent entity by the WHO 2022 and ICC 2022 classifications and confers favorable prognosis [18]. DUX4 deregulation is technically difficult to detect, as it is located within a repetitive region on chromosome 4q, with an almost identical locus on 10q. Thus, primers can bind to multiple loci on both 4q and 10q. Additionally, DUX4 fusions show great variability in breakpoints [19].
ALLseq allows 634 fusions to be identified (including ABL-class translocations and virtually all of the class-defining fusions), as well as the aberrant expression of seven genes including CRLF2. Notably, up to 50% of Ph-like ALLs overexpress this gene due to fusions, mutations, and alterations in the JAK-STAT pathway [20]. Therefore, with the ALLseq design, most Ph-like patients can be diagnosed, allowing the identification of candidates for targeted therapies with tyrosine kinase or JAK-STAT inhibitors (NCT03571321). In fact, we were able to define the ALL subtype of around 30% of patients, among whom 7% were Ph-like cases overexpressing CRLF2.
The recent update of the WHO 2022 and new ICC ALL classifications included, for the first time in this disease, categories defined by point mutations. Moreover, the potential use of targeted therapies in patients carrying SNVs or indels highlighted the clinical relevance of these alterations in ALL. In fact, ALLseq identified point mutations in FLT3, the NOTCH1 pathway, or JAK family genes, for which targeted inhibitors have been developed [21], in 16% of patients.
Regarding CNV identification, IKZF1 deletions have been classically recognized as conferring poor prognosis [22]. More recently, several European groups have developed different CNV profiles that are significant for risk stratification. In particular, the COALL has proposed the IKZF1plus group, defined by the deletion of IKZF1 co-occurring with at least one additional deletion in CDKN2A/Bhomo, PAX5, or the pseudo autosomic region 1 (PAR1) in the absence of ERG deletion, which distinguishes high-risk ALL patients who benefit from treatment intensification [23]. Similarly, the British group proposes a CNV profile (UKALL-CNA) involving IKZF1, CDKN2A/B, PAR1, BTG1, EBF1, PAX5, ETV6, and RB1 to refine risk groups [24]. The ALLTogether (NCT04307576) treatment protocol incorporates the UKALL-CNA risk stratification, making CNV assessment mandatory. With ALLseq, we were able to correctly risk-stratify around 90% of patients.
Cytogenetic approaches have been the main diagnostic tool in ALL, given the exclusive importance of aneuploidies and a few translocations in this disease just a decade ago [25]. However, the development of “omic” technologies has substantially broadened the spectrum of molecular lesions that explain the onset of ALL. In this context, NGS has been incorporated as a complementary tool into most clinical laboratories [26]. Our results show that the combination of ALLseq and cytogenetics provide clinical information to virtually all patients, reducing the number and type of assays necessary for ALL characterization (RT-PCR, MLPA, SNP arrays, etc.). It is worth mentioning emerging technologies, such as optical genome mapping, which will surely be useful for the analysis of ALL patients. This technique, unlike NGS targeted panels, is not restricted to a list of genes and therefore is able to detect novel alterations [27]. Moreover, it can detect numerical and structural chromosome alterations as well as gene-level gains or losses, which makes it very useful for ALL characterization. However, it is unable to detect SNVs and indels which, as discussed above, are currently needed for ALL diagnoses [3,4]. The continuous availability of novel genomic methodologies makes it difficult to define the optimal technology(ies) for ALL characterization. The molecular knowledge of ALL is an evolving field; therefore, the best diagnostic approach has to be flexible and adapt to conform to the latest guidelines. In this context, interdisciplinary groups carrying out an integrated diagnosis become essential in configuring the diagnostic workflow of ALL [28].
In conclusion, ALLseq allows the most frequent alterations in ALL to be identified. Although there are certain limitations to be considered when interpreting the results, the panel constitutes a useful tool for patient characterization and management, as it allows the identification of driver and secondary alterations in a single experiment, thus permitting accurate diagnosis, patient risk stratification, and (in some cases) treatment selection.
4. Materials and Methods
4.1. Patient Samples and Inclusion Criteria
The study included pediatric and adolescent (≤18 years old) ALL patients diagnosed at Hospital Universitari i Politècnic La Fe (Valencia, Spain). Inclusion criteria were as follows: availability of high-quality DNA and RNA from bone marrow or peripheral blood, and written informed consent in accordance with the recommendations of the Declaration of Human Rights and the Conference of Helsinki. The Institutional Ethics Committee for Clinical Research approved this study (approval numbers 2021-045-1 and 2022-09-04).
4.2. ALLseq: An ALL-Targeted Custom NGS Panel
A custom panel targeting ALL (ALLseq) was designed using the White Gloves Service from Thermo Fisher Scientific. Target selection was based on its potential clinical utility according to 2 levels of evidence:
Level 1: clinical guidelines and clinical trials: (a) alterations included in the WHO classification of hematolymphoid tumors in force at the time of the start of the study [29]; (b) alterations defining genetic ALL subtypes [30]; (c) alterations used for risk stratification by international cooperative groups [31], NCT04307576]; (d) alterations used for potential targeted therapy [32].
Level 2: other pathogenic alterations described in large cohorts: (a) variants that cluster into specific subtypes of ALL [15,33]; (b) variants associated with good or bad prognosis but not currently used for patient risk stratification [34]; (c) variants that confer resistance to specific drugs in vitro/in vivo experiments [35].
The sequencing workflow was carried out on Ion Torrent platforms (Thermo Fisher Scientific, San Francisco, CA, USA). DNA libraries were generated from 10 ng of DNA, with an initial PCR consisting of 17 cycles and 4 min of extension time; for RNA libraries, cDNA was generated with the SuperScript™ IV VILO™ kit (Thermo Fisher Scientific) from 10 ng of total RNA, and PCR was performed with 20 cycles and 4 min of extension time. Library and template preparation was carried out automatically on the Ion Chef™ Instrument (Thermo Fisher Scientific) using the Ion AmpliSeq™ Kit for Chef DL8 (Thermo Fisher Scientific) and the Ion 510™ & Ion 520™ & Ion 530™ Kit-Chef (Thermo Fisher Scientific), respectively. Libraries from eight samples were loaded onto an Ion 530™ Chip (Thermo Fisher Scientific) and sequenced on an Ion S5 sequencer (Thermo Fisher Scientific).
4.3. Data Analysis
Human genome build 19 was used as the reference genome. Base calling was performed on Torrent Suite software version 5.10.0 (Thermo Fisher Scientific). Variant identification was accomplished with the Variant Caller Plugin (Thermo Fisher Scientific), and variant annotation was performed using Ion Reporter (IR) software version 5.10.3.0 (Thermo Fisher Scientific).
For CNV assessment, the normalized read depth of each sample was compared with that of the reference baseline (generated by sequencing 20 healthy controls). The IR software applied an algorithm based on a hidden Markov model, which predicts the copy number or the ploidy state. A copy number of 2 was considered as normal, values ≥ 3 were considered as amplifications, and a copy number ratio of one or zero suggested heterozygous or homozygous deletions, respectively.
4.4. ALLseq Validation Strategy
For technical validation, 25 retrospective patients harboring a >90% blast count in bone marrow or peripheral blood and a complete molecular characterization were selected. A second validation round was carried out by sequencing sequential unbiased ALL samples to assess the clinical utility of the panel.
Complementary Molecular and Cytogenetic Methods
All the ALL samples were analyzed at diagnosis by the following methods: conventional cytogenetics, an ALL FISH custom panel (Cytocell Ltd., Cambridge, UK), RT-PCR to asses ETV6::RUNX1 and BCR::ABL1 fusions [36], and MLPA SALSA P335 ALL-IKZF1 (MRC Holland, Amsterdam, NL).
The Oncomine Childhood Research Assay (OCCRA; Thermo Fisher Scientific) was used retrospectively to further characterize ALL samples following the manufacturer’s instructions. After sequencing, variant filtering was performed on IR software version 5.10.3.0 (Thermo Fisher Scientific). For ALLseq technical validation, intronic and synonym variants were filtered out, whereas pathogenic and likely pathogenic variants, as well as variants of unknown significance (VUS), were retained.
Variants detected by NGS (OCCRA and/or ALLseq) were confirmed by Sanger sequencing and Minor Variant Finder software (Thermofisher Scientific) (SNVs and indels) or qRT-PCR (fusions and gene expression alterations). Fusion characterization and gene expression were assessed by qRT-PCR on a LightCycler 480 II (Roche Diagnostics, Switzerland, AG) using Sybr green and ABL1 as the control gene. Primers and PCR parameters are described in Supplemental Tables S9 and S10. For selected samples, optical genome mapping (OGM) was used following the manufacturer’s instructions in order to confirm CNVs not included in the MLPA SALSA P335 ALL-IKZF1.
4.5. ALLseq Technical Validation
4.5.1. Run Metrics and Quality Criteria
A DNA result was considered evaluable if it met the following requirements: mean read depth ≥ 1500× per sample; uniformity and on-target reads ≥ 80%; and MAPD < 0.5, confidence > 20, and precision > 10 for CNV analysis. Regarding RNA, a minimum number of 20,000 mapped reads was established; gene and fusion expression levels were calculated as [(target reads × 1000)/total RNA reads].
The mean on-target and uniformity percentages, depth of coverage, and MAPD yielded by ALLseq were compared with those from the OCCRA.
4.5.2. Assessment of Analytical Performance
Sensitivity [true-positive (TP)/(TP + false-negative (FN))], specificity [true negative (TN)/(TN + false-positive (FP))], precision [(TP + TN)/n], positive predictive values (PPVs) [TP/(TP + FP)], and negative predictive values (NPVs) [FN/(FN + TN)] were assessed by comparing ALLseq results with data from orthogonal techniques described above. The Bland–Altman method was used to assess the variant allele frequency (VAF) agreement between ALLseq and OCCRA panels. CNV detection reliability was further evaluated with Cohen’s kappa coefficient, where a value > 0.8 indicates a high agreement with the gold-standard method.
Fusion expression linearity was assessed by diluting an ETV6::RUNX1-positive sample at 1:10, 1:100, and 1:1000 ratios into a negative control. Gene expression was quantified by ALLseq and qRT-PCR and compared. The overexpression cutoff was established at 105 expression units (target reads × 104/total mapped reads). In order to calculate analytical performance for gene expression, expression units were dichotomized (overexpression vs. no overexpression) according to the cutoff criteria.
4.5.3. Limit of Detection, Reproducibility, and Repeatability
To assess the LoD, repeatability, and reproducibility, a DNA pool from samples harboring NOTCH1 (p.Phe2509fs, VAF 6.55%; p.Phe1606_Lys1607insAspSerPro, VAF 7.25%), NRAS (p.Gly12Cys, VAF 17.47%), KRAS (p.Leu19Phe, VAF 16.37%), and/or DNM2 (p.Arg123Ter, VAF 21.16%) was created. Two serial VAF dilutions (ratios of 1:2 and 1:4) were prepared using a wild-type control sample. For each dilution, two independent libraries were sequenced twice in back-to-back experiments. In these experiments, a coefficient of variation (CV) ≤ 20% was considered acceptable for VAF values.
In order to obtain the CNV LoD, two samples harboring CDKN2A/B homozygous deletion (CDKN2A/Bhomo) and IKZF1 heterozygous deletion (IKZF1hetero), respectively, were combined at different ratios (CDKN2A/Bhomo:IKZF1hetero; 3:1, 1:1, and 1:2) creating different copy number ratios. A sample pool was analyzed in three consecutive experiments to test the reproducibility and repeatability.
4.6. ALLseq Clinical Validation
A total of 43 correlative patients were prospectively analyzed. In these patients, pathogenic and likely pathogenic variants were confirmed with complementary methods, as described above.
4.7. Statistical Analysis
Medians of quantitative variants were compared with Mann–Whitney’s U test. Qualitative parameters were compared with the chi-square’s test and Cohen’s kappa coefficient. A Bland–Altman plot was used to assess the agreement between VAF values obtained by ALLseq and OCCRA panels. Box plots were generated with BoxPlot R (http://shiny.chemgrid.org/boxplotr, accessed on 16 December 2022). A Circos plot was generated as described by Krzywinski et al. [37]. A landscape diagram was created using Oviz-bio, a free web-based platform for interactive data visualization [38].
Acknowledgments
The authors thank V. Latorre for medical writing support.
Supplementary Materials
The supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/ijms24054440/s1.
Author Contributions
Conceptualization, M.L. and E.B.; methodology, J.V.G., E.S., C.S., J.S., A.M., G.P. and G.A.; validation, J.V.G., E.S., A.I.V. and S.R.; formal analysis, J.V.G., M.S., C.F., J.M.F. and M.L.; investigation, J.V.G. and M.L.; resources, M.L., I.d.J., S.P. and E.B.; writing—original draft preparation, J.V.G. and M.L.; writing—review and editing, M.L. and E.B.; supervision, E.B.; funding acquisition, M.L. and E.B. All authors have read and agreed to the published version of the manuscript.
Institutional Review Board Statement
The study was conducted in accordance with the Declaration of Human Rights and the Conference of Helsinki and approved by the Institutional Ethics Committee of IIS LA FE (approval numbers 2021-045-1 and 2022-09-04).
Informed Consent Statement
Informed consent was obtained from all subjects involved in the study or the patients’ parents or legal guardians.
Data Availability Statement
All the data and materials are available upon reasonable request to llop_margar@gva.es.
Conflicts of Interest
The authors declare no conflict of interest.
Funding Statement
This research was funded by Generalidad Valenciana, Conselleria de Innovación, Universidades, Ciencia y Sociedad Digital (grant numbers GV/2019/077 and GV/2021/105), as well as Instituto de Investigación Carlos III (grant number PI19/00730).
Footnotes
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.
References
- 1.Lejman M., Chałupnik A., Chilimoniuk Z., Dobosz M. Genetic Biomarkers and Their Clinical Implications in B-Cell Acute Lymphoblastic Leukemia in Children. Int. J. Mol. Sci. 2022;23:2755. doi: 10.3390/ijms23052755. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Advani A.S. Novel Strategies in the Treatment of Acute Lymphoblastic Leukaemia. Lancet Haematol. 2022;9:e240–e241. doi: 10.1016/S2352-3026(22)00054-0. [DOI] [PubMed] [Google Scholar]
- 3.Alaggio R., Amador C., Anagnostopoulos I., Attygalle A.D., Araujo I.B.D.O., Berti E., Bhagat G., Borges A.M., Boyer D., Calaminici M., et al. The 5th Edition of the World Health Organization Classification of Haematolymphoid Tumours: Lymphoid Neoplasms. Leukemia. 2022;36:1720–1748. doi: 10.1038/s41375-022-01620-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Arber D.A., Orazi A., Hasserjian R.P., Borowitz M.J., Calvo K.R., Kvasnicka H.-M., Wang S.A., Bagg A., Barbui T., Branford S., et al. International Consensus Classification of Myeloid Neoplasms and Acute Leukemias: Integrating Morphologic, Clinical, and Genomic Data. Blood. 2022;140:1200–1228. doi: 10.1182/blood.2022015850. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Hamadeh L., Enshaei A., Schwab C., Alonso C.N., Attarbaschi A., Barbany G., den Boer M.L., Boer J.M., Braun M., Dalla Pozza L., et al. Validation of the United Kingdom Copy-Number Alteration Classifier in 3239 Children with B-Cell Precursor ALL. Blood Adv. 2019;3:148–157. doi: 10.1182/bloodadvances.2018025718. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Braun M., Pastorczak A., Sędek Ł., Taha J., Madzio J., Jatczak-Pawlik I., Wypyszczak K., Matysiak M., Derwich K., Lejman M., et al. Prognostic Significance of IKZF1 Deletions and IKZF1plus Profile in Children with B-Cell Precursor Acute Lymphoblastic Leukemia Treated According to the ALL-IC BFM 2009 Protocol. Hematol. Oncol. 2022;40:430–441. doi: 10.1002/hon.2973. [DOI] [PubMed] [Google Scholar]
- 7.Lato M.W., Przysucha A., Grosman S., Zawitkowska J., Lejman M. The New Therapeutic Strategies in Pediatric T-Cell Acute Lymphoblastic Leukemia. Int. J. Mol. Sci. 2021;22:4502. doi: 10.3390/ijms22094502. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Malczewska M., Kośmider K., Bednarz K., Ostapińska K., Lejman M., Zawitkowska J. Recent Advances in Treatment Options for Childhood Acute Lymphoblastic Leukemia. Cancers. 2022;14:2021. doi: 10.3390/cancers14082021. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Pervez M.T., Hasnain M.J.U., Abbas S.H., Moustafa M.F., Aslam N., Shah S.S.M. A Comprehensive Review of Performance of Next-Generation Sequencing Platforms. Biomed Res. Int. 2022;2022:3457806. doi: 10.1155/2022/3457806. [DOI] [PMC free article] [PubMed] [Google Scholar] [Retracted]
- 10.Churchman M.L., Qian M., Te Kronnie G., Zhang R., Yang W., Zhang H., Lana T., Tedrick P., Baskin R., Verbist K., et al. Germline Genetic IKZF1 Variation and Predisposition to Childhood Acute Lymphoblastic Leukemia. Cancer Cell. 2018;33:937–948.e8. doi: 10.1016/j.ccell.2018.03.021. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Luo X., Feurstein S., Mohan S., Porter C.C., Jackson S.A., Keel S., Chicka M., Brown A.L., Kesserwan C., Agarwal A., et al. ClinGen Myeloid Malignancy Variant Curation Expert Panel Recommendations for Germline RUNX1 Variants. Blood Adv. 2019;3:2962–2979. doi: 10.1182/bloodadvances.2019000644. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Zhang J., Walsh M.F., Wu G., Edmonson M.N., Gruber T.A., Easton J., Hedges D., Ma X., Zhou X., Yergeau D.A., et al. Germline Mutations in Predisposition Genes in Pediatric Cancer. N. Engl. J. Med. 2015;373:2336–2346. doi: 10.1056/NEJMoa1508054. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Vicente-Garcés C., Esperanza-Cebollada E., Montesdeoca S., Torrebadell M., Rives S., Dapena J.L., Català A., Conde N., Camós M., Vega-García N. Technical Validation and Clinical Utility of an NGS Targeted Panel to Improve Molecular Characterization of Pediatric Acute Leukemia. Front. Mol. Biosci. 2022;9:854098. doi: 10.3389/fmolb.2022.854098. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Teo S.M., Pawitan Y., Ku C.S., Chia K.S., Salim A. Statistical Challenges Associated with Detecting Copy Number Variations with Next-Generation Sequencing. Bioinformatics. 2012;28:2711–2718. doi: 10.1093/bioinformatics/bts535. [DOI] [PubMed] [Google Scholar]
- 15.Richards S., Aziz N., Bale S., Bick D., Das S., Gastier-Foster J., Grody W.W., Hegde M., Lyon E., Spector E., et al. Standards and Guidelines for the Interpretation of Sequence Variants: A Joint Consensus Recommendation of the American College of Medical Genetics and Genomics and the Association for Molecular Pathology. Genet. Med. Off. J. Am. Coll. Med. Genet. 2015;17:405–424. doi: 10.1038/gim.2015.30. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Pandzic T., Ladenvall C., Engvall M., Mattsson M., Hermanson M., Cavelier L., Ljungström V., Baliakas P. Five Percent Variant Allele Frequency Is a Reliable Reporting Threshold for TP53 Variants Detected by next Generation Sequencing in Chronic Lymphocytic Leukemia in the Clinical Setting. HemaSphere. 2022;6:e761. doi: 10.1097/HS9.0000000000000761. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Jennings L.J., Arcila M.E., Corless C., Kamel-Reid S., Lubin I.M., Pfeifer J., Temple-Smolkin R.L., Voelkerding K.V., Nikiforova M.N. Guidelines for Validation of Next-Generation Sequencing-Based Oncology Panels: A Joint Consensus Recommendation of the Association for Molecular Pathology and College of American Pathologists. J. Mol. Diagn. JMD. 2017;19:341–365. doi: 10.1016/j.jmoldx.2017.01.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Lilljebjörn H., Henningsson R., Hyrenius-Wittsten A., Olsson L., Orsmark-Pietras C., von Palffy S., Askmyr M., Rissler M., Schrappe M., Cario G., et al. Identification of ETV6-RUNX1-like and DUX4-Rearranged Subtypes in Paediatric B-Cell Precursor Acute Lymphoblastic Leukaemia. Nat. Commun. 2016;7:11790. doi: 10.1038/ncomms11790. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Rehn J.A., O’Connor M.J., White D.L., Yeung D.T. DUX Hunting—Clinical Features and Diagnostic Challenges Associated with DUX4-Rearranged Leukaemia. Cancers. 2020;12:2815. doi: 10.3390/cancers12102815. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Palmi C., Silvestri D., Bronzini I., Cario G., Savino A., Paganin M., Buldini B., Galbiati M., Muckenthaler M.U., Arico M., et al. CRLF2 Over-Expression Is a Poor Prognostic Marker in Children with High Risk T-Cell Acute Lymphoblastic Leukemia. Blood. 2014;124:1071. doi: 10.1182/blood.V124.21.1071.1071. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Li D., Li T., Shang Z., Zhao L., Xu Q., Tan J., Qin Y., Zhang Y., Cao Y., Wang N., et al. Combined Inhibition of Notch and FLT3 Produces Synergistic Cytotoxic Effects in FLT3/ITD+ Acute Myeloid Leukemia. Signal Transduct. Target. Ther. 2020;5:21. doi: 10.1038/s41392-020-0108-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Felice M.S., Rubio P.L., Digiorge J., Barreda Frank M., Martínez C.S., Guitter M.R., Sajaroff E.O., Sánchez La Rosa C.G., Pennella C.L., Peruzzo L.B., et al. Impact of IKZF1 Deletions in the Prognosis of Childhood Acute Lymphoblastic Leukemia in Argentina. Cancers. 2022;14:3283. doi: 10.3390/cancers14133283. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Stanulla M., Dagdan E., Zaliova M., Möricke A., Palmi C., Cazzaniga G., Eckert C., te Kronnie G., Bourquin J.-P., Bornhauser B., et al. IKZF1plus Defines a New Minimal Residual Disease–Dependent Very-Poor Prognostic Profile in Pediatric B-Cell Precursor Acute Lymphoblastic Leukemia. J. Clin. Oncol. 2018;36:1240–1249. doi: 10.1200/JCO.2017.74.3617. [DOI] [PubMed] [Google Scholar]
- 24.Moorman A.V., Enshaei A., Schwab C., Wade R., Chilton L., Elliott A., Richardson S., Hancock J., Kinsey S.E., Mitchell C.D., et al. A Novel Integrated Cytogenetic and Genomic Classification Refines Risk Stratification in Pediatric Acute Lymphoblastic Leukemia. Blood. 2014;124:1434–1444. doi: 10.1182/blood-2014-03-562918. [DOI] [PubMed] [Google Scholar]
- 25.Grimwade D., Walker H., Oliver F., Wheatley K., Harrison C., Harrison G., Rees J., Hann I., Stevens R., Burnett A., et al. The Importance of Diagnostic Cytogenetics on Outcome in AML: Analysis of 1,612 Patients Entered into the MRC AML 10 Trial. Blood. 1998;92:2322–2333. doi: 10.1182/blood.V92.7.2322. [DOI] [PubMed] [Google Scholar]
- 26.Llop M., Sargas C., Barragán E. The Role of Next-Generation Sequencing in Acute Myeloid Leukemia. Curr. Opin. Oncol. 2022;34:723–728. doi: 10.1097/CCO.0000000000000899. [DOI] [PubMed] [Google Scholar]
- 27.Gulilat M., Lamb T., Teft W.A., Wang J., Dron J.S., Robinson J.F., Tirona R.G., Hegele R.A., Kim R.B., Schwarz U.I. Targeted next Generation Sequencing as a Tool for Precision Medicine. BMC Med. Genom. 2019;12:81. doi: 10.1186/s12920-019-0527-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Lippi G., Plebani M. Integrated Diagnostics. Biochem. Med. 2020;30:18–30. doi: 10.11613/BM.2020.010501. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Arber D.A., Orazi A., Hasserjian R., Thiele J., Borowitz M.J., Le Beau M.M., Bloomfield C.D., Cazzola M., Vardiman J.W. The 2016 Revision to the World Health Organization Classification of Myeloid Neoplasms and Acute Leukemia. Blood. 2016;127:2391–2405. doi: 10.1182/blood-2016-03-643544. [DOI] [PubMed] [Google Scholar]
- 30.Hunger S.P., Mullighan C.G. Acute Lymphoblastic Leukemia in Children. N. Engl. J. Med. 2015;373:1541–1552. doi: 10.1056/NEJMra1400972. [DOI] [PubMed] [Google Scholar]
- 31.Inaba H., Mullighan C.G. Pediatric Acute Lymphoblastic Leukemia. Haematologica. 2020;105:2524–2539. doi: 10.3324/haematol.2020.247031. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Mengxuan S., Fen Z., Runming J. Novel Treatments for Pediatric Relapsed or Refractory Acute B-Cell Lineage Lymphoblastic Leukemia: Precision Medicine Era. Front. Pediatr. 2022;10:923419. doi: 10.3389/fped.2022.923419. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Brady S.W., Roberts K.G., Gu Z., Shi L., Pounds S., Pei D., Cheng C., Dai Y., Devidas M., Qu C., et al. The Genomic Landscape of Pediatric Acute Lymphoblastic Leukemia. Nat. Genet. 2022;54:1376–1389. doi: 10.1038/s41588-022-01159-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Hunger S.P., Mullighan C.G. Redefining ALL Classification: Toward Detecting High-Risk ALL and Implementing Precision Medicine. Blood. 2015;125:3977–3987. doi: 10.1182/blood-2015-02-580043. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Frismantas V., Dobay M.P., Rinaldi A., Tchinda J., Dunn S.H., Kunz J., Richter-Pechanska P., Marovca B., Pail O., Jenni S., et al. Ex Vivo Drug Response Profiling Detects Recurrent Sensitivity Patterns in Drug-Resistant Acute Lymphoblastic Leukemia. Blood. 2017;129:e26–e37. doi: 10.1182/blood-2016-09-738070. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Gabert J., Beillard E., van der Velden V.H.J., Bi W., Grimwade D., Pallisgaard N., Barbany G., Cazzaniga G., Cayuela J.M., Cavé H., et al. Standardization and Quality Control Studies of “Real-Time” Quantitative Reverse Transcriptase Polymerase Chain Reaction of Fusion Gene Transcripts for Residual Disease Detection in Leukemia—A Europe against Cancer Program. Leukemia. 2003;17:2318–2357. doi: 10.1038/sj.leu.2403135. [DOI] [PubMed] [Google Scholar]
- 37.Krzywinski M., Schein J., Birol I., Connors J., Gascoyne R., Horsman D., Jones S.J., Marra M.A. Circos: An Information Aesthetic for Comparative Genomics. Genome Res. 2009;19:1639–1645. doi: 10.1101/gr.092759.109. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Jia W., Li H., Li S., Chen L., Li S.C. Oviz-Bio: A Web-Based Platform for Interactive Cancer Genomics Data Visualization. Nucleic Acids Res. 2020;48:8203–8204. doi: 10.1093/nar/gkaa553. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
All the data and materials are available upon reasonable request to llop_margar@gva.es.