Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2024 Apr 1.
Published in final edited form as: Leuk Lymphoma. 2022 Dec 28;64(4):788–798. doi: 10.1080/10428194.2022.2157215

The Inherited Genetic Contribution and Polygenic Risk Score for Risk of CLL and MBL: A Narrative Review

Geffen Kleinstern 1,2, Susan L Slager 2,3
PMCID: PMC10121840  NIHMSID: NIHMS1871968  PMID: 36576061

Abstract

Chronic lymphocytic leukemia (CLL) is a neoplasm of B-cells in the blood and monoclonal B-cell lymphocytosis (MBL) is a precursor state to CLL. This narrative review provides an overview of the genetic studies that identified 43 common variants associated with risk of CLL among individuals of European ancestry. Emerging studies found that ~50% of these variants are associated with MBL risk. Moreover, the polygenic risk score (PRS) calculated from these CLL variants has been shown to be a robust predictor for both CLL and MBL risk among European ancestry individuals but a weak predictor among African ancestry individuals. By summarizing these genetic studies, we conclude that additional studies are needed in other race/ethnic populations to identify race specific susceptibility variants, that functional studies are needed to validate the biological mechanisms of the variants, and that the clinical utility of the PRS is limited until preventive strategies for CLL are developed.

Introduction

Chronic lymphocytic leukemia (CLL) is a neoplasm of mature B-cells, with >5×109 B-cells/L in the peripheral blood1. These CLL cells typically co-express CD5, CD19, CD20dim and CD23. Monoclonal B-cell lymphocytosis (MBL) is a precursor state to CLL2-4, with a clonal absolute B-cell count of <5×109/L in the peripheral blood and notable absence of lymphadenopathy, cytopenias, or organomegaly1. MBL can be classified by the size of the clone: low-count MBL (LC-MBL; clonal B-cell count <0.5x109/L) or high-count MBL (HC-MBL; clonal B-cell count between 0.5 and 5x109/L)1,5-7 and by the immunophenotype, with the most common being CLL-like MBL (i.e., co-expression of CD5 and CD20dim)3,8-10.

In this narrative review we provide an overview of the familial risk and the inherited genetic contribution to the risk of CLL and MBL as evident in the literature. We then summarize the genetic studies that have been done to identify common inherited variants for risk of CLL and MBL and briefly discuss the the emerging functional aspects of the CLL loci. We describe the polygenic risk score (PRS) and its association with risk of CLL and MBL, and we discuss the clinical utility of the PRS for risk prediction. Finally, we conclude with future directions.

Familial Risk and Heritability of CLL and MBL

CLL has a strong familial risk of disease11-13. The largest study to date demonstrating this was in the Swedish Family Cancer Study in which the authors identified 18,521 primary CLL patients and their first-degree relatives (N=43,075)13. Using the standardized incidence ratio, the authors compared the rate of CLL in first-degree relatives to that of the expected rate from the general population. The authors reported an estimated 5.64-fold familial risk of CLL (95% confidence interval [CI]: 5.04-6.03). This familial risk increased to 6.99 for relatives of CLL cases who were diagnosed <65 years of age and increased to 27.13 for relatives of CLL cases with two or more first-degree relatives with CLL.

There is some evidence of a familial component to the risk of MBL3,14-16. In a study of 72 MBL cases and 380 controls, the authors found a 2.1-fold increased risk of MBL among those with any family history of hematological cancer (CI:0.87-4.93)14. A larger study of 365 individuals with MBL and 2,512 controls known not to have MBL reported an association of 1.44 (CI: 0.99-2.09) for those with any first-degree relative with leukemia or lymphoma15. There is also evidence of a familial component observed by a higher prevalence of MBL (15-22%) in families with two or more relatives with CLL3,16,17 compared to the 5-12% prevalence in the general population18,19.

The genetic heritability of CLL is notably high, even when compared to other cancers20. Heritability is a measure of how genetics influences the variability of a trait such as CLL. A large study of 203,000 monozygotic and dizygotic twin pairs from the Norwegian Twin Cancer Study reported that up to 57% of leukemias may be attributed to genetic factors20. Another approach of estimating heritability is through using genotyping data from large genotyping arrays, e.g., genome wide association studies (GWAS). Using this alternative approach, the estimated heritability of CLL was reported to be 34% (±5%), with the potential to explain 57% of the overall familial risk, suggesting that more variants are yet to be identified21. Collectively, these findings support a strong inherited genetic contribution to the risk of CLL and MBL.

GWAS of CLL

GWASs are study designs used to identify common inherited variants associated with risk of disease. Over the past 15 years, GWASs have identified and validated 43 single nucleotide polymorphisms (SNPs) associated with CLL risk among individuals of European ancestry21-29. These susceptibility SNPs are common with minor allele frequencies >1%, have modest effect sizes ranging between 1.13-1.65 (Figures 1-2), and are near or in genes plausibly linked to CLL, including genes involved in apoptosis (including FAS, PMAIP1, BAK1, BCL2, BCL2L11, BMF, CASP8/CASP10), telomere function (POT1, TERT, TERC), transcription factors important in B-cell differentiation (IRF8, LEF1, PRKD3, SP140), and B-cell receptor activation (IRF3, HLA-DQA1). The putative biological mechanism of many of these established CLL SNPs is through epigenomic regulation30-32. Specifically, using bioinformatic in silico approaches, strong evidence suggests that these susceptibility SNPs regulate genes encoding lineage transcription factors, BCL-2 family proteins, or cyclin-dependent kinase inhibitors that play important roles in oncogenesis or apoptiosis30-32; however, no functional studies have been done to validate these findings. Moreover, there has been little evidence of interaction among these SNPs, compatible with them being independent effects, and none of the SNPs have individually shown a strong relationship with age at diagnosis of CLL or sex21-29.

Figure 1: List of the first 10 SNPs found to be associated with CLL risk and the association with CLL and MBL risk across five studies.

Figure 1:

InterLymph CLL study by Berndt et al. including 4,486 CLL and 13,183 controls (Black); AA CLL study by Kleinstern et al. including 173 AA CLL cases from the AA CLL cohort and 235 AA controls from the Mayo Clinic Biobank (red); MBL meta-analysis study by Crowther-Swanepoel et al. including 419 individuals with MBL and 1,753 controls obtained from three sites (blue); GEC MBL study by Kleinstern et al. including 95 familial MBL individuals and 1,267 Mayo controls (blue); Mayo Clinic MBL study by Kleinstern et al. including 560 individuals with MBL from the Mayo Clinic and 2,631 controls from the Mayo Clinic Biobank (blue); SNPs, single nucleotide polymorphisms; CLL, chronic lymphocytic leukemia; MBL, monoclonal B-cell lymphocytosis; InterLymph, International Lymphoma of Epidemiology Consortium; AA, African ancestry; GEC, genetic epidemiology of CLL Consortium; OR, odds ratio; CI, confidence interval;

Figure 2: List of additional 31 SNPs found to be associated with CLL risk and the association with CLL and MBL risk across four studies.

Figure 2:

InterLymph CLL study by Berndt et al. including 4,486 CLL and 13,183 controls (Black); AA CLL study by Kleinstern et al. including 173 AA CLL cases from the AA CLL cohort and 235 AA controls from the Mayo Clinic Biobank (red); GEC MBL study by Kleinstern et al. including 95 familial MBL individuals and 1,267 Mayo controls (blue); Mayo Clinic MBL study by Kleinstern et al. including 560 individuals with MBL from the Mayo Clinic and 2,631 controls from the Mayo Clinic Biobank (blue); SNPs, single nucleotide polymorphisms; CLL, chronic lymphocytic leukemia; MBL, monoclonal B-cell lymphocytosis; InterLymph, International Lymphoma of Epidemiology Consortium; AA, African ancestry; GEC, genetic epidemiology of CLL Consortium; OR, odds ratio; CI, confidence interval;

There does appear to be heterogeneity of the SNP associations by race. In an East Asian population of 71 CLL patients and 1,273 controls, authors evaluated 6 CLL susceptibility SNPs; the SNPs near the IRF4 (rs872071), SP140 (rs13397985), and ACOXL (rs17483466) genes were associated with CLL risk (nominal P<0.05), with a suggestive association with GRAMD1B (rs735665)33. Several studies have evaluated CLL susceptibility SNPs among individuals of African ancestry15,34. In the largest study to date, the authors evaluated 41 CLL susceptibility SNPs in 173 CLL cases and 235 controls (Figure 1-2)15. Of the 41 SNPs, only two were nominally significant: near LEF1 (rs7690934, odds ratio [OR]=1.41, CI:1.03-1.95, P=0.03) and CDKN2B genes (rs1679013, OR=1.56, CI:1.08-2.25, P=0.02). In general, the minor allele frequencies of these SNPs in the Asian and African ancestry populations were much lower than that in populations of European ancestry, supporting the hypothesis that the lower prevalence of CLL genetic risk factors might explain part of the lower incidence of CLL in East Asian and African ancestry populations.

Genetic Risk of MBL

Because MBL is a precursor condition to CLL, evaluating the genetic factors associated with MBL risk may provide insight into pathogenesis of CLL. It is important to note that the prevalence of MBL is significantly higher than that of CLL, yet not everyone who has MBL progresses to CLL. Individuals with HC-MBL progress to CLL requiring therapy about 1-2% per year35,36. Individuals with LC-MBL who have two or more family members with CLL progress to CLL about 1.1% per year, supporting the role of inherited genetics plays on risk of CLL3. The progression rate of LC-MBL to CLL among individuals in the general population is unknown but estimated to be <0.5% per year.

To date, no GWAS of MBL has been done. However, a number of studies have evaluated the known CLL susceptibility SNPs with risk of MBL (Table 1)15,24,37,38. One of the more robust studies (totaling 419 individuals with MBL and 1,753 controls ascertained from three case-control series) evaluated SNPs from ten CLL susceptibility loci37. Of note, the majority of the individuals with MBL in this study were HC-MBL. The authors found that six of the ten SNPs (2q37.1, 2q13, 2q37.3, 6p25.3, 8q24.21, 11q24.1) were statistically significant (P<0.05) (Figure 1). Another large study of genetic risk of MBL was from the Mayo Clinic that examined 41 CLL susceptibility SNPs in 560 individuals with MBL and 2,631 controls15. Unique to this Mayo Clinic study is that the controls were known not to have MBL through screening. The authors found that 21 (51%) of the 41 SNPs were nominally statistically significant (Table 1, Figure 1-2). The Mayo Clinic study was the first to stratify the MBLs by size of the B-cell clone with 396 individuals with LC-MBL and 164 individuals with HC-MBL. Although the authors found that the majority of the observed effect sizes of the SNPs were directionally consistent with those reported in CLL, only 12 SNPs were nominally associated with risk of LC-MBL and 10 SNPs were nominally associated with risk of HC-MBL. The lack of significance for risk of MBL overall or by subtype of MBL may be due to lack of statistical power. However, it is interesting to note that 13 SNPs from the Mayo Clinic study showed little evidence of an association with risk of MBL with OR<1.1 (P>0.05, Table 1, Figure 1-2) yet these SNPs are established SNPs for risk of CLL. Collectively these studies suggest that some of the CLL susceptibility SNPs may be associated with developing the B-clone at the MBL stage and other SNPs may be associated with progression of the B-cell clone reaching the HC-MBL or CLL stage. More work is needed to tease out this hypothesis.

Table 1:

List of 41 SNPs associated with CLL risk and their association with MBL risk across three studies

Locus Nearest Gene(s) Lead SNP in
Published
fine mapping
GWAS
Risk
Allele
Results from Law et al Nat Commun (2017). Effect sizes
from CLL GWAS used for the
PRS calculations)
Results from Kleinstern et al. Blood (2018). Association
between GEC MBL (n=95) vs
Mayo controls (n=1267)
Results from Kleinstern et al. Leukemia (2022). Mayo Clinic
Study. Association between
MBL (n=560) vs controls
(n=2631)
Results from Crowther et al. Blood (2010). Meta-analysis of three
MBL studies totaling 419 MBLs vs
1753 controls
OR (95% CI) P OR (95% CI) P OR§ (95% CI) P SNP OR (95% CI)
1p36.11 MDS2 rs34676223 C 1.19 (1.14-1.25) 5.04E-13 1.29 (0.91-1.83) 0.15 1.09 (0.93-1.27) 0.29
1q42.13 RHOU rs41271473 G 1.19 (1.13-1.26) 1.06E-10 0.98 (0.68-1.41) 0.9 0.97 (0.82-1.15) 0.72
2p22.2 QPCT, PRKD3 rs888096 A 1.15 (1.09-1.21) 5.20E-08 1.10 (0.81-1.50) 0.54 1.07 (0.93-1.23) 0.34
2q13 ACOXL, BCL2L11 rs1002015 C 1.30 (1.23-1.37) 2.23E-23 1.55 (1.12-2.15) 0.008 1.26 (1.10-1.45) 8.9x10 −4 rs17483466 1.27 (1.04-1.56)
rs58055674 C 1.41 (1.32-1.5) 2.02E-27 1.32 (0.94-1.86) 0.11 1.26 (1.06-1.50) 0.01
rs6708784 G 1.30 (1.24-1.37) 2.67E-25 1.14 (0.85-1.54) 0.39 1.15 (1.00-1.31) 0.05
2q33.1 CASP10/CASP8 rs7558911 A 1.18 (1.12-1.24) 5.05E-11 1.11 (0.82-1.49) 0.52 1.21 (1.05-1.38) 0.01
2q37.1 SP110, SP140 rs34004493 G 1.39 (1.31-1.47) 3.67E-32 1.44 (1.05-1.98) 0.03 1.45 (1.25-1.68) 1.1x10 −6 rs13397985 1.40 (1.13-1.72)
2q37.3 FARP2 rs757978 T 1.29 (1.2-1.39) 5.80E-11 1.45 (0.93-2.25) 0.1 1.17 (0.96-1.44) 0.12 1.38 (1.05-1.82)
3p24.1 EOMES rs9880772 A 1.16 (1.11-1.22) 1.91E-09 1.02 (0.76-1.39) 0.88 1.00 (0.87-1.15) 0.98
3q26.2 MYNN, TERC rs1317082 A 1.19 (1.12-1.26) 5.77E-09 1.24 (0.86-1.77) 0.25 1.24 (1.05-1.46) 0.01
4q24 BANK1 rs13107612* C 1.17 (1.11-1.22) 1.37E-10 1.25 (0.89-1.76) 0.2 1.08 (0.93-1.26) 0.28
4q25 LEF1 rs7690934 C 1.16 (1.11-1.22) 6.08E-09 1.20 (0.88-1.65) 0.24 1.07 (0.93-1.23) 0.33
4q35.1 LOC728175 rs57214277 T 1.13 (1.08-1.18) 3.69E-08 0.99 (0.73-1.34) 0.96 1.17 (1.02-1.34) 0.02
5p15.33 TERT rs7705526 A 1.18 (1.12-1.25) 5.90E-10 1.38 (1.02-1.87) 0.04 1.17 (1.02-1.35) 0.03
6p25.3 IRF4 rs9392504 A 1.33 (1.26-1.4) 9.81E-29 1.52 (1.12-2.07) 0.01 1.26 (1.10-1.44) 0.001 rs872071 1.27 (1.06-1.51)
6p21.32 HLA rs9271176 G 1.29 (1.22-1.36) 3.16E-20 1.20 (0.86-1.66) 0.29 1.12 (0.96-1.30) 0.14
6p21.31 BAK1 rs210143 C 1.26 (1.19-1.33) 5.77E-16 1.07 (0.77-1.48) 0.7 1.03 (0.89-1.20) 0.69
6p21.31 C6orf106 rs3800461 C 1.20 (1.13-1.28) 1.97E-08 1.16 (0.74-1.82) 0.52 1.11 (0.90-1.37) 0.31
6q25.2 IPCEF1 rs4869818 G 1.15 (1.09-1.21) 4.11E-08 1.02 (0.76-1.37) 0.9 1.00 (0.88-1.15) 0.97
7q31.33 POT1 rs2267708 T 1.16 (1.1-1.22) 8.55E-09 1.33 (0.99-1.79) 0.06 1.26 (1.10-1.44) 0.001
8q22.3 ODF1 rs2511713 G 1.17 (1.1-1.23) 6.04E-08 1.03 (0.74-1.44) 0.84 1.30 (1.12-1.51) 0.001
8q24.21 POU5F1B rs2466029 G 1.23 (1.17-1.3) 7.47E-16 1.12 (0.82-1.54) 0.47 1.15 (1.00-1.31) 0.05 rs2456449 1.31 (1.10-1.57)
9p21.3 CDKN2B-AS1 rs1679013 C 1.16 (1.1-1.22) 2.17E-08 1.17 (0.87-1.58) 0.3 1.16 (1.02-1.33) 0.03
10q23.31 ACTA, FAS rs6586163 A 1.23 (1.17-1.29) 1.14E-15 1.27 (0.94-1.71) 0.13 1.22 (1.06-1.40) 0.004
11p15.5 C11orf21, TSPAN32 rs2651823 A 1.18 (1.13-1.25) 5.24E-11 1.16 (0.86-1.56) 0.33 1.02 (0.89-1.17) 0.81
11q23.2 TMPRSS5 rs61904987 T 1.24 (1.16-1.32) 2.46E-11 1.00 (0.64-1.58) 0.98 1.08 (0.88-1.32) 0.48
11q24.1 SCN3B, GRAMD1B rs35923643 G 1.63 (1.53-1.72) 4.26E-58 1.73 (1.26-2.39) 0.001 1.48 (1.26-1.73) 9.1x10 −7 rs735665 1.63 (1.32-2.02)
12q24.13 OAS3 rs6489882 G 1.16 (1.1-1.22) 4.76E-08 1.24 (0.91-1.68) 0.17 1.14 (0.99-1.31) 0.07
15q15.1 BMF, BUB1B rs8024033 C 1.26 (1.2-1.32) 7.13E-19 1.33 (0.98-1.80) 0.07 1.10 (0.96-1.26) 0.16
15q21.3 RFX7, NEDD4 rs142215530 G 1.39 (1.29-1.5) 2.46E-18 0.92 (0.57-1.49) 0.74 1.20 (0.97-1.48) 0.09 rs7169431 1.30 (0.95-1.77)
15q23 RPLP1 rs11637565 G 1.35 (1.28-1.42) 1.96E-31 1.50 (1.11-2.02) 0.008 1.22 (1.06-1.41) 0.01 rs7176508 1.18 (0.99-1.41)
16q24.1 IRF8 rs391855 A 1.34 (1.27-1.41) 1.25E-28 1.75 (1.25-2.44) 0.001 1.36 (1.18-1.56) 1.9x10 −5
rs305065 C 1.16 (1.1-1.22) 7.57E-08 1.11 (0.82-1.52) 0.5 0.95 (0.82-1.10) 0.47 rs305061 1.16 (0.97-1.40)
18q21.1 CXXC1 rs1036935 A 1.15 (1.10-1.21) 3.27E-08 1.47 (1.07-2.04) 0.02 1.02 (0.87-1.20) 0.79
18q21.32 PMAIP1 rs4368253 C 1.17 (1.11-1.24) 1.26E-08 0.92 (0.70-1.27) 0.62 1.17 (1.01-1.35) 0.04
18q21.33 BCL2 rs77551289 A 1.37 (1.25-1.5) 1.83E-11 1.38 (0.76-2.52) 0.29 1.32 (1.03-1.69) 0.03
rs4987852 C 1.32 (1.2-1.44) 4.66E-09 2.33 (1.47-3.67) 2.9x10 −4 1.32 (1.03-1.69) 0.03
19p13.3 ZBTB7A rs7254272 A 1.17 (1.10-1.73) 4.67E-08 1.73 (1.22-2.46) 0.002 1.08 (0.90-1.29) 0.43
19q13.3 PRKD2, STRN4 rs874460 C 1.24 (1.15-1.34) 3.37E-08 1.18 (0.73-1.90) 0.49 1.12 (0.92-1.36) 0.27 rs11083846 0.99 (0.81-1.22)
22q13.33 NCAPH2 rs140522 T 1.15 (1.10-1.20) 2.70E-09 1.07 (0.78-1.47) 0.66 1.21 (1.05-1.40) 0.01

CLL, chronic lymphocytic leukemia; MBL, monoclonal B-cell lymphocytosis; GEC, genetic epidemiology of CLL Consortium; SNPs, single nucleotide polymorphisms; OR, Odds Ratio; CI, Confidence Interval

§

Adjusted for age, sex and PC3

*

Proxy for rs71597109; Bold, statistically significant P<0.05

Polygenic Risk Score (PRS) and Risk of CLL Among Individuals of European Ancestry

Since the effect sizes of the individual CLL susceptibility SNPs are small, <1.7 per variant allele, each SNP is a weak predictor of CLL risk. Studies have shown that when combining representative SNPs into a single summary measure, such as the PRS, one can show varying levels of risk stratification for CLL and observe a better risk prediction39. There are several approaches to computing the PRS40. One approach is to compute the sum of the number of risk alleles across the known susceptibility SNPs for each person. This approach assumes each SNP has equal weight on the PRS. The more common approach is to take a weighted average of the number of risk alleles across the susceptibility SNPs, where the weights are the corresponding effect sizes estimated from GWAS; these weights account for the impact of each SNP on the disease risk. In CLL, the weights were obtained from the largest GWAS of CLL to date21.

In Figure 3a, we show the previously published distributions of the weighted PRS across controls, individuals with MBL, and CLL15,38. There are several key points that one can observe from Figure 3a. First, the distributions of the PRS among three large control series are consistent, with median values ranging between 7.46 and 7.59. In contrast, the PRS distribution among controls from the Genetic Epidemiology of CLL (GEC) Consortium is significantly higher than the three control series, with a median PRS=8.09 (P<0.0001, Figure 3a), due to the fact that these GEC controls come from families with two or more members with CLL that have an enrichment of inherited CLL variants. Second, the PRS distribution among two of the three CLL cases series of European ancestry have also consistent distributions (median PRS of 8.24-8.26); yet the PRS distribution from the familial GEC CLL case series is significantly elevated (median PRS=8.51, P<0.001, Figure 3a). Third, the PRS distribution from cases and controls of African ancestry is lower than that of the respective cases and controls of European ancestry. Finally, the PRS distribution from MBL is in between those of controls and CLL cases. It has been previously shown that there is a significant trend of the median PRS as one goes from controls to LC-MBL, HC-MBL, and CLL (P=4.3x10−85)15.

Figure 3:

Figure 3:

A. Distribution of the CLL-PRS by controls and CLL/MBL cohorts from different studies.

Boxplots representing the CLL-PRS distribution based on 41 CLL susceptibility SNPs, across controls (light grey), AA controls (dark grey), individuals with MBL (blue), CLL cases (red), and AA CLL cases (dark red). The horizontal line and score in each box represent the median CLL-PRS for each cohort

B. Forest plot of the association between the CLL-PRS and risk of CLL and MBL across different studies.

*InterLymph study by Kleinstern et al. including 8 studies from InterLymph with a CLL-PRS based on 41 CLL-susceptibility SNPs; **InterLymph study by Berndt et al. (in Press, Leukemia) including 24 studies from InterLymph with a CLL-PRS based on 43 CLL-susceptibility SNPs; PRS, polygenic risk score; CLL, SNPs, single nucleotide polymorphisms; chronic lymphocytic leukemia; MBL, monoclonal B-cell lymphocytosis; LC, low-count; HC, high-count; AA, African ancestry; InterLymph, International Lymphoma of Epidemiology Consortium; GEC, genetic epidemiology of CLL Consortium; FH, family history; OR, odds ratio; CI, confidence interval;

The association between the risk of CLL and the PRS, regardless of the approach used to compute the PRS, has been found to be consistently associated15,28,38,41. One of the first studies computed an unweighted PRS of 30 CLL risk variants that were genome-wide significant at <5x10−8 and reported an association with risk of CLL (Table 2)28. Another study from the International Lymphoma Epidemiology (InterLymph) Consortium comprised of 8 individual studies reported that the weighted PRS of 41 SNPs that were genome-wide significant at <5x10−8 was associated with a 2.49-fold increased risk for CLL (Table 2, Figure 3b)38. The association of the PRS with CLL increased to 3.79 when stratifying the cases and controls by any family history of hematological malignancy (Figure 3b). The InterLymph study was recently updated to include 24 studies and found that the weighted PRS of 43 genome-wide significant SNPs was associated with a 2.17-fold increased risk for CLL (Table 2, Figure 3b)42. Finally, another study compared two methods to compute the PRS with risk of CLL using data from InterLymph41. The first method included 2,025 susceptibility SNPs at an optimized p-value threshold and the other method included 20 CLL susceptibility SNPs that were genome-wide significant at <5x10−8; both approaches weighted the SNPs by the corresponding effect size. The two approaches did not reveal significant differences in terms of relative risk or predictive performance according to the c-statistic38 given the current sample size of cases and controls found in GWASs. However, as the sample sizes increases in GWAS of CLL, the optimized p-value threshold approach may provide increased predictability over a PRS that uses just the CLL susceptibility SNPs that are genome-wide significant41. Because the CLL cases and controls from these PRS studies were used to identify the CLL susceptibility SNPs through GWAS, the PRS association may be inflated. Independent sample of cases and controls are needed.

Table 2:

Cohort and model description of the polygenic risk score studies

Study Cohort description Independent
cohorts
Analysis Race PRS c-stats
(95% CI)
Number of
SNPs in
PRS
Adjustments/
comments
Speedy et al.47 2014 CLL GWAS study Participants were used to discover the CLL SNPs CLL cases vs controls European 0.74 30 Unweighted PRS calculation; no adjustments
Kleinstern et al36 2018 CLL GWAS study from InterLymph Consortium of 8 studies that had both GWAS and exposure data Participants were used to discover the CLL SNPs InterLymph CLL vs InterLymph controls European 0.79 (0.78-0.80) 41 Age, sex, ses, study
  FH+ CLL vs FH+ controls European 0.86 (0.82-0.90) 41 Age, sex, ses, study
  FH CLL vs FH controls European 0.79 (0.77-0.81) 41 Age, sex, ses, study
GEC consortium study of CLL, MBL, and controls from CLL families and an additional set of Mayo controls agnostic to family history. Independent cohorts GEC CLL vs GEC controls European 0.80 (0.74-0.85) 41 Age, sex
GEC CLL vs Mayo controls European 0.78 (0.74-0.81) 41 Age, sex
GEC MBL vs GEC controls European 0.77 (0.70-0.85) 41 Age, sex
GEC MBL vs Mayo controls European 0.77 (0.73-0.82) 41 Age, sex
Zhang et al38 2020 InterLymph Consortium of 24 studies of CLL cases (N=3100) and controls (N=7667) Participants were used to discover the CLL SNPs InterLymph CLL vs InterLymph controls European 0.82 (se=0.03) 2,025 The c-stat represents the maximum c-stat theoretically achievable.
Kleinstern et al29 2022 Mayo Clinic Study of CLL cases and controls Independent cohorts CLL cases vs Mayo Clinic Biobank controls European 0.77 (0.75-0.79) 41 Age, sex
AA CLL cases and AA controls AA CLL cases vs AA controls African 0.62 (0.57-0.68) 41 Age, sex
MBLs screened from the Mayo Clinic Biobank, clinical MBL from the Mayo Clinic CLL Resource, and Mayo Clinic Biobank controls Mayo MBL vs Mayo Clinic Biobank controls European 0.72 (0.69-0.73) 41 Age, sex
  Mayo LC-MBL vs Mayo Clinic Biobank control European 0.72 (0.70-0.75) 41 Age, sex
  Mayo HC-MBL vs Mayo Clinic Biobank controls European 0.73 (0.69-0.77) 41 Age, sex
Berndt et al In Press Leukemia InterLymph Consortium of CLL cases and controls from 24 studies Participants were used to discover the CLL SNPs InterLymph CLL vs InterLymph controls European - 43 Age, sex, and principal components

CLL, chronic lymphocytic leukemia; MBL, monoclonal B-cell lymphocytosis; InterLymph, International Lymphoma of Epidemiology Consortium; AA, African ancestry; GEC, Genetic Epidemiology of CLL Consortium; FH, family history; PRS, polygenic risk score; SNPs, single nucleotide polymorphisms; CI, confidence interval; GWAS, genome wide association study

The first study to evaluate the weighted PRS of 41 SNPs in an independent sample of CLL cases and controls who were not used in any GWAS was from the GEC Consortium of families with more than two members with CLL38. The authors reported that the PRS was associated with 2.44-fold increased risk for CLL (Figure 3b). The risk of CLL increased to 3.51 among those who were in the top 20% of the PRS distribution and decreased to 0.2 among those who were in the bottom 20% of the PRS distribution relative to the middle quintile. The association of the weighted PRS of 41 SNPs was replicated in another independent cohort of 696 CLLs and 2,631 controls known not to have MBL from the Mayo Clinic with a 2.53-fold increased risk for CLL (Figure 3b)29. The risk of CLL increased to 3.49 among those who were in the top 20% of the PRS distribution and decreased to 0.31 among those who were in the bottom 20% of the PRS distribution relative to the middle quintile. Together, these studies demonstrate that the PRS is consistently significantly associated with risk of CLL, independent of age and sex (two other important risk factors for CLL), among individuals of European ancestry. Of note, those in the lower 20% of the PRS distribution have a protective effect against risk of CLL but that is only 5% of the CLL cases compared to 20% of the controls15.

Polygenic Risk Score and Risk of CLL Among Individuals of African Ancestry

Because the SNPs used to comprise the CLL-PRS were identified from GWASs of European ancestry, it is important to evaluate the generalizability of the PRS across other racial and ethnic populations. Only one study to date has evaluated the PRS in another racial or ethnic cohort; this study evaluated the weighted PRS of 41 susceptibility SNPs in a in a cohort of 173 CLL cases and 235 controls of African ancestry (Figure 3a)15. The authors found that the PRS had a 1.76-fold risk for CLL (Figure 3b), significantly lower effect size than that among individuals of European ancestry. This study also used an unweighted PRS to evaluate the risk of CLL in African ancestry, and still observed an attenuated effect size of 1.07 (CI:1.01-1.13, P=0.03). Clearly a PRS comprised of SNPs identified among European ancestry has reduced generalizability among individuals of African ancestry. This is not surprising per se given the allele frequencies and effect sizes of the CLL susceptibility SNPs among individuals of African ancestry were in generally lower than that of Europeans (Figures 1-2). Future GWAS are needed in other racial and ethnic populations to identify novel SNPs unique to the population and improve the risk prediction of the PRS.

Polygenic Risk Score and risk of MBL Among Individuals of European Ancestry

Two studies evaluated the association of the PRS with risk of MBL15,38. The first is in the GEC consortium consisting of a small cohort of 95 GEC MBLs (of whom 93% were LC-MBL) and 58 GEC controls38. The authors reported a 2.3-fold increased risk for MBL (95% CI: 1.44-3.67) for the weighted PRS of 41 SNPs (Figure 3b). Another study evaluated 560 MBLs and 2,631 controls known not to have MBL from the Mayo Clinic Biobank and found that the weighted PRS using 41 SNPs was associated with a 1.86-fold increased risk for MBL (Figure 3b)15. In subtype analyses by the size of the MBL clone, the association of the PRS with risk of LC-MBL and HC-MBL was 1.75-fold and 2.14-fold increased risk, respectively, with a significant positive trend (P<0.0001) between the PRS effect sizes and risk of LC-MBL, HC-MBL, and CLL, meaning that the association increased as the clonal size increased15. No studies to date have evaluated the PRS by the subtypes of MBL based on immunophenotype. Although the genetic studies of risk of MBL listed in Table 1 did not explicitly state the MBL immunophenotype, presumably most were CLL-like MBL since that immunophenotype is the most common among individuals of European ancestry 3,8-10.

Clinical utility of PRS for Risk Prediction of CLL and MBL

Given that PRS has been shown to be statistically significant with both risk of CLL and MBL, and given that the PRS has been validated in two independent studies of CLL and MBL, a question is whether the PRS may be utilized clinically for risk prediction. The studies of the PRS based on 41 SNPs reported good discriminative power for risk of CLL, with the c-statistic ranging between 0.77-0.8 among individuals of European ancestry (Table 2). Similarly, a good discriminative power was observed for risk of MBL (c-statistic ranged between 0.72-0.77, Table 2). Further validation of the PRS would be ideal using a cohort study design. Thus, there is evidence that PRS is a strong predictor for both risk of MBL and CLL. This finding, however, did not generalize to individuals of African ancestry, in which the PRS of 41 SNPs had poor discrimination (c-statistic = 0.62, Table 2). Although PRS is a strong biomarker among individuals of European ancestry, the clinical utility is limited at this time since there are no currently existing preventive strategies of risk of CLL and MBL, and no evidence of an improved overall survival among CLL patients treated at an early stage of CLL43. Furthermore, a prior study reported that indolent CLL patients in a watch-and-wait strategy for treatment experienced distress and worse quality-of-life compared to lymphoma patients who received treatment at time of diagnosis44. Thus the clinical utility of the PRS is on hold until further preventive and therapeutic strategies are developed.

CONCLUSIONS

CLL has a strong genetic component with a high heritability. Numerous studies have identified and robustly validated 43 common variants associated with risk of CLL among individuals of European ancestry, and emerging studies are finding that ~50% of these CLL susceptibility SNPs associated with risk of MBL. The PRS calculated from these known CLL-susceptibility SNPs has been shown to be a robust predictor for both risk of CLL and MBL among individuals of European ancestry but a weak predictor of risk among African ancestry individuals. Clinical utility of the PRS is limited at this time until preventive strategies for CLL are developed.

Future studies are needed in four key areas. First, there is a clear need for GWAS of CLL in other race and ethnicities to identify SNPs which may be unique to each population. This will allow further understanding of the heterogeneity of risk across populations, gain more insight to the pathogenesis of CLL, and improve the development of PRS that are more tailored to each population. The current PRS are disproportionately applicable to cases of European ancestry. Second, there is evidence that genetic studies of MBL may help provide more insight into the biological mechanisms of the SNPs in that some SNPs may be important for developing the B-cell clone while others may be important for clonal expansion. Third, functional studies are needed to validate the in silico putative reported biological mechanisms of the established CLL SNPs. Finally, the clinical utility of the PRS may be limited at this time until future studies develop preventive strategies for CLL.

Acknowledgements

This work was supported by the National Institutes of Health (NIH) grants R01 AG58266, R01 CA235026, R01 CA254951, and R21 CA256648.

References

  • 1.Hallek M, Cheson BD, Catovsky D, et al. iwCLL guidelines for diagnosis, indications for treatment, response assessment, and supportive management of CLL. Blood. 2018;131(25):2745–2760. [DOI] [PubMed] [Google Scholar]
  • 2.Landgren O, Albitar M, Ma W, et al. B-cell clones as early markers for chronic lymphocytic leukemia. N Engl J Med. 2009;360(7):659–667. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Slager SL, Lanasa MC, Marti GE, et al. Natural history of monoclonal B-cell lymphocytosis among relatives in CLL families. Blood. 2021;137(15):2046–2056. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Kolijn PM, Hosnijeh FS, Spath F, et al. High-risk subtypes of chronic lymphocytic leukemia are detectable as early as 16 years prior to diagnosis. Blood. 2022;139(10):1557–1563. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Marti GE, Rawstron AC, Ghia P, et al. Diagnostic criteria for monoclonal B-cell lymphocytosis. Br J Haematol. 2005;130(3):325–332. [DOI] [PubMed] [Google Scholar]
  • 6.Shanafelt TD, Ghia P, Lanasa MC, Landgren O, Rawstron AC. Monoclonal B-cell lymphocytosis (MBL): biology, natural history and clinical management. Leukemia. 2010;24(3):512–520. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Rawstron AC, Shanafelt T, Lanasa MC, et al. Different biology and clinical outcome according to the absolute numbers of clonal B-cells in monoclonal B-cell lymphocytosis (MBL). Cytometry B Clin Cytom. 2010;78 Suppl 1:S19–23. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Dagklis A, Fazi C, Sala C, et al. The immunoglobulin gene repertoire of low-count chronic lymphocytic leukemia (CLL)-like monoclonal B lymphocytosis is different from CLL: diagnostic implications for clinical monitoring. Blood. 2009;114(1):26–32. [DOI] [PubMed] [Google Scholar]
  • 9.Slager SL, Parikh SA, Achenbach SJ, et al. Prevalence and Overall Survival of Low Count Monoclonal B-Cell Lymphocytosis (LC-MBL): A Screening Study of 8,297 Individuals from the Mayo Clinic Biobank. Blood. 2021;138(Supplement 1):2632–2632.34521113 [Google Scholar]
  • 10.Fazi C, Scarfo L, Pecciarini L, et al. General population low-count CLL-like MBL persists over time without clinical progression, although carrying the same cytogenetic abnormalities of CLL. Blood. 2011;118(25):6618–6625. [DOI] [PubMed] [Google Scholar]
  • 11.Goldin LR, Bjorkholm M, Kristinsson SY, Turesson I, Landgren O. Elevated risk of chronic lymphocytic leukemia and other indolent non-Hodgkin’s lymphomas among relatives of patients with chronic lymphocytic leukemia. Haematologica. 2009;94(5):647–653. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Slager SL, Benavente Y, Blair A, et al. Medical history, lifestyle, family history, and occupational risk factors for chronic lymphocytic leukemia/small lymphocytic lymphoma: the InterLymph Non-Hodgkin Lymphoma Subtypes Project. J Natl Cancer Inst Monogr. 2014;2014(48):41–51. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Sud A, Chattopadhyay S, Thomsen H, et al. Analysis of 153 115 patients with hematological malignancies refines the spectrum of familial risk. Blood. 2019;132(12):960–969. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Casabonne D, Almeida J, Nieto WG, et al. Common infectious agents and monoclonal B-cell lymphocytosis: a cross-sectional epidemiological study among healthy adults. PLoS One. 2012;7(12):e52808. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Kleinstern G, Weinberg JB, Parikh SA, et al. Polygenic risk score and risk of monoclonal B-cell lymphocytosis in caucasians and risk of chronic lymphocytic leukemia (CLL) in African Americans. Leukemia. 2022;36(1):119–125. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Marti GE, Carter P, Abbasi F, et al. B-cell monoclonal lymphocytosis and B-cell abnormalities in the setting of familial B-cell chronic lymphocytic leukemia. Cytometry B Clin Cytom. 2003;52(1):1–12. [DOI] [PubMed] [Google Scholar]
  • 17.Rawstron AC, Yuille MR, Fuller J, et al. Inherited predisposition to CLL is detectable as subclinical monoclonal B-lymphocyte expansion. Blood. 2002;100(7):2289–2290. [DOI] [PubMed] [Google Scholar]
  • 18.Rawstron AC, Green MJ, Kuzmicki A, et al. Monoclonal B lymphocytes with the characteristics of "indolent" chronic lymphocytic leukemia are present in 3.5% of adults with normal blood counts. Blood. 2002;100(2):635–639. [DOI] [PubMed] [Google Scholar]
  • 19.Ghia P, Prato G, Scielzo C, et al. Monoclonal CD5+ and CD5- B-lymphocyte expansions are frequent in the peripheral blood of the elderly. Blood. 2004;103(6):2337–2342. [DOI] [PubMed] [Google Scholar]
  • 20.Mucci LA, Hjelmborg JB, Harris JR, et al. Familial Risk and Heritability of Cancer Among Twins in Nordic Countries. Jama-Journal of the American Medical Association. 2016;315(1):68–76. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Law PJ, Berndt SI, Speedy HE, et al. Genome-wide association analysis implicates dysregulation of immunity genes in chronic lymphocytic leukaemia. Nat Commun. 2017;8:14175. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Di Bernardo MC, Crowther-Swanepoel D, Broderick P, et al. A genome-wide association study identifies six susceptibility loci for chronic lymphocytic leukemia. Nature Genetics. 2008;40(10):1204–1210. [DOI] [PubMed] [Google Scholar]
  • 23.Crowther-Swanepoel D, Broderick P, Di Bernardo MC, et al. Common variants at 2q37.3, 8q24.21, 15q21.3 and 16q24.1 influence chronic lymphocytic leukemia risk. Nature Genetics. 2010;42(2):132–136. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Slager SL, Rabe KG, Achenbach SJ, et al. Genome-wide association study identifies a novel susceptibility locus at 6p21.3 among familial CLL. Blood. 2011;117(6):1911–1916. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Slager SL, Skibola CF, Di Bernardo MC, et al. Common variation at 6p21.31 (BAK1) influences the risk of chronic lymphocytic leukemia. Blood. 2012;120(4):843–846. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Crowther-Swanepoel D, Di Bernardo MC, Jamroziak K, et al. Common genetic variation at 15q25.2 impacts on chronic lymphocytic leukaemia risk. British journal of haematology. 2011;154(2):229–233. [DOI] [PubMed] [Google Scholar]
  • 27.Berndt SI, Skibola CF, Joseph V, et al. Genome-wide association study identifies multiple risk loci for chronic lymphocytic leukemia. Nature Genetics. 2013;45(8):868–876. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Speedy HE, Di Bernardo MC, Sava GP, et al. A genome-wide association study identifies multiple susceptibility loci for chronic lymphocytic leukemia. Nature Genetics. 2014;46(1):56–60. [DOI] [PubMed] [Google Scholar]
  • 29.Berndt SI, Camp NJ, Skibola CF, et al. Meta-analysis of genome-wide association studies discovers multiple loci for chronic lymphocytic leukemia. Nat Commun. 2016;7:10933. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Kandaswamy R, Sava GP, Speedy HE, et al. Genetic Predisposition to Chronic Lymphocytic Leukemia Is Mediated by a BMF Super-Enhancer Polymorphism. Cell Rep. 2016;16(8):2061–2067. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Speedy HE, Beekman R, Chapaprieta V, et al. Insight into genetic predisposition to chronic lymphocytic leukemia from integrative epigenomics. Nat Commun. 2019;10(1):3615. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Yan H, Tian S, Kleinstern G, et al. Chronic lymphocytic leukemia (CLL) risk is mediated by multiple enhancer variants within CLL risk loci. Hum Mol Genet. 2020;29(16):2761–2774. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Lan Q, Au WY, Chanock S, et al. Genetic susceptibility for chronic lymphocytic leukemia among Chinese in Hong Kong. Eur J Haematol. 2010;85(6):492–495. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Coombs CC, Rassenti LZ, Falchi L, et al. Single nucleotide polymorphisms and inherited risk of chronic lymphocytic leukemia among African Americans. Blood. 2012;120(8):1687–1690. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Rawstron AC, Bennett FL, O’Connor SJ, et al. Monoclonal B-cell lymphocytosis and chronic lymphocytic leukemia. N Engl J Med. 2008;359(6):575–583. [DOI] [PubMed] [Google Scholar]
  • 36.Parikh SA, Chaffee KG, Larson MC, et al. Outcomes of a large cohort of individuals with clinically ascertained high-count monoclonal B-cell lymphocytosis. Haematologica. 2018;103(6):e237–e240. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Crowther-Swanepoel D, Corre T, Lloyd A, et al. Inherited genetic susceptibility to monoclonal B-cell lymphocytosis. Blood. 2010;116(26):5957–5960. [DOI] [PubMed] [Google Scholar]
  • 38.Kleinstern G, Camp NJ, Goldin LR, et al. Association of polygenic risk score with the risk of chronic lymphocytic leukemia and monoclonal B-cell lymphocytosis. Blood. 2018;131(23):2541–2551. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Torkamani A, Wineinger NE, Topol EJ. The personal and clinical utility of polygenic risk scores. Nature Reviews Genetics. 2018;19(9):581–590. [DOI] [PubMed] [Google Scholar]
  • 40.Chatterjee N, Shi J, García-Closas M. Developing and evaluating polygenic risk prediction models for stratified disease prevention. Nature Reviews Genetics. 2016;17(7):392–406. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Zhang YD, Hurson AN, Zhang H, et al. Assessment of polygenic architecture and risk prediction based on common variants across fourteen cancers. Nat Commun. 2020;11(1):3353. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Berndt SI, Vijai J, Benavente Y, et al. Distinct germline genetic susceptibility profiles identified for common non-Hodgkin lymphoma subtypes. Leukemia. 2022. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Muchtar E, Kay NE, Parikh SA. Early intervention in asymptomatic chronic lymphocytic leukemia. Clin Adv Hematol Oncol. 2021;19(2):92–103. [PubMed] [Google Scholar]
  • 44.Trevino KM, Martin P, Chen Z, Leonard JP. Worsening Quality of Life in Indolent Non-Hodgkin Lymphoma and Chronic Lymphocytic Leukemia Patients in Active Surveillance: A 12-Month Longitudinal Study. Clin Lymphoma Myeloma Leuk. 2022;22(2):82–88. [DOI] [PMC free article] [PubMed] [Google Scholar]

RESOURCES