Skip to main content
The Breast : Official Journal of the European Society of Mastology logoLink to The Breast : Official Journal of the European Society of Mastology
. 2023 Jan 10;67:71–77. doi: 10.1016/j.breast.2023.01.003

Polygenic risk scores and breast cancer risk prediction

Eleanor Roberts a, Sacha Howell a,d,e, D Gareth Evans b,c,d,e,
PMCID: PMC9982311  PMID: 36646003

Abstract

Polygenic Risk Scores (PRS) are a major component of accurate breast cancer risk prediction and have the potential to improve screening and prevention strategies. PRS combine the risk from Single nucleotide polymorphisms (SNPs) associated with breast cancer in Genome Wide Association Studies (GWAS) and explain over 30% of breast cancer heritability. When incorporated into risk models, the more personalised risk assessment derived from PRS, help identify women at higher risk of breast cancer development and enables the implementation of stratified screening and prevention approaches. This review describes the role of PRS in breast cancer risk prediction including the development of PRS and their clinical application. We have also examined the role of PRS within more well-established risk prediction models which incorporate known classic risk factors and discuss the interaction of PRS with these factors and their capacity to predict breast cancer subtypes. Before PRS can be implemented on a population-wide scale, there are several challenges that must be addressed. Perhaps the most pressing of these is the use of PRS in women of non-White European origin, where PRS have been shown to have attenuated risk prediction both in discrimination and calibration. We discuss progress in developing and applying PRS in non-white European populations. PRS represent a significant advance in breast cancer risk prediction and their further development will undoubtedly enhance personalisation.

keywords: Polygenic, Risk, Scores, Breast, Cancer, Prediction, screening, Prevention

1. Introduction: prognosis and diagnosis

Breast cancer is the world's most prevalent cancer with 7.8 million women alive who have been diagnosed with breast cancer between 2015 and 2020. In 2020, there were 2.3 million women diagnosed globally with 685,000 of these women dying from the disease [1]. Breast cancer occurs in every country of the world and can affect any woman who is post-pubertal with prevalence increasing with age. The earlier breast cancer is detected, the greater the chances of long-term survival.

Women with breast cancer have much higher chances of survival the earlier the cancer is diagnosed (Fig. 1) and early detection is the key to improving mortality figures. Most developed countries have, therefore, adopted population based mammographic screening programmes in an attempt to reduce tumour stage and improve survival. Although there are small numbers of women, particularly those with a family history and carriers of pathogenic variants in high-risk genes such as BRCA1, BRCA2 and TP53, who benefit from high-risk screening (often including annual MRI), most of the population receives screening based upon age alone. The use of risk models including standard risk factors, such as family history and hormonal/reproductive factors, integrated with mammographic density and DNA testing now offers the opportunity to more accurately discriminate risk to introduce true population-based risk stratification [3].

Fig. 1.

Fig. 1

5-Year net survival (%) of women diagnosed with BC by the stage at which the BC was diagnosed. All data: adults diagnosed 2013–2017, followed up to 2018. (CRUK 2018.) [2].

1.1. Genetic predisposition

Breast cancer is a heritable disease with epidemiological studies indicating around 4–5% is due to highly penetrant autosomal dominant predisposition [4]. Twin studies suggest around 27% is likely to have a strong heritable component [5]. There are many high and moderate risk genes that have been associated with BC development and these are detailed in Table 1.

Table 1.

High and moderate penetrance BC risk genes and the lifetime BC risk that they confer. Adapted from [[6], [7], [8], [9], [10], [11], [12], [13], [14], [15], [16], [17], [18], [19], [20]].

Penetrance Gene Lifetime BC Risk (%) Rate of variant within outbred populations (%)
High BRCA1 55–85 0.12–0.20
High BRCA2 45–69 0.20–0.50
High TP53 56–90 0.02
High PTEN 60 <0.01
High STK11 32–54 <0.01
High CDH1 60 <0.01
Moderate ATM 25 0.2
Moderate CHEK2 40 0.3–0.5
Moderate PALB2 25–40 0.1
Moderate BARD1 20 0.1–0.2
Moderate RAD51C 20 0.1
Moderate RAD51D 20 0.1

Since the discovery of well-validated high and moderate risk genes, recent research has focused on more common, lower penetrance gene variants that, in combination, increase breast cancer risk significantly. Single Nucleotide Polymorphisms (SNPs) are the most common type of variant within a DNA sequence, with over 10 million SNPs estimated to occur within the human genome [21]. SNPs often explain normal variation between individuals and frequently have minimal known functional impact [22] However, when a SNP occurs within a gene or within the regulatory region near a gene, the gene's function can be affected, and this can contribute to disease development. Many of these SNPs have been singularly associated with a very small increase in BC risk. These BC-associated SNPs occur much more frequently in the population than the pathogenic variants found in high and moderate risk genes, but an individual SNP confers a much lower risk of BC development (Fig. 2). However, with over 300 such SNPs identified to date and the risks from each acting multiplicatively, ∼30% of familial heritability can be attributed to the known SNPs [23,24].

Fig. 2.

Fig. 2

The role of allele variants and the size of the effect they can have on disease penetrance. Generally, rare pathogenic variants have a large impact on disease development and common variants have a smaller impact on disease development [25].

1.2. GWAS – Genome Wide Association Studies

Genome Wide Association Studies (GWAS) have genotyped between hundreds of thousands to millions of SNPs with the purpose of discovering variants that occur at a significantly higher frequency (usually with a P-value of <5 × 10−7) in disease-affected vs unaffected (control) individuals [26]. Using the frequencies of these variants, an odds ratio (OR) can be calculated which indicates the probability of an outcome based on an exposure; in this case, the development of breast cancer based on specific genetic variants [27]. GWAS have consistently linked certain SNPs within the genome to breast cancer development, having identified over 170 genomic regions that are associated with breast cancer risk and pinpointing over 190 likely target genes, with new studies and increasing sample sizes continually identifying new target genes and genomic loci associated with breast cancer risk [28,29]. Within these genomic regions, GWAS have identified over 300 SNPs associated with BC development. In addition to finding loci associated with breast cancer, these studies have also extended our understanding of the heritability of breast cancer with SNPs located in regulatory zones, many strongly enriched for transcription factor binding sites [30].

There is a need to expand GWAS studies to include more participants with substantial focus on detection of SNPs in subtype-specific variants (specifically ER-negative and Triple Negative (TN) tumours which cannot be prevented by endocrine therapies) as well as including individuals of diverse ethnicities beyond those of white European origin.

1.3. Polygenic risk scores

Polygenic Risk Scores (PRS) describe the combined result of numerous risk-inducing susceptibility variants, identified in GWAS [31]. PRS considerably improve risk prediction even though most of the susceptibility variants confer only a small risk individually. However, in conjunction, the multiplicative effect of the variants can dramatically influence risk. Generally, PRS are calculated by multiplying the published per-allele ORs for each risk SNP and these scores are used to improve the stratification of breast cancer risk on a population level [32,33]. It is vital that the mean PRS is calibrated to risk within the specific population being assessed.

The expected allele frequency (EAF) and OR for each SNP is used to calculate the risk score for each genotype using: p2 + 2pq + q2 = 1 where p is the risk allele and q is the non-risk allele.

P2 is the homozygous risk allele frequency, 2pq is the heterozygous risk allele frequency q2 is the homozygous non-risk allele frequency.

The average population risk for the non-risk allele genotype is then calculated using the equation: (p2* published OR2) + (pq*published OR) + (q2*1)

The population adjusted risks for each of the three genotypes is calculated by:

(published OR2/average population risk) for two risk alleles (published OR/average population risk) for one risk allele (1/average population risk) for no risk alleles.

[34,35].

1.4. Discrimination and calibration

For a PRS to have clinical utility it is vital to have accurate discrimination and calibration. Discrimination is defined by correctly classifying whether an individual is either a case (BC affected) or control (BC non-affected). Calibration describes the agreement between observed and estimated incidences in the population [36,37].

Traditionally, many studies have used the discriminatory ability to assess the validity of a PRS. The area under the curve (AUC), defined as the probability that a randomly selected individual with a disease will be assigned a higher risk than a randomly selected individual without the disease, is the most used statistic for assessing discrimination. AUCs of 50% and 100% indicate that a model has no or perfect discrimination, respectively. Typically, PRS alone have modest discrimination at ∼0.60 however when considering PRS use in breast cancer, modest discrimination still has the potential to identify a large portion of the population at higher risk of disease development [38].

Calibration is typically assessed by calculating an observed and estimated odds ratio so that an O/E ratio of 1.0 demonstrates perfect calibration [39]. It is well known that calibration is often a problem when implementing PRS for breast cancer in populations outside of that which the PRS was designed in. Whilst the PRS often discriminates well between cases and controls, there is often a significant over or under prediction that needs to be corrected to accurately report an individual's risk [40,41].

1.5. Developing breast cancer PRS

Discovery of novel SNPs associated with breast cancer development in the larger GWAS, have resulted in expansion of the number of these SNPs incorporated in PRS to improve risk stratification. The history of the development of breast cancer PRS has been summarised in Table 2.

Table 2.

Discrimination statistics for breast cancer PRS with increasing number of SNPs in populations of white European origin [3,33,[42], [43], [44], [45], [46], [47], [48], [49], [50], [51], [52], [53], [54]].

PRS AUC Cases/Controls Year Study Reference
SNP18 0.590 364/1605 2016 Evans
SNP22 0.650 1143/892 2012 Sawyer
SNP24 0.590 1496/2869 2017 Li
SNP32 0.580 6009/7827 2012 Hüsing
SNP77 0.620 33,673/33,381 2015 Mavaddat
SNP77 0.610 750/405 2016 Dite
SNP77 0.603 11,428/18,323 2019 Mavaddat
SNP83 0.590 387/387 2016 Shieh
SNP86 N/A 4291/19,968 2020 Hughes
SNP143 0.650 405/1668 2020 Brentnall
SNP313 0.630 11,428/18,323 2019 Mavaddat
SNP313 0.653 667/332 2022 Saloustros
PRS beyond SNPs AUC Cases/Controls Year Study Reference

SNP18 (+TC/MD) 0.670 466/8897 2018 Van Veen
SNP18 (+TC/DR) 0.634 525/1410 2022 Evans
SNP77 + (PD) N/A 3628/5126 2019 Vachon
SNP143 (+TC/DR) 0.677 525/1410 2022 Evans
SNP143* 0.650 405/1668 2020 Brentnall
SNP313 (+TC/DR) 0.665 525/1410 2022 Evans
SNP5218 (LDpredict) 0.690 6586/157,895 2019 Khera

TC, Tyrer-Cuzick (classic risk factors); MD, Mammographic Density; DR, Density Residual (mammographic density); PD, Percent Density; LDpredict, Linkage Disequilibrium (predicted SNPs based on linkage disequilibrium). There are no reported discrimination statistics for SNP86. *AUC reported for PRS only.

Evans et al. conducted a case-control study where SNP18 was used to discover the role of this PRS in predicting risk of women who have no pathogenic variant in either of the BRCA1/2 genes [42]. SNP18 was found to be predictive of BC (Interquartile range (IQR) OR 1.55, 95% CI 1.29 to 1.87, O/E 96%). There is currently an unmet clinical need to identify women who are at increased BC risk but without such high risk gene variants this study demonstrated that a PRS can help to identify these women. The same group also showed in a cohort of women recruited from the UK screening population that SNP18 PRS was higher in case patients (median, 1.12; IQR, 0.87–1.33) than controls (median, 1.01; IQR, 0.77–1.19), with near perfect calibration (unadjusted O/E OR, 1.03; 95% CI, 0.74–1.32) [53].

Beyond SNP18, SNP77 was one of the next well-validated breast cancer PRS’. In a study of ∼33,000 breast cancer cases and ∼33,000 controls of European origin, SNP77 was shown to stratify breast cancer risk in women with family history as well as those without [46]. Lifetime breast cancer risk was stratified as 8.6% and 24.4% in the lowest and highest quintiles respectively, for women with a first-degree relative with a history of breast cancer and 5.2% and 16.6% respectively, for women with no family history. Mavaddat et al. showed that using this PRS, women in the highest 1% were at 3-fold increased risk of breast cancer development when compared with women in the middle quintile. Other studies have confirmed similar ORs with the 77-SNP PRS (OR = 1.52 per SD, 95% CI 1.45–1.59) [54] and suggested that it explains ∼12.6% of the familial relative risk of breast cancer [55].

As more BC associated SNPs were identified a panel of 94 SNPs were examined with the best discriminatory accuracy coming from 86 of these 94 SNPs. These 86 SNPs thus made the SNP86 PRS [52]. This PRS was reported to be highly predictive of overall breast cancer status with P values of P = 6.4 × 10−66 and P < 10−325 in the two validation cohorts used. Odds ratios per SD were also highly consistent between validation sets (OR = 1.45, 95% CI 1.39–1.52 and OR = 1.47, 95% CI 1.45–1.49).

SNP143 is the next largest well-validated PRS for breast cancer risk and was described by Brentnall et al. when they developed this PRS from a panel of 172 SNPs linked to breast cancer by GWAS [49]. This study found that SNP143 was well calibrated with an O/E ratio of 1.10 (95% CI 0.86–1.34). SNP143 has been further validated by Evans et al., 2022 in combination with mammographic density and a gene panel (Table 2) [56].

SNP313, developed from studies conducted by the Breast Cancer Association Consortium, is predictive of breast cancer risk with an OR of 1.61 (95% CI 1.57–1.65) and an AUC of 0.630 (95% CI 0.628–0.651), demonstrating a good discriminatory ability in white European populations [33].

1.6. PRS with other risk factors

In breast cancer risk prediction models, the use of other known risk factors in combination with PRS has been shown to enhance prediction (Table 2). Well-validated PRS are used in joint association with mammographic density among other classic risk factors to improve risk stratification. Van Veen et al. demonstrated that SNP18 combined with risk data from the Tyrer-Cuzick model and mammographic density identified 16% of cases and 9.5% of controls that moved into the increased risk category (>5% 10-year risk) compared with using only the Tyrer-Cuzick model. They also found that 5% of cases and 4% of controls moved out of the high-risk category demonstrating the ability of PRS to not just place someone at higher risk but also to stratify those women who are at medium or low risk [53]. Van Veen et al. also found that the multiplicative effect of incorporating risk factors improved the models’ discriminatory ability with AUC increasing from 0.58 with Tyrer-Cuzick alone, to 0.64 with the addition of mammographic density to a high of 0.67 when the SNP18 PRS was included as well.

It has been shown that total risk stratification is improved for mammographic density when a SNP77 PRS is also taken into consideration, as shown by Vachon et al. [54] This study found that when considering mammographic percent density (PD), OR fell slightly from 1.45 (95% CI 1.38–1.52) to 1.42 (95% CI 1.36–1.50) when PD was adjusted for the SNP77 PRS.

Brentnall et al. showed that SNP143 used in combination with mammographic density and classic risk factors improved risk stratification [49]. Adding mammographic density and the SNP143 PRS to the Tyrer-Cuzick model classic risk factors, increased the percentage of the population in the lowest (<1.4% 10-year risk) and highest (8% + 10-year risk) risk categories. For cases, using Tyrer-Cuzick alone, 1.2% were in the lowest and 4.2% were in the highest risk groups. After including the SNP143 PRS and mammographic density, this increased to 9.4% in the lowest and 14.6% in the highest risk groups respectively. 64.4% of cases were placed in the low-moderate risk group (1.4–3.5% 10-year risk) using Tyrer-Cuzick alone with this decreasing to 33.1% using the combination of all available risk factors. The percentage decrease in the low-moderate risk group demonstrates that individuals are more appropriately stratified and placed in the lowest and highest risk groups when the PRS is used in combination with classic risk factors and mammographic density. For controls, using only the Tyrer-Cuzick model places 1.2% at the lowest risk and 2.2% at the highest risk whereas this increases to 22.3% in the lowest risk group and 7.4% in the highest risk group after inclusion of mammographic density and the SNP143 PRS. Again, this demonstrates the ability of PRS beyond classic risk factors to place an individual at low risk as well as identifying those at higher risk, much improving risk stratification across the whole population.

A comprehensive study by Evans et al., in 2022 showed the incremental effects of adding mammographic density and different PRS’ comprising increasing number of SNPs on risk stratification in a cohort of ∼500 cases and ∼1500 controls [3]. This study used SNP18, SNP143 and SNP313 to assess the role of PRS using increasing numbers of SNPs in the same population and how risk stratification changes with each PRS. The study showed an incremental improvement in stratification with increasing numbers of SNPs with the PRS being a much greater factor in risk discrimination than pathogenic variants in moderate or high-risk genes. The best performing combination gave an AUC of 0.684 (0.652–0.715). Overall, it was possible to identify 20.9% of controls and 42.5% of breast cancer cases at actionable NICE defined moderate-(10year risk 5–7.99%) or high (≥8%) risk categories.

1.7. PRS and moderate/high risk gene pathogenic variants

PRS have been shown to stratify risk beyond the ability of single high-risk gene pathogenic variants. Kuchenbacker et al. demonstrated large changes in the absolute risk of breast cancer development in a cohort of women (7797 BRCA1 cases; 4330 BRCA2 cases) from the Consortium of Investigators of Modifiers of BRCA1/2 (CIMBA) with the use of a PRS [57]. One study found that SNP86 modifies the risk of women who are carriers of high and moderate penetrance gene variants for breast cancer. Gallagher et al. found that whilst risk stratification was better for non-carriers and women with a CHEK2 pathogenic variant (OR = 1.47, 95% CI 1.45–1.49; OR = 1.49, 95% CI 1.36–1.64, respectively), SNP86 was still able to modify risk prediction in carriers of ATM, PALB2, BRCA1 and BRCA2 with odds ratios ranging from 1.20 for BRCA1 to 1.37 in ATM [58].

Another study examining 26,798 cases and 26,127 controls from the Cancer Risk Estimates Related to Susceptibility consortium evaluated each participant for pathogenic variants in various high and moderate risk genes including BRCA1, BRCA2, ATM, CHEK2, PALB2, BARD1, BRIP1, CDH1 and NF2. [59] In this study, Gao et al. found that PRS’ may be able to reclassify more than 30% of CHEK2 and around 50% of ATM pathogenic variant carriers as having a lower-than-expected lifetime breast cancer risk (<20%). On a clinical level, this may provide significant reassurance and help to plan the most appropriate screening and preventive measures.

1.8. PRS and age

Most genetic studies linking risk prediction to breast cancer focus on generalised breast cancer risk with no specified age group. A large portion of breast cancer occurs in women aged >50 years, however, women affected by breast cancer at younger ages stand to have a greater number of life-years saved with successful early detection and treatment than their older counterparts. One study estimated that there is a 77% 10-year survival if diagnosed aged <40 years vs an 87% 10-year survival if diagnosed >50 years) [60]. Young-onset breast cancers often are phenotypically more aggressive and have some differences in aetiological factors to breast cancers seen in older women. In a study of 185 premenopausal women with invasive breast cancer at the European Institute of Oncology, the youngest women, who were less than 35 years old, had a higher proportion of triple negative cancers (P < 0.001), a higher percentage of cancers with lymphovascular invasion (48.6% vs 37.3%, P = 0.006) and a higher proportion of higher-grade tumours (P < 0.0001) compared with the cohort of women between the ages of 35–50 [61].

There is a need to expand GWAS to include more young women and potentially identify SNPs that are linked to an early-onset diagnosis of breast cancer. There is also the need to enrich for ER-negative SNPs in a PRS used in a young population due to the higher proportion of triple negative cancers seen in this cohort.

1.9. PRS and subtype specificity

To improve the ability to construct a subtype-specific PRS, there is a need to carry out GWAS in populations of patients that develop ER-negative and TN tumours. Alternatively, to increase the power for identification of ER-negative variants, association results for BRCA1 pathogenic variant carriers (BRCA1 predisposes predominantly to ER-negative tumours in younger women) within databases such as CIMBA, as well as BCAC ER-negative association results could be combined to perform a meta-analysis which enriches the ability for GWAS to discover ER-negative SNPs. Until then, research has shown some evidence of breast cancer PRS having some subtype specific predictive ability.

The SNP143 PRS has been shown to be a risk factor for both ER+ and ER-negative breast cancers [49]. Using participants from the CIMBA study, Kuchenbaecker et al. found that the PRS for ER-negative breast cancer showed the strongest association with breast cancer risk in BRCA1 carriers (per SD hazard ratio = 1.27, 9%% CI 1.23–1.31, p = 8.2 × 10−53) with reduced but still significant associations with ER-positive and overall breast cancers in both BRCA1 and BRCA2 carriers [57].

The most comprehensive study investigating the use of PRS for prediction of specific breast cancer subtypes was carried out by Mavaddat et al., in 2019 and used both the SNP77 and SNP313 PRS [33]. In a sample of 11,428 cases and 18,323 controls, they found that the SNP77 PRS and SNP313 were both predictive for ER-positive and ER-negative breast cancer in addition to overall BC (Table 3). SNP313 provided better discrimination than SNP77 as expected with higher AUCs for overall BC in both the prospective test set and validation cohort. However, when considering subtype-specific disease, in both the 77-SNP and 313-SNP PRS, the odds ratios for ER-positive breast cancer, and overall breast cancer, were higher than for the ER-negative cancers.

Table 3.

Reproduced from Mavaddat et al., 2019 [33].

Prospective test set
Validation set
OR 95% CI AUC OR 95% CI AUC
SNP77 PRS Overall BC 1.46 1.42–1.49 0.603 1.49 1.44–1.56 0.612
ER-positive 1.52 1.48–1.56 0.615 1.56 1.49–1.63 0.623
ER-negative 1.35 1.27–1.43 0.584 1.40 1.30–1.50 0.596
SNP313 PRS Overall BC 1.61 1.57–1.65 0.630 1.65 1.59–1.72 0.639
ER-positive 1.68 1.63–1.73 0.641 1.74 1.66–1.82 0.651
ER-negative 1.45 1.37–1.53 0.601 1.47 1.37–1.58 0.611

ER-negative risk prediction was still attenuated when compared to ER-positive breast cancer risk, most likely reflecting the fact that ER-negative disease is less common than ER-positive disease with correspondingly fewer cases in GWAS. However, a study by Michailidou et al. estimated the heritability of ER-negative disease is like that of overall breast cancer, indicating that increasing sample sizes should improve the power of ER-negative PRS [30,62].

Mavaddat et al. further assessed the associations between SNP313 PRS and breast cancer risk by first-degree family history of breast cancer. They found a smaller ER-positive PRS OR (0.91, p = 0.004) for women with a family history, indicating that PRS ORs for this subtype are higher in women without a family history (OR = 1.71 (95% CI 1.65–1.78)) than with a family history (OR = 1.55 (95% CI 1.48–1.65)). When they compared ER-negative breast cancers, there was also a higher OR for women without a family history (OR = 1.45 (95% CI 1.36–1.57)) than with a family history (OR = 1.40 (95% CI 1.27–1.55)) however the disparity was not as large as in ER-positive breast cancers [33].

1.10. PRS and ethnicity

A major challenge within the formulation and use of PRS is ensuring that they are equally suitable across patients of all ethnicities. Should PRS not be appropriately recalculated, their use in these populations is limited, further exacerbating existing ethnic disparities within healthcare systems [63]. This is an obstacle for people of ethnic minorities across the world, whether in a higher income country with a more well-established healthcare system in which they are inadequately represented or in a lower income country where healthcare and research are often limited due to financial constraints.

To date most BC variants have been discovered in GWAS’ of white European populations. However, many of these genetic risks are not transferrable to other populations, with some variants causing risk in one population and being protective in another. One study found that out of approximately 100 variants identified to increase risk in European and Asian individuals, 30–40% were protective in a population of African heritage [64]. These studies provide evidence that GWAS studies should be carried out in a population specific manner, particularly in non-European individuals, with all heritages considered as risk stratification is not automatically transferable from one population to another.

Evans et al. carried out a retrospective case-control study where they evaluated the role of the SNP18 and SNP143 PRS’ in populations of ethnic minorities having been historically well-validated in white European populations [56]. They found that both PRS overestimated risk of breast cancer in all ethnicities with the overestimation being highest in women of African heritage. When combining women of all ethnicities (black, Asian, mixed, and Jewish) into one non-white group, SNP143 was estimated to cause a potential mean 40% overestimation in breast cancer risk. For SNP143, they estimated a breast cancer overprediction for controls of 26% in Jewish, 91% in black, 29% in Asian and 15% in mixed ethnicity populations.

Some commercially available PRS include women of Ashkenazi Jewish ancestry in an ethnic group with white European women, although we have shown that risk is overpredicted in this group. In a study examining an Ashkenazi group from Manchester with a larger validation cohort from Israel, the Ashkenazi population from Manchester showed ∼20% overprediction for breast cancer risk when using SNP142. When the per-allele odds ratios were adjusted using published Ashkenazi Jewish effect allele frequencies and published odds ratios, risk prediction was successfully recalibrated with controls’ mean PRS correcting back to unity (unpublished data).

GWAS research in non-white Europeans needs to be improved with one 2019 study demonstrating that >78% of participants in published GWAS are of white European descent with 71.8% of this population being from only three countries, namely the United States, the United Kingdom and Iceland [65,66]. Whilst there has been some improvement since 2009, when an analysis documented that 96% of GWAS participants were of white European descent, there is still a need to widen participation in GWAS. By 2016, the proportion of non-white European individuals with published GWAS data had risen to nearly 20%, however most of this increase is attributable to Asian populations with representation for African, Latin American, Hispanic, and indigenous populations still very low.

When checking the GWAS diversity monitor, launched in Nature Genetics in 2020, with the purpose to track GWAS participants in real time, by August 2022 over 95% of participants in all known GWAS were white European, demonstrating the continuing lack of diversity and unmet clinical need to perform GWAS in non-white European populations. Of the remaining ∼4%, only 0.30% are of African descent, 0.25% Hispanic/Latin American, 0.65% mixed with the extra ∼3% constituting Asian participants [67].

It is vital that an individual's PRS is compared to population specific distribution, often done using Principal Component Analysis, to ensure that the right adjustments can be made depending on which ethnicity the individual has [68], as in most studies, ethnicity is self-declared. With increasing intermixing across populations of all ancestries, one approach may be to develop an assay which uses genetic markers to determine an individual's ethnicity. Integrating ethnicity associated SNPs into breast cancer PRS design may be a potential solution to this developing issue.

1.11. Are PRS suitable for clinical use?

Research studies continue to demonstrate the capabilities of PRS for breast cancer risk prediction, but PRS have not yet been implemented fully in any clinical setting [69,70]. Whilst PRS have mild-moderate discrimination (lower AUCs), this would be expected for a single risk factor. PRS become clinically useful when used as an addition to existing risk models which consider clinical and lifestyle risk factors, and mammographic density as evidenced previously [71].

However, to implement PRS at a clinical setting, there must be an understanding as to how a PRS would affect a patient on an individual level. Should a breast cancer PRS stratify a woman at higher risk of breast cancer development, this could lead them to make informed decisions regarding preventative measures such as taking medication, undergoing bilateral mastectomies and/or lifestyle modifications [72]. Nevertheless, there is a difference between relative risk and translating this into absolute risk that facilitates clearer understanding for an individual. For example, if a woman has a 16% chance of developing breast cancer in her lifetime compared with an 11% population risk, that patient has nearly 50% increased risk of developing breast cancer although in absolute terms this is only a 5% increased risk from population level. It is vital that patients are not needlessly exposed to anxiety regarding their genetic risk nor unnecessary radiation via increased mammographic screening, which is why PRS must be appropriately used by a clinician/healthcare worker and effectively communicated to their patient.

As PRS are relatively novel in breast cancer risk prediction, it is yet to be determined about the best approach to communicate a personalised risk score to a patient and if patients want to know this information compared with standard screening. PROCAS (Predicting the Risk of Cancer at Screening), WISDOM (Women Informed to Screen Depending on Measures of Risk) and MyPeBS (My Personalised Breast Screening) are all large-scale studies that investigate the viability of breast cancer PRS’ and how their clinical implementation could be facilitated [[73], [74], [75]].

2. Summary

Breast cancer PRS’ remain a keystone in personalised risk prediction medicine and have been shown to refine a patient's risk beyond classic risk factors. PRS have the potential to prevent many cancer deaths via early detection and risk-reducing measures. However, further work needs to be done to improve the use of PRS in women of non-white European ancestry as currently these groups are having their risk significantly overpredicted. To implement PRS on a clinical scale, research must devise how to effectively communicate this risk on an individual scale.

Ethical approval

Ethical approval not required.

Funding

DGE and SJH are supported by the Manchester National Institute for Health Research (NIHR) Biomedical Research Centre (IS-BRC-1215-20007). ER is funded by Cancer Research UK ACED International Alliance for Cancer Early Detection C19941/A27859.

Declaration of competing interest

No authors have conflicts of interest to disclose.

References

  • 1.WHO. https://www.who.int/news-room/fact-sheets/detail/breast-cancer Accessed 25/04/22.
  • 2.Cruk. https://www.cancerresearchuk.org/health-professional/cancer-statistics/statistics-by-cancer-type/breast-cancer/survival#heading-Three Accessed 10/10/22.
  • 3.Evans D.G.R., et al. Breast cancer risk stratification in women of screening age: incremental effects of adding mammographic density, polygenic risk, and a gene panel. Genet Med. 2022;24(7):1485–1494. doi: 10.1016/j.gim.2022.03.009. [DOI] [PubMed] [Google Scholar]
  • 4.Claus E.B., Risch N., Thompson W.D. Autosomal dominant inheritance of early-onset breast cancer. Implications for risk prediction. Cancer. 1994;73(3):643–651. doi: 10.1002/1097-0142(19940201)73:3<643::aid-cncr2820730323>3.0.co;2-5. [DOI] [PubMed] [Google Scholar]
  • 5.Peto J., Mack T.M. High constant incidence in twins and other relatives of women with breast cancer. Nat Genet. 2000;26(4):411–414. doi: 10.1038/82533. [DOI] [PubMed] [Google Scholar]
  • 6.Apostolou P., Fostira F. Hereditary breast cancer: the era of new susceptibility genes. BioMed Res Int. 2013;2013 doi: 10.1155/2013/747318. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.National cancer Institute. https://www.cancer.gov/about-cancer/causes-prevention/genetics/brca-fact-sheet Accessed 29/04/22.
  • 8.Foulkes W.D., et al. Estrogen receptor status in BRCA1- and BRCA2-related breast cancer: the influence of age, grade, and histological type. Clin Cancer Res. 2004;10(6):2029–2034. doi: 10.1158/1078-0432.ccr-03-1061. [DOI] [PubMed] [Google Scholar]
  • 9.Abubakar M., et al. Clinicopathological and epidemiological significance of breast cancer subtype reclassification based on p53 immunohistochemical expression. NPJ Breast Cancer. 2019;5:20. doi: 10.1038/s41523-019-0117-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Li S., et al. Loss of PTEN expression in breast cancer: association with clinicopathological characteristics and prognosis. Oncotarget. 2017;8(19):32043–32054. doi: 10.18632/oncotarget.16761. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Lipsa A., Kowtal P., Sarin R. Novel germline STK11 variants and breast cancer phenotype identified in an Indian cohort of Peutz-Jeghers syndrome. Hum Mol Genet. 2019;28(11):1885–1893. doi: 10.1093/hmg/ddz027. [DOI] [PubMed] [Google Scholar]
  • 12.Horne H.N., et al. E-cadherin breast tumor expression, risk factors and survival: pooled analysis of 5,933 cases from 12 studies in the Breast Cancer Association Consortium. Sci Rep. 2018;8(1):6574. doi: 10.1038/s41598-018-23733-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Konde A.S., et al. Heterozygous germline ATM mutations in breast cancer: a single academic center experience. J Clin Oncol. 2020;38(15_suppl) 1537-1537. [Google Scholar]
  • 14.Cybulski C., et al. Estrogen receptor status in CHEK2-positive breast cancers: implications for chemoprevention. Clin Genet. 2009;75(1):72–78. doi: 10.1111/j.1399-0004.2008.01111.x. [DOI] [PubMed] [Google Scholar]
  • 15.Antoniou A.C., et al. Breast-cancer risk in families with mutations in PALB2. N Engl J Med. 2014;371(6):497–506. doi: 10.1056/NEJMoa1400382. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Yang X., et al. Ovarian and breast cancer risks associated with pathogenic Variants in RAD51C and RAD51D. JNCI. J Natl Cancer Inst. 2020;112(12):1242–1250. doi: 10.1093/jnci/djaa030. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Suszynska M., et al. BARD1 is a low/moderate breast cancer risk gene: evidence based on an association study of the central European p.Q564X recurrent mutation. Cancers. 2019;11(6) doi: 10.3390/cancers11060740. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Śniadecki M., et al. BARD1 and breast cancer: the possibility of creating screening tests and new preventive and therapeutic pathways for predisposed women. Genes. 2020;11(11) doi: 10.3390/genes11111251. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Breast Cancer Association C., et al. Breast cancer risk genes - association analysis in more than 113,000 women. N Engl J Med. 2021;384(5):428–439. doi: 10.1056/NEJMoa1913948. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Woodward Emma R., van Veen Elke M., Evans D.G. FromBRCA1to polygenic risk scores: mutation-associated Risks in breast cancer-related genes. Breast Care. 2021;16(3):202–213. doi: 10.1159/000515319. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Norrgard K. Genetic variation and disease: GWAS. Nature Education. 2008;1(1):87. [Google Scholar]
  • 22.Genomes Project C., et al. A map of human genome variation from population-scale sequencing. Nature. 2010;467(7319):1061–1073. doi: 10.1038/nature09534. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Skol A.D., Sasaki M.M., Onel K. The genetics of breast cancer risk in the post-genome era: thoughts on study design to move past BRCA and towards clinical relevance. Breast Cancer Res. 2016;18(1):99. doi: 10.1186/s13058-016-0759-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Collins A., Politopoulos I. The genetics of breast cancer: risk factors for disease. Appl Clin Genet. 2011;4:11–19. doi: 10.2147/TACG.S13139. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Manolio T.A., et al. Finding the missing heritability of complex diseases. Nature. 2009;461(7265):747–753. doi: 10.1038/nature08494. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Visscher P.M., et al. Five years of GWAS discovery. Am J Hum Genet. 2012;90(1):7–24. doi: 10.1016/j.ajhg.2011.11.029. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Waltoft B.L., et al. The importance of distinguishing between the odds ratio and the incidence rate ratio in GWAS. BMC Med Genet. 2015;16:71. doi: 10.1186/s12881-015-0210-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Zhang H., et al. Genome-wide association study identifies 32 novel breast cancer susceptibility loci from overall and subtype-specific analyses. Nat Genet. 2020;52(6):572–581. doi: 10.1038/s41588-020-0609-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Ferreira M.A., et al. Genome-wide association and transcriptome studies identify target genes and risk loci for breast cancer. Nat Commun. 2019;10(1):1741. doi: 10.1038/s41467-018-08053-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Michailidou K., et al. Association analysis identifies 65 new breast cancer risk loci. Nature. 2017;551(7678):92–94. doi: 10.1038/nature24284. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Pashayan N., et al. Polygenic susceptibility to prostate and breast cancer: implications for personalised screening. Br J Cancer. 2011;104(10):1656–1663. doi: 10.1038/bjc.2011.118. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Evans D.G.R., et al. Breast cancer pathology and stage are better predicted by risk stratification models that include mammographic density and common genetic variants. Breast Cancer Res Treat. 2019;176(1):141–148. doi: 10.1007/s10549-019-05210-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Mavaddat N., et al. Polygenic risk scores for prediction of breast cancer and breast cancer subtypes. Am J Hum Genet. 2019;104(1):21–34. doi: 10.1016/j.ajhg.2018.11.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Evans D.G., et al. Assessing individual breast cancer risk within the U.K. National Health Service Breast Screening Program: a new paradigm for cancer prevention. Cancer Prev Res. 2012;5(7):943–951. doi: 10.1158/1940-6207.CAPR-11-0458. [DOI] [PubMed] [Google Scholar]
  • 35.Lewis C.M. Genetic association studies: design, analysis and interpretation. Briefings Bioinf. 2002;3(2):146–153. doi: 10.1093/bib/3.2.146. [DOI] [PubMed] [Google Scholar]
  • 36.Olsen M., et al. Evaluating the prognostic performance of a polygenic risk score for breast cancer risk stratification. BMC Cancer. 2021;21(1):1351. doi: 10.1186/s12885-021-08937-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Wei J., et al. Calibration of polygenic risk scores is required prior to clinical implementation: results of three common cancers in UKB. J Med Genet. 2022;59(3):243–247. doi: 10.1136/jmedgenet-2020-107286. [DOI] [PubMed] [Google Scholar]
  • 38.Chatterjee N., Shi J., Garcia-Closas M. Developing and evaluating polygenic risk prediction models for stratified disease prevention. Nat Rev Genet. 2016;17(7):392–406. doi: 10.1038/nrg.2016.27. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Elliott J., et al. Predictive accuracy of a polygenic risk score-enhanced prediction model vs a clinical risk score for coronary artery disease. JAMA. 2020;323(7):636–645. doi: 10.1001/jama.2019.22241. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Martin A.R., et al. Predicting polygenic risk of psychiatric disorders. Biol Psychiatr. 2019;86(2):97–109. doi: 10.1016/j.biopsych.2018.12.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Martin A.R., et al. Human demographic history impacts genetic risk prediction across diverse populations. Am J Hum Genet. 2017;100(4):635–649. doi: 10.1016/j.ajhg.2017.03.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Evans D.G., et al. The impact of a panel of 18 SNPs on breast cancer risk in women attending a UK familial screening clinic: a case-control study. J Med Genet. 2017;54(2):111–113. doi: 10.1136/jmedgenet-2016-104125. [DOI] [PubMed] [Google Scholar]
  • 43.Sawyer S., et al. A role for common genomic variants in the assessment of familial breast cancer. J Clin Oncol. 2012;30(35):4330–4336. doi: 10.1200/JCO.2012.41.7469. [DOI] [PubMed] [Google Scholar]
  • 44.Li H., et al. Breast cancer risk prediction using a polygenic risk score in the familial setting: a prospective study from the Breast Cancer Family Registry and kConFab. Genet Med. 2017;19(1):30–35. doi: 10.1038/gim.2016.43. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Husing A., et al. Prediction of breast cancer risk by genetic risk factors, overall and by hormone receptor status. J Med Genet. 2012;49(9):601–608. doi: 10.1136/jmedgenet-2011-100716. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Mavaddat N., et al. Prediction of breast cancer risk based on profiling with common genetic variants. J Natl Cancer Inst. 2015;107(5) doi: 10.1093/jnci/djv036. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Dite G.S., et al. Breast cancer risk prediction using clinical models and 77 independent risk-associated SNPs for women aged under 50 Years: Australian breast cancer family registry. Cancer Epidemiol Biomarkers Prev. 2016;25(2):359–365. doi: 10.1158/1055-9965.EPI-15-0838. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Shieh Y., et al. Breast cancer risk prediction using a clinical risk model and polygenic risk score. Breast Cancer Res Treat. 2016;159(3):513–525. doi: 10.1007/s10549-016-3953-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Brentnall A.R., et al. A case-control evaluation of 143 single nucleotide polymorphisms for breast cancer risk stratification with classical factors and mammographic density. Int J Cancer. 2020;146(8):2122–2129. doi: 10.1002/ijc.32541. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Saloustros E. Associations between the 313-SNP polygenic risk score and breast cancer risk in women from crete. Ann Oncol. 2022;33(suppl_3):S184–S191. [Google Scholar]
  • 51.Khera A.V., et al. Genome-wide polygenic scores for common diseases identify individuals with risk equivalent to monogenic mutations. Nat Genet. 2018;50(9):1219–1224. doi: 10.1038/s41588-018-0183-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Hughes E., et al. Development and validation of a clinical polygenic risk score to predict breast cancer risk. JCO Precision Oncology. 2020;(4):585–592. doi: 10.1200/PO.19.00360. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.van Veen E.M., et al. Use of single-nucleotide polymorphisms and mammographic density plus classic risk factors for breast cancer risk prediction. JAMA Oncol. 2018;4(4):476–482. doi: 10.1001/jamaoncol.2017.4881. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Vachon C.M., et al. Joint association of mammographic density adjusted for age and body mass index and polygenic risk score with breast cancer risk. Breast Cancer Res. 2019;21(1):68. doi: 10.1186/s13058-019-1138-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Rudolph A., et al. Joint associations of a polygenic risk score and environmental risk factors for breast cancer in the Breast Cancer Association Consortium. Int J Epidemiol. 2018;47(2):526–536. doi: 10.1093/ije/dyx242. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Evans D.G., et al. The importance of ethnicity: are breast cancer polygenic risk scores ready for women who are not of White European origin? Int J Cancer. 2022;150(1):73–79. doi: 10.1002/ijc.33782. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Kuchenbaecker K.B., et al. Risks of breast, ovarian, and contralateral breast cancer for BRCA1 and BRCA2 mutation carriers. JAMA. 2017;317(23):2402–2416. doi: 10.1001/jama.2017.7112. [DOI] [PubMed] [Google Scholar]
  • 58.Gallagher S., et al. Association of a polygenic risk score with breast cancer among women carriers of high- and moderate-risk breast cancer genes. JAMA Netw Open. 2020;3(7):e208501. doi: 10.1001/jamanetworkopen.2020.8501. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Gao C., et al. Risk of breast cancer among carriers of pathogenic variants in breast cancer predisposition genes varies by polygenic risk score. J Clin Oncol. 2021;39(23):2564–2573. doi: 10.1200/JCO.20.01992. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.teams F.c. Mammographic surveillance in women younger than 50 years who have a family history of breast cancer: tumour characteristics and projected effect on mortality in the prospective, single-arm, FH01 study. Lancet Oncol. 2010;11(12):1127–1134. doi: 10.1016/S1470-2045(10)70263-1. [DOI] [PubMed] [Google Scholar]
  • 61.Colleoni M., et al. Very young women (<35 years) with operable breast cancer: features of disease at presentation. Ann Oncol. 2002;13(2):273–279. doi: 10.1093/annonc/mdf039. [DOI] [PubMed] [Google Scholar]
  • 62.Milne R.L., et al. Identification of ten variants associated with risk of estrogen-receptor-negative breast cancer. Nat Genet. 2017;49(12):1767–1778. doi: 10.1038/ng.3785. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Martin A.R., et al. Clinical use of current polygenic risk scores may exacerbate health disparities. Nat Genet. 2019;51(4):584–591. doi: 10.1038/s41588-019-0379-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64.Wang S., et al. Genetic variants demonstrating flip-flop phenomenon and breast cancer risk prediction among women of African ancestry. Breast Cancer Res Treat. 2018;168(3):703–712. doi: 10.1007/s10549-017-4638-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65.Popejoy A.B., Fullerton S.M. Genomics is failing on diversity. Nature. 2016;538(7624):161–164. doi: 10.1038/538161a. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66.Sirugo G., Williams S.M., Tishkoff S.A. The missing diversity in human genetic studies. Cell. 2019;177(1):26–31. doi: 10.1016/j.cell.2019.02.048. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67.GWAS diversity monitor. https://gwasdiversitymonitor.com Accessed 11/10/22.
  • 68.Duncan L., et al. Analysis of polygenic risk score usage and performance in diverse human populations. Nat Commun. 2019;10(1):3328. doi: 10.1038/s41467-019-11112-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69.Zeinomar N., Chung W.K. Cases in precision medicine: the role of polygenic risk scores in breast cancer risk assessment. Ann Intern Med. 2021;174(3):408–412. doi: 10.7326/M20-5874. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70.Yanes T., et al. The emerging field of polygenic risk scores and perspective for use in clinical care. Hum Mol Genet. 2020;29(R2):R165–R176. doi: 10.1093/hmg/ddaa136. [DOI] [PubMed] [Google Scholar]
  • 71.Sud A., Turnbull C., Houlston R. Will polygenic risk scores for cancer ever be clinically useful? NPJ Precis Oncol. 2021;5(1):40. doi: 10.1038/s41698-021-00176-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 72.Shah P.D. Polygenic risk scores for breast cancer-can they deliver on the promise of precision medicine? JAMA Netw Open. 2021;4(8):e2119333. doi: 10.1001/jamanetworkopen.2021.19333. [DOI] [PubMed] [Google Scholar]
  • 73.Evans D.G., et al. Breast cancer risk feedback to women in the UK NHS breast screening population. Br J Cancer. 2016;114(9):1045–1052. doi: 10.1038/bjc.2016.56. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 74.Eklund M., et al. The WISDOM personalized breast cancer screening trial: simulation study to assess potential bias and analytic approaches. JNCI Cancer Spectr. 2018;2(4):pky067. doi: 10.1093/jncics/pky067. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 75.Rouge-Bugat M.-E., et al. vol. 3. La Presse Médicale Open; 2022. (MyPeBS International randomized study comparing personalised, risk-stratified to standard breast cancer screening in women aged 40–70: focus on recruitment strategy in France). [Google Scholar]

Articles from The Breast : Official Journal of the European Society of Mastology are provided here courtesy of Elsevier

RESOURCES