Skip to main content
Nature Portfolio logoLink to Nature Portfolio
. 2022 Feb 12;25(4):755–761. doi: 10.1038/s41391-022-00497-7

Prostate cancer risk stratification improvement across multiple ancestries with new polygenic hazard score

Minh-Phuong Huynh-Le 1,2,3, Roshan Karunamuni 2,3, Chun Chieh Fan 3, Lui Asona 2, Wesley K Thompson 4,5, Maria Elena Martinez 6, Rosalind A Eeles 7,8, Zsofia Kote-Jarai 7, Kenneth R Muir 9, Artitaya Lophatananon 9, Johanna Schleutker 10,11, Nora Pashayan 12,13, Jyotsna Batra 14,15, Henrik Grönberg 16, David E Neal 17,18,19, Børge G Nordestgaard 20,21, Catherine M Tangen 22, Robert J MacInnis 23,24, Alicja Wolk 25, Demetrius Albanes 26, Christopher A Haiman 27, Ruth C Travis 28, William J Blot 29,30, Janet L Stanford 31,32, Lorelei A Mucci 33, Catharine M L West 34, Sune F Nielsen 21,20, Adam S Kibel 35, Olivier Cussenot 36,37, Sonja I Berndt 26, Stella Koutros 26, Karina Dalsgaard Sørensen 38,39, Cezary Cybulski 40, Eli Marie Grindedal 41, Florence Menegaux 42, Jong Y Park 43, Sue A Ingles 44, Christiane Maier 45, Robert J Hamilton 46,47, Barry S Rosenstein 48, Yong-Jie Lu 49, Stephen Watya 50, Ana Vega 51,52,53, Manolis Kogevinas 54,55,56, Fredrik Wiklund 16, Kathryn L Penney 57, Chad D Huff 58, Manuel R Teixeira 59,60,61, Luc Multigner 62, Robin J Leach 63, Hermann Brenner 64,65,66, Esther M John 67, Radka Kaneva 68, Christopher J Logothetis 69, Susan L Neuhausen 70, Kim De Ruyck 71, Piet Ost 72, Azad Razack 73, Lisa F Newcomb 31,74, Jay H Fowke 75, Marija Gamulin 76, Aswin Abraham 77, Frank Claessens 78, Jose Esteban Castelao 79, Paul A Townsend 80,81, Dana C Crawford 82, Gyorgy Petrovics 83,84, Ron H N van Schaik 85, Marie-Élise Parent 86,87, Jennifer J Hu 88, Wei Zheng 29; UKGPCS collaborators93; APCB (Australian Prostate Cancer BioResource)94,95; NC-LA PCaP Investigators96; The IMPACT Study Steering Committee and Collaborators97; Canary PASS Investigators31,74; The Profile Study Steering Committee98; The PRACTICAL Consortium, Ian G Mills 89, Ole A Andreassen 90, Anders M Dale 91,3,90, Tyler M Seibert 2,3,90,91,92,
PMCID: PMC9372232  NIHMSID: NIHMS1771503  PMID: 35152271

Abstract

Background

Prostate cancer risk stratification using single-nucleotide polymorphisms (SNPs) demonstrates considerable promise in men of European, Asian, and African genetic ancestries, but there is still need for increased accuracy. We evaluated whether including additional SNPs in a prostate cancer polygenic hazard score (PHS) would improve associations with clinically significant prostate cancer in multi-ancestry datasets.

Methods

In total, 299 SNPs previously associated with prostate cancer were evaluated for inclusion in a new PHS, using a LASSO-regularized Cox proportional hazards model in a training dataset of 72,181 men from the PRACTICAL Consortium. The PHS model was evaluated in four testing datasets: African ancestry, Asian ancestry, and two of European Ancestry—the Cohort of Swedish Men (COSM) and the ProtecT study. Hazard ratios (HRs) were estimated to compare men with high versus low PHS for association with clinically significant, with any, and with fatal prostate cancer. The impact of genetic risk stratification on the positive predictive value (PPV) of PSA testing for clinically significant prostate cancer was also measured.

Results

The final model (PHS290) had 290 SNPs with non-zero coefficients. Comparing, for example, the highest and lowest quintiles of PHS290, the hazard ratios (HRs) for clinically significant prostate cancer were 13.73 [95% CI: 12.43–15.16] in ProtecT, 7.07 [6.58–7.60] in African ancestry, 10.31 [9.58–11.11] in Asian ancestry, and 11.18 [10.34–12.09] in COSM. Similar results were seen for association with any and fatal prostate cancer. Without PHS stratification, the PPV of PSA testing for clinically significant prostate cancer in ProtecT was 0.12 (0.11–0.14). For the top 20% and top 5% of PHS290, the PPV of PSA testing was 0.19 (0.15–0.22) and 0.26 (0.19–0.33), respectively.

Conclusions

We demonstrate better genetic risk stratification for clinically significant prostate cancer than prior versions of PHS in multi-ancestry datasets. This is promising for implementing precision-medicine approaches to prostate cancer screening decisions in diverse populations.

Subject terms: Cancer epidemiology, Cancer screening, Cancer genetics

Introduction

Identification of men at greatest risk of developing prostate cancer remains an important challenge. Risk stratification using common genetic markers, such as single-nucleotide polymorphisms (SNPs), shows promise toward more effectively identifying men at greatest risk of developing aggressive or fatal prostate cancer [1, 2]. Analyses of benefit, harm, and cost-effectiveness support use of genomic risk stratification to guide prostate cancer screening [3, 4]. Ensuring these tools perform well in diverse populations is important to ensure risk stratification is optimal for all men and avoid exacerbating existing health disparities [5, 6].

We previously developed a polygenic hazard score (PHS) and demonstrated in an independent European dataset that the PHS was associated with age at diagnosis of clinically significant prostate cancer [1]. Risk stratification with this score also improved the accuracy of PSA testing [7, 8]. We then validated the PHS model (with 46 SNPs) for association with age at diagnosis and with prostate-cancer specific mortality in a dataset that included men of diverse descent, including European, African, and Asian ancestry [2]. We have also improved performance in men of African ancestry by searching for SNPs within that subpopulation [9, 10].

Recent meta-analyses have identified over 200 SNPs associated with prostate cancer, including some identified through subset analyses in men of non-European ancestry [11, 12]. Given the increasing number of SNPs associated with prostate cancer, we evaluated whether including more SNPs in a prostate cancer PHS would improve associations with clinically significant prostate cancer in multi-ancestry datasets.

Methods

Participants

Genotype and phenotype data, all de-identified, were obtained from the PRACTICAL consortium. Participants had previously been genotyped via the OncoArray [13] or the iCOGs [14] chips; 90,638 men were available for this analysis.

The available data were split into a training dataset and four testing datasets, taking into account prior power analyses and PHS association results [7, 15, 16]. The training dataset for the model included 72,181 men of European genetic ancestry genotyped via OncoArray (24,010 controls and 48,171 cases). The four testing datasets included: (1) men of African ancestry (n = 6253: 3013 controls and 3240 cases), (2) men of Asian ancestry (n = 2378: 1184 controls and 1194 cases), (3) the Cohort of Swedish Men (COSM) population-based cohort with long-term outcomes [17] (n = 3279: 1116 controls, 2163 cases, and 278 prostate cancer deaths), and (4) the ProtecT population-based prospective trial with screening (prostate-specific antigen, PSA) and biopsy outcomes for both cases and controls (n = 6411: 4828 controls and 1583 cases) [18].

Polygenic hazard score model development using LASSO regularization

We sought to develop an optimal, integrated PHS model with candidate SNPs chosen from those used in the prior PHS and from those identified as susceptibility loci for prostate cancer in a genome-wide trans-ancestry meta-analysis [11]. Candidate SNPs included the 46 from the original PHS development and 269 from the meta-analysis. A machine-learning, LASSO-regularized Cox proportional hazards model approach was used to objectively select SNPs and estimate weights, as described previously [8].

There were 299 unique candidate SNPs associated with prostate cancer consistent across the training and testing datasets used in the present work. We first identified SNP pairs (among the 299 candidates) that were highly correlated (r2 > 0.95). Each of these paired, correlated SNPs was tested in a univariable Cox proportional hazards model for association with age at prostate cancer diagnosis; the SNP with the larger p-value was eliminated from inclusion in the model. All other (unpaired) SNPs were included as candidates for the present PHS model.

The R (version 4.0.1) “glmnet” package was used to estimate the LASSO-regularized Cox proportional hazards model [19, 20]. Age at prostate cancer diagnosis was the time to event, and the predictor variables included the genotype allele counts of candidate SNPs and first four European ancestry principal components. Controls were censored at age of last follow-up. The LASSO-regularized model’s hyper-parameter (lambda) was selected using 10-fold cross-validation [19, 20]. The final form of the LASSO model was estimated using the lambda value that minimized the mean cross-validated error.

Association with prostate cancer

We evaluated the association of the adapted PHS with clinically significant prostate cancer, as well as any prostate cancer, via Cox proportional hazards models in each of the four testing datasets. Clinically significant prostate cancer was defined to be a prostate cancer case with Gleason score ≥7, PSA ≥ 10 ng/mL, T3-T4 stage, nodal metastases, or distant metastases [21].

As the COSM dataset had long-term follow-up data available [17], we additionally evaluated the adapted PHS for association with age at prostate cancer death [16]. There were 278 deaths from prostate cancer in the COSM dataset.

Hazard ratio performance

Hazard ratios between the top 5% and middle 40% (HR95/50), top 20% and middle 40% (HR80/50), bottom 20% and middle 40% (HR20/50), and top and bottom 20% (HR80/20) were estimated for any, for clinically significant, and for fatal prostate cancer. Percentiles of genetic risk were determined within the controls the training set with age less than 70 years [1, 2, 7, 8].

Family history

Given that family history of prostate cancer is currently one of the most useful clinical risk factors for the development of prostate cancer, we used Cox proportional hazards models to assess family history for association with any, with clinically significant, or with fatal prostate cancer. Family history of prostate cancer was defined as presence or absence of a first-degree relative diagnosed with prostate cancer. Multivariable models using both family history and the adapted PHS were compared to using family history alone via a log-likelihood test with α = 0.01. HRs were calculated for each variable: HRs for PHS in the multivariable models were estimated as the HR80/20 in each testing dataset (e.g., men in the highest vs. lowest quintile of genetic risk by PHS). HRs for family history of prostate cancer were estimated as the exponent of the beta from the multivariable Cox regression. As done previously [1, 7], p-values were truncated at <10−16.

Positive predictive value performance

Positive predictive value (PPV) performance of PSA testing was calculated using data from the population-based ProtecT screening study [18] (prostate biopsy results were available for both cases and controls with a positive PSA [≥3 ng/mL]). To estimate the PPV and confidence intervals, we generated 1000 bootstrap samples using ProtecT participants with positive PSA, while maintaining the 1:2 case:control ratio in the ProtecT dataset. PPV was calculated as the proportion of PSA-positive participants who were diagnosed with clinically significant prostate cancer on biopsy, looking at those participants in the top 5 (PPV95) or top 20 percentiles (PPV80) of PHS genetic risk.

Cumulative incidence curves for PHS

Genetic-risk-stratified cumulative incidence curves for prostate cancer were derived using previously described methods [7, 8]. Briefly, age-specific population data from the United Kingdom (Cancer Research UK [7]) were used to estimate prostate cancer incidence for men aged 40–70 years. Data from the population-based Cluster Randomized Trial of PSA Testing for Prostate Cancer (CAP) trial [7] were used to adjust this population incidence curve to reflect the age-specific cumulative incidence of clinically significant and non-clinically-significant prostate cancer. Genetic-risk-stratified cumulative incidence curves were then calculated for men in the upper 5 and 20 percentiles of PHS genetic risk by multiplying the prostate-cancer-specific cumulative incidence by the mean value of HR95/50 and HR80/50 in the testing dataset, respectively.

Results

A total of 290 SNPs returned non-zero SNP coefficients using the regularization-weight selection and were included in the final model, called PHS290 (Supplementary Material).

HR performance of PHS290 demonstrates risk stratification across percentiles of genetic risk (Table 1). Comparing the top and bottom quintiles of genetic risk for clinically significant prostate cancer, men with high PHS had HRs of 13.73 [12.43–15.16], 7.07 [6.58–7.60], 10.31 [9.58–11.11], and 11.18 [10.34–12.09] in the ProtecT, African, Asian, and COSM datasets, respectively. Similar risk stratification was seen when evaluating risk of any prostate cancer. Finally, when comparing the top and bottom quintiles of genetic risk in the COSM dataset, men with high PHS had a HR of 7.73 [6.45–9.27] for prostate cancer death.

Table 1.

Hazard ratio (HR) performance in the four testing datasets.

Dataset Prostate cancer type   HRs and 95% CI
HR20/50 HR80/50 HR95/50 HR80/20
ProtecT n = 6411 Any 0.31 [0.30–0.32] 3.47 [3.36–3.58] 5.93 [5.67–6.21] 11.16 [10.48–11.88]
Clinically significant 0.28 [0.27–0.30] 3.86 [3.67–4.06] 6.91 [6.42–7.44] 13.73 [12.43–15.16]
African n = 6253 Any 0.43 [0.42–0.45] 2.58 [2.50–2.67] 3.69 [3.52–3.86] 5.95 [5.59–6.34]
Clinically significant 0.40 [0.39–0.41] 2.82 [2.72–2.93] 4.33 [4.10–4.57] 7.07 [6.58–7.60]
Asian n = 2378 Any 0.34 [0.33–0.35] 2.99 [2.89–3.08] 5.08 [4.85–5.33] 8.75 [8.21–9.32]
Clinically significant 0.31 [0.30–0.33] 3.24 [3.12–3.36] 5.66 [5.46–5.98] 10.31 [9.58–11.11]
COSM n = 3279 Any 0.33 [0.32–0.34] 3.54 [3.42–3.65] 5.77 [5.51–6.04] 10.87 [10.21–11.57]
Clinically significant 0.32 [0.31–0.33] 3.59 [3.45–3.75] 5.91 [5.58–6.26] 11.18 [10.34–12.09]
Fatal 0.38 [0.35–0.42] 2.95 [2.68–3.25] 4.49 [3.93–5.13] 7.73 [6.45–9.27]

HRs are shown with mean sample-weight-corrected values and 95% confidence intervals. Calculations were done using age at diagnosis of any or of clinically significant prostate cancer across all datasets, respectively, and also with age at prostate cancer death for the COSM dataset.

Family history and PHS290

Family history data based on self-report and were available for 89%, 89%, 43%, and 75% of individuals in the ProtecT, African, Asian, and COSM datasets, respectively; 7%, 18%, 9%, and 14% of individuals, respectively, reported having a first-degree relative diagnosed with prostate cancer. The combination of family history and PHS performed better than family history, alone, for clinically significant prostate cancer (and for any prostate cancer) in each of the four testing datasets (log-likelihood p < 10−16; Table 2). Additionally, family history and PHS together performed better than family history, alone, for fatal prostate cancer in the COSM dataset (log-likelihood p < 10−16).

Table 2.

Multivariable Cox models with both PHS and family history of prostate cancer (defined as ≥1 first-degree relative affected) for association with any prostate cancer, with clinically significant prostate cancer, and with fatal prostate cancer.

Dataset Variable   Any prostate cancer
beta z-score p-value HR
ProtecT (iCOGs), n = 5703 PHS 2.11 72.1 <10−16 8.13
Family history 0.06 1.7 0.1 1.07
African (OncoArray), n = 5557 PHS 1.64 53.8 <1016 5.03
Family history 1.13 46.6 <1016 3.09
Asian (OncoArray); n = 1028 PHS 1.99 68.5 <1016 7.64
Family history 0.45 13.1 <10−16 1.56
COSM (OncoArray), n = 2453 PHS 2.13 71.2 <1016 8.71
Family history 0.53 19.0 <1016 1.70
Dataset Variable   Clinically significant prostate cancer
beta z-score p-value HR
ProtecT (iCOGs), n = 5703 PHS 2.32 49.8 <1016 10.01
Family history −0.01 −0.2 0.82 0.99
African (OncoArray), n = 5557 PHS 1.67 37.4 <1016 5.19
Family history 1.17 33.2 <1016 3.22
Asian (OncoArray), n = 1028 PHS 1.89 50.4 <1016 6.90
Family history 0.13 2.5 0.012 1.14
COSM (OncoArray), n = 2453 PHS 2.13 56.9 <1016 6.90
Family history 0.45 12.6 <1016 1.57
Dataset Variable   Fatal prostate cancer
beta z-score p-value HR
COSM (OncoArray), n = 2453 PHS 1.68 19.8 <1016 5.51
Family history 0.43 5.0 4.7 × 107 1.53

Analyses were limited to participants with known family history. Beta and z-scores refer to the overall association (within the multivariable Cox regression) with the endpoint of interest, within the corresponding testing dataset. The p-values reported are two-tailed from the multivariable Cox models.

Positive predictive value performance of PHS290

The PPV of PSA testing for clinically significant prostate cancer was 0.19 (0.15–0.22) for the top 20% of genetic risk (PPV80) and 0.26 (0.19–0.33) for the top 5% of genetic risk (PPV80; Fig. 1). Both were greater than the overall PPV of PSA alone, which was 0.12 (0.11–0.14).

Fig. 1. PPV performance in the ProtecT dataset for clinically significant prostate cancer, estimated using 3 approaches: standard (not using PHS), top 20% of PHS values (PPV80), and top 5% of PHS values (PPV95).

Fig. 1

Error bars are 95% bootstrap confidence intervals.

Cumulative incidence curves for PHS290

Genetic-risk-stratified cumulative incidence curves for clinically significant and non-clinically significant prostate cancer demonstrate greater prostate cancer incidence with higher genetic-adjusted risk (Fig. 2).

Fig. 2. Genetic-risk-adjusted cumulative incidence curves for PHS290.

Fig. 2

Curves are shown for the upper 5th (>95th) and upper 20th (>80th) percentile of PHS290 for clinically significant and non-clinically-significant prostate cancer. The reference curves represent the overall population average from the UK.

Discussion

The improved PHS (PHS290) demonstrates excellent genetic risk stratification, including for clinically significant prostate cancer. This was true in four separate testing sets of varied genetic ancestry (Asian, African, European). Additionally, PHS290 was associated with lifetime prostate-cancer-specific mortality in a population-based cohort [16]. Hazard ratios with PHS290 are larger for each of these associations than those reported for previous versions of PHS [2, 8], demonstrating the value of incorporating SNPs from genome-wide meta-analysis and fine-mapping. The improvements demonstrated here are promising for implementing personalized approaches to prostate cancer screening decisions in diverse populations.

Health disparities are a major problem in prostate cancer. Given the exclusion of non-European data in most genome studies [5, 2224], it is important that the pool of candidate SNPs here included those identified from a recent trans-ancestry meta-analysis [11]. Testing and improving performance of genomic risk scores in diverse populations is critical to equitable implementation of these new tools and important for avoiding exacerbation of existing disparities. PHS290 still performs better in men of European genetic ancestry—an expected result, given the much greater data availability in that population. Further genomic studies in diverse populations are essential, as diversity in model development improves performance in diverse populations [9, 10].

The intersection of social constructs like race/ethnicity and genomics also raises interesting and entangled challenges. Even availability of genomic data is only part of the problem, as disparities in health outcomes are rooted in systemic racism and inequities in access to healthcare [25, 26]. Genotypic ancestry may be a step toward biology, but the continental groups still represent an oversimplification of genetic diversity and a pre-determined assumption that socially defined categories have biological meaning in all contexts. We have previously shown that agnostic genetic clusters are informative for subgroup analyses [2], and this approach may be a better way forward, provided the genomic diversity of the whole population is represented in the available data. Here, we have used genotypic ancestry to evaluate the potential differential performance in groups historically excluded from large-scale genomic studies. We also note that local ancestry may be a critical consideration in admixed populations. We have found previously that PHS performance can vary by the makeup of a region of the genome, beyond what is explained by global ancestry categories assigned for an individual’s entire genome [10]. Despite these challenges and opportunities for future improvement, the current results demonstrate PHS290 does provide meaningful risk stratification in diverse datasets.

The cumulative incidence of clinically significant prostate cancer is heavily influenced by age-specific genetic risk, as demonstrated by genetic-risk-stratified cumulative incidence curves (Fig. 2). As men with high PHS290 age, the incidence curve for clinically significant prostate cancer increases dramatically, prominently separating from the incidence for non-clinically-significant prostate cancer. This effect is driven by the high HR for clinically significant prostate cancer in these men, combined with increasing incidence specifically of more clinically significant cancers as men age [7, 27]. Furthermore, we found that risk stratification with PHS290 improved accuracy of PSA testing, as assessed by probability of a positive PSA test leading to a diagnosis of clinically significant cancer on biopsy. Consistent with a prior study [8], this improvement in PPV of PSA testing was not better when using PHS290 than when using PHS46 in this dataset [2]. PPV analyses in larger datasets could permit finer granularity for age-specific genetic risk to assess whether the increased HRs of PHS290 might translate to better performance of PSA testing than that achieved already with PHS46.

The HRs reported here suggest clinical relevance for PHS290. Predictive tools in routine clinical use for other diseases (e.g., breast cancer, diabetes, and cardiovascular disease) have reported HRs of ~1–3 for endpoints of interest [2831]. Current guidelines recommend earlier and more frequent consideration of prostate cancer screening for men with a family history of prostate cancer or African ancestry, citing an elevated risk 28–80% above that of men without these risk factors [3234]. Guidelines more strongly recommend earlier and more frequent screening in men with germline mutations in BRCA2, which are rare but are estimated to infer up to 3-fold increased risk [3335]. In the present study, men in the top 20% for PHS290 (compared to men with average risk) had HRs for clinically significant prostate cancer of ~2.8–3.9. For men in the top 5% for PHS290, those HRs increase to 4.3–6.9, depending on ancestry. While individuals with high polygenic risk may also develop low-grade prostate cancer in their lifetime, the time-to-event analysis applied here shows that high genetic risk confers a greater hazard for prostate cancer death. This finding is consistent with prior reports, though the effect size is larger with PHS290 [2, 16, 33].

Family history data were not uniformly available across source studies or testing datasets and were notably less available in the Asian dataset (43%, compared to ≥75% in the other testing datasets). The proportion of individuals with positive family history—among those with available family history data—also varied across datasets, ranging from 7% (ProtecT) to 18% (African). The interplay of family history and polygenic risk warrants further investigation in more complete cohorts. It is also worth noting that family history availability is not always available (or reliable) in clinical practice. Nonetheless, both family history and polygenic risk appear important for assessing individual risk of prostate cancer. Possibly, family history represents not only inherited genetic risk but also shared environmental exposures.

If this technology is to be implemented in clinical practice, it is important to consider reliability. Genotype arrays are known to have excellent reproducibility, with concordance of 99.40–99.87% on repeat testing with the same array [36]. Concordance across genotyping platforms is also excellent at 98.80% [36].

Our work has some limitations. First, the weights were calculated in men of European genetic ancestry alone, although SNP candidate selection was performed in multi-ancestry analyses. Future studies evaluating PHS for prostate cancer risk stratification will include non-Europeans in SNP weight calculations. The available data did not permit testing of PPV or association with fatal disease in non-European populations. Moreover, the cumulative incidence curves here are specific to the UK, where we had the most robust population age-specific incidence data for clinically significant prostate cancer. The testing sets used in the present study did represent a very small proportion of the data used for candidate SNP identification in the prior genome-wide association meta-analysis (as opposed to the development of PHS46, which was performed in a dataset completely independent of the validation dataset) [2, 11]. However, the training and testing sets were kept separate in the present study, and the use of the LASSO-regularized Cox model reduces over-fitting [37].

The PHS290 described here has the strongest reported association with prostate cancer in men of European, African, and Asian genetic ancestry. The score was also associated with lifetime prostate-cancer-specific mortality in a population-based cohort. A performance gap remains between genetic ancestry groups that might be closed through development using more data from men of Asian or African ancestry. Nonetheless, the results here suggest PHS290 may improve prostate cancer risk-stratification efforts in multi-ancestry populations.

Supplementary information

Supplementary Material (401.3KB, pdf)

Author contributions

All authors contributed to the study design and data collection. Analysis and manuscript preparation was performed by M-PH-L, RK, CCF, WKT, IGM, OAA, AMD, and TMS. All authors read and approved the final manuscript.

Funding

This study was funded in part by grants from the University of California (#C21CR2060), the United States National Institute of Health/National Institute of Biomedical Imaging and Bioengineering (#K08EB026503), the Prostate Cancer Foundation, the Research Council of Norway (#223273), KG Jebsen Stiftelsen, and South East Norway Health Authority. Funding for the PRACTICAL consortium member studies is detailed in the Supplementary Material. The content is solely the responsibility of the authors and does not necessarily represent the official views of any of the funding agencies, who had no role in the design and conduct of the study; collection, management, analysis, and interpretation of the data; preparation, review, or approval of the manuscript; and decision to submit the manuscript for publication.

Data availability

Prostate Cancer Association Group to Investigate Cancer Associated Alterations in the Genome (PRACTICAL) Consortium data are available upon request to the Data Access Committee (http://practical.icr.ac.uk/blog). Questions and requests for further information may be directed to PRACTICAL@icr.ac.uk. Code used for this work is stored in an institutional repository and will be shared upon request to the corresponding author.

Competing interests

All authors declare no support from any organization for the submitted work except as follows: AMD and TMS report a past research grant from the US Department of Defense. OAA reports research grants from K.G Jebsen Stiftelsen, Research Council of Norway, and South East Norway Health Authority. Authors declare no financial relationships with any organizations that might have an interest in the submitted work in the previous 3 years except as follows, with all of these relationships outside the present study: TMS reports honoraria from Multimodal Imaging Services Corporation, Varian Medical Systems, and WebMD; he has an equity interest in CorTechs Labs and also serves on its Scientific Advisory Board. These companies might potentially benefit from the research results. The terms of this arrangement have been reviewed and approved by the University of California San Diego in accordance with its conflict-of-interest policies. OAA reports speaker honoraria from Lundbeck. Authors declare no other relationships or activities that could appear to have influenced the submitted work except as follows: OAA has a patent application # U. S. 20150356243 pending; AMD also applied for this patent application and assigned it to UC San Diego. AMD has additional disclosures outside the present work: founder, equity holder, and advisory board member for CorTechs Labs, Inc.; founder and equity holder in HealthLytix, Inc., advisory board member of Human Longevity, Inc.; recipient of nonfinancial research support from General Electric Healthcare. OAA is a consultant for HealthLytix, Inc. Additional acknowledgments for the PRACTICAL consortium and contributing studies are described in the Supplementary Information.

Ethics statement

The present analyses used de-identified data from the PRACTICAL consortium and have been approved by the Institutional Review Board at the University of California San Diego. All contributing studies were approved by the relevant ethics committees and performed in accordance with the Declaration of Helsinki.

Footnotes

Members of the UKGPCS collaborators, APCB (Australian Prostate Cancer BioResource), NC-LA PCaP Investigators, The IMPACT Study Steering Committee and Collaborators, Canary PASS Investigators, The Profile Study Steering Committee, The PRACTICAL Consortium are listed in the Supplementary Information.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

The online version contains supplementary material available at 10.1038/s41391-022-00497-7.

References

  • 1.Seibert TM, Fan CC, Wang Y, Zuber V, Karunamuni R, Parsons JK, et al. Polygenic hazard score to guide screening for aggressive prostate cancer: Development and validation in large scale cohorts. BMJ. 2018;360:1–7. doi: 10.1136/bmj.j5757. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Huynh-Le M-P, Fan CC, Karunamuni R, Thompson WK, Martinez ME, Eeles RA, et al. Polygenic hazard score is associated with prostate cancer in multi-ethnic populations. Nat Commun. 2021;12:1236. doi: 10.1038/s41467-021-21287-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Callender T, Emberton M, Morris S, Pharoah PDP, Pashayan N. Benefit, Harm, and Cost-effectiveness Associated with Magnetic Resonance Imaging before Biopsy in Age-based and Risk-stratified Screening for Prostate Cancer. JAMA Network Open. 2021;4:2037657. doi: 10.1001/jamanetworkopen.2020.37657. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Callender T, Emberton M, Morris S, Eeles R, Kote-Jarai Z, Pharoah PDP, et al. Polygenic risk-tailored screening for prostate cancer: A benefit-harm and cost-effectiveness modelling study. PLoS Med. 2019;16:e1002998. doi: 10.1371/journal.pmed.1002998. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Petrovski S, Goldstein DB. Unequal representation of genetic variation across ancestry groups creates healthcare inequality in the application of precision medicine. Genome Biol. 2016;17. 10.1186/s13059-016-1016-y. [DOI] [PMC free article] [PubMed]
  • 6.Duncan L, Shen H, Gelaye B, Meijsen J, Ressler K, Feldman M, et al. Analysis of polygenic risk score usage and performance in diverse human populations. Nat Commun. 2019;10. 10.1038/s41467-019-11112-0. [DOI] [PMC free article] [PubMed]
  • 7.Huynh-Le MP, Fan CC, Karunamuni R, Walsh EI, Turner EL, Athene Lane J, et al. A genetic risk score to personalize prostate cancer screening, applied to population data. Cancer Epidemiol Biomark Preven. 2020;29:1731–8. doi: 10.1158/1055-9965.EPI-19-1527. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Karunamuni RA, Huynh-Le M-P, Fan CC, Thompson W, Eeles RA, Kote-Jarai Z, et al. Additional SNPs improve risk stratification of a polygenic hazard score for prostate cancer. Prostate Cancer Prostatic Dis. 2021. 10.1038/s41391-020-00311-2. [DOI] [PMC free article] [PubMed]
  • 9.Karunamuni RA, Huynh‐Le M, Fan CC, Thompson W, Eeles RA, Kote‐Jarai Z, et al. African‐specific improvement of a polygenic hazard score for age at diagnosis of prostate cancer. International Journal of Cancer. 2021;148:99–105. doi: 10.1002/ijc.33282. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Karunamuni RA, Huynh-Le M-P, Fan CC, Thompson W, Lui A, Martinez ME, et al. Performance of African-ancestry-specific polygenic hazard score varies according to local ancestry in 8q24. Prostate Cancer Prostatic Dis. 2021;1–9. [DOI] [PMC free article] [PubMed]
  • 11.Conti DV, Darst BF, Moss LC, Saunders EJ, Sheng X, Chou A, et al. Trans-ancestry genome-wide association meta-analysis of prostate cancer identifies new susceptibility loci and informs genetic risk prediction. Nat Genet. 2021;53:65–75. doi: 10.1038/s41588-020-00748-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Schumacher FR, Al Olama AA, Berndt SI, Benlloch S, Ahmed M, Saunders EJ, et al. Association analyses of more than 140,000 men identify 63 new prostate cancer susceptibility loci. Nat Genet. 2018;50:928–36. doi: 10.1038/s41588-018-0142-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Amos CI, Dennis J, Wang Z, Byun J, Schumacher FR, Gayther SA, et al. The OncoArray consortium: A network for understanding the genetic architecture of common cancers. Cancer Epidemiol Biomark Preven. 2017;26:126–35. doi: 10.1158/1055-9965.EPI-16-0106. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Eeles RA, Olama AAAl, Benlloch S, Saunders EJ, Leongamornlert DA, Tymrakiewicz M, et al. Identification of 23 new prostate cancer susceptibility loci using the iCOGS custom genotyping array. Nat Genet. 2013;45:385–91. doi: 10.1038/ng.2560. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Karunamuni RA, Huynh-Le M-P, Fan CC, Eeles RA, Easton DF, Kote-Jarai ZS, et al. The effect of sample size on polygenic hazard models for prostate cancer. Eur J Hum Genet. 2020. 10.1038/s41431-020-0664-2. [DOI] [PMC free article] [PubMed]
  • 16.Huynh-Le M-P, Karunamuni R, Fan CC, Thompson WK, Muir K, Lophatananon A, et al. Common genetic and clinical risk factors: association with fatal prostate cancer in the Cohort of Swedish Men. Prostate Cancer Prostatic Dis. 2021;24:845–51. doi: 10.1038/s41391-021-00341-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Discacciati A, Orsini N, Andersson S-O, Andren O, Johansson J-E, Mantzoros CS, et al. Coffee consumption and risk of localized, advanced and fatal prostate cancer: a population-based prospective study. Annals of Oncology. 2013;24:1912–8. doi: 10.1093/annonc/mdt105. [DOI] [PubMed] [Google Scholar]
  • 18.Hamdy FC, Donovan JL, Lane JA, Mason M, Metcalfe C, Holding P, et al. 10-year outcomes after monitoring, surgery, or radiotherapy for localized prostate cancer. New England Journal of Medicine. 2016;375:1415–24. doi: 10.1056/NEJMoa1606220. [DOI] [PubMed] [Google Scholar]
  • 19.Tibshirani R. The lasso method for variable selection in the cox model. Stat Med. 1997;16:385–95. doi: 10.1002/(SICI)1097-0258(19970228)16:4&#x0003c;385::AID-SIM380&#x0003e;3.0.CO;2-3. [DOI] [PubMed] [Google Scholar]
  • 20.Tibshirani R. Regression Shrinkage and Selection via the Lasso. J R Stat Soc B. 1996;58:267–88. [Google Scholar]
  • 21.Schaeffer E, Srinivas S, Antonarakis ES, Armstrong AJ, Bekelman JE, Cheng H, et al. Prostate cancer, version 1.2021: Featured updates to the nccn guidelines. JNCCN. 2021;19:134–43. doi: 10.6004/jnccn.2021.0008. [DOI] [PubMed] [Google Scholar]
  • 22.Martin AR, Kanai M, Kamatani Y, Okada Y, Neale BM, Daly MJ. Clinical use of current polygenic risk scores may exacerbate health disparities. Nat Genet. 2019;51:584–91. doi: 10.1038/s41588-019-0379-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Grinde KE, Qi Q, Thornton TA, Liu S, Shadyab AH, Chan KHK, et al. Generalizing polygenic risk scores from Europeans to Hispanics/Latinos. Genet Epidemiol. 2019;43:50–62. doi: 10.1002/gepi.22166. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Popejoy AB, Fullerton SM. Genomics is failing on diversity. Nature. 2016;538:161–4. doi: 10.1038/538161a. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Riviere P, Luterstein E, Kumar A, Vitzthum LK, Deka R, Sarkar RR, et al. Survival of African American and non-Hispanic white men with prostate cancer in an equal-access health care system. Cancer. 2020;126:1683–90. doi: 10.1002/cncr.32666. [DOI] [PubMed] [Google Scholar]
  • 26.Dess RT, Hartman HE, Mahal BA, Soni PD, Jackson WC, Cooperberg MR, et al. Association of Black Race with Prostate Cancer-Specific and Other-Cause Mortality. JAMA Oncol. 2019;5:975–83. doi: 10.1001/jamaoncol.2019.0826. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Huynh-Le MP, Myklebust TÅ, Feng CH, Karunamuni R, Johannesen TB, Dale AM, et al. Age dependence of modern clinical risk groups for localized prostate cancer—A population-based study. Cancer. 2020;126:1691–9. doi: 10.1002/cncr.32702. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Brentnall AR, Cuzick J, Buist DSM, Bowles EJA. Long-term Accuracy of Breast Cancer Risk Assessment Combining Classic Risk Factors and Breast Density. JAMA Oncol. 2018;4:e180174. doi: 10.1001/jamaoncol.2018.0174. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Torkamani A, Wineinger NE, Topol EJ. The personal and clinical utility of polygenic risk scores. Nat Rev Genet. 2018;1. [DOI] [PubMed]
  • 30.Yeh H-C, Duncan BB, Schmidt MI, Wang N-Y, Brancati FL. Smoking, smoking cessation, and risk for type 2 diabetes mellitus: a cohort study. Ann Intern Med. 2010;152:10–17. doi: 10.7326/0003-4819-152-1-201001050-00005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Wang TJ, Larson MG, Levy D, Benjamin EJ, Leip EP, Omland T, et al. Plasma natriuretic peptide levels and the risk of cardiovascular events and death. N Engl J Med. 2004;350:655–63. doi: 10.1056/NEJMoa031994. [DOI] [PubMed] [Google Scholar]
  • 32.Tsodikov A, Gulati R, Carvalho TM, de, Heijnsdijk EAM, Hunter‐Merrill RA, Mariotto AB, et al. Is prostate cancer different in black men? Answers from 3 natural history models. Cancer. 2017;123:2312–9. doi: 10.1002/cncr.30687. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Shi Z, Platz EA, Wei J, Na R, Fantus RJ, Wang C-H, et al. Performance of Three Inherited Risk Measures for Predicting Prostate Cancer Incidence and Mortality: A Population-based Prospective Analysis. Eur Urol. 2020. 10.1016/j.eururo.2020.11.014. [DOI] [PubMed]
  • 34.Carroll PR, Parsons JK, Box G, Carlsson S, Catalona WJ, Drake BF. National Comprehensive Cancer Network. Prostate Cancer Early Detection (Version 2.2021). https://www.nccn.org/professionals/physician_gls/pdf/prostate_detection.pdf Accessed October 10, 2021.
  • 35.Giri VN, Knudsen KE, Kelly WK, Cheng HH, Cooney KA, Cookson MS, et al. Implementation of Germline Testing for Prostate Cancer: Philadelphia Prostate Cancer Consensus Conference 2019. JCO. 2020. 10.1200/JCO.20.00046. [DOI] [PMC free article] [PubMed]
  • 36.Hong H, Xu L, Liu J, Jones WD, Su Z, Ning B, et al. Technical Reproducibility of Genotyping SNP Arrays Used in Genome-Wide Association Studies. PLOS ONE. 2012;7:e44483. doi: 10.1371/journal.pone.0044483. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.McNeish DM. Using Lasso for Predictor Selection and to Assuage Overfitting: A Method Long Overlooked in Behavioral Sciences. Multivariate Behav Res. 2015;50:471–84. doi: 10.1080/00273171.2015.1036965. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Material (401.3KB, pdf)

Data Availability Statement

Prostate Cancer Association Group to Investigate Cancer Associated Alterations in the Genome (PRACTICAL) Consortium data are available upon request to the Data Access Committee (http://practical.icr.ac.uk/blog). Questions and requests for further information may be directed to PRACTICAL@icr.ac.uk. Code used for this work is stored in an institutional repository and will be shared upon request to the corresponding author.


Articles from Prostate Cancer and Prostatic Diseases are provided here courtesy of Nature Publishing Group

RESOURCES