Skip to main content
UKPMC Funders Author Manuscripts logoLink to UKPMC Funders Author Manuscripts
. Author manuscript; available in PMC: 2020 Mar 27.
Published in final edited form as: Stat Sci. 2020 Mar 3;35(1):14–30. doi: 10.1214/19-STS729

Risk Models for Breast Cancer and Their Validation

Adam R Brentnall 1, Jack Cuzick 2
PMCID: PMC7100774  EMSID: EMS85958  PMID: 32226220

Abstract

Strategies to prevent cancer and diagnose it early when it is most treatable are needed to reduce the public health burden from rising disease incidence. Risk assessment is playing an increasingly important role in targeting individuals in need of such interventions. For breast cancer many individual risk factors have been well understood for a long time, but the development of a fully comprehensive risk model has not been straightforward, in part because there have been limited data where joint effects of an extensive set of risk factors may be estimated with precision. In this article we first review the approach taken to develop the IBIS (Tyrer–Cuzick) model, and describe recent updates. We then review and develop methods to assess calibration of models such as this one, where the risk of disease allowing for competing mortality over a long follow-up time or lifetime is estimated. The breast cancer risk model model and calibration assessment methods are demonstrated using a cohort of 132,139 women attending mammography screening in the State of Washington, USA.

Keywords: Breast cancer, calibration, risk assessment, breast density, Tyrer–Cuzick model, IBIS model

1. Introduction

Unlike lung cancer (tobacco smoking) and cervix cancer (persistent infection with the human papilloma virus) where a single factor explains the majority of the cases, a large number of factors have been found to be important for determining the risk of breast cancer. An increased risk in women with a family history appears to have been known in ancient Roman times [59], and in 1842 Rigoni-Stern reported that nuns had an increased risk of breast cancer [53, 26], paving the way for further research which established that not having a child or having a first childbirth at an older age increased the risk of this disease. Much of the modern work on risk factors was done by Macmahon and colleagues, mostly at Harvard [43]. Pike and colleagues emphasized the link with oestrogens and proposed that much of the population risk could explained by computing the cumulative lifetime exposure to oestrogens based mostly on age at menarche, childbearing history and age at menopause [49]; an important meta-analysis and update on these and other factors was provided by Beral and colleagues [47].

In 1976, Wolfe discovered that breast density was another key risk factor that was broadly unrelated to the known classical factors, but of roughly equal importance in their combined predictive value [69]. This has been further developed by Boyd [11, 10], McCormack and others [45]. In the same paper, Wolfe also suggested [69] using breast density for risk assessment to determine how best to use mammography for breast cancer screening. Several decades later, advances in the ability to accurately stratify women into higher and lower-risk groups are likely to move early detection strategies towards this vision, and replace ‘one-size-fits-all’ screening with so-called ‘risk-adapted’ programs in which both the frequency with which a woman is screened and the modality are chosen based on the risk of breast cancer. Motivations include the identification of women at extremely high risk, who are potential candidates for risk-reducing surgery or preventive therapy [29], delineation of a group at moderately enhanced risk who might benefit from enhanced screening [60], and identification of populations at sufficiently low risk so as to require less frequency or even no screening.

A number of risk models have been developed, mostly for Caucasian women living in North America and Western Europe [32, 18, 4]. In this article, we focus on risk assessment using the Tyrer–Cuzick model [63]. This is a hybrid of two popular sub-models often used for breast cancer risk assessment: a genetic segregation model for familial risk that is combined with a proportional-hazards regression model for other risk factors.

Breast cancer risk factors in the model broadly fit into five general categories: (1) family history and highly penetrant dominant genetic mutations; (2) factors associated with oestrogen exposure, including age at first childbirth, age at menopause and menarche (beginning of periods), use of hormone replacement therapy, height and weight; (3) certain types of prior benign breast disease, (4) breast imaging features seen on the mammogram—notably the amount of dense tissue (opaque areas on a mammogram or breast x-ray); and (5) common but individually less penetrant genetic differences (single nucleotide polymorphisms, SNPs), where several hundred relatively common genetic variants that individually have a small impact on disease have been identified, and that jointly make an important contribution to overall risk assessment via a ‘polygenic risk score’ [44]. There are also some other apparent risk factors which are harder to quantify, but which may improve the performance of the model.

In the rest of this paper, we outline the statistical basis of our model, review and develop methods to assess model calibration that include competing mortality (which has been handled differently in different papers), and apply our model and calibration assessment methods to data from a large cohort of women attending mammographic screening in the State of Washington, USA.

2. Breast Cancer Risk Assessment

2.1. Background

To provide an assessment of the risk of disease occurring within a person’s residual lifetime it is important to take into account competing risks that could lead to death from other causes. In a classical ‘latent lifetimes’ framework [20] each individual is subject to m potential causes of death with times Tj (j = 1, ... , m) and we observe their actual time of death T˜=min(T1,,Tm) and the specific cause J = 1, ... , m. This framework assumes that each death is attributable to a single cause, or a defined group of causes. We extend this approach to also include the incidence of breast cancer, which is our principle interest, and do not consider death after breast cancer occurs.

Multiple modes of failure are conveniently characterised using functions for the rate at which each cause J occurs at each follow-up time, given that the person has not yet died, or experienced a specific event of interest. More precisely, this cause-specific hazard for j = 1, ... , m is defined as

hj(t)=limΔ0P(Tj<t+Δ|T˜t)Δ. (1)

Equation (1) is estimable due to conditioning on (T˜t). Another measure of risk that is sometimes considered is the proportion of the population who have event J when followed up to time t. This cumulative incidence function is defined by

P(J=j,T˜t)=0thj(u)S(u)du, (2)

where

S(t)=P(T˜>t)=exp{0tk=1mhk(u)du}

is the marginal (all-cause) survivor function for T˜. When there is just one cause (death, say) then (2) is the cumulative distribution function (for time of death).

Breast cancer risk models within this framework typically consider two causes (m = 2): T1, the time to diagnosis of breast cancer and T2, the time to death from other causes (excluding breast cancer mortality, which can’t occur before the time of diagnosis), so that T˜=min(T1,T2). Risk assessment for a women usually takes place when current age t0 > 20 years, and the aim is to assess the absolute risk of breast cancer between age t0 to age t, conditional on q risk factors x = (x1, ... , xq). This may be done by extending (2) to condition on risk factors x and age at risk assessment t0 through

P(J=1,T˜t|x,T˜>t0)=t0th1(u|x)exp[t0u{h1(υ|x)+h2(υ|x)}dυ]duPx(t0,t), (3)

where h1(t | x) is the conditional hazard of breast cancer at age t (cf. equation 1) and h2(t | x) is the conditional hazard for competing mortality. In practice, and following [32], competing mortality has often been taken to depend only on age and has been calibrated using national mortality statistics excluding breast cancer. This is reasonable for breast cancer, as it is not strongly linked other causes of death, but is more problematic, for example, for lung cancer, where tobacco smoking is a major factor for it and other causes of death such as cardiovascular disease.

Breast cancer specific hazards have been estimated in two ways:

1. Regression models. Risk is frequently calculated by combination of a regression model derived from case-control or cohort studies of specific relative risks combined with absolute population-based incidence rates from cancer registries. An early and widely used example of this is the Gail model [32], which has been further developed in the Breast Cancer Surveillance Consortium (BCSC) model [16, 61, 46].

2. Estimation from family pedigree data of the probability of carrying one or more high-risk mutations using segregation analysis, and then using the penetrance of a mutation to alter age-specific risk [18]. Two examples are BRCAPRO and BOADICEA [48, 4].

A novel aspect of the Tyrer–Cuzick model is that it employs both methods [63]. The segregation model estimates the hazard function hG(t | x1) at age t due to genetic factors, conditional on information (denoted x1) about a woman’s family history (family tree) of breast and/or ovarian cancer, and results of any tests in the family from known highly penetrant breast cancer genes BRCA1 and BRCA2. This is then combined with a relative hazard regression function r(x2) > 0 based on other risk factors (x2) through

h1(t|x1,x2)=hG(t|x1)r(x2). (4)

We divide family history into two factors x1={x1(a),x1(b)} corresponding to (a) breast and (b) ovarian cancer. The vector x1() has components for event time, censoring information and family relationships, and x2 is based on the other risk factors shown in Table 1. Risk models that only account for family history based on a segregation model do not use x2. Regression-only models developed are often proportional-hazards models of the same form as (4), but with age-specific population rates instead of hG(·); partial information on family history of breast cancer can be included in the regression function, for example, by including a covariate for the number of affected first-degree relatives.

Table 1. Summary of risk factor parameters in model.

Category Hazard ratio Mean risk Reference category
(a) Classic risk factors
   Menopause age (y) per 5y      1.14      1.08 45–49y
   Menarche age (y) <11      1.16      1 13y
11      1.07
12      1.07
13 1
14      0.98
15      0.93
16      0.88
17 or older      0.81
   Height (m) <1.6 1      1.1 <1.6 m
1.6–1.7 1.05 + 2 * (height – 1.6)
1.7 or taller      1.24
   Body mass index
(kg/m2)
(post-menopausal only)
<21 1      1.24 <21
21 to <23      1.14
23 to <25      1.15
25 to <27      1.26
27 or more      1.32
   Age at 1st childbirth (y) Nulliparous 1      1 Nulliparous
<17–19      0.74
20–24      0.77
25–29      0.87
30–34      1.01
35+      1.11
   Menopausal hormone therapy Not current 1      1 Not current
Estrogen-only (current)    1.4
Combined (current) 2
   Benign disease Nonproliferative/none 1      1 None
Proliferative (usual type) 2
Atypical hyperplasia 4
Lobular carcinoma in situ 8
(b) New in v8
   Breast density residual (age 40y+) Visual asssessment scale 1.4 per SD      1 Average density
(age and BMI adjusted)
BI-RADS density 1.4 per SD
Volumetric percentage 1.4 per SD
SNPs Continuous Input      1 Average woman

SD, standard deviation; SNP, single nucleotide polymorphism risk score.

2.2. Tyrer–Cuzick Segregation Model

2.2.1. Statistical model

The genetic risk of breast cancer is modelled through estimates of mutations in BRCA1 and BRCA2 genes and an unknown dominant gene. Some of the genes which make up the unknown component have been discovered [28], but most have not, and are only inferred. Even the known ones are rare and individually do not make a major contribution, and are not so commonly tested for even in women with a family history. We use the variable c1 for BRCA1/2 status (0 if not a BRCA1/2 carrier, 1 if a BRCA1 carrier, and 2 if BRCA2 carrier; joint carriers are extremely rare and are modelled as BRCA1 carriers). The unknown dominant gene is denoted c2 (0, 1 if respectively, not a carrier or a carrier). The model makes the following assumptions.

A.i Breast x1(a) and ovarian cancer x1(b) family histories are conditionally independent given c1 and c2:

p(x1|c1,c2)=p{x1(a)|c1,c2}p{x1(b)|c1,c2},

where p(·) is general notation to denote a probability density or mass function.

A.ii c1 and c2 are independent: {p(c1, c2) = p(c1) × p(c2)}.

A.iii The unknown gene c2 is not associated with ovarian cancer [p{x1(b)|c1,c2}=p{x1(b)|c1}].

A.iv The penetrance SG(t | c1), that is, the probability of developing disease by age t, is known for breast and ovarian cancer in BRCA1/2 carriers. The prevalence p(c1) of BRCA1, BRCA2 and the unknown gene are known in the population.

A.v A proportional-hazards model is assumed for the incidence associated with the unknown gene using two parameters θ = (β, γ) through

SG(t|c1,c2;θ)=S0(t|c1)exp(γc2), (5)

where γ quantifies the log relative hazard attributable to the unknown gene, β is its prevalence (the per-allele proportion is 11β), and S0(t | c1) is a baseline survivor function. From assumption (A.iv) and equation (5) the baseline survivor function is obtained by solving

SG(t|c1;θ)=(1β)S0(t|c1)+βS0(t|c1)exp(γc2), (6)

for example, via a Newton-Raphson iteration [63].

A.vi Mendelian inheritance, that is, there is a 50% chance that one of the two alleles from each parent is inherited by their offspring.

From these assumptions the breast cancer risk conditional on x1 is obtained as

SG(t|x1;θ)=c1=02c2=01SG(t|c1,c2;θ)×p(c1,c2|x1;θ), (7)

where the weights p(c1, c2 | x1; θ) in (7) are obtained from the following application of Bayes’ rule. Denoting d = (c1,c2) then

p(d|x1;θ)=p(x1|d;θ)p(d;θ)p(x1;θ). (8)

p(d; θ) uses assumptions given in (A.ii, A.iv, A.v), and p(x1 | d; θ) is obtained following A.vi (described in more detail in [63]), first separately for breast x1(a) and ovarian cancer x1(b) family histories and then they are combined through (A.i). When gene testing has been done for BRCA1/2 the values for c1 are taken from this, and the familial contribution only applied to the unknown gene c2. Most of the computer code in the Tyrer–Cuzick algorithm is involved in calculating the likelihood p(x1 | d; θ).

2.2.2. Parameter estimates

Some changes to the segregation model have been made since the original model [63], starting from version 7 onwards. BRCA1/2 prevalence and penetrance are now from the 1950+ birth cohort estimates reported in [4]. The penetrance estimate is now lower than used in original model [63] (previously taken from [31]), which was attributed in [4] to bias in the original estimate [31] resulting from a focus on families with multiple cases of breast cancer. Prevalence of BRCA1 is taken to be 0.06% and for BRCA2 is 0.10% (previously 0.11% and 0.12%, respectively). Smoothed first breast cancer rates from the Thames Registry 2005–2009 were used to calibrate overall risk, based on data reported by 5y age group and smoothed by loess (with the smoother ‘span’ parameter 0.2 chosen by eye). The unknown gene calibration has not been altered since the model was first introduced (β = 11.4% and exp(γ) = 13.04, fitted to data from [3]; see [63]). The impact of the unknown gene depends mostly on the product β exp(γ) and it is difficult to accurately estimate these two parameters separately. The best fit gave a very large value for exp(γ) but almost equally good fits would arise if γ was substantially reduced and β increased accordingly. Attributing risk to a single gene is a simplification, as risk is likely to reflect a combination of genes. However it is clear that the contribution of this unknown gene is greater than that for BRCA1 or BRCA2, any other known gene or combination of single nucleotide polymorphisms, and treating this either as a single unknown gene or a polygenic risk score has minimal impact on its predictive value.

2.3. Regression Model

2.3.1. Model

The relative hazard r(x2) in model (4) is normalised by the mean risk in the population. This requires information on the relative hazard associated with other nonfamilial (personal, hormonal and lifestyle) risk factors {denote this function ϕ(x2) > 0} and their population prevalence {f (x2) where f (x2) dx2 = 1} in order to obtain the normalised relative hazard

r(x2)=ϕ(x2)ϕ(x2)f(x2)dx2. (9)

In the Tyrer–Cuzick model this is approximated by treating each risk factor as independent (breast cancer risk factors in the model are largely independent of each other) and taking

r(x2)=j=1q{ϕ(x2j)ϕ(x2j)f(x2j)dx2j}. (10)

When the value of a risk factor is unknown it can be left blank and the normalised relative hazard associated with that factor is taken to be 1 (woman assumed to have population risk). The mean risk constants used in the model are shown in Table 1.

2.3.2. Parameters

The relative hazards and prevalence of risk factors in the original model were reported in [63], and included in Table 1. There have been changes and the current values are described below.

Atypical hyperplasia or LCIS The original model treated atypical hyperplasia or lobular carcinoma insitu (LCIS) as an independent risk factor. Evidence in [25] and [9] indicated that other risk factors should not be included when a women has been diagnosed with atypical hyperplasia as it is more of an intermediate endpoint. As a result, we modified the model to take the maximum risk of atypical hyperplasia (only), LCIS (only) and family history risk combined with other risk factors (based on the 10y risk). For example, a young BRCA1/2 carrier with atypical hyperplasia would not have her risk assessment modified by atypical hyperplasia. This is likely to be conservative, but currently there is inadequate data to model the joint effects of atypia and other risk factors accurately.

Unknown benign disease In general, the risk of subsequent breast cancer associated with a benign lesion depends on the histology, with no increased risk associated with nonproliferative lesions, about a two-fold risk associated with hyperplasia of the usual type and about a 4-fold risk associated with atypical hyperplasia [27]. Based on prevalence data reported by [27], the relative hazard from benign disease when a women has had a biopsy but pathology is unknown is taken to be 1.3; this option was not available originally. Nonbiopsied lesions are not considered in the model.

Menopause hormone therapy The model assumes a maximum relative hazard of 2.0 for combined therapy with an oestrogen and progestin and 1.4 for oestrogen-only hormone therapy. The relative hazard begins at unity in year 1, is half the maximum excess relative hazard in year 2 and then the maximum relative hazard therafter until stopping therapy. Risk is then ramped down following cessation to be 2/3 of the maximum excess in the first year after stopping, then one third in the next year, and no increase thereafter. The relative risk is adjusted if BMI (body mass index) is known, being decreased by 10% of the average excess risk if obese (BMI ≥ 30 kg/m2), and increased by 10% if BMI <25 kg/m2 (not overweight or obese). These assumptions are based on results from the Women’s Health Initiative trial [17]) and the Million Women Study [51].

2.4. New Additions to the Model

2.4.1. Mammographic density

Breast density is correlated with some of the other risk factors in the model, most strongly age and body mass index. It is also measured in different ways that require different calibration. We allow three different ways to input density: a visual assessment in which (ideally) two readers estimate the percentage of the breast covered by dense (opaque) tissue and the average is used, the BI-RADS system which corresponds to four categories, and an automated volumetric system (Volpara) that estimates the percentage volume of dense tissue based on the radiographic intensity of each pixel. In order to incorporate breast density into the model without changing the effect sizes of the other factors, we first developed a measure of breast density which is independent from age and BMI at mammogram, by taking the difference between observed and expected density given age and BMI. This was developed for the different measures in case-control studies, where expected density was modelled in controls by fitting a generalized additive model [70] of different breast density measures against splines for age and BMI (see Figure 1 for a fit to volumetric percent density). The risk associated with density was then calibrated by estimating the effect of the density residual, adjusted for other factors in the Tyrer–Cuzick model using case-control and cohort studies [68, 15, 14].

Fig. 1.

Fig. 1

Volumetric percentage density vs age and body mass index (BMI), with joint nonparametric smooths (line —). Taken from [12].

2.4.2. Polygenic risk scores

Numerous studies have demonstrated the utility of polygenic risk scores. These use common gene variants which individually carry small risks but as a group provide useful information. Their utility has been observed in average risk women [66, 46] and in high-risk women [22, 30], and they combine effectively independently with classical risk factors and breast density [64, 65]. Currently, the breast cancer association consortium has validated more than 170 risk-modifying SNPs at genome-wide significance [44], and this is likely to increase still further.

To provide an overall relative risk estimate from a collection of independent SNPs polygenic risk scores may be obtained by using published data on their per-allele risks, and allele frequencies. First, the odds ratio for each of the three SNP genotypes is determined (no risk alleles, 1 risk allele, and 2 risk alleles) and these are normalised to obtain an average of unity using reported risk allele frequencies. In our model, we assume the odds of two copies of the abnormal gene is the square of that for one copy, and their distribution follows the Hardy–Weinberg law. Finally, to obtain an overall polygenic risk score the odds ratios for each of the genotypes are multiplied together assuming independence [46].

Data show very small correlations between polygenic risk scores and classical risk factors. We have observed that an accurate fit is obtained by treating the score as independent of the other factors in the Tyrer–Cuzick model [22, 30, 65]. However, theoretically, a polygenic risk score will explain some of the genetic aggregation modelled through the segregation model, and so this should be down weighted in the risk assessment. Work to refine this component is ongoing.

3. Comparison with Other Risk Models

It is important to understand differences between risk models, both for use of the models and to refine the scientific process of improving risk assessment. Here we note some differences between the Tyrer–Cuzick and other models.

3.1. Segregation Models

Some segregation models, including BRCAPRO [48], calculate breast cancer risk only based on estimating the likelihood of carrying major genes based on a personal and family history of breast, ovarian or other cancers related to these genes. The main aim of such models is to estimate risk of being a BRCA1/2 mutation carrier. They are mostly used for women with a strong family history in order to determine eligibility for BRCA1/2 testing. They incorporate both pedigree information including age at onset and results of genetic testing for BRCA1 and BRCA2 in the women and their relatives. Such models are less suitable for risk assessment of breast cancer in the wider population, partly because they do not allow for other unknown genetic factors in the model.

The BOADICEA model accounts for other rare but highly penetrant genes (PALB2, CHEK2, ATM), and uses a polygenic model to allow for (unobserved) risk due to other genetic factors than those measured [42]. Define c1=0 if not positive for one of the genes, c1=1 if test positive for BRCA1, c1=2 if test positive for BRCA2 (but not BRCA1), c1=3 if test positive for PALB2 (but not BRCA1/2) and so on for next CHEK2 and last ATM. Thus, very rare individuals who test positive for more than one gene are assigned a group c1=0,,5 based on a hierarchy. The model has the following form for the breast cancer specific hazard:

h1{t|c1,u(t)}=h1B(t)exp{j=15βj(t)I(c1=j)+u(t)},

where t is age, h1B(t) is a baseline hazard function, I(·) the indicator function, βj(t) are time-dependent relative hazards for each genotype, and u(t) is a time-dependent random effect from a normal distribution with mean zero and standard deviation σt. When the woman has not been personally tested, the probability of being a carrier is estimated based on the pedigree and testing results (if any) in family members. The random (polygenic) effects arise from a combined effect of several unmeasured and largely unknown genes, each of which contributes a small effect. Large scale genome-wide association studies have identified that the polygenic model for genetic susceptibility is partly correct, but a large proportion of familial aggregation of risk is still not explained by known major genes or polygenic risk [44].

For most pedigrees arising in the general population one might expect only small differences for risk estimates from polygenic vs. single unknown gene models. Qualitatively, the unknown single dominant gene model is a good approximation to the polygenic model: the genotype is never observed and so gives rise to a range of effects through equations (7) and (8). Indeed, one study that compared the fit between the unknown dominant gene and polygenic models did not show an appreciable difference between them when considering additional genetic factors other than BRCA1/2 [5].

Another difference between BOADICEA and Tyrer–Cuzick is that the hazard for the unknown genetic effect γ is assumed to not vary with age in the Tyrer–Cuzick model, whereas the polygenic variance varies with age in the BOADICEA model. Thus, the BOADICEA model leads to a larger decline in the relative risk as the age of the affected relative increases and appears to fit better with the literature. We will update this aspect of the model in the next version.

3.2. Regression Function Models

Several models have been developed based on a regression function only. These include the Gail (or BCRAT) [32], the Breast Cancer Surveillance Consortium (BCSC) [61, 62] and Rosner–Colditz models [52, 72]. Neither the Gail nor BCSC model include all the risk factors included in the Tyrer–Cuzick model, but they do account for ethnic differences. The Rosner–Colditz model includes the terms for the same broad risk factors as in the Tyrer–Cuzick model, and some additional factors including alcohol consumption, adolescent body somatotype (shape/size) and hormone levels. Assumptions on the risks and prevalence of risk factors differ between the models, as do the use of interaction terms.

The Tyrer–Cuzick model includes an interaction between menopausal status and BMI, assuming it is only a postmenopausal risk factor. Some data suggest that BMI may actually be a protective factor for premenopausal breast cancer but this is not included in the model [58]. Interactions are also included for BMI and HRT (as discussed above), and atypical hyperplasia (or LCIS) and all other risk factors in that it is treated as an intermediate endpoint, and the maximum risk for it alone or all other factors combined is used, following analysis from [9]. No other interaction terms are currently included.

The Gail model includes an interaction between (i) ethnicity and all other risk factors, and for certain ethnic groups an interaction between (ii) age and number of biopsies, and (iii) age at first child and number of affected first-degree relatives (negative interaction). For example, both latter interactions are included for white women, but they are not used for Asian women.

The BCSC model includes interactions between a woman’s age with (i) ethnicity, (ii) benign disease, (iii) family history (negative interaction) and (iv) breast density (negative interaction), by allowing for different regression coefficients in 10y intervals starting at age 40y [62].

The Rosner–Colditz model allows for interactions between: (i) BMI and duration of premenopausal period (time from menarche to start of menopause) (ii) BMI and current duration of menopausal interval, (ii) alcohol after menopause and hormone replacement therapy, (iii) benign disease with age at menarche, (iv) benign disease with duration of premenopause, (v) benign disease with duration of menopause, (vi) height with duration of premenopause (vii) height with duration of postmenopause.

All the models (except Tyrer–Cuzick) include interactions relating to benign disease, but their forms are quite different. In the Gail model, for white women the relative risk of previous biopsies is less in women older than 50y than for younger women. In the BCSC cohort used to fit their model [62] this pattern was not observed for proliferative disease, and for nonproliferative disease the direction was even reversed (a larger effect for women older than 50y). In the Rosner–Colditz model, there is a positive interaction with age at menarche and a negative interaction for the other terms [52], suggesting an older age at menarche is not protective for women with benign disease. These three models have been fitted in populations with individual-level data. Relative hazards for white women in the Gail model were estimated using a case-control study with 2842 cases and 3146 controls, including 179 cases with a prior biopsy aged <50y and 487 cases with a prior biopsy aged 50y+ [32]. The BCSC model was fitted to a cohort with more than 1 million women with up to 10y follow up, and included 6204 women with proliferative disease without atypia and 177 subsequent cancers [62]. A recent Rosner–Colditz model update included approximately 100,000 women of whom 5246 had developed breast cancer. Benign breast disease (nonproliferative and proliferative) was reported in approximately 18,000 women at entry [52].

Potential interactions between benign disease and other risk factors have also been investigated in other studies, but also with inconclusive findings. For example, in a cohort from Nashville a negative interaction between nonproliferative disease and age was observed, such that women diagnosed with nonproliferative disease at an older age had lower breast cancer risks than women in the cohort without proliferative disease [27]; this has not been evaluated in any of the four models considered here.

Overall, due to the current lack of strong evidence to support an interaction of benign disease with age, no such interaction is included in the Tyrer–Cuzick model, and it assumes nonproliferative benign disease does not confer any increased risk.

4. Absolute Cumulative Risk Assessment and Calibration

4.1. Framework

It is important to be aware of the difference between using a model to project risks and calibrating a model from a cohort where follow-up data already exists. In the former case routine data only provides risk assuming no intercurrent mortality, and a specific adjustment needs to be made for competing mortality (cf. equation (3)). In the latter case, inter-current deaths are known and can be treated as part of the censoring process. This has led to different methods for computing absolute cumulative risk, which are essential for interpreting the model predictions, and subsequently to assess how well the model is calibrated to the population under study. This has not been widely appreciated, and can lead to different estimated values of expected absolute risk from the same disease model. In this section, we review some different methods for computing expected cumulative risk under the following setup.

Assume there is a sample of i = 1, ... , n independent individuals with data on risk factors. The risk model evaluates each risk from current age t0 to age t, defined as Pxi(t0i, t) or Pi(t) using shorthand notation following equation (3). The time to breast cancer or death is subjected to right-censoring, so that one observes T=min(T1,T2,TC), where T1 is time to disease, T2 is the time to death from other causes and TC is a right-censoring time.

An issue to contend with for computing the expected probability of breast cancer is that the ‘at-risk’ interval should be from t0 to TC, whereas T is often used. TC is not observed when min(T1, T2) < TC and this presents difficulties as its (conditional) distribution can be hard to determine. Calculating the expected risk using probability of breast cancer over (t0, T) is liable to underestimate expected risk. A solution is to use lifetable methods which are based on the sub-hazard of breast cancer and not the probability of breast cancer, but only need to be computed over (t0, T). We next consider these different methods in more detail.

4.2. Expected Risk Based on the Cumulative Disease-Specific Hazard Function

The expected number of breast cancer cases in the cohort may be obtained by first integrating breast cancer specific hazards over the observed follow-up period, to obtain each individual’s cumulative hazard for breast cancer during the period she is at risk. Formally, let Yi(t)=I(Ti>t) denote the at-risk process for individual i = 1, ... , n, and h1(t | xi) = h1i(t) the disease-specific hazard function sub-model. Then the expected number of individuals with disease (J = 1) is

E(H)=i=1nt0ih1i(u)Yi(u)du=i=1nH1i(Ti), (11)

where

H1i(t)=t0ith1i(u)du.

Equation (11) may be adapted to compute the expected number of deaths from other causes by replacing h1 with h2, etc. Indeed, by letting h2 reflect all-cause mortality (including breast cancers) this is equivalent to the time honoured lifetable method to compute the expected number of deaths [40]. Thus, while it might seem surprising to base the expected number of women diagnosed with breast cancer on the disease-specific cumulative hazard rather than the smaller estimate of the probability of breast cancer, this method is firmly established in lifetable and other settings (see, e.g., the log-rank test).

4.3. Expected Risk Based on Cumulative Incidence

The previous section is the preferred method (see below) but another approach is to calculate expected risk using the model probability of disease allowing for death from other causes (cumulative incidence). This is most straightforward when the model is used in a cohort with no censoring from starting age t0 up to age t. In this case, the expected number is a summation of the conditional cumulative incidence in (3):

E(P)(t)=i=1nPi(t), (12)

where Pi(t) ≡ Px(t0, t) for the ith individual. It is extremely uncommon to have such a dataset, and the calculation is more complicated under the most common scenario of right censoring because we need an estimate of P{J = 1, min(T1, T2, TC) ≤ t | xi}, whereas Pi(t) estimates P{J = 1, min(T1, T2) ≤ t | xi} (see equation (3)). We consider three types of censoring processes for prospective risk calculation.

First, if censoring is fixed to be to a common follow-up time for each patient ti = t0i + A, such as A = 5 years, then

E(P)(A)=i=1nPi(ti0+A). (13)

Second, when the censoring time TC is variable between individuals but it is deterministic, such as being due to variable calendar time enrolment and a common follow-up for all i then

E(P)=i=1nPi(ti0+TCi). (14)

A third case is when the potential censoring time is unknown but is a stochastic process with known distribution. Then one possibility is to base expected risk on

PC(t)=t0t0+th1(u|x)×exp[t0t0+u{h1(υ|x)+h2(υ|x)}dυ]SC(u|x)du, (15)

where SC is the survivor function for the censoring distribution. Then

E(P)=i=1nPCi(ti0+Ti). (16)

If SC is unknown then one will need to estimate it, for example by assuming censoring is independent of risk factors x and using the Kaplan–Meier estimator [41].

4.4. Calibration

The above methods based on cumulative disease-specific hazards or cumulative incidence may be used to assess calibration by comparison of the observed number of women with disease (O) with expected (E), both overall and in sub-groups [38]. If it is assumed that disease is rare (this assumption is relaxed in Section 4.7), then a test may be constructed assuming that O is generated from a Poisson distribution, whose rate would be E if the model was perfectly calibrated. Exact Poisson confidence intervals on O may be used to compute a confidence interval for O/E (treating E as fixed) and determine if it covers unity [33].

4.5. Some Biased Estimates of Absolute Risk and Their Effect on Calibration

Some commonly-used methods are biased, but this can be overcome using hazard-based methods where the appropriate ‘at-risk’ interval is used. We next consider three methods that yield biased estimates of E, the expected number of cancers, which in turn leads to biased estimates of the observed to expected ratio O/E.

1. Under right censoring some studies [19, 24, 7, 55] have used cumulative incidence as the measure of expected risk, but estimated it via

i=1nPi(Ti). (17)

This measure of expected risk is biased towards zero because it only computes risk until the event time T For example, suppose we wish to compute the expected number of deaths in a sample of babies over the next 200y. If the mortality model is well calibrated but risk of death until the age at which each person actually dies is summated (as equation (17)), then fewer deaths will be expected than the number of babies in the sample. The correct analysis based on cumulative incidence would summate risk of death by age 200y for each individual (cf. equation (12)). In mitigation, the bias will be small, and will have a minimal impact on the conclusions drawn when the event is rare.

2. In some studies it has been reported [2] that the expected number with disease is computed as

i=1n[1exp{H1i(Ti)}], (18)

that is, summating 1 minus the net survival (the estimand for Kaplan–Meier estimation) rather than (11). In general, this is biased towards zero for the expected number of breast cancers because the term in the summation is less than or equal to the cumulative hazard H1i(Ti). However, this is a small bias if the cumulative hazard H is small since 1 − exp(−H) = H + O(H2). Also note that 1exp{H1i(Ti)}Pi(Ti), so is less biased than summating Pi(Ti) (i.e., equation (17)).

3. Finally, calibration of breast cancer risk has sometimes been assessed using a fixed follow-up time for all by only including those who could be followed up at least that long [61, 54]. This in inefficient because some data would be excluded, and it should be used with caution when there is censoring. For example, suppose we seek to assess calibration of 5y risk when there is right censoring in the data, and include only those without breast cancer who were not censored by 5y and all breast cancer cases diagnosed up to 5y. If the model is correct (and censoring is noninformative), then O/E based on (12) will be biased. E will be too small relative to O because some of the noncases will have been excluded from expected risk due to censoring, but they should have contributed to E as they would have been counted as cases if disease had occurred in them.

4.6. An Unbiased Estimate of Expected Risk

To avoid bias, we recommend using E(H) to obtain the expected number with disease. This is based on the cumulative hazard rather than probability of disease (cumulative incidence) and has several advantages compared to E(P) when censoring has been handled appropriately as described in Section 4.3.

First, comparing E(H) with O is a test of whether the disease-specific hazard model is correct. The same method may also be used to test the competing-mortality model. Comparing E(P) with O is a test of whether the model for cumulative incidence based on combining the disease risk and competing mortality is correct, that is, whether the marginal risk based on (3) is well calibrated. This might require accurate knowledge and data on the competing risk process, which is often not available. It is uncommon for the competing mortality model to be as well developed as the disease risk model, which is of primary interest to the epidemiologist. If the competing mortality model is incorrect, but the disease model is correct, then the expected risk will be incorrect due to an inadequate model for competing mortality. Conversely, even if the cumulative incidence function risk is well calibrated, it is possible that the competing mortality and disease model are both wrong in different ways, and just happen to cancel each other out. Comparing the hazard sub-models separately would identify such issues.

Second, when there is stochastic censoring equation (11) is easier to apply than (16) because it does not require an assumption or estimate for the conditional censoring distribution.

Third, it can be a more powerful test. For example, in an extreme example of 200y follow-up on mortality in babies, then one would not be able to show that a model that assigns a constant and excessively high mortality rate was incorrect; whereas analysis based on summating the cumulative hazard of that model to the time of each death would yield an expected number of deaths in excess of observed.

4.7. Extensions

We end this section by describing methods to assess calibration not only for the total population, but also in subgroups. For example, when the data are split into ten groups by decile of the predicted risk at baseline then a test of calibration (more generally model fit) across the groups would be akin to the Hosmer-Lemeshow test for binary outcomes [36, 38]. To extend this strategy to the cumulative incidence framework, we construct k = 1, ... , K distinct groups, and assess whether the observed risk matches expected in each of the groups. Then, the test statistic

χK12=K=1K(OkEk)2Ek,

where the expected Ek are obtained as above, will be approximately χ2 with K − 1 degrees of freedom under the null hypothesis of correct prediction and calibration.

This hypothesis testing approach has limitations. First, power to reject inaccurate calibration is limited by the large number of degrees of freedom. Second, with a large enough sample size we would expect to reject model calibration because all models are wrong, and the approach does not directly indicate where the inaccuracies lie.

An alternative approach is to view testing calibration as special case of Poisson regression. Let Oi denote whether individual i = 1, ... , n in the sample has disease, and Ei be the expected risk based on the model. Then consider a Poisson regression for

E(Oi|Ei)=exp{γ0+γ1log(Ei)}=θEiγ1,

where θ = exp(γ0) and γ1 are unknown parameters. Setting γ1 = 1 then the maximum likelihood estimate θ^=iOi/iE provides an overall ‘calibration-in-the-large’ parameter for the entire cohort [21]. When γ1 is also estimated then it can be used to test for calibration across the range of expected risk (on 1 df), as well as yielding an estimate of how closely the regression line matches observed (γ1 = 1 being ideal; 0 being not at all).

Finally, we consider when the disease is relatively common, perhaps because focus is on a high risk group with long follow up. In this case, the Poisson distribution no longer approximates a binomial distribution. However, Poisson regression may be valid if time is broken into shorter segments where the rare disease assumption holds, for example by year, with each segment treated as a ‘new’ observation in the Poisson regression analysis. This approach is also useful when assessing time-dependent calibration, and which we proceed to consider next.

5. Time-Dependent Calibration

The focus of many studies has been on whether the cumulative number with disease over a single follow-up period matches the expected number. While important, it does not assess differences through time, such as nonproportionality of the observed to expected risk. In this section, we consider techniques to assess calibration across follow-up time, and develop new graphical methods.

5.1. Methods

One approach is simply to look at the observed and expected number of events Nj(t) for each cause j as a counting process. The observed number is

N^j(t)=TitI(Ji=j). (19)

The expected number, based on the cumulative hazard approach (as Section 4.2), is

Nj(t)=i=1n0tYi(u)hji(u)du=TitHji(Ti)+Ti>tHij(t), (20)

so that we have extended (11) to be a function of t. One may compare observed N^j(t) vs. expected from (20) through follow-up time t, such as is sometimes done to check proportional hazards [6, 34]. Cumulative pointwise (such as from entry to time t), or interval (such as number in a year) confidence intervals may be constructed using exact Poisson confidence intervals, or used to undertake hypothesis tests of calibration up to or near time t.

An issue with the comparisons based on N(t) is that they depend on the sample size and censoring distribution, so cannot be readily used to compare between studies. To circumvent this, one might consider the comparing the mean cumulative hazard among those at risk, that is, for t<maxi(Ti) comparing

H^j(t)=TitI(Ji=j)Y+(Ti) (21)

with its expected value

Hj(t)=i=1n0tYi(u)hji(u)Y+(u)du, (22)

where the total number at risk at time t is Y+(t)=i=1nYi(t). Note that equation (21) is the Nelson–Aalen estimator of the cumulative hazard [41]. Equation (22) is the expected cumulative hazard based on the breast cancer risk model and observed at-risk process. Time-dependent assessment of the observed to expected risk can be based on Ĥj(t) vs. Hj(t) across follow-up time t. Here we treat Hj(t) as fixed and use the usual confidence intervals for the Nelson–Aalan estimator Ĥj(t) at different times, from which the hypothesis of calibration at a particular time t can also be assessed.

Other functionals of Ĥj(t) and Hj(t) might also be used for assessment of calibration across time. Arguably the most common functional used in applications is the ‘net’ risk

S(t)=1exp{H(t)},

which is often estimated as 1 minus the Kaplan–Meier estimator [41]. There are two options for calculating the expected risk. The first is a mean from the entire cohort at baseline

Sj(A)(t)=n1i=1n[1exp{Hji(t)}]. (23)

The second is to use the mean hazard in those still at risk at time t through

Sj(B)(t)=1exp{Hj(t)} (24)

and (22).

A disadvantage of the cumulative hazard (22) and net risk (23), (24) is that interpretation of the component terms is not straightforward in relation to the observable risk in equation (2). However, if we weight (21) by the proportion at risk using the Kaplan–Meier estimator Ŝ(t) for all-cause survival (i.e., the survival endpoint is disease or death), then we recover an estimator of the cumulative incidence function through

F^j(t)=TitI(Ji=j)Y+(Ti)S^(Ti), (25)

and inference may be based on estimates of its variance [41]. Expected risk may be obtained from

Fj(A)(t)=n1i=1nPi(t), (26)

or the mean expected risk at each t over the n individuals.

5.2. Summary

Our recommendation is to focus on comparisons only involving the disease-specific hazard to assess O vs E through follow-up time t. For exploratory plots, we prefer use of the cumulative hazard comparison of (22) with (21). There are several reasons for these recommendations.

First, as for overall calibration, the method may be used to assess the disease risk and competing mortality sub-models separately. The cumulative hazard analysis extends quite naturally a total number of events comparison, but avoids dependence on the censoring distribution.

Second, focusing on the hazard leads to using the observed at-risk process to calculate expected risk, which appears to be a more robust method to assess model calibration than only using a risk assessment estimate on everyone at baseline. For instance, if those censored or lost to follow-up are more likely to be at higher predicted risk at entry, then estimates of their expected risk will differ, and so comparison of expected risk between methods based on using the at-risk-process or everyone at baseline is a way to assess whether the censoring process was associated with predicted risk. In general terms this issue has parallels with relative survival, where one might consider the Ederer-I approach (mean risk) or the Ederer-II approach (use the at-risk process) [56].

Third, a statistical advantage of the cumulative hazard compared to the cumulative incidence function is that it may have a smaller variance because it does not require an assumption or estimation of all-cause survival (equation (25)).

Finally, in the exploratory plots when the aim is to assess calibration of a risk model, then differences between the observed to expected cumulative hazard seem easier to interpret than the other methods. In contrast, the cumulative incidence function weighs the cumulative hazard by all-cause survival. The component terms have a natural interpretation, but the observed to expected comparison is perhaps less easy to interpret for assessing calibration than cumulative cause-specific hazards, because changes could be due to either lack of calibration in the disease or competing mortality models. Kaplan–Meier estimation transforms the cumulative hazard through 1 − exp(−H), so that differences between large cumulative hazards are downweighted. This makes changes in the calibration plot with time less easy to relate to calibration at the hazard level. For example, it is easier to assess whether the hazard function is constant through time by plotting the Nelson–Aalan estimate of the cumulative hazard, than by plotting 1 minus the Kaplan–Meier estimator.

6. Example

6.1. Data

We have previously reported an evaluation using the Tyrer–Cuzick model in a cohort from the State of Washington, USA [14]. Between 1996 and 2013, 132,139 women aged 40–73y completed a risk questionnaire and had a measure of breast density taken at entry. They were followed up beginning 6-months after the entry mammogram to the earliest of diagnosis of invasive breast cancer or censoring. Women were censored due to disenrollment (n = 62,331, 48.2%), end of follow-up (n = 48,317, 37.3%), age 75y (n = 15,827, 12.2%), death from other causes (n = 2328, 1.8%) or ductal carcinoma in situ (n = 637, 0.5%). Only aggregate data on competing risk causes were made available for analysis, so we were unable to apply methods that require individual-level data about the cause of censoring.

6.2. Calibration

A total of 2699 breast cancers were observed (O), and based on our preferred method (equation 11) 2757 were expected (E), yielding O/E 0.98 (95%CI 0.94–1.02). We next consider application of incorrect methods discussed in Section 4.5.

A first biased assessment is cumulative incidence over a 5y fixed-time horizon by using all the cases that were diagnosed in the 5y period and all noncases at risk for 5y or more. It we calculate the expected 5y risk in those who are at risk at 5y then E = 877 against an observed O = 1157 (O/E = 1.32). A second biased method is to summate expected cumulative incidence to the last follow-up for each individual (equation 17), yielding E = 2605 and O/E = 1.04 (95%CI 1.00–1.08). A third biased method is to follow equation (18) and summate the absolute net risks, which gives expected number E = 2679 (O/E = 1.00).

It also instructive to repeat the above methods in a high-risk group (> 8% ten-year net risk) where greater differences between the methods to calculate E will occur. The Tyrer–Cuzick model with mammographic density identified 4645 women to be in this group, and 273 breast cancers were subsequently diagnosed. Based on the preferred method (11) we find E = 349, so that O/E = 0.78 (95%CI 0.69–0.88), indicating that the model over predicted the high risk group. If instead (18) was used then E = 324 and O/E = 0.84 (95%CI 0.75–0.95). Based on (17) we find E = 310 so that O/E = 0.88 (95%CI 0.78–1.00), where the 95%CI covers unity. Thus, the correct analysis showed a lack of calibration that would not be seen so clearly with the biased methods.

This analysis shows the method to calculate expected risk can cause practically important differences in interpretation of risk model calibration, particularly for highrisk groups.

6.3. Time-Dependent Calibration Plots

Time-dependent calibration was assessed using the methods in Section 5. We present results from overall model calibration (Figure 2) and in the highest and lowest risk deciles (Supplementary Figures S1 and S2 [13]) following [14].

Fig. 2.

Fig. 2

Overall calibration of the breast cancer risk model. (a) Observed (95%CI) vs expected number of breast cancers diagnosed for each year of follow-up, (b) cumulative observed (95%CI) vs expected number of breast cancers diagnosed; (c) observed (Nelson–Aalan, 95%CI) vs expected cumulative hazards; (d) observed (Kaplan–Meier, 95%CI) vs expected (obtained via two methods) net risks; (e) Observed divided by Expected cumulative hazard (95%CI); (f) Observed divided by expected net risk (obtained via two methods) with 95%CI only for the expected risk based on baseline risk assessment.

Overall the model appeared well calibrated throughout follow-up time. As follow-up started at six months, the expected hazard in the first year was much lower than the second year of followup. Figure 2(a) shows the model did not adequately take into account the effect of removal of a pool of cancers diagnosed at entry. However, subsequently the model tracked the observed number quite closely. Figures 2(c), (d) show respectively, the Nelson–Aalan and Kaplan–Meier curves. The corresponding observed to expected plots in Figures 2(e), (f) are virtually identical using the expected risk based on those still at risk. There is also very little difference between the two methods to obtain expected risk using the Kaplan–Meier approach, which indirectly confirms that censoring was unrelated to risk assessment.

The same plots were used to assess high and low risk groups. Supplementary Figure S1 considers women who were in the highest predicted risk decile at entry. As noted in Section 6.2, there is some evidence of over-estimation from the model for this group. However, the observed to expected plots in Supplementary Figure S1(e), (f) show a fairly constant pattern across time, with no discernible difference between Nelson–Aalan or Kaplan–Meier comparisons. Supplementary Figure S2 shows calibration of the bottom decile, where calibration appears reasonable through follow up time, albeit with a suggestion that the model had a tendency to under estimate risks.

6.4. Regression Analysis

The above analysis suggests that the risk model was calibrated overall in the cohort, but tended to overestimate risk in those predicted at highest risk; it over-estimated risk in the first year due to omitting screen-detected cases, but the risk predictions thereafter were stable with greater follow-up time. All these aspects may be jointly tested through a Poisson regression model. The model was fitted with an exponential link function. The log hazard per year (or until the event or censoring if in that year) was included as an offset term, and calibration coefficients were estimated for (i) overall lack of calibration in the first year, (ii) stability of calibration from year two onward (follow-up year), and (iii) 10y risk categories from the model as previously defined (<2%, 2 to <3%, 3 to < 5%, 5 to < 8%, ≥ 8%; with 2–3% group as the reference category) [14].

Results are shown in Table 2, taking the exponent of the estimated model parameter to give a calibration coefficient where unity indicates ideal calibration. They confirm the earlier analysis. The risk model over estimated risk in the first year by a factor of two (calibration coefficient 0.50, 95%CI 0.40–0.63), which is due to the expected short-term reduction in risk following a negative screen. There was no loss in calibration subsequently (calibration 1.00, 95%CI 0.99–1.01). The highest risk category (10y risk ≥ 8%) exhibited evidence of over-estimation of risk relative to the reference average-risk category (calibration overall 0.80, 95%CI 0.69–0.92, i.e., including the intercept), and likewise there was some under-estimation in the lowest-risk group (overall test for calibration across the five groups likelihood-ratio χ2 = 48.6, df = 4).

Table 2. Calibration assessment: univariate and Poisson regression (adjusted for variables listed) calibration coefficient estimates with 95% Wald confidence intervals.

Term n O E O/E [univariate] O/E (95%CI) [adjusted]
Overall (intercept) 132,139 2699 2757 0.98       1.05 (0.94–1.16)
Follow-up time
Year 1 132,139     87   178 0.49       0.50 (0.40–0.63)
Year 2+ (time) 123,830 2612 2579 1.01       1.00 (0.99–1.01)
10y Risk group
<2%   53,436   641   548 1.12       1.17 (1.00–1.25)
2 to <3%   33,269   627   603 1.04       1
3 to <5%   29,477   779   784 0.99       0.96 (0.86–1.07)
5 to <8%   11,312   379   473 0.80       0.78 (0.68–0.88)
8%+      4645   273   349 0.78       0.76 (0.66–0.88)

7. Conclusion

In this paper, we reviewed the statistical foundations of a model to evaluate breast cancer risk at different follow-up times. This model, and corresponding computer program, synthesise findings from many scientific studies by combining risk factors into a hybrid genetic segregation and regression model. The model has been used to guide entry criteria into prevention trials [23], and is recommended by several organisations to guide preventive and early detection strategies, including the American Cancer Society to help determine eligibility for supplemental breast magnetic resonance imaging [57].

We developed new graphical methods for looking at calibration over follow-up time that can be combined with Poisson regression analysis. The impact of using the different methods to assess the calibration of cumulative absolute risk was reviewed. Different methods to account for competing risks when assessing risk model calibration have been used. We note that, given the diversity of methods in use and potential impact on findings, it is important when presenting calibration findings that a detailed description of the method for expected risk is also given [1].

There are opportunities to refine and improve the current risk model. Other factors are known to affect the risk of breast cancer, but most have some difficulties for inclusion in a relatively simple model. Using weight change from age 20 or weight at a young age in addition to current weight does appear to improve risk assessment to some extent [37], but requires accurate recall of previous weight. Alcohol consumption is a well documented risk factor [35], but underestimation of consumption is well known and it is not clear how best to allow for this. The differing roles of weight in the pre- and postmenopausal periods also needs further investigation [58]. Physical activity and dietary factors are also known to affect risk [50, 71], but identification of the key factors and simple accurate measures of these currently present difficulties. Levels of oestrogens and testosterone may also be important especially for postmenopausal women, but these require a blood sample and are not routinely available [72]. Ethnic differences in breast cancer risk are also well documented—notably in the US lower risks in Hispanics and Asians [8]. However, it is unclear as to how these are related to differences in the known risk factors in the model, so would not need model recalibration, or whether there are intrinsic ethnic differences which require use of a separate baseline hazard function, and possibly separate risks for the known factors. Differences between recent migrants and established individuals are also known to exist and complicate any model adjustments [39]. Adding breast density has substantially improved the model, and might be extended by considering longitudinal measurements of density and features in the mammogram other than simply breast density, where more complex algorithms may be able to extract more risk information [67]. The polygenic (SNP) risk score is steadily being improved as more SNPs are added, but this is easily accommodated as all that is required is the relative risk. A bigger challenge is to accurately account for interactions between risk factors. Our model was developed by synthesizing risks for individual factors from large overviews of many studies, and large datasets which contain all the factors will be needed to fully explore interactions. These are not currently available but are likely to be so soon, as the risk factors used in our model are currently being collected in several large cohorts of patients.

In conclusion, we are gradually moving towards an era of precision medicine, where disease treatment, early detection and prevention strategies will depend on risk assessment. The accuracy of risk models will underpin this approach.

Supplementary Material

Supplement to “Risk Models for Breast Cancer and Their Validation” (DOI: 10.1214/19-STS729SUPP; .pdf). Supplementary information.

Supplementary figures

Acknowledgements

This study was supported by grant C569/A16891 from Cancer Research UK. We thank Diana Buist and Erin Bowles for substantial contributions to acquisition and interpretation of data. Our use in this paper of data from Kaiser Permanente Washington Health Research Institute was supported by research specialist award R50CA211115 from the National Cancer Institute (NCI) (Ms Bowles); grants HHSN261201100031C and P01CA154292 from the NCI-funded Breast Cancer Surveillance Consortium (Dr Buist); and contracts N01-CN-005230, N01-CN-67009, N01-PC-35142, HHSN261201000029C, and HHSN261201300012I from the Cancer Surveillance System of the Fred Hutchinson Cancer Research Center, funded by the Surveillance, Epidemiology and End Results Program of the NCI with additional support from the Fred Hutchinson Cancer Research Center and the State of Washington.

Contributor Information

Adam R. Brentnall, Email: a.brentnall@qmul.ac.uk, Centre for Cancer Prevention, Wolfson Institute of Preventive Medicine, Queen Mary University of London, Charterhouse square, London, EC1M 6BQ.

Jack Cuzick, Email: j.cuzick@qmul.ac.uk, Centre for Cancer Prevention, Wolfson Institute of Preventive Medicine, Queen Mary University of London, Charterhouse square, London, EC1M 6BQ.

References

  • [1].Altman DG, McShane LM, Sauerbrei W, Taube SE. Reporting recommendations for tumor marker prognostic studies (REMARK): Explanation and elaboration. PLoS Med. 2012;9 doi: 10.1371/journal.pmed.1001216. e1001216+ [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [2].Amir E, Evans DG, Shenton A, Lalloo F, Moran A, Boggis C, Wilson M, Howell A. Evaluation of breast cancer risk assessment packages in the family history evaluation and screening programme. J Med Genet. 2003;40:807–814. doi: 10.1136/jmg.40.11.807. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [3].Anderson H, Bladstrom A, Olsson H, Moller TR. Familial breast and ovarian cancer: A Swedish population-based register study. Am J Epidemiol. 2000;152:1154–1163. doi: 10.1093/aje/152.12.1154. [DOI] [PubMed] [Google Scholar]
  • [4].Antoniou AC, Cunningham AP, Peto J, Evans DG, Lalloo F, Narod SA, Risch HA, Eyfjord JE, Hopper JL, et al. The BOADICEA model of genetic susceptibility to breast and ovarian cancers: Updates and extensions. Br J Cancer. 2008;98:1457–1466. doi: 10.1038/sj.bjc.6604305. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [5].Antoniou AC, Pharoah PDP, McMullan G, Day NE, Stratton MR, Peto J, Ponder BJ, Easton DF. A comprehensive model for familial breast cancer incorporating BRCA1, BRCA2 and other genes. Br J Cancer. 2002;86:76–83. doi: 10.1038/sj.bjc.6600008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [6].Arjas E. A graphical method for assessing goodness of fit in Cox’s proportional hazards model. J Amer Statist Assoc. 1988;83:204–212. [Google Scholar]
  • [7].Banegas M, Gail M, Lacroix A, Thompson B, Martinez M, Wactawski-Wende J, John E, Hubbell, Yasmeen S, et al. Evaluating breast cancer risk projections for hispanic women. Breast Cancer Res Treat. 2012;132:347–353. doi: 10.1007/s10549-011-1900-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [8].Banegas MP, John EM, Slattery ML, Gomez SL, Yu M, Lacroix AZ, Pee D, Chlebowski RT, Hines LM, et al. Projecting individualized absolute invasive breast cancer risk in US hispanic women. J Natl Cancer Inst. 2017;109 doi: 10.1093/jnci/djw215. djw215. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [9].Boughey JC, Hartmann LC, Anderson SS, Degnim AC, Vierkant RA, Reynolds CA, Frost MH, Pankratz VS. Evaluation of the Tyrer–Cuzick (international breast cancer intervention study) model for breast cancer risk prediction in women with atypical hyperplasia. J Clin Oncol. 2010;28:3591–3596. doi: 10.1200/JCO.2010.28.0784. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [10].Boyd NF, Guo H, Martin LJ, Sun L, Stone J, Fishell E, Jong RA, Hislop G, Chiarelli A, et al. Mammographic density and the risk and detection of breast cancer. N Engl J Med. 2007;356:227–236. doi: 10.1056/nejmoa062790. [DOI] [PubMed] [Google Scholar]
  • [11].Boyd NF, O’Sullivan B, Campbell JE, Fishell E, Simor I, Cooke G, Germanson T. Mammographic signs as risk factors for breast cancer. Br J Cancer. 1982;45:185–193. doi: 10.1038/bjc.1982.32. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [12].Brentnall AR, Cohn WF, Knaus WA, Yaffe MJ, Cuzick J, Harvey JA. A case-control study to add volumetric or clinical mammographic density into the Tyrer–Cuzick breast cancer risk model. J Breast Imaging. 2019;1:99–106. doi: 10.1093/jbi/wbz006. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [13].Brentnall AR, Cuzick J. Supplement to “Risk models for breast cancer and their validation.”. 2020 doi: 10.1214/19-STS729SUPP. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [14].Brentnall AR, Cuzick J, Buist DSM, Bowles EJA. Long-term Accuracy of Breast Cancer Risk Assessment Combining Classic Risk Factors and Breast Density. JAMA Oncology. 2018;4:e180174. doi: 10.1001/jamaoncol.2018.0174. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [15].Brentnall AR, Harkness EF, Astley SM, Donnelly LS, Stavrinos P, Sampson S, Fox L, Sergeant JC, Harvie MN, et al. Mammographic density adds accuracy to both the Tyrer–Cuzick and Gail breast cancer risk models in a prospective UK screening cohort. Breast Cancer Res. 2015;17:147. doi: 10.1186/s13058-015-0653-5. + [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [16].Chen J, Pee D, Ayyagari R, Graubard B, Schairer C, Byrne C, Benichou J, Gail MH. Projecting absolute invasive breast cancer risk in white women with a model that includes mammographic density. J Natl Cancer Inst. 2006;98:1215–1226. doi: 10.1093/jnci/djj332. [DOI] [PubMed] [Google Scholar]
  • [17].Chlebowski RT, Anderson GL. The influence of time from menopause and mammography on hormone therapy-related breast cancer risk assessment. J Natl Cancer Inst. 2011;103:284–285. doi: 10.1093/jnci/djq561. [DOI] [PubMed] [Google Scholar]
  • [18].Claus EB, Risch N, Thompson WD, Claus EB, Risch N, Thompson WD. The calculation of breast cancer risk for women with a first degree family history of ovarian cancer. Breast Cancer Res Treat. 1993;28:115–120. doi: 10.1007/bf00666424. [DOI] [PubMed] [Google Scholar]
  • [19].Costantino JP, Gail MH, Pee D, Anderson S, Redmond CK, Benichou J, Wieand HS. Validation studies for models projecting the risk of invasive and total breast cancer incidence. J Natl Cancer Inst. 1999;91:1541–1548. doi: 10.1093/jnci/91.18.1541. [DOI] [PubMed] [Google Scholar]
  • [20].Crowder MJ. Classical Competing Risks. 1st ed. CRC Press/CRC; Boca Raton, FL: 2001. [Google Scholar]
  • [21].Crowson CS, Atkinson EJ, Therneau TM. Assessing calibration of prognostic risk scores. Stat Methods Med Res. 2016;25:1692–1706. doi: 10.1177/0962280213497434. MR3541121. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [22].Cuzick J, Brentnall AR, Segal C, Byers H, Reuter C, Detre S, Lopez-Knowles E, Sestak I, Howell A, et al. Impact of a panel of 88 single nucleotide polymorphisms on the risk of breast cancer in high-risk women: Results from two randomized tamoxifen prevention trials. J Clin Oncol. 2017;35:743–750. doi: 10.1200/JCO.2016.69.8944. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [23].Cuzick J, Sestak I, Cawthorn S, Hamed H, Holli K, Howell A, Forbes JF. Tamoxifen for prevention of breast cancer: Extended long-term follow-up of the IBIS-I breast cancer prevention trial. Lancet Oncol. 2015;16:67–75. doi: 10.1016/s1470-2045(14)71171-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [24].Decarli A, Calza S, Masala G, Specchia C, Palli D, Gail MH. Gail model for prediction of absolute risk of invasive breast cancer: Independent evaluation in the Florence-European prospective investigation into cancer and nutrition cohort. J Natl Cancer Inst. 2006;98:1686–1693. doi: 10.1093/jnci/djj463. [DOI] [PubMed] [Google Scholar]
  • [25].Degnim AC, Visscher DW, Berman HK, Frost MH, Sellers TA, Vierkant RA, Maloney SD, Pankratz VS, de Groen PC, et al. Stratification of breast cancer risk in women with atypia: A mayo cohort study. J Clin Oncol. 2007;25:2671–2677. doi: 10.1200/JCO.2006.09.0217. [DOI] [PubMed] [Google Scholar]
  • [26].De Stavola B. Statistical facts about cancers on which doctor Rigoni–Stern based his contribution to the surgeons’ subgroup of the IV Congress of the Italian scientists on 23 September 1842. Stat Med. 1987;6:881–884. doi: 10.1002/sim.4780060803. [DOI] [PubMed] [Google Scholar]
  • [27].Dupont WD, Degnim AC, Sanders ME, Simpson JF, Hartmann LC. The Breast. Elsevier; Amsterdam: 2018. Risk Factors for Breast Carcinoma in Women With Proliferative Breast Disease; pp. 264–271.e2. [DOI] [Google Scholar]
  • [28].Easton DF, Pharoah PD, Antoniou AC, Tischkowitz M, Tavtigian SV, Nathanson KL, Devilee P, Meindl A, Couch FJ, et al. Gene-panel sequencing and the prediction of breast-cancer risk. N Engl J Med. 2015;372:2243–2257. doi: 10.1056/NEJMsr1501341. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [29].Easton DF, Pharoah PDP, Antoniou AC, Tischkowitz M, Tavtigian SV, Nathanson KL, Devilee P, Meindl A, Couch FJ, et al. Gene-panel sequencing and the prediction of breast-cancer risk. N Engl J Med. 2015;372:2243–2257. doi: 10.1056/NEJMsr1501341. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [30].Evans DG, Brentnall A, Byers H, Harkness E, Stavrinos P, Howell A, Newman WG, Cuzick J. The impact of a panel of 18 SNPs on breast cancer risk in women attending a UK familial screening clinic: A case-control study. J Med Genet. 2017;54:111–113. doi: 10.1136/jmedgenet-2016-104125. [DOI] [PubMed] [Google Scholar]
  • [31].Ford D, Easton DF, Stratton M, Narod S, Goldgar D, Devilee P, Bishop DT, Weber B, Lenoir G, et al. Genetic heterogeneity and penetrance analysis of the BRCA1 and BRCA2 genes in breast cancer families. Am J Hum Genet. 1998;62:676–689. doi: 10.1086/301749. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [32].Gail MH, Brinton LA, Byar DP, Corle DK, Green SB, Schairer C, Mulvihill JJ. Projecting individualized probabilities of developing breast cancer for white females who are being examined annually. J Natl Cancer Inst. 1989;81:1879–1886. doi: 10.1093/jnci/81.24.1879. [DOI] [PubMed] [Google Scholar]
  • [33].Gail MH, Pfeiffer RM. On criteria for evaluating models of absolute risk. Biostatistics. 2005;6:227–239. doi: 10.1093/biostatistics/kxi005. [DOI] [PubMed] [Google Scholar]
  • [34].Grønnesby JK, Borgan Ø. A method for checking regression models in survival analysis based on the risk score. Lifetime Data Anal. 1996;2:315–328. doi: 10.1007/BF00127305. [DOI] [PubMed] [Google Scholar]
  • [35].Collaborative Group on Hormonal Factors in Breast Cancer. Alcohol, tobacco and breast cancer—collaborative reanalysis of individual data from 53 epidemiological studies, including 58 515 women with breast cancer and 95 067 women without the disease. Br J Cancer. 2002;87:1234–1245. doi: 10.1038/sj.bjc.6600596. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [36].Harrell FE, Lee KL, Mark DB. Multivariable prognostic models: Issues in developing models, evaluating assumptions and adequacy, and measuring and reducing errors. Stat Med. 15:361–387. doi: 10.1002/(SICI)1097-0258(19960229)15:4&#x0003c;361::AID-SIM168&#x0003e;3.0.CO;2-4. [DOI] [PubMed] [Google Scholar]
  • [37].Hidayat K, Yang CM, Shi BM. Body fatness at a young age, body fatness gain and risk of breast cancer: Systematic review and meta-analysis of cohort studies. Obes Rev. 2018;19:254–268. doi: 10.1111/obr.12627. [DOI] [PubMed] [Google Scholar]
  • [38].Hosmer DW, Lemeshow S, Sturdivant RX. Applied Logistic Regression. Wiley; Hoboken, NJ, USA: 2013. [DOI] [Google Scholar]
  • [39].John EM. Migration history, acculturation, and breast cancer risk in hispanic women. Cancer Epidemiol Biomark Prev. 2005;14:2905–2913. doi: 10.1158/1055-9965.EPI-05-0483. [DOI] [PubMed] [Google Scholar]
  • [40].Keiding N. The method of expected number of deaths, 1786–1886–1986. Int Stat Rev. 1987;55:1–20. doi: 10.2307/1403267. MR0962938. [DOI] [PubMed] [Google Scholar]
  • [41].Lawless JF. Statistical Models and Methods for Lifetime Data. Wiley Series in Probability and Statistics. 2nd ed. Wiley-Interscience; Hoboken, NJ: 2003. MR1940115. [Google Scholar]
  • [42].Lee AJ, Cunningham AP, Tischkowitz M, Simard J, Pharoah PD, Easton DF, Antoniou AC. Incorporating truncating variants in PALB2, CHEK2, and ATM into the BOADICEA breast cancer risk model. Genet Med. 2016;18:1190–1198. doi: 10.1038/gim.2016.31. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [43].Macmahon B, Cole P. Endocrinology and epidemiology of breast cancer. Cancer. 1969;24:1146–1150. doi: 10.1002/1097-0142(196912)24:6&#x0003c;1146::aid-cncr2820240612&#x0003e;3.0.co;2-0. [DOI] [PubMed] [Google Scholar]
  • [44].Mavaddat N, Michailidou K, Dennis J, Lush M, Fachal L, Lee A, Tyrer JP, Chen T-H, Wang Q, et al. Polygenic risk scores for prediction of breast cancer and breast cancer subtypes. Am J Hum Genet. 2019;104:21–34. doi: 10.1016/j.ajhg.2018.11.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [45].Mccormack VA, Dos Santos Silva I. Breast density and parenchymal patterns as markers of breast cancer risk: A meta-analysis. Cancer Epidemiol Biomark Prev. 2006;15:1159–1169. doi: 10.1158/1055-9965.epi-06-0034. [DOI] [PubMed] [Google Scholar]
  • [46].Mealiffe ME, Stokowski RP, Rhees BK, Prentice RL, Pettinger M, Hinds DA. Assessment of clinical validity of a breast cancer risk model combining genetic and clinical information. J Natl Cancer Inst. 2010;102:1618–1627. doi: 10.1093/jnci/djq388. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [47].Collaborative Group on Hormonal Factors in Breast Cancer. Menarche, menopause, and breast cancer risk: Individual participant meta-analysis, including 118–964 women with breast cancer from 117 epidemiological studies. Lancet Oncol. 2012;13:1141–1151. doi: 10.1016/s1470-2045(12)70425-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [48].Parmigiani G, Berry DA, Aguilar O. Determining carrier probabilities for breast cancer-susceptibility genes BRCA1 and BRCA2. Am J Hum Genet. 1998;62:145–158. doi: 10.1086/301670. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [49].Pike MC, Krailo MD, Henderson BE, Casagrande JT, Hoel DG. ‘Hormonal’ risk factors, ‘breast tissue age’ and the age-incidence of breast cancer. Nature. 1983;303:767–770. doi: 10.1038/303767a0. [DOI] [PubMed] [Google Scholar]
  • [50].Prentice RL, Caan B, Chlebowski RT, Patterson R, Kuller LH, Ockene JK, Margolis KL, Limacher MC, Manson JE, et al. Low-fat dietary pattern and risk of invasive breast cancer. JAMA. 2006;295:629. doi: 10.1001/jama.295.6.629. [DOI] [PubMed] [Google Scholar]
  • [51].Reeves GK, Beral V, Green J, Gathani T, Bull D. Hormonal therapy for menopause and breast-cancer risk by histological type: A cohort study and meta-analysis. Lancet Oncol. 2006;7:910–918. doi: 10.1016/S1470-2045(06)70911-1. [DOI] [PubMed] [Google Scholar]
  • [52].Rice MS, Tworoger SS, Hankinson SE, Tamimi RM, Eliassen AH, Willett WC, Colditz G, Rosner B. Breast cancer risk prediction: An update to the Rosner–Colditz breast cancer incidence model. Breast Cancer Res Treat. 2017;166:227–240. doi: 10.1007/s10549-017-4391-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [53].Rigoni-Stern Fatti statistici relativi alle malattie cancerose. Giorn Prog Patol Terap. 1842;2:507–517. [Google Scholar]
  • [54].Rockhill B, Spiegelman D, Byrne C, Hunter DJ, Colditz GA. Validation of the Gail et al. Model of breast cancer risk prediction and implications for chemoprevention. J Natl Cancer Inst. 2001;93:358–366. doi: 10.1093/jnci/93.5.358. [DOI] [PubMed] [Google Scholar]
  • [55].Rosner BA, Colditz GA, Hankinson SE, Sullivan-Halley J, Lacey JV, Bernstein L. Validation of Rosner–Colditz breast cancer incidence model using an independent data set, the California Teachers Study. Breast Cancer Res Treat. 2013;142:187–202. doi: 10.1007/s10549-013-2719-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [56].Sasieni P, Brentnall AR. On standardized relative survival. Biometrics. 2017;73:473–482. doi: 10.1111/biom.12578. MR3665964. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [57].Saslow D, Boetes C, Burke W, Harms S, Leach MO, Lehman CD, Morris E, Pisano E, Schnall M, et al. American cancer society guidelines for breast screening with MRI as an adjunct to mammography. CA Cancer J Clin. 2007;57:75–89. doi: 10.3322/canjclin.57.2.75. [DOI] [PubMed] [Google Scholar]
  • [58].Schoemaker MJ, Nichols HB, Wright LB, Brook MN, Jones ME, O’Brien KM, Adami H-O, Baglietto L, Bernstein L, et al. Association of body mass index and age with subsequent breast cancer risk in premenopausal women. JAMA Oncol. 2018;4:e181771. doi: 10.1001/jamaoncol.2018.1771. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [59].Steel M, Thompson A, Clayton J. Genetic aspects of breast cancer. Br Med Bull. 1991;47:504–518. doi: 10.1093/oxfordjournals.bmb.a072488. [DOI] [PubMed] [Google Scholar]
  • [60].Teams FC. Mammographic surveillance in women younger than 50 years who have a family history of breast cancer: Tumour characteristics and projected effect on mortality in the prospective, single-arm, FH01 study. Lancet Oncol. 2010;11:1127–1134. doi: 10.1016/s1470-2045(10)70263-1. [DOI] [PubMed] [Google Scholar]
  • [61].Tice JA, Cummings SR, Smith-Bindman R, Ichikawa L, Barlow WE, Kerlikowske K. Using clinical factors and mammographic breast density to estimate breast cancer risk: Development and validation of a new predictive model. Ann Intern Med. 2008;148:337–347. doi: 10.7326/0003-4819-148-5-200803040-00004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [62].Tice JA, Miglioretti DL, Li C-S, Vachon CM, Gard CC, Kerlikowske K. Breast density and benign breast disease: Risk assessment to identify women at high risk of breast cancer. J Clin Oncol. 2015;33 doi: 10.1200/jco.2015.60.8869. JCO.2015.60.8869–3143. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [63].Tyrer J, Duffy SW, Cuzick J. A breast cancer prediction model incorporating familial and personal risk factors. Stat Med. 2004;23:1111–1130. doi: 10.1002/sim.1668. [DOI] [PubMed] [Google Scholar]
  • [64].Vachon CM, Pankratz VS, Scott CG, Haeberle L, Ziv E, Jensen MR, Brandt KR, Whaley DH, Olson JE, et al. The Contributions of Breast Density and Common Genetic Variation to Breast Cancer Risk. J Natl Cancer Inst. 2015;107 doi: 10.1093/jnci/dju397. dju397+ [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [65].Van Veen EM, Brentnall AR, Byers H, Harkness EF, Astley SM, Sampson S, Howell A, Newman WG, Cuzick J, et al. Use of single-nucleotide polymorphisms and mammographic density plus classic risk factors for breast cancer risk prediction. JAMA Oncol. 2018;4:476. doi: 10.1001/jamaoncol.2017.4881. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [66].Wacholder S, Hartge P, Prentice R, Garcia-Closas M, Feigelson HS, Diver WR, Thun MJ, Cox DG, Hankinson SE, et al. Performance of common genetic variants in breast-cancer risk models. N Engl J Med. 2010;362:986–993. doi: 10.1056/nejmoa0907727. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [67].Wang C, Brentnall AR, Cuzick J, Harkness EF, Evans DG, Astley S. A novel and fully automated mammographic texture analysis for risk prediction: Results from two case-control studies. Breast Cancer Res. 2017;19:114. doi: 10.1186/s13058-017-0906-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [68].Warwick J, Birke H, Stone J, Warren RML, Pinney E, Brentnall AR, Duffy SW, Howell A, Cuzick J. Mammographic breast density refines Tyrer–Cuzick estimates of breast cancer risk in high-risk women: findings from the placebo arm of the International Breast Cancer Intervention Study I. Breast Cancer Research. 2014;16:451. doi: 10.1186/s13058-014-0451-5. + [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [69].Wolfe JN. Breast patterns as an index of risk for developing breast cancer. Am J Roentgenol. 1976;126:1130–1137. doi: 10.2214/ajr.126.6.1130. [DOI] [PubMed] [Google Scholar]
  • [70].Wood SN. Generalized Additive Models: An Introduction with R. Texts in Statistical Science Series. CRC Press/CRC; Boca Raton, FL: 2006. MR2206355. [Google Scholar]
  • [71].Wu Y, Zhang D, Kang S. Physical activity and risk of breast cancer: A meta-analysis of prospective studies. Breast Cancer Res Treat. 2013;137:869–882. doi: 10.1007/s10549-012-2396-7. [DOI] [PubMed] [Google Scholar]
  • [72].Zhang X, Rice M, Tworoger SS, Rosner BA, Eliassen AH, Tamimi RM, Joshi AD, Lindstrom S, Qian J, et al. Addition of a polygenic risk score, mammographic density, and endogenous hormones to existing breast cancer risk prediction models: A nested case-control study. PLoS Med. 2018;15:e1002644. doi: 10.1371/journal.pmed.1002644. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary figures

RESOURCES