Skip to main content
Journal of Clinical Oncology logoLink to Journal of Clinical Oncology
. 2015 Jan 26;33(9):1038–1044. doi: 10.1200/JCO.2014.57.2750

Prospective Approach to Breast Cancer Risk Prediction in African American Women: The Black Women's Health Study Model

Deborah A Boggs 1, Lynn Rosenberg 1, Lucile L Adams-Campbell 1, Julie R Palmer 1,
PMCID: PMC4356712  PMID: 25624428

Abstract

Purpose

Breast cancer risk prediction models have underestimated risk for African American women, contributing to lower recruitment rates in prevention trials. A model previously developed for African American women was found to underestimate risk in the Black Women's Health Study (BWHS).

Methods

We developed a breast cancer risk model for African American women using relative risks derived from 10 years of follow-up of BWHS participants age 30 to 69 years at baseline. Using the subsequent 5 years of follow-up data, we evaluated calibration as the ratio of expected to observed number of breast cancers and assessed discriminatory accuracy using the concordance statistic.

Results

The BWHS model included family history, previous biopsy, body mass index at age 18 years, age at menarche, age at first birth, oral contraceptive use, bilateral oophorectomy, estrogen plus progestin use, and height. There was good agreement between predicted and observed number of breast cancers overall (expected-to-observed ratio, 0.96; 95% CI, 0.88 to 1.05) and in most risk factor categories. Discriminatory accuracy was higher for women younger than age 50 years (area under the curve [AUC], 0.62; 95% CI, 0.58 to 0.65) than for women age ≥ 50 years (AUC, 0.56; 95% CI, 0.53 to 0.59). Using a 5-year predicted risk of 1.66% or greater as a cut point, 2.8% of women younger than 50 years old and 32.2% of women ≥ 50 years old were classified as being at elevated risk of invasive breast cancer.

Conclusion

The BWHS model was well calibrated overall, and the predictive ability was best for younger women. The proportion of women predicted to meet the 1.66% cut point commonly used to determine eligibility for breast cancer prevention trials was greatly increased relative to previous models.

INTRODUCTION

African American women are more likely than white women to develop breast cancer before age 40 years,1 to be diagnosed with estrogen receptor (ER) –negative tumors,2 and to have higher 5-year breast cancer mortality rates.3 The effectiveness of breast cancer prevention and early detection strategies depends in part on the ability to accurately identify individuals at increased risk of breast cancer. Models for predicting the absolute risk of breast cancer have been used for chemoprevention decision making and for determining eligibility for recruitment into prevention trials.

The Breast Cancer Risk Assessment Tool, commonly known as the Gail model,4 was validated in white women.5,6 However, it was found to underestimate risk in African American women and was modified based on data from African American women in the Women's Contraceptive and Reproductive Experiences (CARE) study.7 The CARE model, which uses information on age, age at menarche, family history of breast cancer, and number of breast biopsies, was well calibrated among African American women age 50 to 79 years in the Women's Health Initiative7 but underestimated risk among women age 30 to 69 years in the Black Women's Health Study (BWHS), particularly among women with a later age at first birth.8 Discriminatory accuracy of the CARE model was limited in both studies.7,8 Underestimation of risk for African American women has contributed to the under-representation of African American women in breast cancer prevention trials. The aim of the this study was to develop a breast cancer risk prediction model for African American women using prospective data from the BWHS in follow-up from 1995 to 2005 and to validate the model in subsequent follow-up from 2006 to 2010, overall and by age and ER status.

METHODS

Study Population

The BWHS, an ongoing follow-up study of African American women, was established in 1995 when 59,000 African American women from across the United States enrolled through mailed health questionnaires. Study participants, age 21 to 69 years at baseline, are followed through biennial mailed questionnaires to obtain updated information on exposures and identify incident disease. The follow-up rate of the baseline cohort was 80% through 2011. The present analysis was restricted to 55,879 women who were age 30 to 69 years at baseline or who reached age 30 years during follow-up, had no history of cancer at baseline, and had no missing data on variables in the final model. The Boston University Medical Campus Institutional Review Board approved the protocol.

Exposure Assessment

The baseline questionnaire collected information on age at menarche, height, weight at age 18 years, waist and hip circumference, educational attainment, and family history of breast cancer. Baseline and biennial follow-up questionnaires ascertained information on current weight, parity, age at first birth, duration of breastfeeding, oral contraceptive use, age at menopause, type of menopause, menopausal hormone use, previous diagnosis of benign breast disease and whether it was confirmed by biopsy, smoking history, alcohol intake, and vigorous physical activity. Body mass index (BMI) was calculated as weight in kilograms divided by the square of height in meters.

Breast Cancer Ascertainment

Incident diagnoses of breast cancer were ascertained through self-report on biennial follow-up questionnaires and, for nonrespondents, through linkage with state cancer registries. Breast cancer deaths were identified through linkage with the National Death Index. We were able to obtain records for 94% of incident breast cancers identified from 1995 to 2010, of which more than 99% were confirmed by medical record or cancer registry data. Only patients with confirmed invasive breast cancer were included in the analysis.

Relative Risk Model

On the basis of follow-up data from 1995 to 2005, we used Cox proportional hazards models, stratified by age in years, to estimate relative risks (RRs) and 95% CIs for risk of invasive breast cancer associated with known and suspected risk factors. Women contributed person-years from baseline in 1995 or age 30 years (if younger than age 30 at baseline) until diagnosis of invasive breast cancer, death, or age 70 years, whichever occurred first. Women diagnosed with ductal carcinoma in situ were censored at diagnosis. Risk factors that changed over time were updated as time-varying variables at the start of each 2-year follow-up cycle.

The final model was selected by backward elimination using the Akaike information criterion and likelihood ratio tests at the significance level of P < .10. We also evaluated first-order interactions of all variables included in the final model with each other and with all other variables considered; there were no significant interactions.

We evaluated the following potential predictors, initially coded as follows: age at menarche (< 11, 11, 12, 13, 14, or ≥ 15 years), BMI at age 18 years (< 20, 20 to 22, 23 to 24, or ≥ 25 kg/m2), current BMI (< 25, 25 to 29, 30 to 34, or 35 to 39 kg/m2), waist-to-hip ratio (< 0.74, 0.74 to 0.78, 0.79 to 0.84, or ≥ 0.85), parity (none or one, two, or ≥ three births), age at first birth (< 20, 20 to 24, 25 to 29, or ≥ 30 years), duration of breastfeeding (< 1, 1 to 5, or ≥ 6 months), duration of oral contraceptive use (< 1, 1 to 4, 5 to 9, or ≥ 10 years), recency of oral contraceptive use (never used or ≥ 15, 10 to 14, 5 to 9, or < 5 years ago), type of menopause (premenopausal, natural menopause, bilateral oophorectomy, or hysterectomy), age at menopause (< 40, 40 to 44, 45 to 49, or ≥ 50 years), duration of use of estrogen supplements (never used or < 2, 2, 3 to 4, or ≥ 5 years), recency of use of estrogen alone (never used or ≥ or < 5 years ago), duration of use of estrogen plus progestin supplements (never used or < 2, 2, 3 to 4, or ≥ 5 years), recency of use of estrogen plus progestin (never used or ≥ or < 5 years ago), history of benign breast disease or breast biopsy (no or yes), family history of breast cancer (none, first-degree relative diagnosed with breast cancer before age 50, first-degree relative diagnosed at age 50 or older, second-degree relative diagnosed before age 50, or second-degree relative diagnosed at age 50 or older), alcohol intake (< one, one to three, four to six, or ≥ seven drinks per week), smoking history (never, former < 15 cigarettes a day, former ≥ 15 cigarettes a day, current < 15 cigarettes a day, or current ≥ 15 cigarettes a day), and vigorous physical activity (< 1, 1 to 4, or ≥ 5 hours per week). We excluded women with missing values on any variable; the proportion of missing values was less than 1% for most variables.

Absolute Risk

To estimate absolute risk, we used RRs and attributable risks from the BWHS model, age-specific Surveillance, Epidemiology, and End Results breast cancer rates for African American women (1994 to 1998), and age-specific competing mortality rates for African American women (1996 to 2000; rates are listed in Appendix Table A1, online only), using the methods described by Gail et al.7 For women at risk of incident breast cancer in 2006, we estimated the 5-year predicted risk from 2006 to 2010. We calculated the expected number of breast cancers by summing the predicted absolute risk for each participant. To assess calibration of the model, we compared the expected with the observed number of breast cancers (E/O ratio). The 95% CIs for the E/O ratio were calculated using the Poisson distribution. An E/O ratio greater than 1.0 indicates that the model overestimates risk of breast cancer, whereas an E/O ratio less than 1.0 indicates underestimation. We compared the expected and observed numbers of breast cancers by age group and other risk factor strata. To evaluate the model across levels of risk, we stratified the data by 5-year age groups and created age-adjusted quintiles of 5-year predicted risk to assess the model as a function of risk factors apart from age.

To assess the discriminatory accuracy of the model, we used the concordance statistic (c-statistic), or area under the receiver operating characteristic curve (AUC), which corresponds to the probability that a randomly selected woman with breast cancer has a higher predicted risk than a randomly selected unaffected woman. Random classification of women with breast cancer and women without breast cancer results in a c-statistic of 0.5, whereas perfect discrimination provides a c-statistic of 1. To assess the discriminatory accuracy for women of a given age, we estimated age-specific c-statistics in 5-year intervals and calculated the weighted average of these estimates with weights equal to the inverse variance of the age-specific estimates. Statistical analyses were performed using SAS version 9.3 (SAS Institute, Cary, NC).

RESULTS

Among 55,093 women age 30 to 69 years observed from 1995 through 2005, 896 incident invasive breast cancers were identified. The median age at diagnosis was 50 years. Table 1 lists the RR estimates for variables included in the final model. The factor associated with the largest RR was a family history of breast cancer in a first-degree relative diagnosed before age 50 years (RR, 2.24; 95% CI, 1.76 to 2.84). A history of benign breast disease or biopsy, lower BMI at age 18 years, earlier age at menarche, later age at first birth, longer duration or more recent use of oral contraceptives, no history of bilateral oophorectomy, longer duration of estrogen plus progestin use, and greater height were also associated with increased risk of breast cancer in the final model. Current BMI, waist-to-hip ratio, parity, breastfeeding, age at menopause, alcohol intake, smoking history, and vigorous physical activity were not associated with risk of invasive breast cancer at the significance level of P < .10 and are not included in the prediction model (data not shown). On the basis of RRs from the BWHS model, the estimated attributable risks necessary for calculation of absolute risk were 0.73 for women age 30 to 49 years and 0.70 for women age 50 to 69 years.

Table 1.

Relative Risk Estimates for Breast Cancer in the Black Women's Health Study, 1995 to 2005

Risk Factor No. of Breast Cancers No. of Person-Years RR* 95% CI
Family history
    None 598 362,263 1.00 Reference
    First-degree relative diagnosed at age < 50 years 77 19,288 2.24 1.76 to 2.84
    First-degree relative diagnosed at age ≥ 50 years 76 24,421 1.53 1.20 to 1.94
    Second-degree relative 145 66,008 1.35 1.12 to 1.62
Benign breast disease/biopsy
    No 598 372,449 1.00 Reference
    Yes 298 99,530 1.45 1.25 to 1.67
Body mass index at age 18 years, kg/m2
    < 20 442 197,739 1.50 1.17 to 1.92
    20-24 378 210,802 1.31 1.02 to 1.68
    ≥ 25 76 63,438 1.00 Reference
Age at menarche, years
    < 12 273 132,548 1.42 1.16 to 1.73
    12-13 472 246,749 1.24 1.03 to 1.50
    ≥ 14 151 92,683 1.00 Reference
Age at first birth, years
    Nulliparous or < 25 635 356,859 1.00 Reference
    ≥ 25 261 115,120 1.32 1.14 to 1.53
Oral contraceptive use
    < 10 years in duration and ≥ 10 years ago 663 312,506 1.00 Reference
    ≥ 10 years in duration or < 10 years ago 233 159,474 1.19 1.01 to 1.41
Bilateral oophorectomy
    No 793 422,531 1.46 1.18 to 1.80
    Yes 103 49,449 1.00 Reference
Estrogen plus progestin use, years
    < 3 838 459,197 1.00 Reference
    ≥ 3 58 12,782 1.33 1.01 to 1.76
Height
    < 5 feet, 5 inches 384 213,182 1.00 Reference
    ≥ 5 feet, 5 inches 512 258,798 1.15 1.01 to 1.32

Abbreviation: RR, relative risk.

*

Adjusted for age and all risk factors in the table.

During 5 years of follow-up from 2006 through 2010 of 48,193 women who had not developed breast cancer before 2006, 506 invasive breast cancers were identified, with a median age at diagnosis of 54 years. The 48,193 women observed from 2006 to 2010 were older at the beginning of follow-up in 2006 than the 55,093 women for whom follow-up began in 1995 and had a higher prevalence of late age at first birth, long-term use of estrogen plus progestin, and history of benign breast disease, as expected as a result of the age difference (Appendix Table A2, online only). The two groups were similar with regard to the other risk factors included in the prediction model. Data on ER status were available for 469 breast cancers, with 305 (65%) classified as ER positive and 164 (35%) classified as ER negative. The model predicted 485.8 breast cancers overall, yielding an E/O ratio of 0.96 (95% CI, 0.88 to 1.05; Table 2). The model significantly underestimated risk among women without a family history of breast cancer (E/O ratio, 0.90; 95% CI, 0.81 to 0.99), nulliparous women (E/O ratio, 0.83; 95% CI, 0.70 to 0.99), and women who used oral contraceptives within the previous 10 years (E/O ratio, 0.75; 95% CI, 0.63 to 0.89). There was also a significant underestimation of risk in the second lowest age-adjusted quintile of risk (E/O ratio, 0.75; 95% CI, 0.61 to 0.91). There was good agreement between the predicted and observed number of breast cancers across categories of other risk factors.

Table 2.

Comparison of Number of Breast Cancers Predicted by the Model With the Number Observed in the Black Women's Health Study, 2006 to 2010

Factor Women
Predicted No. of Breast Cancers Observed No. of Breast Cancers E/O Ratio 95% CI
No. %
Overall 48,193 485.8 506 0.96 0.88 to 1.05
Age, years
    30-39 12,341 26 46.2 58 0.80 0.62 to 1.03
    40-49 16,572 34 144.4 152 0.95 0.81 to 1.11
    50-59 13,546 28 196.0 198 0.99 0.86 to 1.14
    60-69 5,734 12 99.2 98 1.01 0.83 to 1.23
Family history of breast cancer
    None 37,274 77 326.4 364 0.90 0.81 to 0.99
    First-degree relative age < 50 years 1,848 4 40.0 36 1.11 0.80 to 1.54
    First-degree relative age ≥ 50 years 2,271 5 37.8 33 1.14 0.81 to 1.61
    Second-degree relative 6,800 14 81.6 73 1.12 0.89 to 1.41
Benign breast disease/biopsy
    No 36,994 77 311.7 345 0.90 0.81 to 1.00
    Yes 11,199 23 174.1 161 1.08 0.93 to 1.26
Body mass index at age 18, kg/m2
    < 20 19,390 40 233.1 226 1.03 0.91 to 1.17
    20-24 21,772 45 207.9 227 0.92 0.80 to 1.04
    ≥ 25 7,031 15 44.8 53 0.85 0.65 to 1.11
Age at menarche, years
    < 12 13,858 29 149.8 160 0.94 0.80 to 1.09
    12-13 25,270 52 256.4 276 0.93 0.83 to 1.05
    ≥ 14 9,065 19 79.5 70 1.14 0.90 to 1.44
Parity, No. of births
    0 13,832 29 105.2 127 0.83 0.70 to 0.99
    1-2 24,396 50 266.8 270 0.99 0.88 to 1.11
    ≥ 3 9,965 21 113.7 109 1.04 0.86 to 1.26
Age at first birth, years*
    < 20 10,361 21 111.2 103 1.08 0.89 to 1.31
    20-24 11,089 23 116.6 133 0.88 0.74 to 1.04
    ≥ 25 12,911 27 152.7 143 1.07 0.91 to 1.26
Lactation*
    Never 18,910 39 215.8 206 1.05 0.91 to 1.20
    Ever 15,451 32 164.7 173 0.95 0.82 to 1.10
Duration of oral contraceptive use, years
    < 1 16,480 34 176.9 175 1.01 0.87 to 1.17
    1-9 23,368 49 218.5 229 0.95 0.84 to 1.09
    ≥ 10 8,345 17 90.4 102 0.89 0.73 to 1.08
Recency of oral contraceptive use
    ≥ 10 years ago 18,030 37 213.9 204 1.05 0.91 to 1.20
    < 10 years ago 13,683 28 95.0 127 0.75 0.63 to 0.89
Type of menopause
    Premenopausal 27,133 56 190.4 214 0.89 0.78 to 1.02
    Natural 8,502 18 141.3 128 1.10 0.93 to 1.31
    Bilateral oophorectomy 5,512 11 56.6 68 0.83 0.66 to 1.06
    Hysterectomy 7,046 15 97.4 95 1.03 0.84 to 1.25
Estrogen plus progestin use
    Never 44,543 92 421.6 441 0.96 0.87 to 1.05
    < 3 years 1,892 4 27.2 33 0.83 0.59 to 1.16
    ≥ 3 years 1,758 4 36.9 32 1.15 0.82 to 1.63
Height
    < 5 feet, 5 inches 21,441 44 204.1 219 0.93 0.82 to 1.06
    ≥ 5 feet, 5 inches 26,752 55 281.7 287 0.98 0.87 to 1.10
Current body mass index, kg/m2
    < 25 11,875 25 114.3 108 1.06 0.88 to 1.28
    25-29 15,722 33 171.4 177 0.97 0.84 to 1.12
    ≥ 30 20,586 43 200.0 221 0.90 0.79 to 1.03
Alcohol intake, No. of drinks per week
    < 1 36,283 75 359.0 368 0.98 0.88 to 1.08
    1-6 9,302 19 98.1 113 0.87 0.72 to 1.04
    ≥ 7 2,608 5 28.7 25 1.15 0.77 to 1.70
Smoking status
    Never smoked 31,524 65 291.6 297 0.98 0.88 to 1.10
    Past smoker 10,449 22 131.1 143 0.92 0.78 to 1.08
    Current smoker 6,220 13 63.0 66 0.95 0.75 to 1.21
Age-adjusted quintile of risk
    1 (low) 9,656 20 53.6 59 0.91 0.70 to 1.17
    2 9,591 20 71.7 96 0.75 0.61 to 0.91
    3 9,690 20 88.3 94 0.94 0.77 to 1.15
    4 9,628 20 108.6 108 1.01 0.83 to 1.21
    5 (high) 9,628 20 163.6 149 1.10 0.93 to 1.29

Abbreviation: E/O, expected/observed.

*

Parous women.

Women with ≥ 1 year of oral contraceptive use.

We compared the RR estimates derived in the first 10 years of follow-up with estimates based on data from subsequent follow-up (data not shown). RRs for a family history of breast cancer were lower in later follow-up, which could explain the underestimation of risk among women without a family history. Conversely, the RR for long duration or recent use of oral contraceptives was higher in later follow-up, which could explain the underestimation of risk among women with recent use.

The discriminatory accuracy of the model, as measured by age-specific c-statistics, is presented in Table 3. The age-adjusted AUC was 0.59 (95% CI, 0.56 to 0.61) overall and was significantly higher (P = .03) for women younger than 50 years old (AUC, 0.62; 95% CI, 0.58 to 0.65) than for women ≥ 50 years old (AUC, 0.56; 95% CI, 0.53 to 0.59). The AUC was higher for ER-negative breast cancer than for ER-positive breast cancer (AUC, 0.62 and 0.58, respectively), but the difference was not statistically significant (P = .10).

Table 3.

Discriminatory Accuracy of the Black Women's Health Study Model, 2006 to 2010

Group Overall
Estrogen Receptor Positive
Estrogen Receptor Negative
Concordance Statistic 95% CI Concordance Statistic 95% CI Concordance Statistic 95% CI
Age specific, years
    30-34 0.57 0.41 to 0.73 0.51 0.22 to 0.79 0.66 0.45 to 0.87
    35-39 0.60 0.52 to 0.67 0.54 0.44 to 0.64 0.70 0.57 to 0.82
    40-44 0.65 0.59 to 0.71 0.63 0.55 to 0.70 0.67 0.56 to 0.78
    45-49 0.60 0.53 to 0.66 0.62 0.52 to 0.71 0.59 0.50 to 0.69
    50-54 0.56 0.51 to 0.62 0.54 0.47 to 0.62 0.60 0.51 to 0.68
    55-59 0.56 0.50 to 0.62 0.54 0.47 to 0.62 0.59 0.46 to 0.72
    60-64 0.56 0.49 to 0.64 0.57 0.48 to 0.66 0.57 0.42 to 0.72
    65-69 0.56 0.46 to 0.65 0.64 0.53 to 0.75 0.59 0.40 to 0.79
Weighted average
    Age < 50 years 0.62 0.58 to 0.65 0.60 0.55 to 0.65 0.65 0.59 to 0.71
    Age ≥ 50 years 0.56 0.53 to 0.59 0.56 0.52 to 0.61 0.59 0.53 to 0.65
    Overall 0.59 0.56 to 0.61 0.58 0.55 to 0.61 0.62 0.58 to 0.66

Compared with the lowest age-adjusted quintile of 5-year predicted risk, the RR for the highest quintile of predicted risk was 2.56 (95% CI, 1.89 to 3.46) for breast cancer overall (Table 4). Results were similar for ER-positive and ER-negative breast cancer. The RR for comparison of extreme quintiles was particularly high among younger women (RR, 4.11; 95% CI, 2.42 to 7.00), but there were only 17 breast cancers in the lowest quintile.

Table 4.

RRs Comparing Age-Adjusted Quintiles of Predicted Risk in the Black Women's Health Study, 2006 to 2010

Quintile Overall
ER-Positive Breast Cancer
ER-Negative Breast Cancer
Age < 50 Years
Age ≥ 50 Years
No. of Cancers RR 95% CI No. of Cancers RR 95% CI No. of Cancers RR 95% CI No. of Cancers RR 95% CI No. of Cancers RR 95% CI
1 (low risk) 59 1.00 Reference 34 1.00 Reference 22 1.00 Reference 17 1.00 Reference 42 1.00 Reference
2 96 1.64 1.19 to 2.28 66 1.96 1.30 to 2.97 23 1.06 0.59 to 1.90 40 2.37 1.34 to 4.19 56 1.35 0.90 to 2.02
3 94 1.59 1.15 to 2.21 56 1.65 1.07 to 2.53 30 1.36 0.79 to 2.37 37 2.18 1.23 to 3.88 57 1.35 0.91 to 2.02
4 108 1.84 1.34 to 2.54 62 1.84 1.21 to 2.80 36 1.65 0.97 to 2.81 47 2.79 1.60 to 4.86 61 1.46 0.99 to 2.17
5 (high risk) 149 2.56 1.89 to 3.46 87 2.59 1.74 to 3.86 53 2.44 1.48 to 4.01 69 4.11 2.42 to 7.00 80 1.93 1.32 to 2.81

Abbreviations: ER, estrogen receptor; RR, relative risk.

Among participants at risk of breast cancer in 2006, mean 5-year risks predicted by the BWHS model were 0.57% (range, 0.07% to 4.60%) for women younger than 50 years old and 1.37% (range, 0.35% to 7.48%) for women ≥ 50 years old; 2.8% of women younger than 50 years old and 32.2% of women ≥ 50 years old had a predicted risk of 1.66% or greater, the cut point typically used for enrollment in prevention trials (Table 5). In contrast, on the basis of the CARE model, mean predicted 5-year risks were 0.50% (range, 0.09% to 2.45%) and 1.16% (range, 0.71% to 3.83%) for younger and older women, respectively, and only 0.1% of younger women and 7.3% of older women were predicted to have a 5-year risk of at least 1.66%.

Table 5.

Comparison of the BWHS and CARE Models for Prediction of 5-Year Risk Less Than or ≥ the Cut Point of 1.66%

BWHS Model CARE Model
< 1.66%
≥ 1.66%
Total
No. of Participants % No. of Participants % No. of Participants %
Overall
    < 1.66% 40,979 85.0 197 0.4 41,176 85.4
    ≥ 1.66% 5,780 12.0 1,237 2.6 7,017 14.6
    Total 46,759 97.0 1,434 3.0
Age < 50 years
    < 1.66% 28,102 97.2 7 < 0.1 28,109 97.2
    ≥ 1.66% 787 2.7 17 0.1 804 2.8
    Total 28,889 99.9 24 0.1
Age ≥ 50 years
    < 1.66% 12,877 66.8 190 1.0 13,067 67.8
    ≥ 1.66% 4,933 25.9 1,220 6.3 6,213 32.2
    Total 17,870 92.7 1,410 7.3

Abbreviations: BWHS, Black Women's Health Study; CARE, Women's Contraceptive and Reproductive Experiences.

DISCUSSION

We developed a risk prediction model for invasive breast cancer for African American women age 30 to 69 years based on 10 years of follow-up in the BWHS. The model includes family history, previous benign breast disease, BMI at age 18 years, age at menarche, age at first birth, oral contraceptive use, bilateral oophorectomy, estrogen plus progestin use, and height. In an internal validation during the subsequent 5 years of follow-up, the model was well calibrated in that it predicted 486 breast cancers in comparison to the 506 breast cancers that occurred (E/O ratio, 0.96). The model fit well in most risk factor strata but underestimated risk among women who had no family history of breast cancer, were nulliparous, had recently used oral contraceptives, or had low predicted risk. Previously, we evaluated the CARE model in the BWHS and found that it underestimated risk overall (E/O ratio, 0.88) and particularly among women with an older age at first birth.8 The CARE model included only the following three predictors: age at menarche, family history, and an interaction between number of breast biopsies and age.

The discriminatory accuracy for breast cancer overall was previously estimated to be 0.57 based on the CARE model,8 compared with 0.59 based on the BWHS model in the current analysis. Previously, discriminatory accuracy of the CARE model was higher for ER-positive breast cancer (AUC, 0.59) than for ER-negative breast cancer (AUC, 0.54).8 In contrast, the discrimination of the BWHS model was higher for ER-negative than for ER-positive breast cancer, although the difference was not statistically significant (P = .10). Reasons for the apparently greater discriminatory accuracy of the BWHS model for ER-negative breast cancer are unclear. Future work in the BWHS after accrual of more breast cancer diagnoses will include developing breast cancer risk prediction models separately by ER status.

Discriminatory accuracy was significantly better for younger women in the present analysis (AUC, 0.62 for age < 50 years and 0.56 for age ≥ 50 years). In 2009, the US Preventive Services Task Force recommended against routine mammography in women younger than age 50 years.9 However, African American women are more likely than other racial or ethnic groups to be diagnosed with breast cancer at a young age,1 and the BWHS model could be a clinically useful tool to identify women who may benefit from targeted screening recommendations before age 50 years. Women age ≥ 70 years were not included in the analysis because of limited numbers. In addition, discrimination has been found to be lower for women older than age 70 years, possibly as a result of weaker associations between risk factors and breast cancer incidence at older ages.10

Enrollment in breast cancer prevention trials has traditionally been restricted to women with a 5-year predicted risk of at least 1.66%. Because breast cancer risk prediction models have underestimated risk for African American women, recruitment has been disproportionately low. Two studies have demonstrated that a greater proportion of African American women would meet trial eligibility criteria for elevated risk by using the CARE model rather than the original Gail model.7,11 However, the CARE model underestimates risk for certain subgroups of African American women, as we previously found in the BWHS.8 In this study, we show that 14.6% of women age 30 to 69 years were predicted to have a 5-year risk of at least 1.66% using the BWHS model, compared with only 3.0% using the CARE model. Therefore, use of the BWHS model could result in increased eligibility of African Americans for prevention trials. It should be noted, however, that the model overestimated risk by approximately 10% (not statistically significant) in the highest quintile of risk, which could account for some of the discrepancy between the BWHS and CARE models in terms of the proportion of women predicted to have a 5-year risk of at least 1.66%.

The BWHS model underestimated risk among women without a family history of breast cancer, possibly because information on family history had not been reported since 1999. Thus, some of the women may have been misclassified as not having a family history.

To our knowledge, this is the first study to use prospective data to develop a breast cancer risk prediction model for black women. A key strength was the inclusion of both younger and older African American women. A limitation is that the model was developed and validated in the same cohort, which may have resulted in optimistic model performance. Although we developed the model with data from one time period and evaluated it with subsequent follow-up data, our model may not perform as well in other study populations. We did not have data on mammographic density, which has modestly improved the Gail model in some studies.12,13 This measurement is not readily available for many women, which makes it less feasible to include in a risk model in large-scale studies. We also lacked information on benign breast disease subtypes including atypical hyperplasia.

In summary, we developed a breast cancer risk prediction model for African American women that was well calibrated across a wide age range and across strata of risk factors. The discriminatory accuracy of the model remains modest but was higher for breast cancer overall and for ER-negative breast cancer than the discriminatory accuracy of a previous model with fewer risk factors.7 The predictive ability was better for younger women, and the model may be a useful tool to identify women who may benefit from screening before age 50 years. It is unclear how well the model will perform in study populations with different distributions of risk factors; the calibration and discrimination of the BWHS model should be evaluated in independent studies.

Acknowledgment

We gratefully acknowledge the continuing dedication of the Black Women's Health Study participants and staff. Data on breast cancer pathology were obtained from several state cancer registries (AZ, CA, CO, CT, DC, DE, FL, GA, IN, IL, KY, LA, MA, MD, MI, NC, NJ, NY, OK, PA, SC, TN, TX, and VA), and results reported do not necessarily represent their views.

Appendix

Table A1.

Rates Used to Estimate Absolute Risk of Breast Cancer in the BWHS (for African American women)

Age (years) Breast Cancer Incidence Rates (per 100,000 person-years; SEER, 1994-1998)* Competing Mortality Rates (per 100,000 person-years; NCHS, 1996-2000)
30-34 31.1 145.9
35-39 67.6 215.9
40-44 119.4 315.1
45-49 187.4 448.8
50-54 241.5 632.3
55-59 291.1 963.0
60-64 310.1 1,471.8
65-69 365.6 2,116.3

Abbreviations: BWHS, Black Women's Health Study; NCHS, National Center for Health Statistics.

*

SEER, 13 registries, 1994 to 1998.

Competing mortality rates from causes other than breast cancer, obtained from the NCHS (1996 to 2000).

Table A2.

Black Women's Health Study Cohort Characteristics in 1995 and 2006

Characteristic % of Women
1995 (n = 55,093) 2006 (n = 48,193)
Age, years
    Mean 39.9 47.2
    Standard deviation 9.3 9.5
Family history
    None 77.2 77.3
    First-degree relative diagnosed at < 50 years 4.0 3.8
    First-degree relative diagnosed at ≥ 50 years 4.9 4.7
    Second-degree relative 13.9 14.1
Benign breast disease/biopsy
    No 81.4 76.8
    Yes 18.6 23.2
Body mass index at age 18 years, kg/m2
    < 20 40.9 40.2
    20-24 45.0 45.2
    ≥ 25 14.1 14.6
Age at menarche, years
    < 12 28.4 28.8
    12-13 52.3 52.4
    ≥ 14 19.3 18.8
Age at first birth, years
    Nulliparous or < 25 78.2 73.2
    ≥ 25 21.8 26.8
Oral contraceptive use
    < 10 years in duration and ≥ 10 years ago 62.4 64.2
    ≥ 10 years in duration or < 10 years ago 37.6 35.8
Bilateral oophorectomy
    No 92.2 88.6
    Yes 7.8 11.4
Estrogen plus progestin use, years
    < 3 98.6 96.4
    ≥ 3 1.4 3.6
Height
    < 5 feet, 5 inches 45.0 44.5
    ≥ 5 feet, 5 inches 55.0 55.5

Footnotes

Supported by Susan G. Komen for the Cure (Grant No. KG111112) and the National Cancer Institute (Grants No. R01 CA058420 and UM1 CA164974).

The contents of this article are solely the responsibility of the authors and do not necessarily represent the official views of Susan G. Komen for the Cure, the National Cancer Institute, or the National Institutes of Health.

Authors' disclosures of potential conflicts of interest are found in the article online at www.jco.org. Author contributions are found at the end of this article.

AUTHORS' DISCLOSURES OF POTENTIAL CONFLICTS OF INTEREST

Disclosures provided by the authors are available with this article at www.jco.org.

AUTHOR CONTRIBUTIONS

Conception and design: All authors

Financial support: All authors

Administrative support: Lynn Rosenberg, Lucile L. Adams-Campbell, Julie R. Palmer

Provision of study materials or patients: Lynn Rosenberg, Lucile L. Adams-Campbell, Julie R. Palmer

Collection and assembly of data: Lynn Rosenberg, Lucile L. Adams-Campbell, Julie R. Palmer

Data analysis and interpretation: Deborah A. Boggs

Manuscript writing: All authors

Final approval of manuscript: All authors

AUTHORS' DISCLOSURES OF POTENTIAL CONFLICTS OF INTEREST

Prospective Approach to Breast Cancer Risk Prediction in African American Women: The Black Women's Health Study Model

The following represents disclosure information provided by authors of this manuscript. All relationships are considered compensated. Relationships are self-held unless noted. I = Immediate Family Member, Inst = My Institution. Relationships may not relate to the subject matter of this manuscript. For more information about ASCO's conflict of interest policy, please refer to www.asco.org/rwc or jco.ascopubs.org/site/ifc.

Deborah A. Boggs

Employment: Quintiles

Lynn Rosenberg

No relationship to disclose

Lucile L. Adams-Campbell

No relationship to disclose

Julie R. Palmer

No relationship to disclose

REFERENCES

  • 1.Anderson WF, Rosenberg PS, Menashe I, et al. Age-related crossover in breast cancer incidence rates between black and white ethnic groups. J Natl Cancer Inst. 2008;100:1804–1814. doi: 10.1093/jnci/djn411. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Clarke CA, Keegan TH, Yang J, et al. Age-specific incidence of breast cancer subtypes: Understanding the black-white crossover. J Natl Cancer Inst. 2012;104:1094–1101. doi: 10.1093/jnci/djs264. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Silber JH, Rosenbaum PR, Clark AS, et al. Characteristics associated with differences in survival among black and white women with breast cancer. JAMA. 2013;310:389–397. doi: 10.1001/jama.2013.8272. [DOI] [PubMed] [Google Scholar]
  • 4.Gail MH, Brinton LA, Byar DP, et al. Projecting individualized probabilities of developing breast cancer for white females who are being examined annually. J Natl Cancer Inst. 1989;81:1879–1886. doi: 10.1093/jnci/81.24.1879. [DOI] [PubMed] [Google Scholar]
  • 5.Costantino JP, Gail MH, Pee D, et al. Validation studies for models projecting the risk of invasive and total breast cancer incidence. J Natl Cancer Inst. 1999;91:1541–1548. doi: 10.1093/jnci/91.18.1541. [DOI] [PubMed] [Google Scholar]
  • 6.Rockhill B, Spiegelman D, Byrne C, et al. Validation of the Gail et al. model of breast cancer risk prediction and implications for chemoprevention. J Natl Cancer Inst. 2001;93:358–366. doi: 10.1093/jnci/93.5.358. [DOI] [PubMed] [Google Scholar]
  • 7.Gail MH, Costantino JP, Pee D, et al. Projecting individualized absolute invasive breast cancer risk in African American women. J Natl Cancer Inst. 2007;99:1782–1792. doi: 10.1093/jnci/djm223. [DOI] [PubMed] [Google Scholar]
  • 8.Boggs DA, Rosenberg L, Pencina MJ, et al. Validation of a breast cancer risk prediction model developed for Black women. J Natl Cancer Inst. 2013;105:361–367. doi: 10.1093/jnci/djt008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.US Preventive Services Task Force: Screening for breast cancer: U.S. Preventive Services Task Force recommendation statement. Ann Intern Med. 2009;151:716–726. W-236. doi: 10.7326/0003-4819-151-10-200911170-00008. [DOI] [PubMed] [Google Scholar]
  • 10.Vacek PM, Skelly JM, Geller BM. Breast cancer risk assessment in women aged 70 and older. Breast Cancer Res Treat. 2011;130:291–299. doi: 10.1007/s10549-011-1576-1. [DOI] [PubMed] [Google Scholar]
  • 11.Adams-Campbell LL, Makambi KH, Frederick WA, et al. Breast cancer risk assessments comparing Gail and CARE models in African-American women. Breast J. 2009;15(suppl 1):S72–S75. doi: 10.1111/j.1524-4741.2009.00824.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Barlow WE, White E, Ballard-Barbash R, et al. Prospective breast cancer risk prediction model for women undergoing screening mammography. J Natl Cancer Inst. 2006;98:1204–1214. doi: 10.1093/jnci/djj331. [DOI] [PubMed] [Google Scholar]
  • 13.Chen J, Pee D, Ayyagari R, et al. Projecting absolute invasive breast cancer risk in white women with a model that includes mammographic density. J Natl Cancer Inst. 2006;98:1215–1226. doi: 10.1093/jnci/djj332. [DOI] [PubMed] [Google Scholar]

Articles from Journal of Clinical Oncology are provided here courtesy of American Society of Clinical Oncology

RESOURCES