Skip to main content
Nature Communications logoLink to Nature Communications
. 2018 Sep 27;9:3957. doi: 10.1038/s41467-018-06467-9

A Mendelian randomization study of the effects of blood lipids on breast cancer risk

Christoph Nowak 1,, Johan Ärnlöv 1,2
PMCID: PMC6160471  PMID: 30262900

Abstract

Observational studies have reported inconsistent associations between circulating lipids and breast cancer risk. Using results from >400,000 participants in two-sample Mendelian randomization, we show that genetically raised LDL-cholesterol is associated with higher risk of breast cancer (odds ratio, OR, per standard deviation, 1.09, 95% confidence interval, 1.02–1.18, P = 0.020) and estrogen receptor (ER)-positive breast cancer (OR 1.14 [1.05–1.24] P = 0.004). Genetically raised HDL-cholesterol is associated with higher risk of ER-positive breast cancer (OR 1.13 [1.01–1.26] P = 0.037). HDL-cholesterol-raising variants in the gene encoding the target of CETP inhibitors are associated with higher risk of breast cancer (OR 1.07 [1.03–1.11] P = 0.001) and ER-positive breast cancer (OR 1.08 [1.03–1.13] P = 0.001). LDL-cholesterol-lowering variants mimicking PCSK9 inhibitors are associated (P = 0.014) with lower breast cancer risk. We find no effects related to the statin and ezetimibe target genes. The possible risk-promoting effects of raised LDL-cholesterol and CETP-mediated raised HDL-cholesterol have implications for breast cancer prevention and clinical trials.


The link between circulating lipids and breast cancer risk is complex. Here, the authors utilise data from more than 400,000 participants in two-sample Mendelian randomization to assess the link between blood lipids and breast cancer risk, and they find risk-promoting effects of raised LDL-cholesterol and CETP-mediated raised HDL-cholesterol.

Introduction

Breast cancer affects up to 1 in 8 women during their lifetime and is the second leading cause of death among women in the Western world1,2. Cardiovascular diseases and breast cancer share many metabolic risk factors, including diet, obesity and physical activity3. The role of lipids, particularly in estrogen receptor-positive (ER-positive) breast cancer, is well known, but causal pathways have been difficult to disentangle4. Observational studies on associations between blood lipids and breast cancer have yielded equivocal results, with suggestive associations between raised triglyceride and high-density lipoprotein-cholesterol (HDL-cholesterol) and lower risk of breast cancer5,6, that have, however, not been confirmed by other studies7 and may depend on menopausal status8. No evidence for an association between low-density lipoprotein-cholesterol (LDL-cholesterol)57,9 or use of statins (widely prescribed LDL-cholesterol-lowering drugs) and breast cancer has been detected1012, although in women with breast cancer, statins may be associated with lower recurrence risk13 and reduced breast cancer-specific mortality14.

The effects of other lipid-modifying drugs on breast cancer are less well studied. No association with cancer occurrence was reported in the largest trial of an HDL-cholesterol-increasing cholesteryl ester transfer protein (CETP) inhibitor15. Kobberø Lauridsen et al.16 found no association between genetic variants in the NPC1L1 gene (encoding the target of ezetimibe) and cancer in ~67,000 persons. No concerns have been raised about associations between proprotein convertase subtilisin/kexin type 9 (PCSK9) inhibitors and cancer risk17.

Limited follow-up time in clinical trials and confounding or reverse causation in observational studies render conclusions about causality uncertain. In epidemiologic settings, Mendelian randomization (MR) has been developed to assess causality18. Parental genetic variants are randomly inherited, and MR uses variants that are associated with an exposure as instruments to test for associations with an outcome. This concept is analogous to randomized designs and minimizes bias from confounding and reverse causation. It can also predict drug effects by using mutations in drug target genes as instruments19. MR makes several assumptions that are often difficult to ascertain, including the absence of genetic effects on the outcome that are independent of the exposure (absent horizontal pleiotropy)20. It can thus only provide preliminary evidence of causality that may inform subsequent intervention studies, drug monitoring and public health approaches21.

MR studies have demonstrated, for instance, an inverse association between genetically predicted obesity and risk of breast cancer22,23. Orho-Melander et al.24 studied ~16,000 women and found suggestive effects of raised HDL-cholesterol and reduced triglycerides on increased breast cancer risk that did not, however, reach nominal significance. In the same study, the LDL-lowering allele of a variant in the statin target gene HMGCR was associated with lower risk of breast cancer while an LDL-cholesterol genetic score was not, suggesting an LDL-independent mechanism. To our knowledge, MR has not been applied to study effects of lipids on breast cancer risk in the largest available genetic datasets from the Global Lipid Genetics Consortium (GLGC)25 and the 2017 release of the Breast Cancer Association Consortium (BCAC)26 of over 180,000 participants each.

In this study, we use two-sample MR in the largest available genetic datasets to assess causal associations between circulating LDL-cholesterol, HDL-cholesterol, triglycerides and variants in genes encoding lipid-modifying drug targets on the risk of total breast cancer, ER-positive and ER-negative breast cancer. We find possible risk-increasing effects of raised LDL-cholesterol and CETP-mediated raised HDL-cholesterol that may have implications for breast cancer prevention.

Results

Study overview

Figure 1 summarizes the study flow. Individual genetic variant associations are listed in Supplementary Tables 1 and 2. Full MR results are available in Supplementary Tables 36 for lipids and in Supplementary Table 7 for drug targets. Causal estimates are expressed as odds ratios (OR) and 95% confidence interval (CI) per standard deviation increment in plasma lipid level. Comprehensive MR refers to using all variants associated with the target lipid, whilst restrictive MR excludes variants associated with any of the other lipids (P < 0.001).

Fig. 1.

Fig. 1

Study overview. BCAC: Breast Cancer Association Consortium, Egger MR Egger method, EUR European reference sample, GLGC Global Lipids Genetics Consortium, HDL-C high-density lipoprotein-cholesterol, Intercept Egger regression intercept term, IVW inverse variance-weighted method, LD linkage disequilibrium, LDL-C low-density lipoprotein-cholesterol, MR-PRESSO MR pleiotropy residual sum and outlier, Q′ modified 2nd order weight heterogeneity test, SNP single nucleotide polymorphisms, TG triglycerides

Effect of lipid levels

In comprehensive MR (Supplementary Table 3), we detected suggestive associations between raised LDL-cholesterol and breast cancer risk (P = 0.055). Genetically raised HDL-cholesterol was associated with breast cancer risk (P = 0.003) and ER-positive disease risk (P = 0.002). Raised triglycerides were negatively associated with all three outcomes. There was evidence of heterogeneity in all analyses (QP-values < 10−5). Following exclusion of pleiotropic variants in restrictive MR (Supplementary Table 4), raised LDL-cholesterol was associated with higher risk of any breast cancer (OR 1.12, 95% CI, 1.02–1.23, P = 0.017) and ER-positive breast cancer (OR 1.17, 95% CI, 1.05–1.29, P = 0.004) with consistent estimates across the Egger and median methods but evidence of remaining heterogeneity (QP-values < 10−4). Raised HDL-cholesterol had no clear association with breast cancer risk (OR 1.08, 95% CI, 0.96–1.21, P = 0.198) with significant remaining heterogeneity (QP-value = 0.003). There was evidence of an effect of raised HDL-cholesterol on increased ER-positive breast cancer risk (OR 1.13, 95% CI, 1.01–1.26, P = 0.028, QP-value = 0.169). Triglycerides were not associated with any of the outcomes (P > 0.4).

We applied the MR-PRESSO method to the restrictive MR models to identify and remove outlier variants followed by retesting for heterogeneity (Fig. 2, Supplementary Table 5). In inverse variance-weighted MR following the removal of outliers, raised LDL-cholesterol had a risk-increasing effect on breast cancer (OR 1.09, 95% CI, 1.02–1.18, P = 0.020, QP-value = 0.102) and ER-positive breast cancer (OR 1.14, 95% CI, 1.05–1.24, P = 0.004, QP-value = 0.124) and no association with ER-negative disease (P = 0.577). Raised HDL-cholesterol had no nominally significant association with either breast cancer risk (OR 1.07, 95% CI, 0.97–1.19, P = 0.171, QP-value = 0.090) or ER-negative disease (OR 1.09, 95% CI, 0.91–1.30, P = 0.365, QP-value = 0.108), but appeared associated with increased risk of ER-positive disease (OR 1.13, 95% CI, 1.01–1.26, P = 0.037, QP-value = 0.169). Genetically predicted triglyceride levels were not associated with any of the outcomes. Application of MR-PRESSO to the comprehensive selection without exclusion of variants associated with other lipids produced causal estimates in the same direction, but there was significant (P < 0.05) remaining heterogeneity after exclusion of outliers in all cases (Supplementary Table 6).

Fig. 2.

Fig. 2

Causal estimates of blood lipid levels on risk of all, ER-positive and ER-negative breast cancer. Inverse variance-weighted instrumental variable analysis using genome-wide significantly associated independent variants as instrumental variables for each lipid. Results following exclusion of variants associated at P < 0.001 with any of the other lipids and following removal of outlier variants (P < 0.05 in MR-PRESSO) are shown. Causal estimates express the change in odds ratio (OR) per standard deviation (SD) increment in lipid concentration. Error bars indicate 95% confidence intervals

Effect of lipid-modifying drug targets

We implemented inverse variance-weighted MR and MR Egger with consideration of the correlation between genetic variants using seven LDL-cholesterol-associated variants in PCSK9, three LDL-associated variants each for NPC1L1 and HMGCR, and six variants for LDLR (Supplementary Table 2). For CETP, we selected 11 HDL-cholesterol-associated variants. LDL-raising variants in PCSK9 were associated with increased risk of breast cancer (OR 1.10, 95% CI, 1.02–1.19, P = 0.014) but not with ER-positive (OR 1.08, 95% CI, 0.99–1.18, P = 0.099) or ER-negative breast cancer (OR 1.13, 95% CI, 0.98–1.30, P = 0.089) at the nominal significance level (Fig. 3, Supplementary Table 7). HDL-raising variants in CETP were associated with raised breast cancer risk (OR 1.07, 95% CI, 1.03–1.11, P = 0.001) and ER-positive breast cancer risk (OR 1.08, 95% CI, 1.03–1.13, P = 0.001), although in both cases, MR Egger was suggestive of a null association (OR 1.00, 95% CI, 0.87–1.15, P = 0.972; and OR 1.01, 95% CI, 0.86–1.17, P = 0.948, respectively). The associations with ER-negative breast cancer risk did not reach nominal significance (OR 1.07, 95% CI, 0.99–1.15, P = 0.075). Estimates for NPC1L1 were suggestive of risk-increasing effects of raised LDL-cholesterol on breast cancer and ER-positive breast cancer risk in inverse variance-weighted analysis, but MR Egger estimates were in the opposite direction, casting doubt on the validity of MR assumptions in this case. LDL-cholesterol-raising variants in HGMCR had a suggestive association with breast cancer risk (OR 1.16, 95% CI, 0.98–1.37, P = 0.086; Fig. 3). The analysis for LDLR variants provided inconsistent estimates across methods and implied unaccounted-for pleiotropy in the Egger intercept test for all three outcomes, cautioning against an interpretation of the observed results in inverse variance-weighted MR.

Fig. 3.

Fig. 3

Causal estimates of blood lipid level-increase due to genetic variants in genes encoding drug targets. Inverse variance-weighted (black) and MR Egger (gray) instrumental variable estimates using genome-wide significantly associated variants within 100 b either side of the gene in low linkage disequilibrium (r2 < 0.4). Analyses take correlations between genetic instruments into account. Causal estimates express the change in odds ratio (OR) per standard deviation (SD) increment in lipid concentration. Error bars indicate 95% confidence intervals

Discussion

In two-sample summary-level MR, we found an association between genetically raised LDL-cholesterol and increased risk of breast cancer and ER-positive breast cancer. Instrumental variable analysis with HDL-cholesterol-raising variants in CETP suggested a small risk-increasing effect on breast cancer and ER-positive disease; however, possible bias from pleiotropy cannot be excluded. Lowered LDL-cholesterol due to variants in PCSK9 had a suggestive protective effect on breast cancer risk.

The risk-increasing effect of genetically raised LDL-cholesterol on total and ER-positive breast cancer contrasts with observational studies that reported inconsistent results generally suggesting a null association, with marked heterogeneity depending on study design, menopausal status and body mass index of participants57,9. In our study, we were able to minimize pleiotropic effects that bias observational studies and found evidence of an LDL-specific harmful effect on ER-positive and (to a lesser extent) total breast cancer risk. The inverse association between triglycerides and breast cancer risk in comprehensive MR was abolished after excluding variants associated with other lipids, implying that triglyceride levels do not affect breast cancer risk independently. This result broadly concurs with observational studies that reported either absent or weak inverse associations6,7,27.

Genetically raised HDL-cholesterol was associated with increased risk of ER-positive breast cancer. Observational studies have reported inconsistent results on associations between HDL and breast cancer, including null effects5,7,27,28, inverse associations in post-menopausal women6 and unidirectional associations in pre-menopausal women29. The effect of HDL-cholesterol on breast cancer risk in our study concurs with a non-significant association in an earlier genetic study24, and the absence of a stronger effect in our summary-level data may relate to lack of power and an inability to stratify participants by menopausal status. Another explanation could be different effects depending on the metabolic health of the individual, as in vivo studies found evidence that oxidation status of HDL-cholesterol in a normo- or hyperlipidaemic context may determine its effects on the promotion of breast cancer metastasis30.

Taken together, laboratory studies have demonstrated that lipoprotein fractions affect breast cancer growth both directly and as precursors for cholesterol metabolites30,31, and observational studies in women have hitherto not been able to consistently define potential effects. Our study provides genetic evidence of a harmful association between raised LDL-cholesterol and breast cancer occurrence, as well as a suggestive harmful effect of raised HDL-cholesterol. A deficit of our study is the inability to stratify women by menopausal status. The endocrine changes of the menopause likely affect plasma lipid composition and the interaction with breast tissue. For instance, a meta-analysis of observational studies found that an inverse association between HDL-cholesterol and breast cancer was only present in post- but not in premenopausal women6. Differing effects depending on menopausal status are further suggested by findings that genetically predicted obesity is inversely associated breast cancer risk, which contrasts with a positive association between obesity and postmenopausal breast cancer risk in observational studies22. Large biobanks such as the UK Biobank may in the future allow to further dissect the suggestive causal effects of HDL-cholesterol on breast cancer discovered in our study.

Raised HDL-cholesterol due to genetic variants in CETP was associated with raised total and ER-positive breast cancer risk, but only two of the 11 variants were individually associated with breast cancer at the nominal significance level and MR Egger implied absent effects (Supplementary Tables 2 and 6). Whether pharmacological CETP inhibition could affect breast cancer risk remains uncertain, as lifelong genetic effects and consequences of pharmacological intervention in mid-life may differ21. None of the four clinical trials of CETP inhibitors published by May 2018 have reported associations with breast cancer occurrence, although the proportion of women (19.2% across trials) and the short follow-up of up to 4.1 years pose limitations on the detection of potential cancer-related effects. Table 1 summarizes these trials.

Table 1.

Breast cancer outcomes in clinical trials of CETP inhibitors

Study name Description Reported outcomes related to breast cancer
ILLUMINATE47 Torcetrapib, N = 15,067 high cardiovascular risk, 1–2-year follow-up, 1,679 women in active and 1,673 women in comparator group No specific reporting on breast cancer. There was 1 cancer death in the torcetrapib group and 0 in the comparator group. Serious adverse events affecting the “reproductive system or breast” occurred in 27 active and 18 control persons.
dal-OUTCOMES48 Dalcetrapib, N = 15,871 post-acute coronary syndrome, 31-month follow-up, 1,573 women in active and 1,497 women in comparator group No specific reporting on breast cancer. Malignant or unspecific tumours occurred in 270 persons (48 fatal) in the active group and in 286 persons (47 deaths) in the comparator group.
ACCELERATE49 Evacetrapib, N = 12,092 high vascular risk, 26-month follow-up, 1,390 women in active and 1,394 women in comparator group No reporting of breast cancer.
REVEAL15 Anacetrapib, N = 30,449 high vascular risk, 4.1-year follow-up, 2,459 women in active and 2,456 women in comparator group Breast cancer occurred in 24 persons in the active, and 27 persons in the comparator group.

Lowering of LDL-cholesterol due to variants in PCSK9 was associated with risk-reducing effects on breast cancer occurrence in our study. A 2017 review of clinical studies comparing PCSK9 inhibitors to placebo found no association with risk of any cancer, although the direction of association (OR 0.91, 95% CI, 0.63–1.31) does not exclude a possibly protective effect19. A similar non-significant protective association with cancer risk was found in a phenome-wide association study of genetic variants in PCSK9 in the UK Biobank sample32.

The potential, but not nominally significant effect (P = 0.086) of variants in the gene encoding the target of statins could point to a true signal that our study was underpowered to detect or that may differ in subgroups (such as postmenopausal women) that we were not able to assess. The possible risk-reducing effect of statin mimicry in MR chimes with observational studies that reported either null or protective associations with breast cancer risk and mortality1014. Future studies with genetic and drug exposure data that allow analysis in subgroups should address any possible effects of statins on breast cancer.

An early clinical trial of ezetimibe raised concerns that combination therapy with statins might be associated with increased risk of cancer. Subsequent longer follow-up and comparisons across other clinical trials, however, found no association with raised cancer risk33,34. Our findings indicate a possible protective effect and agree with an earlier smaller genetic study16, but inconsistent estimates between Egger and inverse variance-weighted MR in our analysis imply violations of model assumptions that do not allow for conclusive interpretation.

Strengths of our study include the use of the largest available summary genetic datasets and extensive diagnostics to evaluate the validity of MR assumptions and limit the potential for bias from pleiotropy. Limitations include our inability to replicate results in independent datasets, concerns about pleiotropy from (un)measured confounders, possible weak instrument bias and lack of power for drug analyses. A bias toward the null because of Winner’s curse35, as genetic discovery had been implemented in the same dataset used to estimate exposure associations, cannot be excluded. MR assesses the life-long effects of genetic variation and cannot be directly compared to pharmacological inhibition in adult life. The analyses accounted for population stratification (genetic principle components and restriction to European ethnicity) and pleiotropy (MR Egger), but remaining sources of bias such as canalization cannot be ruled out. Finally, we could not assess the influence of menopausal status and our results only apply to women of European ethnicity.

Methods

Summary genetic association data

Genome-wide association study results in persons of mostly European ancestry were obtained from the GLGC (up to 188,577 persons) for plasma lipids25 and from BCAC for risk of breast cancer (up to 122,977 affected and 105,974 control women)26. Both studies included rigorous quality control, imputation to the 1000 Genomes Project panel and adjustments for age and population structure. The studies have existing ethical permissions from their respective institutional review boards and include participant informed consent. The outcomes in the present study were risk of any breast cancer, ER-positive and ER-negative breast cancer as defined in BCAC26.

Analyses in the Global Lipids Genetics Consortium

Persons of European ancestry from 47 studies genotyped with different genome-wide association study arrays (n = 94,595) or on the Metabochip array (n = 93,982) with imputation to the 1000 Genomes Project reference were studied. In most included studies, blood lipid concentrations had been measured after an >8 h fast. Participants on lipid-lowering medications were excluded. Traits were adjusted for age, age-squared, sex and principle components, as well as quantile-normalized within each cohort. For genetic association analysis by linear regression, lipid levels were inverse normal-transformed and cohort-wise results combined in fixed effect meta-analysis25.

Analyses in the Breast Cancer Association Consortium

The consortium genotyped on the OncoArray altogether 61,282 women with breast cancer and 45,494 control women without breast cancer of European ancestry who were enrolled in 68 studies in the BCAC and the Discovery, Biology and Risk of Inherited Variants in Breast Cancer Consortium (DRIVE). Genotypes were imputed to ~21 million variants using the 1000 Genomes Project (Phase 3) reference panel. Variants with minor allele frequency <0.5% and imputation quality score <0.3 were excluded resulting in ~11.8 million variants for logistic regression analysis adjusted for genetic principle components and country. Results were combined in fixed-effect meta-analysis with results from the Collaborative Oncological Gene-environment Study (iCOGS, 46,785 cases and 42,892 controls) and 11 other breast cancer genome-wide association studies (14,910 cases and 17,588 controls). The current study uses summary results from women of European ancestry26.

Genetic instruments for blood lipids

We extracted association statistics for LDL-cholesterol, HDL-cholesterol and triglycerides in GLGC for 185 genetic variants in 157 loci previously demonstrated to be associated with at least one lipid fraction36. We constructed two genetic instruments for each lipid. First, we selected all genome-wide significant (P < 5 × 10−8) variants associated with each lipid for comprehensive MR (76 variants for LDL-cholesterol, 85 for HDL-cholesterol, 51 for triglycerides). Second, to reduce possible pleiotropic effects we excluded in each selection those variants that were associated at P < 0.001 with any of the other two lipids for restrictive MR (44 variants for LDL-cholesterol, 28 for HDL-cholesterol, 4 for triglycerides).

Proxies for drug targets

To assess potential causal effects of changes in lipid levels due to pharmacological intervention, we selected polymorphisms within ±100 base pairs of genes encoding drug targets that were genome-wide significantly associated with the target lipid and in low linkage disequilibrium with each other. Variants were ranked by P-value for lipid association in GLGC and iteratively selected in the order of increasing P-value provided they were in low linkage disequilibrium (r2 < 0.4) with variants selected in preceding steps. We obtained pairwise linkage disequilibrium based on Phase 3 (Version 5) of the 1000 Genomes Project combined European reference sample via LDlink37. We used associations with HDL-cholesterol to construct the genetic instrument for CETP and associations with LDL-cholesterol to construct instruments for HMGCR (encodes the target of statins), PCSK9, NPC1L1 (encodes the target of ezetimibe) and LDLR. LDLR does not mimic a drug target but is a common site of mutations causing familial hypercholesterolemia (OMIM 606945) and was included to assess the role of the LDL receptor pathway.

Mendelian randomization

One variant (rs4332136) was not available and excluded as no proxy variant (r2 > 0.8) was available. Genetic effects were aligned to the lipid-increasing allele and alignment checked by comparing the minor allele frequencies reported by BCAC and GLGC. Palindromic variants that could not be unambiguously aligned and multi-allelic variants with different effect and reference alleles in BCAC and GLCL were removed.

We used inverse variance-weighted, Egger and weighted median MR to assess causal effects of lipid fractions. The inverse variance-weighted method regresses genetic associations with the outcome on associations with lipid levels and fixes the intercept at zero. In the absence of directional pleiotropy, it provides robust causal estimates38. MR Egger allows free estimation of the intercept, although further assumptions, such as the independence between instrument strength and direct effects, cannot be easily verified. A statistically significant intercept term implies the presence of unbalanced pleiotropy and causal estimates in MR Egger are less precise than those in inverse variance-weighted MR39. Weighted median MR allows some variants to be invalid instruments provided at least half are valid instruments. It uses inverse variance weights and bootstrapping to estimate CIs40. Figure 1 provides an overview of the methods used in this study. For drug target MR with variants in moderate linkage disequilibrium (r2 < 0.4), we implemented the inverse variance-weighted and Egger methods with explicit modelling of correlations between genetic variants according to the method suggested Burgess et al.41 as implemented in the MendelianRandomization software in R42. To assess heterogeneity between individual genetic variants’ estimates, we used the Egger intercept test43, the Q′ heterogeneity statistic44 and the MR pleiotropy residual sum and outlier (MR-PRESSO)45 test. The Q′ statistic uses modified 2nd order weights that are a derivation of a Taylor series expansion and take into account uncertainty in both numerator and denominator of the instrumental variable ratio (this eases the no-measurement-error, NOME, assumption)44. The MR-PRESSO framework relies on the regression of variant-outcome associations on variant-exposure associations and implements a global heterogeneity test by comparing the observed distance (residual sums of squares) of all variants to the regression line with the distance expected under the null hypothesis of no pleiotropy45. In case of evidence of horizontal pleiotropy, the test compares individual variants’ expected and observed distributions to identify outlier variants. We used an implementation of MR-PRESSO in R (https://github.com/rondolab/MR-PRESSO) with default parameters to (i) test for global heterogeneity; (ii) if significant at P < 0.05 identify and remove outliers; and (iii) retest to evaluate if outlier removal had resolved heterogeneity. We consider as results causal estimates that agree in direction and magnitude across MR methods, pass nominal significance in inverse variance-weighted MR, and do not show evidence of bias from horizontal pleiotropy in heterogeneity tests. Analyses were carried out with the MendelianRandomization42, TwoSampleMR and MR-PRESSO45 packages in R version 3.3.2 (2016-10-31).

Power

We used mRnd (http://cnsgenomics.com/shiny/mRnd/) for post-hoc power calculations. At an alpha level of 0.05, we estimated 80% power to detect causal effects on breast cancer risk per standard deviation increment in lipid level of OR 1.06 (LDL-cholesterol), OR 1.07 (HDL-cholesterol) and OR 1.07 (triglycerides). The corresponding estimates for ER-positive breast cancer were OR 1.06 (LDL-cholesterol), OR 1.07 (HDL-cholesterol) and OR 1.08 (triglycerides); and the estimates for ER-negative breast cancer were OR 1.10 (LDL-cholesterol), OR 1.11 (HDL-cholesterol) and OR 1.12 (triglycerides), respectively.

Sample overlap

Participant overlap between the samples used to estimate genetic associations with the exposure and the outcome, respectively, in two-sample MR can bias results46. A careful comparison of the samples included BCAC and GLGC showed one common cohort (EPIC), which accounted for 2.9% of cases and 3.3% of control persons in BCAC, and for 1.7% (1.0% if only considering women) of participants in GLGC. Based on a simulation study of the association between sample overlap and the degree of bias in instrumental variable analysis46, this degree of overlap (<5%) is unlikely to influence results in a meaningful way.

Code availability

The analysis code in R is available on request and all data displayed in figures are available in Supplementary Tables 17.

Electronic supplementary material

Supplementary Information (618.1KB, pdf)
Peer Review File (293.4KB, pdf)

Acknowledgements

The breast cancer genome-wide association analyses were supported by the Government of Canada through Genome Canada and the Canadian Institutes of Health Research, the ‘Ministère de l’Économie, de la Science et de l’Innovation du Québec’ through Genome Québec and Grant PSR-SIIRI-701, The National Institutes of Health (U19 CA148065, X01HG007492), Cancer Research UK (C1287/A10118, C1287/A16563, C1287/A10710) and The European Union (HEALTH-F2-2009-223175 and H2020 633784 and 634935). All studies and funders are listed in Michailidou et al.26. Key software packages and analysis code were sourced from https://cran.r-project.org/web/packages/MendelianRandomization/index.html; https://github.com/MRCIEU/TwoSampleMR; and https://github.com/rondolab/MR-PRESSO. J.Ä. was supported by a grant from the Swedish Research Council (2012-2215). C.N. was supported by EFSD/Lilly (European Foundation for the Study of Diabetes Young Investigator Programme).

Author contributions

C.N. conceived of and designed the study. C.N. analysed the data, wrote the first draft of the manuscript and is the guarantor of the study. J.Ä. contributed funding and critically revised the manuscript.

Data availability

All summary genetic association data used in this study are available online, GLGC (http://lipidgenetics.org/) and BCAC (http://bcac.ccge.medschl.cam.ac.uk/).

Competing interests

The authors declare no competing interests.

Footnotes

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Supplementary Information accompanies this paper at 10.1038/s41467-018-06467-9.

References

  • 1.Harbeck N, Gnant M. Breast cancer. Lancet. 2017;389:1134–1150. doi: 10.1016/S0140-6736(16)31891-8. [DOI] [PubMed] [Google Scholar]
  • 2.Global Burden of Disease Cancer Collaboration Global, regional, and national cancer incidence, mortality, years of life lost, years lived with disability, and disability-adjusted life-years for 32 cancer groups, 1990 to 2015: a systematic analysis for the global burden of disease study. JAMA Oncol.3, 524–548 (2017). [DOI] [PMC free article] [PubMed]
  • 3.Mehta LS, et al. Cardiovascular disease and breast cancer: where these entities intersect: a scientific statement from the American Heart Association. Circulation. 2018;137:e30–e66. doi: 10.1161/CIR.0000000000000556. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Abramson HN. The lipogenesis pathway as a cancer target. J. Med. Chem. 2018;54:5615–5638. doi: 10.1021/jm2005805. [DOI] [PubMed] [Google Scholar]
  • 5.Touvier M, et al. Cholesterol and breast cancer risk: a systematic review and meta-analysis of prospective studies. Br. J. Nutr. 2015;114:347–357. doi: 10.1017/S000711451500183X. [DOI] [PubMed] [Google Scholar]
  • 6.Ni H, Liu H, Gao R. Serum lipids and breast cancer risk: a meta-analysis of prospective cohort studies. PloS One. 2015;10:e0142669. doi: 10.1371/journal.pone.0142669. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Chandler PD, et al. Lipid biomarkers and long-term risk of cancer in the Women’s Health Study. Am. J. Clin. Nutr. 2016;103:1397–1407. doi: 10.3945/ajcn.115.124321. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.His M, et al. Prospective associations between serum biomarkers of lipid metabolism and overall, breast and prostate cancer risk. Eur. J. Epidemiol. 2014;29:119–132. doi: 10.1007/s10654-014-9884-5. [DOI] [PubMed] [Google Scholar]
  • 9.Benn M, Tybjaerg-Hansen A, Stender S, Frikke-Schmidt R, Nordestgaard BG. Low-density lipoprotein cholesterol and the risk of cancer: a Mendelian randomization study. J. Natl. Cancer Inst. 2011;103:508–519. doi: 10.1093/jnci/djr008. [DOI] [PubMed] [Google Scholar]
  • 10.Islam MM, et al. Exploring association between statin use and breast cancer risk: an updated meta-analysis. Arch. Gynecol. Obstet. 2017;296:1043–1053. doi: 10.1007/s00404-017-4533-3. [DOI] [PubMed] [Google Scholar]
  • 11.Undela K, Srikanth V, Bansal D. Statin use and risk of breast cancer: a meta-analysis of observational studies. Breast Cancer Res. Treat. 2012;135:261–269. doi: 10.1007/s10549-012-2154-x. [DOI] [PubMed] [Google Scholar]
  • 12.Cholesterol Treatment Trialists' Collaboration Lack of effect of lowering LDL cholesterol on cancer: meta-analysis of individual data from 175,000 people in 27 randomised trials of statin therapy. PloS ONE7, e29849 (2012). [DOI] [PMC free article] [PubMed]
  • 13.Mansourian M, et al. Statins use and risk of breast cancer recurrence and death: a systematic review and meta-analysis of observational studies. J. Pharm. Pharm. Sci. 2016;19:72–81. doi: 10.18433/J3202B. [DOI] [PubMed] [Google Scholar]
  • 14.Wu QJ, et al. Statin use and breast cancer survival and risk: a systematic review and meta-analysis. Oncotarget. 2015;6:42988–43004. doi: 10.18632/oncotarget.5557. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.The HPS3/TIMI55-REVEAL Collaborative Group Effects of anacetrapib in patients with atherosclerotic vascular disease. N. Engl. J. Med. 2017;377:1217–1227. doi: 10.1056/NEJMoa1706444. [DOI] [PubMed] [Google Scholar]
  • 16.Kobberø Lauridsen B, Stender S, Frikke-Schmidt R, Nordestgaard BG, Tybjaerg-Hansen A. Using genetics to explore whether the cholesterol-lowering drug ezetimibe may cause an increased risk of cancer. Int. J. Epidemiol. 2017;46:1777–1785. doi: 10.1093/ije/dyx096. [DOI] [PubMed] [Google Scholar]
  • 17.Dullaart RPF. PCSK9 inhibition to reduce cardiovascular events. N. Engl. J. Med. 2017;376:1790–1791. doi: 10.1056/NEJMe1703138. [DOI] [PubMed] [Google Scholar]
  • 18.Smith GD, Ebrahim S. ‘Mendelian randomization’: can genetic epidemiology contribute to understanding environmental determinants of disease? Int. J. Epidemiol. 2003;32:1–22. doi: 10.1093/ije/dyg070. [DOI] [PubMed] [Google Scholar]
  • 19.Schmidt AF, et al. PCSK9 genetic variants and risk of type 2 diabetes: a Mendelian randomisation study. Lancet Diabetes Endocrinol. 2017;5:97–105. doi: 10.1016/S2213-8587(16)30396-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Burgess S, Bowden J, Fall T, Ingelsson E, Thompson SG. Sensitivity analyses for robust causal inference from Mendelian randomization analyses with multiple genetic variants. Epidemiology. 2017;28:30–42. doi: 10.1097/EDE.0000000000000559. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Plenge RM, Scolnick EM, Altshuler D. Validating therapeutic targets through human genetics. Nat. Rev. Drug Discov. 2013;12:581–594. doi: 10.1038/nrd4051. [DOI] [PubMed] [Google Scholar]
  • 22.Guo Y, et al. Genetically predicted body mass index and breast cancer risk: Mendelian randomization analyses of data from 145,000 women of European descent. PLoS Med. 2016;13:e1002105. doi: 10.1371/journal.pmed.1002105. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Gao C, et al. Mendelian randomization study of adiposity-related traits and risk of breast, ovarian, prostate, lung and colorectal cancer. Int. J. Epidemiol. 2016;45:896–908. doi: 10.1093/ije/dyw129. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Orho-Melander M, et al. Blood lipid genetic scores, the HMGCR gene and cancer risk: a Mendelian randomization study. Int. J. Epidemiol. 2017;47:495–505. doi: 10.1093/ije/dyx237. [DOI] [PubMed] [Google Scholar]
  • 25.Willer CJ, et al. Discovery and refinement of loci associated with lipid levels. Nat. Genet. 2013;45:1274–1283. doi: 10.1038/ng.2797. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Michailidou K, et al. Association analysis identifies 65 new breast cancer risk loci. Nature. 2017;551:92–94. doi: 10.1038/nature24284. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.His M, et al. Associations between serum lipids and breast cancer incidence and survival in the E3N prospective cohort study. Cancer Causes Control. 2017;28:77–88. doi: 10.1007/s10552-016-0832-4. [DOI] [PubMed] [Google Scholar]
  • 28.Borgquist S, et al. Apolipoproteins, lipids and risk of cancer. Int. J. Cancer. 2016;138:2648–2656. doi: 10.1002/ijc.30013. [DOI] [PubMed] [Google Scholar]
  • 29.Kucharska-Newton AM, et al. HDL-cholesterol and incidence of breast cancer in the ARIC cohort study. Ann. Epidemiol. 2008;18:671–677. doi: 10.1016/j.annepidem.2008.06.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Danilo C, Frank PG. Cholesterol and breast cancer development. Curr. Opin. Pharmacol. 2012;12:677–682. doi: 10.1016/j.coph.2012.07.009. [DOI] [PubMed] [Google Scholar]
  • 31.Nelson ER, Chang Cy, McDonnell DP. Cholesterol and breast cancer pathophysiology. Trends Endocrinol. Metab. 2014;25:649–655. doi: 10.1016/j.tem.2014.10.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Schmidt, A. F. et al. Phenome-wide association analysis of LDL-cholesterol lowering genetic variants in PCSK9. Preprint at bioRxiv10.1101/329052 (2018). [DOI] [PMC free article] [PubMed]
  • 33.Green A, et al. Incidence of cancer and mortality in patients from the Simvastatin and Ezetimibe in Aortic Stenosis (SEAS) trial. Am. J. Cardiol. 2014;114:1518–1522. doi: 10.1016/j.amjcard.2014.08.016. [DOI] [PubMed] [Google Scholar]
  • 34.Peto R, et al. Analyses of cancer data from three ezetimibe trials. N. Engl. J. Med. 2008;359:1357–1366. doi: 10.1056/NEJMsa0806603. [DOI] [PubMed] [Google Scholar]
  • 35.Taylor AE, et al. Mendelian randomization in health research: using appropriate genetic variants and avoiding biased estimates. Econ. Hum. Biol. 2014;13:99–106. doi: 10.1016/j.ehb.2013.12.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Do R, et al. Common variants associated with plasma triglycerides and risk for coronary artery disease. Nat. Genet. 2013;45:1345–1352. doi: 10.1038/ng.2795. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Machiela MJ, Chanock SJ. LDlink: a web-based application for exploring population-specific haplotype structure and linking correlated alleles of possible functional variants. Bioinformatics. 2015;31:3555–3557. doi: 10.1093/bioinformatics/btv402. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Burgess S, Scott RA, Timpson NJ, Davey Smith G, Thompson SG. Using published data in Mendelian randomization: a blueprint for efficient identification of causal risk factors. Eur. J. Epidemiol. 2015;30:543–552. doi: 10.1007/s10654-015-0011-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Bowden J, Davey Smith G, Burgess S. Mendelian randomization with invalid instruments: effect estimation and bias detection through Egger regression. Int. J. Epidemiol. 2015;44:512–525. doi: 10.1093/ije/dyv080. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Bowden J, Davey Smith G, Haycock PC, Burgess S. Consistent estimation in Mendelian randomization with some invalid instruments using a weighted median estimator. Genet. Epidemiol. 2016;40:304–314. doi: 10.1002/gepi.21965. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Burgess S, Dudbridge F, Thompson SG. Combining information on multiple instrumental variables in Mendelian randomization: comparison of allele score and summarized data methods. Stat. Med. 2016;35:1880–1906. doi: 10.1002/sim.6835. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Yavorska OO, Burgess S. MendelianRandomization: an R package for performing Mendelian randomization analyses using summarized data. Int. J. Epidemiol. 2017;46:1734–1739. doi: 10.1093/ije/dyx034. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Bowden J, et al. A framework for the investigation of pleiotropy in two-sample summary data Mendelian randomization. Stat. Med. 2017;36:1783–1802. doi: 10.1002/sim.7221. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Bowden, J. et al. Improving the accuracy of two-sample summary data Mendelian randomization: moving beyond the NOME assumption. Preprint at bioRxiv10.1101/159442 (2018). [DOI] [PMC free article] [PubMed]
  • 45.Verbanck M, Chen CY, Neal B, Do R. Detection of widespread horizontal pleiotropy in causal relationships inferred from Mendelian randomization between complex traits and diseases. Nat. Genet. 2018;50:693–698. doi: 10.1038/s41588-018-0099-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Burgess S, Davies NM, Thompson SG. Bias due to participant overlap in two-sample Mendelian randomization. Genet. Epidemiol. 2016;40:597–608. doi: 10.1002/gepi.21998. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Barter PJ, et al. Effects of torcetrapib in patients at high risk for coronary events. N. Engl. J. Med. 2007;357:2109–2122. doi: 10.1056/NEJMoa0706628. [DOI] [PubMed] [Google Scholar]
  • 48.Schwartz GG, et al. Effects of dalcetrapib in patients with a recent acute coronary syndrome. N. Engl. J. Med. 2012;367:2089–2099. doi: 10.1056/NEJMoa1206797. [DOI] [PubMed] [Google Scholar]
  • 49.Lincoff AM, et al. Evacetrapib and cardiovascular outcomes in high-risk vascular disease. N. Engl. J. Med. 2017;376:1933–1942. doi: 10.1056/NEJMoa1609581. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Information (618.1KB, pdf)
Peer Review File (293.4KB, pdf)

Data Availability Statement

All summary genetic association data used in this study are available online, GLGC (http://lipidgenetics.org/) and BCAC (http://bcac.ccge.medschl.cam.ac.uk/).


Articles from Nature Communications are provided here courtesy of Nature Publishing Group

RESOURCES