Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2020 Feb 1.
Published in final edited form as: Mol Cancer Res. 2018 Oct 24;17(2):457–468. doi: 10.1158/1541-7786.MCR-18-0946

Frequent ESR1 and CDK Pathway Copy Number Alterations in Metastatic Breast Cancer

Ahmed Basudan 1,2,3, Nolan Priedigkeit 2,4, Ryan J Hartmaier 2,*, Ethan S Sokol 5, Amir Bahreini 2,6, Rebecca J Watters 7,8, Michelle M Boisen 2,9,10, Rohit Bhargava 2,11, Kurt R Weiss 7,12, Maria M Karsten 13, Carsten Denkert 13, Jens-Uwe Blohmer 13, Jose P Leone 14, Ronald L Hamilton 11, Adam M Brufsky 2,15, Esther Elishaev 2,10,11, Peter C Lucas 11, Adrian V Lee 1,2,8, Steffi Oesterreich 2,8
PMCID: PMC6359977  NIHMSID: NIHMS1510051  PMID: 30355675

Abstract

DNA sequencing has identified a limited number of driver mutations in metastatic breast cancer beyond single base-pair mutations in the estrogen receptor (ESR1). However, our previous studies and others have observed that structural variants, such as ESR1 fusions, may also play a role. Therefore, we expanded upon these observations by performing a comprehensive and highly sensitive characterization of copy number (CN) alterations in a large clinical cohort of metastatic specimens. NanoString DNA hybridization was utilized to measure CN gains, amplifications and deletions of 67 genes in 108 breast cancer metastases, and in 26 cases, the patient-matched primary tumor. For ESR1, a copyshift algorithm was applied to identify CN imbalances at exon-specific resolution and queried large datasets (>15,000 tumors) that had previously undergone next-generation sequencing (NGS). Interestingly, a subset of ER+ tumors showed increased ESR1 CN (11/82, 13%); three had CN amplifications (4%) and eight had gains (10%). Increased ESR1 CN was enriched in metastatic specimens versus primary tumors and this was orthogonally confirmed in a large NGS dataset. ESR1-amplified tumors showed a site-specific enrichment for bone metastases and worse outcomes than non-amplified tumors. No ESR1 CN amplifications and only one gain was identified in ER− tumors. ESR1 copyshift was present in five out of the 11 ESR1-amplified tumors. Other frequent amplifications included ERBB2, GRB7 and cell cycle pathway members CCND1 and CDK4/6, which showed mutually exclusivity with deletions of CDKN2A, CDKN2B, and CDKN1B.

Keywords: metastases, breast cancer, copy number, estrogen receptor, amplification, cyclin dependent kinase

Background

Breast cancer is a genetic disease driven by accumulations of single nucleotide mutations and structural alterations [1]. The latter include gene fusions, deletions, tandem duplications, and copy number (CN) amplifications as seen in ERBB2 (HER2), CCND1, and MYC. Numerous studies have shown varying levels of estrogen receptor (ERalpha) gene (ESR1) CN amplifications in primary breast cancer. In 2007, Holst et al used fluorescence in situ hybridization (FISH) and identified ESR1 CN amplifications (20.6%) and gains (15.3%) in primary breast tumors [2]. Subsequently, multiple groups reported ESR1 CN amplifications at much lower frequency (0–10%), using FISH and array comparative genomic hybridization (aCGH) [38]. Analysis of The Cancer Genome Atlas (TCGA) and Molecular Taxonomy of Breast Cancer International Consortium (METABRIC) datasets using cBioPortal [911] revealed that ESR1 CN amplifications are present in only 2.5% and 2.3% of breast cancer samples heavily enriched for primary disease, respectively. More recently, Desmedt et al showed ESR1 CN gains and corresponding increased ESR1 mRNA levels in 25% of primary invasive lobular breast cancers (ILC) [12]. Thus, the true frequency as well as the relevance of these ESR1 CNV events, particularly in recurrent and metastatic disease, has yet to be determined.

The landscape of structural variants in metastatic breast cancer (MBC) is largely unknown, particularly after therapy resistance. Gene rearrangements and CN amplifications of ESR1, FOXA1, CYP19A1, and ERBB2 have been reported in relatively small cohorts of endocrine resistant breast cancers [1316]. Li et al [17] originally described an ESR1 fusion arising in a patient-derived xenograft (PDX) model and we have recently identified nine additional ESR1 fusions in advanced disease [18]. Importantly, the location between the N-terminal ESR1 gene and the C-terminal fusion partner is often conserved between ESR1 exons 6 and 7. Gene fusion events are often associated with CN alterations adjacent to the fusion junction that results in an imbalance of DNA CN between the breakpoint flanking exons [19,20]. We therefore designed and applied a computational method (copyshift) that identifies CN imbalances and found imbalances were significantly enriched in ER+ metastases compared to primary breast tumors [18].

Despite overwhelming evidence for a critical role of ESR1 mutations and increasing evidence for ESR1 fusions in endocrine resistant breast cancer, a comprehensive study of ESR1 CN variations in recurrent and metastatic disease is lacking. In this study, we set out to characterize ESR1 CN alterations with exon-level resolution in a large cohort of well curated metastatic lesions using a highly sensitive nanoString-based approach. Additionally, we assessed the CN alteration landscape of 66 genes with known roles in breast cancer progression and therapy resistance.

Materials and methods

Sample collection

We obtained 141 formalin-fixed paraffin-embedded (FFPE) and frozen sections from eligible metastatic cases from the University of Pittsburgh Health Sciences Tissue Bank (HSTB) and Charite Universitätsmedizin, Berlin. Collection and analysis of specimens was approved under the University of Pittsburgh IRB and Charite Universitätsmedizin Berlin guidelines. When available, patient-matched treatment naïve primary tumors were included in the study (n=26). Seven samples were excluded from the study due to unknown ER status. In total, we had 134 samples from 100 patients that met the study criteria (82 ER+ and 52 ER−), including 26 primary tumors (15 ER+ and 11 ER−) and 108 metastases (67 ER+ and 41 ER−). In some cases, multiple primary tumors or metastases were from the same patient. More than 85% of the samples had tumor cellularity > 40%, that was determined by surface area of tumor cells. If it was lower than 30–40%, we macrodissected the tumor area. Therefore, we circled an area of interest on the slide (i.e., a subregion of the slide that contains the highest concentration of invasive cancer), and from within this region, we estimated the percentage of nuclei that are tumor cell nuclei.

Clinicopathological characteristics are provided in Additional file 1: Supplementary Table S1. We collected information on ER status, source of tissue, site of metastasis, histologic subtype. A subset of the samples has recently been used in another, unrelated study [21] (indicated in Supplementary Figure S1).

Immunohistochemistry (IHC)

Hematoxylin and eosin (H&E) staining procedures were performed at the Histology and Micro-imaging Core (HMC) facility at Magee-Women’s Research Institute (MWRI). Analysis of histological classification was performed by two pathologists (PL and EE) (Additional file 1: Supplementary Table S1).

DNA isolation

DNA was isolated from 4–6 FFPE macrodisected sections (10 μm) per sample depending on tumor size and cellularity using the Qiagen AllPrep FFPE kit (cat# 80234) as per manufacturer’s instructions. DNA from frozen samples was extracted using the Qiagen DNeasy kit (cat# 69506) according to the manufacturer’s instructions. All DNA quantifications were done using Qubit dsDNA HS/BR assay kits (ThermoFisher).

Control samples for nanoString and Digital droplet PCR (ddPCR) comparison

We isolated DNA from MCF-7 long-term estrogen-deprived (LTED) cells, BT-474, and MCF10A cells for validation of CN calls. MCF-7 LTED cells have ESR1 amplifications [17] and BT-474 cells have been described to have a heterozygous ESR1 deletion (Cancer Cell Line Encyclopedia (CCLE)) [22]. MCF10A is a non-tumorigenic epithelial cell line with normal (2N) ESR1 CN [23].

Digital droplet PCR (ddPCR)

Primers and probes were designed and ordered through Integrated DNA Technologies (IDT) for ESR1 and Bio-Rad for two reference genes recommended by the manufacturer (EIF2C and AP3B1) (Additional file 1: Supplementary Table S2). 60 ng of control samples were processed for ddPCR analysis as previously described [24]. Briefly, DNA samples were combined with primers, probes, and supermix, and then added to cartridge. Droplets were generated using Biorad QX100 Droplet Generator and transferred into a 96-well plate for PCR amplification, and a droplet reader (Biorad QX100 Droplet Reader) was used to count PCR+ve and PCR-ve droplets. Data were analyzed using QuantaSoft™ software (Bio-Rad) where target concentration was normalized to reference concentration and multiplied by the number of reference loci in the genome (assumed to be 2) to generate copy number calls.

nanoString

We designed 100bp DNA hybridization probes for a total of 67 genes (Additional file 1: Supplementary Table S3), including ESR1 (n=10 probes) (Additional file 2: Supplementary Figure S1), and 66 genes with known roles in breast cancer development and progression (n=3 probes per gene). For the latter, we queried TCGA breast cancer dataset and previous breast cancer metastasis studies [2528], with a focus on potentially druggable genes as identified in drug gene interaction database (DGIdb) [29]. Processing of CN data was performed as per manufacturer recommendations. Briefly, DNA samples were fragmented at 37 ⁰C using Alu1 restriction enzyme, denatured at 95 ⁰C, and hybridized overnight with target probes. Post-hybridization sample processing was done using the automated nCounter Prep Station. Raw counts were then collected from the nCounter Digital Analyzer and transferred to the nSolver™ software (v 2.5) for data analysis [30]. Raw counts were normalized to 10 invariant reference probes (intra-sample count normalization) and to reference samples pools of normal breast FFPE (n=13) and frozen (n=4) DNA to generate CN calls (inter-sample normalization; Additional file 1: Supplementary Tables S3-S5). Average CN estimate value was calculated per gene based on all probes for that gene relative to the CN estimate in the normal sample pool. We used the 67 genes of the nanoString code set for comprehensive quality control assessment of copy number calls between FFPE and cell pellet DNA.

Preparation and sequencing of RNA-seq libraries

RNA sequencing data was available for a subset of 66 samples (46 ER+), and was used to associate correlation between ESR1 CN and mRNA expression. TruSeq RNA Access library preparation (illumina) and sequencing was performed at the sequencing core facility at Children’s Hospital of University of Pittsburgh Medical Center (UPMC) using the NextSeq500 platform that produced paired-end reads (2X75bp). Gene expression values for ESR1 were calculated using counts per million normalized to trimmed-means of M-values (TMM-normalized CPM) using DESeq2 and edgeR packages in R [31,32].

Genomic copy number (CN) analysis of RNAseq data

Detection of chromosomal aberrations was based on measuring allelic bias in RNAseq data using the package eSNP-Karyotyping in R [33]. Briefly, we first called variants using GATK package Haplotype caller [34]. Variants were then ordered according to their genomic coordinates and the ratio between major and minor alleles was calculated. To increase confidence, variants with coverage below 20 reads and frequency below 0.2 were excluded from the analysis. Statistical significance was calculated with a one tailed t-test comparing the SNPs major/minor ratio values in each window with the total SNP pool and false discovery rate (FDR) correcting for multiple testing. Moving median plots were generated using windows of 151 SNPs as suggested by the package developers.

ESR1 CN validation datasets

Comprehensive genomic profiling was performed on breast tumors in a CLIA-certified, New York State- and CAP-accredited laboratory (Foundation Medicine [FM], Cambridge, MA) [35,36]. In addition, tumor data from the American Association for Cancer Research (AACR) GENIE and Memorial Sloan Kettering Cancer Center (MSKCC) was accessed and analyzed through cBioPortal [37,38]. Statistical analyses with p value significance of 0.05 were performed in R and GraphPad, and figures were plotted using ggplot2.

Statistical and bioinformatics analysis

R environment was used for statistical computing and graphics [39]. Oncoprints visualizing multiple genomic alterations were generated using ComplexHeatmap package [40]. Given the important role of ESR1 in breast cancer development and progression, and prior evidence for low level amplifications, copy number changes of ESR1 were a major focus of our study. Thus, we used 10 probes to cover the ESR1, and only 2–3 probes for the other 66 genes. ESR1 CN increase by 35% (CN ≥2.7) and decrease by 50% (CN ≤ 1) were considered gains/amplifications and deletions, respectively. CN calls above ≥10 were considered high amplifications. Giving the lower resolution for other genes in the panel, we used one CN cutoff of ≥5 as amplification. Plots and heatmap of actual CN imbalance and shifts were generated using ggplot2 and heatmap.3 packages, respectively [41,42]. For CN imbalance, we reported only shifts of ≥ 30% difference in CN (ratio of ≥ 1.3) between ESR1 5’ and 3’ exons, which is similar to the 35% CN increase used to define gains. Multiple correlations between the different ESR1 exons and mRNA expression were clustered by first principal component (FPC) scores. Spearman and Pearson correlations, Fisher’s exact, and Wilcoxon rank sum tests were performed in R and GraphPad Prism with significance cutoff of 0.05.

Results

Measurement of DNA CN changes using nanoString technology

We set out to determine which platform (ddPCR or nanoString) is best suited for quantifying CN alterations. Using ddPCR, we determined ESR1 CN status in MCF7-LTED, BT-474, and MCF10A cells, with EIF2C and AP3B1 reference probes. Our CN calls were consistent between the two probes for the ESR1 CN normal (MCF10A) and deletion (BT-474) models, but not MCF7-LTED cells, that have a known ESR1 amplification (Figure 1A and Additional file 1: Supplementary Table S6). These data suggested that use of two reference probes was not sufficient for accurate measurement of CN. Next, we measured ESR1 CN using nanoString technology, where ten reference probes were included in the library. This analysis correctly identified CN in the three control cell lines (Figure 1C). We then performed quality control experiments to test sensitivity and reproducibility of the CN calls within and between the nanoString runs using DNA from fresh cell pellets and from processed FFPE sections. There was a high correlation between DNA isolated from fresh and fixed samples (Figure 1B-C, and Additional file 1: Supplementary Table S8; rho > 0.9, p < 2.2e-16) and excellent reproducibility between different nanoString runs using the same DNA (Figure 1D-E and Table S9; rho > 0.99, p < 2.2e-16). We therefore proceeded using nanoString technology for the characterization of CN alterations in a large set of well curated breast tumors.

Figure 1: nanoString and ddPCR CN analysis in control cell lines.

Figure 1:

A. ddPCR ESR1 copy number analysis of MCF7-LTED, BT-474, and MCF10A cell lines that represent amplification, deletion, and normal copy number models, respectively. The y-axis represents CN status for the ESR1 gene that was normalized separately using two reference genes normalizers (AP3B1 and EIF2C). Error bars of the droplet Poisson distribution for the 95% Confidence Interval are indicated for each data point. B-C. Comparison of nanoString CN calls from FFPE vs immediately processed high quality (HQ) samples of the same cell lines (left; BT474, right; MCF7LTED) showed very high correlation (Pearson’s rho > 0.9, p < 2.2e-16). Data points at extreme ends (i.e. very low or very high expressed genes, n=6) are not included in the graph for better visualization of data. Plots of 61 genes and invariant controls shows highly correlated absolute CN calls for HQ and FFPE DNA of each cell line (top; BT474, bottom; MCF7LTED). Y-axis represent CN calls for each gene. D-E. MCF7LTED HQ and FFPE DNA was analyzed twice within the same run (D) and in two separate runs (E) to assess reproducibility. Reproducibility was very high within and between the runs (Pearson’s rho > 0.99, p < 2.2e-16). CN data points shown as log2 normalized counts.

Frequent ESR1 amplifications in metastatic breast cancer

Our probe design targeted the ESR1 promoter region, untranslated exons (E1 and E2), and coding exons (E3-E10) (Additional file 2: Supplementary Figure S1). Given the high resolution of our coverage, we were able to call low level amplifications/gain (CN ≥ 2.7), in addition to amplification (CN ≥ 10). We measured ESR1 CN in a total of 134 tumor samples (82 ER+ and 52 ER−) (Figure 2). When averaging CN calls from all ESR1 probes, we detected ESR1 CN alterations in eleven (13.4%) out of 82 ER+ tumor samples (67 metastases and 15 primary) (Figure 2A and Table 1). Specifically, eight (9.8%) samples had CN gains and three (3.7%) samples had amplifications. There was a trend for enrichment of ESR1 CN alterations in metastatic samples (14.9%) compared to primary samples (6.6%). Bone metastases showed significant enrichment for ESR1 CN alterations vs primary breast cancer and ovarian metastases (Figure 2B and Table 1; p <0.05).

Figure 2: Frequent ESR1 amplifications in metastatic breast cancer.

Figure 2:

A. Oncoprint visualization of ESR1 copy number alterations in ER+ (top) and ER− (bottom) samples. Levels of amplification and deletion are color coded. Each column represents a single sample (sample IDs labelled as: grey; primary tumor, black; metastasis). Each row indicates copy number call of the correspondent single exon probe. Untranslated exons (E1 and E2) are annotated with ‘*’ symbol. P; Promoter probe. ESR1_ave: average copy number call of all probes. B. Distribution of ESR1 copy number amplifications by site (primary, brain, bone, GI, and ovaries as indicated by arrows) and ER status (color coded). Bone metastases showed significant enrichment for amplifications vs primary and ovaries (Fisher’s exact test, p <0.05). C. Single pair where the brain metastasis had higher level ESR1 amplifications compared to its primary tumor. D. Graphical representation (left; actual copy number) and heatmap (right; ratio) of 5’−3’ copy number imbalance among the ESR1 amplified samples. The dark red color indicates higher copy number amplification toward the 5’ side of ESR1 (Wilcoxon matched pairs signed-rank test p = 0.0024).

Table 1:

ESR1 copy number alterations by site in ER+ samples

Site N n (%)
AMP Gain Total
Primary 15 0.0 1 (6.7) 1 (6.7)
Brain 21 1 (4.8) 1 (4.8) 2 (9.5)
Bone 18 2 (11.1) 5 (27.8) 7 (38.9)*
Ovaries 22 0.0 0.0 0.0
GI 6 0.0 1 (16.7) 1 (16.7)

Total 82 3 (3.7) 8 (9.8) 11 (13.4)
*

significant enrichment for amplifications vs Primaries and ovaries (Fisher’s exact test, p <0.05)

N= number of samples included in the cohort; n and (%)= count and percentage of samples with alterations for the indicated site, respectively.

In ER− tumors (41 metastases and 11 primary), we did not detect any ESR1 amplifications, and found only one primary tumor with a CN gain (Figure 2A and Additional file 1: Supplementary Table S10). ESR1 CN alterations were significantly enriched in ER+ vs ER− tumors (Table 2; p = 0.0192).

Table 2:

ESR1 copy number alterations by ER status

ER status N n (%)
AMP Gain Total
ER+ 82 3 (3.7) 8 (9.8) 11 (13.4)*
ER− 52 0.00 1 (1.9) 1 (1.9)

Total 134 3 (2.2) 9 (6.7) 12 (9)
*

Fisher’s exact test, p =0.0192

N= number of samples included in the cohort; n and (%)= count and percentage of samples with alterations for the indicated site, respectively.

Since our cohort included 13 ER+ patient-matched primary-metastatic tumor pairs, we were able to explore if any metastatic lesions demonstrated an increase in CN as compared to their matched primary lesions. Indeed, we observed this for one pair, where the ER+ primary tumor BP51 had a ESR1 CN gain (CN 3.9), and the patient-matched brain metastases BM51 showed amplifications (CN 10.9) (Figure 2C and Additional file 2: Supplementary Figure S3).

We have recently described a novel algorithm that helps identify potential gene fusions from hybrid capture DNA sequencing based upon CN exon imbalances at fusion breakpoints [18]. The high resolution of nanoString exon-level CN detection within ESR1 allowed us to adapt this approach to determine ESR1 CN imbalance by calculating the ratio of CN signal of 5’ exon probes (3–6) to 3’ exon probes (7–10). Our analysis revealed that five out of 11 amplified samples (45.5%) exhibited ≥ 30% increase in CN (ratio of ≥ 1.3) in the 5’ exons (Figure 2D and Additional file 1: Supplementary Table s12, p= 0.0024), while no samples harbored an increase CN of 3’ exons relative to 5’ exons.

In summary, we observed enrichment of ESR1 amplifications in ER+, recurrent samples and bone metastases versus other metastatic sites. We also identified frequent ‘−3’ exonic imbalances indicating potential rearrangements in ESR1.

Correlation of ESR1 CN with ER mRNA expression

To determine whether ESR1 CN gains and amplifications were correlated with ER mRNA expression, we utilized expression data from RNAseq. As expected, we observed higher ESR1 mRNA expression in samples with ESR1 gains/amplifications (Figure 3A), although this association did not reach statistical significance, likely due to the limited number of samples in the CN gains/amplification group for which RNAseq data was available (n=3). An exploratory analysis within this group of ESR1 amplified samples showed a correlation between CN and expression, but this was also not significant (Additional file 2: Supplementary Figure S2). The first translated ESR1 exon E3 was most predictive for mRNA expression (Figure 3B and Additional file 1: Supplementary Table S13; rho=0.34, p= 0.0219) which supports the higher CN on the 5’ side.

Figure 3: Correlation of ESR1 copy number with mRNA expression.

Figure 3:

A. Box plot comparison of ESR1 mRNA expression between normal CN vs gain/amplification ER+ groups (Wilcoxon rank-sum test, p=0.233). Y-axis: TMM-normalized log2 of count per million (CPM). B. Correlation matrix plot of ESR1 copy number calls by different exons and mRNA expression. Multiple correlations were clustered by first principal component (FPC) scores. Bigger and darker blue circles represent higher correlation. Red outline represents significant p-value for correlation with mRNA expression (p < 0.05).

Validation of ESR1 CN amplifications enrichment in additional cohorts

To validate our findings of ESR1 CN amplifications, we queried three additional datasets (AACR-GENIE, MSKCC, and FM) that used sequencing to characterize breast cancer metastases [35,36,43,44]. This analysis revealed significant enrichment for ESR1 amplifications in metastatic versus primary tumors (Figure 4A). The frequency of ESR1 amplifications was 1.65% (n=27/1637), 2.28% (n=19/835), and 1.84% (n=122/6629) in the AACR-GENIE, MSKCC, and FM metastatic cohorts, respectively. Lower frequency of amplifications in these cohorts are likely due to the use of techniques less sensitive for measurement of amplifications.

Figure 4: Metastasis enrichment for ESR1 amplifications in validation datasets.

Figure 4:

A. Barplot for the percentage of ESR1 amplifications in three different genomic datasets (grey, primary; black, metastasis). Actual counts for amplified samples are indicated within each bar, with p values for metastatic enrichment on top. AACR, American Association for Cancer Research; MSKCC, Memorial Sloan Kettering Cancer Center; FM, Foundation Medicine. MSKCC is also a major contributor to the AACR_GENIE dataset. (For the FM dataset, samples denoted as “Primary” tumors can contain local recurrences since disease is defined by the site of biopsy). B. Kaplan-Meier curves for MSKCC survival data grouped by ESR1 CN status for patients with metastastic breast cancer.

In the FM cohort, where we have had access to ER status for a subset of the tumors, we observed a significant enrichment of ESR1 amplifications in ER+ (36/1272; 2.8%) compared to ER− samples (3/963; 0.3%) (p=1.9e-06). Analysis of survival data from MSKCC cohort showed significantly worse overall survival (OS) for patients with ESR1 amplified vs non-amplified tumors (p value <0.05, Figure 4B), pointing towards potential clinical relevance of the amplification.

CN alterations of known and potential breast cancer driver genes other than ESR1

In addition to ESR1, we extended our CN analysis to 66 genes with described roles in breast cancer progression and metastasis. For these genes, we had slightly lower resolution due to lower coverage (2–3 probes per gene) compared to the ESR1 study (10 probes), which did not allow for high confidence detection of low level amplifcations, and thus we limited the analysis to binary calls (yes or no CNV, defined as CN ≥ 5). Across all ER+ and ER− tumors, the genes with most frequent CN amplifications are shown in Table 3 and Additional file 2: Supplementary Figure S4. The most amplified genes were ERBB2 and GRB7, harboring amplifications of 44% and 21%, respectively, in brain and bone metastases, respectively. Previous studies have reported co-amplifications of ERBB2 and GRB7 on amplicon 17q12 [4547]. This was confirmed by our eSNP-karyotyping analysis using RNAseq data from a subset of our samples that predicts the amplification to cover a broader region rather than being limited to a focal event (Additional file 2: Supplementary Figure S5). Metastasic site-unique or enriched CN alterations were seen for a number of genes, including higher rates of FADD amplifications (17%) in bone metastasis compared to other metastatic sites, and higher rates of PTK2 and PKIA amplifications (~10–20%) in brain and GI metastases. Additionally, comparison of ER+ vs ER− brain metastases showed significant enrichment of FGFR1 amplifications in the ER+ group (Additional file 2: Supplementary Figure S6; p= 0.0221). The comparative analysis between ER+ and ER− tumors was only possible in the brain metastases cohort where we had balanced distribution of ER+ vs ER− tumors. The most frequently deleted gene was TP53 (10%) and this loss was mainly observed in brain metastases (15.4%) (Table 3 and Additional file 2: Supplementary Figure S4).

Table 3:

Most frequent copy number alterations by site

Alteration Gene n (%)

Primary All Mets Brain Bone Ovary GI
Amplifications ERBB2 9 (34.6) 31 (28.7) 23 (44.2) 5 (20.8) 2 (8.6) 1 (11.1)
GRB7 9 (34.6) 29 (26.8) 24 (46.1) 2 (8.3) 2 (8.6) 1 (11.1)
CCND1 1 (3.8) 13 (12) 6 (11.5) 5 (20.8) 2 (8.6) 0 (0)
MYC 4 (15.3) 13 (12) 8 (15.3) 4 (16.6) 0 (0) 1 (11.1)
FGF19 3 (11.5) 10 (9.2) 6 (11.5) 2 (8.3) 2 (8.6) 0 (0)
CTTN 0 (0) 9 (8.3) 5 (9.6) 2 (8.3) 2 (8.6) 0 (0)
FADD 1 (3.8) 8 (7.4) 3 (5.7) 4 (16.6) 1 (4.3) 0 (0)
PTK2 0 (0) 8 (7.4) 6 (11.5) 0 (0) 0 (0) 2 (22.2)
PKIA 0 (0) 7 (6.4) 5 (9.6) 1 (4.1) 0 (0) 1 (11.1)
FGFR1 4 (15.3) 6 (5.5) 4 (7.6) 0 (0) 2 (8.6) 0 (0)

Deletions TP53 0 (0) 11 (10.1) 8 (15.3) 1 (4.1) 1 (4.3) 1 (11.1)
AURKB 0 (0) 9 (8.3) 7 (13.4) 2 (8.3) 0 (0) 0 (0)
CDKN1B 3 (11.5) 9 (8.3) 9 (17.3) 0 (0) 0 (0) 0 (0)
CDKN2B 3 (11.5) 9 (8.3) 6 (11.5) 1 (4.1) 2 (8.6) 0 (0)
CDKN2A 4 (15.3) 7 (6.4) 5 (9.6) 1 (4.1) 1 (4.3) 0 (0)
MAP2K4 0 (0) 7 (6.4) 7 (13.4) 0 (0) 0 (0) 0 (0)
JUN 0 (0) 7 (6.4) 6 (11.5) 0 (0) 1 (4.3) 0 (0)
LSMD1 1 (3.8) 7 (6.4) 4 (7.6) 0 (0) 3 (13) 0 (0)
PTEN 2 (7.6) 6 (5.5) 6 (11.5) 0 (0) 0 (0) 0 (0)

n and (%)= count and percentage of samples with alterations for the indicated site, respectively.

Co-occurrence analysis identified gene combinations that are enriched in ER+ tumors (Figure 5A, top panel; p < 0.05). For example, we confirmed previously reported co-amplifications of genes (CCND1, CTTN, FADD, PAK1, AAMDC, and FGF19) at the 11q13 amplicon [4853], of MYC (8q24) and ERBB2 (17q12) [5456], and of NCOA3 (20q13) and MDM2 (12q15) [57] (Figure 5B). We also observed co-amplification of MDM2 and ERBB3 (12q13), which had not been previously described. Finally, there was an enrichment of MYC (45%) and CCND1 (36%) in tumors with ESR1 amplifications (Figure 5C; p < 0.05). We identified only one event of co-occurrence in ER− tumors, this involved CN amplifications of ERBB2 and GRB7 at the 17q12 amplicon (Figure 5A; bottom panel).

Figure 5: Co-occurrence and mutual exclusivity of CN alterations in metastatic breast cancer.

Figure 5:

A. Tile plots of ER+ (top) and ER− (bottom) samples for copy number amplifications co-occurrences. Gene pairs with significant co-occurrence are colored in red and marked with ‘*’ in the plot (Multiple Fisher’s exact test with FDR adjusted p <0.05). B. Circos plot for genomic co-occurrence events. Circular tracks from outside to inside: genome positions by chromosomes (black lines are cytobands); Inside arcs connects genes with co-occurrence (red; inter-chromosomal, blue; intra-chromosomal). C. Oncoprint shows enrichment of MYC (45%) and CCND1 (36%) amplifications in tumors with ESR1 amplifications (Fisher’s exact test; MYC p= 0.0083, CCND1, p= 0.0365). D. Top: Oncoprint of samples with CN alterations in the CDKs pathway and its inhibitors. Amplifications of the activators CCND1 and CDK4/6 of the pathway were mutually exclusive to the deletions of the inhibitory members CDKN1B, CDKN2A, and CDKN2B (Fisher’s exact test p < 0.0001). Bottom: RNAseq data for a subset of samples (n=46) shows significant correlation of mRNA expression with copy number status for genes of interest (p indicated for Wilcoxon rank-sum test).

Among the most frequent recurrent deletions, we identified mutually exclusive deletions of CDKN2A, CDKN2B, and CDKN1B with amplifications of CCND1 and CDK4/6 (Figure 5D; p < 0.0001). Expression analysis from RNAseq data showed significant correlation with these CN alterations (Figure 5D).

Discussion

Breast cancer is increasingly being recognized as a disease that can be driven by DNA structural variation, in addition to well described single base pair mutations. The majority of CN studies have been performed in primary tumors, at least in part due to difficulties in accessing metastatic and recurrent tissue. However, it is now well recognized that expression of key biomarkers such as ER and HER2 can change between a primary and metastatic breast cancer [58], and that this has clinical significance for therapeutic decision making.

Recent studies have documented ESR1 base pair mutations and fusions in breast cancer. ESR1 amplification has also been identified, although there have been conflicting reports on the frequency, ranging from 0 to 30% [38]. CN has been explored using FISH, aCGH, and qPCR, and sequencing is currently evolving as promising approach. Due to the potential clinical relevance, we performed a comprehensive measurement of ESR1 CN (by examining all exons) in a unique cohort of metastatic, and when available patient-matched primary tissues.

In total, we detected ESR1 gene gains or amplifications in 13% of ER+ metastatic breast tumors. Given the sensitivity and specificity of our method, and careful sampling of tumors with cellularity of at least 40%, we are confident that this is a representative number for ER+ metastatic tumors. Previously reported wide ranges in detected CN alterations are due, at least in part, to the use of technologies with limited sensitivities and specificity such as aCGH, FISH, and qPCR. Our study clearly shows PCR approaches using a limited number of reference genes need to be interpreted with caution. Additional reasons for variability include tumor cellularity and threshold for calling gain and amplification, as previously discussed [59].

We observed significant enrichment of ESR1 gains and amplification in ER+ metastatic lesions compared to the primary tumors, and we were able to validate this finding in three large datasets. In one out of 15 ER+ paired samples, ESR1 CN increased from 3.9 in the primary tumor to 10.9 in the patient-matched brain metastasis. This was not due to cellularity, since both the primary tumor and the metastatic lesion for sample #51 had similar tumor cellularities (~70%). In support of our findings, a recent study by Ullah et al showed an ESR1 amplifcation in an ER+ metastatic lesion, but not in the paired primary tumor [60]. This enrichment suggests selection of ESR1-amplified clones under endocrine treatment, similar to that described for ESR1 mutations [25,26] and HER2 amplifications in brain metastases [16]. Although somewhat limited by small numbers, we did observe a correlation between ESR1 amplifications and ESR1 mRNA expression, a finding that has previously been reported by others [2,61]. However, there are reports in which ESR1 amplification does not correlate with expression level [5,59,62], and ESR1 amplifications have been previously detected in ER− tumors with poor prognosis [63]. Of note, in our study, we observed an ESR1 gain (CN= 2.7) in only one ER− sample. Additional studies are necessary to decipher the functional consequences of low level ESR1 amplifications.

We [18] and others [17,64] have recently identified ESR1 gene fusions in metastatic breast cancer, with fusions generally maintaining the DNA binding and N-terminal transcriptional activation domains, but deleting the ligand-binding domain. We recently described a copyshift algorithm that determines imbalance in DNA CN of exons that are 5’ or 3’ to the break [18]. The high resolution of our nanoString assay allowed us to detect CN imbalances at the exon level, and we found that five out of 11 amplified samples (45.5%) showed at least 30% increase in CN at the 5’ side of ESR1. We did not have RNAseq data for four out of these five cases, and we were unable to detect an ESR1 fusion in the sample with available RNA seq data, likely due to inadequate sequencing coverage. Other mechanisms for generation of structural variants could be involved, such as fusions resulting from tandem duplications. An example is the ESR1-CCDC170 fusion [65] in which CCDC170 is truncated but the coding region of ESR1 is intact. Overall, these imbalanceles are indicative of genetic rearrangements, therefore future studies should determine whether tumors with significant exon imbalances harbor ESR1 fusion genes, and if so, which ones might be non-functional vs drivers of endocrine resistance.

The clinical relevance of ESR1 CN is unclear at this point in time. ESR1 amplifications have been associated with improved [2,59,61] as well as worse outcome [66] in patients treated with endocrine therapy. In endocrine resistant cell line models [67] and in a patient-derived xenografts (PDX) model [17] with ESR1 amplifications, estradiol treatment resulted in tumor regression. We recently described a metastatic breast cancer case with ESR1 amplifications that showed sustained partial response to high dose estradiol treatment as measured by CA 27–29 level and by decrease in liver metastasis burden [68]. These findings might deserve further exploration in a clinical trial setting including prospective measurement of ESR1 CN.

Analysis of 66 other genes revealed frequent ERBB2 and GRB7 amplifications in different metastatic sites, with an enrichment in brain metastases. These results support the increasingly growing need for testing HER2 in the metastatic settings [69,70]. Amplification of genes at the 11q13 locus has been reported in about 15% of primary breast cancer cases and is associated with poor prognosis [4853]. In our analysis, amplifications of multiple genes at this locus (CCND1, CTTN, FADD, PAK1, AAMDC, and FGF19) were also frequent in multiple metastatic sites. Intriguingly, we observed co-amplification of ERBB3 and MDM2, which has not been previously described. Given prior evidence for functional interaction between ERBB3 signaling and MDM2 complex formation, co-amplification may be positively selected in some tumors. Similar to ESR1, some amplifications also showed organ-specific enrichments suggesting CN alterations may be driving metastatic tropisms. For example, brain and GI metastases showed higher PTK2 and PKIA amplifications (~10–20%), while FADD amplifications were more frequent in bone metastases (17%). In our patient-matched samples, most of the CN alterations were maintained in the pairs except for two pairs with a slight increase in CN for PKIA, PTK2, and FGFR4 in the metastases (Additional file 2: Supplementary Figure S7). Moreover, FGFR1 amplifications were enriched in ER+ vs ER− brain metastases.

Cyclin-dependent kinases (CDKs), which control transitions through the different stages of cell cycle, have been considered as promising targets for cancer therapy. Amplifications/overexpression of CCND1 (CDK4/6 activator) and loss of CDKN2A (p16, CDK4/6 inhibitor) have been described in primary breast cancer patients [7177]. Preclinical and in vitro data across multiple cancers support that loss of CDK4/6 inhibitory members may serve as biomarkers for CDK4/6 inhibition sensitivity [7883]. On the other hand, clinical evidence in breast cancer trials performed thus far showed that CCND1 amplifications or p16 loss are unlikely to predict treatment benefit [84]. However, these trials assessing p16 as a biomarker were earlier phase with small sample size, and didn’t show if concurrent alterations in other genes existed, which can alter response to therapy. In our analysis, 32 out of 108 metastases (29%) showed aberrations in the CDKs pathway and 24 samples (22%) had alterations in the CCND1-CDK4/6 axis specifically. Among those, deletions of the CDK4/6 inhibitors CDKN2A/B were significantly mutually exclusive with CCND1 amplifications, and correlated with mRNA expression, suggesting functional consequences for the alterations. Those deletions were maintained in the paired metastases, which supports the concept of using CDK4/6 inhibitors in metastatic settings as previously shown [8588]. Moreover, it is important to note that these deletions also occur in ER− tumors and tumors with HER2 amplifications. Most clinical trials have been conducted in patients with ER+/HER2- breast cancer. Our data suggests that testing of CDK4/6 inhibitors in other subgroups (e.g. ER−) may deserve further investigation.

Conclusion

Our findings suggest that, in addition to ESR1 mutations, ESR1 amplifications and exon imbalances might play a role in endocrine resistance, but further studies are necessary to test this hypothesis. Our analysis also defines CDK pathway alterations as common in metastatic disease—which could be especially relevant to recently adopted CDK4/6 inhibitor therapy. Taken together, copy number alterations in druggable genes are common in metastatic breast cancer and could serve as biomarkers for precision therapies in advanced disease.

Supplementary Material

1
2

Implications: Copy number alterations of ESR1 and key CDK pathway genes are frequent in metastatic breast cancers, and their clinical relevance should be tested further.

Acknowledgements

This project used the UPMC HCC Tissue and Research Pathology/Health Sciences Tissue Bank shared resource, which is supported in part by award P30CA047904.

Funding

We kindly acknowledge the funding support from Breast Cancer Research Foundation (AVL, SO), National Cancer Institute of the National Institutes of Health (P30CA047904), Fashion Footwear Association of New York (FFANY), the Nicole Meloche Memorial Breast Cancer Research Foundations, the Penguins Alumni Foundation, the Shear Family Foundation, and Magee-Womens Research Institute and Foundation. AB is supported by full fellowship from KSU. AVL and SO are recipients of Scientific Advisory Council awards from Susan G. Komen for the Cure, and Hillman Foundation Fellows. NP was supported by a training grant from the NIH/NIGMS (2T32GM008424–21) and an individual fellowship from the NIH/NCI (5F30CA203095). Kurt Weiss is supported by K08 CA177927.

List of abbreviations

aCGH

array comparative genomic hybridization

CCLE

the cell cancer cell line encyclopedia

CN

copy number

ddPCR

digital droplet polymerase chain reaction

DGIdb

drug gene interaction database

ER

estrogen receptor

FDR

false discovery rate

FFPE

formalin-fixed paraffin-embedded

FISH

fluorescence in situ hybridization

IDC

invasive ductal breast cancer

IHC

immunohistochemistry

ILC

invasive lobular breast cancer

LTED

long-term estrogen-deprived

MBC

metastatic breast cancer

METABRIC

Molecular Taxonomy of Breast Cancer International Consortium

qPCR

quantitative polymerase chain reaction

TCGA

The Cancer Genome Atlas

Footnotes

Declarations

Ethics approval and consent to participate

Collection and analysis of specimens were approved under the University of Pittsburgh IRB # PRO 11100645, 14040193, 1500502, 16020301, and Charite Universitätsmedizin Berlin guidelines.

Consent for publication

Not applicable

Availability of data and materials

All data generated or analyzed during this study are included in this published article and its supplementary information files.

Competing interests

The authors declare that they have no competing interests.

References

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

1
2

RESOURCES