Abstract
Multifocal breast cancer (MFBC), defined as multiple synchronous unilateral lesions of invasive breast cancer, is relatively frequent and has been associated with more aggressive features than unifocal cancer. Here, we aimed to investigate the genomic heterogeneity between MFBC lesions sharing similar histopathological parameters. Characterization of different lesions from 36 patients with ductal MFBC involved the identification of non‐silent coding mutations in 360 protein‐coding genes (171 tumour and 36 matched normal samples). We selected only patients with lesions presenting the same grade, ER, and HER2 status. Mutations were classified as ‘oncogenic’ in the case of recurrent substitutions reported in COSMIC or truncating mutations affecting tumour suppressor genes. All mutations identified in a given patient were further interrogated in all samples from that patient through deep resequencing using an orthogonal platform. Whole‐genome rearrangement screen was further conducted in 8/36 patients. Twenty‐four patients (67%) had substitutions/indels shared by all their lesions, of which 11 carried the same mutations in all lesions, and 13 had lesions with both common and private mutations. Three‐quarters of those 24 patients shared oncogenic variants. The remaining 12 patients (33%) did not share any substitution/indels, with inter‐lesion heterogeneity observed for oncogenic mutation(s) in genes such as PIK3CA, TP53, GATA3, and PTEN. Genomically heterogeneous lesions tended to be further apart in the mammary gland than homogeneous lesions. Genome‐wide analyses of a limited number of patients identified a common somatic background in all studied MFBCs, including those with no mutation in common between the lesions. To conclude, as the number of molecular targeted therapies increases and trials driven by genomic screening are ongoing, our findings highlight the presence of genomic inter‐lesion heterogeneity in one‐third, despite similar pathological features. This implies that deeper molecular characterization of all MFBC lesions is warranted for the adequate management of those cancers. © 2015 The Authors. Pathological Society of Great Britain and Ireland.
Keywords: breast cancer, multifocal, multicentric, targeted sequencing, genomic heterogeneity, oncogenic mutations
Introduction
Multifocal or multicentric breast cancers are traditionally defined by the presence of two or more neoplastic lesions within a single breast quadrant or within different quadrants of the same breast, respectively 1. Since these definitions do not follow any internal anatomical boundary, here we use the term multifocal breast cancer (MFBC) to denote any ipsilateral, synchronous tumours presenting with separate invasive lesions. The two largest studies so far investigating the incidence of MFBC reported multifocality in 21% and 24% of patients, respectively 2, 3. No differences between multifocal and unifocal breast cancers have been reported in the distribution of molecular subtypes defined by oestrogen receptor (ER), progesterone receptor (PR), HER2, and the basal‐like CK5/6, CK14, and EGFR markers 4, suggesting that multifocality is not enriched in a certain molecular subtype. MFBC has instead been associated with a potentially worse clinical outcome and with increased axillary nodal involvement than unifocal tumours in most but not all publications 3, 5, 6, 7, 8, 9, 10, 11, 12, 13.
Based on the rationale that lesions with similar histology and grade are expected to be biologically similar, the College of American Pathologists recommends further characterizing all MFBC foci only when histology and grade differ from one another 14. However, recent studies question this recommendation because of well‐documented intra‐patient, inter‐lesion discordances in ER and HER2 expression (up to 4.4% and 10% discrepancy, respectively) 15, 16, 17. We therefore aimed to go a step further and assessed the potential genomic differences between ductal MFBC lesions with concordant histological grade, ER, and HER2 status, which represent the majority of MFBCs. To this end, we sequenced multiple lesions from a cohort of 36 patients with MFBC using a panel of 360 cancer‐related genes and validated this analysis by ultra‐deep resequencing using an orthogonal sequencing platform. For a subset of patients, we further characterized genome‐wide structural variations to interrogate the phylogenetic relationship between the various lesions.
Materials and methods
Patient selection
Patients were retrospectively selected on the basis of the following criteria: (1) documented MFBC, defined in the pathology report as the presence of multiple synchronous ipsilateral invasive lesions in the surgical specimen separated by benign breast tissue; (2) ductal histology; (3) tumour cellularity greater than 40%; (4) availability of two or more MFBC lesions, from which a minimum amount of 700 ng of double‐stranded DNA (dsDNA) could be extracted; (5) similar histological grade, ER, and HER2 status following central pathology review of the different lesions of the same patient; (6) no neo‐adjuvant treatment; and (7) availability of germline DNA, derived from whole blood or a tumour‐free axillary lymph node.
All 36 patients were diagnosed and operated on at Institut Jules Bordet or Ospedale Morgagni–Pierantoni (Forli, Italy) between 2000 and 2013. Eight of the 36 patients were selected for a whole‐genome rearrangement screen based on the availability of a frozen sample of at least two invasive lesions, from which a minimum of 3 µg of dsDNA could be extracted (Supplementary Tables 1 and 2), the remaining tissue samples used for this project all being formalin‐fixed, paraffin‐embedded (FFPE). The project was approved by the internal ethics committees of the two contributing hospitals (internal numbers and dates: 1748‐19/08/2010 and 2028‐11/10/2012) and conducted according to the principles of the Declaration of Helsinki.
Targeted gene screen
The exonic regions of 360 cancer‐related genes (Supplementary Table 3) were enriched using in‐solution RNA baits (SureSelect, Agilent, UK) and sequenced on an Illumina HiSeq 2000 instrument at the Wellcome Trust Sanger Institute. Samples from eight patients were processed with an earlier version of the bait design, lacking 46 of those 360 genes (Supplementary Tables 2 and 3). Seven hundred nanograms to one microgram of DNA was fragmented to an average insert size of 145 bp (75–300) and subjected to Illumina DNA sequencing library preparation using the Bravo Automated Liquid Handling Platform. Individual samples were indexed using a unique DNA barcode via six cycles of PCR. Equimolar pools of 16 libraries were prepared and hybridized to custom RNA baits following the Agilent SureSelect protocol and sequenced on the Illumina HiSeq devices using the 75‐base pair paired‐end protocol. Somatic base substitutions and small insertions or deletions were identified by comparison with the matched normal sample using established bioinformatic algorithms 18, 19.
All mutations identified in at least one sample from a patient were further interrogated by deep resequencing using the Life Technologies Ion Torrent™ technology and confirmed to be present when the frequency of mutated reads was ≥ 5%, as in ref 20. Mutations with a frequency ≥ 1% but < 5% were defined as present only if there was at least one other sample in the same patient reporting the same mutation with a mutant allele fraction ≥ 5%. In brief, samples were analysed with heavily multiplexed primer pools, encompassing a total of 295 amplicons, designed with AmpliSeq® Designer v 3.0.1 (Life Technologies Inc, Carlsbad, CA, USA). Targeted resequencing was performed by barcoding samples on 318™ chips and running the analyses on a PGM™ device (Life Technologies Inc), using the Torrent Suite pipeline and default parameters for FFPE material.
The sequencing data are available through the EBI's European Genome–phenome Archive under accession number EGAD00001001041.
Identification of oncogenic mutations
Mutations were classified in one of the following categories according to the following definition, slightly modified from Papaemmanuil et al 21:
Oncogenic: non‐synonymous substitutions or in‐frame mutations in canonical oncogenes at recurrent hotspots; non‐synonymous substitutions recurrent in two or more confirmed samples in COSMIC; non‐synonymous substitutions recurrent in two or more samples in our own dataset; and nonsense, frameshifting insertions, and deletions in known tumour suppressors;
Putative oncogenic: previously unreported non‐synonymous substitutions in a known cancer gene within ±3 amino acids of a mutation recurrent in two or more samples in COSMIC; more than two non‐synonymous substitutions within three amino acids of each other (mutation clusters);
Possible oncogenic: non‐synonymous substitutions confirmed somatic in one sample in COSMIC; mutations close to mutation clusters in COSMIC;
Unknown significance: all remaining mutations.
For the sake of simplicity, all oncogenic, putative, and possible oncogenic mutations are referred to as ‘oncogenic’ here.
Whole‐genome sequencing
Tumour genomic DNA from frozen multifocal lesions and matched germline DNA from eight patients were prepared for Illumina paired‐end sequencing according to the standard protocol (Illumina Inc, San Diego, CA, USA). Short reads were mapped back to the reference genome (GRCh37) and discordantly mapping reads were identified as pairs that did not match with the expected insert size, mapped in the wrong orientation, or mapped to different genomic regions, as previously described 22, 23. Putative rearrangements were validated and breakpoint annotated using capillary sequencing. Copy number alterations were assessed by QDNAseq (version 1.0.5) 24 by first binning the reads in non‐overlapping 100 kb windows. Bins in problematic regions were blacklisted. Read counts were corrected for GC content and mappability using a LOESS regression and median normalized. Raw log2 ratios were smoothed to remove outlier points using the median absolute deviation and segmented using the multitrack PCF algorithm as implemented in the R package copynumber (version 1.6.0) 25. Absolute estimates of copy numbers were obtained using ABSOLUTE (version 1.0.6) 26 in ‘total’ copy number mode and possible solutions of ploidy and cancer cell fraction were manually reviewed as explained on the software's webpage. Distance measures between lesions of a given patient were computed using a one‐step minimum event distance metric as implemented in the package MEDICC (version 10/2014) 27. We further compared the copy number status for the breast cancer‐specific focal copy‐number aberrations previously described in ref 28.
Statistical analyses
Spearman's ρ was used to assess the correlation between allelic frequencies obtained with Illumina and Ion Torrent technologies. Wilcoxon's tests were used to test the significance of non‐zero shift for samples from two categories and Kruskal–Wallis tests were used to test whether samples for more than two categories had the same distribution. Fisher's exact tests were used for the analysis of contingency tables. All tests rejected the null hypothesis with a two‐tailed p value of 0.05.
Additional methods are reported in the Supplementary methods.
Results
Targeted sequencing of cancer genes in MFBC
To investigate the potential genomic heterogeneity between ductal MFBC lesions with centrally confirmed homogeneity in grade, ER, and HER2 status, we conducted targeted sequencing of cancer‐related genes in multifocal lesions from 36 patients (Table 1 and Supplementary Tables 1–3). Whenever lesion size allowed, we interrogated multiple geographically distinct samples per lesion, leading to a total of 171 investigated tumour samples. Sequencing was carried out at a median exonic coverage of 178×. Overall, 474 somatic mutations were identified, corresponding to 145 and 55 unique coding non‐silent somatic substitutions and indels, respectively, across all samples (Supplementary Table 4).
Table 1.
Patient characteristics
| Total No of patients | |
|---|---|
| Age, years | |
| <40 | 2 |
| 40–49 | 11 |
| 50–69 | 17 |
| >70 | 6 |
| Tumour size, cm | |
| 1–2 | 17 |
| 2–5 | 18 |
| >5 | 1 |
| No of positive nodes | |
| None | 16 |
| 1–3 | 16 |
| 4–9 | 1 |
| >10 | 3 |
| Tumour grade | |
| G1 | 8 |
| G2 | 7 |
| G3 | 21 |
| Molecular subtype | |
| ER+/HER2− | 26 |
| ER−/HER2− | 4 |
| HER2+ | 6 |
| DCIS | |
| Absent | 6 |
| Present | 30 |
| LVI | |
| Absent | 21 |
| Present | 14 |
| Unknown | 1 |
| Inter‐lesion distance*, cm | |
| <2 | 10 |
| ≥2 | 16 |
| Unknown | 10 |
| No of lesions | |
| 2 | 22 |
| >2 | 14 |
DCIS = ductal carcinoma in situ; LVI = lymphovascular invasion.
When there were more than two lesions, the largest inter‐lesion distance was taken into consideration for this table and for the downstream analyses.
To independently confirm the presence or absence of these mutations, we performed a second targeted sequencing experiment at greater sequencing depth (median coverage of 1344×) using an alternative platform (Supplementary Figure 1A). We could not confirm 35 mutations (7%), due to either design/technical failures (n = 19 mutations) or the lack of remaining DNA for four samples (n = 16 mutations). We validated 389 mutations among the remaining 439 (89%, referred to as ‘confirmed’ in Supplementary Table 4) and observed a strong correlation between the allelic frequencies reported by the two sequencing technologies (ρ = 0.81, Supplementary Figures 1B and 1C). The area under the ROC curve of Illumina and PGM sequencing, which can be considered as the probability to classify as positive a randomly chosen positive observation with the first method, had a value of 78.94% (95% CI = 71.25–86.62%). Fifty mutations could not be detected above the predefined cut‐off despite sufficient coverage at that specific genomic location. Of interest, this additional sequencing step also identified the presence of 52 additional mutations in samples where the mutation was previously undetected though present in other sample(s) from the same patient (referred to as ‘present’ in Supplementary Table 4).
We observed a median of three mutations (range 1–27) per patient. The number of mutations detected per patient correlated neither with the number of lesions nor with the number of samples sequenced (Supplementary Figures 2A and 2B). Of note, among the 141 validated unique mutations, 62 (44%) were identified as oncogenic (see the Materials and methods section) and thus susceptible to having contributed to the development of the cancer.
Identification of inter‐lesion heterogeneity
By comparing the mutations identified in the various samples and lesions of a single patient, we arbitrarily classified the patients into three groups: those for which all samples from all lesions carried the same mutations (11/36 patients, 31%, Figure 1); those with both common and private mutations (13/36 patients, 36%, Figure 2); and those with no single mutation in common among all samples from the investigated lesions (12/36 patients, 33%, Figure 3). These groups will further be referred to as ‘homogeneous’, ‘intermediate’, and ‘heterogeneous’, respectively. There was no significant association between the number of samples and lesions that were sequenced per patient and the group to which they were categorized (Supplementary Figures 2C and 2D). In contrast, these groups were significantly associated with the total number of mutations per patient: patients from the homogeneous group had fewer mutations than patients from the heterogeneous group (median number of mutations: 2.5 and 4.5, respectively; p = 0.030; Supplementary Figure 2E). Nevertheless, the number of mutations per lesion between those two groups of multifocal tumors was not significantly different (p = 0.568). Of note, the vast majority of the homogeneous and intermediate cancers (9/11 and 9/13) shared at least one oncogenic mutation, potentially representing common and early events during MFBC carcinogenesis.
Figure 1.

Distribution of non‐silent substitutions and indels in the ‘homogeneous’ MFBC group. Heat maps representing the non‐silent exonic substitutions and indels of the homogeneous multifocal lesions (L), which have all the mutations in common between all the lesions. Orange indicates the presence of an oncogenic mutation; grey the presence of a mutation of unknown significance; and white the absence of the mutation. When a sample name is grey, it means that it was not initially sequenced, due to a lack of sufficient DNA, but was included in the validation phase.
Figure 2.

Distribution of non‐silent substitutions and indels in the ‘intermediate’ MFBC group. Heat maps representing the non‐silent exonic substitutions and indels of the intermediate multifocal lesions, presenting both common and private mutations. The colour code, abbreviations, and meaning of the grey colour for sample names are the same as in Figure 1. When numbers are present after the gene symbol, it means that different mutations were observed in the same gene.
Figure 3.

Distribution of non‐silent substitutions and indels in the ‘heterogeneous’ MFBC group. (A) Heat maps representing the non‐silent exonic substitutions and indels of the heterogeneous multifocal lesions, which have no mutation in common between the lesions. The colour code and abbreviations are the same as in Figure 1. (B) Immunohistochemical staining illustrating the loss of PTEN at the protein level only in the lesion of patient PD13774 carrying the PTEN mutation.
In the intermediate group, the median allelic frequency of private mutations (ie those not shared by all samples of a given patient) was lower than that of common mutations (0.13 versus 0.26, p = 0.042 using a Wilcoxon's signed rank test for paired data). Although copy losses could account for this observation, we believe that the emergence of private, subclonal alterations provides a more general and biologically plausible explanation.
In the heterogeneous group, most cancers were characterized by the presence of different oncogenic mutations in the investigated lesions. In six patients, oncogenic PIK3CA mutations were only present in one of the lesions. A similar observation was found for oncogenic TP53 (n = 3), GATA3 (n = 3), and PTEN (n = 2) mutations. We further demonstrated that the PTEN mutation resulted in the loss of PTEN staining by immunohistochemistry only in the lesion carrying the mutation, as exemplified for patient PD13774 in Figure 3B.
Finally, we observed a case of possible convergent evolution for patient PD4877, with each lesion carrying a different TP53 oncogenic mutation (all three samples from the first lesion carried the R196* mutation, whereas one sample from the other lesion was characterized by the R273H variant).
Altogether, these results suggest that although multifocal tumours may look identical at pathological evaluation, for a substantial number of patients, their lesions differ in terms of oncogenic mutations.
Association of inter‐lesion heterogeneity with clinico‐pathological variables
We further sought to investigate whether inter‐lesion heterogeneity was associated with clinical or histopathological variables, such as age at diagnosis, axillary lymph node involvement, tumour size, molecular subtype of the tumour based on ER and HER2 status, histological grade, number of multifocal lesions, largest distance between lesions, presence of in situ component, and lymphovascular invasion. The only association found was that of inter‐lesion distance: the lesions from patients of the heterogeneous group were further apart from each other than those from patients of the homogeneous group, although the p value was of borderline significance (p = 0.072). A significant p value was, however, reached when comparing patients whose lesions had shared oncogenic mutations with those who had only private oncogenic mutations (p = 0.015, Supplementary Figure 3).
Genome‐wide comparison of multifocal lesions
Targeted mutation screening underestimates the number of common genetic alterations among samples because it only interrogates a tiny fraction of the genome. To investigate the clonal relationship between the lesions, we carried out a low‐coverage screen of somatic structural variation and copy number aberrations (SCNAs) for the eight patients with available frozen material (Figure 4, Supplementary Figure 4, and Supplementary Tables 5–8). Two patients belonged to the homogeneous, three to the intermediate, and three to the heterogeneous group of MFBC tumours, as defined by targeted sequencing. Interestingly, we observed common rearrangements and SCNAs between the lesions for all patients, even for those belonging to the heterogeneous groups, implying that different lesions from the same patient are genetically related. Nevertheless, we cannot exclude that some of the common rearrangements might have arisen during mammary gland development and/or ageing. Although the numbers are too small to draw any statistical conclusion, we further observed a higher proportion of private rearrangements and SCNAs for patients belonging to the heterogeneous group, suggesting an early divergent parallel evolution of the lesions. Of note, most rearrangements did not involve a known cancer gene. Nevertheless, we observed a tandem duplication involving the oncogene MYC in the second lesion of patient PD4877 (Supplementary Table 5). At the copy number level, inter‐lesion differences involving cancer‐related genes were present, such as, for example, PTEN loss in only one of the lesions of patients PD4877, PD4878, and PD11773, and MYC amplification in one lesion from PD11776 (Supplementary Tables 7 and 8).
Figure 4.

Genome‐wide alterations. Bar plots showing the distribution and the type of somatic rearrangements (A) and the percentage of private and common (present in all investigated lesions from that patient) rearrangements (B). (C) Genome‐wide Circos plots of somatic rearrangements of the three patients classified in the ‘heterogeneous’ group. The chromosomes are represented in the outer ring. Within the inner ring, each blue line represents a common rearrangement, whereas each green or red line represents a rearrangement private to lesion 1 or 2, respectively. Bar plots showing the distribution (D) and the percentage of private and common (present in all investigated lesions from that patient) SCNAs (E). (F, G) Log2‐based estimate (log2 ratio) of SCNAs, represented in light red, of two of the three patients classified as ‘heterogeneous’.
Discussion
The College of American Pathologists currently considers it sufficient for the immunohistochemical characterization of multifocal lesions with similar grade and histology to be based on the largest lesion 14. Previous studies have demonstrated that despite the similarity of pathological features, lesions might differ in some cases in terms of ER and HER2 receptor status, potentially impacting the treatment and management of these cancers 15, 16, 17. Here we sought to investigate more deeply the intra‐patient, inter‐lesion tumour heterogeneity of MFBCs, and the potential clinical implications. To this end, we used a targeted sequencing approach that specifically focused on cancer‐related genes, similar to those currently used by several molecular cancer profiling initiatives (reviewed in ref 29), on a series of 171 tumour samples from 36 clinically well‐annotated MFBC patients.
Our study showed that 67% (24 patients) of MFBC lesions share all or a fraction of the identified mutations by targeted sequencing. Most of these lesions (18/24) also shared oncogenic variants, suggesting a common origin in the tumour development. Surprisingly, in one‐third of the patients (12 patients), the lesions did not share any mutations despite similar histopathological features. The lesions in all but one patient actually differed in terms of oncogenic mutations. For example, four patients were heterogeneous in terms of oncogenic TP53 mutations, in line with a recent observation 30. These differences in the presence of oncogenic mutations suggest that different genetic alterations might have been causally involved in the development of the respective lesions in a substantial proportion of MFBCs.
Although the numbers in our study are too small to draw definitive conclusions, we did not observe any significant associations between inter‐lesion heterogeneity and commonly used clinico‐pathological features, with the exception of the inter‐lesion distance. Indeed, MFBCs whose lesions shared oncogenic variants were closer to each other than lesions not sharing any oncogenic variant. This observation supports the concept underlying the historical definition of multicentric and multifocal tumours – namely, that lesions in close proximity to each other are more likely to be biologically similar than lesions that are far apart 1.
The fact that lesions from one‐third of the MFBCs that we studied harboured distinct oncogenic mutations may have substantial therapeutic implications in the context of genotype‐driven trials. Although these trials are mainly running in the metastatic setting, most of them allow identification of the mutation(s) to be performed in the primary tumour 29. In the case of MFBC, the identification of the molecular targets, for example PIK3CA and PTEN, would differ depending on the lesion interrogated. Our results therefore suggest that ideally all lesions from patients with MFBC should be evaluated, in particular when the lesions are relatively distant from each other.
Another question, especially in the case of heterogeneous MFBCs, is whether MFBC represents clonally related outgrowths of a common ancestral cancer cell or completely separate, coincidental cancers. This has been a matter of debate in the literature during the last few decades 31, 32, 33, 34. Our targeted sequencing data support a common origin in at least two‐thirds of MFBCs. Our genome‐wide analyses, which are limited to a subset of the patients that we studied, suggest that an even greater proportion of MFBCs, including those without any common oncogenic mutations, are clonally related and consequently that the lesions arise through intra‐mammary spread of the tumour cells. The intrinsically invasive nature of the cells making up MFBCs might explain the worse prognostic features associated with multifocal compared with unifocal cancers.
Since the number of molecular alterations with potential clinical utility is rapidly growing in parallel with the increasing number of targeted therapies, our findings suggest that molecularly characterizing only the largest lesion is not sufficient to adequately manage MFBCs, especially when lesions are further apart from each other. The implications of inter‐lesion differences in terms of treatment response to various standard and targeted therapies, as well as in terms of disease progression, deserve further investigation. Recognizing the potential molecular heterogeneity of these cancers has great potential clinical relevance and already represents an important step towards the personalized treatment of this disease.
Author contributions
CD and CS conceived, designed, and directed the study. CD, GZ, AB, and CS were involved in funding acquisition. DF, EP, DV, MM, NK, IM, LS, DA, DL, MP, and CS provided study materials or patients. CD, DF, EP, FR, SM, MM, DV, NK, RS, RdW, and DL were involved in sample preparation and pathological characterization. CD, SNZ, GZ, and GG analysed and curated substitutions and indels, and generated the oncogenicity calls. GZ, AG, EC, TVB, AB, and DL conducted the resequencing experiments. CD, DF, and GZ analysed the resequencing experiments. CD, SNZ, LM, and SM carried out the whole‐genome rearrangement screen experiments. CD, DF, SNZ, and DB analysed the whole‐genome rearrangement data. BB and DB conducted the copy number analyses. GZ performed the statistical analyses. CD, DF, EP, GZ, DB, SNZ, GG, SL, TVB, DA, DL, MP, DL, PJC, and CS critically interpreted the data and results. CD prepared the manuscript. All authors were involved in reviewing and commenting on the manuscript, and gave their approval of the submitted version of the manuscript.
SUPPORTING INFORMATION ON THE INTERNET.
The following supporting information may be found in the online version of this article:
Supplementary methods.
Figure S1. Validation of the mutations using the alternative sequencing platform.
Figure S2. Relationship between the mutational burden, the number of samples and lesions interrogated per patient, and the group of MFBCs.
Figure S3. Inter‐lesion heterogeneity and inter‐lesion distance.
Figure S4. Genome‐wide copy number alterations.
Table S1. Patient and tumour's characteristics.
Table S2. List of all tumour samples and the applications they have been used for.
Table S3. List of genes interrogated by targeted sequencing.
Table S4. Somatic substitutions and indels.
Table S5. Physical and sequence coverage from the whole‐genome sequencing for the identification of the rearrangements.
Table S6. List of validated private and common rearrangements.
Table S7. Absolute copy number status of the focal regions reported to be specifically amplified in breast cancer according to ref 28, for the 16 samples with available data.
Table S8. Absolute copy number status of the focal regions reported to be specifically deleted in breast cancer according to ref 28, for the 16 samples with available data.
Supporting information
AppendixS1. SUPPORTING INFORMATION
FigureS1. Validation of the mutations using the alternative sequencing platform. (A) Histogram representing the distribution of the coverage of the interrogated mutations across all samples. (B‐C) Scatterplot and Receiver Operating Characteristic (ROC) analyses illustrating the concordance of the allelic frequencies estimated by the two sequencing technologies.
FigureS2. Relationship between the mutational burden, the number of samples and lesions interrogated per patient, and the group of MFBCs. (A‐B) Boxplots of mutational burden per patient in terms of the number of lesions and samples that have been interrogated per patient, respectively; (C‐D) Boxplots of the number of interrogated lesions and samples per patient in terms of the group of MFBC, respectively; and (E) Boxplot of the mutational burden per patient in terms of the group of MFBC.
FigureS3. Inter‐lesion heterogeneity and inter‐lesion distance. (A) Boxplot of inter‐lesion heterogeneity in terms of oncogenic mutations and largest inter‐lesion distance. Here patients were classified in two groups: those sharing oncogenic mutations between their lesions and those only having oncogenic mutations private to some of their lesions. Patients without identified oncogenic mutations were not considered here. (B) Boxplot of inter‐lesion heterogeneity in terms of the three groups identified according to the targeted sequencing data considering all mutations, and largest inter‐lesion distance.
FigureS4. Genome‐wide copy number alterations. Log2 based estimate of copy number (Log2 Ratio) aberrations, represented in red, across the patients with available whole genome sequencing data that were not represented in Figure 4.
TableS1. Patient and tumour's characteristics.
TableS2. List of all tumour samples and the applications they have been used for.
TableS3. List of genes interrogated by targeted sequencing.
TableS4. Somatic substitutions and indels. Abbreviations: WT = wild type; MT = mutated; AF = allelic fraction; NA = not available; sub = substitution; del = deletion; ins = insertion. Note concerning the ‘Validation. IonTorrent’ column: ‘Confirmed’ means that the particular mutation was detected by both technologies in that given sample. ‘Present’ means that the particular mutation was not detected by Illumina, but was detected in the validation phase in that particular sample.
TableS5. Physical and sequence coverage from the whole‐genome sequencing for the identification of the rearrangements.
TableS6. List of validated private and common rearrangements.
TableS7. Absolute copy number status of the focal regions reported to be specifically amplified in breast cancer according to ref 28, for the 16 samples with available copy number data.
TableS8. Absolute copy number status of the focal regions reported to be specifically deleted in breast cancer according to ref 28, for the 16 samples with available copy number data.
Acknowledgments
We thank the patients who agreed to participate in this study, as well as L Buisseret, G Rouas, PY Adnet, S Brohée, S Martin, L Stebbings, S O'Meara, S McLaren, J Marshall, and C Straehle for their logistical, technical, and administrative assistance. We further thank MEDIC; Les Amis de l'Institut Bordet; Fonds National de Recherche Scientifique; the Brussels Region (Impulse programme Life Sciences); Fonds Lambeau‐Marteaux; the Breast Cancer Research Foundation (BCRF); the Wellcome Trust Clinical Research Training Fellowship Programmes; San Paolo Foundation; CARIGE Foundation; and AIRC Foundation (ID MFAG10570) for their financial support.
Conflict of interest statement: Peter J Campbell is a founder, stock‐holder, and consultant for 14 M Genomics Ltd. The remaining authors do not have any conflicts of interest to declare.
References Note: References 35– 40 are cited in the Supporting information to this article
- 1. Salgado R, Aftimos P, Sotiriou C, et al. Evolving paradigms in multifocal breast cancer. Semin Cancer Biol 2015; 31: 111–118. [DOI] [PubMed] [Google Scholar]
- 2. Lynch SP, Lei X, Chavez‐Macgregor M, et al. Multifocality and multicentricity in breast cancer and survival outcomes. Ann Oncol 2012; 23: 3063–3069. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3. Wolters R, Wöckel A, Janni W, et al. Comparing the outcome between multicentric and multifocal breast cancer: what is the impact on survival, and is there a role for guideline‐adherent adjuvant therapy? A retrospective multicenter cohort study of 8,935 patients. Breast Cancer Res Treat 2013; 142: 579–590. [DOI] [PubMed] [Google Scholar]
- 4. Tot T. Early and more advanced unifocal and multifocal breast carcinomas and their molecular phenotypes. Clin Breast Cancer 2011; 11: 258–263. [DOI] [PubMed] [Google Scholar]
- 5. Bendifallah S, Werkoff G, Borie‐Moutafoff C, et al. Multiple synchronous (multifocal and multicentric) breast cancer: clinical implications. Surg Oncol 2010; 19: e115‐123. [DOI] [PubMed] [Google Scholar]
- 6. Pedersen L, Gunnarsdottir KA, Rasmussen BB, et al. The prognostic influence of multifocality in breast cancer patients. Breast 2004; 13: 188–193. [DOI] [PubMed] [Google Scholar]
- 7. Cabioglu N, Ozmen V, Kaya H, et al. Increased lymph node positivity in multifocal and multicentric breast cancer. J Am Coll Surg 2009; 208: 67–74. [DOI] [PubMed] [Google Scholar]
- 8. Vlastos G, Rubio IT, Mirza NQ, et al. Impact of multicentricity on clinical outcome in patients with T1–2, N0–1, M0 breast cancer. Ann Surg Oncol 2000; 7: 581–587. [DOI] [PubMed] [Google Scholar]
- 9. Joergensen LE, Gunnarsdottir KA, Lanng C, et al. Multifocality as a prognostic factor in breast cancer patients registered in Danish Breast Cancer Cooperative Group (DBCG) 1996–2001. Breast 2008; 17: 587–591. [DOI] [PubMed] [Google Scholar]
- 10. Litton JK, Eralp Y, Gonzalez‐Angulo AM, et al. Multifocal breast cancer in women < or =35 years old. Cancer 2007; 110: 1445–1450. [DOI] [PubMed] [Google Scholar]
- 11. Leopold KA, Recht A, Schnitt SJ, et al. Results of conservative surgery and radiation therapy for multiple synchronous cancers of one breast. Int J Radiat Oncol Biol Phys 1989; 16: 11–16. [DOI] [PubMed] [Google Scholar]
- 12. Yerushalmi R, Kennecke H, Woods R, et al. Does multicentric/multifocal breast cancer differ from unifocal breast cancer? An analysis of survival and contralateral breast cancer incidence. Breast Cancer Res Treat 2009; 117: 365–370. [DOI] [PubMed] [Google Scholar]
- 13. Pekar G, Hofmeyer S, Tabár L, et al. Multifocal breast cancer documented in large‐format histology sections: long‐term follow‐up results by molecular phenotypes. Cancer 2013; 119: 1132–1139. [DOI] [PubMed] [Google Scholar]
- 14. Lester SC, Bose S, Chen YY, et al. Protocol for the examination of specimens from patients with invasive carcinoma of the breast. Arch Pathol Lab Med 2009; 133: 1515–1538. [DOI] [PubMed] [Google Scholar]
- 15. Buggi F, Folli S, Curcio A, et al. Multicentric/multifocal breast cancer with a single histotype: is the biological characterization of all individual foci justified? Ann Oncol 2012; 23: 2042–2046. [DOI] [PubMed] [Google Scholar]
- 16. Choi Y, Kim EJ, Seol H, et al. The hormone receptor, human epidermal growth factor receptor 2, and molecular subtype status of individual tumour foci in multifocal/multicentric invasive ductal carcinoma of breast. Hum Pathol 2012; 43: 48–55. [DOI] [PubMed] [Google Scholar]
- 17. Pekar G, Gere M, Tarjan M, et al. Molecular phenotype of the foci in multifocal invasive breast carcinomas: intertumoural heterogeneity is related to shorter survival and may influence the choice of therapy. Cancer 2014; 120: 26–34. [DOI] [PubMed] [Google Scholar]
- 18. Varela I, Tarpey P, Raine K, et al. Exome sequencing identifies frequent mutation of the SWI/SNF complex gene PBRM1 in renal carcinoma. Nature 2011; 469: 539–542. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19. Ye K, Schulz MH, Long Q, et al. Pindel: a pattern growth approach to detect break points of large deletions and medium sized insertions from paired‐end short reads. Bioinformatics 2009; 25: 2865–2871. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20. Wang L, Yamaguchi S, Burstein MD, et al. Novel somatic and germline mutations in intracranial germ cell tumours. Nature 2014; 511: 241–245. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21. Papaemmanuil E, Gerstung M, Malcovati L, et al. Clinical and biological implications of driver mutations in myelodysplastic syndromes. Blood 2013; 122: 3616–3627. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22. Campbell PJ, Yachida S, Mudie LJ, et al. The patterns and dynamics of genomic instability in metastatic pancreatic cancer. Nature 2010; 467: 1109–1113. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23. Stephens PJ, McBride DJ, Lin ML, et al. Complex landscapes of somatic rearrangement in human breast cancer genomes. Nature 2009; 462: 1005–1010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24. Scheinin I, Sie D, Bengtsson H, et al. DNA copy number analysis of fresh and formalin‐fixed specimens by shallow whole‐genome sequencing with identification and exclusion of problematic regions in the genome assembly. Genome Res 2014; 24: 2022–2032. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25. Nilsen G, Liestøl K, Van Loo P, et al. Copynumber: efficient algorithms for single‐ and multi‐track copy number segmentation. BMC Genomics 2012; 13: 591. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26. Carter SL, Cibulskis K, Helman E, et al. Absolute quantification of somatic DNA alterations in human cancer. Nature Biotechnol 2012; 30: 413–421. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27. Schwarz RF, Trinh A, Sipos B, et al. Phylogenetic quantification of intra‐tumour heterogeneity. PLoS Comput Biol 2014; 10: e1003535. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28. Zack TI, Schumacher SE, Carter SL, et al. Pan‐cancer patterns of somatic copy number alteration. Nature Genet 2013; 45: 1134–1140. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29. Zardavas D, Maetens M, Irrthum A, et al. The AURORA initiative for metastatic breast cancer. Br J Cancer 2014; 111: 1881–1887. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30. Eeles R, Knee G, Jhavar S, et al. Multicentric breast cancer: clonality and prognostic studies. Breast Cancer Res Treat 2011; 129: 703–716. [DOI] [PubMed] [Google Scholar]
- 31. Dawson PJ, Baekey PA, Clark RA. Mechanisms of multifocal breast cancer: an immunocytochemical study. Hum Pathol 1995; 26: 965–969. [DOI] [PubMed] [Google Scholar]
- 32. Noguchi S, Aihara T, Koyama H, et al. Discrimination between multicentric and multifocal carcinomas of the breast through clonal analysis. Cancer 1994; 74: 872–877. [DOI] [PubMed] [Google Scholar]
- 33. Teixeira MR, Ribeiro FR, Torres L, et al. Assessment of clonal relationships in ipsilateral and bilateral multiple breast carcinomas by comparative genomic hybridisation and hierarchical clustering analysis. Br J Cancer 2004; 91: 775–782. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34. Tsuda H, Hirohashi S. Identification of multiple breast cancers of multicentric origin by histological observations and distribution of allele loss on chromosome 16q. Cancer Res 1995; 55: 3395–3398. [PubMed] [Google Scholar]
- 35. Harvey JM, Clark GM, Osborne CK, et al. Estrogen receptor status by immunohistochemistry is superior to the ligand‐binding assay for predicting response to adjuvant endocrine therapy in breast cancer. J Clin Oncol 1999; 17: 1474–1481. [DOI] [PubMed] [Google Scholar]
- 36. Leake R, Barnes D, Pinder S, et al. Immunohistochemical detection of steroid receptors in breast cancer: a working protocol. UK Receptor Group, UK NEQAS, The Scottish Breast Cancer Pathology Group, and The Receptor and Biomarker Study Group of the EORTC. J Clin Pathol 2000; 53: 634–635. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37. Wolff AC, Hammond ME, Schwartz JN, et al. American Society of Clinical Oncology/College of American Pathologists guideline recommendations for human epidermal growth factor receptor 2 testing in breast cancer. J Clin Oncol 2007; 25: 118–145. [DOI] [PubMed] [Google Scholar]
- 38. Elston CW, Ellis IO. Pathological prognostic factors in breast cancer. I. The value of histological grade in breast cancer: experience from a large study with long‐term follow‐up. Histopathology 1991; 19: 403–410. [DOI] [PubMed] [Google Scholar]
- 39. Genestie C, Zafrani B, Asselain B, et al. Comparison of the prognostic value of Scarff–Bloom–Richardson and Nottingham histological grades in a series of 825 cases of breast cancer: major importance of the mitotic count as a component of both grading systems. Anticancer Res 1998; 18: 571–576. [PubMed] [Google Scholar]
- 40. Nik‐Zainal S, Van Loo P, Wedge DC, et al. The life history of 21 breast cancers. Cell 2012; 149: 994–1007. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
AppendixS1. SUPPORTING INFORMATION
FigureS1. Validation of the mutations using the alternative sequencing platform. (A) Histogram representing the distribution of the coverage of the interrogated mutations across all samples. (B‐C) Scatterplot and Receiver Operating Characteristic (ROC) analyses illustrating the concordance of the allelic frequencies estimated by the two sequencing technologies.
FigureS2. Relationship between the mutational burden, the number of samples and lesions interrogated per patient, and the group of MFBCs. (A‐B) Boxplots of mutational burden per patient in terms of the number of lesions and samples that have been interrogated per patient, respectively; (C‐D) Boxplots of the number of interrogated lesions and samples per patient in terms of the group of MFBC, respectively; and (E) Boxplot of the mutational burden per patient in terms of the group of MFBC.
FigureS3. Inter‐lesion heterogeneity and inter‐lesion distance. (A) Boxplot of inter‐lesion heterogeneity in terms of oncogenic mutations and largest inter‐lesion distance. Here patients were classified in two groups: those sharing oncogenic mutations between their lesions and those only having oncogenic mutations private to some of their lesions. Patients without identified oncogenic mutations were not considered here. (B) Boxplot of inter‐lesion heterogeneity in terms of the three groups identified according to the targeted sequencing data considering all mutations, and largest inter‐lesion distance.
FigureS4. Genome‐wide copy number alterations. Log2 based estimate of copy number (Log2 Ratio) aberrations, represented in red, across the patients with available whole genome sequencing data that were not represented in Figure 4.
TableS1. Patient and tumour's characteristics.
TableS2. List of all tumour samples and the applications they have been used for.
TableS3. List of genes interrogated by targeted sequencing.
TableS4. Somatic substitutions and indels. Abbreviations: WT = wild type; MT = mutated; AF = allelic fraction; NA = not available; sub = substitution; del = deletion; ins = insertion. Note concerning the ‘Validation. IonTorrent’ column: ‘Confirmed’ means that the particular mutation was detected by both technologies in that given sample. ‘Present’ means that the particular mutation was not detected by Illumina, but was detected in the validation phase in that particular sample.
TableS5. Physical and sequence coverage from the whole‐genome sequencing for the identification of the rearrangements.
TableS6. List of validated private and common rearrangements.
TableS7. Absolute copy number status of the focal regions reported to be specifically amplified in breast cancer according to ref 28, for the 16 samples with available copy number data.
TableS8. Absolute copy number status of the focal regions reported to be specifically deleted in breast cancer according to ref 28, for the 16 samples with available copy number data.
