Skip to main content
PLOS One logoLink to PLOS One
. 2015 Sep 25;10(9):e0136192. doi: 10.1371/journal.pone.0136192

Targeted Sequencing of the Mitochondrial Genome of Women at High Risk of Breast Cancer without Detectable Mutations in BRCA1/2

Sophie Blein 1, Laure Barjhoux 1; GENESIS investigators, Francesca Damiola 1, Marie-Gabrielle Dondon 2,3,4, Séverine Eon-Marchais 2,3,4, Morgane Marcou 2, Olivier Caron 5, Alain Lortholary 6, Bruno Buecher 7, Philippe Vennin 8, Pascaline Berthet 9, Catherine Noguès 10, Christine Lasset 11,12, Marion Gauthier-Villars 5, Sylvie Mazoyer 1, Dominique Stoppa-Lyonnet 5,13,14, Nadine Andrieu 2,3,4, Gilles Thomas 15, Olga M Sinilnikova 1,16, David G Cox 1,*
Editor: Paolo Peterlongo17
PMCID: PMC4583250  PMID: 26406445

Abstract

Breast Cancer is a complex multifactorial disease for which high-penetrance mutations have been identified. Approaches used to date have identified genomic features explaining about 50% of breast cancer heritability. A number of low- to medium penetrance alleles (per-allele odds ratio < 1.5 and 4.0, respectively) have been identified, suggesting that the remaining heritability is likely to be explained by the cumulative effect of such alleles and/or by rare high-penetrance alleles. Relatively few studies have specifically explored the mitochondrial genome for variants potentially implicated in breast cancer risk. For these reasons, we propose an exploration of the variability of the mitochondrial genome in individuals diagnosed with breast cancer, having a positive breast cancer family history but testing negative for BRCA1/2 pathogenic mutations. We sequenced the mitochondrial genome of 436 index breast cancer cases from the GENESIS study. As expected, no pathogenic genomic pattern common to the 436 women included in our study was observed. The mitochondrial genes MT-ATP6 and MT-CYB were observed to carry the highest number of variants in the study. The proteins encoded by these genes are involved in the structure of the mitochondrial respiration chain, and variants in these genes may impact reactive oxygen species production contributing to carcinogenesis. More functional and epidemiological studies are needed to further investigate to what extent variants identified may influence familial breast cancer risk.

Introduction

In 2008, 14 million breast cancers (BC) were diagnosed worldwide, representing more than 10% of all cancer diagnoses[1]. One woman out of nine will develop breast cancer in her lifetime in developed countries[1,2]. Breast cancer is a complex multifactorial disease, with genetic, life-style and environmental susceptibility factors. Today we estimate that the genetic component of breast cancer represents 5% to 10% of all breast cancers, with 4% to 5% due to high-penetrance mutations in susceptibility genes[35] Mutations in two high-penetrance susceptibility genes have been identified: BRCA1 and BRCA2. Pathogenic mutations in BRCA1 confer a lifetime risk of BC of 60% to 85%[6,7], whereas such mutations in BRCA2 confer a lifetime risk of approximately 40% to 85%[6,7]. Other genomic variations (for example in genes interacting with BRCA1 and BRCA2) have been identified as modifiers of BC cancer risk and alter the risk initially conferred by BRCA1 or BRCA2 mutation[8].

Early methods such as linkage or candidate gene studies, and more recent approaches like genome-wide association studies (GWAS) have been able to identify genomic features explaining about 50% of breast cancer heritability[9]. A number of low- to moderate-effect Single Nucleotide Polymorphisms (SNPs) have been identified as associated with BC risk. However, today the large amount of genome-wide studies performed that have failed to identify other highly penetrant breast cancer susceptibility genes suggests that the remaining heritability is likely to be explained by the cumulative effect of low-penetrant and low- to moderate-effect genomic variations, by moderately penetrant but rare alleles, and by the synergistic effect of environmental exposure combined to specific genomic variations.

Oxidative stress has been shown to play a role in BC development[10,11]. Mitochondria are intimately linked to oxidative stress, as they are a main source of Reactive Oxygen Species (ROS) in the cell. ROS are cytotoxic and mutagenic components that can cause damage to DNA, in particular double strand breaks. When not correctly handled, this damage leads to genomic instability, and therefore may contribute to tumorigenesis. Germline and somatic variations in the mitochondrial genome have been linked to cancer. Mitochondrial genome mutations are found in various cancers and may alter mitochondrial metabolism, enhance tumorigenesis and permit cancer cell adaptation to changing environments[12,13].

Whereas the mitochondrial genome is haploid, classical linkage studies are dedicated to the analysis of diploid genomes. Furthermore, commercially available GWAS arrays are not suited to capture variation in specific regions of the genome, in particular the mitochondrial genome. Therefore methods specifically targeting the mitochondrial genome are needed to further explore variability in the mitochondrial genome of breast cancer patients. For these reasons, we have sequenced the mitochondrial genome of 436 women diagnosed with breast cancer, having a positive familial breast cancer history, but testing negative for BRCA1/2 pathogenic mutations.

Materials and Methods

GENESIS study

GENESIS (GENE SISters) is a French national study designed to identify new breast cancer susceptibility genes[14]. GENESIS includes pairs of sisters diagnosed with breast cancer, but testing negative for BRCA1/BRCA2 mutation. Testing was performed in the context of an oncogenetic consultation following French national guidelines. GENESIS also includes matched non-related controls of the same age group and sharing the same living environment, i.e. close friends or colleagues of index cases.

Ethics Statement

The GENESIS study has been reviewed and approved by the appropriate ethics committee (Comité de Protection des Personnes Ile de France III, 3 october 2006, agreement n°2373). Informed consent has been obtained for each woman included in the GENESIS Study.

Sample Selection

We selected unrelated women among index breast cancer cases of the study. Index cases were ranked by breast cancer phenotypic aggressiveness: i.e. women diagnosed at an early age, and presenting bilateral cancer (see Table 1). Table 1 also presents the number of cases selected from different inclusion categories. Blood samples are centralized in the Breast Cancer Genetics laboratory of the Cancer Research Center of Lyon, where DNA is extracted using automated protocols. As our hypothesis was that rare, potentially deleterious alleles would be discovered, we chose to include as many index cases as possible, and no control individuals. Statistical power to carry out classical association testing in this context would be insufficient, and formal association testing was not the aim of this study.

Table 1. Inclusion categories of selected index cases.

Inclusion category Counts
Index Case’s BC bilateral, diagnosed before 50 Sister’s BC bilateral, diagnosed before 50 8
Index Case’s BC bilateral, diagnosed before 60 Sister’s BC bilateral 8
Index Case’s BC diagnosed before 60 Sister’s BC bilateral 36
Index Case’s BC bilateral, diagnosed before 50 Sister’s BC unilateral, diagnosed before 50 23
Index Case’s BC unilateral, diagnosed before 50 Sister’s BC bilateral, diagnosed before 50 23
Index Case’s BC unilateral, diagnosed before 50 Sister’s BC unilateral, diagnosed before 50 164
Index Case’s BC bilateral, diagnosed before 60 174
Total 436
Age at 1st BC for index cases (in years, mean ± SD) 46.1 ± 7.9
Age at 1st BC for sister (in years, mean ± SD) 45.7 ± 6.7
Bilateral cancer for index cases Yes: 51.2% No: 48.8%

Mitochondrial Genome Sequencing

Primers corresponding to eleven amplicons covering the whole mitochondrial genome were designed and tested for specificity to the mitochondrial genome (Table A in S1 File). Sequencing libraries were prepared with NEBNext Fast DNA Fragmentation & Library Prep Set for Ion Torrent. The mean target fragment size for fragmentation was 175 bp. Samples were multiplexed by 48. Libraries were loaded on Ion 316 chips for sequencing (Single end sequencing) using the Personalized Genome MachineTM system (PGM, Life Technologies, Carlsbad, CA, USA). A total of 10 sequencing runs were performed.

Bioinformatics Analyses

Reads were aligned to the Mitochondrial Genome Reference sequence rCRS deposed in Genbank under the accession NC_012920.1 with BWA[15] software (version 0.7.5a). Read quality was assessed using the FastQC (v.0.10.1)[16] Toolkit. Samtools (v.0.1.19)[17], Picard tools (v.1.96)[18], and GATK (v.2.5–2)[1921] were used to process aligned bam files. Indels were realigned after primary alignment with the GATK walker IndelRealigner. Base quality scores were recalibrated with the GATK walker BaseRecalibrator. Variants were called using the GATK UnifiedGenotyper algorithm. Visual examination of read alignment in regions surrounding detected variants was performed with Integrative Genomics Viewer[22,23].

Annotation and filtering

Variants called were annotated with the Ensembl tool Variant Effect Predictor (VEP)[24], and with locally developed Python scripts. Given the haploid status of mitochondria, variants called as heterozygous were considered unreliable and filtered as false-positives. Variants for which the alternative allele was not balanced on both strands, i.e. variants having less than 10% of reads supporting alternative allele on one strand are considered as unreliable and filtered. Remaining reliable variants were annotated according to MITOMAP[25], a catalog of mitochondrial variants including mini insertions and deletions (accessed September 2013). We also annotated positions that are conserved among vertebrates, i.e. positions strictly conserved between reference genomes of nine superior eukaryotic species (data not shown). If mutated, variants at these positions are potentially more likely to have a functional impact. Finally, VEP provided gene annotation, substitution effect in transcribed region, codon and amino acid change and functional effect prediction with PolyPhen[26] and SIFT[27] for non-synonymous substitutions.

To estimate mitochondrial variability at the gene level, 2 statistics were computed. The global variant enrichment rate estimates the number of distinct variants observed per gene and per Mb. For a given gene x of length l x, if N x distinct variants were observed in our study, then the global variant enrichment rate r g for this gene is:

rg = Nxlx× 1000 variants/Mb

This statistic does not take into account the frequency of each variant observed in our study. Therefore, the weighted variant enrichment rate r w, taking into account both gene variability and variants frequency was also computed. With i from 1 to N x representing the i th variant observed on gene x, and n i the count of individual carrying variant i, then:

rw = Σi=1Nxnilx × 1000 variants.person/Mb

The statistics were computed for all mitochondrial genes, except for those encoding for mitochondrial transfer RNAs, as their very small length would have biased the results.

Results

Sequencing characteristics, coverage assessment, and variant description

More than 20 million reads were obtained, with a mean read count per sample higher than 45,000 (Table B in S1 File). On average, more than 27,000 of the 45,000 sequenced reads were successfully aligned against the reference genome, which gives an alignment rate of 57.4%. (Table C in S1 File)

In our experiment, approximately 90% of the mitochondrial genome is covered with a depth of coverage of 50X. The global mean coverage is 198X. More details, including coverage profile along the mitochondrial genome, are presented in Figure A in S1 File.

1157 distinct variants successfully passed our quality filters; and are detailed in S2 File. More than 99% of variants are substitutions. About 80% of variants are located in the coding sequence of mitochondrial genes. This is not surprising given the probable bacterial origin of mitochondria, and the low proportion of intergenic sequences in its genome. There are no missense substitutions or indels detected in genes coding for mitochondrial transfer RNA (Mt-tRNA). Comparing concordance between analyses with SIFT and Polyphen shows that 24 variants are predicted as deleterious by SIFT and as probably damaging by Polyphen, and these variants are detailed in Table 2.

Table 2. Description of variants predicted as deleterious by SIFT and as probably damaging by Polyphen.

Position Ref. Alt. rsID MITOMAP Counts Conserved Gene CodonChange a.a Change
3388 C A . Known 1 No MT-ND1 Cta/Ata L/M
6237 C A . Known 2 No MT-CO1 Ctg/Atg L/M
6489 C A rs28461189 Known 3 Yes MT-CO1 Ctc/Atc L/I
7941 A G . Known 1 Yes MT-CO2 aAc/aGc N/S
7964 T C . Known 1 No MT-CO2 Ttc/Ctc F/L
7976 G A . Known 1 Yes MT-CO2 Ggc/Agc G/S
8563 A G . Known 1 No MT-ATP6 Aca/Gca T/A
8839 G A . Known 2 Yes MT-ATP6 Gcc/Acc A/T
8920 G A . Known 1 No MT-ATP6 Ggc/Agc G/S
9010 G A . Known 1 Yes MT-ATP6 Gct/Act A/T
9448 A G . Known 1 Yes MT-CO3 tAc/tGc Y/C
9500 C A . Unknown 1 No MT-CO3 ttC/ttA F/L
9577 T C . Unknown 1 Yes MT-CO3 cTa/cCa L/P
9903 T C rs199999390 Known 1 Yes MT-CO3 Ttt/Ctt F/L
11087 T C . Known 1 Yes MT-ND4 Ttc/Ctc F/L
12634 A G . Known 3 Yes MT-ND5 Atc/Gtc I/V
12923 G T . Known 1 No MT-ND5 tGa/tTa W/L
13129 C T . Known 1 Yes MT-ND5 Ccc/Tcc P/S
13973 A T . Known 1 No MT-ND5 cAa/cTa Q/L
14180 T C . Known 2 No MT-ND6 tAt/tGt Y/C
14484 T C . Known 2 No MT-ND6 Atg/Gtg M/V
14769 A G rs28357679 Known 2 Yes MT-CYB aAc/aGc N/S
15218 A G rs2853506 Known 10 No MT-CYB Aca/Gca T/A
15773 G A . Known 1 Yes MT-CYB Gta/Ata V/M

Ref.: Reference allele

Alt.: Alternative allele

Freq.: Minor allele frequency observed in our data

a. a. change: amino acid change

Table 3 contains a description of observed variants unknown in MITOMAP, which represent 3% of all detected variants. None of these variants is detected at high frequency in our dataset (all were observed once, with the exception of the A-T variant at bp3385 which was observed twice), and most do not affect conserved positions. These observations are consistent with unknown rare polymorphisms, and these variants are also absent from dbSNP.

Table 3. Description of variants observed but unknown in MITOMAP.

Pos. Ref. Alt. Conserved Gene Effect Sift Polyphen
393 T A No - - - -
1713 A G No MT-RNR2 Non coding exon - -
1807 T C No MT-RNR2 Non coding exon - -
2150 T TA Yes MT-RNR2 Non coding exon - -
2156 A AT No MT-RNR2 Non coding exon - -
3385 A T No MT-ND1 missense deleterious possibly damaging
4875 C T No MT-ND2 synonymous - -
5573 A G No MT-TW Non coding exon - -
5746 GA G No - - - -
6113 A T No MT-CO1 synonymous - -
6200 C T No MT-CO1 synonymous - -
6569 C T No MT-CO1 synonymous - -
6608 C T No MT-CO1 synonymous - -
6812 A G No MT-CO1 synonymous - -
7004 A G No MT-CO1 synonymous - -
7366 C T Yes MT-CO1 missense tolerated benign
8263 C T No MT-CO2 synonymous - -
8465 C T No MT-ATP8 missense tolerated probably damaging
8673 A G No MT-ATP6 synonymous - -
9138 C T No MT-ATP6 synonymous - -
9370 A T No MT-CO3 missense tolerated benign
9500 C A No MT-CO3 missense deleterious probably damaging
9577 T C Yes MT-CO3 missense deleterious probably damaging
9873 C A No MT-CO3 missense tolerated probably damaging
9890 A G No MT-CO3 synonymous - -
10030 C T No MT-TG Non coding exon - -
10094 C A No MT-ND3 synonymous - -
12098 C T No MT-ND4 synonymous - -
12266 A G No MT-TL2 Non coding exon - -
13380 C T No MT-ND5 synonymous - -
13792 C T No MT-ND5 synonymous - -
13806 C T No MT-ND5 synonymous - -
15620 C T Yes MT-CYB missense deleterious possibly damaging
16229 T A No - - - -
16454 C T No - - - -

Pos.: position on mitochondrial genome

Ref.: Reference allele

Alt.: Alternative allele

Table 4 summarizes the distribution of variants by mitochondrial gene, and displays the global variant enrichment rate r g and the weighted enrichment rate r w for each gene. The mean global variant enrichment per gene (Mt-tRNA genes excluded) is 64.2 variants per 1Mb. The mean weighted variant enrichment rate is 0.44 variants.person/Mb. Genes with the highest enrichment are MT-ATP6 with 86.5 variants/Mb, and MT-CYB (cytochrome B) with 84.1 variants/Mb. A Shapiro test was used to verify that r g follows a normal distribution (p-value = 0.34). r g values of MT-ATP6 and MT-CYB are positioned at the 93th and at the 90th percentile of the distribution. They are also among genes having the highest weighted variant enrichment rate (0.92 and 0.91 respectively).

Table 4. Distribution of variants and description by mitochondrial gene.

Variant Enrichment is performed for non Mt-tRNA genes (Variants per 1Mb).

Gene Symbol Type Length Variant count Enrichment r g Enrichment r w
MT-RNR1 Mt_rRNA 955 34 35.6 1.07
MT-RNR2 Mt_rRNA 1561 57 37.2 0.64
MT-ND1 protein_coding 957 58 60.6 0.27
MT-ND2 protein_coding 1043 64 61.4 0.62
MT-CO1 protein_coding 1543 98 63.5 0.29
MT-CO2 protein_coding 685 44 64.2 0.11
MT-ATP8 protein_coding 208 15 72.1 0.12
MT-ATP6 protein_coding 682 59 86.5 0.92
MT-CO3 protein_coding 785 60 76.4 0.23
MT-ND3 protein_coding 347 24 69.2 0.30
MT-ND4L protein_coding 298 20 67.1 0.23
MT-ND4 protein_coding 1379 85 61.6 0.46
MT-ND5 protein_coding 1813 134 73.9 0.27
MT-ND6 protein_coding 526 26 49.4 0.24
MT-CYB protein_coding 1142 96 84.1 0.91

r g: global variants enrichment rate, in variants/Mb.

r w weighted variants enrichment rate, in variants.person/Mb

Two genes, MT-RNR1 and MT-RNR2, show low global enrichment rates (35.6 and 37.2 variants/Mb, respectively). These two genes code for the mitochondrial 12S and 16S ribosomal RNAs, respectively structural components of the small and large ribosomal subunits of mitoribosomes. Given their essential structural function, it is not surprising to see that these genes are more conserved than other mitochondrial genes.

As represented in Fig 1, we can observe 3 groups of genes. MT-RNR1 and MT-RNR2 represent the group of MT-RNAs genes. They are characterized by a low number of distinct variants, some having a high frequency in our population. On the other hand, MT-CYB and MT-ATP6 are both characterized by a high number of distinct variants (high r g value). They also are the most frequently altered genes in our population (high r w value). The third group is composed of the remaining genes.

Fig 1. Distribution of r g and r w for all non-MT-tRNAs genes.

Fig 1

A. Distribution of r g by bins of 1. B. Distribution of r w by bins of 1. White dots represents MT-CYB and MT-ATP6. Black dots represents MT-RNR1 and MT-RNR2. Grey dots represent other protein coding genes. * r w value for MT-RNR1 gene is artificially increased because of 2 rare variants carried in the reference sequence in this gene.

Discussion

In this study we have performed a deep characterization of mitochondrial genome variability among 436 women diagnosed with breast cancer, having a positive familial breast cancer history, and negative for BRCA1/2 mutation screening. Women were chosen based on the age at diagnosis and on the laterality of their cancer.

We identified 1157 distinct variants compared to the reference sequence rCRS. All the frequent variants observed in this study (i.e. with a relative frequency > 5%) are known in the MITOMAP database. The majority of these variants are common mitochondrial SNPs, some of which have been associated with an increased risk of breast cancer[28, 29]. Of the 35 variants (including 3 indels) that are unknown in MITOMAP database, (also unknown in dbSNP), none was observed in more than 3 individuals among the 436 samples that were sequenced. There is no reason to reject these variants after visual examination of alignments with IGV. They could be rare variants or private mutations, i.e. rare mutations restricted to a few families, appeared recently in mitochondrial genome evolution. 4 variants are predicted as deleterious by SIFT, 2 are predicted possibly damaging, and 4 probably damaging by PolyPhen. None of these 6 loci have been described in the literature.

No variant predicted as deleterious by SIFT and probably damaging by PolyPhen was observed more than 3 times in the 436 samples sequenced, except the polymorphism rs2853506 (A15218G), which was observed in 10 samples (2%). This polymorphism has been found associated with epileptogenesis[30]. This SNP is located in the gene coding for cytochrome b, MT-CYB. Along with cytochrome c1 and Rieske protein, cytochrome b is one of the three respiratory subunits of complex III, the third complex involved in Electron Transport Chain. Alterations in this subunit have been shown to modify catalytic capacities of complex III[31].

Although the coverage profile is not homogeneous or constant along the mitochondrial genome, this profile is robust through the 10 independent sequencing runs that were performed in this study. Furthermore, despite the relatively low alignment rate, we have a mean depth > 50X for approximately 90% of mitochondrial genome. Similar coverage profiles were obtained in other studies on Ion Torrent sequencing platforms and with various aligners[32,33].

Whereas the Ion Torrent sequencing technology has been reported to have very high confidence rates regarding point mutation sequencing, the reliability of detecting insertions and deletions (indels) has been a subject of intense debate and a significant proportion of indels called with this technology is likely to represent false positives [3335]. In our study, only 10 out of 111 indels initially called passed filters, and 93% of these were discarded in at least one sample due to an unbalanced repartition of reads supporting alternative alleles on the two DNA strands, with less than 10% of reads on one strand. This strand specific deletion bias has already been observed in Ion Torrent [36]. Furthermore, in our results, 8 out of the 10 deletions that have passed filters are located in homopolymer tracts. The only 2 remaining indels seem highly reliable when visualized with a viewer like IGV[22,23]. Ion Torrent sequencing is known to be prone to errors for homopolymeric tracts of length more than 3, the uncertainty of pH measure often leading to a misestimation of homopolymer length. Loman et al. have estimated the Ion Torrent PGM homopolymer-associated indel errors at 1.5 per 100 bases[37]. Better results are usually obtained on Ion Torrent PGM data by using an aligner based on Burrows-Wheeler Transform (BWT) algorithm[38], an alignment algorithm used by BWA (the aligner we used) or the mapper included in Nextgene sequencing analysis software (SoftGenetics, State College, PA, USA).

Filtering non-homozygous variant calls enables us to obtain a restricted list of variants after having eliminated the high level of noise generated by Ion torrent sequencing. However, by doing this we choose to not analyze the potential heteroplasmy, or potential somatic changes in the mitochondrial genome in the blood sample sequenced, and to focus on inherited variants. We did so because heteroplasmy in blood may not be of interest for studying breast cancer etiology, as compared to heteroplasmy observed in breast tissue.

Our aim is to characterize in detail mitochondrial genome variability of women with a strong familial history of breast cancer, but without BRCA1/2 pathogenic mutations. Given the diminished capability of familial linkage and GWAS analyses to detect mitochondrial genome features, a targeted sequencing approach was needed. Several recent studies[38] underline the possibility that the remaining unexplained heritability of breast cancer could be explained by rare variants specific of only one or a few families. Population based methods like case-control studies are therefore not able to detect any associations with these variants. For this reason we have not included any control subjects in our study, and have included as many cases as feasible.

We identified 1157 variants; some of which were previously associated with breast cancer risk. However, as expected, we did not identify a common potentially pathogenic genomic pattern among the 436 women included in our study. It is therefore unlikely that mutations in the mitochondrial genome have the potential to explain a high proportion of the excess of breast cancer risk of women with a familial history but without BRCA1/2 mutations. New variants, unreferenced in MITOMAP and not described in the pubic literature were characterized, and some of them are predicted to potentially alter the function of the protein encoded by their genes. MT-ATP6 and MT-CYB, two genes coding for important structural subunits of the mitochondrial respiration chain, show the highest variability with the highest frequency in our data. These results are interesting in this context, and echo recent results underlying the potential role of mitochondrial variants as susceptibility factors for familial breast cancer risks[39]. Based on these elements, further studies are needed to examine if these variants influence breast cancer risk.

Alternative approaches are now required to try to understand a larger part of breast cancer heritability. Powerful technologies such as next generation sequencing can help to achieve this goal, and has already been applied in the context of breast cancer[40,41]. New consortiums are emerging, such as COMPLEXO which aims at deciphering breast cancer missing heritability by looking further into the human exome by using massive parallel sequencing technologies[42]. Our results provide evidence that the mitochondrial genome should be considered when designing studies in order assess the role of mitochondrial genome variability in breast cancer risk and etiology.

Supporting Information

S1 File. Technical details of sequencing experiments.

Details of PCR primers used to isolate the mitochondrial genome including primer sequence, melting temperature, and amplicon size (Table A). Post-sequencing read characteristics (Table B). Post-sequencing mitochondrial genome coverage (Table C). Coverage distribution along the mitochondrial genome with standard deviation for all 436 samples. Coverage in overlapping regions is divided by two to take into account the overlap (Figure A).

(DOCX)

S2 File. Full list of variants identified.

Details of all variants identified by sequencing 436 mitochondrial genomes (Table A).

(XLSX)

Acknowledgments

We wish to thank the genetic epidemiology platform (the PIGE, Plateforme d'Investigation en Génétique et Epidémiologie: S. Eon-Marchais, M. Marcou, D. Le Gal, L. Toulemonde, J. Beauvallet, N. Mebirouk, E. Cavaciuti, A. Fescia), the biological resource center (C. Verny-Pierre, L. Barjhoux, V. Sornin) and all the GENESIS collaborating cancer clinics (Clinique Sainte Catherine, Avignon: H. Dreyfus; Hôpital Saint Jacques, Besançon: M-A. Collonge-Rame; Institut Bergonié, Bordeaux: M.Longy, A. Floquet, E. Barouk-Simonet; CHU, Brest: S. Audebert; Centre François Baclesse, Caen: P. Berthet; Hôpital Dieu, Chambéry: S. Fert-Ferrer; Centre Jean Perrin, Clermont-Ferrand: Y-J. Bignon; Hôpital Pasteur, Colmar: J-M. Limacher; Hôpital d’Enfants CHU—Centre Georges François Leclerc, Dijon: L. Faivre-Olivier; CHU, Fort de France: O. Bera; CHU Albert Michallon, Grenoble: D. Leroux; Hôpital Flaubert, Le Havre: V. Layet; Centre Oscar Lambret, Lille: P. Vennin†, C. Adenis; Hôpital Jeanne de Flandre, Lille: S. Lejeune-Dumoulin, S. Manouvier-Hanu; CHRU Dupuytren, Limoges: L. Venat-Bouvet; Centre Léon Bérard, Lyon: C. Lasset, V. Bonadona; Hôpital Edouard Herriot, Lyon: S. Giraud; Institut Paoli-Calmettes, Marseille: F. Eisinger, L. Huiart; Centre Val d’Aurelle—Paul Lamarque, Montpellier: I. Coupier; CHU Arnaud de Villeneuve, Montpellier: I. Coupier, P. Pujol; Centre René Gauducheau, Nantes: C. Delnatte; Centre Catherine de Sienne, Nantes: A. Lortholary; Centre Antoine Lacassagne, Nice: M. Frénay, V. Mari; Hôpital Caremeau, Nîmes: J. Chiesa; Réseau Oncogénétique Poitou Charente, Niort: P. Gesta; Institut Curie, Paris: D. Stoppa-Lyonnet, M. Gauthier-Villars, B. Buecher, A. de Pauw, C. Abadie, M. Belotti; Hôpital Saint-Louis, Paris: O. Cohen-Haguenauer; Centre Viggo-Petersen, Paris: F. Cornélis; Hôpital Tenon, Paris: A. Fajac; GH Pitié Salpétrière et Hôpital Beaujon, Paris: C. Colas, F. Soubrier, P. Hammel, A. Fajac; Institut Jean Godinot, Reims: C. Pennet, T. D. Nguyen; Polyclinique Courlancy, Reims: L. Demange†, C. Pennet; Centre Eugène Marquis, Rennes: C. Dugast; Centre Henri Becquerel, Rouen: A. Chevrier, T. Frebourg, J. Tinat, I. Tennevet, A. Rossi; Hôpital René Huguenin/Institut Curie, Saint Cloud: C. Noguès, L. Demange†, E. Mouret-Fourme; CHU, Saint-Etienne: F. Prieur; Centre Paul Strauss, Strasbourg: J-P. Fricker, H. Nehme-Schuster; Hôpital Civil, Strasbourg: O. Caron, C. Maugard; Institut Claudius Regaud, Toulouse: L. Gladieff, V. Feillel; Hôpital Bretonneau, Tours: I. Mortemousque; Centre Alexis Vautrin, Vandoeuvre-les-Nancy: E. Luporsi; Hôpital de Bravois, Vandoeuvre-les-Nancy: P. Jonveaux; Gustave Roussy, Villejuif: A. Chompret†, O. Caron).

Data Availability

All relevant data are within the paper and its Supporting Information files.

Funding Statement

This work was supported by Ligue Nationale contre le Cancer (3 grants: PRE05/DSL, PRE07/DSL, PRE11/NA), www.ligue-cancer.net. GENESIS: study design and data collection. This work also received support from the French National Institute of Cancer (Grant INCa n°2008-029/LL-LC), http://www.e-cancer.fr/. (GENESIS: study design and data collection); and the comprehensive cancer center SiRIC Curie, (Site de Recherche Intégrée sur le Cancer: Grant INCa-DGOS-4654), http://www.institut-curie.org/research/institut-curie-integrated-cancer-research-site-siric; www.ligue-cancer.net. GENESIS: study design and data collection.

References

  • 1. Ferlay J., Shin HR, Bray F, Forman D, Mathers C, Parkin DM. Estimates of worldwide burden of cancer in 2008: GLOBOCAN 2008. Int. J. Cancer 2010. 127(12) [DOI] [PubMed] [Google Scholar]
  • 2. Lalloo F, Evans DG. Familial Breast Cancer. Clin. Genet. 2012. 82(2) [DOI] [PubMed] [Google Scholar]
  • 3. Newman B, Austin MA, Lee M, King MC. Inheritance of human breast cancer: evidence for autosomal dominant transmission in high-risk families. Proc. Natl. Acad. Sci. U. S. A. 1988. May; 85(9) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4. Hall JM, Lee MK, Newman B, Morrow JE, Anderson LA, Huey B, et al. Linkage of early-onset familial breast cancer to chromosome 17q21. Science 1990. December 21; 250(4988). [DOI] [PubMed] [Google Scholar]
  • 5. Miki Y, Swensen J, Shattuck-Eidens D, Futreal PA, Harshman K, Tavtigian S, et al. A strong candidate for the breast and ovarian cancer susceptibility gene BRCA1. Science 1994. October 7; 266(5182). [DOI] [PubMed] [Google Scholar]
  • 6. Evans DG., Shenton A, Woodward E, Lalloo F, Howell A, Maher ER. Penetrance estimates for BRCA1 and BRCA2 based on genetic testing in a Clinical Cancer Genetics service setting: risks of breast/ovarian cancer quoted should reflect the cancer burden in the family. BMC Cancer 2008. May 30; 8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7. Antoniou A, Pharoah PD., Narod S, Risch HA, Eyfjord JE, Hopper JL. et al. Average risks of breast and ovarian cancer associated with BRCA1 or BRCA2 mutations detected in case Series unselected for family history: a combined analysis of 22 studies. Am. J. Hum. Genet. 2003. May; 72(5). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8. Lynch HT, Snyder C, Casey MJ. Hereditary ovarian and breast cancer: what have we learned? Ann. Oncol. 2013. November; 24 [DOI] [PubMed] [Google Scholar]
  • 9. Melchor L, Benítez J. The complex genetic landscape of familial breast cancer. Hum. Genet. 2013. August; 132(8) [DOI] [PubMed] [Google Scholar]
  • 10. Jezierska-Drutel A, Rosenzweig SA, Neumann CA Role of oxidative stress and the microenvironment in breast cancer development and progression. Adv. Cancer Res. 2013; 119 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11. Loft S, Olsen A, Møller P, Poulsen HE, Tjønneland A. Association between 8-oxo-7,8-dihydro-2’-deoxyguanosine excretion and risk of postmenopausal breast cancer: nested case-control study. Cancer Epidemiol. Biomark. Prev. 2013. July; 22(7). [DOI] [PubMed] [Google Scholar]
  • 12. Wallace DC. Mitochondria and cancer. Nat. Rev. Cancer 2012. October; 12(10). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13. Brandon M, Baldi P, Wallace DC. Mitochondrial mutations in cancer. Oncogene 2006. August 7; 25(34). [DOI] [PubMed] [Google Scholar]
  • 14.Andrieu N, Dondon MG, Eon-Marchais S, Damiola F, Marcou M, Barjhoux L, et al. GENESIS: a French national resource to study the missing heritability of breast cancer. Abstract presented at the 2014 American Society of Human Genetics meeting available at http://www.ashg.org/2014meeting/abstracts/fulltext/f140122138.htm
  • 15. Li H, Durbin R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 2009. July 15; 25(14). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.FastQC project. at <http://www.bioinformatics.bbsrc.ac.uk/projects/fastqc>
  • 17. Li H, Handsaker B, Wysoker A, Fennel T, Ruan J, Homer N, et al. The Sequence Alignment/Map format and SAMtools. Bioinformatics 2009. August 15; 25(16). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Picard project. at <http://picard.sourceforge.net>
  • 19. McKenna A, Hanna M, Banks E, Sivachenko A, Cibulskis K, Kernytsky A, et al. The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 2010. September; 20(9). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20. DePristo MA, Banks E, Polin R, Garimella KV, Maguire JR, Hartl C, et al. A framework for variation discovery and genotyping using next-generation DNA sequencing data. Nat. Genet. 2011. May; 43(5). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21. Van der Auwera GA, Carneiro MO, Hartl C, Poplin R., del Angel G, Levy-Moonshine A et al. in Current Protocols in Bioinformatics (John Wiley & Sons, Inc, 2002). at <http://onlinelibrary.wiley.com/doi/10.1002/0471250953.bi1110s43/abstract> [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22. Robinson JT, Thorvaldsdottir H, Winckler W, Guttman M, Lander ES, Getz G, et al. Integrative genomics viewer. Nat. Biotechnol. 2011. January; 29(1). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23. Thorvaldsdóttir H, Robinson JT, Mesirov JP. Integrative Genomics Viewer (IGV): high-performance genomics data visualization and exploration. Brief. Bioinform. 2013. March; 14(2). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24. McLaren W, Pritchard B, Rios D, Chen Y, Flicek P, Cunningham F. Deriving the consequences of genomic variants with the Ensembl API and SNP Effect Predictor. Bioinformatics 2010. August 15; 26(6). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25. Ruiz-Pesini E, Lott MT, Procaccio V, Poole JC, Brandon MC, Mishmar D, et al. An enhanced MITOMAP with a global mtDNA mutational phylogeny. Nucleic Acids Res. 2007. January; 35(Database issue). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26. Adzhubei IA, Schmidt S, Peshkin L, Ramensky VE, Gerasimova A, Bork P, et al. A method and server for predicting damaging missense mutations. Nat. Methods 2010. April; 7(4). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27. Kumar P, Henikoff S, Ng PC. Predicting the effects of coding non-synonymous variants on protein function using the SIFT algorithm. Nat. Protoc. 2009; 4(7). [DOI] [PubMed] [Google Scholar]
  • 28. Bai R-K, Leal SM, Covarrubias D, Liu A, Wong L-JC. Mitochondrial genetic background modifies breast cancer risk. Cancer Res. 2007. May 15; 67(10). [DOI] [PubMed] [Google Scholar]
  • 29. Cheng M, Guo Z, Li H, Li Z, Li C, Geng C. Identification of sequence polymorphisms in the mitochondrial displacement loop as risk factors for sporadic and familial breast cancer. Tumour Biol. 2014. May; 35(5). [DOI] [PubMed] [Google Scholar]
  • 30. Soini HK, Moilanen JS, Vilmi-Kerälä T, Finnilä S, Majamaa K. Mitochondrial DNA variant m.15218A > G in Finnish epilepsy patients who have maternal relatives with epilepsy, sensorineural hearing impairment or diabetes mellitus. BMC Med. Genet. 2013. July 19; 14. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31. Legros F, Chatzoglou E, Frachon P, Ogier De Baulny H, Laforet P, Jardel C, et al. Functional characterization of novel mutations in the human cytochrome b gene. Eur. J. Hum. Genet. 2001. July; 9(7). [DOI] [PubMed] [Google Scholar]
  • 32. Picardi E, Pesole G. Mitochondrial genomes gleaned from human whole-exome sequencing. Nat. Methods 2012. May 30; 9(6). [DOI] [PubMed] [Google Scholar]
  • 33.Ion Torrent Sequencing on Humans. BioLektures at <http://biolektures.wordpress.com/2011/12/10/ion-torrent-sequencing-on-humans/>
  • 34. Bragg LM, Stone G, Butler MK, Hugenholtz P, Tyson GW. Shining a light on dark Sequencing: characterising errors in Ion Torrent PGM Data. PLoS Comput Biol 2013. April; 9(4). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35. Quail M, Smith M, Coupland P, Otto TD, Harris SR, Connor TR, et al. A tale of three next generation sequencing platforms: comparison of Ion torrent, pacific biosciences and illumina MiSeq sequencers. BMC Genomics 2012. July 24; 13. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36. Loman NJ, Misra RV, Dallman TJ, Constantinidou C, Gharbia SE, Wain J, et al. Performance comparison of benchtop high-throughput sequencing platforms. Nat. Biotechnol. 2012. May; 30(5). [DOI] [PubMed] [Google Scholar]
  • 37. Parson W, Strobi C, Huber G, Zimmermann B, Gomes SM, Souto L, et al. Reprint of: Evaluation of next generation mtGenome sequencing using the Ion Torrent Personal Genome Machine (PGM). Forensic Sci. Int. Genet. 2013. December; 7(6). doi: 10.1016/j.fsigen.2013.09.007 [DOI] [PubMed] [Google Scholar]
  • 38. Lynch H, Wen H, Kim YC, Snyder C, Kinarsky Y, Chen PX, et al. Can unknown predisposition in familial breast cancer be family-specific? Breast J. 2013. Dep-Oct; 19(5) [DOI] [PubMed] [Google Scholar]
  • 39. Tommasi S, Favia P, Weigl S, Bianco A, Pilato B, Russo L, et al. Mitochondrial DNA variants and risk of familial breast cancer: An exploratory study. Int. J. Oncol. 2014. May; 44(5) [DOI] [PubMed] [Google Scholar]
  • 40. Snape K, Ruark E, Tarpey P, Renwick A, Turnbull C, Seal S, et al. Predisposition gene identification in common cancers by exome sequencing: insights from familial breast cancer. Breast Cancer Res. Treat 2012. July; 134(1) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41. Thompson ER, Doyle MA, Ryland GL, Rowley SM, Choong DY, Tothill RW, et al. Exome sequencing identifies rare deleterious mutations in DNA repair genes FANCC and BLM as potential breast cancer susceptibility alleles. PLoS Genet. 2012. September; 8(9). doi: 10.1371/journal.pgen.1002979 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42. COMPLEXO, Southey MC, Park DJ, Nguyen-Dumont T, Campbell I, Thompson E, et al. COMPLEXO: identifying the missing heritability of breast cancer via next generation collaboration. Breast Cancer Res. 2013. June 21; 15(3) [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

S1 File. Technical details of sequencing experiments.

Details of PCR primers used to isolate the mitochondrial genome including primer sequence, melting temperature, and amplicon size (Table A). Post-sequencing read characteristics (Table B). Post-sequencing mitochondrial genome coverage (Table C). Coverage distribution along the mitochondrial genome with standard deviation for all 436 samples. Coverage in overlapping regions is divided by two to take into account the overlap (Figure A).

(DOCX)

S2 File. Full list of variants identified.

Details of all variants identified by sequencing 436 mitochondrial genomes (Table A).

(XLSX)

Data Availability Statement

All relevant data are within the paper and its Supporting Information files.


Articles from PLoS ONE are provided here courtesy of PLOS

RESOURCES