Summary
Associations with EOC for variants at CHEK2 were found with the strongest association for the CHEK2 SNP rs6005807 with serous EOC. Common variants at 22q12.1 are associated with risk of serous EOC and CHEK2 as a plausible target susceptibility gene.
Abstract
Genome-wide association studies have identified 20 genomic regions associated with risk of epithelial ovarian cancer (EOC), but many additional risk variants may exist. Here, we evaluated associations between common genetic variants [single nucleotide polymorphisms (SNPs) and indels] in DNA repair genes and EOC risk. We genotyped 2896 common variants at 143 gene loci in DNA samples from 15 397 patients with invasive EOC and controls. We found evidence of associations with EOC risk for variants at FANCA, EXO1, E2F4, E2F2, CREB5 and CHEK2 genes (P ≤ 0.001). The strongest risk association was for CHEK2 SNP rs17507066 with serous EOC (P = 4.74 x 10–7). Additional genotyping and imputation of genotypes from the 1000 genomes project identified a slightly more significant association for CHEK2 SNP rs6005807 (r 2 with rs17507066 = 0.84, odds ratio (OR) 1.17, 95% CI 1.11–1.24, P = 1.1×10−7). We identified 293 variants in the region with likelihood ratios of less than 1:100 for representing the causal variant. Functional annotation identified 25 candidate SNPs that alter transcription factor binding sites within regulatory elements active in EOC precursor tissues. In The Cancer Genome Atlas dataset, CHEK2 gene expression was significantly higher in primary EOCs compared to normal fallopian tube tissues (P = 3.72×10−8). We also identified an association between genotypes of the candidate causal SNP rs12166475 (r 2 = 0.99 with rs6005807) and CHEK2 expression (P = 2.70×10-8). These data suggest that common variants at 22q12.1 are associated with risk of serous EOC and CHEK2 as a plausible target susceptibility gene.
Introduction
Epithelial ovarian cancer (EOC) is the leading cause of death from cancers of the female reproductive tract (1). Established protective risk factors for the disease include pregnancy, oral contraceptives and breast-feeding, factors which reduce the number of lifetime ovulatory cycles. The biological mechanisms underlying these associations are not well understood but may involve proliferation, inflammation, oxidative stress, mutation and DNA repair in the ovarian and/or fallopian tube epithelia (2–4). Several studies have highlighted the critical role in ovarian tumorigenesis of maintaining genomic integrity, either through DNA repair or apoptotic pathways (5).
The importance of highly penetrant genetic risk factors for EOC has become evident from studies of familial clustering of breast and ovarian cancers, largely caused by mutations in the BRCA1 and BRCA2 (6–8). Several additional susceptibility genes exist that confer more moderate risks of EOC (e.g. MSH6, MSH2, MLH1, RAD51C, RAD51D and BRIP1 (9–12)). Finally, several common genetic variants conferring relatively mild risks of EOC have been identified in genome-wide association studies (GWAS) (13–21).
Whereas GWAS are highly successful at identifying common variant susceptibility alleles for multiple complex traits and diseases using case–control study designs, only a fraction of the total number of risk variants for any one disease have been identified to date. For example, Michailidou et al. (22) estimated that many common risk variants for breast cancer await discovery; and the same is likely true of other cancers. One approach to identify additional susceptibility alleles is to integrate knowledge of disease biology with germline genetic datasets (i.e. candidate gene or pathway studies). This approach has recently been successful in identifying the 5p15 (TERT-CLPTM1L) susceptibility locus for breast, prostate and ovarian cancers (18).
CHEK2 is a putative tumor suppressor gene that encodes a protein kinase activated in response to DNA damage and has also been shown to interact with and phosphorylate BRCA1, promoting cellular survival after DNA damage (23). A deletion variant in CHEK2 (CHEK2*1100delC) is associated with a 2–3-fold increased risk of breast cancer but is not associated with ovarian cancer risk (24). However, other CHEK2 coding variants may confer susceptibility to ovarian cancer, and in one study, five CHEK2 missense variants were identified in an analysis of 360 EOC cases, two of which were predicted to be damaging in functional assays (25). Common low penetrance variants may also exist in and around the CHEK2 gene. We previously evaluated associations for germline genetic variants spanning 53 DNA damage response and repair genes and risk of invasive serous EOC in 364 EOC cases and 761 controls from the North Carolina Ovarian Cancer Study (NCOCS), and identified borderline evidence of an association for two variants (rs5762746 and rs6005835) in CHEK2 (26).
In this study, we expanded the examination of risks associated with DNA repair genes by genotyping 2896 single nucleotide polymorphisms (SNPs) and indel variants spanning 143 gene loci in DNA samples of 15 397 EOC cases and 30 816 controls as part of the Collaborative Oncological Gene-environment Study (COGS). Imputation of additional SNPs in COGS further defined risk associations at the CHEK2 locus. Finally, we integrated germline genetic data with functional annotation of the region to identify the most likely disease-causing susceptibility variants and target genes at this locus.
Materials and methods
Genetic association analyses
Study datasets
Genetic association analyses were carried out using data from several Ovarian Cancer Association Consortium (OCAC) genotyping projects. Study subjects were of European ancestry (determined using principal components analysis of genotype data): 2162 cases and 2564 controls from a GWAS from North America (‘US GWAS’), 1763 cases and 6118 controls from a UK-based GWAS (‘UK GWAS’) and 441 cases and 442 controls from a second GWAS from North America (‘Mayo GWAS’) (13–19,21). In total, 11 030 cases and 21 693 controls from 41 OCAC studies were genotyped using the iCOGS array (‘OCAC-iCOGS’ stage 1 data). The USA and UK GWAS were comprised of several independent case–control studies, and samples from some of these studies were also subsequently genotyped using the iCOGS array. Combined, these studies comprised 15 396 independent cases and 30 817 controls. All duplicates were removed. Further details of the component studies are in Supplementary Table 1, available at Carcinogenesis Online. Details of genotyping platform are shown in Supplementary Table 2, available at Carcinogenesis Online.
Variant selection
To select genes, we expanded upon the 53 genes included in our earlier investigation (26) using the gene sets described in Wood et al. (27) plus literature and gene ontology database searches, ultimately identifying 143 genes whose functions relate to DNA damage recognition and repair processes for inclusion in this study. For each gene, we identified all SNPs within genome windows ranging from 10kb upstream of the transcription start sites to 10kb downstream of the transcription end sites of the 143 genes DNA repair genes. These variants were included in the US ovarian cancer GWAS database of SNPs imputed by the MACH software package against Hapmap Phase II genotypes (Release 22, NCBI build 36) for 60 CEU founders. We used these data to conduct a preliminary association analysis and to identify tag sets for each region, tagging polymorphisms with minor allele frequencies (MAF) of at least 0.025 to an r 2 of 0.80 or above. We ranked the genes on basis of the most significant variants within each gene. We tagged SNPs at TP53, CHEK2 and the top five ranked genes to an r 2 of 0.975, the genes ranked 6 through 124 to an r 2 of 0.90 and the 20 lowest ranked genes to an r 2 of 0.80. We then chose the SNP with the highest Illumina design score as the tag in each r 2 bin, choosing the most highly significant SNP when there were ties. This yielded 3651 variants that were included on the COGS Illumina custom iSelect chip (iCOGS). After quality control analysis, 3252 variants passed QC of which 53 had MAF < 0.02 (Supplementary Table 3 and Figure 1, available at Carcinogenesis Online).
Imputation
OCAC-iCOGS samples and each of the GWAS sets were imputed separately. Variants were imputed from the 1000 Genomes Project data using the v3 April 2012 release as the reference panel. We used a two-step procedure, which involved prephasing in the first step (using the SHAPEIT software and imputation, of the phased data in the second step using the IMPUTE version 2 software (28). To perform the imputation, we divided the data into segments of ~5Mb and excluded variants from the association analysis if their imputation accuracy was r 2 < 0.25 or their MAF was <0.005. The number of successfully imputed SNPs by MAF is shown in Supplementary Table 4, available at Carcinogenesis Online.
Data analysis
Analyses were restricted to women of European intercontinental ancestry. We performed principal components analysis using a set of ~37 000 unlinked markers to control for population substructure; an in-house program was utilized (available at http://ccge.medschl.cam.ac.uk/software/). Unconditional logistic regression treating the number of alternate alleles carried as an ordinal variable (log-additive, codominant model) was used to evaluate the association between each variant and EOC risk. A likelihood ratio statistic was used to examine significance of association, and per-allele log odds ratios (OR) and 95% confidence interval (CI) were estimated. The logistic regression model was adjusted for study and population substructure by including study-specific indicators and a variable number of eigenvalues from the principal components analyses. The number of principal components was chosen based on the position of the inflexion of the principal components screen plot. Two principal components were included in the analysis of the UK and US GWAS data sets, one was used for the Mayo GWAS and five were used for the COGS-OCAC dataset. Results from the three GWAS and COGS were combined using fixed-effects, inverse variance weighted meta-analysis.
Functional analyses
Public databases
To perform functional annotation of 293 candidate variants and expression quantitative trait locus (eQTL) analysis at the CHEK2 locus, we mined the following databases: ENCODE (http://genome.ucsc.edu); Haploreg (http://www.broadinstitute.org/mammals/haploreg/haploreg.php); and The Blood eQTL Browser (http://genenetwork.nl/bloodeqtlbrowser/) (29).
Profiling of epigenetic marks in ovarian cancer and ovarian cancer precursor cells
FAIREseq and ChIPseq profiles were generated for two immortalized normal ovarian surface epithelial cell lines (IOE4 and IOE11, generated in-house) and two fallopian secretory epithelial cell lines (FT33 and FT246, from Dr R Drapkin) as described previously (30). The CaOV3 and UWB1.289 (31) ovarian cancer cell lines (from CRUK and ATCC, respectively) were also profiled. Prior to performing ChIPseq/FAIREseq, cell lines were authenticated using the Promega PowerPlex16HS Assay (performed at the University of Arizona Genetic Core), and mycoplasma-specific PCR was performed to ensure cell lines were not contaminated with mycoplasma infections. ChIPseq was performed using antibodies that recognized histone 3 lysine 27 acetylation (H3K27ac) and histone 3 lysine 4 monomethylation (H3K4me) (Abcam). Identification of variants predicted significantly to alter transcription factor motifs was performed as described by Hazelett et al. (32).
Collection of normal epithelial samples
Early passage primary normal ovarian surface epithelial cells and fallopian tube secretory epithelial cells were obtained from disease-free ovaries and fallopian tubes collected during gynecological surgical procedures taking place at the University of Southern California, Los Angeles (the Gynecological Tissue and Fluid Repository), University College Hospital, London and Oregon Health and Science University. All samples were collected with informed patient consent. Methods for the collection have been described previously (33,34). RNA extraction was performed from cells at 80% confluence using standard protocols. Cell lines were confirmed to be free of mycoplasma, but were not authenticated as they were novel cell lines used at an early passage.
eQTL data analysis
For each sample, 500ng RNA were reverse transcribed using the Superscript III kit (Life Technologies). TaqMan® was used to quantify CHEK2 gene expression using a TaqMan® gene expression probe (Hs00200485_m1, Life Technologies). Four control genes were also included: ACTB, Hs00357333_g1; GAPDH, Hs02758991_g1; HMBS, Hs00609293_g1; and HPRT1 Hs0280069_m1 (all Life Technologies). Relative expression levels were calculated using the ΔΔCt method. Correlations between genotype and gene expression were calculated in R using a Jonckheere–Terpstra trend test (for three groups) or the Wilcoxon rank sum statistic (for 2 groups).
eQTL analysis using TCGA data
Publicly available microarray and germline genotyping data for high-grade serous EOCs was downloaded from TCGA (Lawrenson et al, Nat Comm, accepted). For each case, germline genotyping data were used to determine ancestry using principal components through EIGENSTRAT software (HapMap profiles were used as a control set). Only cases with complete Northern or Western European ancestry were included. Cis-eQTL analyses were performed for all genes in a 1MB region spanning the top variant, using a method we have described previously (35). Associations between risk variant genotypes and mRNA expression for 339 cases were evaluated using a linear regression model adjusting for the effects of copy number and methylation. The Benjamini–Hochberg method was used to adjust for multiple testing. A significant association was defined by a false discovery rate of less than 0.1.
Results
Genetic association analyses of DNA repair gene loci
For 143 DNA repair genes, we identified 2896 tagging SNPs (minor allele frequencies (MAFs) > 2%) that lie within 10kb upstream and downstream of the transcription start and end sites of each gene; these variants were genotyped as part of iCOGS. Of these, 2 621 were successfully genotyped in 46 213 subjects from 43 studies. This sample included 15 397 women diagnosed with invasive EOC, of whom 9608 cases had serous ovarian cancer, and 30 816 controls. Details of the study populations, genotyping platforms used for each data set and quality control analysis are given in Supplementary Tables 1–3 and Figure 1, available at Carcinogenesis Online.
Supplementary Table 5, available at Carcinogenesis Online, lists the DNA repair genes evaluated in this study, including the number of tag SNPs at each locus, and the significance of association (P value) with serous ovarian cancer for the most significant risk-associated variant for each gene. The data for all SNPs and genes are illustrated in Figure 1A. SNPs at 6 different genes were associated with serous ovarian cancer risk at a P value threshold of 0.001: FANCA on chromosome 16q24.3 (P = 0.001); EXO1 on chromosome 1q43 (P = 0.0005); E2F4 on chromosome 16q22.1 (P = 0.0005); E2F2 on chromosome 1p36.12 (P = 0.0004); CREB5 on chromosome 7p15.1 (P = 0.0002) and CHEK2 on chromosome 22q12.1 (P = 4.7×10−7).
Figure 1.
(A) Manhattan plot illustrating associations between 2621 SNPs spanning 143 DNA repair genes and risk of high-grade serous ovarian cancer. SNPs colored red are the top ranked risk associations at the CHEK2 gene locus. (B) Regional association plot for the 22q12.1 locus showing the distribution of genotyped (black dots) and imputed (red dots) SNPs and the genetic architecture with respect to the 19 genes in the region.
The genomic inflation factor λ for the combined meta-analysis analysis was 1.15 (adjusted value to 1000 cases and controls λ1000 = 1.01). This may be due to cryptic population structure not accounted for by adjusting for principal components. However, there was no residual inflation observed for association with clear cell and mucinous ovarian cancers (λ = 1.01 and 0.93, respectively) and minimal inflation for the larger set of SNPs on the iCOGS array that had not been selected as candidates for EOC susceptibility (λ = 1.07, λ1000 = 1.004).
Genetic association analyses of the CHEK2 gene locus
CHEK2 showed the strongest evidence of association with serous ovarian cancer risk, for SNP rs17507066 (odds ratio (OR) 0.86; 95% confidence interval (95% CI) 0.81–0.91). Because CHEK2 is a known moderately penetrant susceptibility gene for breast cancer, additional common variants in the region spanning this gene had been included on the iCOGS array at providing a greater density within the region than for other DNA repair genes (16). Genotype data were available for a further 176 variants in this sample set in addition to the 24 tagging SNPs originally evaluated. Further genotyping identified rs9625477 with a marginally more significant association with serous ovarian cancer risk (P = 2.4×10−7). The data for the association analysis of all 200 genotyped variants in the region in serous ovarian cancer are given in Supplementary Table 6, available at Carcinogenesis Online.
We further evaluated this region after imputing genotypes for variants identified through the 1000 Genomes Project for all participants of European ancestry (Supplementary Table 7, available at Carcinogenesis Online). After excluding poorly imputed SNPs, a total of 4785 SNPs with an imputation r 2 > 0.3 and an estimated MAF >0.02% spanning a 2Mb region on 22q12.1 (nucleotide position 28 000 000 to 30 000 000) were analyzed for their associations with high-grade serous ovarian cancer (HGSOC) risk. This analysis identified multiple additional variants highly correlated with rs9625477, several of which were more significantly associated with disease risk. All imputed risk-associated variants with P value less than threshold 10−6 are given in Supplementary Table 7, available at Carcinogenesis Online. The most significant risk association was for SNP rs6005807 (OR 1.17, 95%CI 1.11–1.24, P = 1.1 − 10−7), which is correlated with rs9625477 (r 2 = 0.99). Risk associations for genotyped and imputed SNPs in HGSOC across the regions are illustrated in Figure 1B.
We stratified risk associations by histological subtype. The most significant SNP for serous ovarian cancer (rs6005807) was more weakly associated with all invasive subtypes of EOC combined (P = 2.9×10−6) and showed no evidence of association for clear cell (P = 0.65), endometrioid (P = 0.55) or mucinous (P = 0.33) subtypes. However, other variants in the region (all imputed) showed subtype-specific associations: rs78371015 (r 2 0.97 with rs6005807) was the strongest risk allele for the clear cell subtype (P = 0.0002); rs34051361 (r 2 = 0.56 with rs6005807) was associated with the endometrioid subtype (P = 0.0001); and the variant 22:29126347:D (r 2 = 0.92 with rs6005807) was associated with the mucinous subtype (P = 0.0001). Summary results are given in Table 1.
Table 1.
Risk associations for the top SNPs at the CHEK2 locus, by histological subtype
Top SNP | EOC subtype | Position | Number of cases | SNP location (nearest gene) | r 2 COGS | Odds ratio | 95% CI | P value |
---|---|---|---|---|---|---|---|---|
rs6005807 | Serous | 28934313 | 9608 | Intronic (TTC28) | 1 | 1.17 | 1.11–1.24 | 1.1×10−07 |
All invasive | 15 397 | 1.12 | 1.07–1.18 | 2.9×10−06 | ||||
Clear cell | 1172 | 1.04 | 0.89–1.20 | 0.65 | ||||
Endometrioid | 2385 | 1.03 | 0.93–1.15 | 0.55 | ||||
Mucinous | 1112 | 1.07 | 0.92–1.24 | 0.33 | ||||
rs5752754 | All invasive | 28925542 | 15 397 | Intronic (TTC28) | 0.94 | 1.11 | 1.06–1.16 | 2.4×10−06 |
rs78371015 | Clear cell | 29245611 | 1172 | Intergenic (ZNRF3) | 0.97 | 1.40 | 1.17–1.67 | 0.0002 |
rs34051361 | Endometrioid | 29582557 | 2385 | Intergenic (KREMEN1) | 0.56 | 1.57 | 1.24–1.98 | 0.0001 |
chr22:29126347:D | Mucinous | 29126347 | 1112 | Intergenic (ZNRF3) | 0.92 | 1.56 | 1.25–1.98 | 0.0001 |
Functional annotation of risk-associated variants
Two hundred and ninety-three variants (genotyped or imputed), representing the most likely candidate causal variants at the locus, had likelihood ratios greater than 1:100 compared with the most significant SNP in serous ovarian cancer. The majority of candidate causal variants were SNPs (267/293, 91.1%); the remaining 26/293 (8.9%) were indel polymorphisms. We annotated these SNPs with respect to protein coding genes, predicted functional motifs and regulatory elements cataloged in ENCODE and Haploreg. Two SNPs were located in protein coding regions of the TTC28 gene; both were synonymous and therefore unlikely to be of functional importance. The remaining variants were located in non-coding DNA regions: 274 (93.5%) were located within introns of the TTC28 gene, and 12 (4.1%) were located within introns of the CHEK2 gene (Figure 2). Fifty-four SNPs (18.4%) coincide with enhancer or promoter elements annotated in ENCODE, 51 SNPs (17.4%) are located in DNase hypersensitivity domains and 241 SNPs (82.3%) are predicted to alter transcription factor binding motifs in Haploreg (Supplementary Table 8, available at Carcinogenesis Online). To define further the overlaps between risk variants and putative functional features, we identified those variants predicted significantly to alter transcription factor binding sites (TFBSs) identified using data from FactorBook (32,36). We only considered TFBS variants that lie within regulatory DNA regions active in EOC precursor tissues. Active regulatory elements in normal ovarian and fallopian epithelial cells were profiled using formaldehyde assisted isolation of regulatory element sequencing (FAIRE-seq) to identify regions of open chromatin, and chromatin immunoprecipitation sequencing (ChIP-seq) for histone modification marks H3K4me1 and H3K27ac (37). We identified 25 instances where candidate SNPs altered TFBSs within active regulatory sequences (Supplementary Table 9, available at Carcinogenesis Online), suggesting that these SNPs may be the most likely candidate causal variants at this locus. HOCOMOCO (38) was also used to identify transcription factors that may bind to the risk associated SNPs. We found that rs12166475 is predicted to affect binding of WT1, and that rs9620817 and rs16986509 are predicted to alter TFBSs for BRCA1; both transcription factors are known to be important in risk of development of HGSOC. These data are summarized in Supplementary Table 10, available at Carcinogenesis Online and Figure 2C.
Figure 2.
Functional annotation of candidate genes and SNPs at the 22q12 locus. We analysed the genes and ovarian cancer risk associated variants in a 1MB region spanning the top ranked risk SNP at this locus: (A) The location of all protein coding genes in the 1 MB region spanning the ranked risk SNP, and the 298 SNPs in the region with a 100:1 likelihood of being causal; (B) Illustration of the regulatory elements identified by RNAseq and ChIPseq analysis (H3K4me1, H3K4me3 and H3K27ac) catalogued by ENCODE, illustrates the extent of overlap between candidate SNPs and epigenetic marks; (C) The three most significant candidate SNPs identified through a combination of functional annotation, eQTL analyses and transcription factor binding site prediction. Position weight matrices for each factor are shown with SNP position indicated. Ref, reference allele; Alt, alternative allele; PCT; percent of maximum position weight matrix score for each allele; (D) Gene expression in 489 HGSOCs and 8 normal fallopian tube tissue specimens performed by TCGA indicates CHEK2 is the most significantly differentially expressed gene in the region; (E) mRNA expression of each gene in an in vitro early stage transformation model of ovarian cancer. Expression in CMYC and KRAS transformed cells compared to untransformed immortalized ovarian epithelial (IOE) cells (*P < 0.05).
Functional analyses of candidate genes
We used somatic data to evaluate the role in EOC development of all protein-coding genes within a 1Mb region spanning the most significant risk-associated SNP (rs6005807) to identify the most likely susceptibility target gene. Six genes lie in the region: rs6005807 is located in an intron of TTC28 (tetratricopeptide repeat domain 28); 5′ prime of TTC28 are CHEK2, HSCB (HscB mitochondrial iron-sulfur cluster co-chaperon), CCDC117 (coiled-coil domain containing 117), XBP1 (X-box binding protein 1), and ZNRF3 (zinc and ring finger 3). We evaluated somatic genetic alterations in primary ovarian tumors for these six genes using The Cancer Genome Atlas (TCGA) data and other public databases (summarized in Table 2).
Table 2.
Candidate gene analyses and eQTL analyses
Gene | Functiona | TCGA gene expression tumor (n = 489) versus normal FT (n = 8) | TCGA somatic mutation rate HGSOC N = 316 (%) | COSMIC somatic mutations N unique mutations/N unique samples tested (%) |
Expression quantitative trait locus (eQTL) analysis | ||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Lymphocyte eQTL analyses | TCGA eQTL analyses | Normal OSEC (n = 65) rs1750706 P value |
|||||||||||
P value | Mean Normal FT | Mean HGSOC | SNP | P value | FDR | SNP | P value | FDR | |||||
TTC28 | Involved in mitosis and cytokinesis (39) | 0.010 | 0.347 | 0.001 | 0.0 | 67/8204 (0.8%) | NS | NS | NS | N/A | N/A | N/A | ND |
CHEK2 | Involved in DNA damage response | 3.7×10−8 | −1.29 | 0.011 | 0.3 | 249/9830 (2.5%) | rs12166475b | 2.70×10−8 | 0 | rs6005790 | 0.018 | 1.0 | 0.184 |
HSCB | Involved in iron-sulfur cluster biogenesis (40) | 0.368 | 0.065 | −0.020 | 0.0 | 16/8533 (0.2%) | rs9620817 | 1.21×10−6 | 0 | N/A | N/A | N/A | ND |
CCDC117 | Not characterized | 0.098 | −0.151 | −0.005 | 0.0 | 25/8326 (0.3%) | rs16986509 | 1.39×10−6 | 0 | N/A | N/A | N/A | ND |
XBP1 | Transcription factor, involved in the immune system | 0.130 | 0.268 | −0.015 | 0.0 | 21/8204 (0.3%) | rs12165715 | 1.16×10−127 | 0 | rs714191 | 0.045 | 1.0 | ND |
ZNRF3 | Ubiquitin ligase, involved in Wnt signaling (41) | 0.975 | 0.006 | 0.004 | 0.3 | 67/8305 (0.8%) | NS | NS | NS | N/A | N/A | N/A | ND |
NS, no significant eQTLs detected; ND, not done; OSEC, ovarian surface epithelial cell; N/A, not available, in TCGA level 3 data there were no data available for HSCB, CCDC117 and ZNRF3.
aGene functions were identified using the NCBI ‘Gene’ pages unless alternative references are provided. Blood eQTL analyses were performed using data from Ref (29), the top disease-associated eQTL SNP is shown for each gene. For CHEK2, HSCB, CCDC117 and XBP1, the top eQTL SNP was also disease associated. bMinor alleles were associated with increased expression of CHEK2, HSCB, CCDC117 and XBP1.
These analyses revealed that 11% of high-grade serous ovarian cancer (HGSOC) cases showed copy number gain or amplification in the region spanning the six candidate genes, whereas homozygous deletions were rare (<1% cases). We identified a somatic coding sequence mutation in both the CHEK2 and ZNRF3 genes out of 316 sequenced HGSOC cases. The mutation in CHEK2 is a missense (R346H) predicted to be of ‘high impact’ (mutationassesor.org). The ZNRF3 mutation is also missense (P805H), but predicted to have little functional impact. Using another database of somatic mutation frequencies (COSMIC), which includes data for over 8000 tumors, we observed that CHEK2 was the most frequently mutated (in 2.5% of cases) of all the genes in the region (42).
We examined differences in gene expression between normal and cancer tissues (Figure 2D). CHEK2 gene expression was significantly higher in HGSOCs (n = 489) compared to normal fallopian tube tissues (P = 3.7×10−8). TTC28 was the only other gene in the region that was differentially expressed in ovarian cancers with lower expression in tumors compared to normal control tissues (P = 0.01). We also evaluated the expression of these genes in a stepwise model of early-stage ovarian epithelial cell transformation driven by overexpression of the CMYC gene and mutant KRAS (Figure 2E) (43). In this model, CHEK2, HSBC, CCDC117, XBP1 and ZNFR3 all showed significantly increased expression in ovarian epithelial cells (P < 0.05) that had undergone early stage neoplastic transformation, suggesting that the upregulation of these genes may be an early event in EOC development.
eQTL analyses
We used eQTL analysis to evaluate associations between risk genotypes and the levels of mRNA expression for candidate genes in the region (29). We looked for cis-eQTL associations in both normal and tumor tissues. We did not detect an eQTL for CHEK2 in normal ovarian/fallopian epithelial cultures. In peripheral blood samples (N > 5300) we observed a particularly strong association between rs12165715 and XBP1 expression (P = 1.16×10−127) (29). We also detected associations between rs12166475 and CHEK2 expression (P = 2.70×10−8), rs9620817 and HSCB expression (P = 1.21×10−6), and rs16986509 and CCDC117 expression (P = 1.29×10−6). The variant most significantly associated with ovarian cancer risk at this locus (rs6005807) showed significant eQTL associations for CHEK2 (P = 9.22×10−6) and HSCB (P = 1.85×10−5). Importantly, for CHEK2, HSCB, CCDC117 and XBP1 the most significant eQTL SNPs were also disease associated. These data are summarized in Table 2. Three of the top 25 candidate variants from in silico functional annotation were amongst the most significant eQTL associations: rs12166475, associated with expression of CHEK2 coincides with a TFBS for EGR1, a transcription factor previously implicated in EOC development; rs9620817, associated with HSCB expression, is predicted to alter CEBPB and ETS1 motifs; and rs16986509 associated with CCDC117 expression is predicted to alter TFBSs for TAL1 and UA2 (Figure 2C).
Finally, we evaluated eQTL associations for all six protein-coding genes in the region in 339 primary HGSOC tissues using publicly available data from TCGA. Variations in gene expression were adjusted for changes in DNA copy number and methylation variation in each tumor. We observed no significant associations between rs6005807 and CHEK2, TTC28 or XBP1 at a P value threshold of 0.05, and false discovery rate threshold of 0.1. Details of the eQTL analyses are provided in Table 2.
Discussion
DNA repair mechanisms are important in the initiation and development of EOC and the current study represents the most comprehensive analysis of common genetic variation at DNA repair genes and EOC risk to date. We evaluated 2621 candidate variants spanning 143 gene regions at several different DNA repair pathways and found strong evidence of risk associations for SNPs at the CHEK2 gene locus that were just below the threshold for genome-wide significance. This is consistent with a smaller previous study in which we showed borderline evidence of risk associations for SNPs spanning this locus (26). Even though we did not find strong statistical evidence of risk associations for SNPs at other DNA repair gene loci, we cannot rule out that germline genetic variation at these genes is associated with EOC risk. We only performed detailed genotyping and imputation analysis at the CHEK2 locus because of the strength of its association from our initial screen, but additional analyses of the other gene loci in the future may identify other associations. COGS represents the largest genetic association study reported for EOC, but is still substantially smaller in sample size compared to GWAS for more common diseases such as breast cancer and coronary artery disease (22,44,45). Sample size has a substantial impact on the ability to identify risk associations, which partly explains why more common diseases have identified the most risk associations using GWAS. Disease heterogeneity may also have restricted our ability to identify risk variants for EOC as some ovarian cancer risk loci are subtype-specific (16,19). It is likely that common variants in various DNA repair genes confer susceptibility to subtype-specific EOC as observed for more highly-penetrant genes. Finally, rarer variants (MAF < 0.02) in these genes may confer susceptibility to EOC, but we did not have adequate power to detect such rarer associations in this study.
The genetic data suggest that variants at the CHEK2 locus are associated with risk of invasive EOC, but that the association for the top ranked SNP (rs6005807) is stronger with serous histology. We found no evidence that rs6005807 was associated with other EOC histologies but different imputed variants within the region showed evidence of association with clear cell, endometrioid and mucinous EOC at P values of 0.0002. Because these three EOC subtypes are less common than serous EOC, the weakness of these associations may simply reflect the smaller sample sizes available for their genotyping. One caveat to the association analyses lies in the minor residual inflation observed in the test statistics. This may be due to population structure but given that this inflation was greater than for other sets of SNPs this seems an unlikely explanation. An alternative explanation is an overall burden of weak susceptibility signals within this set of SNPs.
The most significant associations were identified using imputed genotypes, based on an estimated imputation r 2 that indicates a very high correlation between imputed genotypes and actual genotypes. It is therefore unlikely that there are other common variants within the region that may represent more highly associated SNPs than those already identified, and so these alleles represent the candidate causal variants for this locus. We identified 293 candidate causal polymorphisms that are virtually indistinguishable from each other with respect to their risk associations and any one (or even several) could be the causal SNP(s) influencing expression of the target susceptibility gene. Only two of the variants were in protein-coding regions and both were synonymous changes, suggesting that the causal SNP(s) likely reside in non-coding DNA. As such we neither know the functional basis for the genetic susceptibility, nor the target susceptibility gene(s). Our in silico analysis of these variants with respect to non-coding regulatory biofeatures profiled in multiple different cell lines by ENCODE, and in EOC precursor tissues, identified 25 risk SNPs that intersect with annotated functional elements. While this represents a relatively small number of candidate causal polymorphisms and functional targets at this locus, several other candidate causal SNPs may exist. None of the cell lines we evaluated have been comprehensively analysed for the full catalogue of non-coding regulatory elements and is possible that additional variants overlap regulatory marks that were not profiled; for example CTCF repressor marks and non-coding RNAs.
We identified three risk SNPs located within regulatory elements active in ovarian cells. These SNPs were also the most significant SNPs from eQTL analysis in the region: rs12166475 associated with CHEK2; rs9620187 associated with HSCB; and rs16986509 associated with CCDC117. In each case the minor allele was associated with increased gene expression; consequently higher CHEK2, HSCB and CCDC117 expression was associated with reduced cancer risk, whereas higher XBP1 expression was associated with higher cancer risk. The strongest candidate SNP is rs12166475, which alters the binding site for EGR1, a transcription factor involved in epithelial-to-mesenchymal transition in EOC (46). The alternative allele of this variant is also predicted to increase the binding affinity of WT1, a biomarker commonly expressed in HGSOCs. WT1 can have both repressive and activating effects on gene expression (47). Additional functional analysis of the possible interaction between rs12166475 and CHEK2 will be required to validate these findings and to elucidate the transcriptional consequences of allele-dependent EGR1/WT1 binding at the site of this SNP. The SNP rs9620817, which is most significantly associated with HSCB expression, is also a strong candidate. This SNP is predicted to alter CEBPB and ETS1 transcription factor binding sites (TFBS), although the difference in predicted binding affinity between the two SNP alleles is much greater for ETS1. Both rs16986509 and rs9620817 also alter binding sites for BRCA1, which may be of significance given that BRCA1-associated pathways are deregulated in approximately half of all HGSOCs (48). BRCA1 can function as a co-repressor or co-activator, and regulates gene expression by interacting with a myriad of different transcription factors, including TP53 and CMYC (49).
Although we conditioned our analysis at 22q12.1 on tagging variants spanning the CHEK2 gene, five other genes lie within a 500-kb region at either side of the most risk associated SNP that could be the target of risk-associated variants at this locus. However, somatic analysis of ovarian tumors from TCGA suggests that CHEK2 is the most likely target. It is the only gene in the region that is differentially expressed in ovarian tumors compared to normal fallopian tube tissues, suggesting that it may play a role in EOC development. While CHEK2 was overexpressed in ovarian tumors compared to normal fallopian tubes, in our eQTL analyses reduced CHEK2 expression was associated with increased cancer risk, which may suggest that overexpression occurs at later stages of tumorigenesis but lower CHEK2 expression is involved in early cancer development. This hypothesis is consistent with the moderate risk of breast cancer conferred by CHEK2 loss-of-function variants, where large population-based studies report estimated odds ratios for rare protein-truncating and splice-junction variants on the order of 6.18 (95% CI: 1.76–21.8) and 8.75 (95% CI: 1.06–72.2) for missense substitutions (50). Breast and EOC have shared genetic etiology for both high and low penetrance susceptibility genes, providing a rationale for why germline genetic variants in or around CHEK2 may be associated with EOC risk. No similar rationale applies for other genes in the region but eQTL analyses identified several genotype-gene expression associations that indicate alternative candidate target genes. The strongest association was with the XBP1 gene in peripheral lymphocytes, although the most significant eQTL SNP for this gene was not predicted to alter a TFBS within an active regulatory element in EOC precursor cells. We also identified highly statistically significant cis-eQTL associations between risk SNPs and expression of both CHEK2 and HSCB. However, the true importance of these eQTL associations is unclear given that they were identified in tissues that are not associated with EOC development. Several previous studies have highlighted the importance of tissue-specific gene expression when evaluating eQTLs, and stressed the need to perform eQTL analysis in tissues relevant to disease development (51). We did not identify eQTL associations for any of these genes in primary EOCs, although these studies were underpowered.
In summary, we provide evidence that common genetic variants in a region on chromosome 22q12.1 are associated with risk of serous ovarian cancer. The most likely target susceptibility gene at this locus is CHEK2 based on a combination of its known role in DNA damage response pathways, somatic variation in gene expression suggesting a role in EOC development, and a significant eQTL association with risk-associated variants. Future studies will be needed to increase the power of our genetic association studies by increasing sample size to confirm that this region is a true susceptibility locus for ovarian cancer. Finally, detailed functional characterization of this locus will be needed to confirm the functional impact of these candidate SNPs on their regulatory elements and establish their interactions and influence on the target susceptibility gene, as has been shown for other common variant susceptibility genes and alleles.
Supplementary material
Supplementary Tables 1–10 and Figure 1 can be found at http://carcin.oxfordjournals.org/
Funding
The scientific development and funding for this project were supported by the following: the Genetic Associations and Mechanisms in Oncology (GAME-ON): a NCI Cancer Post-GWAS Initiative (U19-CA148112); the COGS project is funded through a European Commission’s Seventh Framework Programme grant (agreement number 223175-HEALTH-F2-2009-223175); the Ovarian Cancer Association Consortium is supported by a grant from the Ovarian Cancer Research Fund thanks to donations by the family and friends of Kathryn Sladek Smith (PPD/RPCI.07); and Department of Defense Award (W81XWH-12-1-0561, W81XWH-07-0449). F.M. was supported by National Institutes of Health K07-CA080668; S.S.T. and E.M.P. were supported in part by Department of Defense Award W81XWH-12-1-0561. Funding of the constituent studies was provided by: California Cancer Research Program (00-01389V-20170, 2II0200); Cancer Prevention Institute of California; Department of Defense (DAMD17-02-1-0666, DAMD17-02-1-0669, W81XWH-10-1-02802); Fred C. and Katherine B. Andersen Foundation; Lon V Smith Foundation grant LVS-39420; Mayo Foundation; Minnesota Ovarian Cancer Alliance; National Institutes of Health (K07-CA095666, K07-CA143047, K22-CA138563); National Center for Research Resources/General Clinical Research Center grant M01-RR000056, N01-CN025403, N01-CN55424, N01-PC67001, N01-PC67010, P01-CA17054, P30-CA14089, P30-CA15083, P30-CA072720, P50-CA105009, P50-CA136393, P50-CA159981, R01-CA058860, R01-CA074850, R01-CA080742, R01-CA092044, R01-CA112523, R01-CA122443, R01-CA126841, R01-CA16056, R01-CA54419, R01-CA58598, R01-CA61132, R01-CA76016, R01-CA83918, R01-CA87538, R01-CA95023, R01-CA063678, R01 CA114343, R03-CA113148, R03-CA115195, U01-CA69417, U01-CA71966),UM1-CA182910; Rutgers Cancer Institute of New Jersey; and the US Public Health Service (PSA-042205). Personal support: K.L is supported by a K99/R00 grant from the National Cancer Institute (Grant number 1K99CA184415-01). D.F.E. is a Principal Research Fellow of Cancer Research UK. G.C.-T. and P.M.W. are supported by the National Health and Medical Research Council. B.K. holds an American Cancer Society Early Detection Professorship (SIOP-06-258-01- COUN) and the National Center for Advancing Translational Sciences (NCATS), Grant UL1TR000124.
Supplementary Material
Acknowledgements
This study would not have been possible without the contributions of the following: A. M. Dunning, P. Hall (COGS); D. C. Tessier, F. Bacot, D. Vincent,, S. LaBoissière and F. Robidoux and the staff of the genotyping unit (Genome Quebec); D. C. Whiteman, P. M. Webb, A. C. Green, N. K. Hayward, P. G. Parsons, D. M. Purdie, B. M. Smithers, D. Gotley, A. Clouston, I. Brown, S. Moore. K. Harrap, T. Sadkowski, S. O’Brien, E. Minehan, D. Roffe, S. O’Keefe, S. Lipshut, G. Connor, H. Berry, F. Walker, T. Barnes, J. Thomas, L. Terry, M. Connard, L. Bowes, M-R. Malt, J. White, C. Mosse, N. Tait, C. Bambach, A. Biankan, R. Brancatisano, M. Coleman, M. Cox, S. Deane, G. L. Falk, J. Gallagher, M. Hollands, T. Hugh, D. Hunt, J. Jorgensen, C. Martin, M. Richardson, G. Smith, R. Smith, D. Storey, J. Avramovic, J. Croese, J. D’Arcy, S. Fairley, J. Hansen, J. Masson, L. Nathanson, B. O’Loughlin, L. Rutherford, R. Turner, M. Windsor, J. Bessell, P. Devitt, G. Jamieson, D. Watson, S. Blamey, A. Boussioutas, R. Cade, G. Crosthwaite, I. Faragher, J. Gribbin, G. Hebbard, G. Kiroff, B. Mann, R. Millar, P. O’Brien, R. Thomas, S. Wood, S. Archer, K. Faulkner, J. Hamdorf (ACS); R. Stuart-Harris, F. Kirsten, J. Rutovitz, P. Clingan, A.Glasgow, A. Proietto, S. Braye, G. Otton, J. Shannon, T. Bonaventura, J. Stewart, S. Begbie, M. Friedlander, D. Bell, S. Baron-Hay, G. Gard, D. Nevell, N. Pavlakis, S. Valmadre, B. Young, C Camaris, R. Crouch, L. Edwards, N. Hacker, D. Marsden, G. Robertson, P. Beale, J. Beith, J. Carter, C. Dalrymple, R. Houghton, P. Russell, L. Anderson, M. Links, J. Grygiel, J. Hill, A. Brand, K. Byth, R. Jaworski, P. Harnett, R. Sharma,.G Wain, D. Purdie, D. Whiteman, B. Ward, D. Papadimos, A. Crandon, M. Cummings, K. Horwood. A. Obermair, L. Perrin, D. Wyld, J. Nicklin, M. Davy, M. K. Oehler, C. Hall, T. Dodd, T. Healy, K. Pittman, D. Henderson, J. Miller, J. Pierdes, A. Achan, P. Blomfield, D. Challis, R. McIntosh, A. Parker, B. Brown, R. Rome, D. Allen, P. Grant, S. Hyde, R. Laurie, M. Robbie, D. Healy, T. Jobling, T. Manolitsas, J. McNealage, P Rogers, B. Susil, E. Sumithran, I. Simpson, I. Haviv, K. Phillips, D. Rischin, S. Fox, D. Johnson, S. Lade, P. Waring, M. Loughrey, N. O’Callaghan, B. Murray, L. Mileshkin, P. Allan; V. Billson, J. Pyman, D. Neesham, M. Quinn, A. Hamilton, C. Underhill, R. Bell, L. F Ng, R. Blum, V.Ganju, I. Hammond, C. Stewart, Y. Leung, M. Buck, N. Zeps (ACS); G. Peuteman, T. Van Brussel and D. Smeets (BEL); U. Eilber(GER); L. Gacucova (HMO); P. Schürmann, F. Kramer, W. Zheng, T.-W. Park-Simon, K. Beer-Grondke and D. Schmidt (HJO); G.S. Keeney, S. Windebank, C. Hilker and J. Vollenweider (MAY); the state cancer registries of AL, AZ, AR, CA, CO, CT, DE, FL, GA, HI, ID, IL, IN, IA, KY, LA, ME, MD, MA, MI, NE, NH, NJ, NY, NC, ND, OH, OK, OR, PA, RI, SC, TN, TX, VA, WA, and WYL (NHS); L. Paddock, M. King, U. Chandran, A. Samoila, and Y. Bensman (NJO); L. Brinton, M. Sherman, A. Hutchinson, N. Szeszenia- Dabrowska, B. Peplonska, W. Zatonski, A. Soni, P. Chao and M. Stagner (POL);); C. Luccarini, P. Harrington the SEARCH team and ECRIC (SEA); the Scottish Gynaecological Clinical Trails group and SCOTROC1 investigators (SRO); W-H. Chow, Y-T. Gao (SWH); Information about TCGA and the investigators and institutions who constitute the TCGA research network can be found at http://cancergenome.nih.gov/ (TCGA); I. Jacobs, M. Widschwendter, E. Wozniak, N. Balogun, A. Ryan and J. Ford (UKO); Carole Pye (UKR); a full list of the investigators who contributed to the generation of the WTCCC data is available from http://www.wtccc.org.uk/ (WTCCC).
Conflict of Interest Statement: None declared.
Glossary
Abbreviations
- COGS
Collaborative Oncological Gene-environment Study
- EOC
epithelial ovarian cancer
- eQTL
expression quantitative trait locus
- GWAS
genome-wide association studies
- HGSOC
high-grade serous ovarian cancer
- MAF
minor allele frequency
- SNP
single nucleotide polymorphism
- TFBS
transcription factor binding site
References
- 1. American Cancer Society. (2012) Cancer Facts & Figures 2012. American Cancer Society, Atlanta. [Google Scholar]
- 2. Murdoch W.J., et al. (2001) Ovulation-induced DNA damage in ovarian surface epithelial cells of ewes: prospective regulatory mechanisms of repair/survival and apoptosis. Biol. Reprod., 65, 1417–1424. [DOI] [PubMed] [Google Scholar]
- 3. Bahar-Shany K., et al. (2014) Exposure of fallopian tube epithelium to follicular fluid mimics carcinogenic changes in precursor lesions of serous papillary carcinoma. Gynecol. Oncol., 132, 322–327. [DOI] [PubMed] [Google Scholar]
- 4. Schildkraut J.M., et al. (1997) Relationship between lifetime ovulatory cycles and overexpression of mutant p53 in epithelial ovarian cancer. J. Natl. Cancer Inst., 89, 932–938. [DOI] [PubMed] [Google Scholar]
- 5. Moynahan M.E., et al. (1999) Brca1 controls homology-directed DNA repair. Mol. Cell, 4, 511–518. [DOI] [PubMed] [Google Scholar]
- 6. Wooster R, et al. (1995) Identification of the breast cancer susceptibility gene BRCA2. Nature, 378, 789–792. [DOI] [PubMed] [Google Scholar]
- 7. Futreal P.A., et al. (1994) BRCA1 mutations in primary breast and ovarian carcinomas. Science, 266, 120–122. [DOI] [PubMed] [Google Scholar]
- 8. Miki Y., et al. (1994) A strong candidate for the breast and ovarian cancer susceptibility gene BRCA1. Science, 266, 66–71. [DOI] [PubMed] [Google Scholar]
- 9. Song H., et al. (2014) The contribution of deleterious germline mutations in BRCA1, BRCA2 and the mismatch repair genes to ovarian cancer in the population. Hum. Mol. Genet., 23, 4703–4709. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10. Meindl A., et al. (2010) Germline mutations in breast and ovarian cancer pedigrees establish RAD51C as a human cancer susceptibility gene. Nat. Genet., 42, 410–414. [DOI] [PubMed] [Google Scholar]
- 11. Loveday C., et al. (2011) Germline mutations in RAD51D confer susceptibility to ovarian cancer. Nat. Genet., 43, 879–882. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12. Rafnar T., et al. (2011) Mutations in BRIP1 confer high risk of ovarian cancer. Nat. Genet., 43, 1104–1107. [DOI] [PubMed] [Google Scholar]
- 13. Song H., et al. (2009) A genome-wide association study identifies a new ovarian cancer susceptibility locus on 9p22.2. Nat. Genet., 41, 996–1000. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14. Goode E.L., et al. (2010) A genome-wide association study identifies susceptibility loci for ovarian cancer at 2q31 and 8q24. Nat. Genet., 42, 874–879. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15. Bolton K.L., et al. (2010) Common variants at 19p13 are associated with susceptibility to ovarian cancer. Nat. Genet., 42, 880–884. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16. Pharoah P.D., et al. (2013) GWAS meta-analysis and replication identifies three new susceptibility loci for ovarian cancer. Nat. Genet., 45, 362–370. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17. Permuth-Wey J., et al. (2013) Identification and molecular characterization of a new ovarian cancer susceptibility locus at 17q21.31. Nat. Commun., 4, 16–27. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18. Bojesen S.E., et al. (2013) Multiple independent variants at the TERT locus are associated with telomere length and risks of breast and ovarian cancer. Nat. Genet., 45, 371–384. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19. Shen H., et al. (2013) Epigenetic analysis leads to identification of HNF1B as a subtype-specific susceptibility gene for ovarian cancer. Nat. Commun., 4, 16–28. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20. Chen K., et al. (2014) Genome-wide association study identifies new susceptibility loci for epithelial ovarian cancer in Han Chinese women. Nat. Commun., 5, 4682. [DOI] [PubMed] [Google Scholar]
- 21. Kuchenbaecker K.B., et al. (2015) Identification of six new susceptibility loci for invasive epithelial ovarian cancer. Nat. Genet., 47, 164–171. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22. Michailidou K., et al. (2013) Large-scale genotyping identifies 41 new loci associated with breast cancer risk. Nat. Genet., 45, 353–361. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23. Lee J.S., et al. (2000) hCds1-mediated phosphorylation of BRCA1 regulates the DNA damage response. Nature, 404, 201–204. [DOI] [PubMed] [Google Scholar]
- 24. Consortium C.B.C.C.-C. (2004) CHEK2*1100delC and susceptibility to breast cancer: a collaborative analysis involving 10,860 breast cancer cases and 9,065 controls from 10 studies. Am. J. Hum. Genet., 74, 1175–1182. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25. Walsh T., et al. (2011) Mutations in 12 genes for inherited ovarian, fallopian tube, and peritoneal carcinoma identified by massively parallel sequencing. Proc. Natl. Acad. Sci. USA, 108, 18032–18037. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26. Schildkraut J.M., et al. (2010) Association between DNA damage response and repair genes and risk of invasive serous ovarian cancer. PLoS One, 5, e10061. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27. Wood R.D., et al. (2005) Human DNA repair genes, 2005. Mutat. Res., 577, 275–283. [DOI] [PubMed] [Google Scholar]
- 28. Howie B.N., et al. (2009) A flexible and accurate genotype imputation method for the next generation of genome-wide association studies. PLoS Genet., 5, e1000529. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29. Westra H.J., et al. (2013) Systematic identification of trans eQTLs as putative drivers of known disease associations. Nat. Genet., 45, 1238–1243. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30. Karst A.M., et al. (2012) Primary culture and immortalization of human fallopian tube secretory epithelial cells. Nat. Protoc., 7, 1755–1764. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31. DelloRusso C., et al. (2007) Functional characterization of a novel BRCA1-null ovarian cancer cell line in response to ionizing radiation. Mol. Cancer Res., 5, 35–45. [DOI] [PubMed] [Google Scholar]
- 32. Hazelett D.J., et al. (2014) Comprehensive functional annotation of 77 prostate cancer risk loci. PLoS Genet., 10, e1004–102. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33. Lawrenson K., et al. (2009) In vitro three-dimensional modelling of human ovarian surface epithelial cells. Cell Prolif., 42, 385–393. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34. Li N.F., et al. (2004) A modified medium that significantly improves the growth of human normal ovarian surface epithelial (OSE) cells in vitro . Lab. Invest., 84, 923–931. [DOI] [PubMed] [Google Scholar]
- 35. Li Q., et al. (2013) Integrative eQTL-based analyses reveal the biology of breast cancer risk loci. Cell, 152, 633–641. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36. Wang J., et al. (2013) Factorbook.org: a Wiki-based database for transcription factor-binding data generated by the ENCODE consortium. Nucleic Acids Res., 41, D171–D176. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37. Coetzee S.G., et al. (2015) Cell type specific enrichment of risk associated regulatory elements at ovarian cancer susceptibility loci. Hum. Mol. Genet, 24, 3595–3607. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38. Kulakovskiy I.V., et al. (2013) HOCOMOCO: a comprehensive collection of human transcription factor binding sites models. Nucleic Acids Res., 41, D195–D202. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39. Izumiyama T., et al. (2012) A novel big protein TPRBK possessing 25 units of TPR motif is essential for the progress of mitosis and cytokinesis. Gene, 511, 202–217. [DOI] [PubMed] [Google Scholar]
- 40. Uhrigshardt H., et al. (2010) Characterization of the human HSC20, an unusual DnaJ type III protein, involved in iron-sulfur cluster biogenesis. Hum. Mol. Genet., 19, 3816–3834. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41. Hao H.X., et al. (2012) ZNRF3 promotes Wnt receptor turnover in an R-spondin-sensitive manner. Nature, 485, 195–200. [DOI] [PubMed] [Google Scholar]
- 42. Forbes S.A., et al. (2014) COSMIC: exploring the world’s knowledge of somatic mutations in human cancer. Nucleic Acids Res, 43, D805–D811. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43. Lawrenson K., et al. (2011) Modelling genetic and clinical heterogeneity in epithelial ovarian cancers. Carcinogenesis, 32, 1540–1549. [DOI] [PubMed] [Google Scholar]
- 44. Eeles R.A., et al. (2013) Identification of 23 new prostate cancer susceptibility loci using the iCOGS custom genotyping array. Nat. Genet., 45, 385–391. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45. Deloukas P., et al. (2013) Large-scale association analysis identifies new risk loci for coronary artery disease. Nat. Genet., 45, 25–33. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46. Cheng J.C., et al. (2013) Egr-1 mediates epidermal growth factor-induced downregulation of E-cadherin expression via Slug in human ovarian cancer cells. Oncogene, 32, 1041–1049. [DOI] [PubMed] [Google Scholar]
- 47. Hohenstein P., et al. (2006) The many facets of the Wilms' tumour gene, WT1. Hum. Mol. Genet., 15, R196–R201. [DOI] [PubMed] [Google Scholar]
- 48. Network C.G.A.R. (2011) Integrated genomic analyses of ovarian carcinoma. Nature, 474, 609–615. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49. Mullan P.B., et al. (2006) The role of BRCA1 in transcriptional regulation and cell cycle control. Oncogene, 25, 5854–5863. [DOI] [PubMed] [Google Scholar]
- 50. Le Calvez-Kelm F., et al. (2011) Rare, evolutionarily unlikely missense substitutions in CHEK2 contribute to breast cancer susceptibility: results from a breast cancer family registry case-control mutation-screening study. Breast Cancer Res., 13, R6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51. Pomerantz M.M., et al. (2010) Analysis of the 10q11 cancer risk locus implicates MSMB and NCOA4 in human prostate tumorigenesis. PLoS Genet., 6, e1001204. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.