Skip to main content
The Journal of Clinical Endocrinology and Metabolism logoLink to The Journal of Clinical Endocrinology and Metabolism
. 2019 Apr 30;104(9):3835–3850. doi: 10.1210/jc.2018-02496

Family-Based Quantitative Trait Meta-Analysis Implicates Rare Noncoding Variants in DENND1A in Polycystic Ovary Syndrome

Matthew Dapas 1, Ryan Sisk 1, Richard S Legro 2, Margrit Urbanek 1,3,4, Andrea Dunaif 5,2,, M Geoffrey Hayes 1,3,6,2,
PMCID: PMC6660913  PMID: 31038695

Abstract

Context

Polycystic ovary syndrome (PCOS) is among the most common endocrine disorders of premenopausal women, affecting 5% to15% of this population depending on the diagnostic criteria applied. It is characterized by hyperandrogenism, ovulatory dysfunction, and polycystic ovarian morphology. PCOS is highly heritable, but only a small proportion of this heritability can be accounted for by the common genetic susceptibility variants identified to date.

Objective

The objective of this study was to test whether rare genetic variants contribute to PCOS pathogenesis.

Design, Patients, and Methods

We performed whole-genome sequencing on DNA from 261 individuals from 62 families with one or more daughters with PCOS. We tested for associations of rare variants with PCOS and its concomitant hormonal traits using a quantitative trait meta-analysis.

Results

We found rare variants in DENND1A (P = 5.31 × 10−5, adjusted P = 0.039) that were significantly associated with reproductive and metabolic traits in PCOS families.

Conclusions

Common variants in DENND1A have previously been associated with PCOS diagnosis in genome-wide association studies. Subsequent studies indicated that DENND1A is an important regulator of human ovarian androgen biosynthesis. Our findings provide additional evidence that DENND1A plays a central role in PCOS and suggest that rare noncoding variants contribute to disease pathogenesis.


Whole-genome sequencing analysis finds rare variants in associated with altered hormone levels in families affected by PCOS.


Polycystic ovary syndrome (PCOS) is a common endocrine disorder affecting 5% to 15% of premenopausal women worldwide (1), depending on the diagnostic criteria applied. PCOS is diagnosed by two or more of its reproductive features of hyperandrogenism, ovulatory dysfunction, and polycystic ovarian morphology. It is frequently associated with insulin resistance and pancreatic β-cell dysfunction, making it a leading risk factor for type 2 diabetes in young women (2).

PCOS is a highly heritable complex genetic disorder. Analogous to other complex traits (3), common susceptibility loci identified in genome-wide association studies (GWASs) (4–8) account for only a small proportion of the estimated genetic heritability of PCOS (9). As GWASs were designed to assess common allelic variants, usually with minor allele frequencies (MAFs) ≥2% to 5%, it has been proposed that less frequently occurring variants with greater effect sizes account for the observed deficit in heritability (10). Next-generation sequencing approaches have identified rare variants that contribute to complex disease pathogenesis (11–18).

We tested the hypothesis that rare variants contribute to PCOS by conducting family-based association analyses using whole-genome sequencing data. We filtered and weighted rare variants (MAF ≤2%) according to their predicted levels of deleteriousness and grouped them regionally and by genes. We not only tested for associations with PCOS diagnosis, but we also tested for association with its quantitative reproductive and metabolic trait levels: testosterone (T), dehydroepiandrosterone sulfate (DHEAS), LH, FSH, SHBG, and fasting insulin. The quantitative trait phenotypes were combined using a meta-analysis approach to identify sets of rare variants that associate with altered hormonal levels in PCOS.

Materials and Methods

Subjects

This study included 261 individuals from 62 families with PCOS who were whites of European ancestry. Families were ascertained by an index case who fulfilled the National Institutes of Health criteria for PCOS (19). Each family consisted of at least a proband–parent trio. Brothers were not included in this study. Phenotypic data and some genetic analyses on these subjects have been previously reported (20–22). The study was approved by the Institutional Review Boards of Northwestern University Feinberg School of Medicine, Penn State Health Milton S. Hershey Medical Center, and Brigham and Women’s Hospital. Written informed consent was obtained from all subjects prior to the study.

Women were ages 14 to 63 years, in good health, and had both ovaries and a uterus. They were not taking medications known to alter reproductive or metabolic hormone levels for at least 1 month prior to the study. Exogenous gonadal steroid administration was discontinued at least 3 months prior to the study. Thyroid, pituitary, and adrenal disorders were excluded by appropriate tests (23). Women were considered to be of reproductive age when they were between the ages of at least 2 years postmenarche and 45 years old, and had FSH levels ≤40 mIU/mL. Hyperandrogenemia was defined by elevated levels of T (>58 ng/dL), non-SHBG bound T (>15 ng/dL), and/or DHEAS (>2683 ng/mL), as established in Legro et al. (23). Ovarian dysfunction was defined as six or fewer menses per year. Ovarian morphology was not assessed because it does not correlate with the endocrine phenotype (6, 24).

Phenotypes in reproductive-age women were assigned as previously reported (23). Reproductive-age women with hyperandrogenemia and ovarian dysfunction were assigned a PCOS phenotype. Reproductive-age women with normal androgen levels and regular menses (every 27 to 35 days) were assigned an unaffected phenotype. Reproductive-age women with hyperandrogenemia and regular menses were assigned a hyperandrogenemic (HA) phenotype. Because androgen levels do not decrease during the menopausal transition (25, 26), women between 46 and 63 years of age with hyperandrogenemia were also assigned the HA phenotype, regardless of menstrual cycle pattern. One index case fulfilled the criteria for PCOS when she was 45 years of age but she was 46 years of age when enrolled in the study. She was confirmed to have persistent hyperandrogenemia and ovarian dysfunction. Women with either the PCOS or HA phenotype were considered affected, as we have done in our previous linkage (24) and family-based association testing (27) studies. In the current study, we also included HA women between 46 and 63 years of age as affected. Women with normal androgen levels who were not of reproductive age and all fathers in the study were not assigned a phenotype.

The quantitative trait analysis examined associations between rare variants and T, DHEAS, SHBG, LH, FSH, and insulin levels. In addition to the women included in the dichotomous trait analysis, women were included in the quantitative trait association testing as follows. Women 46 to 72 years old were included in the analyses for T and DHEAS because androgen levels do not change during the menopausal transition (25, 26). These women were also included in the analysis of SHBG and insulin. Women with bilateral oophorectomy (n = 10) not receiving postmenopausal hormone therapy were included in the analysis of the adrenal androgen, DHEAS, and insulin (28, 29). We compared hormonal traits in women receiving postmenopausal hormone therapy (n = 15) to women from the cohort of comparable age who were not receiving hormonal therapy (n = 10). Only SHBG levels differed significantly. Therefore, women receiving postmenopausal hormone therapy were included in the analyses of T and DHEAS. They were not included in the insulin analysis because of the effect of estrogen on circulating insulin levels (30). No additional women were included in the LH or FSH analyses.

Fathers were included in the insulin level association test because there are not sex differences in this parameter (31). Subjects receiving glucocorticoids (men, 0; women, 2) were excluded from all quantitative trait association tests. Subjects receiving antidiabetic medications (men, 6; women, 7) were excluded from the T, SHBG, and insulin analyses but were included in the DHEAS analysis (32). Subjects with type 2 diabetes not receiving medications (men, 3; women, 7) were excluded from the insulin analysis, but the women were included in the T, DHEAS, and SHBG analyses.

Reference ranges for hormonal parameters from concurrently studied reproductively normal control subjects of comparable age, sex, body mass index (BMI), and ancestry (6, 33, 34) are included in Table 1. All control subjects had normal glucose tolerance with a 75 g oral glucose tolerance test (35). Reproductive-age control women were 18 to 45 years old with regular menses, FSH <40 mIU/mL, and normal androgen levels. Older control women were 46 to 65 years of age with a history of regular menses and normal androgen levels. Control men were 46 to 65 years of age. The hormonal data from control subjects in Table 1 were provided to depict normal reference ranges and were not used for any statistical comparisons.

Table 1.

Clinical and Biochemical Characteristics of Study Population

Subject Type N Age (y)a N BMI (kg/m2) N Testosterone (ng/dL) N SHBG (nmol/L) N DHEAS (ng/mL) N Insulin (μU/mL) N LH (mIU/mL) N FSH (mIU/mL)
Index cases 62 29 (19–48) 62 35.6 (27.9–42.7) 62 74 (65–90) 59 56 (36–92) 61 2095 (1638–2710) 62 21 (14–29) 59 14 (7–20) 59 9 (8–11)
Fathers 62 58 (43–85) 61 28.9 (27.0–32.2) 49 17 (11–26)
Mothers
 HA 6 51 (40–63) 6 34.5 (25.2–38.2) 5 51 (49–72) 5 117 (105–135) 6 2630 (1264–2694) 3 19 (15–22)
 Unaffected 2 44 (42–45) 2 24.0 (20.6–27.4) 2 32 (27–37) 2 280 (218–342) 2 1081 (901–1260) 2 12 (8–15) 2 7 (6–7) 2 8 (6–10)
 Age >45 y 54 57 (46–72) 53 28.6 (26.6–35.5) 36 26 (16–34) 25 93 (74–166) 50 645 (501–993) 28 19 (11–25)
Sisters
 PCOS 10 29 (19–36) 10 32.3 (29.0–37.3) 10 63 (49–74) 10 55 (45–104) 10 1665 (807–2774) 10 28 (23–32) 10 10 (5–15) 10 11 (10–12)
 HA 5 32 (15–40) 5 22.3 (22.3–28.5) 5 67 (54–68) 5 126 (69–177) 5 2047 (1509–3775) 5 15 (11–17) 4 4 (3–18) 4 10 (7–14)
 Unaffected 57 32 (14–45) 57 25.1 (22.2–29.3) 57 30 (23–35) 55 128 (90–181) 57 1408 (1028–1814) 55 12 (9–15) 55 5 (3–10) 55 10 (7–12)
 Age >45 y 3 47 (46–49) 3 26.4 (24.5–32.1) 3 30 (20–48) 3 137 (88–150) 3 772 (765–1588) 3 13 (12–15)
Reference ranges
 Reproductive–aged women 346 30 (25–35) 346 28.5 (23.0–35.4) 227 29 (21–37) 188 100 (70–144) 226 1357 (1018–1756) 185 12 (10–16) 173 4 (3–8) 173 9 (7–12)
 Men 55 53 (50–57) 55 28.0 (25.9–31.0) 46 14 (10–17)
 Older women (46 to 65 y) 69 56 (52–60) 69 26.4 (23.2–29.6) 69 23 (15–30) 60 102 (75–156) 66 645 (487–1016) 69 13 (10–17) 48 50 (35–69) 49 80 (57–100)

Traits reported as count, median (25th to 75th percentiles). Values reported here are unadjusted. Trait values were adjusted and normalized for variant association testing.

a

Age reported as count, median (minimum to maximum).

Hormone assays

T, DHEAS, SHBG, LH, FSH, and insulin levels were measured as previously reported (6).

Whole-genome sequencing

Genomic DNA was isolated from whole blood samples using the Gentra Puregene Blood Kit (Qiagen, Valencia, CA). Whole-genome sequencing was performed by Complete Genomics, Inc. (CGI) using their proprietary sequencing technology. Their sequencing platform used high-density nanoarrays populated with amplified DNA clusters called DNA nanoballs. DNA was read using a novel, iterative hybridization and ligation approach, which produces paired-end reads up to 70 bases in length (36). CGI’s sequencing service included sample quality control, library construction, whole-genome DNA sequencing, and variant calling. Reads were mapped to the National Center for Biotechnology Information Build 37.2 reference genome (GRCh37).

Variant calling

Variant calling was performed using CGI’s Assembly Pipeline version 2.0. Although raw read data were provided, because of the unique gapped read structure produced by CGI sequencing, the use of other mapping or variant-call software was not recommended by CGI. Results were provided in CGI’s unique variant-call format. Variants included single nucleotide variants (SNVs), as well as insertions, deletions, and substitutions ≤50 bases in length. Calls were assigned confidence scores assuming equal allele fractions for the diploid genome (varScoreEAF). A Bayesian probability model was used to evaluate potential locus calls. The model accounted for read depth, base call quality scores, mapping/alignment probabilities, and empirical priors on gap sizes and discordance rates (37). Based on the relative allele likelihoods a quality score was assigned for the chosen call:

varScore=10log10(P(call is true)P(call is false)). (1)

Scores were therefore reported in decibels, where a score of 10 dB represents a likelihood ratio of 10:1, 20 dB means a 100:1 likelihood, 30 dB means a 1000:1 likelihood, and so forth. Quality thresholds for reporting variants were minimized (≥10 dB for homozygous and ≥20 dB for heterozygous variant calls) to maximize sensitivity. Variants were assigned a basic high/low quality flag (VQHIGH or VQLOW) based on a quality score threshold of 20 dB for homozygous and 40 dB for heterozygous variant calls. CGI variant calls were validated previously using Sanger sequencing (38).

Called variant filtering

Variants were considered rare when they appeared with a frequency ≤2% in the Scripps Wellderly Genome Resource, which consisted of 597 unrelated participants of European ancestry from the Scripps Wellderly Study (39). The Wellderly study population is composed entirely of elderly individuals ≥80 years of age with no history of chronic disease. Within each sequenced family, reported variants that were inconsistent with Mendelian patterns of inheritance were removed from consideration. A variant was considered consistent with Mendelian inheritance when it was called (VQHIGH) in one or more of the offspring and in at least one parent. The vast majority of DNA sequencing errors can be eliminated using Mendelian inheritance analysis (40). As previously described (41–44), an additional set of filters was applied to called variants for each sample to minimize the number of false-positive calls: (i) variants with VQLOW allele tags were removed; (ii) variants in microsatellite regions were removed; (iii) variants within simple tandem repeat regions were removed; (iv) three or more single-nucleotide polymorphisms (SNPs) clustered within a distance of 10 bp were removed; (v) SNPs located within 10 bp of an insertion or deletion (indel) were removed; (vi) calls located within known regions of segmental duplication were removed; and (vii) calls with an observed read depth greater than threefold the average read depth (>168) were removed.

Selection of optimal read depth and quality score thresholds

After systematically applying the filter matrix outlined above, optimal read depth and quality score thresholds were determined for each variant type by comparing calls between replicated samples for a particular family that was sequenced twice by CGI. Variants that were concordant between offspring sample pairs—above a given coverage depth and quality score threshold—were considered as true positive (TP) calls, whereas those that were discordant between replicates were considered as false positive (FP). Accordingly, concordant variant calls that fell below a given depth and quality threshold were classified as false negative (FN), and discordant calls that fell below a given depth and quality threshold were classified as true negative (TN). Optimal depth and quality score thresholds were determined by selecting the thresholds that yielded the greatest Matthews correlation coefficient (MCC) values across each variant type (41):

MCC= TP×TNFP×FN(TP+FP)(FP+FN)(TN+FP)(TN+FN), (2)

The thresholds chosen for association testing corresponded with the greatest MCC values for each variant type (45).

Assessing deleteriousness

After filtering for rare variants called with high confidence, as described above, variants were further characterized according to their predicted effects. Only variants that were predicted to have a deleterious effect were included in the association testing. Deleteriousness was primarily assessed using the Combined Annotation Dependent Depletion (CADD) tool (46), which is trained on output from numerous annotation programs to predict deleteriousness based on conservation relative to our ancestral genome. Variants were retained when they produced a CADD score greater than the gene-specific mutation significance cutoff (MSC; 95% CI) suggested by Itan et al. (47). For variants outside of gene regions or in inconsistently annotated genes, the mean MSC (MSCμ = 12.889) was used as the cutoff. In a further effort to reduce false positives, only coding variants classified as at least possibly damaging by the PolyPhen2 (48) or SIFT (49) variant effect prediction tools and noncoding variants with LINSIGHT (50) scores ≥0.8 were included in our analysis, as integrating individual methods can improve variant effect prediction (47, 51–53). CADD and LINSIGHT are both primarily based on evolutionary conservation, and they therefore carry certain limitations (54). However, the applicability of prediction tools based on functional annotations from specific cell types (55, 56) is extremely limited for PCOS because its pathophysiology involves numerous cell types across multiple organs (57), and annotations for relevant cell types are largely unavailable (58–61).

Genome-wide rare variant association testing

Variants were then grouped for association testing using both gene-based and sliding window approaches. Gene regions included all coding and noncoding DNA from the 3′ untranslated region to 7.5 kb upstream of the 5′ transcriptional start site for all RefSeq annotated genes. The sliding window approach included all noncoding variation contained within sequential windows across the genome using three different window sizes: 10-kb windows with no overlap, 25-kb windows with 12.5-kb overlap, and 100-kb windows with 75-kb overlap.

In rare variant association testing, genes are often filtered according to a minimum number of rare variants detected per gene (62–65) or a minimum cumulative variant frequency (CVF) per gene (66–69) to increase power to detect disease associations. In this study, genes and windows were removed from consideration when they did not harbor deleterious variants in at least 10% of affected subjects (70, 71). For the gene-based test, to reduce the resultant bias toward larger genes, the observed CVFs were adjusted for gene length. Coding and noncoding CVFs were modeled separately using linear regressions against the coding sequence length (72, 73) and the square root of the noncoding sequence length (74, 75), respectively. The models also accounted for gene-level guanine-cytosine content. The CVFs observed in affected subjects for each gene were then adjusted accordingly prior to applying the 10% threshold. Adjusting for gene length in this manner reduced the risk for erroneous associations, as genes with longer open reading frames are more likely to be reported falsely as disease-associated (76).

Sequence kernel association test statistics were computed using the methods described in Schaid et al. (77), which account for pedigree information in calculating trait associations. Custom variant weights were applied for each variant according to their CADD Phred scores:

wv= CvMSCgmaxv,gG(CvMSCg). (3)

where Cv is the CADD Phred score for variant v, g is the gene or window, MSCg is the mutation significance threshold for g, and G is the set of all variants within the genome. In this way, variants that are more likely to be deleterious were weighted more heavily than variants where the functional consequences are less likely to be damaging. For the dichotomous trait analysis, the binary outcome was adjusted for age and BMI using a logistic regression.

Six quantitative traits, that is, T, DHEAS, LH, FSH, SHBG, and insulin levels, were tested for association. The trait distributions were each positively skewed. Testing against skewed trait distributions can result in heavily inflated type I error rates in rare variant association testing (78). Therefore, each trait was modeled against a gamma distribution when adjusting for age and BMI, and nonnormal residuals (Shapiro–Wilk <0.05) were then further normalized using a rank-based inverse normal transformation (INT). The INT has been found to be the optimal method for maintaining type I error control without sacrificing power in rare variant association testing on nonnormally distributed traits (79).

Quantitative trait meta-analysis

For complex diseases with multivariate phenotypes, combining multiple related phenotypic traits into one analysis can increase power in finding disease associations (80, 81). We combined the six aforementioned quantitative trait associations into one meta-statistic using Fisher’s combination function modified to account for correlated traits (82). Intertrait correlations were determined using the Spearman correlation coefficient (45). P values were adjusted using a conservative Bonferroni correction according to the number of variant groupings that were tested that contained at least one variant across both the dichotomous and quantitative trait associations, as well as by the genomic inflation factor, λ. Nominal P values are reported herein as Pn, and corrected P values are reported as Pc. Correlation between meta-analysis association results and dichotomous trait association results was calculated using a Spearman coefficient and tested using a two-tailed t test.

The characteristic disturbance of gonadotropin secretion associated with PCOS is increased LH relative to FSH release (83). For genes with significant meta-analysis associations, LH/FSH ratios were compared between variant carriers and noncarriers using a Wilcoxon rank sum test, adjusted for multiple testing (Bonferroni). Differences in LH/FSH ratios between variant carriers and noncarriers would indicate that the gene variants alter gonadotropin signaling.

Linkage disequilibrium between rare variants and GWAS risk alleles

To evaluate whether rare variants identified in the association tests could account for GWAS association signals in the same gene, we calculated linkage disequilibrium (LD) between each of the rare variants and each of the corresponding GWAS risk alleles. Alleles were phased utilizing pedigree information. Instances in which the allelic phasing could not be resolved between the rare variants and GWAS variants were removed from consideration.

In silico binding effect prediction

To assess the potential functional effects of noncoding variants identified in the quantitative trait meta-analysis, we predicted the corresponding impacts to transcription factor (TF) and RNA-binding protein (RBP) binding in silico. For each noncoding SNV identified in the quantitative trait meta-analysis, TF binding affinities were calculated for all subsequences overlapping the SNV position on each strand within a ±20-bp window. Binding affinity scores were calculated using position weight matrices derived from ENCODE chromatin immunoprecipitation sequencing experiments (76). Scores were determined by summing the logged frequencies for a given sequence across a motif position weight matrix. Binding P values were defined as the probability that a sequence sampled from a genomic background distribution had an affinity score greater than or equal to the largest affinity score produced from one of the tested subsequences. Genomic background sequences were generated using a first-order Markov model (84). The significance of a given change in binding affinity scores between reference and SNV alleles was assessed by determining whether the differences in relative binding affinity rank between the two alleles was significantly different than what would be expected by chance (85, 86). P values were conservatively adjusted to account for multiple testing using the Benjamini–Hochberg procedure (87).

Once binding affinities were calculated for each TF at each SNV, filters were applied to identify the most likely candidates for TF binding site disruption. Instances in which the predicted TF binding affinity score was <80% of the maximum affinity score for the given motif were excluded. SNVs in which the reference allele was not predicted to bind a particular TF with statistical significance (Benjamini–Hochberg P < 0.05) were also removed from consideration, as were variants in which both the reference and SNV alleles were predicted to bind a TF with statistical significance. Only TFs expressed in the ovary were analyzed. Tissue-specific gene expression was determined using GTEx data (88) (median reads per kilobase of transcript per million mapped reads ≥0.1).

The identified SNVs were likewise analyzed for potential alteration to RBP sites following the same procedure but for a few modifications. RBPs and their binding affinity scores were determined using the ATtTRACT database (89). Only sequences on the coding strand were evaluated as to reflect the mRNA sequences. Additionally, significant changes in RBP binding were considered regardless of the direction of effect, such that instances in which the alternate allele was predicted to induce RBP binding were also included.

Supplemental methods

For additional details regarding methodological considerations and rationale, please refer to the online repository (45).

Results

Characteristics of study population

The characteristics of the study population, including counts and trait distributions by familial relationship and the numbers of subjects included in each association test, are summarized in Table 1.

Whole-genome sequencing and variant calling

Genome sequencing yielded average genome coverage of 96.2% per sample and an overall mean sequencing depth of 56× (45). On average, 90.4% and 66.4% of the genome, including 95.2% and 78.8% of the exome, was covered with at least 20× and 40× sequencing depth, respectively. Approximately 4.04 million high-confidence small variant calls were reported per genome. By applying optimal read depth and quality thresholds as well as a series of genomic filters (41), we reduced the discrepancy rate between replicate samples from 0.23% to 0.04% for rare variants (45).

Association testing and quantitative trait meta-analysis

We found 339 genes that had rare, deleterious variants in at least 10% of cases after adjusting for gene length and percentage guanine-cytosine content. No set of rare variants reached genome-wide significance for association with PCOS/HA disease status in the dichotomous trait analysis. We found 32 rare variants (2 coding, 30 noncoding) in the DENND1A gene that were collectively significantly associated with quantitative trait levels (Pn = 5.31×10−5, Pc = 0.039; Table 2), after adjusting for multiple testing and for observed genomic inflation (45). Women with one or more of these DENND1A variants had significantly higher LH/FSH ratios (Pc = 0.0036). PCOS/HA phenotype women with one or more DENND1A variants had significantly higher LH/FSH ratios than did PCOS/HA phenotype women without DENND1A variants (Pc = 0.0180). Unaffected women with one or more DENND1A variants also trended toward higher LH/FSH ratios than unaffected women without DENND1A variants (Pc = 0.1758; Fig. 1). No other gene-based set of rare variants reached genome-wide significance for association with quantitative trait levels. The correlation between the meta-analysis gene associations and the dichotomous trait gene associations was 0.24 (Spearman; P = 5.12 × 10−6).

Table 2.

Top Five Rare Variant Associations From Quantitative Trait Meta-Analysis

Chromosome Gene Length Variants Familiesa ORb P n P c
9 DENND1A 550 kb 32 50% 1.20 (0.69–2.14) 5.31 × 10−5 0.039
11 PKNOX2 269 kb 18 39% 1.55 (0.87–2.94) 7.28 × 10−4 0.53
6 BMP6 155 kb 11 35% 1.41 (0.65–3.27) 3.98 × 10−3 1.00
9 C9orf3 421 kb 10 21% 2.79 (1.08–8.24) 6.14 × 10−3 1.00
1 PRDM2 125 kb 8 19% 3.26 (0.77–22.51) 6.92 × 10−3 1.00

Rare variants exceed thresholds of predicted deleteriousness.

a

Families include all that have daughters with at least one of the rare variants.

b

ORs shown with 95% CIs, estimated from logistic regression of gene variant count against disease status, with equal variant weighting, adjusted for BMI, not considering relatedness (individual variants may have larger effects).

Figure 1.

Figure 1.

LH/FSH ratios in DENND1A variant carriers. LH/FSH ratios in DENND1A rare variant carriers (+) and noncarriers (−) in unaffected women and in women with PCOS/HA. Differences in group means were analyzed using a Wilcoxon rank sum test (*Pc < 0.05). The significance bars illustrate significant differences in LH/FSH ratios between (i) all DENND1A variant carriers and all noncarriers and (ii) DENND1A variant carriers and noncarriers in affected subjects only. Note that PCOS includes HA subjects. Unaff, unaffected.

Using the sliding windows approach, we found a subset of noncoding variants within a 25-kb region of DENND1A (chr9:126,537,500 to 126,562,500) that were significantly associated with altered quantitative trait levels (Pn = 1.92 × 10−5, Pc = 0.019) (45). The region included three noncoding variants that were collectively present in eight subjects with PCOS/HA but in no unaffected subjects. Notably, one of these rare variants, rs117893097 (MAFWellderly = 0.013), was homozygous in one of the subjects. This 25-kb region encompasses one of the DENND1A GWAS risk variants [rs10986105; MAFWellderly = 0.034; ORMeta=1.39 (90)], although all carriers of the rare variants in the region lacked the rs10986105 risk allele. The relative positions of all of the rare variants found in DENND1A are shown in Fig. 2 (91–93). No other windows across the genome were found to have significant associations with disease state or hormonal levels.

Figure 2.

Figure 2.

Rare variants in DENND1A. The locations of deleterious rare variants and previously reported GWAS SNPs within the DENND1A gene, including the two primary isoforms DENND1A.V1 and DENND1A.V2, are shown. The 25-kb region significantly associated with altered hormone levels is highlighted in light blue. The conservation track was measured on multiple alignments of 100 vertebrate species by phyloP (91). The H3K4Me1 track shows enrichment of monomethylation of lysine 4 of the H3 histone protein, which is associated with enhancers and DNA regions downstream of transcription starts, as determined by chromatin immunoprecipitation sequencing assay and layered by different cell types (58). The DNase clusters track shows regions of DNase hypersensitivity, an indicator of regulatory activity, with darkness proportional to maximum signal strength (58). The GTEx eQTL track displays gene expression quantitative trait loci for DENND1A, as identified from GTEx RNA sequencing and genotype data, with red and blue indicating positive and negative effects on gene expression, respectively (92). The linkage disequilibrium heat map was generated using phase 1 CEU data from the 1000 Genomes Project (93).

Several other PCOS GWAS candidate genes appeared in our filtered set of genes, including C9orf3 (5, 6, 8) (Pn = 6.14 × 10−3), HMGA2 (5) (Pn = 0.062), ZBTB16 (8) (Pn = 0.20), TOX3 (5, 8) (Pn = 0.22), and THADA (4, 5, 7, 8) (Pn = 0.74). C9orf3 had the fourth strongest association overall (45), but it failed to reach genome-wide significance after correction for multiple testing. The relative quantitative trait associations for these genes are illustrated in Fig. 3.

Figure 3.

Figure 3.

Trait associations for PCOS GWAS genes. The relative quantitative trait associations are shown for PCOS GWAS susceptibility loci included in meta-analysis results, with meta-analysis association ranking. I, fasting insulin.

Linkage disequilibrium with GWAS variants

On average, the 32 rare variants identified in DENND1A were in low LD with each of the previously identified DENND1A GWAS risk alleles: rs2479106 (MAF = 0.25, Dμ = 0.28, r2μ = 0.0077), rs10818854 (MAF = 0.06, Dμ = 0.035, r2μ = 0.0024), and rs10986105 (MAF = 0.05, Dμ = 0.031, r2μ = 0.0026), although individually, some were in high LD with rs2479106 (D′ = 1). Pairwise LD values are provided in Table 3.

Table 3.

Deleterious, Rare Variants in DENND1A

Position ID Variant Allele Frequency CADD LINSIGHT TF − (Disrupted) RBP − (Disrupted) RBP + (Enhanced) GWAS Allelic LD (D′)
Affa Unrelb Popc rs2479106 rs10818854 rs10986105
9:126,144,390d rs189947178 G → T 0.018 0.020 0.003 11.91 N/A N/A <0.01 0.13 <0.01
9:126,154,100 rs147370674 C → T 0.006 0.004 0.004 21.10 0.97 XPO5 <0.01 <0.01 <0.01
9:126,154,582 rs529224231 T → G 0.006 0.004 0.000 21.30 0.97 MAFK, MAFB, NRL IFIH1, XPO5 N/A N/A N/A
9:126,169,258 rs561100869 C → T 0.000 0.004 0.000 16.94 0.97 IRF1 NOVA1 <0.01 <0.01 <0.01
9:126,180,758 rs750425892 G → C 0.000 0.004 0.000 19.31 0.96 STAT6, MZF1, SPI1 PCBP2 <0.01 <0.01 <0.01
9:126,203,546 T → C 0.006 0.004 0.000 20.50 0.93 FOXM1 RBMX <0.01 <0.01 <0.01
9:126,225,586 rs538451690 T → A 0.006 0.004 0.002 16.07 0.94 RC3H1, XPO5 1 <0.01 <0.01
9:126,231,902 C → T 0.006 0.004 0.000 20.20 0.97 <0.01 <0.01 <0.01
9:126,247,645 rs543947590 C → A 0.006 0.004 0.003 21.00 0.97 STAT5A OAS1 1 <0.01 <0.01
9:126,267,980 rs558809288 C → T 0.006 0.004 0.000 19.97 0.84 ARNT <0.01 <0.01 <0.01
9:126,284,213 rs149244424 C → T 0.012 0.004 0.008 12.44 0.92 ESRRA, ELF1 <0.01 <0.01 <0.01
9:126,312,990 rs748274474 A → G 0.006 0.004 0.003 21.10 0.96 YBX1 <0.01 <0.01 <0.01
9:126,313,234 C → A 0.006 0.004 0.000 17.84 0.97 SRSF2 <0.01 <0.01 <0.01
9:126,326,081 rs138249397 C → T 0.018 0.016 0.002 9.32 0.88 OAS1 SRP68 0.73 <0.01 <0.01
9:126,331,427 rs184609118 A → C 0.000 0.004 0.002 18.48 0.97 ELAVL1, SSB CELF2, NOVA1 1 <0.01 <0.01
9:126,370,689 rs564042790 C → T 0.006 0.004 0.003 16.84 0.89 SMAD3 IFIH1 YBX1 <0.01 <0.01 <0.01
9:126,402,348 rs182167487 C → T 0.000 0.004 0.000 14.18 0.93 <0.01 <0.01 <0.01
9:126,404,312 G → A 0.006 0.004 0.000 15.72 0.91 SRP54, SRP68 PTBP1 1 <0.01 <0.01
9:126,414,365d rs141759269 T → G 0.000 0.004 0.001 26.00 N/A N/A N/A <0.01 <0.01
9:126,440,857 rs147844210 T → C 0.000 0.004 0.008 21.60 0.95 GATA6 CMTR1, FUS, SRSF3, YBX1 <0.01 <0.01 <0.01
9:126,441,904 rs75342773 T → C 0.006 0.004 0.008 16.64 0.95 HNRNPH1–2 TRA2B <0.01 <0.01 <0.01
9:126,447,680 rs117984673 T → A 0.018 0.016 0.008 17.70 0.94 SOX10, SOX17 NOVA2 0.73 <0.01 <0.01
9:126,458,124 rs112188193 G → C 0.012 0.008 0.013 14.43 0.92 STAT4, STAT5A, TEAD1 RC3H1 NUDT21 <0.01 <0.01 <0.01
9:126,480,236 rs543924878 A → C 0.006 0.004 0.004 18.31 0.81 SREBF1 HNRNPH1–3, HNRNPF, KHSRP, SRP14 <0.01 <0.01 <0.01
9:126,513,154 rs147058034 A → G 0.006 0.004 0.003 19.81 0.97 FOXG1, FOXP3, FOXO4, FOXO6, FOXC1, FOXK1 CELF1, XPO5 NOVA1 1 1 1
9:126,549,983 T → C 0.018 0.004 0.000 19.45 0.81 1 <0.01 <0.01
9:126,555,003 rs117893097 C → G 0.030 0.020 0.013 21.40 0.99 HNRNPH1–3, HNRNPF, KHSRP <0.01 <0.01 <0.01
9:126,557,679 rs552299287 A → T 0.006 0.004 0.000 18.00 0.84 STAT5A, STAT6 CELF1–2, RC3H1, XPO5 1 0.84 1
9:126,597,096 T → C 0.006 0.004 0.000 16.64 0.97 SOX5 HNRNPA0, HNRNPA1, HNRNPD, ELAVL1, ZFP36 XPO5 <0.01 <0.01 <0.01
9:126,603,402 rs78012023 TTA → ATG 0.018 0.012 0.000 8.99 0.91 N/A <0.01 <0.01
9:126,625,643 rs116887221 C → A 0.006 0.004 0.008 16.24 0.8 HNRNPH1–3, HNRNPF, KHSRP <0.01 <0.01 <0.01
9:126,691,321 rs79740971 G → A 0.006 0.008 0.010 13.52 0.84 ESRRA <0.01 <0.01 <0.01

Positions correspond to GRCh37. Variant queries were facilitated by Kaviar (94). CADD and LINSIGHT columns show CADD and LINSIGHT deleteriousness prediction scores, accordingly. TF− shows TFs for which binding is predicted to be negatively impacted. RBP− and RBP+ show RBPs for which binding is predicted to be negatively impacted or enhanced, respectively. GWAS allelic LDs show linkage disequilibrium D′ values between the rare variants and each of the previously identified DENND1A GWAS risk alleles.

Abbreviation: N/A, not applicable for coding variant.

a

Affected (aff) cohort allele frequencies represent proportion of alleles with variant in PCOS/HA subjects.

b

Unrelated (unrel) cohort allele frequencies represent proportion of variant alleles in parents.

c

Population (pop) allele frequencies correspond to Wellderly cohort.

d

Coding variant (rs189947178 and rs141759269).

Protein binding effect prediction

Nine of the DENND1A variants were predicted to significantly impact TF binding motifs, whereas most variants were predicted to significantly alter RBP binding motifs (17 disrupted, 14 induced). The specific TF and RBP motifs associated with each noncoding variant are listed in Table 3. Binding by the heterogeneous nuclear ribonucleoprotein family of RBPs appeared to be the most commonly impacted.

Discussion

We identified rare variants in DENND1A that were significantly associated with altered reproductive and metabolic hormone levels in PCOS. Fifty percent of the PCOS families included in this study carried at least one of these rare DENND1A variants. These findings are of considerable interest because common variants in DENND1A were associated with PCOS diagnosis in a GWAS of Han Chinese women (4); these associations were replicated in women of European ancestry (95). Subsequently, McAllister and colleagues (61, 96) discovered that DENND1A plays a key role in androgen biosynthesis in human theca cells and is upregulated in PCOS theca cells. Our study, using an independent family-based WGS analytical approach, further implicates DENND1A in the pathogenesis of PCOS. Additionally, our results provide genetic evidence to support our hypothesis that hyperandrogenemia is a core biologic pathway in PCOS (22, 97). The increased LH/FSH ratios that we observed in rare variant carriers (Fig. 1) also suggest that DENND1A contributes to the disruption of gonadotropin secretion that is a characteristic feature of PCOS (98).

DENND1A encodes two transcripts as the result of alternative splicing, DENND1A.V1 and DENND1A.V2 (61). The encoded V2 protein was found in ovarian theca cells and its abundance was correlated with increased androgen production (96). Forced expression of the V2 transcript produced a PCOS phenotype in normal theca cells, whereas knockdown of V2 in PCOS theca cells reduced thecal androgen biosynthesis (96). The three DENND1A risk variants identified by GWASs (rs2479106, rs10818854, rs10986105) were located in introns, and the functional consequences of these variants are unknown (61). There have been no reported large-scale sequencing studies to fine-map causal variants in DENND1A (90). Targeted (61, 96, 99) and whole-exome sequencing (100) in small PCOS case-control cohorts have failed to identify any variants in DENND1A that were associated with PCOS or with V2 isoform expression. The GWAS risk variants and most of the rare DENND1A variants in the current study lie well upstream (100 to 400 kb) of this splicing region (Fig. 2). Nevertheless, based on the LD we observed between the rare variants and the GWAS risk variants, some of the rare variants (particularly those with D′ = 1) may contribute to the GWAS signal at rs2479106, but a larger cohort would be needed to conduct conditional association analyses.

A number of our rare DENND1A variants were predicted to disrupt conserved TF binding motifs, which could affect gene expression, but most of the variants were predicted to alter affinities of RBPs to the mRNA transcript (Table 3). It is plausible, therefore, that these rare variants were selectively driving V2 expression via posttranscriptional regulation. Collectively, the rare DENND1A variants were found in 50% of families, but each individual variant was typically found in only one or two families. Our findings, therefore, support a model of PCOS in which causal variants are individually uncommon but collectively occur in key genes regulating core biologic pathways contributing to disease pathogenesis, such as androgen biosynthesis (23). Our recent findings (18, 38) of multiple rare variants in the AMH and AMHR2 genes that reduce their biologic activity in ∼7% of women with PCOS is consistent with this model.

Several PCOS GWAS candidate genes besides DENND1A appeared among the top gene associations, but they failed to reach genome-wide significance. These genes included C9orf3 (5, 6, 8), HMGA2 (5), ZBTB16 (8), TOX3 (5, 8), and THADA (4, 5, 7, 8) (Fig. 3). Two additional genes with strong, but not genome-wide significant, associations with PCOS quantitative traits were highly plausible PCOS candidate genes. BMP6 had the third strongest association in our meta-analysis (P = 4.00 × 10−3, Table 2). It is a member of the bone morphogenetic protein family, which are growth factors involved in folliculogenesis. BMP6 expression was previously found to be significantly higher in granulosa cells from women with PCOS compared with reproductively normal control women (101). Moreover, BMP6 was found to increase expression of the FSH receptor, inhibin/activin β subunits, and AMH genes in human granulosa cells (102). PRDM2 had the fifth strongest association in our meta-analysis (P = 6.95×10−3, Table 2). PRDM2 is an estrogen receptor coactivator (103) that is highly expressed in the ovary and pituitary gland (88). Ligand-bound estrogen receptor α binds with PRDM2 to open chromatin at estrogen receptor α target genes (103, 104). PRDM2 also binds with the retinoblastoma protein (105), which has been shown to play an important role in follicular development in granulosa cells (106, 107).

Our results align with the emerging evidence that rare coding variants with large effect sizes do not play a major role in complex disease (108). The average estimated effect size of the rare DENND1A variants—assuming additive, equal variant weighting—was relatively modest (OR, 1.20; 95% CI, 0.69 to 2.14; Table 2), although individual variants may have larger effects. Of the 32 rare variants we identified in DENND1A, 30 were noncoding. Indeed, it appears that complex traits are primarily driven by noncoding variation (109, 110), both common and rare (110, 111). Paired WGS and transcriptome sequencing analysis from one extensive kindred demonstrated that rare noncoding variants had strong effects on individual gene-expression profiles (113). In a study designed similarly to ours, Ament et al. (114) found rare variants, the vast majority of which were noncoding, associated with increased risk for bipolar disorder. Owing to the relative cost-effectiveness of whole-exome sequencing (109), the limited availability of computational tools designed to predict the effects of noncoding variants on phenotypes (115), and our relatively poor understanding of regulatory mechanisms in the genome (116), noncoding variants have been noticeably understudied in complex trait genetics. As larger WGS data sets are accumulated and the focus of complex trait studies shifts more toward understanding regulatory mechanisms, the contribution of rare noncoding variants in various complex diseases will become clearer.

The central statistical challenge in studying rare variants is achieving adequate power to detect significant associations while controlling for type I error. The family-based structure of our cohort provided an enrichment of individual rare variants and enabled modeling of familial segregation (117). To mitigate variant calling errors, we used replicate samples from one family to determine optimal read depth and quality thresholds. To remove irrelevant variants from consideration, we applied a LINSIGHT score threshold and gene-specific CADD score thresholds (77) to filter for deleteriousness. We further prioritized variants by weighting them by their relative CADD scores. To group rare variants effectively, we applied several windows-based binning methods, in addition to the gene-based approach, to ensure that different kinds of functionally correlated genomic regions were tested, both of fixed length and of variable length. To limit our search to genes that were more likely to have specific roles in PCOS etiology, we only considered genes with rare deleterious variants in at least 10% of cases, as causal rare variants are more likely to accumulate in core disease genes (109). We greatly increased our power to detect relevant disease genes and account for pleiotropic effects by consolidating quantitative trait association results into a meta-analysis (80, 81). To reduce type I error, in addition to the variant calling quality control measures, we modeled the quantitative traits against skewed distributions and further normalized trait residuals using an INT.

Given the size of our cohort, it was necessary to apply relatively strict filters based on predicted variant effects and cumulative allele frequencies to detect rare variant associations. Very large sample sizes are otherwise required for WGS studies of rare variants (118, 119). By only including genes with rare, likely-deleterious variants in at least 10% of cases, we greatly reduced the multiple-testing burden of our analysis, thereby increasing our power to detect core PCOS genes (120, 121). Applying a priori hypotheses regarding which variants and genes may be relevant to disease, however, is analogous to a candidate gene approach (118). Any sets of rare variants that contribute to PCOS in smaller subpopulations of women with PCOS, as we found for AMH (38), were likely removed from consideration. Likewise, any causal rare variants that were not predicted bioinformatically to be deleterious based on existing annotations and evolutionary conservation (46, 50) would not have been detected. Furthermore, because variants were filtered for consistency with Mendelian inheritance, de novo mutations were not considered.

Because the number of genetic variants is directly correlated with the size of the gene (76), the CVF threshold introduced a bias toward larger genes in some of the analyses. However, this bias was mitigated by adjusting for gene length. Furthermore, any such bias was not applicable to our fixed-length windows-based approach, which replicated our DENND1A findings. It is also possible that causal rare variants with larger effect sizes were omitted from the meta-analysis because we tested against normalized trait residuals in an effort to reduce type I errors. Using normalized trait residuals may have excluded variants with large effects that produced outliers. However, as mentioned above, recent evidence has demonstrated that complex traits are primarily driven by noncoding variation with modest effect sizes (109, 110).

Despite the numerous steps taken to increase power, our study ultimately remained underpowered to detect rare variant associations with PCOS diagnosis in the dichotomous trait analysis. Although an association with PCOS quantitative traits implicates genetic variants in disease pathogenesis, it does not necessarily mean that a gene is associated with PCOS itself. The noncoding variants identified in this study, despite being rare and predicted to be deleterious, may be in LD with the actual pathogenic variant on the same alleles that were removed by the applied allele frequency or predicted effect thresholds. These noncoding variants may also affect the expression of one or more distant genes outside of the gene in which the variants are located. Importantly, the results of this study have not been replicated in an independent cohort and therefore, despite the established association of DENND1A variation with PCOS, should be regarded as hypothesis generating. Replication and functional studies are needed to confirm individual variant functionality and disease associations. Finally, because the women with PCOS studied herein were diagnosed according to National Institutes of Health criteria, our results may not be generalizable to other PCOS phenotypes defined by different sets of diagnostic criteria.

In summary, by applying family-based sequence kernel association tests on filtered whole-genome variant call data from a cohort of PCOS families, we were able to identify rare variants in the DENND1A gene that were associated with quantitative hormonal traits of PCOS. Because DENND1A is an important regulator of androgen biosynthesis, these findings provide additional genetic evidence to support our hypothesis that hyperandrogenemia is a core biologic pathway in PCOS (23, 97). Our results also suggest that rare noncoding variants contribute to the distinctive hormonal profile of PCOS. This study further demonstrates that using a quantitative trait meta-analysis can be a powerful approach in rare variant association testing, particularly for complex diseases with pleiotropic etiologies.

Acknowledgments

We thank Drs. Terry Farrah and Gustavo Glusman, from the Institute for Systems Biology, for facilitating variant queries via Kaviar (http://db.systemsbiology.net/kaviar). We thank the Northwestern University Research Computing Services team for supporting the computational needs of this research. We are very grateful to all of the families who took part in the study.

Financial Support: This study was supported by National Institutes of Health Grants P50 HD044405 (to A.D.) and R01 HD085227 (to A.D.). M.D. was supported by National Institutes of Health National Research Service Award T32 DK007169. This study uses data from the Scripps Wellderly Genome Resource, which is funded under National Institutes of Health Grant UL1 TR001114 and a Scripps Translational Science Institute Clinical and Translational Science Award.

Disclosure Summary: The authors have nothing to disclose.

Glossary

Abbreviations:

BMI

body mass index

CADD

Combined Annotation Dependent Depletion

CGI

Complete Genomics, Inc.

CVF

cumulative variant frequency

DHEAS

dehydroepiandrosterone sulfate

GWAS

genome-wide association study

HA

hyperandrogenemic

INT

inverse normal transformation

LD

linkage disequilibrium

MAF

minor allele frequency

MSC

mutation significance cutoff

Pc

corrected P value

Pn

nominal P value

PCOS

polycystic ovary syndrome

RBP

RNA-binding protein

SNP

single-nucleotide polymorphism

SNV

single nucleotide variant

T

testosterone

TF

transcription factor

WGS

whole-genome sequencing

μ (as subscript)

mean

References and Notes

  • 1. Azziz R. New insights into the genetics of polycystic ovary syndrome (published correction appears in Nat Rev Endocrinol. 2016;12(3):183). Nat Rev Endocrinol. 2016;12(2):74–75. [DOI] [PubMed] [Google Scholar]
  • 2. Gambineri A, Patton L, Altieri P, Pagotto U, Pizzi C, Manzoli L, Pasquali R. Polycystic ovary syndrome is a risk factor for type 2 diabetes: results from a long-term prospective study. Diabetes. 2012;61(9):2369–2374. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3. Shi H, Kichaev G, Pasaniuc B. Contrasting the genetic architecture of 30 complex traits from summary association data. Am J Hum Genet. 2016;99(1):139–153. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4. Chen ZJ, Zhao H, He L, Shi Y, Qin Y, Shi Y, Li Z, You L, Zhao J, Liu J, Liang X, Zhao X, Zhao J, Sun Y, Zhang B, Jiang H, Zhao D, Bian Y, Gao X, Geng L, Li Y, Zhu D, Sun X, Xu JE, Hao C, Ren CE, Zhang Y, Chen S, Zhang W, Yang A, Yan J, Li Y, Ma J, Zhao Y. Genome-wide association study identifies susceptibility loci for polycystic ovary syndrome on chromosome 2p16.3, 2p21 and 9q33.3. Nat Genet. 2011;43(1):55–59. [DOI] [PubMed] [Google Scholar]
  • 5. Shi Y, Zhao H, Shi Y, Cao Y, Yang D, Li Z, Zhang B, Liang X, Li T, Chen J, Shen J, Zhao J, You L, Gao X, Zhu D, Zhao X, Yan Y, Qin Y, Li W, Yan J, Wang Q, Zhao J, Geng L, Ma J, Zhao Y, He G, Zhang A, Zou S, Yang A, Liu J, Li W, Li B, Wan C, Qin Y, Shi J, Yang J, Jiang H, Xu JE, Qi X, Sun Y, Zhang Y, Hao C, Ju X, Zhao D, Ren CE, Li X, Zhang W, Zhang Y, Zhang J, Wu D, Zhang C, He L, Chen ZJ. Genome-wide association study identifies eight new risk loci for polycystic ovary syndrome. Nat Genet. 2012;44(9):1020–1025. [DOI] [PubMed] [Google Scholar]
  • 6. Hayes MG, Urbanek M, Ehrmann DA, Armstrong LL, Lee JY, Sisk R, Karaderi T, Barber TM, McCarthy MI, Franks S, Lindgren CM, Welt CK, Diamanti-Kandarakis E, Panidis D, Goodarzi MO, Azziz R, Zhang Y, James RG, Olivier M, Kissebah AH, Stener-Victorin E, Legro RS, Dunaif A; Reproductive Medicine Network. Genome-wide association of polycystic ovary syndrome implicates alterations in gonadotropin secretion in European ancestry populations (published correction appears in Nat Commun. 2016;7:10762). Nat Commun. 2015;6(1):7502. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7. Day FR, Hinds DA, Tung JY, Stolk L, Styrkarsdottir U, Saxena R, Bjonnes A, Broer L, Dunger DB, Halldorsson BV, Lawlor DA, Laval G, Mathieson I, McCardle WL, Louwers Y, Meun C, Ring S, Scott RA, Sulem P, Uitterlinden AG, Wareham NJ, Thorsteinsdottir U, Welt C, Stefansson K, Laven JS, Ong KK, Perry JR. Causal mechanisms and balancing selection inferred from genetic associations with polycystic ovary syndrome. Nat Commun. 2015;6(1):8464. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8. Day F, Karaderi T, Jones MR, Meun C, He C, Drong A, Kraft P, Lin N, Huang H, Broer L, Magi R, Saxena R, Laisk T, Urbanek M, Hayes MG, Thorleifsson G, Fernandez-Tajes J, Mahajan A, Mullin BH, Stuckey BGA, Spector TD, Wilson SG, Goodarzi MO, Davis L, Obermayer-Pietsch B, Uitterlinden AG, Anttila V, Neale BM, Jarvelin M-R, Fauser B, Kowalska I, Visser JA, Andersen M, Ong K, Stener-Victorin E, Ehrmann D, Legro RS, Salumets A, McCarthy MI, Morin-Papunen L, Thorsteinsdottir U, Stefansson K, Styrkarsdottir U, Perry JR, Dunaif A, Laven J, Franks S, Lindgren CM, Welt CK; 23andMe Research Team. Large-scale genome-wide meta-analysis of polycystic ovary syndrome suggests shared genetic architecture for different diagnosis criteria. PLoS Genet. 2018;14(12):e1007813. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9. Vink JM, Sadrzadeh S, Lambalk CB, Boomsma DI. Heritability of polycystic ovary syndrome in a Dutch twin-family study. J Clin Endocrinol Metab. 2006;91(6):2100–2104. [DOI] [PubMed] [Google Scholar]
  • 10. Manolio TA, Collins FS, Cox NJ, Goldstein DB, Hindorff LA, Hunter DJ, McCarthy MI, Ramos EM, Cardon LR, Chakravarti A, Cho JH, Guttmacher AE, Kong A, Kruglyak L, Mardis E, Rotimi CN, Slatkin M, Valle D, Whittemore AS, Boehnke M, Clark AG, Eichler EE, Gibson G, Haines JL, Mackay TF, McCarroll SA, Visscher PM. Finding the missing heritability of complex diseases. Nature. 2009;461(7265):747–753. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11. Li X, Kim Y, Tsang EK, Davis JR, Damani FN, Chiang C, Hess GT, Zappala Z, Strober BJ, Scott AJ, Li A, Ganna A, Bassik MC, Merker JD, Hall IM, Battle A, Montgomery SB; GTEx Consortium; Laboratory, Data Analysis and Coordinating Center (LDACC)—Analysis Working Group; Statistical Methods groups—Analysis Working Group; Enhancing GTEx (eGTEx) groups; NIH Common Fund; NIH/NCI; NIH/NHGRI; NIH/NIMH; NIH/NIDA; Biospecimen Collection Source Site—NDRI; Biospecimen Collection Source Site—RPCI; Biospecimen Core Resource—VARI; Brain Bank Repository—University of Miami Brain Endowment Bank;Leidos Biomedical—Project Management; ELSI Study; Genome Browser Data Integration and Visualization—EBI; Genome Browser Data Integration and Visualization—UCSC Genomics Institute, University of California Santa Cruz. The impact of rare variation on gene expression across tissues. Nature. 2017;550(7675):239–243. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12. Walter K, Min JL, Huang J, Crooks L, Memari Y, McCarthy S, Perry JR, Xu C, Futema M, Lawson D, Iotchkova V, Schiffels S, Hendricks AE, Danecek P, Li R, Floyd J, Wain LV, Barroso I, Humphries SE, Hurles ME, Zeggini E, Barrett JC, Plagnol V, Richards JB, Greenwood CM, Timpson NJ, Durbin R, Soranzo N; UK10K Consortium. The UK10K project identifies rare variants in health and disease. Nature. 2015;526(7571):82–90. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13. Surakka I, Horikoshi M, Mägi R, Sarin AP, Mahajan A, Lagou V, Marullo L, Ferreira T, Miraglio B, Timonen S, Kettunen J, Pirinen M, Karjalainen J, Thorleifsson G, Hägg S, Hottenga JJ, Isaacs A, Ladenvall C, Beekman M, Esko T, Ried JS, Nelson CP, Willenborg C, Gustafsson S, Westra HJ, Blades M, de Craen AJ, de Geus EJ, Deelen J, Grallert H, Hamsten A, Havulinna AS, Hengstenberg C, Houwing-Duistermaat JJ, Hyppönen E, Karssen LC, Lehtimäki T, Lyssenko V, Magnusson PK, Mihailov E, Müller-Nurasyid M, Mpindi JP, Pedersen NL, Penninx BW, Perola M, Pers TH, Peters A, Rung J, Smit JH, Steinthorsdottir V, Tobin MD, Tsernikova N, van Leeuwen EM, Viikari JS, Willems SM, Willemsen G, Schunkert H, Erdmann J, Samani NJ, Kaprio J, Lind L, Gieger C, Metspalu A, Slagboom PE, Groop L, van Duijn CM, Eriksson JG, Jula A, Salomaa V, Boomsma DI, Power C, Raitakari OT, Ingelsson E, Järvelin MR, Thorsteinsdottir U, Franke L, Ikonen E, Kallioniemi O, Pietiäinen V, Lindgren CM, Stefansson K, Palotie A, McCarthy MI, Morris AP, Prokopenko I, Ripatti S; ENGAGE Consortium. The impact of low-frequency and rare variants on lipid levels. Nat Genet. 2015;47(6):589–597. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14. Zou Y, Lu P, Shi J, Liu W, Yang M, Zhao S, Chen N, Chen M, Sun Y, Gao A, Chen Q, Zhang Z, Ma Q, Ning T, Ying X, Jin J, Deng X, Shen B, Zhang Y, Yuan B, Kauderer S, Liu S, Hong J, Liu R, Ning G, Wang W, Gu W, Wang J. IRX3 promotes the browning of white adipocytes and its rare variants are associated with human obesity risk. EBioMedicine. 2017;24:64–75. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15. Turcot V, Lu Y, Highland HM, Schurmann C, Justice AE, Fine RS, Bradfield JP, Esko T, Giri A, Graff M, Guo X, Hendricks AE, Karaderi T, Lempradl A, Locke AE, Mahajan A, Marouli E, Sivapalaratnam S, Young KL, Alfred T, Feitosa MF, Masca NGD, Manning AK, Medina-Gomez C, Mudgal P, Ng MC, Reiner AP, Vedantam S, Willems SM, Winkler TW, Abecasis G, Aben KK, Alam DS, Alharthi SE, Allison M, Amouyel P, Asselbergs FW, Auer PL, Balkau B, Bang LE, Barroso I, Bastarache L, Benn M, Bergmann S, Bielak LF, Blüher M, Boehnke M, Boeing H, Boerwinkle E, Böger CA, Bork-Jensen J, Bots ML, Bottinger EP, Bowden DW, Brandslund I, Breen G, Brilliant MH, Broer L, Brumat M, Burt AA, Butterworth AS, Campbell PT, Cappellani S, Carey DJ, Catamo E, Caulfield MJ, Chambers JC, Chasman DI, Chen YI, Chowdhury R, Christensen C, Chu AY, Cocca M, Collins FS, Cook JP, Corley J, Corominas Galbany J, Cox AJ, Crosslin DS, Cuellar-Partida G, D’Eustacchio A, Danesh J, Davies G, Bakker PIW, Groot MC, Mutsert R, Deary IJ, Dedoussis G, Demerath EW, Heijer M, Hollander AI, Ruijter HM, Dennis JG, Denny JC, Di Angelantonio E, Drenos F, Du M, Dubé MP, Dunning AM, Easton DF, Edwards TL, Ellinghaus D, Ellinor PT, Elliott P, Evangelou E, Farmaki AE, Farooqi IS, Faul JD, Fauser S, Feng S, Ferrannini E, Ferrieres J, Florez JC, Ford I, Fornage M, Franco OH, Franke A, Franks PW, Friedrich N, Frikke-Schmidt R, Galesloot TE, Gan W, Gandin I,Gasparini P, Gibson J, Giedraitis V, Gjesing AP, Gordon-Larsen P, Gorski M, Grabe HJ, Grant SF, Grarup N, Griffiths HL, Grove ML, Gudnason V, Gustafsson S, Haessler J, Hakonarson H, Hammerschlag AR, Hansen T, Harris KM, Harris TB, Hattersley AT, Have CT, Hayward C, He L, Heard-Costa NL, Heath AC, Heid IM, Helgeland Ø, Hernesniemi J, Hewitt AW, Holmen OL, Hovingh GK, Howson JM, Hu Y, Huang PL, Huffman JE, Ikram MA, Ingelsson E, Jackson AU, Jansson JH, Jarvik GP, Jensen GB, Jia Y, Johansson S, Jørgensen ME, Jørgensen T, Jukema JW, Kahali B, Kahn RS, Kähönen M, Kamstrup PR, Kanoni S, Kaprio J, Karaleftheri M, Kardia SLR, Karpe F, Kathiresan S, Kee F, Kiemeney LA, Kim E, Kitajima H, Komulainen P, Kooner JS, Kooperberg C, Korhonen T, Kovacs P, Kuivaniemi H, Kutalik Z, Kuulasmaa K, Kuusisto J, Laakso M, Lakka TA, Lamparter D, Lange EM, Lange LA, Langenberg C, Larson EB, Lee NR, Lehtimäki T, Lewis CE, Li H, Li J, Li-Gao R, Lin H, Lin KH, Lin LA, Lin X, Lind L, Lindström J, Linneberg A, Liu CT, Liu DJ, Liu Y, Lo KS, Lophatananon A, Lotery AJ, Loukola A, Luan J, Lubitz SA, Lyytikäinen LP, Männistö S, Marenne G, Mazul AL, McCarthy MI, McKean-Cowdin R, Medland SE, Meidtner K, Milani L, Mistry V, Mitchell P, Mohlke KL, Moilanen L, Moitry M, Montgomery GW, Mook-Kanamori DO, Moore C, Mori TA, Morris AD, Morris AP, Müller-Nurasyid M, Munroe PB, Nalls MA, Narisu N, Nelson CP, Neville M, Nielsen SF, Nikus K, Njølstad PR, Nordestgaard BG, Nyholt DR, O’Connel JR, O’Donoghue ML, Olde Loohuis LM, Ophoff RA, Owen KR, Packard CJ, Padmanabhan S, Palmer CNA, Palmer ND, Pasterkamp G, Patel AP, Pattie A, Pedersen O, Peissig PL, Peloso GM, Pennell CE, Perola M, Perry JA, Perry JRB, Pers TH, Person TN, Peters A, Petersen ER, Peyser PA, Pirie A, Polasek O, Polderman TJ, Puolijoki H, Raitakari OT, Rasheed A, Rauramaa R, Reilly DF, Renström F, Rheinberger M, Ridker PM, Rioux JD, Rivas MA, Roberts DJ, Robertson NR, Robino A, Rolandsson O, Rudan I, Ruth KS, Saleheen D, Salomaa V, Samani NJ, Sapkota Y, Sattar N, Schoen RE, Schreiner PJ, Schulze MB, Scott RA, Segura-Lepe MP, Shah SH, Sheu WH, Sim X, Slater AJ, Small KS, Smith AV, Southam L, Spector TD, Speliotes EK, Starr JM, Stefansson K, Steinthorsdottir V, Stirrups KE, Strauch K, Stringham HM, Stumvoll M, Sun L, Surendran P, Swift AJ, Tada H, Tansey KE, Tardif JC, Taylor KD, Teumer A, Thompson DJ, Thorleifsson G, Thorsteinsdottir U, Thuesen BH, Tönjes A, Tromp G, Trompet S, Tsafantakis E, Tuomilehto J, Tybjaerg-Hansen A, Tyrer JP, Uher R, Uitterlinden AG, Uusitupa M, Laan SW, Duijn CM, Leeuwen N, van Setten J, Vanhala M, Varbo A, Varga TV, Varma R, Velez Edwards DR, Vermeulen SH, Veronesi G, Vestergaard H, Vitart V, Vogt TF, Völker U, Vuckovic D, Wagenknecht LE, Walker M, Wallentin L, Wang F, Wang CA, Wang S, Wang Y, Ware EB, Wareham NJ, Warren HR, Waterworth DM, Wessel J, White HD, Willer CJ, Wilson JG, Witte DR, Wood AR, Wu Y, Yaghootkar H, Yao J, Yao P, Yerges-Armstrong LM, Young R, Zeggini E, Zhan X, Zhang W, Zhao JH, Zhao W, Zhao W, Zhou W, Zondervan KT, Rotter JI, Pospisilik JA, Rivadeneira F, Borecki IB, Deloukas P, Frayling TM, Lettre G, North KE, Lindgren CM, Hirschhorn JN, Loos RJ;. CHD Exome+ Consortium; EPIC-CVD Consortium; ExomeBP Consortium; Global Lipids Genetic Consortium; GoT2D Genes Consortium; EPIC InterAct Consortium; INTERVAL StudyReproGen Consortium; T2D-Genes Consortium; MAGIC Investigators; Understanding Society Scientific Group. Protein-altering variants associated with body mass index implicate pathways that control energy intake and expenditure in obesity (published corrctions appear in Nat Genet. 2018;50(5):765–766 and Nat Genet. 2018;50(5):766–767). Nat Genet. 2018;50(1):26–41. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16. Russo A, Di Gaetano C, Cugliari G, Matullo G. Advances in the genetics of hypertension: the effect of rare variants. Int J Mol Sci. 2018;19(3):E688. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17. Gorsik LK, Kosova G, Werstein B, Sisk R, Legro RS, Hayes MG, Teixeira JM, Dunaif A, Urbanek M. Pathogenic anti-Müllerian hormone variants in polycystic ovary syndrome. J Clin Endocrinol Metab. 2017;102(8):2862–2872. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18. Gorsik LK, Dapas M, Legro RS, Hayes MG, Urbanek M. Functional genetic variation in the anti-Müllerian hormone pathway in women with polycystic ovary syndrome. J Clin Endocrinol Metab. 2019;104(7):2855–2874. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19. Zawadski JK, Dunaif A. Diagnostic criteria for polycystic ovary syndrome: towards a rational approach. In: Dunaif A, Givens JR, Haseltine FP, Merriam GR, eds. Polycystic Ovary Syndrome. Boston, MA: Blackwell Scientific; 1992:377–384. [Google Scholar]
  • 20. Diamanti-Kandarakis E, Dunaif A. Insulin resistance and the polycystic ovary syndrome revisited: an update on mechanisms and implications. Endocr Rev. 2012;33(6):981–1030. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21. Kosova G, Urbanek M. Genetics of the polycystic ovary syndrome. Mol Cell Endocrinol. 2013;373(1–2):29–38. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22. Dunaif A. Perspectives in polycystic ovary syndrome: from hair to eternity. J Clin Endocrinol Metab. 2016;101(3):759–768. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23. Legro RS, Spielman R, Urbanek M, Driscoll D, Strauss JF III, Dunaif A. Phenotype and genotype in polycystic ovary syndrome. Recent Prog Horm Res. 1998;53:217–256. [PubMed] [Google Scholar]
  • 24. Urbanek M, Legro RS, Driscoll DA, Azziz R, Ehrmann DA, Norman RJ, Strauss JF III, Spielman RS, Dunaif A. Thirty-seven candidate genes for polycystic ovary syndrome: strongest evidence for linkage is with follistatin. Proc Natl Acad Sci USA. 1999;96(15):8573–8578. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25. Janssen I, Powell LH, Kazlauskaite R, Dugan SA. Testosterone and visceral fat in midlife women: the Study of Women’s Health Across the Nation (SWAN) fat patterning study. Obesity (Silver Spring). 2010;18(3):604–610. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26. Kim C, Harlow SD, Zheng H, McConnell DS, Randolph JF Jr. Changes in androstenedione, dehydroepiandrosterone, testosterone, estradiol, and estrone over the menopausal transition. Womens Midlife Health. 2017;3(1):9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27. Urbanek M, Woodroffe A, Ewens KG, Diamanti-Kandarakis E, Legro RS, Strauss JF III, Dunaif A, Spielman RS. Candidate gene region for polycystic ovary syndrome on chromosome 19p13.2. J Clin Endocrinol Metab. 2005;90(12):6623–6629. [DOI] [PubMed] [Google Scholar]
  • 28. Kotsopoulos J, Shafrir AL, Rice M, Hankinson SE, Eliassen AH, Tworoger SS, Narod SA. The relationship between bilateral oophorectomy and plasma hormone levels in postmenopausal women. Horm Cancer. 2015;6(1):54–63. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29. Yazdani S, Sharbatdaran M, Abedi Samakoosh M, Bouzari Z, Masoudi Z. Glucose tolerance and lipid profile changes after surgical menopause. Caspian J Intern Med. 2014;5(2):114–117. [PMC free article] [PubMed] [Google Scholar]
  • 30. Espeland MA, Hogan PE, Fineberg SE, Howard G, Schrott H, Waclawiw MA, Bush TL. Effect of postmenopausal hormone therapy on glucose and insulin concentrations. Diabetes Care. 1998;21(10):1589–1595. [DOI] [PubMed] [Google Scholar]
  • 31. Magkos F, Wang X, Mittendorfer B. Metabolic actions of insulin in men and women. Nutrition. 2010;26(7–8):686–693. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32. Sohrevardi SM, Nosouhi F, Hossein Khalilzade S, Kafaie P, Karimi-Zarchi M, Halvaei I, Mohsenzadeh M. Evaluating the effect of insulin sensitizers metformin and pioglitazone alone and in combination on women with polycystic ovary syndrome: an RCT. Int J Reprod Biomed (Yazd). 2016;14(12):743–754. [PMC free article] [PubMed] [Google Scholar]
  • 33. Torchen LC, Kumar A, Kalra B, Savjani G, Sisk R, Legro RS, Dunaif A. Increased antimüllerian hormone levels and other reproductive endocrine changes in adult male relatives of women with polycystic ovary syndrome. Fertil Steril. 2016;106(1):50–55. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34. Sam S, Legro RS, Essah PA, Apridonidze T, Dunaif A. Evidence for metabolic and reproductive phenotypes in mothers of women with polycystic ovary syndrome. Proc Natl Acad Sci USA. 2006;103(18):7030–7035. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35. World Health Organization. Definition and diagnosis of diabetes mellitus and intermediate hyperglycaemia: report of a WHO/IDF consultation. Available at: https://apps.who.int/iris/handle/10665/43588. Accessed 16 August 2018.
  • 36. Drmanac R, Sparks AB, Callow MJ, Halpern AL, Burns NL, Kermani BG, Carnevali P, Nazarenko I, Nilsen GB, Yeung G, Dahl F, Fernandez A, Staker B, Pant KP, Baccash J, Borcherding AP, Brownley A, Cedeno R, Chen L, Chernikoff D, Cheung A, Chirita R, Curson B, Ebert JC, Hacker CR, Hartlage R, Hauser B, Huang S, Jiang Y, Karpinchyk V, Koenig M, Kong C, Landers T, Le C, Liu J, McBride CE, Morenzoni M, Morey RE, Mutch K, Perazich H, Perry K, Peters BA, Peterson J, Pethiyagoda CL, Pothuraju K, Richter C, Rosenbaum AM, Roy S, Shafto J, Sharanhovich U, Shannon KW, Sheppy CG, Sun M, Thakuria JV, Tran A, Vu D, Zaranek AW, Wu X, Drmanac S, Oliphant AR, Banyai WC, Martin B, Ballinger DG, Church GM, Reid CA. Human genome sequencing using unchained base reads on self-assembling DNA nanoarrays. Science. 2010;327(5961):78–81. [DOI] [PubMed] [Google Scholar]
  • 37. Baccash J, Halpern A, Tian C, Pant K, Carnevali P, inventors; Complete Genomics, Inc., assignee. Determining variants in genome of a heterogeneous sample. US patent application 2013/0110407-A1. 2 May 2013.
  • 38. Gorsic LK, Kosova G, Werstein B, Sisk R, Legro RS, Hayes MG, Teixeira JM, Dunaif A, Urbanek M. Pathogenic anti-Müllerian hormone variants in polycystic ovary syndrome. J Clin Endocrinol Metab. 2017;102(8):2862–2872. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39. Erikson Galina A, Bodian Dale L, Rueda M, Molparia B, Scott Erick R, Scott-Van Zeeland Ashley A, Topol Sarah E, Wineinger Nathan E, Niederhuber John E, Topol Eric J, Torkamani A.. Whole-Genome Sequencing of a Healthy Aging Cohort. Cell. 2016;165(4):1002–1011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40. Roach JC, Glusman G, Smit AF, Huff CD, Hubley R, Shannon PT, Rowen L, Pant KP, Goodman N, Bamshad M, Shendure J, Drmanac R, Jorde LB, Hood L, Galas DJ. Analysis of genetic inheritance in a family quartet by whole-genome sequencing. Science. 2010;328(5978):636–639. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41. Reumers J, De Rijk P, Zhao H, Liekens A, Smeets D, Cleary J, Van Loo P, Van Den Bossche M, Catthoor K, Sabbe B, Despierre E, Vergote I, Hilbush B, Lambrechts D, Del-Favero J. Optimized filtering reduces the error rate in detecting genomic variants by short-read sequencing. Nat Biotechnol. 2011;30(1):61–68. [DOI] [PubMed] [Google Scholar]
  • 42. Yuen RK, Thiruvahindrapuram B, Merico D, Walker S, Tammimies K, Hoang N, Chrysler C, Nalpathamkalam T, Pellecchia G, Liu Y, Gazzellone MJ, D’Abate L, Deneault E, Howe JL, Liu RS, Thompson A, Zarrei M, Uddin M, Marshall CR, Ring RH, Zwaigenbaum L, Ray PN, Weksberg R, Carter MT, Fernandez BA, Roberts W, Szatmari P, Scherer SW. Whole-genome sequencing of quartet families with autism spectrum disorder. Nat Med. 2015;21(2):185–191. [DOI] [PubMed] [Google Scholar]
  • 43. Abecasis GR, Altshuler D, Auton A, Brooks LD, Durbin RM, Gibbs RA, Hurles ME, McVean GA; 1000 Genomes Project Consortium. A map of human genome variation from population-scale sequencing (published correction appears in Nature. 2011;473(7348):544). Nature. 2010;467(7319):1061–1073. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44. Jiang YH, Yuen RK, Jin X, Wang M, Chen N, Wu X, Ju J, Mei J, Shi Y, He M, Wang G, Liang J, Wang Z, Cao D, Carter MT, Chrysler C, Drmic IE, Howe JL, Lau L, Marshall CR, Merico D, Nalpathamkalam T, Thiruvahindrapuram B, Thompson A, Uddin M, Walker S, Luo J, Anagnostou E, Zwaigenbaum L, Ring RH, Wang J, Lajonchere C, Wang J, Shih A, Szatmari P, Yang H, Dawson G, Li Y, Scherer SW. Detection of clinically relevant genetic variants in autism spectrum disorder by whole-genome sequencing. Am J Hum Genet. 2013;93(2):249–263. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45. Dapas M, Sisk R, Legro RS, Urbanek M, Dunaif A, Hayes MG.. Data from: Family-based quantitative trait meta-analysis implicates rare noncoding variants in DENND1A in polycystic ovary syndrome. figshare 2019. Deposited 12 March 2019. 10.6084/m9.figshare.7834658. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46. Kircher M, Witten DM, Jain P, O’Roak BJ, Cooper GM, Shendure J. A general framework for estimating the relative pathogenicity of human genetic variants. Nat Genet. 2014;46(3):310–315. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47. Itan Y, Shang L, Boisson B, Ciancanelli MJ, Markle JG, Martinez-Barricarte R, Scott E, Shah I, Stenson PD, Gleeson J, Cooper DN, Quintana-Murci L, Zhang SY, Abel L, Casanova JL. The mutation significance cutoff: gene-level thresholds for variant predictions. Nat Methods. 2016;13(2):109–110. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48. Adzhubei IA, Schmidt S, Peshkin L, Ramensky VE, Gerasimova A, Bork P, Kondrashov AS, Sunyaev SR. A method and server for predicting damaging missense mutations. Nat Methods. 2010;7(4):248–249. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49. Kumar P, Henikoff S, Ng PC. Predicting the effects of coding non-synonymous variants on protein function using the SIFT algorithm. Nat Protoc. 2009;4(7):1073–1081. [DOI] [PubMed] [Google Scholar]
  • 50. Huang YF, Gulko B, Siepel A. Fast, scalable prediction of deleterious noncoding variants from functional and population genomic data. Nat Genet. 2017;49(4):618–624. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51. Dong C, Wei P, Jian X, Gibbs R, Boerwinkle E, Wang K, Liu X. Comparison and integration of deleteriousness prediction methods for nonsynonymous SNVs in whole exome sequencing studies. Hum Mol Genet. 2015;24(8):2125–2137. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52. Gnad F, Baucom A, Mukhyala K, Manning G, Zhang Z. Assessment of computational methods for predicting the effects of missense mutations in human cancers. BMC Genomics. 2013;14(Suppl 3):S7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53. González-Pérez A, López-Bigas N. Improving the assessment of the outcome of nonsynonymous SNVs with a consensus deleteriousness score, Condel. Am J Hum Genet. 2011;88(4):440–449. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54. Nishizaki SS, Boyle AP. Mining the unknown: assigning function to noncoding single nucleotide polymorphisms. Trends Genet. 2017;33(1):34–45. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55. Ernst J, Kellis M. Chromatin-state discovery and genome annotation with ChromHMM. Nat Protoc. 2017;12(12):2478–2492. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56. Zhang Y, Hardison RC. Accurate and reproducible functional maps in 127 human cell types via 2D genome segmentation. Nucleic Acids Res. 2017;45(17):9823–9836. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57. Rosenfield RL, Ehrmann DA. The pathogenesis of polycystic ovary syndrome (PCOS): the hypothesis of PCOS as functional ovarian hyperandrogenism revisited. Endocr Rev. 2016;37(5):467–520. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58. Consortium EP, Bernstein BE, Birney E, Dunham I, Green ED, Gunter C, Snyder M; ENCODE Project Consortium. An integrated encyclopedia of DNA elements in the human genome. Nature. 2012;489(7414):57–74. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59. Forgacs AL, Ding Q, Jaremba RG, Huhtaniemi IT, Rahman NA, Zacharewski TR. BLTK1 murine Leydig cells: a novel steroidogenic model for evaluating the effects of reproductive and developmental toxicants. Toxicol Sci. 2012;127(2):391–402. [DOI] [PubMed] [Google Scholar]
  • 60. Huang-Doran I, Franks S. Genetic rodent models of obesity-associated ovarian dysfunction and subfertility: insights into polycystic ovary syndrome. Front Endocrinol (Lausanne). 2016;7:53. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61. Tee MK, Speek M, Legeza B, Modi B, Teves ME, McAllister JM, Strauss JF III, Miller WL. Alternative splicing of DENND1A, a PCOS candidate gene, generates variant 2. Mol Cell Endocrinol. 2016;434:25–35. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62. Erikson GA, Bodian DL, Rueda M, Molparia B, Scott ER, Scott-Van Zeeland AA, Topol SE, Wineinger NE, Niederhuber JE, Topol EJ, Torkamani A. Whole-genome sequencing of a healthy aging cohort. Cell. 2016;165(4):1002–1011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63. Tombácz D, Maróti Z, Kalmár T, Csabai Z, Balázs Z, Takahashi S, Palkovits M, Snyder M, Boldogkői Z. High-coverage whole-exome sequencing identifies candidate genes for suicide in victims with major depressive disorder. Sci Rep. 2017;7(1):7106. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64. Wong JK, Gui H, Kwok M, Ng PW, Lui CH, Baum L, Sham PC, Kwan P, Cherny SS. Rare variants and de novo variants in mesial temporal lobe epilepsy with hippocampal sclerosis. Neurol Genet. 2018;4(3):e245. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65. He Z, Zhang D, Renton AE, Li B, Zhao L, Wang GT, Goate AM, Mayeux R, Leal SM. The rare-variant generalized disequilibrium test for association analysis of nuclear and extended pedigrees with application to Alzheimer disease WGS data (published correction appears in Am J Hum Genet.2017;100(2):371). Am J Hum Genet. 2017;100(2):193–204. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66. Igartua C, Myers RA, Mathias RA, Pino-Yanes M, Eng C, Graves PE, Levin AM, Del-Rio-Navarro BE, Jackson DJ, Livne OE, Rafaels N, Edlund CK, Yang JJ, Huntsman S, Salam MT, Romieu I, Mourad R, Gern JE, Lemanske RF, Wyss A, Hoppin JA, Barnes KC, Burchard EG, Gauderman WJ, Martinez FD, Raby BA, Weiss ST, Williams LK, London SJ, Gilliland FD, Nicolae DL, Ober C. Ethnic-specific associations of rare and low-frequency DNA sequence variants with asthma. Nat Commun. 2015;6(1):5965. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67. Lubitz SA, Brody JA, Bihlmeyer NA, Roselli C, Weng LC, Christophersen IE, Alonso A, Boerwinkle E, Gibbs RA, Bis JC, Cupples LA, Mohler PJ, Nickerson DA, Muzny D, Perez MV, Psaty BM, Soliman EZ, Sotoodehnia N, Lunetta KL, Benjamin EJ, Heckbert SR, Arking DE, Ellinor PT, Lin H; NHLBI GO Exome Sequencing Project. Whole exome sequencing in atrial fibrillation. PLoS Genet. 2016;12(9):e1006284. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68. Li M, Li Y, Weeks O, Mijatovic V, Teumer A, Huffman JE, Tromp G, Fuchsberger C, Gorski M, Lyytikäinen LP, Nutile T, Sedaghat S, Sorice R, Tin A, Yang Q, Ahluwalia TS, Arking DE, Bihlmeyer NA, Böger CA, Carroll RJ, Chasman DI, Cornelis MC, Dehghan A, Faul JD, Feitosa MF, Gambaro G, Gasparini P, Giulianini F, Heid I, Huang J, Imboden M, Jackson AU, Jeff J, Jhun MA, Katz R, Kifley A, Kilpeläinen TO, Kumar A, Laakso M, Li-Gao R, Lohman K, Lu Y, Mägi R, Malerba G, Mihailov E, Mohlke KL, Mook-Kanamori DO, Robino A, Ruderfer D, Salvi E, Schick UM, Schulz CA, Smith AV, Smith JA, Traglia M, Yerges-Armstrong LM, Zhao W, Goodarzi MO,Kraja AT, Liu C, Wessel J, Boerwinkle E, Borecki IB, Bork-Jensen J, Bottinger EP, Braga D, Brandslund I, Brody JA, Campbell A, Carey DJ, Christensen C, Coresh J, Crook E, Curhan GC, Cusi D, de Boer IH, de Vries AP, Denny JC, Devuyst O, Dreisbach AW, Endlich K, Esko T, Franco OH, Fulop T, Gerhard GS, Glümer C, Gottesman O, Grarup N, Gudnason V, Hansen T, Harris TB, Hayward C, Hocking L, Hofman A, Hu FB, Husemoen LL, Jackson RD, Jørgensen T, Jørgensen ME, Kähönen M, Kardia SL, König W, Kooperberg C, Kriebel J, Launer LJ, Lauritzen T, Lehtimäki T, Levy D, Linksted P, Linneberg A, Liu Y, Loos RJ, Lupo A, Meisinger C, Melander O, Metspalu A, Mitchell P, Nauck M, Nürnberg P, Orho-Melander M, Parsa A, Pedersen O, Peters A, Peters U, Polasek O, Porteous D, Probst-Hensch NM, Psaty BM, Qi L, Raitakari OT, Reiner AP, Rettig R, Ridker PM, Rivadeneira F, Rossouw JE, Schmidt F, Siscovick D, Soranzo N, Strauch K, Toniolo D, Turner ST, Uitterlinden AG, Ulivi S, Velayutham D, Völker U, Völzke H, Waldenberger M, Wang JJ, Weir DR, Witte D, Kuivaniemi H, Fox CS, Franceschini N, Goessling W, Köttgen A, Chu AY; CHARGE Glycemic-T2D Working Group;CHARGE Blood Pressure Working Group. SOS2 and ACP1 loci identified through large-scale exome chip analysis regulate kidney development and function. J Am Soc Nephrol. 2017;28(3):981–994. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69. Crosby J, Peloso GM, Auer PL, Crosslin DR, Stitziel NO, Lange LA, Lu Y, Tang ZZ, Zhang H, Hindy G, Masca N, Stirrups K, Kanoni S, Do R, Jun G, Hu Y, Kang HM, Xue C, Goel A, Farrall M, Duga S, Merlini PA, Asselta R, Girelli D, Olivieri O, Martinelli N, Yin W, Reilly D, Speliotes E, Fox CS, Hveem K, Holmen OL, Nikpay M, Farlow DN, Assimes TL, Franceschini N, Robinson J, North KE, Martin LW, DePristo M, Gupta N, Escher SA, Jansson JH, Van Zuydam N, Palmer CN, Wareham N, Koch W, Meitinger T, Peters A, Lieb W, Erbel R, Konig IR, Kruppa J, Degenhardt F, Gottesman O, Bottinger EP, O’Donnell CJ, Psaty BM, Ballantyne CM, Abecasis G, Ordovas JM, Melander O, Watkins H, Orho-Melander M, Ardissino D, Loos RJ, McPherson R, Willer CJ, Erdmann J, Hall AS, Samani NJ, Deloukas P, Schunkert H, Wilson JG, Kooperberg C, Rich SS, Tracy RP, Lin DY, Altshuler D, Gabriel S, Nickerson DA, Jarvik GP, Cupples LA, Reiner AP, Boerwinkle E, Kathiresan S; TG and HDL Working Group of the Exome Sequencing Project, National Heart, Lung, and Blood Institute. Loss-of-function mutations in APOC3, triglycerides, and coronary disease. N Engl J Med. 2014;371(1):22–31. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70. Dai W, Zheng H, Cheung AK, Tang CS, Ko JM, Wong BW, Leong MM, Sham PC, Cheung F, Kwong DL, Ngan RK, Ng WT, Yau CC, Pan J, Peng X, Tung S, Zhang Z, Ji M, Chiang AK, Lee AW, Lee VH, Lam KO, Au KH, Cheng HC, Yiu HH, Lung ML. Whole-exome sequencing identifies MST1R as a genetic susceptibility gene in nasopharyngeal carcinoma. Proc Natl Acad Sci USA. 2016;113(12):3317–3322. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 71. Ferre-Fernández JJ, Aroca-Aguilar JD, Medina-Trillo C, Bonet-Fernández JM, Méndez-Hernández CD, Morales-Fernández L, Corton M, Cabañero-Valera MJ, Gut M, Tonda R, Ayuso C, Coca-Prados M, García-Feijoo J, Escribano J. Whole-exome sequencing of congenital glaucoma patients reveals hypermorphic variants in GPATCH3, a new gene involved in ocular and craniofacial development. Sci Rep. 2017;7(1):46175. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 72. Strom SP, Gorin MB. Evaluation of autosomal dominant retinal dystrophy genes in an unaffected cohort suggests rare or private missense variants may often be benign. Mol Vis. 2013;19:980–985. [PMC free article] [PubMed] [Google Scholar]
  • 73. Schroeder JW, Hirst WG, Szewczyk GA, Simmons LA. The effect of local sequence context on mutational bias of genes encoded on the leading and lagging strands. Curr Biol. 2016;26(5):692–697. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 74. Iossifov I, Levy D, Allen J, Ye K, Ronemus M, Lee YH, Yamrom B, Wigler M. Low load for disruptive mutations in autism genes and their biased transmission. Proc Natl Acad Sci USA. 2015;112(41):E5600–E5607. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 75. Gao L, Fang Z, Zhang K, Zhi D, Cui X. Length bias correction for RNA-seq data in gene set analyses. Bioinformatics. 2011;27(5):662–669. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 76. Shyr C, Tarailo-Graovac M, Gottlieb M, Lee JJ, van Karnebeek C, Wasserman WW. FLAGS, frequently mutated genes in public exomes [published correction appears in BMC Med Genomics. 2017;10(1):69]. BMC Med Genomics. 2014;7(1):64. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 77. Schaid DJ, McDonnell SK, Sinnwell JP, Thibodeau SN. Multiple genetic variant association testing by collapsing and kernel methods with pedigree or population structured data. Genet Epidemiol. 2013;37(5):409–418. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 78. Wei P, Cao Y, Zhang Y, Xu Z, Kwak IY, Boerwinkle E, Pan W. On robust association testing for quantitative traits and rare variants. G3 (Bethesda). 2016;6(12):3941–3950. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 79. Auer PL, Reiner AP, Leal SM. The effect of phenotypic outliers and non-normality on rare-variant association testing. Eur J Hum Genet. 2016;24(8):1188–1194. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 80. Solovieff N, Cotsapas C, Lee PH, Purcell SM, Smoller JW. Pleiotropy in complex traits: challenges and strategies. Nat Rev Genet. 2013;14(7):483–495. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 81. Wu B, Pankow JS. Fast and accurate genome-wide association test of multiple quantitative traits. Comput Math Methods Med. 2018;2018:2564531. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 82. Yang JJ, Li J, Williams LK, Buu A. An efficient genome-wide association test for multivariate phenotypes based on the Fisher combination function. BMC Bioinformatics. 2016;17(1):19. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 83. Lal L, Bharti A, Perween A.. To study the status of LH: FSH ratio in obese and non-obese patients of polycystic ovarian syndrome. J Dent Med Sci. 2017;16(1):20–23. [Google Scholar]
  • 84. Chan HP, Zhang NR, Chen LH. Importance sampling of word patterns in DNA and protein sequences. J Comput Biol. 2010;17(12):1697–1709. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 85. Macintyre G, Bailey J, Haviv I, Kowalczyk A. is-rSNP: a novel technique for in silico regulatory SNP detection. Bioinformatics. 2010;26(18):i524–i530. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 86. Zuo C, Shin S, Keleş S. atSNP: transcription factor binding affinity testing for regulatory SNP detection. Bioinformatics. 2015;31(20):3353–3355. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 87. Benjamini Y, Hochberg Y. Controlling the false discovery rate—a practical and powerful approach to multiple testing. J R Stat Soc B. 1995;57(1):289–300. [Google Scholar]
  • 88. GTEx Consortium. The Genotype-Tissue Expression (GTEx) project. Nat Genet. 2013;45(6):580–585. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 89. Giudice G, Sánchez-Cabo F, Torroja C, Lara-Pezzi E.. ATtRACT—a database of RNA-binding proteins and associated motifs. Database (Oxford). 2016;2016;baw035. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 90. Gao J, Xue JD, Li ZC, Zhou L, Chen C. The association of DENND1A gene polymorphisms and polycystic ovary syndrome risk: a systematic review and meta-analysis. Arch Gynecol Obstet. 2016;294(5):1073–1080. [DOI] [PubMed] [Google Scholar]
  • 91. Pollard KS, Hubisz MJ, Rosenbloom KR, Siepel A. Detection of nonneutral substitution rates on mammalian phylogenies. Genome Res. 2010;20(1):110–121. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 92. Battle A, Brown CD, Engelhardt BE, Montgomery SB;. GTEx Consortium; Laboratory, Data Analysis and Coordinating Center (LDACC)—Analysis Working Group; Statistical Methods groups—Analysis Working Group; Enhancing GTEx (eGTEx) groups; NIH Common Fund; NIH/NCI; NIH/NHGRI; NIH/NIMH; NIH/NIDA; Biospecimen Collection Source Site—NDRI; Biospecimen Collection Source Site—RPCI; Biospecimen Core Resource—VARI; Brain Bank Repository—University of Miami Brain Endowment Bank; Leidos Biomedical—Project Management; ELSI Study; Genome Browser Data Integration andVisualization—EBI; Genome Browser Data Integration and Visualization—UCSC Genomics Institute, University of California Santa Cruz; Lead analysts; Laboratory, Data Analysis and Coordinating Center (LDACC); NIH program management; Biospecimen collection; Pathologye; QTL manuscript working group. Genetic effects on gene expression across human tissues [published correction appears in Nature. 2018;553(7689):530]. Nature. 2017;550(7675):204–213. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 93. Auton A, Brooks LD, Durbin RM, Garrison EP, Kang HM, Korbel JO, Marchini JL, McCarthy S, McVean GA, Abecasis GR; 1000 Genomes Project Consortium. A global reference for human genetic variation. Nature. 2015;526(7571):68–74. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 94. Glusman G, Caballero J, Mauldin DE, Hood L, Roach JC. Kaviar: an accessible system for testing SNV novelty. Bioinformatics. 2011;27(22):3216–3217. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 95. Welt CK, Styrkarsdottir U, Ehrmann DA, Thorleifsson G, Arason G, Gudmundsson JA, Ober C, Rosenfield RL, Saxena R, Thorsteinsdottir U, Crowley WF, Stefansson K. Variants in DENND1A are associated with polycystic ovary syndrome in women of European ancestry. J Clin Endocrinol Metab. 2012;97(7):E1342–E1347. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 96. McAllister JM, Modi B, Miller BA, Biegler J, Bruggeman R, Legro RS, Strauss JF III. Overexpression of a DENND1A isoform produces a polycystic ovary syndrome theca phenotype. Proc Natl Acad Sci USA. 2014;111(15):E1519–E1527. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 97. Torchen LC, Idkowiak J, Fogel NR, O’Neil DM, Shackleton CH, Arlt W, Dunaif A. Evidence for increased 5α-reductase activity during early childhood in daughters of women with polycystic ovary syndrome. J Clin Endocrinol Metab. 2016;101(5):2069–2075. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 98. McCartney CR, Eagleson CA, Marshall JC. Regulation of gonadotropin secretion: implications for polycystic ovary syndrome. Semin Reprod Med. 2002;20(4):317–326. [DOI] [PubMed] [Google Scholar]
  • 99. Eriksen MB, Nielsen MF, Brusgaard K, Tan Q, Andersen MS, Glintborg D, Gaster M. Genetic alterations within the DENND1A gene in patients with polycystic ovary syndrome (PCOS). PLoS One. 2013;8(9):e77186. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 100. Khan MJ, Nazli R, Ahmed J, Basit S. Whole genome sequencing instead of whole exome sequencing is required to identify the genetic causes of polycystic ovary syndrome in Pakistani families. Pak J Med Sci. 2018;34(3):540–545. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 101. Khalaf M, Morera J, Bourret A, Reznik Y, Denoual C, Herlicoviez M, Mittre H, Benhaim A. BMP system expression in GCs from polycystic ovary syndrome women and the in vitro effects of BMP4, BMP6, and BMP7 on GC steroidogenesis. Eur J Endocrinol. 2013;168(3):437–444. [DOI] [PubMed] [Google Scholar]
  • 102. Shi J, Yoshino O, Osuga Y, Koga K, Hirota Y, Hirata T, Yano T, Nishii O, Taketani Y. Bone morphogenetic protein-6 stimulates gene expression of follicle-stimulating hormone receptor, inhibin/activin β subunits, and anti-Müllerian hormone in human granulosa cells. Fertil Steril. 2009;92(5):1794–1798. [DOI] [PubMed] [Google Scholar]
  • 103. Di Zazzo E, De Rosa C, Abbondanza C, Moncharmont B. PRDM proteins: molecular mechanisms in signal transduction and transcriptional regulation. Biology (Basel). 2013;2(1):107–141. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 104. Carling T, Kim KC, Yang XH, Gu J, Zhang XK, Huang S. A histone methyltransferase is required for maximal response to female sex hormones. Mol Cell Biol. 2004;24(16):7032–7042. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 105. Liu L, Shao G, Steele-Perkins G, Huang S. The retinoblastoma interacting zinc finger gene RIZ produces a PR domain-lacking product through an internal promoter. J Biol Chem. 1997;272(5):2984–2991. [DOI] [PubMed] [Google Scholar]
  • 106. Andreu-Vieyra C, Chen R, Matzuk MM. Conditional deletion of the retinoblastoma (Rb) gene in ovarian granulosa cells leads to premature ovarian failure. Mol Endocrinol. 2008;22(9):2141–2161. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 107. Yang QE, Nagaoka SI, Gwost I, Hunt PA, Oatley JM. Inactivation of retinoblastoma protein (Rb1) in the oocyte: evidence that dysregulated follicle growth drives ovarian teratoma formation in mice. PLoS Genet. 2015;11(7):e1005355. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 108. Wray NR, Wijmenga C, Sullivan PF, Yang J, Visscher PM. Common disease is more complex than implied by the core gene omnigenic model. Cell. 2018;173(7):1573–1580. [DOI] [PubMed] [Google Scholar]
  • 109. Boyle EA, Li YI, Pritchard JK. An expanded view of complex traits: from polygenic to omnigenic. Cell. 2017;169(7):1177–1186. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 110. Ma M, Ru Y, Chuang LS, Hsu NY, Shi LS, Hakenberg J, Cheng WY, Uzilov A, Ding W, Glicksberg BS, Chen R. Disease-associated variants in different categories of disease located in distinct regulatory elements. BMC Genomics. 2015;16(Suppl 8):S3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 111. Igartua C, Mozaffari SV, Nicolae DL, Ober C. Rare non-coding variants are associated with plasma lipid traits in a founder population. Sci Rep. 2017;7(1):16415. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 112. Zhao J, Akinsanmi I, Arafat D, Cradick TJ, Lee CM, Banskota S, Marigorta UM, Bao G, Gibson G. A burden of rare variants associated with extremes of gene expression in human peripheral blood. Am J Hum Genet. 2016;98(2):299–309. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 113. Li X, Battle A, Karczewski KJ, Zappala Z, Knowles DA, Smith KS, Kukurba KR, Wu E, Simon N, Montgomery SB. Transcriptome sequencing of a large human family identifies the impact of rare noncoding variants. Am J Hum Genet. 2014;95(3):245–256. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 114. Ament SA, Szelinger S, Glusman G, Ashworth J, Hou L, Akula N, Shekhtman T, Badner JA, Brunkow ME, Mauldin DE, Stittrich AB, Rouleau K, Detera-Wadleigh SD, Nurnberger JI Jr, Edenberg HJ, Gershon ES, Schork N, Price ND, Gelinas R, Hood L, Craig D, McMahon FJ, Kelsoe JR, Roach JC; Bipolar Genome Study. Rare variants in neuronal excitability genes influence risk for bipolar disorder. Proc Natl Acad Sci USA. 2015;112(11):3576–3581. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 115. Liu Q, Chen C, Gao A, Tong HH, Xie L; Alzheimer’s Disease Neuroimaging Initiative. VariFunNet, an integrated multiscale modeling framework to study the effects of rare non-coding variants in genome-wide association studies: applied to Alzheimer’s Disease. Proceedings IEEE Int Conf Bioinformatics Biomed. 2017;2017:2177–2182. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 116. Zhou L, Zhao F. Prioritization and functional assessment of noncoding variants associated with complex diseases. Genome Med. 2018;10(1):53. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 117. Cirulli ET, Goldstein DB. Uncovering the roles of rare variants in common disease through whole-genome sequencing. Nat Rev Genet. 2010;11(6):415–425. [DOI] [PubMed] [Google Scholar]
  • 118. Wray NR, Gratten J. Sizing up whole-genome sequencing studies of common diseases. Nat Genet. 2018;50(5):635–637. [DOI] [PubMed] [Google Scholar]
  • 119. Visscher PM, Wray NR, Zhang Q, Sklar P, McCarthy MI, Brown MA, Yang J. 10 Years of GWAS discovery: biology, function, and translation. Am J Hum Genet. 2017;101(1):5–22. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 120. Bourgon R, Gentleman R, Huber W. Independent filtering increases detection power for high-throughput experiments. Proc Natl Acad Sci USA. 2010;107(21):9546–9551. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 121. Kim S, Schliekelman P. Prioritizing hypothesis tests for high throughput data. Bioinformatics. 2016;32(6):850–858. [DOI] [PubMed] [Google Scholar]

Articles from The Journal of Clinical Endocrinology and Metabolism are provided here courtesy of The Endocrine Society

RESOURCES