Genome-Wide Association Studies in Psychiatry: What Have We Learned?

Ann L Collins; Patrick F Sullivan

doi:10.1192/bjp.bp.112.117002

. Author manuscript; available in PMC: 2014 Oct 8.

Published in final edited form as: Br J Psychiatry. 2013 Jan;202(1):1–4. doi: 10.1192/bjp.bp.112.117002

Genome-Wide Association Studies in Psychiatry: What Have We Learned?

Ann L Collins ¹, Patrick F Sullivan ^1,²

PMCID: PMC4188920 NIHMSID: NIHMS632377 PMID: 23284144

Summary

Genome-wide association studies (GWAS) have been the focus of considerable effort in psychiatry. These efforts have markedly increased knowledge of the genetic basis of psychiatric disorders, and yielded empirical data on genetic architecture critical to addressing long-standing debates in the field. There is a now a clear path to increased knowledge of the “parts lists” for these disorders.

Keywords: genetics, genome-wide association, review

GWAS in Psychiatry

GWAS usually contrast the frequencies of genetic variants between cases and controls for a large set of genetic markers distributed across the genome (see Corvin et al. for an introduction to these methods). (1) GWAS are notable for large numbers of genetic makers (usually 500K–1M) and large sample sizes (often >10,000 subjects).

Since 2005, 1,050 GWAS of human diseases and biometrical traits have been published.^‡ These studies of 575 phenotypes implicated 2,881 genetic variants at stringent levels of significance, often with compelling independent replication. Since 2007, 115 GWAS focused on psychiatric disorders have appeared, most on Alzheimer’s disease, bipolar disorder, and schizophrenia along with efforts in ADHD, autism, major depressive disorder, and licit and illicit drug use and dependence. Given these efforts, it is timely to ask what we have gained from this body of work.

Culture Shifts

GWAS efforts have directly or indirectly led to important changes in the conduct of genetic studies. First, large-scale collaboration has become the norm in psychiatric genetics. Because the sample sizes needed for GWAS discovery and replication are beyond the reach of single groups, multiple consortia have emerged to foster scientific discovery. As an example, the Psychiatric Genomics Consortium (PGC)(2) has ~300 investigators and >75,000 subjects with GWAS data under analysis, and may be the largest consortium in the history of psychiatry.

Second, prompt sharing of results and full genomic data is now standard. This welcome change maximizes progress toward understanding the genetic basis of critically important psychiatric disorders, and fosters reproducibility by allowing the independent evaluation of claims of association. For example, anyone can view or obtain the results from the PGC studies, and qualified investigators can obtain individual-level data in order to conduct additional analyses (Web Resources).

Third, as critically, uncompromising statistical rigor is now required. Genomic studies must explicitly account for the 10⁵–10⁶ statistical comparisons. Thresholds for declaring significance are severe but appropriate, and most journals require replication in independent samples. GWAS can observe dozens of associations with P~10⁻⁶–10⁻⁷ resulting from the play of chance. Most investigators now understand that “intriguing biology” is irrelevant to establishing a robust genetic association. For example, we observed multiple associations of MDD with PCLO (piccolo) with P ~10⁻⁷. (3) The biology of PCLO is fascinating (among its other functions, PCLO “touches” serotonin); however, this “intriguing” association did not withstand the test of replication. (3, 4) The community standard in human genetics now requires significance well beyond chance plus replication in independent samples. The biology of a gene does not play a role in establishing the association.

Discoveries of Genetic Loci

The central goal of psychiatric genetics is to discover loci that are robustly and repeatedly associated with a disorder and thereby gain insight into etiology (Table 1). Around 10 copy number variants (CNVs) have been discovered (included in this review as most were discovered using GWAS technology). These CNVs are rare (<0.1% in controls), potent (odds ratios of 4–20), and often non-specific risk factors for psychiatric disorders. (5) In addition, GWAS have strongly implicated common variation in ~30 different genomic loci for psychiatric disorders along with loci for nicotine (CHRNA3, BNDF, CYP2A6) and alcohol consumption (ADH1B, ALDH2, AUTS2). These genetic variants are relatively common (allele frequencies > 5%) and subtly increase disease risk (odds ratios 1.10–1.25).

Table 1.

Empirical findings for psychiatric disorders.

Finding	Location or SNP	Band, nearest gene, or finding	Class	Disorder
CNV	chr1:145.0–148.0	1q21.1	Rare, potent	SCZ
	chr2:50.1–51.2	2p16.3	Rare, potent	ASD, SCZ
	chr3:195.7–197.3	3q29	Rare, potent	SCZ
	chr7:72.7–74.1	7q11.23	Rare, potent	ASD
	chr7:158.8–158.9	7q36.3	Rare, potent	SCZ
	chr15:23.6–28.4	15q11.2	Rare, potent	ASD
	chr15:30.9–33.5	15q13.3	Rare, potent	ADHD, ASD, SCZ
	chr16:15.4–16.3	16p13.11	Rare, potent	ADHD
	chr16:29.5–30.2	16p11.2	Rare, potent	ASD, SCZ
	chr17:34.8–36.2	17q12	Rare, potent	ASD, SCZ
	chr22:18.7–21.8	22q11.21	Rare, potent	ASD, SCZ
SNP	rs3818361	CR1	Common, subtle	AD
	rs744373	BIN1	Common, subtle	AD
	rs9349407	CD2AP	Common, subtle	AD
	rs11767557	EPHA1	Common, subtle	AD
	rs11136000	CLU	Common, subtle	AD
	rs610932	MS4A cluster	Common, subtle	AD
	rs3851179	PICALM	Common, subtle	AD
	rs3764650	ABCA7	Common, subtle	AD
	rs2075650	APOE, TOMM40	Common, notably strong	AD
	rs3865444	CD33	Common, subtle	AD
	rs12576775	ODZ4	Common, subtle	BIP
	rs4765913	CACNA1C	Common, subtle	BIP
	rs1064395	NCAN	Common, subtle	BIP
	rs1625579	MIR137	Common, subtle	SCZ
	rs2312147	VRK2	Common, subtle	SCZ
	rs1344706	ZNF804A	Common, subtle	SCZ
	rs17662626	PCGEM1	Common, subtle	SCZ
	rs13211507	MHC	Common, subtle	SCZ
	rs7004635	MMP16	Common, subtle	SCZ
	rs10503253	CSMD1	Common, subtle	SCZ
	rs16887244	LSM1	Common, subtle	SCZ
	rs7914558	CNNM2	Common, subtle	SCZ
	rs11191580	NT5C2	Common, subtle	SCZ
	rs11819869	AMBRA1	Common, subtle	SCZ
	rs12807809	NRGN	Common, subtle	SCZ
	rs12966547	CCDC68	Common, subtle	SCZ
	rs9960767	TCF4	Common, subtle	SCZ
	rs1344706	ZNF804A	Common, subtle	SCZ+BIP
	rs2239547	ITIH3-ITIH4	Common, subtle	SCZ+BIP
	rs10994359	ANK3	Common, subtle	SCZ+BIP
	rs4765905	CACNA1C	Common, subtle	SCZ+BIP
Heritability		0.40	Common variation	BIP
		0.30	Common variation	SCZ
Pathway		miR-137 network	Common variation	SCZ
Analysis		Cholesterol, innate immune	Common variation	AD
		Calcium signaling	Common variation	BIP
		Post-synaptic signaling	Rare CNVs	BIP, SCZ
Burden		Increased, multiple studies	Rare CNVs	ASD, SCZ
		Increased	Rare CNVs	ADHD, BIP, MDD
		Increased	Common variation	SCZ, BIP, MDD

Open in a new tab

See Tables 2 and 3 in reference (5) for full citations. The CNV and SNP findings meet genome-wide significance in large samples. Most are likely secure but some may not stand the test of time. The heritability, pathway, and burden results have replicated in multiple samples and/or represent consistent results from different analytical methods. Genomic locations are NCBI Build 37/UCSC hg19. Abbreviations: CNV=copy number variation, SNP=single nucleotide polymorphism, AD=Alzheimer’s disease, ASD=autism spectrum disorders, BIP=bipolar disorder, MDD=major depressive disorder, SCZ=schizophrenia.

Associations popular in the literature before 2007 (e.g., COMT, DRD3, DRD2, HTR2A, NRG1, BDNF, DTNBP1 and SLC6A4) have generally not fared well in GWAS (6). The reasons are unclear as is whether these genes have definitively been excluded from consideration. A few associations have stood the test of time, and are limited to loci with unusually large effect sizes (Alzheimer’s disease-APOE, schizophrenia-22q11.21, and alcohol dependence-alcohol metabolic genes).

More Complex Analyses

The basic analytic model used in most GWAS is very simple and considers single genetic markers in isolation. This simple model is not optimal given empirical data that psychiatric disorders are polygenic, and analyses of sets of markers could provide further insight by better reflecting the fundamental genetic architecture.

Of the ways in which multiple markers can be analyzed in combination, three have been the focus of particular effort (Table 1). First, large sets of genetic markers can be used to estimate heritability. Unlike twin or family studies whose assumptions continue to be criticized, these approaches yield assessments of heritability based directly on the genome. For schizophrenia and bipolar disorder, these results confirm that substantial proportions of the variance in liability (~25% or approximately ⅓ to ½ of the heritability) are accounted for by the current generation of genotyping arrays. (7) Thus, the critical assumption that has driven a generation of genetic studies now seems particularly secure.

Second, pathway analyses evaluate whether associations in pre-defined sets of genes have smaller p-values than expected. (5) These analyses have provided new ideas about the biology of these disorders as hypotheses for future work, and include: micro-RNA miR-137 and multiple genes containing binding sites for miR-137 in schizophrenia; calcium signaling in bipolar disorder; cholesterol metabolism and the innate immune response in Alzheimer’s disease; and post-synaptic signaling in schizophrenia and bipolar disorder (via CNVs).

Third, it is possible to compare the “burden” of a type of genetic variant between cases and controls. (5) Cases with autism or schizophrenia have consistently been reported to have a greater burden of rare CNVs than controls, and similar findings may hold for ADHD, bipolar disorder, and major depressive disorder. For common genetic variation, cases with schizophrenia, bipolar disorder, and major depressive disorder have greater burden of risk alleles than controls. For schizophrenia, this finding is highly replicable and highly significant (P< 10⁻²⁵). (5)

Genetic Architecture

GWAS has provided real data about the genetic basis of psychiatric disorders. Genetic architecture refers to the number of loci conferring risk for a disorder and their frequencies, effect sizes, modes of action, and interactions with other genetic loci and environmental factors. In the many, mostly philosophical debates on this topic in the past century, two extreme views have been articulated: psychiatric disorders are caused by rare mutations of strong effect with most cases having a different causal variant “versus” a causal model consisting of the cumulative effects of many common variants of relatively subtle effects.

Where the results are sufficient to afford the ability to judge, the answer is that both common and rare variants have roles. This general conclusion applies to Alzheimer’s disease (rare mutations with very strong effects (APP, PSEN1, PSEN2) and ten common variants of far more subtle effects) and bipolar disorder and schizophrenia (although no Mendelian-like mutations have been identified, rare CNVs play a role along with multiple common variants). For autism spectrum disorders, the rare variant catalog is more complete (rare Mendelian syndromes with autistic features like Rett syndrome, karyotype abnormalities in ~5% of cases, and CNVs in 5–10% of cases). A recent series of papers in Nature used exome sequencing to identify just three candidate genes (SCN2A, KATNAL2, CHD8) containing rare de novo variants. Indeed, the hypothesis that autism results only from many different Mendelian-like mutations could be rejected. The role of common variation in autism is currently unknown as the available GWAS samples are small by current standards.

Common variation has been implicated for alcohol and nicotine consumption. For ADHD, several rare CNVs have been reported. For anorexia nervosa, obsessive compulsive disorder, post-traumatic stress disorder, and Tourette’s syndrome, the published data are sparse, and the roles of common and rare variation are unknown. For all of these disorders, GWAS sample sizes are not large by current standards, and considerably smaller than the sample sizes that were required to identify robust and replicable findings for other biomedical diseases.

For major depressive disorder, a large GWAS mega-analysis (~19,000 subjects)failed to identify findings of genome-wide significance. In context, nearly all GWAS with sample sizes above 11,000 subjects identified at least one genome-wide significant finding.

What Have We Learned?

For most researchers, the rare “versus” common variant debate is settled. Unsurprisingly, where there are reasonable amounts of data, the answers contain elements of both models. Although some have wished for these disorders to conform to a classical Medical Genetic model whereby the etiology of complex psychiatric disorders would resolve into a series of highly penetrant mutations, such hopes have now been demonstrated to be inconsistent with results for Alzheimer’s disease, autism, and schizophrenia. The “many Mendelians” model now seems to be very unlikely.

Psychiatric disorders are polygenic. The rare variant and the common variant results indicate that many different loci are involved in Alzheimer’s disease, bipolar disorder, schizophrenia, autism spectrum disorders, and drug consumption. A parsimonious hypothesis is that these variants encode or regulate multi-component biological pathways. (8) Several intriguing hypotheses have emerged from GWAS data that suggest novel mechanisms underlying these disorders.

There are more discoveries to be made. From the experiences of other biomedical disorders for which GWAS has been conspicuously successful (e.g., type 2 diabetes mellitus or inflammatory bowel disease), we can confidently project that larger studies will yield more robust and replicable findings. To our knowledge, this is the first time in the history of psychiatry where there is a clear path to increasing our fundamental understanding of these disorders.

Acknowledgments

We are indebted to the tens of thousands of individuals who chose to participate in the work summarized here, and to the hundreds of scientific colleagues who have donated thousands of person-hours to these efforts. A multitude of sources have funded this work, including governmental funds and philanthropy but considerable effort has been donated by individual researchers. The PGC has been funded by the NIMH via MH085520 and MH094421.

Footnotes

^‡

NHGRI GWAS catalog, http://www.genome.gov/26525384, downloaded 4 June 2012.

Web Resources

Genome-wide results for PGC analyses for ADHD, bipolar disorder, MDD, and schizophrenia can be freely downloaded (https://pgc.unc.edu) and visualized in genomic regional context (http://www.broadinstitute.org/mpg/ricopili). Full results and individual data are available by application to the NIMH Repository (https://www.nimhgenetics.org), dbGaP (http://www.ncbi.nlm.nih.gov/dbgap), and/or the Wellcome Trust (http://www.wtccc.org.uk). The top findings for published GWAS are also available (http://www.genome.gov/26525384).

Financial Disclosures

Dr Sullivan was on the SAB for Expression Analysis, Durham, NC, USA, and received unrestricted support for genetic research in schizophrenia from Eli Lilly.

References

1.Corvin A, Craddock N, Sullivan PF. Genome-wide association studies: a primer. Psychologal Medicine. 2010;40:1063–77. doi: 10.1017/S0033291709991723. [DOI] [PMC free article] [PubMed] [Google Scholar]
2.Psychiatric GWAS Consortium. A framework for interpreting genomewide association studies of psychiatric disorders. Molecular Psychiatry. 2009;14:10–7. doi: 10.1038/mp.2008.126. [DOI] [PubMed] [Google Scholar]
3.Sullivan P, de Geus E, Willemsen G, James MR, Smit JH, Zandbelt T, Arolt V, Baune BT, Blackwood D, Cichon S, Coventry WL, Domschke K, Farmer A, Fava M, Gordon SD, He Q, Heath A, Heutink P, Holsboer F, Hoogendijk WJ, Hottenga JJ, Hu Y, Kohli M, Lin D, Lucae S, MacIntyre DJ, Maier W, McGhee KA, McGuffin P, Montgomery GW, Muir WJ, Nolen W, Nöthen MM, Perlis RH, Pirlo K, Posthuma D, Rietschel M, Rizzu P, Schosser A, Smit AB, Smoller JW, Tzeng J-Y, van Dyck R, Verhage M, GZF, Martin NG, Wray NR, Boomsma DI, Penninx B. Genomewide association for major depressive disorder: a possible role for the presynaptic protein piccolo. Molecular Psychiatry. 2009;14:359–75. doi: 10.1038/mp.2008.125. [DOI] [PMC free article] [PubMed] [Google Scholar]
4.Major Depressive Disorder Working Group of the PGC. A mega-analysis of genome-wide association studies for major depressive disorder. Molecular Psychiatry. 2012 doi: 10.1038/mp.2012.21. [DOI] [PMC free article] [PubMed] [Google Scholar]
5.Sullivan PF, Daly MJ, O’Donovan M. Genetic architectures of psychiatric disorders: the emerging picture and its implications. Nature Reviews Genetics. 2012;13:537–51. doi: 10.1038/nrg3240. [DOI] [PMC free article] [PubMed] [Google Scholar]
6.Collins AL, Kim Y, Sklar P, O’Donovan MC, Sullivan PF International Schizophrenia Consortium. Hypothesis-driven candidate genes for schizophrenia compared to genome-wide association results. Psychol Med. 2012;42:607–16. doi: 10.1017/S0033291711001607. [DOI] [PMC free article] [PubMed] [Google Scholar]
7.Visscher PM, Brown MA, McCarthy MI, Yang J. Five Years of GWAS Discovery. American journal of human genetics. 2012 Jan 13;90(1):7–24. doi: 10.1016/j.ajhg.2011.11.029. [DOI] [PMC free article] [PubMed] [Google Scholar]
8.Sullivan PF. Schizophrenia as a pathway disease. Nature Medicine. 2012;18:210–11. doi: 10.1038/nm.2670. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R1] 1.Corvin A, Craddock N, Sullivan PF. Genome-wide association studies: a primer. Psychologal Medicine. 2010;40:1063–77. doi: 10.1017/S0033291709991723. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R2] 2.Psychiatric GWAS Consortium. A framework for interpreting genomewide association studies of psychiatric disorders. Molecular Psychiatry. 2009;14:10–7. doi: 10.1038/mp.2008.126. [DOI] [PubMed] [Google Scholar]

[R3] 3.Sullivan P, de Geus E, Willemsen G, James MR, Smit JH, Zandbelt T, Arolt V, Baune BT, Blackwood D, Cichon S, Coventry WL, Domschke K, Farmer A, Fava M, Gordon SD, He Q, Heath A, Heutink P, Holsboer F, Hoogendijk WJ, Hottenga JJ, Hu Y, Kohli M, Lin D, Lucae S, MacIntyre DJ, Maier W, McGhee KA, McGuffin P, Montgomery GW, Muir WJ, Nolen W, Nöthen MM, Perlis RH, Pirlo K, Posthuma D, Rietschel M, Rizzu P, Schosser A, Smit AB, Smoller JW, Tzeng J-Y, van Dyck R, Verhage M, GZF, Martin NG, Wray NR, Boomsma DI, Penninx B. Genomewide association for major depressive disorder: a possible role for the presynaptic protein piccolo. Molecular Psychiatry. 2009;14:359–75. doi: 10.1038/mp.2008.125. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R4] 4.Major Depressive Disorder Working Group of the PGC. A mega-analysis of genome-wide association studies for major depressive disorder. Molecular Psychiatry. 2012 doi: 10.1038/mp.2012.21. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R5] 5.Sullivan PF, Daly MJ, O’Donovan M. Genetic architectures of psychiatric disorders: the emerging picture and its implications. Nature Reviews Genetics. 2012;13:537–51. doi: 10.1038/nrg3240. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R6] 6.Collins AL, Kim Y, Sklar P, O’Donovan MC, Sullivan PF International Schizophrenia Consortium. Hypothesis-driven candidate genes for schizophrenia compared to genome-wide association results. Psychol Med. 2012;42:607–16. doi: 10.1017/S0033291711001607. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R7] 7.Visscher PM, Brown MA, McCarthy MI, Yang J. Five Years of GWAS Discovery. American journal of human genetics. 2012 Jan 13;90(1):7–24. doi: 10.1016/j.ajhg.2011.11.029. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R8] 8.Sullivan PF. Schizophrenia as a pathway disease. Nature Medicine. 2012;18:210–11. doi: 10.1038/nm.2670. [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

Genome-Wide Association Studies in Psychiatry: What Have We Learned?

Ann L Collins, PhD

Patrick F Sullivan, MD, FRANZCP

Summary

GWAS in Psychiatry

Culture Shifts

Discoveries of Genetic Loci

Table 1.

More Complex Analyses

Genetic Architecture

What Have We Learned?

Acknowledgments

Footnotes

References

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

Genome-Wide Association Studies in Psychiatry: What Have We Learned?

Ann L Collins, PhD

Patrick F Sullivan, MD, FRANZCP

Summary

GWAS in Psychiatry

Culture Shifts

Discoveries of Genetic Loci

Table 1.

More Complex Analyses

Genetic Architecture

What Have We Learned?

Acknowledgments

Footnotes

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases