Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2017 Jan 30.
Published in final edited form as: Nat Genet. 2016 Dec 5;49(1):152–156. doi: 10.1038/ng.3736

Genome-wide analyses for personality traits identify six genomic loci and show correlations with psychiatric disorders

Min-Tzu Lo 1, David A Hinds 2, Joyce Y Tung 2, Carol Franz 3, Chun-Chieh Fan 1,4, Yunpeng Wang 5,6, Olav B Smeland 6, Andrew Schork 1,4, Dominic Holland 5, Karolina Kauppi 1,7, Nilotpal Sanyal 1, Valentina Escott-Price 8, Daniel J Smith 9, Michael O'Donovan 8, Hreinn Stefansson 10, Gyda Bjornsdottir 10, Thorgeir E Thorgeirsson 10, Kari Stefansson 10, Linda K McEvoy 1, Anders M Dale 1,3,5, Ole A Andreassen 6, Chi-Hua Chen 1,*
PMCID: PMC5278898  NIHMSID: NIHMS838768  PMID: 27918536

Summary

Personality is influenced by genetic and environmental factors1, and associated with mental health. However, the underlying genetic determinants are largely unknown. We identified six genetic loci, including five novel loci2,3, significantly associated with personality traits in a meta-analysis of genome-wide association studies (N=123,132–260,861). Of these genome-wide significant loci, extraversion was associated with variants in WSCD2 and near PCDH15, and neuroticism with variants on chromosome 8p23.1 and in L3MBTL2. We performed a principal component analysis to extract major dimensions underlying genetic variations among five personality traits and six psychiatric disorders (N=5,422–18,759). The first genetic dimension separated personality traits and psychiatric disorders, except that neuroticism and openness to experience were clustered with the disorders. High genetic correlations were found between extraversion and attention-deficit/hyperactivity disorder (ADHD), and between openness and schizophrenia/bipolar disorder. The second genetic dimension was closely aligned with extraversion-introversion and grouped neuroticism with internalizing psychopathology (e.g., depression/anxiety).


The Five Factor Model (FFM) of personality, also known as “the Big Five” is commonly used to measure individual differences in personality. It models personality according to five broad domains4. Extraversion (versus introversion) reflects talkativeness, assertiveness and activity level. Neuroticism (versus emotional stability) denotes negative affect like anxiety and depression. Agreeableness (versus antagonism) measures cooperativeness and compassion. Conscientiousness (versus undependability) depicts order and discipline. Openness to experience (versus closedness) captures intellectual curiosity and creativity4,5. Personality phenotypes, measured by various questionnaires, are represented by continuous quantitative scores on each of the five traits4.

A meta-analysis of twin and family studies found that approximately 40% of the variance in personality could be attributed to genetic factors1. Genome-wide association studies (GWAS) have discovered several variants associated with FFM traits68. Neuroticism was reported to be associated with an intronic variant in MAGI1 (P=9.26×10−9, N=63,661)7, conscientiousness with an intronic variant in KATNAL2 (P=4.9×10−8, N=17,375)6, and openness with variants near RASA1 (P=2.8×10−8, N=17,375)6 and near PTPRD (P=1.67×10−8, N=1,089)8. Recent UK Biobank studies (N=106,716–170,908) yielded several single nucleotide polymorphisms (SNPs) associated with neuroticism2,3.

Another large study, 23andMe, contains well-phenotyped data on personality and offers opportunity to identify additional genetic variants, since all five personality traits were measured in all individuals using the same personality inventory (Online Methods). We performed a meta-analysis based on GWAS summary statistics to identify genetic variants associated with FFM traits. We included participants with European ancestry from 23andMe (N=59,225) and two samples from the Genetics of Personality Consortium (GPC)6,7. GPC-1 (N=17,375)6 contains data on agreeableness, conscientiousness, and openness, whereas GPC-2 (N=63,661)7 contains information on extraversion and neuroticism.

Summary statistics of GWAS from 23andMe (available in Supplementary Data Sets 1–5 for the top 10K SNPs) were combined with the two GPC samples separately, yielding a total of 76,600 and 122,886 subjects as the discovery/stage 1 sample. Eight linkage-disequilibrium (LD)-independent SNPs (LD r2<0.05) were discovered exceeding GWAS significance (P<5×10−8) in the combined meta-analysis (Table 1 and Fig. 1).

Table 1.

LD-independent genetic variants significantly associated with personality traits

SNP Chr Closest
gene
(region)
A1/
A2
Frq Discovery/Stage 1 Replication/Stage 2 Final combined analysis of
stage 1 and stage 2

23andMe
(N = ~59,200)
GPC
(N = 17,375/
63,661)
Combined analysis 23andMe replication
(N = ~39,500)
deCODE
(N = ~7,100)
UK Biobank
(N = 91,370)

β SE P-value β SE P-value P-value N β SE P-value β SE P-value β SE P-value P-value N R2 (%)
Conscientiousness
rs3814424 5q LINC00461* T/C 0.17 −0.289 0.050 9.75×10−9 −0.138 0.131 0.294 2.98×10−8 76,551 −0.051 0.051 0.313 −0.005 0.027 0.855 6.19×10−7 123,132 0.0202
Extraversion
rs57590327 3p GBE1
(intergenic)
T/G 0.26 0.236 0.054 1.37×10−5 0.026 0.006 2.03×10−5 1.61×10−9 122,886 0.088 0.052 0.091 0.007 0.019 0.713 1.26×10−9 169,507 0.0217
rs2164273 8p MTMR9
(intron)
G/A 0.39 0.179 0.047 1.14×10−4 0.024 0.006 4.08×10−5 1.79×10−8 122,845 0.093 0.045 0.037 0.021 0.018 0.255 1.61×10−9 169,466 0.0215
rs6481128 10q PCDH15
(intergenic)
G/A 0.45 0.205 0.046 7.10×10−6 0.018 0.005 0.0010 4.15×10−8 122,886 0.154 0.045 5.58×10−4 −0.011 0.017 0.528 5.44×10−10 169,507 0.0227
rs1426371 12q WSCD2
(intron)
A/G 0.28 −0.308 0.053 4.65×10−9 −0.023 0.006 2.56×10−4 2.09×10−11 122,886 −0.177 0.051 5.09×10−4 −0.037 0.021 0.077 9.54×10−15 169,507 0.0354
rs7498702 16p RBFOX1
(intron)
C/T 0.29 −0.166 0.050 8.94×10−4 −0.026 0.006 1.17×10−5 4.73×10−8 122,886 −0.006 0.048 0.907 −0.005 0.018 0.777 1.89×10−6 169,507 0.0134
Neuroticism
rs6981523 8p XKR6
(intergenic)
T/C 0.50 0.250 0.042 2.68×10−9 0.022 0.006 1.01×10−4 4.25×10−12 122,867 0.138 0.042 1.05×10−3 0.032 0.018 0.070 0.098 0.015 1.04×10−10 3.17×10−24 260,861 0.0395
rs9611519 22q L3MBTL2
(exon)
CHADL
(intron)
T/C 0.31 0.235 0.046 4.05×10−7 0.020 0.007 0.003 1.87×10−8 122,867 0.002 0.047 0.966 −0.002 0.023 0.931 0.053 0.017 0.0015 9.16×10−9 260,861 0.0127

Chr: chromosome; A1: effect allele; A2: non-effect allele; Frq: allele frequency of A1; β: linear regression association coefficient; SE: standard error; N: sample size. β and SE may have varying scales in different cohorts; thus sample-based meta-analyses were used.

*

SNP in non-protein coding region.

The sample sizes of GPC1 and GPC2 are 17,375 and 63,661, respectively.

Due to absence of rs9611519 in the UK Biobank data, a proxy SNP (rs2273085, LD r2 = 0.99) was used.

Figure 1. Manhattan plots for personality traits in the combined sample of 23andMe and GPC data (discovery/stage1 sample).

Figure 1

Sample size: Agreeableness: N=76,551; conscientiousness: N=76,551; extraversion: N=122,886; neuroticism: N=122,867; openness: N=76,581. Number of SNPs: Agreeableness: N=2,165,398; conscientiousness: N=2,166,809; extraversion: N=6,343,667; neuroticism: N=6,337,541; openness: N=2,167,320.

To evaluate the consistency of association signals between 23andMe and GPC samples, we conducted genome-wide polygenic analyses using LD Score regression to examine genetic correlations (rg)9 of personality traits between the two samples. The estimated rg were highly significant (rg=0.86–0.96), suggesting that genetic effects are consistent and replicable between the samples at a polygenic level (Supplementary Fig. 1), and that a considerable number of SNPs below the GWAS significance threshold contain trait-associated genetic effects.

To assess replicability of the eight significant SNPs identified in the discovery/stage1 sample, we obtained their summary statistics from three independent samples, including an independent 23andMe replication sample, UK Biobank cohort (only neuroticism) and an Icelandic sample from deCODE Genetics (Online Methods and Table 1). In the final combined meta-analysis, six SNPs remained GWAS significant. The other two fell just below GWAS significance but had consistent direction of effects in all samples, suggesting that these may be significant in larger samples. Overall, the directions of effects were consistent for all eight SNPs between the discovery and replication tests, except two SNPs in the smaller (N=7,137) deCODE sample.

The strongest associations were detected for neuroticism within a subregion of 8p23.1, which spans ~4 Mb (Chr8: 8,091,701–11,835,712) with highly correlated SNPs in one big LD block (Fig. 2a). The 8p23.1 region comprises genes related to innate immunity and the nervous system, and is considered as a potential hub for cancer and developmental neuropsychiatric disorders10. Our conditional analysis indicated the existence of multiple associations (conditional P~10−7) independent of the top SNP within the 8p23.1 locus but these were not GWAS significant.

Figure 2. Regional association plot.

Figure 2

The figure shows the distribution of -log10(p-value) of SNPs on chromosome 8p of the significant SNPs for neuroticism (a) and extraversion (b) in the combined discovery analysis. These two SNPs (LD r2=0.5 in LDlink) have opposite signs of β's in GWAS results of neuroticism and extraversion. The opposite signals might be attributable to negative phenotypic association between neuroticism and extraversion. Regional plots with detailed annotation information for significant SNPs are also shown in Supplementary Fig. 4.

The UK Biobank studies also identified multiple associations with neuroticism in 8p23.12,3, which were attributed to an inversion polymorphism2. Our association signals reside in the same inversion region, with an LD of r2=0.35 (LDlink) between the lead SNP found here and in the UK Biobank study3. Additionally, we identified an intronic variant of MTMR9 within 8p23.1 that was associated with extraversion, with opposite direction of association with neuroticism (Fig.2b). Together, these findings provide converging evidence for the association of 8p23.1 with personality.

For extraversion, we found a significant locus on 12q23.3 within WSCD2. This locus has been implicated in a GWAS of temperament in bipolar disorder11, and linkage analysis12, suggesting that 12q likely harbors important alleles for temperament and personality. Another SNP significantly associated with extraversion is near PCDH15, a member of the cadherin superfamily important for calcium-dependent cell-cell adhesion.

All six SNPs discovered here reside in loci for which genome-wide significant associations with other phenotypes have been reported (NHGRI GWAS catalog). For example, we found a variant associated with neuroticism in L3MBTL2, a gene reported to be associated with schizophrenia13. Etiologically, neuroticism has been associated with schizophrenia risk14. Further, one gene in which we found a variant associated with extraversion, MTMR9 has been related to response to antipsychotic medications15. The SNP associated with conscientiousness in the discovery sample, though not significant in the final meta-analysis, was located in a locus linked to educational attainment16, and high conscientiousness was found to correlate positively with academic performance17.

These six SNPs have been found to be significantly associated with gene expression and all are listed as expression quantitative trait loci (eQTL) for brain tissues, to varying degrees (Supplementary Table 1). We performed a Bayesian test18 to examine whether GWAS signals co-localize with eQTL. The COLOC-estimated posterior probabilities18 (see Online Methods) indicated that one SNP-associated locus (rs57590327) and its corresponding eQTL (Supplementary Table 1) were probably attributable to a common causal variant (posterior probability=0.76). Another SNP (rs216273) showed evidence of independence with eQTL (posterior probability =0.75). For the rest of the SNPs, the posterior probability ranged between 0 and 0.45, failing to support any of the specified hypotheses. Our analyses did not show consistent evidence for these SNPs influencing personality traits through gene expression levels in the brain, but caution is warranted owing to the small eQTL sample (N=134).

Beyond identifying single genetic variants that each account for very little phenotypic variance, we estimated SNP-based heritability of the traits. All heritability estimates were significant in our 23andMe discovery sample, with the largest estimate for extraversion (0.18) (Supplementary Table 2). These findings extend those from a previous heritability analysis of FFM traits (N=5,011), in which SNP-based heritability estimates were significant for neuroticism and openness19. As expected, SNP-based heritability estimates were lower than those reported in family studies1.

Relationships among personality traits are also of interest. Although the FFM traits were derived through factor analysis and thus orthogonal in the original findings, most studies observe some degree of phenotypic correlation between traits19. Using 23andMe data, we found that neuroticism was inversely related to the other personality traits, whereas agreeableness, conscientiousness, extroversion, and openness were positively correlated. Almost all phenotypic correlations were highly significant, except for openness vs. conscientiousness (Supplementary Table 3). Genetic correlation patterns were congruent with phenotypic correlations but the association was more apparent in genetic structure, reflecting clear shared genetic factors contributing to the correlations (Fig. 3a).

Figure 3. Genetic correlations between personality traits (23andMe sample) and psychiatric disorders.

Figure 3

(a) The heat map illustrates genetic correlations between phenotypes. The values in the color squares correspond to genetic correlations. Asterisks denote genetic correlations significantly different from zero: * P<0.05; ** P<0.00091 (Bonferroni correction threshold). (b) The loading plot shows loadings of the personality traits and psychiatric disorders on the first two principal components derived from the genetic correlation matrix on the left. A small angle between arrows indicates a high correlation between variables and arrows pointing to opposite directions indicate a negative correlation in the space of the two principal components.

A notable feature of personality is its link with a wide range of social, mental and physical health outcomes5. High levels of neuroticism, extraversion and openness have been associated with bipolar disorder20, and high neuroticism with major depression and anxiety21. Low agreeableness has been associated with narcissism, Machiavellianism and psychopathy22. In addition to phenotypic relationships, twin and GWAS studies have demonstrated genetic correlations between personality traits and psychiatric disorders3,21,23, though most focus on only neuroticism (Supplementary Note for details).

We thus sought to quantify the genetic correlations between the five personality traits and six psychiatric disorders from the Psychiatric Genomics Consortium: schizophrenia (N=17,115), bipolar disorder (N=16,731), major depressive disorder (N=18,759), ADHD (N=5,422) and autism spectrum disorder (N=10,263), and from Genetic Consortium for Anorexia Nervosa (N=17,767) (see Online Methods and Supplementary Table 2). A pair-wise genetic correlation matrix (11×11) was constructed, which revealed several significant correlations (Fig. 3a, Supplementary Table 4). For example, neuroticism was highly correlated with depression, and extraversion with ADHD. To complement genetic correlation estimation via LD Score regression9, we compared the pattern of GWAS results by assessing whether signs of genetic effects were concordant between the top associations among these traits and disorders. The results of the sign tests of directional effects closely matched the genetic correlations (Supplementary Fig. 2).

Given the moderate and high genetic correlations, we subsequently conducted a principal component analysis (PCA) to extract principal components of genetic variation (Fig. 3b). We projected all phenotypes onto a two dimensional space spanned by the top two principal components (PC1 and PC2) of genetic variation. This loading plot summarizes the genetic relationships between personality traits and psychiatric disorders. The analysis integrates genomic information with traditionally defined phenotypes to better understand basic dimensions of the full range of human behavior, from typical to pathological, in line with the research strategy of the Research Domain Criteria (RDoC)24.

Our results indicate that openness, bipolar disorder, and schizophrenia cluster in the first quadrant (Fig. 3b). Interestingly, all three share phenotypic commonality in that they have been linked to heightened creativity and dopamine activity25,26. Most personality traits (conscientiousness, agreeableness and extraversion) cluster in the second quadrant. Neuroticism and depression are in the fourth quadrant. Autism and anorexia nervosa are captured by factors in higher dimensions and have relatively low loadings on the first two components, as indicated by short arrows on these two dimensions. Notably, ADHD has a high genetic correlation with extraversion and low correlations with other psychiatric disorders (except bipolar disorder), as also shown in hierarchical clustering analysis in which ADHD clustered with personality traits rather than psychiatric disorders (Supplementary Fig. 3). This may indicate that ADHD, or some ADHD subtypes, represent a variant of extraversion personality trait. Of note, our ADHD data consists of cases ranging in age from 5–19 years old. Phenotypically, positive emotionality has been linked with a subgroup of children with ADHD27. Future genetic studies considering ADHD heterogeneity (e.g., subtypes, child/adult ADHD) may help characterize its diverse etiologies and relationships with personality traits.

Overall, we observed a systematic pattern with all psychiatric disorders showing positive loadings on PC1, and agreeableness and conscientiousness with negative loadings. A combination of low agreeableness and low conscientiousness is thought to reflect Eysenk’s psychoticism personality4. PC2 is closely aligned with extraversion-introversion which has been associated with externalizing/internalizing traits and activation/inhibition28,29. Internalizing traits (e.g., neuroticism, depression, anxiety and withdrawal)21 have negative loadings on PC2. Externalizing traits are predicted by high extraversion, low agreeableness and low conscientiousness29.

These findings provide additional support for shared genetic influences between personality traits and psychiatric disorders3,21,23 and for the notion that personality traits and psychiatric disorders exist on a continuum in phenotypic and genomic space5,11. Maladaptive or extreme variants of personality may contribute to the persistence of, or vulnerability to, psychiatric disorders and comorbidity5,11,21,23. Further genomic research in which categorical disease entities are viewed as variants of quantitative dimensions in a polygenic framework may help elucidate this issue30.

Caveats of this study include that the sample size, while large, may still be underpowered to detect the majority of associated SNPs, given the conservative GWAS significance threshold. Because we used only summary statistics of GWAS, we cannot estimate non-additive genetic variance such as dominance and epistasis, and genetic contribution from structural (e.g., inversions) and rare variants. Additionally, genetic correlations indicate the degree of shared genetic influences across traits at the genome-wide level, but other studies using different methods are needed to identify specific pleiotropic variants underlying the observed correlations.

In summary, by studying all FFM traits we found six replicable genetic variants associated with personality, five of which are novel and one replicates a recently published finding2,3. We also observed that personality traits are correlated at the genetic level, with neuroticism showing an inverse association with the other traits. Other novel aspects of this study include description of the genetic correlations among five personality traits and six psychiatric disorders, and depiction of their relationships through principal component analysis. Personality traits are likely influenced by many gene variants and by gene-environment interactions. We are only in the beginning of understanding the genetics of personality and their relation to psychiatric disorders. The overall effort promises to have great relevance to public health.

Online Methods

23andMe sample

The GWAS summary statistics were obtained from a subset of 23andMe participants. 23andMe uses a survey design to collect a number of phenotypes including the personality traits reported here, and the sample has been described previously for other phenotypes31,32. We included only those participants (N=59,225) who showed >97% European ancestry as determined by analyzing local ancestry and comparing to three HapMap 2 populations33. Relatedness between participants was examined by a segmental identity-by-descent (IBD) method34 to ensure that only unrelated individuals (sharing less than 700 cM IBD) were included in the sample. All participants included in the analyses provided informed consent and answered surveys online according to a human subject research protocol, which was reviewed and approved by Ethical & Independent Review Services, an AAHRPP-accredited private institutional review board (http://www.eandireview.com).

Additionally, we obtained independent replication results of GWAS from 23andMe replication sample. This sample included ~39,500 participants (N=39,452 for conscientiousness, 39,484 for extraversion and 39,488 for neuroticism) who met the same inclusion criteria as described above.

Genetics of Personality Consortium (GPC) sample

Genetics of Personality Consortium (GPC) is a large collaboration of GWAS for personality. Summary statistics of the PGC data used in the current study included the first meta-analysis of GWAS (GPC-1)6 for three traits (agreeableness, conscientiousness and openness) and the second meta-analysis of GWAS (GPC-2) for neuroticism7,35,36 and extraversion. The results of 10 discovery cohorts for GPC-1 and of 29 discovery cohorts for GPC-2 are available in the public domain, which respectively consist of 17,375 and 63,661 participants with European ancestry across Europe, Australia and United States. These studies were performed with oversight from local ethic committees, and all participants provided informed consent6,7,35,36.

UK Biobank sample

UK (United Kingdom) Biobank is a large prospective cohort of more than 502,000 participants (aged 40–69 years)3 with genetic data and a wide range of phenotypic data including social, cognitive, personality (neuroticism trait), lifestyle, and physical health measures collected at baseline. We used a subsample of this cohort for neuroticism replication. Exclusion criteria included UK Biobank genomic analysis exclusions, relatedness, gender mismatch, non-white UK ancestry and failure of quality control of UK BiLEVE genotyping3, resulting in a sample of 91,370 individuals. Association analysis was conducted using linear regression under a model of additive allelic effects with sex, age, array and the first eight PCs as covariates3. Informed consent was obtained from all participants and the study was approved by National Health Service National Research Ethics Service3.

deCODE sample

Icelandic participants (N=7,137 for extraversion, 7,136 for neuroticism and 7,129 for conscientiousness) were enrolled in various ongoing deCODE studies administering the NEO-FFI measure of the Big Five personality traits37,38. All deCODE studies were approved by the appropriate bioethics and data protection authorities and all participating subjects donating blood signed informed consent forms. The personal identities of participants from whom phenotype information and biological samples were obtained were encrypted by a third-party system overseen by the Icelandic Data Protection Authority39. A generalized form of linear regression that accounts for relatedness between individuals was used to test the correlation between normalized NEO-FFI trait scores and genotypes.

Personality assessment

In the 23andMe sample, individuals completed a web-based implementation of the Big Five Inventory (BFI)40,41, with 44 questions. Scores for agreeableness, conscientiousness, extraversion, neuroticism, and openness were computed using 8 to10 items per factor40.

In GPC-1, scores of personality traits were based on the 60 item NEO Five-Factor Inventory (NEO-FFI) with 12 items per factor6,37. In GPC-2, harmonization of measures for neuroticism and extraversion across 9 inventories and 29 cohorts were performed by applying Item Response Theory (IRT) to avoid personality scores being influenced by the number of items and the specific inventory. Because the personality measures were not assessed similarly across GPC-2 cohorts, the harmonized/calibrated scores of personality are more comparable, thereby increasing power for meta-analysis of GWAS using fixed-effect models7,35,36. As described in the main text, high genetic correlations between 23andMe and GPC samples were found, suggesting a highly consistent pattern of associations despite the discrepancy in questionnaires (Supplementary Fig. 1).

In the UK Biobank sample, neuroticism was scored between 0 to 12 using the 12 items of the Eysenck Personality Questionnaire-Revised Short Form (EPQ-R-S)42 with high reliability and concurrent validity42.

In the deCODE sample, NEO-FFI personality trait scores37,38 were adjusted for sex and age at measurement and were then normalized to a standard normal distribution using quantile normalization.

Distributions and correlations for personality scores in the 23andMe sample

Quantile-quantile (QQ) plots of covariate-adjusted personality scores to examine normality are shown in Supplementary Fig. 5. The distributions at the top tail deviates from normality due to the limited range of the scores and those at the bottom tail deviate due to the limited range (for neuroticism and extraversion) and/or extreme values. This violation of the normality assumption can be influential for genetic variants with very low minor allele frequencies (e.g., rare variants)43. However, this did not affect our results because our GWAS and LD Score regression9 only include common variants.

Pearson correlations, unadjusted and after adjusting for the covariates (age, sex, top five principal components for population structure correction44), were used to assess phenotypic correlations among the five traits (Supplementary Table 3).

Genotyping and imputation

In the 23andMe sample, DNA extraction and genotyping were performed on saliva samples by National Genetics Institute (NGI), a CLIA licensed clinical laboratory and a subsidiary of Laboratory Corporation of America. Samples have been genotyped on one of four genotyping platforms. The V1 and V2 platforms were variants of the Illumina HumanHap550+ BeadChip, including about 25,000 custom SNPs selected by 23andMe, with a total of about 560,000 SNPs. The V3 platform was based on the Illumina OmniExpress+ BeadChip, with custom content to improve the overlap with 23andMe's V2 array, with a total of about 950,000 SNPs. The 23andMe's V4 platform in current use is a fully custom array, including a lower redundancy subset of V2 and V3 SNPs with additional coverage of lower-frequency coding variation, and about 570,000 SNPs. Samples that failed to reach 98.5% call rate were re-analyzed. As part of 23andMe standard practice, individuals whose analyses failed repeatedly were re-contacted and asked to provide a new sample.

23andMe participant genotype data were imputed using the 1000 Genomes Project phase 1 version 3 reference panel45. The phasing and imputation for each genotyping platform were separated. First, chromosomal segments of no more than 10,000 genotyped SNPs, with overlaps of 200 SNPs, were phased using Beagle (version 3.3.1)46. Then, each phased segment was imputed against all-ethnicity 1000 Genomes Project haplotypes (excluding monomorphic and singleton sites) using a high-performance version of Minimac47 for 5 rounds and 200 states to estimate parameters. SNPs were filtered by procedures including Hardy-Weinberg equilibrium P<10−20 (stringent threshold for large sample size), call rate<95% and allele frequencies apparently different from European 1000 Genomes Project reference data. A total of 13,341,935 SNPs was retained after filtering and excluding chromosome X, Y and mitochondria. We focus on autosomal SNPs, which are available for 23andMe, GPC and UK Biobank samples.

Genotyping in cohorts of GPC-16 and GPC-27,35 was conducted on Illumina or Affymetrix platforms. Quality control of genotype data was examined in each cohort independently, including checks for European ancestry, sex inconsistencies, Mendelian errors, high genome-wide homozygosity, relatedness, minor allele frequencies (MAF), SNP call rate, sample call rate and Hardy-Weinberg equilibrium6,7,35,36. Genotype data of GPC-1 were then imputed using HapMap phase II CEU as a reference panel including ~2.5M SNPs6 and, alternatively, a reference panel from 1000 Genomes Project phase 1 version 3 was used to impute the genotype data of GPC-27,35,36. Poorly imputed SNPs (r2<0.3 or proper_info<0.36 or 0.47,35) and low MAF (<0.016 or 5/N7,35) were excluded in the meta-analyses, resulting in a total number of 1.1–6.6 million SNPs7,35 across cohorts.

In the UK Biobank first release genetic data of 152,729 participants (June 2015), about two thirds of the sample was genotyped using Affymetrix UK Biobank Axiom array (Santa Clara, CA, USA) and the remaining were genotyped using the Affymetrix UK BiLEVE Axiom array3. Outlier, multi-allelic and low-MAF (<1%) SNPs were excluded from phasing and imputation procedures. The reference panel of imputation was based on the 1000 Genomes Phase 3 and UK10K haplotype panels3. Further quality control procedures were applied after imputation, yielding a total of 8,268,322 SNPs for further analyses3.

Genotyping, imputation methods and the association analysis method used in the deCODE sample are previously described48. A total of 676,913 autosomal SNPs were typed using Illumina SNP chips48. SNPs with low MAF (<0.1%) and low imputation information (<0.8) were excluded and 99.5% of SNPs remained after imputation.

Genome-wide association analysis

Association tests were performed by regressing personality traits on imputed dosages of SNPs in the 23andMe sample. Age, sex, and the top five principal components (PCs)44 for population structure correction were included as covariates and p-values were computed using likelihood ratio tests. For all five personality traits, the correlation structure of SNPs was determined by an LD matrix of 9,270,523 autosomal SNPs generated from European reference sample in 1000 Genomes Project phase 1 v3 within 1,000,000 base pairs (1 Mb)49,50 using Plink 1.0751. The original 13,341,935 SNPs were reduced into 9,270,523 SNPs in our subsequent analyses (e.g., LD correlation structure is used to determine LD-independent SNPs). All SNPs' positions were mapped to Genome Reference Consortium Human Build 37 (GRCh37) and UCSC Genome Browser on Human hg19 assembly. We made QQ plots with GWAS summary statistics of the 23andMe sample. The QQ plots lie along the expected null line for large p-values (P>10−3), indicating that the GWAS results are not inflated by population stratification or cryptic relatedness. This pattern is consistent with the genomic inflation factors (λ)52 close to 1, as shown in Supplementary Fig. 6.

In each cohort of GPC-16 and GPC-27,35, linear regressions with covariates of sex, age and PCs were conducted for association tests using dosage data. The meta-analyses of GWAS results of cohorts for GPC-1 and GPC-2 were performed by the inverse-variance method using METAL53 released on the GPC website (see URLs). Given improved power for detection of genetic effects with larger sample sizes in GWAS, we performed a combined meta-analysis of 23andMe and GPC samples using METAL53 based on the sample-size based method. SNPs available in one cohort only were excluded. The totals of 2,305,461, 2,305,682 and 2,305,640 SNPs were available for traits of agreeableness, conscientiousness and openness (respectively) in GPC-1, as well as 6,941,603 SNPs for extraversion and 6,949,614 SNPs for neuroticism in GPC-2. Genomic inflation factors (λ) are 1.01, 1.01, 1.03, 1.02 and 1.02 for agreeableness, conscientiousness, extraversion, neuroticism and openness, respectively.

Meta-analysis of 23andMe and GPC samples

Given improved power for detection of genetic effects with larger sample sizes in GWAS, we performed a combined meta-analysis of 23andMe and GPC samples using METAL53 based on the sample-size based method. To assess the quality of meta-analysis, SNPs with heterogeneity p-values<0.05 were excluded. Eight significant LD-independent SNPs were identified after removing correlated SNPs at LD r2>0.05 that are within 1 Mb of the top SNP. In Table 1, the percentage of variance explained by each SNP is calculated using equation: (z2/(nk−1+z2))×100, where z is the z value for each SNP controlling for covariates, n is the sample size for each SNP and k is the number of covariates in the regression model (k=7 for age, sex, and top five PCs)54,55.

Conditional analysis within 1 Mb region of significant SNPs

We performed a conditional analysis56 within the 1 Mb genomic region of each of the six LD-independent SNPs. In our study, we used 1000 Genomes Project reference panel of European ancestry to estimate LD correlations (r2) and excluded SNPs correlated at LD r2>0.9 with the top associated SNP within 1 Mb window. We did not detect additional significant SNPs conditional on the top SNPs under the stringent GWAS threshold. However, for the significant loci in 8p, several SNPs still showed substantial association signals (P~10−7) conditioning on the top SNPs, rs6981523 or rs2164273.

Regional association and annotation plot

The regional plot of chromosome 8p (Fig. 2) was constructed by a web-interface tool, LocusZoom57. In Fig. 2a and 2b, the most significant SNPs (rs6981523 and rs2164273) are shown in purple, otherwise the colors of the circles denote their correlations (LD r2) with the top SNP. The bottom panel displays gene symbol and location within the region derived from UCSC Genome Browser on Human hg19 assembly. The regional and annotation plots for other significant SNPs are also shown in Supplementary Fig. 4.

Genetic correlation analysis

We used the LD Score regression method to examine the pattern of genetic correlations (rg)9,58 across personality traits within/between 23andMe and GPC samples (Fig. 3a, Supplementary Fig. 1 and Supplementary Table 4) based on GWAS summary statistics. The LD Score for each SNP measures the amount of pair-wise LD r2 with other SNPs within 1-cM windows from 1000 Genomes Project reference panel of European ancestry. All SNPs were filtered by LD Score regression built-in procedures, including INFO > 0.9 and MAF > 0.1, and merged to SNPs in HapMap 3 reference panel. Approximately 0.8–1.1 million SNPs (Supplementary Table 2) were retained to estimate genetic correlations.

We also examined genetic correlations among the five traits, which have been estimated previously using a twin design59,60, and unrelated individuals' SNP data from a relatively smaller sample, in which many estimates did not converge19. Our LD Score regression analysis based on a large sample provided additional contribution to this effort.

We further quantified genetic correlations between personality traits and psychiatric disorders, including schizophrenia61, bipolar disorder62, major depressive disorder63, ADHD61, autism spectrum disorder61 and anorexia nervosa64.

Query for eQTL Database

We queried eQTL evidence for our significant SNPs from Braineac65,66 (the Brain eQTL Almanac). The results are listed in Supplementary Table 1. We display the brain region with the lowest p-value among all 10 regions. To check the rank of eQTL p-values of six LD-independent SNPs in the Braineac database, we randomly selected 50,000 SNPs and queried the database to extract the lowest p-value for each SNP, resulting in a total of 36,190 SNPs with eQTL results. In order to match allele frequency and distances to transcription start site (TSS) with the significant SNPs, the randomly selected SNPs were stratified into four groups: (1) within transcript, (2) downstream 0–200 kilobase pairs (kb), (3) upstream 0–200 kb and (4) upstream 200–400 kb. SNPs that fell outside these ranges were removed. The SNPs in the ‘within transcript’ group were further stratified into three subgroups according to allele frequency. This procedure resulted in six distributions of eQTL p-values that matched the significant SNPs in terms of allele frequency and TSS, and these were used to determine the ranking of eQTL associations (see Supplementary Table 1 & 5). Two SNPs are ranked high for their significance as eQTL compared to randomly sampled eQTL markers with matched allele frequencies and distance to TTS from the Braineac database (top 10–20% ranking: rs6981523 and top 20–30% ranking: rs9611519; see Supplementary Table 5).

Colocalisation analysis between GWAS and eQTL

To investigate whether GWAS significant SNPs and their eQTL are colocalised with a shared candidate causal variant, we performed a colocalisation analysis, COLOC, that use Bayesian posterior probability to assess colocalisation18. The SNP-associated locus was defined as within a 1 Mb window18 for each of the six SNPs (Table 1). The prior probabilities that the locus is associated with only trait 1 (i.e., personality traits), only trait 2 (i.e., eQTL) and both are respectively 10−5, 10−4 and 10−6. The posterior probabilities (PP0, PP1, PP2, PP3 and PP4) for five hypotheses (H0: no association with either trait; H1: association with trait 1, not with trait 2; H2: association with trait 2, not with trait 1; H3: independent association with two traits, two independent SNPs; H4: association with both traits, one shared SNP)18 were calculated to determine which hypothesis is supported by the data. A limitation of this analysis is the potentially low power in the small eQTL sample (N=134).

SNP concordant test for the top GWAS signals

To investigate concordance of SNP effects between personality traits and psychiatric disorders, we followed a similar procedure described previously67,68 by counting the number of same direction effect sizes for the LD-independent top SNPs (P<10−4) in the pairwise phenotypes data and calculated the proportion of the same direction effects in the total number of LD-independent top SNPs. The one-sided p-value for the proportion of each pairwise phenotype was computed using a binomial test to examine the deviation from 0.5 for the proportion. In Supplementary Fig. 2, a heat map of the proportions of the same direction effect for pairwise phenotypes shows a similar pattern with a heat map of genetic correlations in Fig 3a.

Hierarchical clustering analysis

We performed hierarchical clustering analysis using dissimilarity measures (1-genetic correlation) implemented in hclust function of R to investigate and display relationships between personality traits and psychiatric disorders. Based on genetic correlations, the more highly correlated phenotypes were grouped in the same clusters and displayed by a dendrogram (Supplementary Fig. 3), showing an agreement with classifications of the loading plot (Fig. 3b).

Supplementary Material

Supplementary Figure
Supplementary Table

Acknowledgments

We would like to thank the customers, research participants, and employees of 23andMe for making this work possible. This project was funded by National Institute of Mental Health R01MH100351 (M.-T. Lo, N. Sanyal, C.-H. Chen), NARSAD Young Investigator award (C.-H. Chen), South-East Norway Regional Health Authority (2016-064) (O.B. Smeland), and Research Council of Norway through a FRIPRO Mobility Grant, contract no. 251134 (Y. Wang). The FRIPRO Mobility grant scheme (FRICON) is co-funded by the European Union’s Seventh Framework Programme for research, technological development and demonstration under Marie Curie grant agreement no. 608695. D.J. Smith is a Lister Institute Prize Fellow. The research leading to deCODE results was supported in part by NIH (NIDA) (R01-DA017932 and R01-DA034076) and the Innovative Medicines Initiative Joint Undertaking under grant agreement no. 115008 of which resources are composed of EFPIA in-kind contribution and financial contribution from the European Union's Seventh Framework Programme (FP7/2007-2013) and EU funded FP7-People-2011-IAPP grant PsychDPC (GA 28613) (H. Stefansson, G. Bjornsdottir, T.E. Thorgeirsson and K. Stefansson).

Footnotes

URLs.

LDlink, http://analysistools.nci.nih.gov/LDlink/?tab=ldpair;GPC-1and GPC-2 summary statistics, http://www.tweelingenregister.org/GPC/; LocusZoom, http://locuszoom.sph.umich.edu/locuszoom/; The Bain eQTL Almanac (Braineac), http://www.braineac.org/; Psychiatric Genomics Consortium (PGC) summary statistics (schizophrenia, bipolar disorder, major depressive disorder, ADHD, autism spectrum disorder and anorexia nervosa), https://www.med.unc.edu/pgc/results-and-downloads; LD score regression, https://github.com/bulik/ldsc; GCTA-COJO (Genome-wide Complex Trait Analysis - Conditional and Joint Genome-wide Association Analysis), http://cnsgenomics.com/software/gcta/cojo.html; METAL (Meta-analysis of Genome-wide Association Scans), http://csg.sph.umich.edu//abecasis/metal/; PLINK 1.9, https://www.cog-genomics.org/plink2.

Data Availability Statement

The top 10K SNPs for five personality traits from the 23andMe discovery data set are available in Supplementary Data Sets 1–5. The full GWAS summary statistics for the 23andMe discovery data set will be made available through 23andMe to qualified researchers under an agreement with 23andMe that protects the privacy of the 23andMe participants. Please contact David Hinds (dhinds23andme.com) for more information and to apply for data access.

Author Contributions

C.-H.C., M.-T.L. and O.A.A. designed the study. M.-T.L. and C.-H.C. analysed data and wrote the manuscript. D.A.H. and J.Y.T. analysed the 23andMe data. V.E.-P., D.J.S. and M.O'D. analysed the UK Biobank data. H.S., G.B., T.E.T and K.S. analysed the deCODE data. C.F., C.-C.F., Y.W., O.B.S., A.S. D.H., K.K., N.S., L.K.M., A.M.D. and O.A.A. contributed to manuscript preparation. All authors commented on and approved the manuscript.

Competing Financial Interests Statement

H.S., T.E.T., G.B. and K.S. are employees of deCODE Genetics/Amgen. D.A.H. and J.Y.T. are employees of 23andMe, Inc. The remaining authors declare no competing financial interests.

References

  • 1.Vukasovic T, Bratko D. Heritability of personality: A meta-analysis of behavior genetic studies. Psychol Bull. 2015;141:769–785. doi: 10.1037/bul0000017. [DOI] [PubMed] [Google Scholar]
  • 2.Okbay A, et al. Genetic variants associated with subjective well-being, depressive symptoms, and neuroticism identified through genome-wide analyses. Nat Genet. 2016;48:624–633. doi: 10.1038/ng.3552. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Smith DJ, et al. Genome-wide analysis of over 106 000 individuals identifies 9 neuroticism-associated loci. Mol Psychiatry. 2016;21:749–757. doi: 10.1038/mp.2016.177. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Goldberg LR. The structure of phenotypic personality traits. Am Psychol. 1993;48:26–34. doi: 10.1037//0003-066x.48.1.26. [DOI] [PubMed] [Google Scholar]
  • 5.Trull TJ, Widiger TA. Dimensional models of personality: the five-factor model and the DSM-5. Dialogues Clin Neurosci. 2013;15:135–146. doi: 10.31887/DCNS.2013.15.2/ttrull. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.de Moor MH, et al. Meta-analysis of genome-wide association studies for personality. Mol Psychiatry. 2012;17:337–349. doi: 10.1038/mp.2010.128. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Genetics of Personality Consortium. et al. Meta-analysis of Genome-wide Association Studies for Neuroticism, and the Polygenic Association With Major Depressive Disorder. JAMA Psychiatry. 2015;72:642–650. doi: 10.1001/jamapsychiatry.2015.0554. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Kim HN, et al. Genome-wide association study of the five-factor model of personality in young Korean women. Journal of Human Genetics. 2013;58:667–674. doi: 10.1038/jhg.2013.75. [DOI] [PubMed] [Google Scholar]
  • 9.Bulik-Sullivan BK, et al. LD Score regression distinguishes confounding from polygenicity in genome-wide association studies. Nat Genet. 2015;47:291–295. doi: 10.1038/ng.3211. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Tabares-Seisdedos R, Rubenstein JLR. Chromosome 8p as a potential hub for developmental neuropsychiatric disorders: implications for schizophrenia, autism and cancer. Molecular Psychiatry. 2009;14:563–589. doi: 10.1038/mp.2009.2. [DOI] [PubMed] [Google Scholar]
  • 11.Greenwood TA, Akiskal HS, Akiskal KK, Study BG, Kelsoe JR. Genome-Wide Association Study of Temperament in Bipolar Disorder Reveals Significant Associations with Three Novel Loci. Biological Psychiatry. 2012;72:303–310. doi: 10.1016/j.biopsych.2012.01.018. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Green E, et al. Localization of bipolar susceptibility locus by molecular genetic analysis of the chromosome 12q23-q24 region in two pedigrees with bipolar disorder and Darier's disease. Am J Psychiatry. 2005;162:35–42. doi: 10.1176/appi.ajp.162.1.35. [DOI] [PubMed] [Google Scholar]
  • 13.Schizophrenia Working Group of the Psychiatric Genomics Consortium. Biological insights from 108 schizophrenia-associated genetic loci. Nature. 2014;511:421–427. doi: 10.1038/nature13595. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Van Os J, Jones PB. Neuroticism as a risk factor for schizophrenia. Psychological Medicine. 2001;31:1129–1134. doi: 10.1017/s0033291701004044. [DOI] [PubMed] [Google Scholar]
  • 15.Aberg K, et al. Genomewide association study of movement-related adverse antipsychotic effects. Biol Psychiatry. 2010;67:279–282. doi: 10.1016/j.biopsych.2009.08.036. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Rietveld CA, et al. Common genetic variants associated with cognitive performance identified using the proxy-phenotype method. Proceedings of the National Academy of Sciences of the United States of America. 2014;111:13790–13794. doi: 10.1073/pnas.1404623111. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Poropat AE. A meta-analysis of adult-rated child personality and academic performance in primary education. British Journal of Educational Psychology. 2014;84:239–252. doi: 10.1111/bjep.12019. [DOI] [PubMed] [Google Scholar]
  • 18.Giambartolomei C, et al. Bayesian test for colocalisation between pairs of genetic association studies using summary statistics. PLoS Genet. 2014;10:e1004383. doi: 10.1371/journal.pgen.1004383. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Power RA, Pluess M. Heritability estimates of the Big Five personality traits based on common genetic variants. Transl Psychiatry. 2015;5:e604. doi: 10.1038/tp.2015.96. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Barnett JH, et al. Personality and bipolar disorder: dissecting state and trait associations between mood and personality. Psychological Medicine. 2011;41:1593–1604. doi: 10.1017/S0033291710002333. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Hettema JM, Neale MC, Myers JM, Prescott CA, Kendler KS. A population-based twin study of the relationship between neuroticism and internalizing disorders. American Journal of Psychiatry. 2006;163:857–864. doi: 10.1176/ajp.2006.163.5.857. [DOI] [PubMed] [Google Scholar]
  • 22.Jakobwitz S, Egan V. The dark triad and normal personality traits. Personality and Individual Differences. 2006;40:331–339. [Google Scholar]
  • 23.Kendler KS, Myers J. The genetic and environmental relationship between major depression and the five-factor model of personality. Psychological Medicine. 2010;40:801–806. doi: 10.1017/S0033291709991140. [DOI] [PubMed] [Google Scholar]
  • 24.Insel T, et al. Research Domain Criteria (RDoC): Toward a New Classification Framework for Research on Mental Disorders. American Journal of Psychiatry. 2010;167:748–751. doi: 10.1176/appi.ajp.2010.09091379. [DOI] [PubMed] [Google Scholar]
  • 25.Deyoung CG. The neuromodulator of exploration: A unifying theory of the role of dopamine in personality. Front Hum Neurosci. 2013;7:762. doi: 10.3389/fnhum.2013.00762. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Power RA, et al. Polygenic risk scores for schizophrenia and bipolar disorder predict creativity. Nat Neurosci. 2015;18:953–955. doi: 10.1038/nn.4040. [DOI] [PubMed] [Google Scholar]
  • 27.Karalunas SL, et al. Subtyping attention-deficit/hyperactivity disorder using temperament dimensions: toward biologically based nosologic criteria. JAMA Psychiatry. 2014;71:1015–1024. doi: 10.1001/jamapsychiatry.2014.763. [DOI] [PMC free article] [PubMed] [Google Scholar] [Retracted]
  • 28.Gray JA. The psychophysiological basis of introversion-extraversion. Behav Res Ther. 1970;8:249–266. doi: 10.1016/0005-7967(70)90069-0. [DOI] [PubMed] [Google Scholar]
  • 29.Mezquita L, et al. Five-factor model and internalizing and externalizing syndromes: A 5-year prospective study. Personality and Individual Differences. 2015;79:98–103. [Google Scholar]
  • 30.Plomin R, Haworth CM, Davis OS. Common disorders are quantitative traits. Nat Rev Genet. 2009;10:872–878. doi: 10.1038/nrg2670. [DOI] [PubMed] [Google Scholar]

Methods-only References

  • 31.Hu Y, et al. GWAS of 89,283 individuals identifies genetic variants associated with self-reporting of being a morning person. Nat Commun. 2016;7:10448. doi: 10.1038/ncomms10448. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Pickrell JK, et al. Detection and interpretation of shared genetic influences on 42 human traits. Nat Genet. 2016;48:709–717. doi: 10.1038/ng.3570. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Falush D, Stephens M, Pritchard JK. Inference of population structure using multilocus genotype data: Linked loci and correlated allele frequencies. Genetics. 2003;164:1567–1587. doi: 10.1093/genetics/164.4.1567. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Henn BM, et al. Cryptic distant relatives are common in both isolated and cosmopolitan genetic samples. PLoS One. 2012;7:e34267. doi: 10.1371/journal.pone.0034267. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.van den Berg SM, et al. Meta-analysis of Genome-Wide Association Studies for Extraversion: Findings from the Genetics of Personality Consortium. Behav Genet. 2016;46:170–182. doi: 10.1007/s10519-015-9735-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.van den Berg SM, et al. Harmonization of Neuroticism and Extraversion phenotypes across inventories and cohorts in the Genetics of Personality Consortium: an application of Item Response Theory. Behav Genet. 2014;44:295–313. doi: 10.1007/s10519-014-9654-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Costa PT, McCrae RR. Revised NEO personality inventory (NEO PI-RTM) and NEO five-factor inventory (NEO-FFI) : professional manual. Florida: Psychological Assessment Resources; 1992. p. vi.p. 101. [Google Scholar]
  • 38.Bjornsdottir G, et al. Psychometric properties of the Icelandic NEO-FFI in a general population sample compared to a sample recruited for a study on the genetics of addiction. Pers Individ Dif. 2014;58 doi: 10.1016/j.paid.2013.10.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Gulcher JR, Kristjansson K, Gudbjartsson H, Stefansson K. Protection of privacy by third-party encryption in genetic research in Iceland. Eur J Hum Genet. 2000;8:739–742. doi: 10.1038/sj.ejhg.5200530. [DOI] [PubMed] [Google Scholar]
  • 40.John OPDEM, Kentle RL. The Big Five Inventory--Versions 4a and 54. Berkeley, CA: University of California, Berkeley, Institute of Personality and Social Research; 1991. [Google Scholar]
  • 41.Soto CJ, John OP. Ten facet scales for the Big Five Inventory: Convergence with NEO PI-R facets, self-peer agreement, and discriminant validity. Journal of Research in Personality. 2009;43:84–90. [Google Scholar]
  • 42.Eysenck SBG, Eysenck HJ, Barrett P. A Revised Version of the Psychoticism Scale. Personality and Individual Differences. 1985;6:21–29. [Google Scholar]
  • 43.Buzkova P. Linear regression in genetic association studies. PLoS One. 2013;8:e56976. doi: 10.1371/journal.pone.0056976. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Lehoucq RB, Sorensen DC, Yang C Society for Industrial and Applied Mathematics. Software, environments, tools 6 1 electronic text. Philadelphia, Pa: Society for Industrial and Applied Mathematics (SIAM, 3600 Market Street, Floor 6, Philadelphia, PA 19104); 1998. ARPACK users' guide solution of large-scale eigenvalue problems with implicitly restarted Arnoldi methods; p. xv.p. 142. [Google Scholar]
  • 45.1000 Genomes Project Consortium et al. A map of human genome variation from population-scale sequencing. Nature. 2010;467:1061–1073. doi: 10.1038/nature09534. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Browning BL, Browning SR. Rapid and accurate haplotype phasing and missing data inference for whole genome association studies using localized haplotype clustering. Genetic Epidemiology. 2007;31:606–606. doi: 10.1086/521987. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Howie B, Fuchsberger C, Stephens M, Marchini J, Abecasis GR. Fast and accurate genotype imputation in genome-wide association studies through pre-phasing. Nat Genet. 2012;44:955–959. doi: 10.1038/ng.2354. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Gudbjartsson DF, et al. Large-scale whole-genome sequencing of the Icelandic population. Nat Genet. 2015;47:435–444. doi: 10.1038/ng.3247. [DOI] [PubMed] [Google Scholar]
  • 49.Thompson WK, et al. An Empirical Bayes Mixture Model for Effect Size Distributions in Genome-Wide Association Studies. PLoS Genet. 2015;11:e1005717. doi: 10.1371/journal.pgen.1005717. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Wang Y, et al. Leveraging Genomic Annotations and Pleiotropic Enrichment for Improved Replication Rates in Schizophrenia GWAS. PLoS Genet. 2016;12:e1005803. doi: 10.1371/journal.pgen.1005803. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Purcell S, et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet. 2007;81:559–575. doi: 10.1086/519795. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Devlin B, Roeder K. Genomic control for association studies. Biometrics. 1999;55:997–1004. doi: 10.1111/j.0006-341x.1999.00997.x. [DOI] [PubMed] [Google Scholar]
  • 53.Willer CJ, Li Y, Abecasis GR. METAL: fast and efficient meta-analysis of genomewide association scans. Bioinformatics. 2010;26:2190–2191. doi: 10.1093/bioinformatics/btq340. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Hibar DP, et al. Common genetic variants influence human subcortical brain structures. Nature. 2015;520:224–229. doi: 10.1038/nature14101. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.So HC, Li M, Sham PC. Uncovering the total heritability explained by all true susceptibility variants in a genome-wide association study. Genet Epidemiol. 2011;35:447–456. doi: 10.1002/gepi.20593. [DOI] [PubMed] [Google Scholar]
  • 56.Yang J, et al. Conditional and joint multiple-SNP analysis of GWAS summary statistics identifies additional variants influencing complex traits. Nat Genet. 2012;44:369–375. S1–S3. doi: 10.1038/ng.2213. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Pruim RJ, et al. LocusZoom: regional visualization of genome-wide association scan results. Bioinformatics. 2010;26:2336–2337. doi: 10.1093/bioinformatics/btq419. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Bulik-Sullivan B, et al. An atlas of genetic correlations across human diseases and traits. Nat Genet. 2015;47:1236–1241. doi: 10.1038/ng.3406. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Distel MA, et al. The five-factor model of personality and borderline personality disorder: a genetic analysis of comorbidity. Biol Psychiatry. 2009;66:1131–1138. doi: 10.1016/j.biopsych.2009.07.017. [DOI] [PubMed] [Google Scholar]
  • 60.Ono Y, et al. Genetic structure of the five-factor model of personality in a Japanese twin population. Keio J Med. 2000;49:152–158. doi: 10.2302/kjm.49.152. [DOI] [PubMed] [Google Scholar]
  • 61.Cross-Disorder Group of the Psychiatric Genomics Consortium. Identification of risk loci with shared effects on five major psychiatric disorders: a genome-wide analysis. Lancet. 2013;381:1371–1379. doi: 10.1016/S0140-6736(12)62129-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62.Psychiatric GWAS Consortium Bipolar Disorder Working Group. Large-scale genome-wide association analysis of bipolar disorder identifies a new susceptibility locus near ODZ4. Nat Genet. 2011;43:977–983. doi: 10.1038/ng.943. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Major Depressive Disorder Working Group of the Psychiatric GWAS Consortium et al. A mega-analysis of genome-wide association studies for major depressive disorder. Mol Psychiatry. 2013;18:497–511. doi: 10.1038/mp.2012.21. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64.Boraska V, et al. A genome-wide association study of anorexia nervosa. Mol Psychiatry. 2014;19:1085–1094. doi: 10.1038/mp.2013.187. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65.Rosander P, Backstrom M. Personality traits measured at baseline can predict academic performance in upper secondary school three years late. Scandinavian Journal of Psychology. 2014;55:611–618. doi: 10.1111/sjop.12165. [DOI] [PubMed] [Google Scholar]
  • 66.Trabzuni D, et al. Quality control parameters on a large dataset of regionally dissected human control brains for whole genome expression studies. Journal of Neurochemistry. 2011;119:275–282. doi: 10.1111/j.1471-4159.2011.07432.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67.Franke B, et al. Genetic influences on schizophrenia and subcortical brain volumes: large-scale proof of concept. Nat Neurosci. 2016;19:420–431. doi: 10.1038/nn.4228. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68.Nyholt DR. SECA: SNP effect concordance analysis using genome-wide association summary results. Bioinformatics. 2014;30:2086–2088. doi: 10.1093/bioinformatics/btu171. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Figure
Supplementary Table

RESOURCES