Low-frequency and rare variants may contribute to elucidate the genetics of major depressive disorder

Chenglong Yu; Mauricio Arcos-Burgos; Bernhard T Baune; Volker Arolt; Udo Dannlowski; Ma-Li Wong; Julio Licinio

doi:10.1038/s41398-018-0117-7

. 2018 Mar 27;8:70. doi: 10.1038/s41398-018-0117-7

Low-frequency and rare variants may contribute to elucidate the genetics of major depressive disorder

Chenglong Yu ^1,^2,^3,^✉, Mauricio Arcos-Burgos ⁴, Bernhard T Baune ⁵, Volker Arolt ⁶, Udo Dannlowski ^6,⁷, Ma-Li Wong ^2,^3,⁸, Julio Licinio ^8,^✉

PMCID: PMC5913271 PMID: 29581422

Abstract

Major depressive disorder (MDD) is a common but serious psychiatric disorder with significant levels of morbidity and mortality. Recent genome-wide association studies (GWAS) on common variants increase our understanding of MDD; however, the underlying genetic basis remains largely unknown. Many studies have been proposed to explore the genetics of complex diseases from a viewpoint of the “missing heritability” by considering low-frequency and rare variants, copy-number variations, and other types of genetic variants. Here we developed a novel computational and statistical strategy to investigate the “missing heritability” of MDD. We applied Hamming distance on common, low-frequency, and rare single-nucleotide polymorphism (SNP) sets to measure genetic distance between two individuals, and then built the multi-dimensional scaling (MDS) pictures. Whole-exome genotyping data from a Los Angeles Mexican-American cohort (203 MDD and 196 controls) and a European-ancestry cohort (473 MDD and 497 controls) were examined using our proposed methodology. MDS plots showed very significant separations between MDD cases and healthy controls for low-frequency SNP set (P value < 2.2e−16) and rare SNP set (P value = 7.681e−12). Our results suggested that low-frequency and rare variants may play more significant roles in the genetics of MDD.

Introduction

Major depressive disorder (MDD) is a common mental illness with tremendous medical, economic, and social impact. MDD, as a principal contributor to disease load worldwide, leads to high levels of morbidity and mortality^1–5. One significant avenue for preventing and treating depression lies in uncovering the genetics of this condition^6,7. Despite rapid advances on genome-wide association studies (GWAS)^8–10, little is understood about its fundamental biological basis and much further research needs to be carried out to fully unravel the genetic elements that confer susceptibility to this disorder^11,12.

Many studies have been proposed to explore the genetic causes of complex diseases from a point view of the “missing heritability”^13–16. For example, some genetic effects are not owing to the common single-nucleotide polymorphisms (SNPs) examined in the candidate-gene studies or GWAS, but due to low-frequency and rare variants, copy-number variations, and other types of genetic mutations¹⁷. Actually, GWAS focus on the identification of significant common (minor allele frequency (MAF) ≥ 5%) variants, thus analyses of low-frequency (0.5% ≤ MAF < 5%) and rare (MAF < 0.5%) variants would be promising to elucidate additional disease risk or trait variability¹⁸. Furthermore, using next-generation sequencing, family-based linkage analysis has also provided an important way to understand the role of rare variants in disease etiology^19,20. For example, some family-control studies applied Hamming distance to identify disease genes based on sequencing data^21,22. However, high-priced sequencing expenses are currently a concern that restricts acquiring large datasets.

Recently, we have applied GWAS and rare-variant analysis to investigate the genetics of MDD based on a whole-exome genotyping data from a Mexican-American cohort in Los Angeles and a replication European-ancestry cohort²³. Our results suggested that the “missing heritability” in MDD may be partly explained by rare variants, because most of the functional variations detected in the cohorts were rare. In this study, we designed a novel computational and statistical strategy to further investigate this conclusion. In our methodology, we used Hamming distance on common, low-frequency, and rare SNP sets to measure the genetic distance between two individuals. Then we built the multi-dimensional scaling pictures, in which separation between MDD cases and healthy controls revealed valuable information for hidden genetic factors of major depression. The corresponding statistical results in the pictures were reported.

Materials and methods

The two cohorts used in this study

In our previous work²³, we have investigated a cohort of MDD cases (n = 203) and controls (n = 196) of Los Angeles Mexican-Americans. They were mostly recent immigrants born in Mexico and experienced high levels of hyperactivation of the hypothalamic-pituitary-adrenal axis related to distress, challenges, and acculturation issues caused by immigration. MDD were diagnosed using the Structured Clinical Interview for DSM-IV (Diagnostic and Statistical Manual IV edition) (SCID, for abbreviation). Subjects met the diagnostic criteria for current, unipolar major depressive episode, attended a pharmacogenetic study on antidepressant treatment, and had an initial 21-Item Hamilton Depression Rating Scale (HAM-D21 for abbreviation) score of ≥18 with item number 1 (depressed mood) rated ≥2. Controls responded that they were in good health and replied questionnaires about acculturation. But they were not screened for medical illnesses and did not respond to structured psychiatric interviews. The controls were also Mexican-American and recruited from the same community in Los Angeles. The control group have similar sex ratio and age distribution (mean and standard error) to the MDD group (see Table S1). Participants submitted written informed consent, and their demographic, epidemiological, and clinical descriptions were previously described in detail^24–26. We have registered this study in ClinicalTrials.gov (NCT00265291). The research was approved by the Institutional Review Boards of the University of California Los Angeles and University of Miami, USA, and by the Human Research Ethics Committees of the Australian National University and Bellbery Ltd, Australia.

We also included the European-ancestry cohort of MDD cases (n = 473) and controls (n = 497), which was used for replication in our previous study²³. In this cohort, the MDD group also have similar sex ratio and age distribution (mean and standard error) to the control group (see Table S1). Those participants provided written informed consent and were recruited under two protocols: (1) Münster mood disorder studies (consisted of the neuroimaging and the mood-in-flame studies), which have been conducted by the Department of Psychiatry and Psychotherapy, University of Münster, Münster, Germany, and (2) the Characteristics of the Cognitive Function and Mood Study (CoFaM-Study) conducted by the Discipline of Psychiatry, University of Adelaide, South Australia, Australia²⁷. The SCID/MINI (Mini International Neuropsychiatric Interview) was used to ascertain that healthy controls were free from lifetime history of psychiatric disorders; for this cohort, we also used DSM-IV criteria and HAM-D21 for the main diagnostic of MDD and mood assessment. The study on this cohort was approved by Human Research Ethics Committee protocols at the University of Münster, Germany, and University of Adelaide and Flinders University, South Australia, Australia.

Our previous power analysis on the same cohorts²³ suggested that, when 100,000 variants are tested for association studies, 200 cases and 200 controls are sufficient to detect 80% true positives and a medium size of effect defined by the Cohen’s h parameter. This value of 100,000 overcomes the numbers of SNPs in common, low-frequency and rare-variant groups studied here. Actually, based on the effect sizes for the 19 MDD GWAS significant variants in the Mexican-American cohort²³, the post hoc statistical power ranges between >60% (SNP-exm 2249659, Cohen’s h = 0.335) and >99% (SNP-exm1508600, Cohen’s h = 0.643). Cohen’s h suggests 0.2, 0.5, and 0.8 represent small, medium, and large effect sizes, respectively; thus, it indicates that our study had enough power to detect medium to large effect sizes for current association tests.

Whole-exome SNP genotyping

The two cohorts were genotyped by the Australian Genome Research Facility (North Melbourne, VIC, Australia; www.agrf.org.au) using the Illumina HumanExome BeadChip-12v1_A, in which exonic content consists of >250,000 markers representing diverse populations and a range of common conditions. All the human samples passed the Illumina expected SNP calling rate (>99%). Then we filtered the raw whole-exome SNPs by a pipeline considering variant call rate, allele numbers, and Hardy–Weinberg equilibrium deviations. For this follow-up study, we analyzed 83,898 SNPs for the Mexican-American cohort and 121,174 SNPs for the European-ancestry cohort, which remained after quality control (QC) and filtering out criteria. Detailed QC and filtering analyses have been well reported in our previous work²³.

SNP classification and population stratification

Considering MAF, we divided the 83,898 SNPs in the Mexican-American population into 27,575 common variants, 17,838 low-frequency variants, and 38,485 rare variants, and divided the 121,174 SNPs in the European-ancestry population into 12,530 common variants, 12,902 low-frequency variants, and 95,742 rare variants. As expected, we found that most SNPs are rare (MAF < 0.5%) in the HumanExome BeadChip, because this chip has been designed to concentrate on rare variants rather than common ones²⁸, which contrast to conventional genome-wide genotyping arrays that do not tag low-frequency and rare variants²⁹.

We then used the four classes of SNPs (all, common, low-frequency and rare) to check population stratifications of the two cohorts. PLINK software³⁰, which provides a powerful tool for population stratification based on pairwise identity-by-state (IBS) distance and multi-dimensional scaling (MDS) plots, was used here.

Hamming distance between two individuals

In this study, we use Hamming distance³¹, a natural distance without assuming any model mutation/substitution rate, to investigate the genetic distance between two individuals based on a set of SNPs.

Let S be an SNP set which contains n SNPs. We use $S N P_{k}$ to represent the SNP indexed k (k = 1, …, n). Thus, S = { $S N P_{1}, S N P_{2}, . . ., S N P_{n}$ }. Suppose that X and Y are two individuals who have their own genotypes on this SNP set S, namely and respectively, S_X and S_Y . Let S_X be { $S N P_{1}^{X}, S N P_{2}^{X}, . . ., S N P_{n}^{X}$ } and S_Y be { $S N P_{1}^{Y}, S N P_{2}^{Y}, . . ., S N P_{n}^{Y}$ }. Then the Hamming distance between the two individuals X and Y is defined as

H (X, Y) = \sum_{i = 1}^{n} δ (S N P_{i}^{X}, S N P_{i}^{Y}),

where $δ (a, b) = \{\begin{matrix} 0 & if a and b are the same \\ 1 & otherwise \end{matrix}$ , that is, the number of positions at which the corresponding SNPs are different on the SNP set S. Considering the size of the SNP set, we can also get the normalized Hamming distance as

N H (X, Y) = \frac{\sum_{i = 1}^{n} δ (S N P_{i}^{X}, S N P_{i}^{Y})}{n} .

Take Table 1 as a simple example, individuals X, Y, and Z show their genotypes on an SNP set of eight SNPs. The Hamming distance between X and Y is 7 (SNP1, SNP3, SNP4, SNP5, SNP6, SNP7, and SNP8 are different). Similarly, the Hamming distance between Y and Z is 3, and the Hamming distance between X and Z is 5. Our hypothesis was that if two individuals have closer Hamming distance in this way, then those two individuals would have closer genetic distance and more similar phenotypes such as diseases or traits. We assume that Y and Z have more similar phenotypes in the above example.

Table 1.

Hamming distances of three subjects in an 8-SNP set

Genotype	SNP1	SNP2	SNP3	SNP4	SNP5	SNP6	SNP7	SNP8
	(A/T)	(G/T)	(C/G)	(C/T)	(C/T)	(A/G)	(A/C)	(C/T)
Subject X	AT	GG	CG	CC	CC	AG	AC	TT
Subject Y	AA	GG	CC	TT	CT	GG	CC	CT
Subject Z	AA	GG	CC	CC	CT	AA	CC	TT

Open in a new tab

SNP single-nucleotide polymorphism

Given a group of individuals, we can compute their Hamming distance matrix based on a specific SNP set such as common, low-frequency, or rare-variant set. After obtaining the distance matrix between all pairs of individuals, MDS approach³² can be used to observe the distance relationships among those individuals in a two-dimensional graph. The display of scatters representing individuals which shows separating variability between MDD cases and healthy controls can reveal interesting genetic information hidden in the SNP sets. Then for statistical analysis, Hotelling’s T² test for two independent samples is used to examine whether the means of the two groups (case and control) are equal.

Code availability

The codes (by R software; www.r-project.org) of data analysis for this study can be accessed from the authors.

Results

Population stratification

In Figs. 1 and 2, we presented the population stratification results based on all, common, low-frequency, and rare SNP sets for the Mexican-American cohort and the European-ancestry cohort. Although several far outliers are found in Mexican-American cohort for low-frequency and rare SNPs and in European-ancestry cohort for rare SNPs, there is no significant separation between depressed cases (blue points) and controls (red points) in the IBS-MDS plots.

Fig. 1 — Population stratification based on IBS distance and MDS for Mexican-American cohort

Fig. 2 — Population stratification based on IBS distance and MDS for European-ancestry cohort

MDS on Hamming distance

In Fig. 3 we presented MDS results on Hamming distance for all, common, low-frequency, and rare SNP sets of the Mexican-American cohort. For common SNPs, there is no significant separation between MDD cases (blue points) and controls (red points). However, for low-frequency and rare SNPs, we found that all the healthy controls were scattered in the lower half-plane, and all points in the upper half-plane were blue points representing depressed cases. We use Hotelling’s T² test to statistically examine these visual separations between case and control points in the MDS plane. For common variants, the result is P value = 5.891e−05 and T² statistic = 9.983. For low-frequency variants, the Hotelling’s T² test result is P value <2.2e−16 and T² statistic = 42.958. For rare variants, the Hotelling’s T² test result is P value = 7.681e−12 and T² statistic = 27.32. Therefore, the separation of cases and controls shown in the MDS plane is much more significant for low-frequency and rare SNPs than for common SNPs.

In Fig. 4, we showed the results of MDS on Hamming distance for all, common, low-frequency, and rare SNP sets of the European-ancestry cohort. For all the SNP sets, we see that there is no significant visual separation between MDD cases (blue points) and controls (red points) in the MDS planes. The same results can also be found in Figure S1 by excluding some far outliers.

Discussion

Genetic factors play important roles in the susceptibility to major depression, as indicated by family, twin, and adoption studies³³. The heritability of MDD is estimated to range between 40 and 70%³⁴. In this article, we designed a novel methodology to explore the “missing heritability” of MDD. The significant separations between cases and controls for low-frequency and rare variants in the planes of MDS on Hamming distance supported our previous conclusion²³ that most of the functional variants detected in the Mexican-American cohort were rare. The corresponding statistical results also show that the separation for low-frequency and rare SNPs is much more significant than for common SNPs. Thus, our findings further suggest that low-frequency and rare variants may play more significant roles in the development of MDD. Low-frequency and rare variants are currently not tagged by conventional genome-wide genotyping arrays, thus they may represent an important but understudied component of MDD genetics²⁹. There are many different types of technical designs that identify low-frequency and rare variants. In the current study, we applied whole-exome-wide genotyping array data, which are relatively cost-effective. With the rapid development of next-generation sequencing technologies (whole-genome sequencing, whole-exome sequencing, and targeted sequencing of candidate genes), it is now possible to collect most or even all low-frequency and rare genetic variants in large samples and test their roles in human disease risk^35–37. Thus, future work would be needed to further investigate our methodology on much larger SNP sets.

The traditional genetic distance³⁸ considers mutation rates and is designed as a measure of genetic divergence between populations within a species. Therefore, it is not appropriate to examine the genetic variation associated with a complex disorder within a human population, namely Mexican-American. In this work, Hamming distance was used to investigate the genetic distance between two individuals based on their SNP sets. As sequencing costs are currently dropping further, we may examine single-nucleotide variants (SNVs) which involve much more individual genetic information. For example, we recently proposed a new concept of SNV proportion in genes and employed it to develop a predictive approach for major depression³⁹. Using similar classification and cluster analysis methods, a potential tool for MDD diagnosis could also be constructed based on Hamming distance and low-frequency/rare SNPs.

Using our methodology with a case–control study, we could examine the effect of a group of variants within a specific range of MAF. The design and methodology we have developed can also be extended to other complex disorders. Our approach based on exome genotyping array or sequencing data may reveal the “missing heritability” resulted from allele frequency for many complex diseases.

Our European-ancestry cohort failed to show significant separations of cases and controls in the MDS planes on Hamming distance. The reasons could be as follows. First, MDD is clearly a gene–environment interaction disorder³⁴. Our Mexican-American cohort is comprised of first-generation individuals (60%) who have experienced significant levels of stress and hyperactivation of the hypothalamic-pituitary-adrenal axis related to acculturation issues^40,41. In contrast to the European-ancestry cohort, the significant stressful life events for Mexican-Americans could cause much higher levels of depression. Therefore, the depression effect differences between case and control in two cohorts may be large. Further studies using our methodology could be tested on a larger size of European-ancestry sample. Second, the European-ancestry cohort that we studied have much lower levels of genetic variants. In our previous work²³, whole-genome sequencing analyses of a subset of the two cohorts revealed that European-ancestry subjects have a significantly reduced (around 50%) number of SNVs compared with Mexican-American subjects. For this reason, the roles of low-frequency and rare variants may vary across populations.

Electronic supplementary material

Supplementary Information(PDF 263 kb)^{(263.1KB, pdf)}

Acknowledgements

We have been supported by grants APP1051931 (M.-L.W. and M.A.-B.), APP1070935 (M.-L.W.), and APP1060524 (B.T.B.) from NHMRC of Australia, the German Research Foundation Grant FOR 2107, DA1151/5-1 (UD), NIH Grant GM61394 (M.-L.W.), and institutional funds from the South Australian Health and Medical Research Institute, Flinders University. and the Australian National University.

Conflict of interest

The authors declare that they have no conflict of interest.

Footnotes

Electronic supplementary material

Supplementary Information accompanies this paper at 10.1038/s41398-018-0117-7.

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Contributor Information

Chenglong Yu, Email: chenglong.yu@flinders.edu.au.

Julio Licinio, Email: licinioj@upstate.edu.

References

1.Kessler RC, et al. Lifetime and 12-month prevalence of DSM-III-R psychiatric disorders in the United States. Results from the National Comorbidity Survey. Arch. Gen. Psychiatry. 1994;51:8–19. doi: 10.1001/archpsyc.1994.03950010008002. [DOI] [PubMed] [Google Scholar]
2.Lopez AD, Murray CC. The global burden of disease, 1990–2020. Nat. Med. 1998;4:1241–1243. doi: 10.1038/3218. [DOI] [PubMed] [Google Scholar]
3.Wong ML, Licinio J. Research and treatment approaches to depression. Nat. Rev. Neurosci. 2001;2:343–351. doi: 10.1038/35072566. [DOI] [PubMed] [Google Scholar]
4.Wong ML, Licinio J. From monoamines to genomic targets: a paradigm shift for drug discovery in depression. Nat. Rev. Drug. Discov. 2004;3:136–151. doi: 10.1038/nrd1303. [DOI] [PubMed] [Google Scholar]
5.Kessler RC, Chiu WT, Demler O, Merikangas KR, Walters EE. Prevalence, severity, and comorbidity of 12-month DSM-IV disorders in the National Comorbidity Survey Replication. Arch. Gen. Psychiatry. 2005;62:617–627. doi: 10.1001/archpsyc.62.6.617. [DOI] [PMC free article] [PubMed] [Google Scholar]
6.Lohoff FW. Overview of the genetics of major depressive disorder. Curr. Psychiatry Rep. 2010;12:539–546. doi: 10.1007/s11920-010-0150-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
7.Flint J, Kendler KS. The genetics of major depression. Neuron. 2014;81:484–503. doi: 10.1016/j.neuron.2014.01.027. [DOI] [PMC free article] [PubMed] [Google Scholar]
8.CONVERGE Consortium. Sparse whole-genome sequencing identifies two loci for major depressive disorder. Nature. 2015;523:588–591. doi: 10.1038/nature14659. [DOI] [PMC free article] [PubMed] [Google Scholar]
9.Amin N, et al. Exome-sequencing in a large population-based study reveals a rare Asn396Ser variant in the LIPG gene associated with depressive symptoms. Mol. Psychiatry. 2017;22:537–543. doi: 10.1038/mp.2016.101. [DOI] [PubMed] [Google Scholar]
10.Hyde CL, et al. Identification of 15 genetic loci associated with risk of major depression in individuals of European descent. Nat. Genet. 2016;48:1031–1036. doi: 10.1038/ng.3623. [DOI] [PMC free article] [PubMed] [Google Scholar]
11.Sullivan PF, Daly MJ, O’Donovan M. Genetic architectures of psychiatric disorders: the emerging picture and its implications. Nat. Rev. Genet. 2012;13:537–551. doi: 10.1038/nrg3240. [DOI] [PMC free article] [PubMed] [Google Scholar]
12.Peterson RE, et al. The genetic architecture of major depressive disorder in Han Chinese women. JAMA Psychiatry. 2017;74:162–168. doi: 10.1001/jamapsychiatry.2016.3578. [DOI] [PMC free article] [PubMed] [Google Scholar]
13.Manolio TA, et al. Finding the missing heritability of complex diseases. Nature. 2009;461:747–753. doi: 10.1038/nature08494. [DOI] [PMC free article] [PubMed] [Google Scholar]
14.Eichler EE, et al. Missing heritability and strategies for finding the underlying causes of complex disease. Nat. Rev. Genet. 2010;11:446–450. doi: 10.1038/nrg2809. [DOI] [PMC free article] [PubMed] [Google Scholar]
15.Lee SH, Wray NR, Goddard ME, Visscher PM. Estimating missing heritability for disease from genome-wide association studies. Am. J. Hum. Genet. 2011;88:294–305. doi: 10.1016/j.ajhg.2011.02.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
16.Zuk O, Hechter E, Sunyaev SR, Lander ES. The mystery of missing heritability: genetic interactions create phantom heritability. Proc. Natl. Acad. Sci. USA. 2012;109:1193–1198. doi: 10.1073/pnas.1119675109. [DOI] [PMC free article] [PubMed] [Google Scholar]
17.Wray NR, Maier R. Genetic basis of complex genetic disease: the contribution of disease heterogeneity to missing heritability. Curr. Epidemiol. Rep. 2014;1:220–227. doi: 10.1007/s40471-014-0023-3. [DOI] [Google Scholar]
18.Lee S, Abecasis GR, Boehnke M, Lin X. Rare-variant association analysis: study designs and statistical tests. Am. J. Hum. Genet. 2014;95:5–23. doi: 10.1016/j.ajhg.2014.06.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
19.Ott J, Wang J, Leal SM. Genetic linkage analysis in the age of whole-genome sequencing. Nat. Rev. Genet. 2015;16:275–284. doi: 10.1038/nrg3908. [DOI] [PMC free article] [PubMed] [Google Scholar]
20.Knowles EE, et al. Genome-wide linkage on chromosome 10q26 for a dimensional scale of major depression. J. Affect. Disord. 2016;191:123–131. doi: 10.1016/j.jad.2015.11.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
21.Imai A, et al. Beyond homozygosity mapping: family-control analysis based on Hamming distance for prioritizing variants in exome sequencing. Sci. Rep. 2015;5:12028. doi: 10.1038/srep12028. [DOI] [PMC free article] [PubMed] [Google Scholar]
22.Imai A, et al. HDR: a statistical two-step approach successfully identifies disease genes in autosomal recessive families. J. Hum. Genet. 2016;61:959–963. doi: 10.1038/jhg.2016.85. [DOI] [PMC free article] [PubMed] [Google Scholar]
23.Wong ML, et al. The PHF21B gene is associated with major depression, and modulates stress response. Mol. Psychiatry. 2017;22:1015–1025. doi: 10.1038/mp.2016.174. [DOI] [PMC free article] [PubMed] [Google Scholar]
24.Dong C, Wong ML, Licinio J. Sequence variations of ABCB1, SLC6A2, SLC6A3, SLC6A4, CREB1, CRHR1 and NTRK2: association with major depression and antidepressant response in Mexican-Americans. Mol. Psychiatry. 2009;14:1105–1118. doi: 10.1038/mp.2009.92. [DOI] [PMC free article] [PubMed] [Google Scholar]
25.Wong ML, Dong C, Andreev V, Arcos-Burgos M, Licinio J. Prediction of susceptibility to major depression by a model of interactions of multiple functional genetic variants and environmental factors. Mol. Psychiatry. 2012;17:624–633. doi: 10.1038/mp.2012.13. [DOI] [PMC free article] [PubMed] [Google Scholar]
26.Wong ML, et al. Clinical outcomes and genome-wide association for a brain methylation site in an antidepressant pharmacogenetics study in Mexican Americans. Am. J. Psychiatry. 2014;171:1297–1309. doi: 10.1176/appi.ajp.2014.12091165. [DOI] [PMC free article] [PubMed] [Google Scholar]
27.Baune BT, Air T. Clinical, functional, and biological correlates of cognitive dimensions in major depressive disorder-rationale, design, and characteristics of the cognitive function and mood study (CoFaM-Study) Front. Psychiatry. 2016;7:150. doi: 10.3389/fpsyt.2016.00150. [DOI] [PMC free article] [PubMed] [Google Scholar]
28.Guo Y, et al. Illumina human exome genotyping array clustering and quality control. Nat. Protoc. 2014;9:2643–2662. doi: 10.1038/nprot.2014.174. [DOI] [PMC free article] [PubMed] [Google Scholar]
29.Auer PL, Lettre G. Rare variant association studies: considerations, challenges and opportunities. Genome Med. 2015;7:16. doi: 10.1186/s13073-015-0138-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
30.Purcell S, et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am. J. Hum. Genet. 2007;81:559–575. doi: 10.1086/519795. [DOI] [PMC free article] [PubMed] [Google Scholar]
31.Hamming RW. Error detecting and error correcting codes. Bell Syst. Tech. J. 1950;29:147–160. doi: 10.1002/j.1538-7305.1950.tb00463.x. [DOI] [Google Scholar]
32.Torgerson WS. Multidimensional scaling: I. theory and method. Psychometrika. 1952;17:401–419. doi: 10.1007/BF02288916. [DOI] [Google Scholar]
33.Sullivan PF, Neale MC, Kendler KS. Genetic epidemiology of major depression: review and meta-analysis. Am. J. Psychiatry. 2000;157:1552–1562. doi: 10.1176/appi.ajp.157.10.1552. [DOI] [PubMed] [Google Scholar]
34.Lesch KP. Gene–environment interaction and the genetics of depression. J. Psychiatry Neurosci. 2004;29:174–184. [PMC free article] [PubMed] [Google Scholar]
35.Dunn EC, et al. Genetic determinants of depression: recent findings and future directions. Harv. Rev. Psychiatry. 2015;23:1–18. doi: 10.1097/HRP.0000000000000054. [DOI] [PMC free article] [PubMed] [Google Scholar]
36.Yu C, Baune BT, Licinio J, Wong ML. A novel strategy for clustering major depression individuals using whole-genome sequencing variant data. Sci. Rep. 2017;7:44389. doi: 10.1038/srep44389. [DOI] [PMC free article] [PubMed] [Google Scholar]
37.Yu C, Baune BT, Licinio J, Wong ML. Whole-genome single nucleotide variant distribution on genomic regions and its relationship to major depression. Psychiatry Res. 2017;252:75–79. doi: 10.1016/j.psychres.2017.02.041. [DOI] [PMC free article] [PubMed] [Google Scholar]
38.Nei M, Kumar S. Molecular Evolution and Phylogenetics. New York: Oxford Univ. Press; 2000. [Google Scholar]
39.Yu C, Baune BT, Licinio J, Wong ML. Single-nucleotide variant proportion in genes: a new concept to explore major depression based on DNA sequencing data. J. Hum. Genet. 2017;62:577–580. doi: 10.1038/jhg.2017.2. [DOI] [PMC free article] [PubMed] [Google Scholar]
40.Caplan S, et al. Cultural influences on causal beliefs about depression among Latino immigrants. J. Transcult. Nurs. 2013;24:68–77. doi: 10.1177/1043659612453745. [DOI] [PubMed] [Google Scholar]
41.Korenblum W, et al. Elevated cortisol levels and increased rates of diabetes and mood symptoms in Soviet Union-born Jewish immigrants to Germany. Mol. Psychiatry. 2005;10:974–975. doi: 10.1038/sj.mp.4001720. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Information(PDF 263 kb)^{(263.1KB, pdf)}

[CR1] 1.Kessler RC, et al. Lifetime and 12-month prevalence of DSM-III-R psychiatric disorders in the United States. Results from the National Comorbidity Survey. Arch. Gen. Psychiatry. 1994;51:8–19. doi: 10.1001/archpsyc.1994.03950010008002. [DOI] [PubMed] [Google Scholar]

[CR2] 2.Lopez AD, Murray CC. The global burden of disease, 1990–2020. Nat. Med. 1998;4:1241–1243. doi: 10.1038/3218. [DOI] [PubMed] [Google Scholar]

[CR3] 3.Wong ML, Licinio J. Research and treatment approaches to depression. Nat. Rev. Neurosci. 2001;2:343–351. doi: 10.1038/35072566. [DOI] [PubMed] [Google Scholar]

[CR4] 4.Wong ML, Licinio J. From monoamines to genomic targets: a paradigm shift for drug discovery in depression. Nat. Rev. Drug. Discov. 2004;3:136–151. doi: 10.1038/nrd1303. [DOI] [PubMed] [Google Scholar]

[CR5] 5.Kessler RC, Chiu WT, Demler O, Merikangas KR, Walters EE. Prevalence, severity, and comorbidity of 12-month DSM-IV disorders in the National Comorbidity Survey Replication. Arch. Gen. Psychiatry. 2005;62:617–627. doi: 10.1001/archpsyc.62.6.617. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR6] 6.Lohoff FW. Overview of the genetics of major depressive disorder. Curr. Psychiatry Rep. 2010;12:539–546. doi: 10.1007/s11920-010-0150-6. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR7] 7.Flint J, Kendler KS. The genetics of major depression. Neuron. 2014;81:484–503. doi: 10.1016/j.neuron.2014.01.027. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR8] 8.CONVERGE Consortium. Sparse whole-genome sequencing identifies two loci for major depressive disorder. Nature. 2015;523:588–591. doi: 10.1038/nature14659. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR9] 9.Amin N, et al. Exome-sequencing in a large population-based study reveals a rare Asn396Ser variant in the LIPG gene associated with depressive symptoms. Mol. Psychiatry. 2017;22:537–543. doi: 10.1038/mp.2016.101. [DOI] [PubMed] [Google Scholar]

[CR10] 10.Hyde CL, et al. Identification of 15 genetic loci associated with risk of major depression in individuals of European descent. Nat. Genet. 2016;48:1031–1036. doi: 10.1038/ng.3623. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR11] 11.Sullivan PF, Daly MJ, O’Donovan M. Genetic architectures of psychiatric disorders: the emerging picture and its implications. Nat. Rev. Genet. 2012;13:537–551. doi: 10.1038/nrg3240. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR12] 12.Peterson RE, et al. The genetic architecture of major depressive disorder in Han Chinese women. JAMA Psychiatry. 2017;74:162–168. doi: 10.1001/jamapsychiatry.2016.3578. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR13] 13.Manolio TA, et al. Finding the missing heritability of complex diseases. Nature. 2009;461:747–753. doi: 10.1038/nature08494. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR14] 14.Eichler EE, et al. Missing heritability and strategies for finding the underlying causes of complex disease. Nat. Rev. Genet. 2010;11:446–450. doi: 10.1038/nrg2809. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR15] 15.Lee SH, Wray NR, Goddard ME, Visscher PM. Estimating missing heritability for disease from genome-wide association studies. Am. J. Hum. Genet. 2011;88:294–305. doi: 10.1016/j.ajhg.2011.02.002. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR16] 16.Zuk O, Hechter E, Sunyaev SR, Lander ES. The mystery of missing heritability: genetic interactions create phantom heritability. Proc. Natl. Acad. Sci. USA. 2012;109:1193–1198. doi: 10.1073/pnas.1119675109. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR17] 17.Wray NR, Maier R. Genetic basis of complex genetic disease: the contribution of disease heterogeneity to missing heritability. Curr. Epidemiol. Rep. 2014;1:220–227. doi: 10.1007/s40471-014-0023-3. [DOI] [Google Scholar]

[CR18] 18.Lee S, Abecasis GR, Boehnke M, Lin X. Rare-variant association analysis: study designs and statistical tests. Am. J. Hum. Genet. 2014;95:5–23. doi: 10.1016/j.ajhg.2014.06.009. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR19] 19.Ott J, Wang J, Leal SM. Genetic linkage analysis in the age of whole-genome sequencing. Nat. Rev. Genet. 2015;16:275–284. doi: 10.1038/nrg3908. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR20] 20.Knowles EE, et al. Genome-wide linkage on chromosome 10q26 for a dimensional scale of major depression. J. Affect. Disord. 2016;191:123–131. doi: 10.1016/j.jad.2015.11.012. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR21] 21.Imai A, et al. Beyond homozygosity mapping: family-control analysis based on Hamming distance for prioritizing variants in exome sequencing. Sci. Rep. 2015;5:12028. doi: 10.1038/srep12028. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR22] 22.Imai A, et al. HDR: a statistical two-step approach successfully identifies disease genes in autosomal recessive families. J. Hum. Genet. 2016;61:959–963. doi: 10.1038/jhg.2016.85. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR23] 23.Wong ML, et al. The PHF21B gene is associated with major depression, and modulates stress response. Mol. Psychiatry. 2017;22:1015–1025. doi: 10.1038/mp.2016.174. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR24] 24.Dong C, Wong ML, Licinio J. Sequence variations of ABCB1, SLC6A2, SLC6A3, SLC6A4, CREB1, CRHR1 and NTRK2: association with major depression and antidepressant response in Mexican-Americans. Mol. Psychiatry. 2009;14:1105–1118. doi: 10.1038/mp.2009.92. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR25] 25.Wong ML, Dong C, Andreev V, Arcos-Burgos M, Licinio J. Prediction of susceptibility to major depression by a model of interactions of multiple functional genetic variants and environmental factors. Mol. Psychiatry. 2012;17:624–633. doi: 10.1038/mp.2012.13. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR26] 26.Wong ML, et al. Clinical outcomes and genome-wide association for a brain methylation site in an antidepressant pharmacogenetics study in Mexican Americans. Am. J. Psychiatry. 2014;171:1297–1309. doi: 10.1176/appi.ajp.2014.12091165. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR27] 27.Baune BT, Air T. Clinical, functional, and biological correlates of cognitive dimensions in major depressive disorder-rationale, design, and characteristics of the cognitive function and mood study (CoFaM-Study) Front. Psychiatry. 2016;7:150. doi: 10.3389/fpsyt.2016.00150. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR28] 28.Guo Y, et al. Illumina human exome genotyping array clustering and quality control. Nat. Protoc. 2014;9:2643–2662. doi: 10.1038/nprot.2014.174. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR29] 29.Auer PL, Lettre G. Rare variant association studies: considerations, challenges and opportunities. Genome Med. 2015;7:16. doi: 10.1186/s13073-015-0138-2. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR30] 30.Purcell S, et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am. J. Hum. Genet. 2007;81:559–575. doi: 10.1086/519795. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR31] 31.Hamming RW. Error detecting and error correcting codes. Bell Syst. Tech. J. 1950;29:147–160. doi: 10.1002/j.1538-7305.1950.tb00463.x. [DOI] [Google Scholar]

[CR32] 32.Torgerson WS. Multidimensional scaling: I. theory and method. Psychometrika. 1952;17:401–419. doi: 10.1007/BF02288916. [DOI] [Google Scholar]

[CR33] 33.Sullivan PF, Neale MC, Kendler KS. Genetic epidemiology of major depression: review and meta-analysis. Am. J. Psychiatry. 2000;157:1552–1562. doi: 10.1176/appi.ajp.157.10.1552. [DOI] [PubMed] [Google Scholar]

[CR34] 34.Lesch KP. Gene–environment interaction and the genetics of depression. J. Psychiatry Neurosci. 2004;29:174–184. [PMC free article] [PubMed] [Google Scholar]

[CR35] 35.Dunn EC, et al. Genetic determinants of depression: recent findings and future directions. Harv. Rev. Psychiatry. 2015;23:1–18. doi: 10.1097/HRP.0000000000000054. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR36] 36.Yu C, Baune BT, Licinio J, Wong ML. A novel strategy for clustering major depression individuals using whole-genome sequencing variant data. Sci. Rep. 2017;7:44389. doi: 10.1038/srep44389. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR37] 37.Yu C, Baune BT, Licinio J, Wong ML. Whole-genome single nucleotide variant distribution on genomic regions and its relationship to major depression. Psychiatry Res. 2017;252:75–79. doi: 10.1016/j.psychres.2017.02.041. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR38] 38.Nei M, Kumar S. Molecular Evolution and Phylogenetics. New York: Oxford Univ. Press; 2000. [Google Scholar]

[CR39] 39.Yu C, Baune BT, Licinio J, Wong ML. Single-nucleotide variant proportion in genes: a new concept to explore major depression based on DNA sequencing data. J. Hum. Genet. 2017;62:577–580. doi: 10.1038/jhg.2017.2. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR40] 40.Caplan S, et al. Cultural influences on causal beliefs about depression among Latino immigrants. J. Transcult. Nurs. 2013;24:68–77. doi: 10.1177/1043659612453745. [DOI] [PubMed] [Google Scholar]

[CR41] 41.Korenblum W, et al. Elevated cortisol levels and increased rates of diabetes and mood symptoms in Soviet Union-born Jewish immigrants to Germany. Mol. Psychiatry. 2005;10:974–975. doi: 10.1038/sj.mp.4001720. [DOI] [PubMed] [Google Scholar]

PERMALINK

Low-frequency and rare variants may contribute to elucidate the genetics of major depressive disorder

Chenglong Yu

Mauricio Arcos-Burgos

Bernhard T Baune

Volker Arolt

Udo Dannlowski

Ma-Li Wong

Julio Licinio

Abstract

Introduction