Abstract
Background and Aims
Twin and family studies suggest that genetic influences are shared across substances of abuse. However, despite evidence of heritability, genome-wide association and candidate gene studies have indicated numerous markers of limited effects, suggesting that much of the heritability remains missing. We estimated (1) the aggregate effect of common single nucleotide polymorphisms (SNPs) on multiple indicators of comorbid drug problems that are typically employed across community and population-based samples, and (2) the genetic covariance across these measures.
Participants
2596 unrelated subjects from the “Study of Addiction: Genetics and Environment” provided information on alcohol, tobacco, cocaine, cannabis, and other illicit substance dependence. Phenotypic measures included: (1) a factor score based on DSM-IV drug dependence diagnoses (DD), (2) a factor score based on problem use (PU; i.e., 1+ DSM-IV symptoms), and (3) dependence vulnerability (DV; a ratio of DSM-IV symptoms to the number of substances used).
Findings
Univariate and bivariate Genome-wide complex trait analyses of this selected sample indicated that common SNPs explained 25-36% of the variance across measures, with DD and DV having the largest effects [h2SNP (CI)=0.36 (0.11-0.62) and 0.33(0.07-0.58), respectively; PU = 0.25 (-0.01-0.51)]. Genetic effects were shared across the three phenotypic measures of comorbid drug problems (rSNP; rDD-PU = 0.92 (0.76-1.00), rDD-DV = 0.97 (0.87-1.00), and rPU-DV = 0.96 (0.82-1.00)).
Conclusion
At least 20% of the variance in the generalized vulnerability to substance dependence is attributable to common single nucleotide polymorphisms. The additive effect of common single nucleotide polymorphisms is shared across important indicators of comorbid drug problems.
Keywords: Addiction, Substance Dependence, Genetics, Genome-wide Complex Trait Analysis
Introduction
Substance dependence is a complex behavior influenced by both genetic and environmental factors. The role of genetics in substance dependence is well established in twin and family studies. Unfortunately, molecular genetic studies have had limited success identifying individual genetic variants that are common across multiple substances of abuse (1-10). First, the primary conclusions from molecular genetic studies are that common single nucleotide polymorphisms (SNPs) modestly contribute to substance dependence phenotypes (e.g., rs1614972 in the alcohol dehydrogenase gene (ADH1C) was the only replicable SNP in a recent alcoholism genome-wide association study (GWAS) (11), and variants in the CHRNA5-A3-B4 gene cluster have been repeated linked to tobacco addiction/dependence (12, 13)). Second, multiple genetic polymorphisms influence substance dependence. In part to address the possibility that most variants truly associated with complex traits have effect sizes too small to detect individually using GWAS studies, Yang et al. (14) developed a new method, Genome-wide Complex Trait Analysis (GCTA), that focuses on the estimation of the phenotypic variance explained by genome-wide similarity at genotyped single nucleotide polymorphism (SNPs). Rather than testing each SNP individually, GCTA decomposes the phenotypic variance into two components: (1) effects due to the additive influences of all measured SNPs (h2SNP) and (2) the effects due to unmeasured environmental influences, random noise, or the effects of genetic variants that were not measured by the genotyping array. This approach allows for an estimate of phenotypic variability explained by genome-wide SNP data.
In part, some of the variability in findings across studies can also be attributed to differences in how drug dependence phenotypes are scored for analysis. Prior studies have primarily utilized clinically-defined phenotypes (based on the Diagnostic and Statistical Manual of Mental Disorders dependence symptoms (version four; DSM-IV) (15, 16)), such as dependence diagnosis (i.e., 3+ DSM-IV dependence symptoms all occurring in a 12 month period). However, alternative and dimensional summary scores, such as problem usage (i.e., 1+ DSM-IV dependence symptoms) and symptom counts have also been utilized to help overcome low-level levels of diagnosis often observed in community and population-based samples. Epidemiological studies (17, 18) show that individuals who meet the clinical criteria for dependence diagnosis for one substance are at greatly increased risk of using or becoming dependent/addicted to other substances, suggesting a generalized pattern of problematic drug usage. Furthermore, studies suggest that the “common liability” (i.e., each substance has its own set of genetic and environmental liabilities that are shared with other substances) and “alternative forms” (i.e., comorbidity across substances arises because each substance is an alternate manifestation of a common underlying liability for deviant behaviors) models best explain the observed comorbidity for dependence across different substances (19-22). Like Figure 1, these models assume that dependence upon multiple substances is attributable to correlated latent liabilities or a single latent continuous liability, respectively. Recently, a comparison of three multivariate genetic models indicated that a model which attributes the covariance among different substances to a single latent trait parsimoniously describes alcohol, tobacco, and cannabis dependence in a community-based sample (21). The identified factor, which was referred to as “Substance Dependence Vulnerability” was highly heritable (64%) across genders and has been demonstrated to be stable over time (23). Likewise, evidence for a general tendency to misuse substances has also given rise to dimensional measures of comorbid drug problems, in particular, dependence vulnerability (DV), which is a heritable (h2 = 0.40) summary measure that reflects the average number of DSM-IV dependence symptoms across substance used (24).
Figure 1.
The Common Pathway Model. The observed phenotypic variance/covariance of the measured dependence phenotypes (rectangles) represent alternate manifestations of a generalized vulnerability to dependence (indicated by the latent trait (solid circle), Substance Dependence Vulnerability). Variation in the latent trait and the variance in the observed phenotypes are decomposable into genetic (G) and environmental/residual (E) influences (dashed-circles).
The current study aimed to identify genetic effects on the vulnerability to substance dependence. We hypothesized that common genetic variants account for at least one-half of the genetic variance observed in twin/adoption studies. Furthermore, to demonstrate the validity of these findings we utilized three DSM-IV based definitions of comorbid drug dependence and further hypothesized that a common set of genetic factors would account for any and all genetic variance identified across the definitions. To test these hypotheses we first used factor analysis to replicate prior work (21) and determine the existence of a common factor indicated by either drug dependence diagnosis (DD; i.e., 3+ DSM-IV criteria in a 12-month period) or problem use (PU; i.e., 1+ DSM-IV criteria (lifetime)). Second, we examined genome-wide additive genetic influences on the observed factors, as well as the summary measure, dependence vulnerability (DV). Clinical definitions (DD), sub-clinical thresholds (PU), and the DV summary score were analyzed separately to explore how the measurement approach used to reflect substance-related problems affects the magnitude of identified additive genetic effects. Finally, we used a bivariate GCTA model to test whether DD, PU and DV index the same genetic liability.
Methods
Sample
Data were from the Study of Addiction: Genetics and Environment (SAGE), which is part of the National Human Genome Research Institute's Gene Environment Association Study Initiative (Database for Genotypes and Phenotypes (dbGaP) study accession phs000092.v1.p1). SAGE is a multi-ethnic sample of 4121 unrelated individuals from three large, complementary datasets designed to study drug addiction: the Collaborative Study on the Genetics of Alcoholism (COGA), the Family Study of Cocaine Dependence (FSCD), and the Collaborative Genetic Study of Nicotine Dependence (COGEND). Subjects in COGA were recruited from several sites throughout the United States of America by identifying alcoholic probands from treatment facilities and recruiting other family members, as well as comparison families from the same communities. COGA subjects utilized in the current study are a case-control subset of independent individuals. The FSCD is a case-control family study that ascertained cocaine dependent individuals from chemical dependency treatment units in the greater St. Louis metropolitan area along with community-based control subjects matched on several criteria such as age and race. The COGEND is a community-based case-control family study that recruited nicotine dependent cases and non-dependent smoking controls from the cities of Detroit and St. Louis. The current analyses focused on a subset of 2596 (COGA N = 957, FSCD N = 541, COGEND N = 1098) unrelated participants (44% male, mean age = 38.58 years, standard deviation = 9.80) of European descent (confirmed via principal component analysis including HAPMAP CEPH, Yoruban, Han Chineese, and Japanese samples as ancestral reference groups) drawn from all three studies that comprise SAGE. SAGE was selected for this study because it is the largest cohort of complementary datasets that is publicly available (additional description of SAGE is available at http://www.ncbi.nlm.nih.gov/projects/gap/cgi-bin/study.cgi?study_id=phs000092.v1.p1). While the ascertainment strategy of each study focuses on a particular substance, the fact that substance users have a tendency to become involved with multiple substances allows for sampling of both single and multiple drug users, thereby increasing our power to examine the latent trait that is vulnerability to substance addiction (Supplemental Table S1 provides a description of rates of drug dependence across studies in SAGE).
Measures
Self-report data on the endorsement of DSM-IV symptoms for dependence on alcohol, nicotine, cocaine, cannabis, and other illicit drugs (i.e., dependence on drugs other than cannabis or cocaine (e.g., opiates, phencyclidine, hallucinogens, sedatives)) were gathered using the Semi-Structured Assessment for the Genetics of Alcoholism (SSAGA) (25) and its modified versions (the Semi-Structured Assessment of Nicotine Dependence used in COGEND and the Semi-Structured Assessment for Cocaine Dependence used in FSCD).
Derivation of phenotypes
Several studies strongly support a general tendency to use and become dependent on multiple substances (18, 26-28). As such, we defined substance addiction phenotypically by combining dependence data across various classes of drugs. Further, because phenotypic definitions affect heritability estimates, we created three DSM-IV-based phenotypes intended to approximate those that have commonly been used in the literature. Consistent with the extant literature, we defined dependence vulnerability (DV) as a summary score based on the ratio of the total dependence symptoms endorsed over the total number of substances used (mean = 2.13 (SD = 1.72)) (24, 29). Exploratory and confirmatory analysis was used to for the remaining measures.
To understand the factor structure of DD and PU items, we first used exploratory factor analysis (EFA) on a random half of subjects and then conducted confirmatory factor analysis (CFA) on the remaining half of the sample. For all factor analyses, we used MPlus (Version 7) (30) using weighted least squares mean variance estimation. Parallel analysis (31) and scree plots were used to determine the number of factors, and standard EFA/CFA fit indices (e.g., root mean square error of approximation (RMSEA), comparable fit index (CFI), and Tucker-Lewis index (TLI)) to compare nested models (32, 33). EFA and CFA models indicated a single factor when using either the substance dependence diagnosis items or problem use items (see Supplemental Table S2 and Supplemental Figure S1 for detailed results). Based on the consensus between the EFA and CFA models, genetic analyses utilized factor scores (Mean (M) = 0, standard deviation (SD) = 1) extracted from CFA models using all individuals of European descent. Genetic analyses of DV utilized rank normalized scores (using the BLOM approximation method (34); M = 0, SD = 1). These analyses yielded a factor score based on the dichotomous DSM-IV diagnoses of “drug dependence” (DD) and a factor score of “problem usage” (PU) that was based on items measuring the presence or absence of at least one of the seven total substance dependence symptoms for each of the five diagnostic categories. Unlike recent GWAS studies using factor scores based on dependence diagnoses, DV has previously been shown to localize regions of interest using linkage analysis (29).
Genetic Analyses
Genotyping
Genotyping of SAGE was done using the Illumina 1M platform using blood samples deposited at the Rutgers University Cell and DNA Repository (http://www.rucdr.org) and was performed at the Johns Hopkins Center for Inherited Disease Research (CIDR) using Illumina Human1Mv1_C BeadChips and the Illumina Infinium II assay protocol. SNP calls were made using Illumina BeadStudio Genotyping Module v3.1.14. Strict quality-control standards were implemented, and genotypes were released by CIDR for 1,040,106 SNPs (99.15% of attempted). Further details are provided in the comprehensive data cleaning report posted at dbGaP http://www.ncbi.nlm.nih.gov/projects/gap/cgi-bin/document.cgi?study_id=phs000092.v1.p1&phv=22928&phd=2274&pha=&pht=116&phvf=&phdf=20&phaf=&phtf=&dssp=1&consent=&temp=1.
Population Stratification
We performed principal component analysis (PCA) using a random sample of 30,000 independent SNPs to control for any residual population stratification effects among the individuals of European descent. Principal components have been demonstrated to capture subtle ancestral differences even within ethnicity as well certain technical artifacts (e.g., batch effects) (35). Accordingly, all mixed effects analyses included the first five ancestral principal components as covariates.
Quality Control (QC)
For all genetic analyses, we retained 796,125 autosomal markers with an allele frequency > 1%, a call rate ≥ 99%, and a Hardy–Weinberg Equilibrium (HWE) p-value greater than 0.0001 among subjects of European descent.
Estimation of variance/covariance explained by the SNPs
We used GCTA to decompose the phenotypic variance in each measure into variance components due to the additive effects of all genotyped SNPs and residual effects. GCTA consists of two steps in which the genetic similarity between all pairs of individuals is obtained via a pairwise genetic relationship matrix (GRM), followed by construction of a mixed effects model using genetic similarity as a random effect to predict each phenotype. In our identification of subjects of European descent from SAGE, we used GCTA to systematically remove one of any pair of individuals who were more related than second cousins in order to control for cryptic relatedness, which could artificially inflate SNP-heritability estimates. The GRM used in all analyses comprised the 2596 unrelated individuals. Univariate and bivariate models were fitted to the phenotypic data while controlling for age, gender, study origin (to account for mean differences/batch effects between the different samples within SAGE), and the first five ancestral principal components to account for stratification effects within individuals of European descent (11).
Results
Prevalence and comorbidity of drug dependence and problem use
Table 1 shows the prevalence rates and correlations between the variables used to construct the DD and PU factor scores. Alcohol and nicotine dependence represented the most common substance dependence diagnoses (∼45% prevalence). Rates for the problem use items were higher and followed the same pattern. Phenotypic tetrachoric correlations among all items were generally high and are shown in Table 1. For example, correlations among the problem use items ranged from 0.47 (between Other Drug and Nicotine problem use) to ∼0.84 (between Other Drug and Cocaine problem use).
Table 1. Prevalence, sample size, and correlations among Drug Dependence (DD)/Problem Use (PU) variables.
Phenotype | % (N) | Tetrachoric correlations (ASE) | ||||
---|---|---|---|---|---|---|
Drug Dependence Diagnosis (3+ symptoms) | Alcohol | Cocaine | Cannabis | Nicotine | Other Drugs | |
Alcohol | 45.65 (1185) | 1.00 | ||||
Cocaine | 19.29 (500) | 0.78 (0.02) | 1.00 | |||
Cannabis | 16.76 (434) | 0.79 (0.02) | 0.82 (0.02) | 1.00 | ||
Nicotine | 45.53 (1146) | 0.66 (0.02) | 0.55 (0.03) | 0.60 (0.03) | 1.00 | |
Other Drugs | 15.63 (405) | 0.80 (0.02) | 0.83 (0.02) | 0.77 (0.02) | 0.54 (0.03) | 1.00 |
Problem Use (1+ symptoms) | Alcohol | Cocaine | Cannabis | Nicotine | Other Drugs | |
Alcohol | 70.80 (1838) | 1.00 | ||||
Cocaine | 22.71 (589) | 0.73 (0.03) | 1.00 | |||
Cannabis | 30.35 (788) | 0.67 (0.02) | 0.80 (0.02) | 1.00 | ||
Nicotine | 70.69 (1835) | 0.59 (0.02) | 0.47 (0.03) | 0.50 (0.03) | 1.00 | |
Other Drug | 22.13 (574) | 0.74 (0.03) | 0.84 (0.01) | 0.78 (0.02) | 0.47 (0.03) | 1.00 |
Table showing the level of dependence diagnoses (3+ symptoms in a year) and problem use (1+ symptoms) for each substance in the sample, as well as the tetrachoric correlation between substances. Abbreviations: ASE - Asymptotic standard error.
Phenotypic variance/covariance attributable to common SNPs
The additive genetic variance due to all SNPs (h2SNP) was similar across DD, PU, and DV. The univariate h2SNP estimate of DD was 0.36 (SE = 0.13, p-value (p) = 2.30E-3). The observed univariate h2SNP estimates for PU (h2SNP = 0.25, SE = 0.13, p = 0.03) and DV (h2SNP = 0.32, SE = 0.13, p = 8.91E-3) were slightly smaller. Consistent with the hypothesis that a large number of alleles throughout the genome contribute to substance dependence, longer chromosomes accounted for more phenotypic variation than smaller ones (Figure 2), but not for all phenotypes. For PU h2SNP increased with chromosome length (Figure 2; R2=0.27, β=1.87×10-4, t(20)=2.75, p=0.01). This was not the case with DD (R2=0.12, β=1.47×10-4, t(20)=1.64, p=0.12) and DV (R2=0.00, β=1.60×10-5, t(20)=0.20, p=0.84) phenotypes. Post hoc analyses of the h2SNP estimates within each cohort of SAGE (see Supplemental Table S3) showed internal consistency in the GCTA estimates for each of the phenotypes; though the small sample sizes and differences in ascertainment strategies across studies complicate direct comparison of these estimates.
Figure 2.
Proportion of variance explained (VG/VP) of individual chromosomes on each phenotype while imposing strict QC and adjusting for cryptic relatedness. Note: Additive effects attributable to variants across chromosome were estimated at 1E-6 across all phenotypes. Abbreviations: DD = Dependence Diagnosis Factor; PU = Problem Use Factor; DV = Dependence Vulnerability Score.
Table 2 presents SNP-heritability and correlation estimates from the bivariate analyses. As expected, the three phenotypes had high phenotypic correlations, ranging from 0.84 to 0.86. A strong significant SNP-correlation was observed from bivariate analyses between DD and DV (rSNP = 0.97, SE = 0.08) and strong SNP-correlations were found between DD and PU (rSNP = 0.92, SE = 0.08) and between PU and DV (rG = 0.96, SE = 0.07). This suggested that the different ways of scoring substance dependence or vulnerability to substance dependence are associated with largely overlapping SNPs.
Table 2.
SNP-heritability and SNP-correlation estimates (SE).
Trait 1 | Trait 2 | VSNP-Trait 1 | VSNP-Trait 2 | CSNP | h2 SNP-Trait 1 | h2SNP-Trait 2 | rSNP | P (rSNP=0) |
---|---|---|---|---|---|---|---|---|
DD | PU | 0.16 (0.06) | 0.13 (0.07) | 0.13 (0.06) | 0.36 (0.13) | 0.24 (0.13) | 0.92 (0.08) | 0.0096 |
DD | DV | 0.16 (0.06) | 0.30 (0.12) | 0.21 (0.08) | 0.36 (0.13) | 0.33 (0.13) | 0.97 (0.05) | 0.0029 |
PU | DV | 0.13 (0.07) | 0.31 (0.12) | 0.20 (0.09) | 0.25 (0.13) | 0.35 (0.13) | 0.96 (0.07) | 0.0002 |
Note: SE = Standard Error; DD = Dependence Diagnosis Factor; PU = Problem Use Factor; DV = Dependence Vulnerability Score; VSNP-Trait = Genetic Variance of each trait; CSNP = Genetic Covariance between trait-1 and trait-2; h2SNP = SNP heritability; rSNP = Genetic Correlation
Discussion
This is the first study of its kind to estimate the SNP heritability of generalized vulnerability to substance dependence and compare the genetic liability across multiple phenotypic definitions. The results provide an indication of the additive effect of common SNPs on nicotine, cannabis, cocaine, and alcohol in particular. Given the high loadings of these substances on the DD and PU factors, the additive effect of the common SNPs on individual substance dependence/problems ranges from ∼25-36%. Although DD evidenced the highest heritability, the effects attributed to common SNPs are highly correlated across the three phenotypes. Overall, the evidence supports the utility of common SNPs to index the genetic liability to substance addiction.
Despite the paucity of findings from GWAS on substance addiction, the present analyses indicate that common SNPs on existing GWA platforms capture substantial genetic information regarding the vulnerability to substance dependence in subjects of European ancestry. The bivariate findings also confirm the expectation that different phenotypic representations of generalized substance dependence are influenced by the same SNPs. Our ability to find largely overlapping SNP effects across substances of abuse may be attributable to our analytic approach. As described and demonstrated elsewhere (36-39), relative to GWAS or candidate studies that look for the independent effects of individual SNPs, GCTA aggregates variance across all SNPs and typically explains a greater proportion of variance because the effects of individually non-significant SNPs are included in the estimate. Our results suggest that GWA studies on these phenotypes that have larger samples will discover additional significant genetic associations, as has occurred for other phenotypes (40, 41). It should be noted that additional research is needed to confirm these effects in African populations and to determine whether the observed effects can be attributed to shared genetic factors across different ethnic backgrounds.
Our findings also suggest that the subtle differences in substance vulnerability definitions may not be of major concern in genetic analyses. All three alternative measures of substance vulnerability showed broadly consistent estimates, and the high SNP-correlations among them suggest the same genetic influences affect the three phenotypes. Thus, it is likely that results of genetic analyses across these phenotypes can be meaningfully compared, and meta- or mega- analyses of datasets using alternative phenotypic definitions of substance vulnerability will be informative.
An additional implication of these findings is that we should expect to have a better chance of finding and replicating genetic association analyses (using common SNPs) across phenotypes indicative of a known underlying dimension of risk. The current findings suggest that the same SNPs affect each of these three phenotypes, and therefore genetic associations from studies across these three phenotypes can be meaningfully compared. While the current study is limited to DSM criteria, the evidence suggests the possibility of replication success across studies using non-DSM phenotypes, but that likelihood is dependent upon the extent to which the selected phenotypes reflect common biological variation. Non-DSM phenotypes were beyond the scope of the current manuscript and additional research comparing additive genetic effects across DSM and non-DSM-based phenotypes is necessary in order to obtain a definitive conclusion on expectations of replication using broad phenotypes and endophenotypes that may not necessarily harmonize. Findings of common underlying genetic contributions across different phenotypic representations of the vulnerability to substance dependence tested here may also be extended to non-genetic studies to inform models of substance etiology and treatment. This is possible because the phenotypes used here summarize involvement with disparate substances of abuse (such as alcohol versus heroin).
Limitations
The current study utilized subjects ascertained for studying alcohol (COGA), nicotine (COGEND), and cocaine (FSCD) dependence, and none of the studies excluded individuals because of their involvement with other substances. Our analyses did account for mean differences/batch effects between the different samples within SAGE by controlling for study origin in all analyses. However, our estimates are uncorrected for ascertainment because there is currently no way in GCTA to correct for ascertainment on continuous phenotypes, such as liability for substance dependence, which we focused on here. Our estimates of SNP-heritability are therefore relevant to a population with the same distribution of liability as that observed in our sample; we expect that the SNP-heritability is lower in the general (non-ascertained) population (42). Future research, using larger epidemiologic samples recruited for population representativeness rather than substance abuse is needed to replicate the present findings. However, given the low rates of comorbid drug involvement in community- and general-population samples, samples in the tens of thousands will be necessary to obtain sufficient variability across alcohol, tobacco, cannabis, and in particular illicit substances. It should also be noted that the strength of the associations among substances and the evidence for the common phenotypic factors defined by drug dependence and problem use items, may depend on the prevalence of symptoms in our sample. Larger samples of substance users might show different patterns of comorbidity. However, large studies among drug users in community- and general population samples have provided substantial evidence in support of a general vulnerability to drug dependence (18, 43). As such, one would expect to identify at least a common factor (under similar genetic influence [as evidenced in the current study]) that accounts for the majority of the phenotypic covariance and possibly one or more additional factors that reflect unique variance shared by other substances (22).
Conclusions
The present findings provide important support for a significant contribution of common SNPs in the prediction of vulnerability to substance dependence. Although our findings provided some support for highest heritability observed in DD relative to other ways of defining vulnerability to substance dependence, the associations among the definitions point to shared underlying genetic variation across all three different ways of defining vulnerability to substance dependence. Taken together, these important findings reinforce the utility of examining common variation in SNPs as well as some ability to generalize across studies using different definitions of vulnerability to substance dependence.
Supplementary Material
Acknowledgments
Funding support for the Study of Addiction: Genetics and Environment (SAGE) was provided through the NIH Genes, Environment and Health Initiative [GEI] (U01HG004422). SAGE is one of the genome-wide as association studies funded as part of the Gene Environment Association Studies (GENEVA) under GEI. Assistance with phenotype harmonization and genotype cleaning, as well as with general study coordination, was provided by the GENEVA Coordinating Center (U01 HG004446). Assistance with data cleaning was provided by the National Center for Biotechnology Information. Support for collection of datasets and samples was provided by the Collaborative Study on the Genetics of Alcoholism (COGA; U10 AA008401), the Collaborative Genetic Study of Nicotine Dependence (COGEND; P01 CA089392), and the Family Study of Cocaine Dependence (FSCD; R01 DA013423). Funding support for genotyping, which was performed at the Johns Hopkins University Center for Inherited Disease Research, was provided by the NIH GEI (U01HG004438), the National Institute on Alcohol Abuse and Alcoholism, the National Institute on Drug Abuse, and the NIH contract “High throughput genotyping for studying the gene tic contributions to human disease’ (HHSN268200782096C). The datasets used for the analyses described in this manuscript were obtained from dbGaP at http://www.ncbi.nlm.nih.gov/projects/gap/cgi-bin/study.cgi?study_id=phs000092.v1.p1 through dbGaP accession number phs000092.v1.p1. The views expressed in this article are those of the authors and do not necessarily reflect the position or policy of the Department of Veterans Affairs.
Role of Funding Sources: Funding for this study was provided by AA021113 (Palmer), DA023134 (Knopik), DA033302 (Bidwell), MH087240 (Nugent), MH100141 (Keller). NIAAA, NIMH, and NIDA had no role in the study design, collection, analysis, or interpretation of the data, writing the manuscript, or the decision to submit the paper for publication.
Footnotes
Declaration of interests: Contributors: Authors Rohan Palmer, Matthew Keller, John McGeary, and Valerie Knopik designed the current study approach. Authors Palmer, Nugent, and Bidwell conducted literature searches and provided summaries of previous research studies. Author Leslie Brick conducted the statistical analysis under the guidance of Drs. Palmer and Keller. Authors Palmer and Brick wrote the first draft of the manuscript and all authors contributed to and have approved the final manuscript.
Conflict of Interest: All of the listed authors declare that they have no conflicts of interests.
References
- 1.Kendler KS, Kalsi G, Holmans PA, Sanders AR, Aggen SH, Dick DM, et al. Genomewide association analysis of symptoms of alcohol dependence in the molecular genetics of schizophrenia (MGS2) control sample. Alcoholism, clinical and experimental research. 2011;35:963–975. doi: 10.1111/j.1530-0277.2010.01427.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Yang C, Li C, Kranzler HR, Farrer LA, Zhao H, Gelernter J. Exploring the genetic architecture of alcohol dependence in African-Americans via analysis of a genomewide set of common variants. Hum Genet. 2013 doi: 10.1007/s00439-013-1399-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Vrieze SI, McGue M, Iacono WG. The interplay of genes and adolescent development in substance use disorders: leveraging findings from GWAS meta-analyses to test developmental hypotheses about nicotine consumption. Hum Genet. 2012;131:791–801. doi: 10.1007/s00439-012-1167-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Yang BZ, Han S, Kranzler HR, Farrer LA, Gelernter J. A genomewide linkage scan of cocaine dependence and major depressive episode in two populations. Neuropsychopharmacology : official publication of the American College of Neuropsychopharmacology. 2011;36:2422–2430. doi: 10.1038/npp.2011.122. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Gelernter J, Panhuysen C, Weiss R, Brady K, Hesselbrock V, Rounsaville B, et al. Genomewide linkage scan for cocaine dependence and related traits: significant linkages for a cocaine-related trait and cocaine-induced paranoia. American journal of medical genetics Part B, Neuropsychiatric genetics : the official publication of the International Society of Psychiatric Genetics. 2005;136B:45–52. doi: 10.1002/ajmg.b.30189. [DOI] [PubMed] [Google Scholar]
- 6.Cornelis MC, Monda KL, Yu K, Paynter N, Azzato EM, Bennett SN, et al. Genome-wide meta-analysis identifies regions on 7p21 (AHR) and 15q24 (CYP1A2) as determinants of habitual caffeine consumption. PLoS Genet. 2011;7:e1002033. doi: 10.1371/journal.pgen.1002033. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Gelernter J, Kranzler HR, Sherva R, Koesterer R, Almasy L, Zhao H, et al. Genome-Wide Association Study of Opioid Dependence: Multiple Associations Mapped to Calcium and Potassium Pathways. Biological psychiatry. 2013 doi: 10.1016/j.biopsych.2013.08.034. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Nishizawa D, Fukuda K, Kasai S, Hasegawa J, Aoki Y, Nishi A, et al. Genome-wide association study identifies a potent locus associated with human opioid sensitivity. Molecular psychiatry. 2014;19:55–62. doi: 10.1038/mp.2012.164. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Drgon T, Zhang PW, Johnson C, Walther D, Hess J, Nino M, et al. Genome wide association for addiction: replicated results and comparisons of two analytic approaches. PLoS One. 2010;5:e8832. doi: 10.1371/journal.pone.0008832. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Yu Y, Kranzler HR, Panhuysen C, Weiss RD, Poling J, Farrer LA, et al. Substance dependence low-density whole genome association study in two distinct American populations. Hum Genet. 2008;123:495–506. doi: 10.1007/s00439-008-0501-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Biernacka JM, Geske JR, Schneekloth TD, Frye MA, Cunningham JM, Choi DS, et al. Replication of genome wide association studies of alcohol dependence: support for association with variation in ADH1C. PLoS One. 2013;8:e58798. doi: 10.1371/journal.pone.0058798. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Bierut LJ. Convergence of genetic findings for nicotine dependence and smoking related diseases with chromosome 15q24-25. Trends Pharmacol Sci. 2010;31:46–51. doi: 10.1016/j.tips.2009.10.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Berrettini WH, Doyle GA. The CHRNA5-A3-B4 gene cluster in nicotine addiction. Mol Psychiatry. 2012;17:856–866. doi: 10.1038/mp.2011.122. [DOI] [PubMed] [Google Scholar]
- 14.Yang J, Lee SH, Goddard ME, Visscher PM. Genome-wide complex trait analysis (GCTA): methods, data analyses, and interpretations. Methods Mol Biol. 2013;1019:215–236. doi: 10.1007/978-1-62703-447-0_9. [DOI] [PubMed] [Google Scholar]
- 15.Diagnostic and statistical manual of mental disorders. Washington D.C: 1994. [Google Scholar]
- 16.Diagnostic and statistical manual of mental disorders. Washington, DC: American Psychiatric Association; 2000. [Google Scholar]
- 17.Administration S. A. A. M. H. S. Results from the 2010 National Survey on Drug Use and Health: Summary of National Findings. Rockville, MD: Substance Abuse and Mental Health Services Administration; 2011. HHS Publication No. (SMA) 11-4658. [Google Scholar]
- 18.Palmer RH, Young SE, Hopfer CJ, Corley RP, Stallings MC, Crowley TJ, et al. Developmental epidemiology of drug use and abuse in adolescence and young adulthood: Evidence of generalized risk. Drug Alcohol Depend. 2009;102:78–87. doi: 10.1016/j.drugalcdep.2009.01.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Kendler KS, Jacobson KC, Prescott CA, Neale MC. Specificity of genetic and environmental risk factors for use and abuse/dependence of cannabis, cocaine, hallucinogens, sedatives, stimulants, and opiates in male twins. The American journal of psychiatry. 2003;160:687–695. doi: 10.1176/appi.ajp.160.4.687. [DOI] [PubMed] [Google Scholar]
- 20.Kendler KS, Myers J, Prescott CA. Specificity of genetic and environmental risk factors for symptoms of cannabis, cocaine, alcohol, caffeine, and nicotine dependence. Arch Gen Psychiatry. 2007;64:1313–1320. doi: 10.1001/archpsyc.64.11.1313. [DOI] [PubMed] [Google Scholar]
- 21.Palmer RH, Button TM, Rhee SH, Corley RP, Young SE, Stallings MC, et al. Genetic etiology of the common liability to drug dependence: evidence of common and specific mechanisms for DSM-IV dependence symptoms. Drug and alcohol dependence. 2012;123(Suppl 1):S24–32. doi: 10.1016/j.drugalcdep.2011.12.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Rhee SH, Hewitt JK, Young SE, Corley RP, Crowley TJ, Neale MC, et al. Comorbidity between alcohol dependence and illicit drug dependence in adolescents with antisocial behavior and matched controls. Drug and alcohol dependence. 2006;84:85–92. doi: 10.1016/j.drugalcdep.2005.12.003. [DOI] [PubMed] [Google Scholar]
- 23.Palmer RH, Young SE, Corley RP, Hopfer CJ, Stallings MC, Hewitt JK. Stability and change of genetic and environmental effects on the common liability to alcohol, tobacco, and cannabis DSM-IV dependence symptoms. Behav Genet. 2013;43:374–385. doi: 10.1007/s10519-013-9599-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Button TM, Hewitt JK, Rhee SH, Young SE, Corley RP, Stallings MC. Examination of the causes of covariation between conduct disorder symptoms and vulnerability to drug dependence. Twin Res Hum Genet. 2006;9:38–45. doi: 10.1375/183242706776402993. [DOI] [PubMed] [Google Scholar]
- 25.Bucholz KK, Cadoret R, Cloninger CR, Dinwiddie SH, Hesselbrock VM, Nurnberger JI, Jr, et al. A new, semi-structured psychiatric interview for use in genetic linkage studies: a report on the reliability of the SSAGA. J Stud Alcohol. 1994;55:149–158. doi: 10.15288/jsa.1994.55.149. [DOI] [PubMed] [Google Scholar]
- 26.Kapusta ND, Plener PL, Schmid R, Thau K, Walter H, Lesch OM. Multiple substance use among young males. Pharmacol Biochem Behav. 2007;86:306–311. doi: 10.1016/j.pbb.2006.10.007. [DOI] [PubMed] [Google Scholar]
- 27.Sartor CE, Grant JD, Bucholz KK, Madden PA, Heath AC, Agrawal A, et al. Common genetic contributions to alcohol and cannabis use and dependence symptomatology. Alcoholism, clinical and experimental research. 2010;34:545–554. doi: 10.1111/j.1530-0277.2009.01120.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Iacono WG, Malone SM, McGue M. Behavioral Disinhibition and the Development of Early-Onset Addiction: Common and Specific Influences. Annual Review of Clinical Psychology. 2008;4:325–348. doi: 10.1146/annurev.clinpsy.4.022007.141157. [DOI] [PubMed] [Google Scholar]
- 29.Stallings MC, Corley RP, Hewitt JK, Krauter KS, Lessem JM, Mikulich SK, et al. A genome-wide search for quantitative trait loci influencing substance dependence vulnerability in adolescence. Drug and alcohol dependence. 2003;70:295–307. doi: 10.1016/s0376-8716(03)00031-0. [DOI] [PubMed] [Google Scholar]
- 30.Muthen LK, Muthen BO. Mplus User's Guide. Seventh. Los Angeles, CA: Muthen & Muthen: Muthen & Muthen; 1998-2012. [Google Scholar]
- 31.Hayton JC, Allen DG, Scarpello V. Factor retention decisions in exploratory factor analysis: A tutorial on parrallel analysis. Organizational Research Methods. 2004;7:191–205. [Google Scholar]
- 32.Hu LT, Bentler PM. Cutoff criteria for fit indices in covariance structure analysis: Conventional criteria versus new alternatives. Structural Equation Modeling. 1999;6:1–55. [Google Scholar]
- 33.Yu CY. Evaluation of model fit indices for latent variable models with categorical and continuous outcomes. Unpublished dissertation. 2002 Retrieved February 2, 2014, from Mplus website http://www.statmodel.com/download/Yudissertation.pdf.
- 34.SAS. SAS and all other SAS Institute Inc product or service names are registered trademarks or trademarks of SAS Institute Inc. Cary, NC, USA.: 2000. The analyses for this paper were generated using SAS software Version 9.3 of the SAS System for Windows. Copyright © 2000-2002 SAS Institute Inc. [Google Scholar]
- 35.Price AL, Patterson NJ, Plenge RM, Weinblatt ME, Shadick NA, Reich D. Principal components analysis corrects for stratification in genome-wide association studies. Nature genetics. 2006;38:904–909. doi: 10.1038/ng1847. [DOI] [PubMed] [Google Scholar]
- 36.Yang J, Ferreira T, Morris AP, Medland SE, Madden PAF, Heath AC, et al. Conditional and joint multiple-SNP analysis of GWAS summary statistics identifies additional variants influencing complex traits. Nat Genet. 2012;44:369–375. doi: 10.1038/ng.2213. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Galarneau G, Palmer CD, Sankaran VG, Orkin SH, Hirschhorn JN, Lettre G. Fine-mapping at three loci known to affect fetal hemoglobin levels explains additional genetic variation. Nat Genet. 2010;42:1049–1051. doi: 10.1038/ng.707. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Sanna S, Li B, Mulas A, Sidore C, Kang HM, Jackson AU, et al. Fine Mapping of Five Loci Associated with Low-Density Lipoprotein Cholesterol Detects Variants That Double the Explained Heritability. PLoS Genet. 2011;7:e1002198. doi: 10.1371/journal.pgen.1002198. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Vrieze SI, McGue M, Miller MB, Hicks BM, Iacono WG. Three mutually informative ways to understand the genetic relationships among behavioral disinhibition, alcohol use, drug use, nicotine use/dependence, and their co-occurrence: twin biometry, GCTA, and genome-wide scoring. Behavior genetics. 2013;43:97–107. doi: 10.1007/s10519-013-9584-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Rahmioglu N, Macgregor S, Drong AW, Hedman AK, Harris HR, Randall JC, et al. Genome-wide enrichment analysis between endometriosis and obesity-related traits reveals novel susceptibility loci. Hum Mol Genet. 2014 doi: 10.1093/hmg/ddu516. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Biological insights from 108 schizophrenia-associated genetic loci. Nature. 2014;511:421–427. doi: 10.1038/nature13595. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Lee SH, Yang J, Goddard ME, Visscher PM, Wray NR. Estimation of pleiotropy between complex diseases using single-nucleotide polymorphism-derived genomic relationships and restricted maximum likelihood. Bioinformatics. 2012;28:2540–2542. doi: 10.1093/bioinformatics/bts474. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Derringer J, Krueger RF, Dick DM, Agrawal A, Bucholz KK, Foroud T, et al. Measurement invariance of DSM-IV alcohol, marijuana and cocaine dependence between community-sampled and clinically overselected studies. Addiction. 2013;108:1767–1776. doi: 10.1111/add.12187. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.