Abstract
An earlier age at onset (AAO) has been associated with greater genetic loadings in schizophrenia. This study aimed to identify modifier loci associated with an earlier AAO of schizophrenia. A genome-wide association analysis (GWAS) was conducted in 94 schizophrenia probands with the earliest AAO and 91 with the latest AAO. Candidate single nucleotide polymorphisms (SNPs) were then genotyped in the co-affected siblings and unrelated probands. Multi-SNP genetic risk scores (GRS) composed of the candidate loci were used to distinguish patients with an early or late AAO. The 14-SNP GRS could distinguish the co-affected siblings (n = 90) of the earliest probands from those (n = 91) of the latest probands. When 132 patients with an earlier AAO and 158 patients with a later AAO were included, a significant trend in the 14-SNP GRS was detected among those unrelated probands from 4 family groups with the earliest, earlier, later, and latest AAO. The overall effect of the 14 SNPs on an AAO in schizophrenia was verified using co-affected siblings of the GWAS probands and trend effect across unrelated patients. Preliminary network analysis of these loci revealed the involvement of PARK2, a gene intensively reported in Parkinson’s disease and schizophrenia research.
Introduction
Schizophrenia has long been described as a complex disorder and its aetiology remains elusive1–3. Family and twin studies have indicated that genetic factors contributed substantially to schizophrenia, with an estimated heritability of approximately 80%4. When hundreds of microsatellite markers were examined in genome-wide linkage studies (GWLS) to look for genetic variants that co-segregated with the illness within families, suggestive evidence for a dozen of linkage regions was found5. With the advent of genome-wide association studies (GWAS), in which hundreds of thousands of single nucleotide polymorphisms (SNPs) were examined, approximately 100 markers conferring small increments in risk have been found in schizophrenia6, 7.
Nevertheless, the combined variance explained by the SNPs reaching genome-wide significance remained low6. Alternatively, an application of GWAS data, the polygenic risk score (PRS), has been discovered6. The PRS, consisting of SNPs with P-values less than certain thresholds in GWAS, has been demonstrated to provide clues to functional analysis6, 8. Examination of the PRS derived from the results of large-scale GWAS6, 7 has been used to predict individuals with first-episode psychosis from controls9 and those predisposed to schizophrenia in adolescence10. The associations between the PRS and some clinical dimensions of schizophrenia11 and first-episode psychosis12 have also been reported.
In search of the genetic factors contributing to schizophrenia, another class of genes involved in the clinical manifestations of the illness, i.e., modifier genes, also needs to be considered13. Modifier loci associated with clinical features, such as the age at onset (AAO)14, 15 and positive and negative symptoms16 but not with schizophrenia susceptibility per se, have been identified using GWAS. Furthermore, our previous ordered subset linkage analysis identified a prominent linkage peak in a subset of schizophrenia patients characterized by earlier AAO17.
In this study, we aimed to identify genetic loci associated with earlier AAO of schizophrenia in multiplex families from a single ethnicity. We performed a case-only GWAS for AAO of schizophrenia in a multiplex family sample of Han Chinese in Taiwan. Individual SNPs above certain significance thresholds were further tested in two subsamples, including co-affected siblings and other unrelated patients from the same recruitment scheme. Then, a multi-SNP genetic risk score (GRS) was applied to estimate an overall effect of the candidate genetic loci. The influence of the AAO-modifying genetic loci on the susceptibility of schizophrenia was also evaluated to clarify the potential effect using additional population-based case-control and family-based association tests. Finally, candidate SNPs were subjected to gene network analysis to explore their possible functions associated with the AAO of schizophrenia.
Materials and Methods
Participants
The multiplex family samples used in this study were selected from the Taiwan Schizophrenia Linkage Study (TSLS). First-degree relatives from Han Chinese families with more than two children affected by schizophrenia were recruited across Taiwan. Detailed methods of the fieldwork have been described elsewhere18. Briefly, interviews at recruitment were performed using the Diagnostic Interview for Genetic Studies (DIGS)19, accompanied with the Family Diagnostic Interview for Genetic Studies (FIGS)20. Research assistants carried out a Chinese version of the DIGS, following standard training as described by Chen and colleagues21. The diagnosis of schizophrenia was made independently by at least two research psychiatrists based on the criteria of the fourth edition of the Diagnostic and Statistical Manual (DSM-IV), joined with the record of DIGS, FIGS, interviewer notes, and hospital anamnesis. Information regarding an AAO of the first psychotic episode was extracted from the psychosis section of DIGS, or from the medical history where necessary. Whole blood samples were collected and sent to the National Institute of Mental Health (NIMH) Repository and Genomics Resource (RGR) to be transformed into lymphoblastoid cell lines and stored. DNA samples were extracted from the cell lines and then transported to Taiwan.
The study design and selection of family samples are illustrated in Supplementary Figure S1. Among the 607 families recruited by the TSLS18, 2242 individuals from 557 multiplex families, including 1207 affected (1150 siblings, 57 parents) and 1035 unaffected (271 siblings and 764 parents) individuals, were successfully genotyped in a genome-wide linkage analysis22. Previous GWLS of these multiplex families indicated that a subset of families with early AAO led to a significant linkage signal17. Hence, we postulated that a GWAS screening based on a subsample with extreme contrast in AAO might maximize the power to identify candidate modifier genetic variants. Nevertheless, the chosen variants could then be genotyped in the remaining samples for the confirmation of the association with AAO. First, the 557 families were ranked according to the average AAO of the siblings affected by schizophrenia from each family. Second, a power evaluation of a two-group design (patients with extremely early AAO vs. patients with extremely late AAO) with 80 patients in each group reached 0.7–0.9 using Genetic Power Calculator23 under the following conditions: the minor allele frequencies of the SNPs are 0.2–0.45; the genotype relative risk for heterozygote and homozygote are 1.9–2.0 and 2.0–2.2, respectively; and the prevalence of early AAO among patients on average is 0.3–0.5, according to our empirical data. The final sample size increased from 80 to 95 per group since the microarray chips used in this study could hold 190 samples. Hence, the 95 families with the earliest average AAO and the 95 families with the latest average AAO were identified as the subsample with an extreme contrast in the AAO. One proband with an AAO conforming to the contrast from each family (i.e., earlier AAO in the extremely early subgroup and later AAO in the extremely late subgroup) was selected for GWAS genotyping. After sample quality control (discussed later), 185 probands (94 of the earliest AAO subgroup and 91 of the latest AAO subgroup) remained. SNPs with the strongest association signals (P-value < 10−4) in the association tests were identified as a set of candidate SNPs. Third, the co-affected siblings (n = 181), whose DNA material were available, from the initial 185 families were then genotyped for the consideration that their genetic background was similar to the initial probands.
Another two subgroups of schizophrenia patients were selected from the remaining families with less contrast in average AAO. However, the number of patients that could be genotyped was determined by the slots available for candidate loci genotyping after the co-affected sibling and parents of the initial 185 probands, i.e., at most 300 samples. Using similar principles in selecting one patient from each family as before, 132 patients from families of earlier AAO and 158 patients from families of later AAO were then genotyped for the candidate SNPs. Along with the 185 initial probands with extremely early AAO (n = 94) and extremely late AAO (n = 91), those unrelated schizophrenia patients could be divided into four subgroups (the earliest, earlier, later, and latest AAO) and exhibited an increasing trend of average AAO (Supplementary Figure S2). This four-group contrast provided an opportunity to examine a gradient effect of the candidate loci on AAO. The remaining 77 families with modest contrast in the average AAO were not included for the genotyping of candidate loci in consideration of the efficiency of the trend analyses.
To evaluate whether the candidate SNPs associated with AAO were also involved in susceptibility for schizophrenia, two strategies of association analysis were adopted. The first strategy was to genotype the parents (n = 288) from the initial 185 probands and then to pool them with the affected siblings to undergo family-based association test for schizophrenia. The second strategy was to conduct a case-control association analysis. Community controls free of major mental disorders were selected from the Han Chinese Cell and Genome Bank in Taiwan (HCCGB)24 using a frequency matching in age and gender distribution with a 5 to 1 ratio of controls versus cases25. A total of 925 community controls were selected to match the 185 probands with genome-wide SNP data (Supplementary Figure S1).
SNP genotyping and quality controls
The genome-wide SNP genotyping was performed on 190 schizophrenia probands of extreme contrast in AAO using the Axiom Genome-wide CHB 1 Array Plate (Affymetrix, Santa Clara, CA, USA). A total of 642,832 SNP genotypes were examined. Quality control of the samples and markers was performed according to the standard procedure for GWAS26, and was mainly performed using PLINK 1.0727. Briefly, SNP data from each proband were checked for gender inconsistency and sample genotyping call rate (>98%).
All genetic markers were checked for marker genotyping call rate (>98%), minor allele frequency (>0.05), and Hardy-Weinberg equilibrium (HWE) with a P-value of >0.001. For population stratification, or check for the homogeneity of the sample, we followed the procedures provided by PLINK. The whole-genome identical by state (IBS) distances were firstly calculated between all individuals. Four multidimensional scaling (MDS) components of each individual were then estimated using the Hamming distance. A MDS component with a significant difference in the comparison would be adjusted as a covariate in the GWAS analyses. Relatedness between probands was estimated based on pairwise kinship using the KING software28. Batch effects were also checked and adjusted. Finally, 185 probands with schizophrenia remained after the quality control measures, with 5 patients being deleted due to low call rates (n = 2), a high kinship with another proband (n = 1), or population stratification suggesting different ancestry (n = 2). A total of 564,836 SNPs remained after the quality control. The overall genotyping error was 0.42%. Afterwards, the genomic inflation factor was very small (λ = 1.0013).
Genotyping for the candidate SNPs in available co-affected siblings and parents and an independent set of schizophrenia probands with less contrast in AAO was carried out using the Illumina GoldenGate Genotyping Assay, which allows for customising probes of the SNPs of interest. The SNP genotyping of the community controls was performed by the biobank using the same microarray chip (Affymetrix Axiom Genome-wide CHB 1 Array Plate) as used for the schizophrenia probands. Only the data of the candidate SNPs in the community controls are presented in this study.
Data analyses
A two-tailed Fisher’s exact test for dichotomous variables and a two-tailed t-test for continuous variables were used for group comparisons. Multiple logistic regression with adjustment for one of the four MDS components was used for association analysis for individual SNPs between GWAS probands from families with extreme contrast in AAO. We did not include covariates such as age and gender in the regression models to increase the power when searching for new genetic loci in diseases with low prevalence like schizophrenia29. Because age and AAO of the patients in the TSLS sample (n = 1,285) were highly correlated, with a Pearson’s correlation coefficient (r) = 0.52, we did not adjust for age in subsequent association analyses in the data sets of individual genotyping. Furthermore, we did not adjust for gender in these analyses since only autosomal SNPs were included in our GWAS analysis. Multiple comparisons of the association tests on the candidate SNPs were adjusted using Bonferroni correction. A P-value lower than 7.2 × 10−8 was considered the threshold of genome-wide significance30, while a less stringent threshold of P-value < 10−4 was used to filter SNPs with the strongest association. Owing to these considerations, simple logistic regression analyses were used for the analysis of the co-affected siblings of the initial GWAS probands. Ordinal logistic regression was used to examine whether there was a linear trend in the allele frequency of a candidate SNP across schizophrenia probands from the earliest, earlier, later, and latest AAO family subgroups. Nevertheless, to exclude potential confounding by gender we also performed alternative statistic models with adjustment for gender in the analysis of co-affected siblings and the 4 groups of unrelated probands.
Multi-SNP GRS was constructed by sum of logarithm of odds ratio (logOR) multiplied by genotypes (coded as 0, 1 and 2) of the candidate SNPs with 0 for the homozygosity of major allele, 1 for the heterozygosity, and 2 for homozygosity of minor allele. The GRS were estimated using the logOR from the GWAS analysis of 185 probands and presented as the means ± standard error. Comparisons between the earliest and the latest AAO family subgroups using non-parametric Mann-Whitney U-test were performed separately among the probands and their co-affected siblings. The trend test for the GRS between the four AAO family subgroups was carried out using linear regression.
The influence of the AAO-associated SNPs on the susceptibility to schizophrenia was evaluated using both family-based and population-based designs. First, for those families with SNP genotyping information, family-based association analyses were performed using software family-based association test (FBAT) version 2.0.331, with affected status as outcome and the additive model for genotypes. Second, the schizophrenia probands with GWAS genotyping and their age- and gender-matched community controls were subjected to association analysis using multiple logistic regression with adjustment for age and gender to avoid selection bias.
The possible function of a SNP was predicted by the GoldenPath as described in the F-SNP database (http://compbio.cs.queensu.ca/F-SNP). Then, those candidate SNPs with gene annotation were subjected to network analyses using the Ingenuity Pathway Analysis (IPA; http://www.qiagen.com/ingenuity). The significance level of the gene network analysis was determined using -log10(P-value) and noted as the p-score.
Ethics statement
This study has been approved by the National Taiwan University Hospital Research Ethics Committee (NTUH-REC, No.: 201003106 R and 201411086RINC) for the analyses including all relevant details. The use of personal information and DNA samples for genotyping experiments were performed in accordance with the NTUH-REC guidelines and regulations. The sample collections of families with schizophrenia and normal controls were approved by the National Taiwan University Hospital’s Internal Review Board of Human Studies and the Internal Review Board of the Institute of Biomedical Sciences, Academia Sinica as described in the articles of the filed work18, 24.
Data Availability
The datasets generated during and/or analysed during the current study are available from the corresponding author on reasonable request.
Results
For the 185 families with extreme contrast in AAO, the difference in AAO for both the probands and their co-affected sibling subsamples were significant between families with the earliest AAO and those with the latest AAO (Table 1). Further, the difference in AAO for an independent set of probands (n = 290) from other families with less contrast in AAO was also significant. For those families with affected parents, the average AAO of the affected parents from families with the earliest AAO was younger than those with the latest AAO, and parental AAOs were later than their affected children, regardless of family subgroups. The distribution of gender did not differ among groups. However, the average age of the members from families with an early AAO was statistically younger than that of the families with an early AAO.
Table 1.
Extreme contrast in the AAO | Less contrast in the AAO | ||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
All | Earliest AAO | Latest AAO | P-value | All | Earlier AAO | Later AAO | P-value | ||||||||
Probands | n = 185 | n = 94 | n = 91 | n = 290 | n = 132 | n = 158 | |||||||||
Male, n (%) | 117 | (63%) | 53 | (56%) | 64 | (70%) | 0.06 | 192 | (66%) | 87 | (66%) | 105 | (66%) | 1 | |
Age, mean (SD) | 35.0 | (8.4) | 30.2 | (7.5) | 39.9 | (6.0) | <0.0001 | 35.1 | (7.4) | 32.2 | (7.1) | 37.6 | (6.6) | <0.0001 | |
AAO, mean (SD) | 23.7 | (9.2) | 15.1 | (1.6) | 32.6 | (3.7) | <0.0001 | 23.1 | (5.8) | 18.6 | (2.4) | 26.8 | (5.0) | <0.0001 | |
Co-affected siblings | n = 181 | n = 90 | n = 91 | ||||||||||||
Male, n (%) | 111 | (61%) | 57 | (63%) | 54 | (59%) | 0.64 | ||||||||
Age, mean (SD) | 34.8 | (8.6) | 30.6 | (7.5) | 38.9 | (7.6) | <0.0001 | ||||||||
AAO, mean (SD) | 21.4 | (4.8) | 19.6 | (3.4) | 23.1 | (5.3) | <0.0001 | ||||||||
Parents | n = 288 | n = 156 | n = 132 | ||||||||||||
Male, n (%) | 127 | (44%) | 73 | (47%) | 54 | (41%) | 0.34 | ||||||||
Age, mean (SD) | 63.1 | (9.2) | 59.3 | (8.7) | 67.7 | (7.5) | <0.0001 | ||||||||
Affected, n (%) | 21 | (7.3%) | 13 | (8.3%) | 8 | (6.1%) | 0.50 | ||||||||
AAO, mean (SD) | 32.9 | (11.0) | 28.8 | (8.7) | 43.0 | (14.8) | 0.02 |
AAO: age at onset.
Genome-wide association study
The Manhattan plot of the initial GWAS analysis among schizophrenia probands from families with extreme contrast in AAO is displayed Fig. 1, with none reaching genome-wide significance. Nevertheless, 17 SNPs reached a suggestive threshold (P-values < 10−4). Table 2 displays the names, locations, minor allele frequencies (MAF), odds ratios (OR), and P-values of the association on 17 candidate SNPs, which are located on chromosome 1, 4, 6, 7, 18, 19, and 21. All the empirical P-values of the 17 SNPs remained less than 10−4 after 108 times of permutation. Among these SNPs, 9 were mapped to the introns of 7 genes, whereas the remaining SNPs were in intergenic regions. Three intronic SNPs (rs3016537, rs7755434, and rs60117510) were mapped to the same intron in PARK2, spaced less than 300 base pairs apart with high linkage disequilibrium (LD); therefore, only one SNP (rs60117510) with the smallest P-value was included in the subsequent analyses.
Table 2.
Chr | SNP | Position | Gene | GWAS probands of extreme contrast in the AAO (n = 185) | Co-affected siblings of extreme contrast in the AAO (n = 181) | Trend of the AAO among 4 groups of probands§ (n = 475) | |||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
ALm | MAFE | MAFL | ORa | P-value | Empirical P-value | MAFE | MAFL | ORa | P-value | ORb | P-value | ||||
1 | rs12118952 | 81,445,881 | ADGRL2 | A | 0.25 | 0.46 | 0.38 | 3.8 × 10−5 | 1.6 × 10−5 | 0.30 | 0.40 | 0.66 | 0.06 | 0.66 | 8.9 × 10−4 |
1 | rs6426994 | 166,361,616 | FMO7P | G | 0.30 | 0.49 | 0.39 | 9.6 × 10−5 | 5.5 × 10−5 | — | — | — | — | — | — |
1 | rs609832 | 211,552,050 | RD3/SLC30A1 | G | 0.18 | 0.04 | 6.32 | 3.6 × 10−5 | 5.3 × 10−6 | 0.14 | 0.07 | 2.44 | 0.02 | 1.95 | 8.9 × 10−4 |
1 | rs2378013 | 218,778,121 | TGFB2 | A | 0.41 | 0.20 | 2.68 | 7.5 × 10−5 | 3.6 × 10−5 | 0.33 | 0.26 | 1.44 | 0.13 | 1.62 | 5.9 × 10−4 |
4 | rs923673 | 20,789,574 | KCNIP4 | C | 0.30 | 0.12 | 4.08 | 1.3 × 10−5 | 3.2 × 10−6 | 0.25 | 0.17 | 1.69 | 0.06 | 1.69 | 3.4 × 10−4 |
4 | rs11930588 | 40,179,089 | N4BP2 | T | 0.49 | 0.30 | 3.02 | 5.1 × 10−5 | 1.6 × 10−5 | 0.43 | 0.36 | 1.41 | 0.14 | 1.46 | 1.7 × 10−4 |
6 | rs2506754 | 103,548,429 | R3HDM2P2 | A | 0.30 | 0.12 | 3.94 | 1.2 × 10−5 | 1.9 × 10−6 | 0.22 | 0.18 | 1.37 | 0.25 | 1.88 | 2.6 × 10−3 |
6 | rs12210422 | 103,595,851 | R3HDM2P2 | G | 0.35 | 0.15 | 3.01 | 8.9 × 10−5 | 3.8 × 10−5 | 0.30 | 0.25 | 1.31 | 0.26 | 1.64 | 5.5 × 10−6 |
6 | rs6900852 | 125,901,926 | NCOA7 | G | 0.37 | 0.43 | 0.39 | 9.8 × 10−5 | 5.2 × 10−5 | 0.42 | 0.49 | 0.73 | 0.14 | 0.62 | 1.0 × 10−4 |
6 | rs3016537 | 161,816,611 | PARK2 | C | 0.13 | 0.30 | 0.27 | 4.1 × 10−5 | 1.6 × 10−5 | — | — | — | — | — | — |
6 | rs7755434 | 161,816,624 | PARK2 | A | 0.12 | 0.29 | 0.26 | 2.7 × 10−5 | 9.6 × 10−6 | — | — | — | — | — | — |
6 | rs60117510 | 161,816,881 | PARK2 | C | 0.12 | 0.29 | 0.25 | 2.2 × 10−5 | 9.1 × 10−6 | 0.12 | 0.26 | 0.40 | 0.001 | 0.64 | 2.5 × 10−3 |
7 | rs6964070 | 4,463,521 | SDK1/FOXK1 | G | 0.53 | 0.35 | 2.63 | 9.3 × 10−5 | 4.5 × 10−5 | 0.49 | 0.36 | 1.73 | 0.01 | 1.51 | 1.1 × 10−3 |
7 | rs1589988 | 16,227,085 | ISPD | A | 0.18 | 0.34 | 0.34 | 7.6 × 10−5 | 2.9 × 10−5 | 0.20 | 0.31 | 0.55 | 0.02 | 0.62 | 4.0 × 10−4 |
18 | rs571002 | 13,090,667 | CEP192 | T | 0.22 | 0.42 | 0.33 | 7.7 × 10−5 | 3.8 × 10−5 | 0.29 | 0.36 | 0.73 | 0.17 | 0.61 | 1.5 × 10−4 |
19 | rs371164 | 6,489,802 | DENND1C/TUBB4A | G | 0.53 | 0.40 | 2.52 | 6.3 × 10−5 | 2.9 × 10−5 | 0.49 | 0.34 | 1.98 | 0.004 | 1.53 | 4.2 × 10−4 |
21 | rs2830964 | 27,474,617 | RPL10P1/NCSTNP1 | C | 0.22 | 0.39 | 0.34 | 8.4 × 10−5 | 3.8 × 10−5 | 0.27 | 0.32 | 0.72 | 0.19 | 0.65 | 8.0 × 10−4 |
Chr: chromosome; bold: SNP within the gene; italics: SNP near the gene; ALm: minor allele; AAO: age at onset; MAFE/MAFL: minor allele frequency in patients from families with earliest/latest AAO; ORa: odds ratio from association test using logistic regression; ORb: odds ratio from trend test using ordinal logistic regression. Empirical P presented the P-values obtained after up to 108 times of permutation of the multiple logistic model. §The trend effect of each SNP was tested using ordinal logistic regression among 4 groups of probands from families with earliest, earlier, later, and latest AAO of schizophrenia.
Candidate SNPs in additional subsamples
These 15 SNPs were then genotyped in two additional samples: 1) the 181 co-affected siblings of the initial probands with extreme contrast in AAO and 2) the 290 unrelated probands from the remaining families with less contrast in average AAO (Table 2), with rs6426994 being excluded from analysis due to its low call rate. Among the co-affected sibling subsample from the families with extreme contrast in AAO, 5 SNPs exhibited significant associations (P-values < 0.05). Only the SNP on PARK2 (rs60117510 with P < 0.001) remained significant after adjustment for multiple comparisons (P-value < 0.0036 = 0.05/14).
Regarding the 290 unrelated probands from the remaining families with less contrast in the AAO, none of the 14 SNPs reached statistical significance. Nevertheless, when the patients from the initial 185 probands with extreme contrast in the AAO and the 290 probands from the remaining families with less contrast in the AAO were examined as unrelated patients from four family subgroups (earliest, earlier, later, and latest AAO), all the 14 SNP genotypes showed a significant trend for the AAO (Table 2). The results of statistical models with adjustment for gender among the co-affected siblings and the unrelated gradient AAO groups remain similar (Supplementary Table S2).
Multi-SNP genetic risk scores
To obtain a multi-SNP GRS composed of the 14 candidate SNPs, the 185 probands with GWAS data were used as the learning set, in which the logOR were used as the weighting system. The probands with earliest AAO had greater 14-SNP GRS than those with latest AAO (left panel in Fig. 2). Then, the weights of the 14 SNPs were carried over to calculate the corresponding GRS among the 181 co-affected siblings of the GWAS probands, and the comparisons of the GRS between the two types of families remained significant, although the magnitude of contrast decreased (left panel in Fig. 2). Furthermore, the 14-SNP GRS for the 290 unrelated probands from the remaining families with less contrast in the AAO were calculated and used in the analysis of the unrelated patients from four family subgroups (earliest, earlier, later, and latest AAO), a significant trend of the GRS was observed (right panel in Fig. 2). All comparisons of the 14-SNP GRS maintained the statistical significance (P < 0.0001) with adjustment for gender (data not shown).
AAO-associated SNPs and susceptibility
The relationship between the 14 AAO-associated SNPs and susceptibility to schizophrenia was analysed using two approaches (Table S1). First, the association between the 14 candidate SNPs and affliction with schizophrenia was examined using an additive model of the FBAT within the 185 multiplex families. The results showed that only 1 SNP (rs571002) had a weak association signal (P = 0.049). Second, by incorporating 925 age- and gender-matched community controls with the initial 185 schizophrenia probands, the conventional case-control association tests indicated that none of the 14 SNPs had different allele frequencies between the cases and the controls.
Network analysis
Out of the 14 candidate SNPs identified associated with an early AAO, 7 were located within introns. These 7 SNPs were then subjected to the network analysis. The results revealed 5 significant gene networks with p-score >2, including a larger network centering on PARK2 and 4 small networks consisting of N4BP2, and KCNIP4, ADGRL2, and NCOA7, respectively (Supplementary Figure S1). These 5 networks could be merged into a complicated scheme that may be involved in known diseases or biological functions, such as nervous system development, molecular transport, hereditary disorder, or cell-to-cell signaling and interaction. Detailed information on these networks is shown in Supplementary Table S3.
Discussion
In this study, we identified 14 genetic loci represented by SNPs that were associated with an earlier AAO in a series of patients affected by schizophrenia from multiplex families with Han Chinese ancestry in Taiwan. Initially, 14 SNPs with the most significant association signals (P-value < 10−4) were obtained in a GWAS analysis for AAO among schizophrenia probands from two subgroups of families with extreme differences in the AAO (earliest AAO versus latest AAO). The association of the 14 SNPs were further evaluated in the co-affected siblings of the GWAS probands as well as in the probands from the families with less contrast in the AAO. When unrelated probands from four family subgroups (earliest, earlier, later, and latest AAO) were examined, all the 14 SNPs showed a significant trend for the AAO. A multi-SNP GRS consisting of the 14 candidate SNPs was able to distinguish probands with the earliest AAO from those with the latest AAO. Then, the discriminative validity of the GRS was further confirmed in the co-affected siblings of the GWAS probands as well as the unrelated probands from the four family subgroups characterized by an average AAO.
Some issues on our study design should be noted. First, this study primarily aimed to identify modifier loci associated with an earlier AAO of schizophrenia using a case-only design rather than susceptibility loci of schizophrenia using a case-control design. Second, we selected patients from families with contrast in AAO to conduct a dichotic comparison for association analyses instead of using patient onset age as a continuous outcome. Nevertheless, the linear trends detected for individual SNPs among the four ordinary AAO subgroups (earliest, earlier, later, and latest AAO) did indicate certain dosage effects. Third, limiting our patients to a single ethnicity may help decrease potential confounding effects from population admixture. Finally, adopting the PRS typically constructed from thousands of SNPs, we estimated a multi-SNP GRS composed of the 14 SNPs and confirmed its combined effect on the modification of the AAO across our family samples. The converging results from multiple lines of evidence provide robustness to our findings.
An important feature of this study is that we used patients from multiplex families with co-affected sib-pairs to search for modifier genetic loci associated with an early AAO in schizophrenia. Both an earlier AAO and a positive family history imply greater genetic contributions in these patients with schizophrenia14, 15. Given the same number of individuals that could be genotyped using GWAS chip under the constraints of resources, comparing two groups of multiplex schizophrenia patients with extreme contrast in AAO might have better power in detecting the associated genetic variants. The remaining co-affected sibling in each family become a genetically similar group for confirming the associations derived from the initial earliest vs. the latest onset probands. Using patients with relatively homogeneity in genetic loadings might also account for our success in confirming the initial hits from the GWAS results, which is typically more difficult in regular case-control studies due to heterogeneous background6, 7. In addition, we chose a threshold less stringent than the genome-wide significance level to select suggestive loci for further investigation, which has been adopted in many post-GWAS studies of schizophrenia15, 32. Nevertheless, the associations of these suggestive loci with AAO among the other co-affected siblings as well as multiplex patients from other families with less contrast in AAO lend consistent support to the initial selection.
Further, we applied a series of samples to confirm the association of 14 candidate loci with AAO in schizophrenia. The 2 groups of unrelated probands with less contrast of AAO was pooled into the initial 185 probands to conduct a gradient characteristic, considering the use of patients from multiplex families to reduce heterogeneity. Although not all individual SNPs conformed to the prediction in the samples consisting of patients with less contrast in the AAO, all of them exhibited a significant linear trend when pooling all 4 groups of unrelated probands together. With the gradient AAO among the 4 family groups, we substantiated that the AAO-modifying loci found in a dichotic design had implied a trend effect across the whole multiplex family sample. Nevertheless, alternative statistic models with adjustment for gender did not show evident changes in the significance of the association in single SNP and multi-SNP GRS analyses.
Among the 14 genetic loci we identified as AAO-associated in schizophrenia, 7 are intronic and another 7 are intergenic. The 7 SNPs or their surrounding regions within genes have been reported to be related to neurological disorders, such as schizophrenia22, 33–37 and Parkinson’s disease38–40 in previous GWLS and GWAS. The network analysis of the 7 annotated genes revealed 5 networks, each with PARK2, N4BP2, KCNIP4, ADGRL2, or NCOA7 as its focus gene, which may have joint influence on the initiation of schizophrenia.
Though preliminary in nature, there have been clues about possible involvement of these genes in neurological diseases. First, ADGRL2 at chromosome 1p31.1 had a nonparametric linkage Z score of 2.08 for marker D1S551 related to schizophrenia in the GWLS conducted in the original TSLS families22, indicating a consistent signal in both linkage and association studies. Intriguingly, the same linkage signal on 1p31.1 was also reported in a GWLS for schizophrenia in an Indian sample of 124 affected sib-pair families33. Second, a few SNPs on KCNIP4 were suggestively associated with schizophrenia34, suicidal ideation in major depression41 and childhood ADHD42, 43 in GWAS, implying a function underlying these neuropsychiatric illnesses. Third, N4BP2 at chromosome 4p14 was reported in linkage studies of schizophrenia35 and Parkinson’s disease39. Fourth, NCOA7 was associated with Parkinson’s disease in GWAS40, and its expression was upregulated in the brain than in other human tissues44. Last but not least, mutations in PARK2 at 6q25.2-q27 have been intensively studied in Parkinson’s disease, especially those with early onset45. There is increasing evidence pointing to a potential overlapping pathogenesis between Parkinson’s disease and schizophrenia since they had opposite dopaminergic dysfunctions, similar psychotic symptoms46, and shared some gene networks47.
Furthermore, 2 of the intergenic loci (SNP rs609832 and rs2378013) were predicted to have transcriptional regulation function, although the targets remained unclear. Unlike non-synonymous variants in exons, intronic and intergenic loci may serve as markers reflecting nearby genetic variants with functional meaning in schizophrenia48. Taken together, these genetic loci may have modifier functions for an earlier AAO in schizophrenia, although biological evidence is needed for further confirmation. The results of our GWAS for AAO in schizophrenia provide potential genetic markers and functional targets for future investigation.
While many GWAS for schizophrenia have been published, only two GWAS focusing on modifier loci responsible for AAO of schizophrenia have been conducted14, 15. Using schizophrenia patients of European ancestry, these two studies did not separate patients into those with and without family history when they analysed GWAS for the AAO. Each study identified multiple genetic loci that were associated with a quantitative AAO with a nominal significance level of P-value < 10−6 but could not validate them in the replication samples. Nevertheless, the loci identified by these two studies and our study did not overlap. One explanation for the inconsistent results could be the ethnic heterogeneity among studies49. Another explanation could be the difference in the scale of AAO, i.e., our use of subgroups with extreme contrast in AAO versus the use of continuous AAO in the other two studies.
It is noteworthy that none of the 14 AAO-modifying genetic loci found in this study were reported in the 108 genetic loci for schizophrenia susceptibility reported by an international collaborative effort7. This highlights a growing recognition that modifier genes may involve mechanisms in the pathogenesis of schizophrenia that are different from those of susceptibility genes13. The discovery of modifier genetic loci may also contribute to the phenomenon of missing heritability in the GWA results of schizophrenia50.
Some limitations should be noted in this study. First, the number of probands was small in the initial GWAS for the AAO of schizophrenia. Nevertheless, using multiplex patients of single ethnic group may help increase the homogeneity of our family sample. Second, an independent sample for replication would be required for further confirmation of our results. Of note, collecting multiplex families of sib-pairs co-affected with schizophrenia would be a substantial endeavour. Third, our network analysis was limited by both the restricted annotation of SNPs located within genes and the inability to incorporate the intergenic regions in the analysis. Finally, the functions of the AAO-modifying loci warrant further investigation.
In conclusion, this study identified 14 modifier loci that are associated with the AAO of schizophrenia, and a multi-SNP GRS composed of these 14 loci can distinguish patients with an early AAO from those with a late AAO. The use of co-affected siblings and unrelated patients with a gradient AAO for confirmation of the initial GWAS provided a novel strategy to overcome a drawback of the small sample. Future investigation of these AAO-modifying genetic loci may help further illuminate the pathogenesis of schizophrenia.
Electronic supplementary material
Acknowledgements
This study was supported by grants from the National Health Research Institutes, Taiwan (NHRI-EX102-10048PI; NHRI-EX104-10432PI); Ministry of Science and Technology, Taiwan (101-2314-B-002-134-MY3; MOST 103-2325-B-002-025), and Ministry of Education, Taiwan (‘Aim for the Top University Project’ to National Taiwan University, 2011–2017). The authors thank Dr. Steve Faraone, who was the Co-PI of the Taiwan Schizophrenia Linkage Study supported by the National Institute of Mental Health, USA (1R01 MH59624-01, 1R01 MH59624-02), and other collaborators who helped in the recruitment and evaluation of schizophrenia patients, including Drs Ming-Hsien Hsieh, Tzung-Jeng Hwang, Shi K. Liu, Ching-Jui Chang, Hung-Jung Chang, Hai Ho, Ping-Ju Chang, Shi-Chin Guo, Hsien-Yuan Lane, Su-Kuan Lin, Fu-Chuan Wei, and Joseph J. and Cheng.
Author Contributions
A.W. and W.C. wrote the main manuscript text and prepared tables and figures. P.K. and Y.L. designed the study. P.H. and S.W. analysed the GWAS data. C.L., H.H., and M.T. acquired the family samples of schizophrenia. L.C., C.C., and J.W. provided the genotype data of controls. T.L. and E.C. interpreted the network analysis. All authors reviewed the manuscript.
Competing Interests
The authors report no biomedical financial interests or potential conflicts of interest.
Footnotes
Electronic supplementary material
Supplementary information accompanies this paper at doi:10.1038/s41598-017-06795-8
Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
References
- 1.Gottesman II, Shields J. A polygenic theory of schizophrenia. Proceedings of the National Academy of Sciences. 1967;58:199–205. doi: 10.1073/pnas.58.1.199. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Tsuang M. Schizophrenia: genes and environment. Biological Psychiatry. 2000;47:210–220. doi: 10.1016/S0006-3223(99)00289-9. [DOI] [PubMed] [Google Scholar]
- 3.van Os J, Kapur S. Schizophrenia. The Lancet. 2009;374:635–645. doi: 10.1016/S0140-6736(09)60995-8. [DOI] [PubMed] [Google Scholar]
- 4.Sullivan PF, Kendler KS, Neale MC. Schizophrenia as a complex trait: evidence from a meta-analysis of twin studies. Archives of General Psychiatry. 2003;60:1187–1192. doi: 10.1001/archpsyc.60.12.1187. [DOI] [PubMed] [Google Scholar]
- 5.Ng MY, et al. Meta-analysis of 32 genome-wide linkage studies of schizophrenia. Molecular Psychiatry. 2009;14:774–785. doi: 10.1038/mp.2008.135. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.The International Schizophrenia Consortium. Common polygenic variation contributes to risk of schizophrenia and bipolar disorder. Nature460, 748–752, doi:10.1038/nature08185 (2009). [DOI] [PMC free article] [PubMed]
- 7.Schizophrenia Working Group of the Psychiatric Genomics Consortium Biological insights from 108 schizophrenia-associated genetic loci. Nature. 2014;511:421–427. doi: 10.1038/nature13595. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Wray NR, et al. Research review: Polygenic methods and their application to psychiatric traits. Journal of Child Psychology and Psychiatry. 2014;55:1068–1087. doi: 10.1111/jcpp.12295. [DOI] [PubMed] [Google Scholar]
- 9.Vassos E, et al. An examination of polygenic score risk prediction in individuals with first-episode psychosis. Biological Psychiatry. 2016 doi: 10.1016/j.biopsych.2016.06.028. [DOI] [PubMed] [Google Scholar]
- 10.Jones HJ, et al. Phenotypic manifestation of genetic risk for schizophrenia during adolescence in the general population. JAMA Psychiatry. 2016;73:221–228. doi: 10.1001/jamapsychiatry.2015.3058. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Fanous AH, et al. Genome-wide association study of clinical dimensions of schizophrenia: polygenic effect on disorganized symptoms. American Journal of Psychiatry. 2012;169:1309–1317. doi: 10.1176/appi.ajp.2012.12020218. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Sengupta SM, et al. Polygenic Risk Score associated with specific symptom dimensions in first-episode psychosis. Schizophr Res. 2016 doi: 10.1016/j.schres.2016.11.039. [DOI] [PubMed] [Google Scholar]
- 13.Fanous AH, Kendler KS. Genetic heterogeneity, modifier genes, and quantitative phenotypes in psychiatric illness: searching for a framework. Molecular Psychiatry. 2005;10:6–13. doi: 10.1038/sj.mp.4001571. [DOI] [PubMed] [Google Scholar]
- 14.Wang KS, Liu X, Zhang Q, Aragam N, Pan Y. Genome-wide association analysis of age at onset in schizophrenia in a European-American sample. American Journal of Medical Genetics Part B: Neuropsychiatric Genetics. 2011;156B:671–680. doi: 10.1002/ajmg.b.31209. [DOI] [PubMed] [Google Scholar]
- 15.Bergen SE, et al. Genetic modifiers and subtypes in schizophrenia: investigations of age at onset, severity, sex and family history. Schizophrenia Research. 2014;154:48–53. doi: 10.1016/j.schres.2014.01.030. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Edwards AC, et al. Meta-analysis of positive and negative symptoms reveals schizophrenia modifier genes. Schizophrenia Bulletin. 2016;42:279–287. doi: 10.1093/schbul/sbv119. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Lien YJ, et al. A genome-wide linkage scan for distinct subsets of schizophrenia characterized by age at onset and neurocognitive deficits. Public Library of Science One. 2011;6:e24103. doi: 10.1371/journal.pone.0024103. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Hwu HG, et al. Taiwan schizophrenia linkage study: the field study. American Journal of Medical Genetics Part B: Neuropsychiatric Genetics. 2005;134B:30–36. doi: 10.1002/ajmg.b.30139. [DOI] [PubMed] [Google Scholar]
- 19.Nurnberger, J. I. Jr. et al. Diagnostic interview for genetic studies. Rationale, unique features, and training. NIMH Genetics Initiative. Archives of General Psychiatry51, 849–859, discussion 863–844 (1994). [DOI] [PubMed]
- 20.NIMH Genetics Initiative. Family Interview for Genetic Studies. (National Institute of Mental Health, 1992).
- 21.Chen WJ, et al. Sustained attention deficit and schizotypal personality features in nonpsychotic relatives of schizophrenic patients. American Journal of Psychiatry. 1998;155:1214–1220. doi: 10.1176/ajp.155.9.1214. [DOI] [PubMed] [Google Scholar]
- 22.Faraone SV, et al. Genome scan of Han Chinese schizophrenia families from Taiwan: confirmation of linkage to 10q22.3. American Journal of Psychiatry. 2006;163:1760–1766. doi: 10.1176/ajp.2006.163.10.1760. [DOI] [PubMed] [Google Scholar]
- 23.Purcell S, Cherny SS, Sham PC. Genetic Power Calculator: design of linkage and association genetic mapping studies of complex traits. Bioinformatics. 2003;19:149–150. doi: 10.1093/bioinformatics/19.1.149. [DOI] [PubMed] [Google Scholar]
- 24.Pan WH, et al. Han Chinese cell and genome bank in Taiwan: purpose, design and ethical considerations. Human Heredity. 2006;61:27–30. doi: 10.1159/000091834. [DOI] [PubMed] [Google Scholar]
- 25.Nsengimana J, Bishop DT. Design considerations for genetic linkage and association studies. Methods in Molecular Biology. 2012;850:237–262. doi: 10.1007/978-1-61779-555-8_13. [DOI] [PubMed] [Google Scholar]
- 26.Turner, S. et al. Unit 1.19 Quality control procedures for genome-wide association studies. Current Protocols in Human Genetics doi:10.1002/0471142905.hg0119s68 (Wiley, 2011). [DOI] [PMC free article] [PubMed]
- 27.Purcell S, et al. PLINK: A tool set for whole-genome association and population-based linkage analyses. American Journal of Human Genetics. 2007;81:559–575. doi: 10.1086/519795. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Manichaikul A, et al. Robust relationship inference in genome-wide association studies. Bioinformatics. 2010;26:2867–2873. doi: 10.1093/bioinformatics/btq559. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Pirinen M, Donnelly P, Spencer CC. Including known covariates can reduce power to detect genetic effects in case-control studies. Nature Genetics. 2012;44:848–851. doi: 10.1038/ng.2346. [DOI] [PubMed] [Google Scholar]
- 30.Dudbridge F, Gusnanto A. Estimation of significance thresholds for genomewide association scans. Genetic Epidemiology. 2008;32:227–234. doi: 10.1002/gepi.20297. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Laird NM, Lange C. Family-based methods for linkage and association analysis. Advances in Genetics. 2008;60:219–252. doi: 10.1016/S0065-2660(07)00410-5. [DOI] [PubMed] [Google Scholar]
- 32.Ikeda M, et al. Genome-wide association study of schizophrenia in a Japanese population. Biological Psychiatry. 2011;69:472–478. doi: 10.1016/j.biopsych.2010.07.010. [DOI] [PubMed] [Google Scholar]
- 33.Holliday EG, et al. Strong evidence for a novel schizophrenia risk locus on chromosome 1p31.1 in homogeneous pedigrees from Tamil Nadu, India. American Journal of Psychiatry. 2009;166:206–215. doi: 10.1176/appi.ajp.2008.08030442. [DOI] [PubMed] [Google Scholar]
- 34.Sullivan PF, et al. Genomewide association for schizophrenia in the CATIE study: results of stage 1. Molecular Psychiatry. 2008;13:570–584. doi: 10.1038/mp.2008.25. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Hong KS, et al. Genome-widely significant evidence of linkage of schizophrenia to chromosomes 2p24.3 and 6q27 in an SNP-Based analysis of Korean families. American Journal of Medical Genetics Part B: Neuropsychiatric Genetics. 2009;150B:647–652. doi: 10.1002/ajmg.b.30884. [DOI] [PubMed] [Google Scholar]
- 36.Mukherjee O, et al. Evidence of linkage and association on 18p11.2 for psychosis. American Journal of Medical Genetics Part B: Neuropsychiatric Genetics. 2006;141B:868–873. doi: 10.1002/ajmg.b.30363. [DOI] [PubMed] [Google Scholar]
- 37.Alkelai A, et al. Identification of new schizophrenia susceptibility loci in an ethnically homogeneous, family-based, Arab-Israeli sample. The FASEB Journal. 2011;25:4011–4023. doi: 10.1096/fj.11-184937. [DOI] [PubMed] [Google Scholar]
- 38.Lucking CB, et al. Association between early-onset Parkinson’s disease and mutations in the parkin gene. The New England Journal of Medicine. 2000;342:1560–1567. doi: 10.1056/NEJM200005253422103. [DOI] [PubMed] [Google Scholar]
- 39.Grimes DA, Bulman DE. Parkinson’s genetics–creating exciting new insights. Parkinsonism and Related Disorders. 2002;8:459–464. doi: 10.1016/S1353-8020(02)00030-5. [DOI] [PubMed] [Google Scholar]
- 40.Nguyen TT, Huang J, Wu Q, Nguyen T, Li M. Genome-wide association data classification and SNPs selection using two-stage quality-based Random Forests. BMC Genomics. 2015;16(Suppl 2):S5. doi: 10.1186/1471-2164-16-S2-S5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Perroud N, et al. Genome-wide association study of increasing suicidal ideation during antidepressant treatment in the GENDEP project. Pharmacogenomics Journal. 2012;12:68–77. doi: 10.1038/tpj.2010.70. [DOI] [PubMed] [Google Scholar]
- 42.Neale BM, et al. Population differences in the International Multi-Centre ADHD Gene Project. Genetic Epidemiology. 2008;32:98–107. doi: 10.1002/gepi.20265. [DOI] [PubMed] [Google Scholar]
- 43.Lasky-Su J, et al. Family-based association analysis of a statistically derived quantitative traits for ADHD reveal an association in DRD4 with inattentive symptoms in ADHD individuals. American Journal of Medical Genetics Part B: Neuropsychiatric Genetics. 2008;147B:100–106. doi: 10.1002/ajmg.b.30567. [DOI] [PubMed] [Google Scholar]
- 44.Shao W, Halachmi S, Brown M. ERAP140, a conserved tissue-specific nuclear receptor coactivator. Molecular and Cellular Biology. 2002;22:3358–3372. doi: 10.1128/MCB.22.10.3358-3372.2002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Bruggemann, N. & Klein, C. Parkin Type of Early-Onset Parkinson Disease. 2001 April 17 [Updated 2013 April 4]. In: Pagon, R. A. et al., editors. GeneReviews® [Internet] (University of Washington, Seattle, 1993-2017).
- 46.Chang A, Fox SH. Psychosis in Parkinson’s Disease: Epidemiology, Pathophysiology, and Management. Drugs. 2016;76:1093–1118. doi: 10.1007/s40265-016-0600-5. [DOI] [PubMed] [Google Scholar]
- 47.Birtwistle, J. & Baldwin, D. Role of dopamine in schizophrenia and Parkinson’s disease. British Journal of Nursing7, 832–834, 836, 838–841, doi:10.12968/bjon.1998.7.14.5636 (1998). [DOI] [PubMed]
- 48.Roussos P, et al. A role for noncoding variation in schizophrenia. Cell Reports. 2014;9:1417–1429. doi: 10.1016/j.celrep.2014.10.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.McClellan J, King MC. Genetic heterogeneity in human disease. Cell. 2010;141:210–217. doi: 10.1016/j.cell.2010.03.032. [DOI] [PubMed] [Google Scholar]
- 50.Gershon ES, Alliey-Rodriguez N, Liu C. After GWAS: searching for genetic risk for schizophrenia and bipolar disorder. American Journal of Psychiatry. 2011;168:253–256. doi: 10.1176/appi.ajp.2010.10091340. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The datasets generated during and/or analysed during the current study are available from the corresponding author on reasonable request.