Abstract
Background.
Obesity and diabetes are major modifiable risk factors for pancreatic cancer. Interactions between genetic variants and diabetes/obesity have not previously been comprehensively investigated in pancreatic cancer at the genome-wide level.
Methods.
We conducted a gene-environment interaction (GxE) analysis including 8,255 cases and 11,900 controls from four pancreatic cancer GWAS datasets (PanScan I-III and PanC4). Obesity (BMI≥30 kg/m2) and diabetes (duration ≥ 3 years) were the environmental variables of interest. Approximately 870,000 SNPs (MAF ≥ 0.005, genotyped in at least one dataset) were analyzed. Case-control (CC), case-only (CO), and joint-effect test methods were used for SNP-level GxE analysis. As a complementary approach, gene-based GxE analysis was also performed. Age, sex, study site and principal components accounting for population substructure were included as covariates. Meta-analysis was applied to combine individual-GWAS summary statistics.
Results.
No genome-wide significant interactions (departures from a log-additive odds model) with diabetes or obesity were detected at the SNP level by the CC or CO approaches. The joint-effect test detected numerous genome-wide significant GxE signals in the GWAS main effects top hit regions but the significance diminished after adjusting for the GWAS top hits. In the gene-based analysis, a significant interaction of diabetes with variants in the FAM63A (family with sequence similarity 63 member A) gene (significance threshold P<1.25 × 10−6) was observed in the meta-analysis (PGxE= 1.2 × 10−6, PJoint= 4.2 × 10−7).
Conclusions.
Our current analyses did not find significant GxE interactions at the SNP level but found one significant interaction with diabetes at the gene level. A larger sample size might unveil additional genetic factors via GxE scans.
Impact.
This study may contribute to discovering the mechanism of diabetes-associated pancreatic cancer.
Keywords: Gene-environment interaction, Obesity, Diabetes, GWAS, Pancreatic cancer
Introduction
Pancreatic cancer is the third leading cause of cancer death, accounting for more than 47,000 deaths each year in the United States (1). It is a highly lethal disease with a 5-year survival rate of 9% (2). Epidemiological studies have shown that 20–25% of pancreatic cancer cases are attributable to cigarette smoking (3). However, the incidence of pancreatic cancer has been rising slightly each year in the United States since 2002; this is unexpected given the decreasing prevalence of cigarette smoking and may be due to the rising prevalence of obesity and diabetes. Accumulating evidence suggests that obesity and long-term type II diabetes are associated with increased risk of pancreatic cancer. For example, a pooled analysis of 14 cohort studies of body mass index (BMI) has shown that obesity (BMI ≥30 kg/M2) was associated with 47% (95% confidence interval of 23%−75%) increased risk of pancreatic cancer (4). A meta-analysis of 23 cohort or case-control studies suggests that the association between BMI and pancreatic cancer is not linear (5). At least four meta-analyses of large datasets from cohort and case-control studies have shown that long-term diabetes was associated with a 1.5- to 2-fold increased risk of pancreatic cancer (6–9). Because only a small portion of obese and diabetic individuals develop pancreatic cancer, understanding how genetic factors affect risk among those individuals could inform targeted interventions or screening. Identifying variants that are only associated with risk of cancer (or have stronger associations) among obese or diabetic individuals is of particular interest.
Genome-wide association studies (GWAS) conducted by the Pancreatic Cancer Cohort Consortium (PanScan) and Pancreatic Cancer Case Control Consortium (PanC4) have identified 21 genetic loci and chromosome regions significantly associated with the risk of pancreatic cancer (10–15). However, these findings explain limited heritability of the disease, i.e, the established GWAS loci explain 2.1% of the heritability of pancreatic cancer in contrast to the estimated heritability of 36% from a large population-based twin study (13,16). Beyond main effects, some genetic factors may contribute to the risk of pancreatic cancer only in the presence of specific risk factors for the disease such as obesity and diabetes, i.e., gene-obesity/diabetes interaction, and broadly referred as gene-environment interaction (GxE) herein. Therefore, a genome-wide GxE scan may help find the missing heritability of pancreatic cancer. Several of the susceptibility genes identified by GWAS, i.e. NR5A2, PDX1, HNF1B, and HNF4G, are important for pancreas development (17). These genes are important components of the transcriptional networks governing embryonic pancreatic development and differentiation as well as maintaining pancreatic homeostasis. Mutations in some of these genes are responsible for maturity onset diabetes of the young (MODY) and common variants of these genes have been associated with BMI and risk of type II diabetes in GWAS (17). Therefore, in addition to their roles in regulating the development and function of the pancreas these genes may contribute to pancreatic cancer, partially through an increased risk of obesity and diabetes. Whether these genes and other unidentified genes have an interactive action with obesity and diabetes in modifying the risk of pancreatic cancer is the focus of the current investigation.
We have previously performed GxE analyses at SNP/gene/pathway-levels using GWAS data from 2,028 cases and 2,109 controls from PanScan I and II. No significant interactions at the SNP or gene levels were observed for diabetes or obesity. At the pathway level, NF-kappaB-mediated chemokine signaling and axonal guidance signaling pathway, respectively, were identified as the top pathway interacting with obesity and smoking in modifying the risk of pancreatic cancer (18,19). These studies were limited by the small sample size, and underpowered for genome-wide GxE analysis (20). To address this limitation, the current analysis is conducted in a much larger combined dataset of PanScan I-III and PanC4 with 8,255 cases and 11,900 controls and further leverages recently developed, more powerful SNP-set/gene-based GxE tests (21,22), to discover novel genetic variants that may modify the association between diabetics/obesity and pancreatic cancer.
Materials and Methods
Study population and datasets
The present genome-wide GxE study includes 8255 cases and 11900 controls of European-ancestry drawn from the PanScan and PanC4 consortia. Cases were patients with known or presumed primary pancreatic ductal adenocarcinoma (ICD-O-3 code C250-C259) and controls were free of pancreatic cancer. Individual studies were approved by the respective institutional review board following the institution’s requirement. Written informed consent was obtained from each study participant. The approaches for data harmonization and meta-analysis were approved by the University of Texas MD Anderson Cancer Center Institutional Review Board.
Genotype data were generated in four previously reported GWAS studies, i.e. PanScan I, II, III and PanC4, and the details of these studies have previously been described (10–13). Genotyping in PanScan I, II, and III was conducted at the Cancer Genomics Research Laboratory (CGR) of the National Cancer Institute (NCI) of the National Institutes of Health (NIH) using the Illumina HumanHap550 Infinium II, Human 610-Quad, and OmniExpress series arrays, respectively. PanC4 employed the HumanOmniExpressExome-8v1 array. Because different genotyping platforms were used in these studies, missing genotypes were imputed using the University of Michigan imputation server (https://imputationserver.sph.umich.edu/index.html) with the Haplotype Reference Consortium (23) as the reference panel or IMPUTE2 with the 1000 Genomes Phase 3 as the reference panel (https://mathgen.stats.ox.ac.uk/impute/impute_v2.html). After imputation, SNPs that were identified by imputation only (not genotyped in any of the four GWAS), having minor allele frequency (MAF) ≤ 0.005, imputation quality score < 0.3, or Hardy-Weinberg equilibrium test p value <1 × 10−6 in controls were excluded; a total of about 870,000 common SNPs to all four studies were included in the current GxE analysis. The PanScan (I, II, and III) and PanC4 GWAS data are available through dbGaP (accession numbers phs000206.v5.p3 and phs000648.v1.p1, respectively).
Exposure variables
The exposure variables considered in this GxE analysis were obesity (BMI≥30 kg/m2 vs. <30 kg/m2) and diabetes (diabetes with ≥3-years of duration vs. non-diabetes). Because diabetes could be a manifestation of occult pancreatic cancer, we excluded diabetes with a short duration (< 3 years) for studies with diabetes duration information to control reverse causality. Covariates for adjustment included age (continuous), sex, study sites and principal components accounting for population substructure. The distribution of demographics and risk factors of participants in each GWAS study included in the current analysis is summarized in Table S1.
Statistical analyses
We applied case-control (CC), case-only (CO), and 2 degrees-of-freedom (df) joint-effect test (24) methods at the SNP level, and the “rareGE” method (21) at the gene level in the genome-wide GxE scan. The 2-df joint-effect test is more powerful in detecting a susceptibility SNP in the presence of strong genetic main effect (SNP), strong interaction effect (SNPxE), or a combination of weak/moderate main and interaction effects (SNP + SNPxE). Thus, the joint-effect test is a useful complementary approach to CC, CO and single-SNP marginal association analysis in identifying disease susceptible loci (20).
The PanScan I-III and PanC4 datasets were analyzed individually using the CC, CO, and joint-effect test at the SNP level. The “rareGE” method was used for gene-based G x E analysis. The summary statistics for each consortium were then subjected to meta-analysis.
SNP-level tests
In order to perform SNP-level analysis, we ran the logistic regression model as follows:
| [1] |
where Y is the disease status (1 for case; 0 for control), β0 is the intercept, E is the exposure variable of interest (diabetes or obesity), SNP is the dosage of the genetic variant of interest, coded additively accounting for genotype imputation uncertainty (ranging from 0 to 2), C is the vector of all covariates including age (continuous), sex, study indicators, principal components accounting for population substructures, and either diabetes or BMI (e.g., diabetes serves as the exposure of interest with BMI (continuous) included in the covariate vector). For the CC study design, the null hypothesis to be tested is H0: βGE =0. is referred as the interaction odds ratio (OR).
Joint-effect analysis of SNP and SNP*E were run using the approach by Aschard et al (25) by testing the null hypothesis H0: βG = βGE =0, derived from model [1] with a 2-df χ2 Wald test. For the CO study design, a logistic regression model was run in the case group only as follows:
| [2] |
where the coefficients in model [2] are denoted the same as those in model [1].
Gene-level tests
Gene regions were defined according to coordinates of the hg19 assembly, retrieved from the University of California, Santa Cruz (UCSC) Genome Browser (26). About 22,300 genes were downloaded from UCSC server, of which ~20, 000 genes covering ≥ 2 GWAS genotyped SNPs were analyzed in the current study.
We performed gene-based GxE analysis using the “rareGE” method (21) based on common SNPs (MAF ≥ 0.005, located within 20 kb upstream or downstream of a given gene). For a gene with p SNPs, the full model is as follows:
| [3] |
where βGj and βGEj are the regression coefficients for the genetic main effect and GxE effect for the jth SNP, respectively.
Two tests were implemented in the “rareGE” R package: G x E test with genetic main effects estimated as random effects (PInt) under the null hypothesis of no G x E, i.e., H0: βGE1 = βGE2 = ⋯ = βGEp = 0, and a joint test of G and GxE (PJoint) with H0: βG1 = βG2 = ⋯ = βGp = 0 and βGE1 = βGE2 = ⋯ = βGEp = 0 analogous to the 2-df SNP-level joint-effect test.
Meta analyses
We applied fixed-effects meta-analysis in METAL to combine SNP-level GxE results from the CC or CO method across individual consortia (27). Fisher’s meta-analysis was used to combine gene-level G x E p-values from the rareGE method (28).
Statistical thresholds
All tests were two sided. We consider P < 2.5 × 10−8 and P < 1.25 × 10−6 as genome-wide significant at the SNP- and gene-level, respectively (29), for each individual study and each meta-analysis, adjusted for 1 million SNPs, 20,000 genes and 2 exposures of interest by the Bonferroni correction at family-wise error rate of 0.05. P < 5.0 × 10−2 was considered as nominally significant for all analyses.
Statistical power estimation
We used the QUANTO software (version 1.2.4) (30) to perform power estimation for the current GxE scans. With 8,255 cases and 11,900 controls we had 80% power to detect an interaction OR of 1.5 and 1.6, respectively, for obesity (main effect OR = 1.2 with 20% prevalence in controls based on Table S1) and diabetes (main effect OR = 1.7 with 10% prevalence in controls based on Table S1) for a SNP with MAF of 20% at a significance level of 2.5 × 10−8 by the standard CC test.
Results
We examined the GxE (obesity and diabetes) interactions at the SNP level using the CC, CO and joint tests in each individual GWAS first, followed by meta-analysis of the summary statistics. Figure S1 shows the quantile-quantile (Q-Q) plots for the CC and CO meta-analyses. There was no discernable abnormal behavior in the QQ plots for CC and CO study designs (genomic control λ ranged from 0.942 to 1.023). QQ plots also performed well for meta-analysis of joint-effect tests (λs: 0.94–1.045).
Case-control (CC) and case-only (CO) analyses
No signal at a genome-wide threshold of significance (P < 2.5 × 10−8) was detected in CC or CO analyses on interactions of gene with diabetes or obesity. Using the CC approach, four SNPs on chromosomes 10, 18 and 20 showed evidence of interactions with diabetes at near genome-wide significance (P <1 × 10−6) and six SNPs on chromosomes 2, 7, 11 and 16 showed weaker evidence of interactions with obesity (P < 1 ×10−5) (Table 1). By the CO approach, four SNPs on chromosome 3 and chromosome 10 showed evidence of interactions with diabetes at near genome-wide significance (P < 1 × 10−6, Table 2). Of these, two SNPs (rs12255372 and rs7901695) were near TCF7L2 and are in linkage disequilibrium (LD) (r2=0.74 and 0.87, respectively) with the lead SNP from a recent GWAS of Type 2 diabetes (rs7903146; P=1×10−347) (31). Thus, the CO signals for these two SNPs likely reflect violations of the gene-environment independence assumption rather than evidence for gene-environment interaction. In addition, five SNPs on chromosomes 4, 8, 14 and 17 had possible interactions with obesity at a P value < 1 ×10−5 (Table 2). In addition, no significant across-consortium heterogeneity was found for the meta-analysis results in Table 1 and 2 (all heterogeneity test P-value > 0.05).
Table 1.
Top SNPs interacting with diabetes and obesity (Case-Control)
| SNP | Chr. | Position | Genea | Effect/Ref Allele | MAFb | Meta analysis | |
|---|---|---|---|---|---|---|---|
| OR (95% CI) | P | ||||||
| Diabetes | |||||||
| rs7505930 | 18 | 4092001 | *ROCK1P1-SLC35G4 | G/A | 0.35 | 1.60 (1.34–1.91) | 1.9E-07 |
| rs2777534 | 10 | 34109601 | *GTPBP4-FGF8 | A/G | 0.12 | 2.04 (1.56–2.67) | 2.3E-07 |
| rs2812656 | 10 | 34116863 | *GTPBP4-FGF8 | G/A | 0.12 | 0.50 (0.38–0.65) | 2.4E-07 |
| rs11086650 | 20 | 57183256 | APCDD1L_AS1 | C/T | 0.32 | 0.61 (0.51–0.74) | 5.8E-07 |
| Obesity | |||||||
| rs7802442 | 7 | 22736446 | *COX19-SLC12A9 | C/A | 0.31 | 0.73 (0.65–0.83) | 1.2E-06 |
| rs4298423 | 7 | 151643909 | PRKAG2_AS1-GALNTL5* | A/G | 0.34 | 1.34 (1.19–1.51) | 2.3E-06 |
| rs559449 | 11 | 55340379 | OR4C16 | A/G | 0.45 | 1.31 (1.17–1.47) | 3.6E-06 |
| rs7608326 | 2 | 37903390 | *GRHL1-CHST10 | C/T | 0.07 | 0.51 (0.38–0.68) | 4.2E-06 |
| rs759831 | 16 | 82863660 | CDH13 | A/C | 0.31 | 1.32 (1.17–1.49) | 5.5E-06 |
| rs1476483 | 7 | 22731199 | *COX19-SLC12A9 | G/A | 0.20 | 0.72 (0.62–0.83) | 8.5E-06 |
Abbreviations: Chr. for chromosome; MAF for minor allele frequency; OR (95%CI) for interaction odds ratio (95% confidence interval);
Gene region was defined by the UCSC Genome Browser;
marks the nearest gene to the SNP.
Derived from the PanC4 dataset.
Table 2.
Top interaction signals from case-only analyses
| SNP | Chr. | Position | Genea | Eff/Ref Allele | MAFb | PCO |
|---|---|---|---|---|---|---|
| Diabetes | ||||||
| rs608841 | 3 | 138764229 | *PRR23C-BPESC1 | G/A | 0.24 | 1.6E-07 |
| rs696638 | 3 | 138775377 | *PRR23C-BPESC1 | A/G | 0.16 | 2.2E-07 |
| rs12255372 | 10 | 114808902 | TCF7L2 | A/C | 0.28 | 2.3E-07 |
| exm-rs7903146 | 10 | 114758349 | TCF7L2 | A/G | 0.29 | 4.9E-07 |
| Obesity | ||||||
| rs2018572 | 17 | 11599798 | BHLHA9-DNAH9* | G/A | 0.19 | 1.3E-07 |
| rs4791473 | 17 | 11574959 | BHLHA9-DNAH9* | G/T | 0.18 | 1.9E-06 |
| rs4413478 | 4 | 48491651 | SLC10A4-ZAR1* | A/G | 0.25 | 2.8E-06 |
| rs925611 | 8 | 9768690 | OR4F21-C8orf49* | T/G | 0.097 | 3.2E-06 |
| rs961044 | 14 | 87608094 | *LOC283585-GALC | G/A | 0.14 | 6.9E-06 |
Abbreviations: Chr. for chromosome; MAF for minor allele frequency; PCO for case-only test p-value;
Gene region was defined by the UCSC Genome Browser;
marks the nearest gene to the SNP.
Derived from the PanC4 dataset.
2-df joint-effect test
Meta-analysis of joint-effect tests for SNP and SNP x diabetes or SNP x obesity detected numerous genome-wide significant signals that are all located in the chromosome regions containing previously identified GWAS top hits (Table S2). Conditional analysis adjusting for the GWAS top hits in each region resulted in null findings, indicating that joint effect test signals were all driven by the strong main effects of the SNPs.
Gene-level G x E analysis
Possible interactions of nine genes with diabetes and three genes with obesity at a meta-analysis significance level of P < 1 × 10−4 in at least one of the interaction-only and joint tests are listed in Table 3. Among these genes, a significant (P <1.25 × 10−6) interaction of diabetes with FAM63A (family with sequence similarity 63 member A) gene was observed in the meta-analysis (PInteraction = 1.2 × 10−6, PJoint = 4.2 × 10−7) (Table 3). The SNPs contributing to this gene are listed in Table S3. No individual SNP of this gene showed a significant interaction with diabetes.
Table 3.
Top genes interacting with diabetes and obesity by rareGE method
| Gene | Chr. | Meta | PanScan I | PanScan II | PanScan III | PanC4 | |||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| PInt | PJoint | PInt | PJoint | PInt | PJoint | PInt | PJoint | PInt | PJoint | ||
| Diabetes | |||||||||||
| FAM63A | 1q21.3 | 1.2E-6* | 4.2E-7* | 3.8E-2 | 6.8 E-2 | 0.024 | 0.04 | 2.2E-4 | 8.8E-6 | 3.3E-3 | 8.1E-3 |
| CLTCL1 | 22q11.21 | 1.5E-4 | 5.2E-4 | 0.85 | 0.98 | 4.9E-3 | 0.01 | 0.77 | 0.95 | 6.0E-5 | 1.0E-4 |
| MIR561 | 2q32.1 | 4.1E-5 | 6.6E-4 | 9.3E-4 | 1.7E-3 | 0.043 | 8.5E-2 | 8.1E-3 | 3.5E-2 | 0.13 | 0.25 |
| GNG2 | 14q22.1 | 3.4E-5 | 1.1E-3 | 0.76 | 0.66 | 3.4E-3 | 7.9E-3 | 2.5E-5 | 5.9E-4 | 0.51 | 0.76 |
| ADA | 20q13.12 | 6.8E-5 | 4.6E-4 | 0.14 | 0.27 | 1.8E-3 | 3.7E-3 | 0.47 | 0.6 | 0.97 | 0.38 |
| TP53I3 | 2p23.2 | 7.0E-5 | 1.7E-3 | 0.31 | 0.52 | 0.28 | 0.36 | 2.0E-6 | 3.0E-5 | 0.46 | 0.71 |
| SF3B14 | 2p23.3 | 6.9E-5 | 1.9E-3 | 0.31 | 0.52 | 0.28 | 0.36 | 2.0E-6 | 3.6E-5 | 0.45 | 0.7 |
| DCAF6 | 1q24.1 | 2.7E-2 | 1.6E-5 | 2.4E-2 | 0.05 | 0.49 | 2.7E-5 | 0.21 | 7.3E-2 | 0.07 | 0.14 |
| OR6K2 | 1q23.1 | 3.3E-6 | 4.0E-3 | 6.4E-2 | 0.12 | 0.037 | 7.3E-2 | 2.0E-6 | 2.9E-3 | 0.45 | 0.48 |
| MIR4457 | 5p15.33 | 0.57 | 9.9E-6 | 0.14 | 2.9E-2 | 0.93 | 0.61 | 0.61 | 5.8E-4 | 0.44 | 7.6E-4 |
| Obesity | |||||||||||
| CDC42EP3 | 2p22.2 | 3.40E-04 | 2.10E-05 | 0.18 | 0.17 | 0.62 | 4.3E-3 | 5.50E-02 | 1.70E-01 | 9.10E-05 | 1.5E-4 |
| FSD1L | 9q31.2 | 6.50E-02 | 3.60E-05 | 4.2E-2 | 0.066 | 6.1E-2 | 0.13 | 8.80E-01 | 8.70E-06 | 2.80E-01 | 0.48 |
| MIR4457 | 5p15.33 | 3.10E-01 | 5.10E-06 | 0.45 | 0.0045 | 0.82 | 0.32 | 6.20E-01 | 6.50E-03 | 4.00E-02 | 3.8E-4 |
Abbreviations: Chr. for chromosome; PInt, PJoint for p values, respectively, derived from random-effect GxE interaction test and joint-effect test.
Genome-wide significant p-values (<1.25E-6)
Discussion
In this genome-wide G x obesity/diabetes interaction study of pancreatic cancer, no significant departures from a log-linear odds model at the SNP level were identified by the CC or CO approaches. In the gene-based analysis, a significant interaction between variants in the FAM63A gene and diabetes was observed.
FAM63A (family with sequence similarity 63, member A), also known as MINDY-1 (MINDY lysine 48 deubiquitinase 1) is a member of an evolutionarily conserved and structurally distinct family of deubiquitinating enzymes (32), which specifically cleaves K48-linked poly-ubiquitin chain to regulate protein degradation. This distinct deubiquitinase class localizes to DNA lesions, where it plays an important role in genome stability pathways, functioning to prevent spontaneous DNA damage and to promote cellular survival in response to exogenous DNA damage (33). Previous GWAS have associated FAM63A or FAM63A homolog gene variants with risk of primary rhegmatogenous retinal detachment (34) and chronic renal disease (35). Genetic analysis of a diabetes-prone mouse strain has revealed gene regions homologous to FAM63A contributing to diabetes susceptibility (36). Although the role of FAM63A in pancreatic cancer is unknown at present, the observed interaction with diabetes deserves further investigation.
Genome-wide GxE analysis has unique challenges compared with genetic main effects analysis in GWAS. First, GxE analysis requires a much larger sample size to detect a realistic interaction OR than does a GWAS scan for a comparable main effect OR (20,37), largely explaining why few positive findings have been reported in GxE studies (38–40). For example, the current GxE scan with 8,255 cases and 11,900 controls, even though about four times as large as our previous G x obesity/diabetes interaction analysis (18), had 80% power to detect an interaction OR of 1.5 and 1.6, respectively, for obesity and diabetes for a SNP with MAF of 20% at a significance level of 2.5 × 10−8 by the standard CC test; in contrast, the same sample size had 80% power to detect a genetic main effect OR of 1.18 at the same MAF and significance level. To boost the power for a given sample size, novel statistical and analytical methods have been proposed to leverage a priori biological knowledge in the form of genes, pathways or other functional genomics annotations such as those derived from the ENCODE and NIH Epigenomics Roadmap projects (18,19,41). Second, exposure variability and measurement accuracy play a considerable role in determining the power and reproducibility of GxE studies (42,43). Third, there is no single most powerful statistical method for either SNP or gene-level genome-wide GxE analysis due to the largely unknown patterns of GxE interaction signals and combinations of genetic main and GxE effects (20,22). Therefore, we suggest that the GxE analysis should make use of multiple methods with complementary strengths, as used here and suggested by other investigators (44), to discover the missing heritability of pancreatic cancer (45).
This study identified a statistically significant interaction of diabetes with variants in FAM63A in gene-based GxE analysis but no significant SNP-level GxE interactions with either diabetes or obesity. We note that the absence of interaction on the log-odds scale has potentially important implications for risk modeling, as it typically implies presence of interaction on the risk difference scale, sometimes referred to as “public health interaction” (46). Developing and validating a multifactorial risk model is beyond the scope of this paper, but we note that our results lend support to the common assumption of additive log odds when combining genetic, clinical, and environmental risk factors to predict risk (47,48).
The current study has strengths and limitations. This is by far the largest GxE analysis in pancreatic cancer. Quality control was strictly performed in steps of genotyping, population structure definition, exposure measurement and harmonization. Diabetes was defined as disease with ≥3-year duration, avoiding reverse causality. Along the same line, since it is common for pancreatic cancer patients to experience severe weight loss (43), we avoided using body weight at or close to cancer diagnosis for cases when calculating the BMI. Following the state-of-the-art analysis strategies in large consortium-based GxE scans (49,50), we only adjusted for a “minimum” set of covariates, including age, sex, study sites and principal components accounting for population substructure, in the regression analysis. As shown by the well-behaved QQ plots in Figure S1, there was no indication of uncontrolled confounding effects. Finally, genome-wide significant thresholds based on the Bonferroni correction were applied to reduce false-positive discovery. Nevertheless, relatively small sample sizes curbed the power of genome-wide GxE scan from case-control and case-only study designs. Despite this, the current GxE analysis discovered a novel susceptibility locus for pancreatic cancer using a gene-based GxE test, and may contribute to discovering the mechanism of diabetes-associated pancreatic cancer.
Supplementary Material
Acknowledgments
This work was supported by the National Institutes of Health grants R01CA169122 (to P. Wei) and UH2CA191284 (to P. Kraft).
The IARC/Central Europe study was supported by NIH grant R03 CA123546–02 and grants from the Ministry of Health of the Czech Republic (NR 9029–4/2006, NR9422–3, NR9998–3, MH CZ-DRO-MMCI 00209805).
The work at Johns Hopkins University was supported by the NCI Grants P50CA062924 and R01CA97075. Additional support was provided by the Lustgarten Foundation, Susan Wojcicki and Dennis Troper and the Sol Goldman Pancreas Cancer Research Center. This work was supported by a NCI grant R01 CA154823.
The Mayo Clinic Biospecimen Resource for Pancreas Research study is supported by the Mayo Clinic SPORE in Pancreatic Cancer (P50 CA102701).
The MD Anderson Cancer Center study was supported by the NIH grant R01CA98030 and a grant from the Khalifa Bin Zayed Al Nahyan Foundation.
The Memorial Sloan Kettering Cancer Center Pancreatic Tumor Registry is supported by P30CA008748, the Geoffrey Beene Foundation, the Arnold and Arlene Goldstein Family Foundation, and the Society of MSKCC.
The PACIFIC Study was supported by the NCI grant R01CA102765, Kaiser Permanente and Group Health Cooperative.
The Queensland Pancreatic Cancer Study was supported by a grant from the National Health and Medical Research Council of Australia (NHMRC) (Grant number 442302). RE Neale is supported by a NHMRC Senior Research Fellowship (#1060183).
The UCSF pancreas study was supported by NIH-NCI grants (R01CA1009767, R01CA109767-S1 and R0CA059706) and the Joan Rombauer Pancreatic Cancer Fund.
The Yale (CT) pancreas cancer study was supported by the NCI grant 5R01CA098870. The cooperation of 30 Connecticut hospitals, including Stamford Hospital, in allowing patient access, is gratefully acknowledged. The Connecticut Pancreas Cancer Study was approved by the State of Connecticut Department of Public Health Human Investigation Committee. Certain data used in that study were obtained from the Connecticut Tumor Registry in the Connecticut Department of Public Health. The authors assume full responsibility for analyses and interpretation of these data.
Studies included in PANDoRA were partly funded by: the Czech Science Foundation (No. P301/12/1734), the Internal Grant Agency of the Czech Ministry of Health (IGA NT 13 263); the Baden-Württemberg State Ministry of Research, Science and Arts (Prof. H. Brenner), the Heidelberger EPZ-Pancobank (Prof. M.W. Büchler and team: Prof. T. Hackert, Dr. N. A. Giese, Dr. Ch. Tjaden, E. Soyka, M. Meinhardt; Heidelberger Stiftung Chirurgie and BMBF grant 01GS08114), the BMBH (Prof. P. Schirmacher; BMBF grant 01EY1101), the “5×1000” voluntary contribution of the Italian Government, the Italian Ministry of Health (RC1203GA57, RC1303GA53, RC1303GA54, RC1303GA50), the Italian Association for Research on Cancer (Prof. A. Scarpa; AIRC n. 12182), the Italian Ministry of Research (Prof. A. Scarpa; FIRB - RBAP10AHJB), the Italian FIMP-Ministry of Health (Prof. A. Scarpa; 12 CUP_J33G13000210001), and by the National Institute for Health Research Liverpool Pancreas Biomedical Research Unit, UK. We would like to acknowledge the contribution of Dr Frederike Dijk and Prof. Oliver Busch (Academic Medical Center, Amsterdam, the Netherlands).
The American Cancer Society (ACS) funds the creation, maintenance, and updating of the Cancer Prevention Study II cohort.
Cancer incidence data for CLUE were provided by the Maryland Cancer Registry, Center for Cancer Surveillance and Control, Department of Health and Mental Hygiene, 201 W. Preston Street, Room 400, Baltimore, MD 21201, http://phpa.dhmh.maryland.gov/cancer, 410-767-4055.
We acknowledge the State of Maryland, the Maryland Cigarette Restitution Fund, and the National Program of Cancer Registries of the Centers for Disease Control and Prevention for the funds that support the collection and availability of the cancer registry data.” We thank all the CLUE participants.
The Melbourne Collaborative Cohort Study (MCCS) recruitment was funded by VicHealth and Cancer Council Victoria. The MCCS was further supported by Australian NHMRC grants 209057 and 396414 and by infrastructure provided by Cancer Council Victoria. Cases and their vital status were ascertained through the Victorian Cancer Registry and the Australian Institute of Health and Welfare, including the National Death Index and the Australian Cancer Database.
The NYU study (AZJ and AAA) was funded by NIH R01 CA098661, UM1 CA182934 and center grants P30 CA016087 and P30 ES000260.
The PANKRAS II Study in Spain was supported by research grants from Instituto de Salud Carlos III-FEDER, Spain: Fondo de Investigaciones Sanitarias (FIS) (#PI13/00082 and #PI15/01573) and Red Temática de Investigación Cooperativa en Cáncer, Spain (#RD12/0036/0050); and European Cooperation in Science and Technology (COST Action #BM1204: EU_Pancreas), Ministerio de Ciencia y Tecnología (CICYT SAF 2000-0097), Fondo de Investigación Sanitaria (95/0017), Madrid, Spain; Generalitat de Catalunya (CIRIT - SGR); ‘Red temática de investigación cooperativa de centros en Cáncer’ (C03/10), ‘Red temática de investigación cooperativa de centros en Epidemiología y salud pública’ (C03/09), and CIBER de Epidemiología (CIBERESP), Madrid.
The Physicians’ Health Study was supported by research grants CA-097193, CA-34944, CA-40360, HL-26490, and HL-34595 from the National Institutes of Health, Bethesda, MD USA.
The Women’s Health Study was supported by research grants CA-047988, HL-043851, HL-080467, and HL-099355 from the National Institutes of Health, Bethesda, MD USA.
Health Professionals Follow-up Study is supported by NIH grant UM1 CA167552 from the National Cancer Institute, Bethesda, MD USA
Nurses’ Health Study is supported by NIH grants UM1 CA186107, P01 CA87969, and R01 CA49449 from the National Cancer Institute, Bethesda, MD USA
Additional support from the Hale Center for Pancreatic Cancer Research, U01 CA21017 from the National Cancer Institute, Bethesda, MD USA, and the United States Department of Defense CA130288, Lustgarten Foundation, Pancreatic Cancer Action Network, Noble Effort Fund, Peter R. Leavitt Family Fund, Wexler Family Fund, and Promises for Purple to B.M. Wolpin.
The WHI program is funded by the National Heart, Lung, and Blood Institute, National Institutes of Health, U.S. Department of Health and Human Services through contracts HHSN268201600018C, HHSN268201600001C, HHSN268201600002C, HHSN268201600003C, and HHSN268201600004C. The authors thank the WHI investigators and staff for their dedication, and the study participants for making the program possible. A full listing of WHI investigators can be found at: http://www.whi.org/researchers/Documents%20%20Write%20a%20Paper/WHI%20Investigator%20Long%20List.pdf.
Where authors are identified as personnel of the International Agency for Research on Cancer/World Health Organization, the authors alone are responsible for the views expressed in this article and they do not necessarily represent the decisions, policy or views of the International Agency for Research on Cancer /World Health Organization.
The abbreviations used are:
- CC
case-control
- CO
case-only
- GxE
gene-environment interaction
- GWAS
genome-wide association study
- PanC4
Pancreatic Cancer Case Control Consortium
- PanScan
Pancreatic Cancer Cohort Consortium
- MAF
minor allele frequency
Footnotes
The authors declare no conflict of interest.
References
- 1.Society AC. Cancer Facts and Figures 2019. Atlanta: American Cancer Society; 2019. [Google Scholar]
- 2.Ward EM, Sherman RL, Henley SJ, Jemal A, Siegel DA, Feuer EJ, et al. Annual Report to the Nation on the Status of Cancer, Featuring Cancer in Men and Women Age 20–49 Years. J Natl Cancer Inst 2019;111(12):1279–97 doi 10.1093/jnci/djz106. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Bosetti C, Lucenteforte E, Silverman DT, Petersen G, Bracci PM, Ji BT, et al. Cigarette smoking and pancreatic cancer: an analysis from the International Pancreatic Cancer Case-Control Consortium (Panc4). Ann Oncol 2012;23(7):1880–8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Genkinger JM, Spiegelman D, Anderson KE, Bernstein L, van den Brandt PA, Calle EE, et al. A pooled analysis of 14 cohort studies of anthropometric factors and pancreatic cancer risk. Int J Cancer 2011;129(7):1708–17 doi 10.1002/ijc.25794. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Aune D, Greenwood DC, Chan DSM, Vieira R, Vieira AR, Navarro Rosenblatt DA, et al. Body mass index, abdominal fatness and pancreatic cancer risk: a systematic review and non-linear dose-response meta-analysis of prospective studies. Annals of Oncology 2012;23(4):843–52. [DOI] [PubMed] [Google Scholar]
- 6.Everhart J, Wright D. Diabetes mellitus as a risk factor for pancreatic cancer. A meta-analysis. JAMA 1995;273(20):1605–9. [PubMed] [Google Scholar]
- 7.Huxley R, Ansary-Moghaddam A, Berrington de Gonzalez A, Barzi F, Woodward M. Type-II diabetes and pancreatic cancer: a meta-analysis of 36 studies. Br J Cancer 2005;92(11):2076–83. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Elena JW, Steplowski E, Yu K, Hartge P, Tobias GS, Brotzman MJ, et al. Diabetes and risk of pancreatic cancer: a pooled analysis from the pancreatic cancer cohort consortium. Cancer Causes Control 2013;24(1):13–25 doi 10.1007/s10552-012-0078-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Bosetti C, Rosato V, Li D, Silverman D, Petersen GM, Bracci PM, et al. Diabetes, antidiabetic medications, and pancreatic cancer risk: an analysis from the International Pancreatic Cancer Case-Control Consortium. Ann Oncol 2014;25(10):2065–72. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Amundadottir L, Kraft P, Stolzenberg-Solomon RZ, Fuchs CS, Petersen GM, Arslan AA, et al. Genome-wide association study identifies variants in the ABO locus associated with susceptibility to pancreatic cancer. Nat Genet 2009;41(9):986–90. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Petersen GM, Amundadottir L, Fuchs CS, Kraft P, Stolzenberg-Solomon RZ, Jacobs KB, et al. A genome-wide association study identifies pancreatic cancer susceptibility loci on chromosomes 13q22.1, 1q32.1 and 5p15.33. Nat Genet 2010;42(3):224–8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Wolpin BM, Rizzato C, Kraft P, Kooperberg C, Petersen GM, Wang Z, et al. Genome-wide association study identifies multiple susceptibility loci for pancreatic cancer. Nat Genet 2014;46(9):994–1000. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Childs EJ, Mocci E, Campa D, Bracci PM, Gallinger S, Goggins M, et al. Common variation at 2p13.3, 3q29, 7p13 and 17q25.1 associated with susceptibility to pancreatic cancer. Nat Genet 2015;47(8):911–6 doi 10.1038/ng.3341. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Zhang M, Wang Z, Obazee O, Jia J, Childs EJ, Hoskins J, et al. Three new pancreatic cancer susceptibility signals identified on chromosomes 1q32.1, 5p15.33 and 8q24.21. Oncotarget 2016;7(41):66328–43 doi 10.18632/oncotarget.11041. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Klein AP, Wolpin BM, Risch HA, Stolzenberg-Solomon RZ, Mocci E, Zhang M, et al. Genome-wide meta-analysis identifies five new susceptibility loci for pancreatic cancer. Nat Commun 2018;9(1):556. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Chen F, Childs EJ, Mocci E, Bracci P, Gallinger S, Li D, et al. Analysis of Heritability and Genetic Architecture of Pancreatic Cancer: A PanC4 Study. Cancer Epidemiol Biomarkers Prev 2019;28(7):1238–45 doi 10.1158/1055-9965.EPI-18-1235. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Li D, Duell EJ, Yu K, Risch HA, Olson SH, Kooperberg C, et al. Pathway analysis of genome-wide association study data highlights pancreatic development genes as susceptibility factors for pancreatic cancer. Carcinogenesis 2012;33(7):1384–90. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Tang H, Wei P, Duell EJ, Risch HA, Olson SH, Bueno-de-Mesquita HB, et al. Genes-environment interactions in obesity- and diabetes-associated pancreatic cancer: a GWAS data analysis. Cancer Epidemiol Biomarkers Prev 2014;23(1):98–106. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Tang H, Wei P, Duell EJ, Risch HA, Olson SH, Bueno-de-Mesquita HB, et al. Axonal guidance signaling pathway interacting with smoking in modifying the risk of pancreatic cancer: a gene- and pathway-based interaction analysis of GWAS data. Carcinogenesis 2014;35(5):1039–45. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Gauderman WJ, Mukherjee B, Aschard H, Hsu L, Lewinger JP, Patel CJ, et al. Update on the State of the Science for Analytical Methods for Gene-Environment Interactions. Am J Epidemiol 2017;186(7):762–70 doi 10.1093/aje/kwx228. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Chen H, Meigs JB, Dupuis J. Incorporating gene-environment interaction in testing for association with rare genetic variants. Hum Hered 2014;78(2):81–90 doi 10.1159/000363347. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Yang T, Chen H, Tang H, Li D, Wei P. A powerful and data-adaptive test for rare-variant-based gene-environment interaction analysis. Stat Med 2019;38(7):1230–44 doi 10.1002/sim.8037. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Das S, Forer L, Schonherr S, Sidore C, Locke AE, Kwong A, et al. Next-generation genotype imputation service and methods. Nat Genet 2016;48(10):1284–7 doi 10.1038/ng.3656. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Kraft P, Yen YC, Stram DO, Morrison J, Gauderman WJ. Exploiting gene-environment interaction to detect genetic associations. Hum Hered 2007;63(2):111–9. [DOI] [PubMed] [Google Scholar]
- 25.Aschard H, Hancock DB, London SJ, Kraft P. Genome-wide meta-analysis of joint tests for genetic and gene-environment interaction effects. Hum Hered 2010;70(4):292–300 doi 10.1159/000323318. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Petrovski S, Wang Q, Heinzen EL, Allen AS, Goldstein DB. Genic intolerance to functional variation and the interpretation of personal genomes. PLoS genetics 2013;9(8):e1003709doi 10.1371/journal.pgen.1003709. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Willer CJ, Li Y, Abecasis GR. METAL: fast and efficient meta-analysis of genomewide association scans. Bioinformatics 2010;26(17):2190–1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Hedges LV, Olkin I. The inverse chi-square method In: Statistical Methods for Meta-Analysis. San Diago, CA: Academic Press, Inc; 1985. 37–9 p. [Google Scholar]
- 29.Barsh GS, Copenhaver GP, Gibson G, Williams SM. Guidelines for genome-wide association studies. PLoS genetics 2012;8(7):e1002812. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Gauderman WJ. Sample size requirements for matched case-control studies of gene-environment interaction. Stat Med 2002;21(1):35–50 doi 10.1002/sim.973. [DOI] [PubMed] [Google Scholar]
- 31.Xue A, Wu Y, Zhu Z, Zhang F, Kemper KE, Zheng Z, et al. Genome-wide association analyses identify 143 risk variants and putative regulatory mechanisms for type 2 diabetes. Nat Commun 2018;9(1):2941doi 10.1038/s41467-018-04951-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Abdul Rehman SA, Kristariyanto YA, Choi SY, Nkosi PJ, Weidlich S, Labib K, et al. MINDY-1 Is a Member of an Evolutionarily Conserved and Structurally Distinct New Family of Deubiquitinating Enzymes. Molecular cell 2016;63(1):146–55 doi 10.1016/j.molcel.2016.05.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Kwasna D, Abdul Rehman SA, Natarajan J, Matthews S, Madden R, De Cesare V, et al. Discovery and Characterization of ZUFSP/ZUP1, a Distinct Deubiquitinase Class Important for Genome Stability. Molecular cell 2018;70(1):150–64 e6 doi 10.1016/j.molcel.2018.02.023. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Kirin M, Chandra A, Charteris DG, Hayward C, Campbell S, Celap I, et al. Genome-wide association study identifies genetic risk underlying primary rhegmatogenous retinal detachment. Hum Mol Genet 2013;22(15):3174–85 doi 10.1093/hmg/ddt169. [DOI] [PubMed] [Google Scholar]
- 35.Kottgen A, Pattaro C, Boger CA, Fuchsberger C, Olden M, Glazer NL, et al. New loci associated with kidney function and chronic kidney disease. Nat Genet 2010;42(5):376–84 doi 10.1038/ng.568. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Mao HZ, Roussos ET, Peterfy M. Genetic analysis of the diabetes-prone C57BLKS/J mouse strain reveals genetic contribution from multiple strains. Biochim Biophys Acta 2006;1762(4):440–6 doi 10.1016/j.bbadis.2006.01.002. [DOI] [PubMed] [Google Scholar]
- 37.McAllister K, Mechanic LE, Amos C, Aschard H, Blair IA, Chatterjee N, et al. Current Challenges and New Opportunities for Gene-Environment Interaction Studies of Complex Diseases. Am J Epidemiol 2017;186(7):753–61 doi 10.1093/aje/kwx227. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Gong J, Hutter CM, Newcomb PA, Ulrich CM, Bien SA, Campbell PT, et al. Genome-Wide Interaction Analyses between Genetic Variants and Alcohol Consumption and Smoking for Risk of Colorectal Cancer. PLoS genetics 2016;12(10):e1006296doi 10.1371/journal.pgen.1006296. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Kantor ED, Hutter CM, Minnier J, Berndt SI, Brenner H, Caan BJ, et al. Gene-environment interaction involving recently identified colorectal cancer susceptibility Loci. Cancer Epidemiol Biomarkers Prev 2014;23(9):1824–33 doi 10.1158/1055-9965.EPI-14-0062. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Ritz BR, Chatterjee N, Garcia-Closas M, Gauderman WJ, Pierce BL, Kraft P, et al. Lessons Learned From Past Gene-Environment Interaction Successes. Am J Epidemiol 2017;186(7):778–86 doi 10.1093/aje/kwx230. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Ritchie MD, Davis JR, Aschard H, Battle A, Conti D, Du M, et al. Incorporation of Biological Knowledge Into the Study of Gene-Environment Interactions. Am J Epidemiol 2017;186(7):771–7 doi 10.1093/aje/kwx229. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Kraft P, Aschard H. Finding the missing gene-environment interactions. European journal of epidemiology 2015;30(5):353–5 doi 10.1007/s10654-015-0046-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Wei P, Tang H, Li D. Functional logistic regression approach to detecting gene by longitudinal environmental exposure interaction in a case-control study. Genetic epidemiology 2014;38(7):638–51 doi 10.1002/gepi.21852. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Hsu L, Jiao S, Dai JY, Hutter C, Peters U, Kooperberg C. Powerful cocktail methods for detecting genome-wide gene-environment interaction. Genetic epidemiology 2012;36(3):183–94 doi 10.1002/gepi.21610. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Manolio TA, Collins FS, Cox NJ, Goldstein DB, Hindorff LA, Hunter DJ, et al. Finding the missing heritability of complex diseases. Nature 2009;461(7265):747–53. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Siemiatycki J, Thomas DC. Biological models and statistical interactions: an example from multistage carcinogenesis. Int J Epidemiol 1981;10(4):383–7 doi 10.1093/ije/10.4.383. [DOI] [PubMed] [Google Scholar]
- 47.Kim Jihye, Yuan Chen, Babic Ana, Bao Ying, Clish Clary, Pollak Michael N., et al. Genetic and circulating biomarker data improve risk prediction for pancreatic cancer in the general population Cancer Epidemiology, Biomarkers & Prevention 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Klein AP, Lindstrom S, Mendelsohn JB, Steplowski E, Arslan AA, Bueno-de-Mesquita HB, et al. An Absolute Risk Model to Identify Individuals at Elevated Risk for Pancreatic Cancer in the General Population. Plos One 2013;8(9). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Rao DC, Sung YJ, Winkler TW, Schwander K, Borecki I, Cupples LA, et al. Multiancestry Study of Gene-Lifestyle Interactions for Cardiovascular Traits in 610 475 Individuals From 124 Cohorts Design and Rationale. Circ-Cardiovasc Gene 2017;10(3). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Nan HM, Hutter CM, Lin Y, Jacobs EJ, Ulrich CM, White E, et al. Association of Aspirin and NSAID Use With Risk of Colorectal Cancer According to Genetic Variants. Jama-J Am Med Assoc 2015;313(11):1133–42. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
