Supplemental Digital Content is available in the text.
Keywords: calcium, diet, fiber, folate, gene expression, meat, vegetables, whole grains
Abstract
Background
Although the association between diet and disease is well documented, the biologic mechanisms involved have not been entirely elucidated. In this study, we evaluate how dietary intake influences gene expression to better understand the underlying mechanisms through which diet operates.
Methods
We used data from 144 individuals who had comprehensive dietary intake and gene expression data from RNAseq using normal colonic mucosa. Using the DESeq2 statistical package, we identified genes that showed statistically significant differences in expression between individuals in high-intake and low-intake categories for several dietary variables of interest adjusting for age and sex. We examined total calories, total fats, vegetable protein, animal protein, carbohydrates, trans-fatty acids, mutagen index, red meat, processed meat, whole grains, vegetables, fruits, fiber, folate, dairy products, calcium, and prudent and western dietary patterns.
Results
Using a false discovery rate of less than 0.1, meat-related foods were statistically associated with 68 dysregulated genes, calcium with three dysregulated genes, folate with four dysregulated genes, and nonmeat-related foods with 65 dysregulated genes. With a more stringent false discovery rate of less than 0.05, there were nine meat-related dysregulated genes and 23 nonmeat-related genes. Ingenuity pathway analysis identified three major networks among genes identified as dysregulated with respect to meat-related dietary variables and three networks among genes identified as dysregulated with respect to nonmeat-related variables. The top networks (Ingenuity Pathway Analysis network score >30) associated with meat-related genes were (i) cancer, organismal injury, and abnormalities, tumor morphology, and (ii) cellular function and maintenance, cellular movement, cell death, and survival. Among genes related to nonmeat consumption variables, the top networks were (i) hematological system development and function, nervous system development and function, tissue morphology and (ii) connective tissue disorders, organismal injury, and abnormalities.
Conclusion
Several dietary factors were associated with gene expression in our data. These findings provide insight into the possible mechanisms by which diet may influence disease processes.
Introduction
Diet has been associated with several diseases including atherosclerosis 1,2, inflammatory bowel disease 3, steatosis and nonalcoholic fatty liver disease 4,5, seizures 6, and various cancers including colon and breast carcinoma 7–9. Moreover, diet has also been identified as a risk factor for type 2 diabetes mellitus, obesity, and metabolic syndrome 10. Beyond its role in disease status, diet has been shown to play an important role in a number of physiologic and metabolic processes including bone remodeling 11 and aspects of the immune response 12. The biologic mechanisms involved in these processes are less clear.
Studies have evaluated the role of diet in the regulation of specific physiologic and pathologic processes, showing that single dietary metabolites and nutrients play a role in the regulation of certain signaling pathways 11,13–15 or cytokine profiles 16. Other studies have focused on dietary effects on oxidative stress 17 or the role of potentially chemoprotective nutrients 18. Some of these studies also have looked at localized gene expression within specific hypothesized pathways and processes, suggesting that diet plays a role in the regulation of gene expression 7,8,16,19. Recently, the effects of dietary patterns on the entire gene expression profile were investigated, suggesting a differential expression of genes in individuals with a prudent dietary pattern verses a western dietary pattern and identified specific canonical pathways associated with both dietary patterns 20.
In this study, we hypothesize that levels of gene expression in normal colonic mucosa will be associated with dietary factors. Dietary factors that we consider include sources of calories, including animal protein and vegetable protein, carbohydrates, and total fat. We also consider trans-fatty acids, total red meat, processed meat, mutagen index, dairy products, fruit, vegetables, refined grains, whole grains, sucrose, dietary fiber, calcium, and folate with gene expression profiles. Finally, we examined the association between prudent dietary pattern and western dietary pattern with gene expression.
Methods
Total RNA was available from normal colonic mucosa for colon cancer cases who were part of the diet, activity, and lifestyle study, an incident, population-based, case–control study of colon cancer from Utah and the Kaiser Permanente Medical Research Program (KPMRP) 21. Cases were identified using a rapid-reporting system, had tumor registry verification of a first primary adenocarcinoma of the colon, and were diagnosed between October 1991 and September 1994; tumor tissue blocks (used for RNA) were obtained for 97% of all Utah cases and for 85% of all KPMRP cases 22. Individuals with known adenomatous polyposis coli, Crohn’s disease, or inflammatory bowel disease were not eligible for the study. The study was approved by the Institutional Review Board of the University of Utah and at KPMRP; all study participants signed informed consent forms before participation.
Dietary data
Data were collected by trained and certified interviewers using laptop computers shortly after diagnosis. All interviews were audio-taped as described previously and reviewed for quality control (QC) purposes 23. Any interview deemed questionable by the interviewer was reviewed centrally; this enabled us to evaluate the data by distinguishing between an interview that was difficult for the interviewer versus one that was difficult for the participant. In addition, we reviewed audio tapes for all individuals whose nutrient levels were considered outliers (over the 95% tile or under the 5% tile of intake); this enabled us to correct any coding errors and determine the quality and face validity of the interview. Dietary information was obtained for the year, 2 years before diagnosis using an extensive diet history questionnaire adapted from the validated CARDIA diet history; most individuals were asked to recall usual dietary intake from 2 to 3 years ago to obtain the prediagnosis diet 24. Foods were converted into nutrients using the Nutrition Coding Center Nutrient Data System (University of Minnesota, Nutrition Coding Center, Minneapolis, Minnesota, USA), version 19 and were also grouped into categories of similar foods. We assessed both foods and nutrients with gene expression. Foods units were standard servings per day, which was half cup of fruit, vegetable, or dairy product, meat servings were 2–3 oz, and grain products were half cup of rice-type grains or one slice of bread. Prudent and western dietary patterns were developed on the basis of the principal component program 25. Our prudent dietary pattern was heavily loaded toward diets high in fruits, vegetables, whole grains, fish, and chicken, whereas the western dietary pattern was highly loaded toward red meat, processed meats, and refined grains and high-sugar, high-fat foods. Additional questions were asked about meat consumption, doneness, and preparation methods that were combined and used to create a mutagen index score 26.
RNA processing
RNA was extracted from formalin-fixed paraffin-embedded colonic tissues. Formalin-fixed paraffin-embedded tissue has been shown to be a reliable source of RNA for use in conjunction with RNA-seq 27. The study pathologist reviewed slides to delineate carcinoma and normal colonic mucosa. Total RNA was extracted, isolated, and purified using the RecoverAll Total Nucleic Acid isolation kit (Ambion, Carlsbad, California, USA), RNA yields were determined using a NanoDrop spectrophotometer (Wilmington, Delaware, USA).
Sequencing library preparation
Library construction was performed using the TruSeq stranded total RNA sample preparation kit (Illumina, San Diego, California, USA) with Ribo-Zero (Illumina) as described previously 28. Briefly, ribosomal RNA was removed from 100 ng total RNA using biotinylated Ribo-Zero oligos attached to magnetic beads. Following purification, the rRNA-depleted sample was fragmented and primed with random hexamers. First-strand reverse transcription was performed using superscript II reverse transcriptase (Invitrogen, Carlsbad, California, USA). Second-strand cDNA synthesis was performed using DNA polymerase I and Rnase H under conditions in which dUTP is substituted for dTTP, yielding blunt-ended cDNA fragments in which the second strand contains dUTP. An A-base is added to the blunt ends as a means to prepare the cDNA fragments for adapter ligation and block concatamer formation during the ligation step. Adapters containing a T-base overhang were ligated to the A-tailed DNA fragments. Ligated fragments were PCR amplified (13 cycles) under conditions in which the PCR reaction enables amplification of the first-strand cDNA product. The PCR-amplified library was purified using Agencourt AMPure XP beads (Beckman Coulter Genomics, Danvers, Massachusetts, USA).
Sequencing and data processing
Sequencing libraries (18 pM) were chemically denatured and applied to an Illumina TruSeq v3 single-read flow cell using an Illumina cBot. Hybridized molecules were clonally amplified and annealed to sequencing primers with reagents from an Illumina TruSeq SR Cluster Kit v3-cBot-HS. Following transfer of the flowcell to an Illumina HiSeq instrument, a 50 cycle single-read sequence run was performed using TruSeq SBS v3 sequencing reagents. The single-end 50-base reads from the Illumina HiSeq2500 were aligned to a sequence database containing the human genome (build GRCh37/hg19, February 2009, from http://www.genome.ucsc.edu) plus all splice junctions generated using the USeq MakeTranscriptome application (version 8.8.1, available at: http://www.useq.sourceforge.net/). Alignment was performed using NovoAlign, version 2.08.01 available from http://www.novocraft.com, which also trimmed any adapter sequence. Genome alignments to splice junctions were translated back to genomic coordinates using the USeq SamTranscriptomeParser application. Alignments were then sorted and indexed using the Picard SortSam application (version 1.100, available at: http://broadinstitute.github.io/picard/). Aligned read counts for each gene were calculated using pysam (https://code.google.com/p/pysam/) and samtools (http://www.samtools.sourceforge.net/). A python script using the pysam library was given a list of the genome coordinates for each gene, and counts to the exons and UTRs of those genes were calculated. Gene coordinates were downloaded from http://www.genome.ucsc.edu. Over 17 000 genes that were expressed in colon mucosa were analyzed with dietary factors. Rigid QC procedures were carried out to ensure high-quality data; participants failing QC were excluded from the analysis.
Statistical methods
Of the 197 initial tumor/nontumor tissue pairs, 22 patients failed QC on the basis of a low number of sequences, leaving 175 patients with high-quality expression data. Of these, 144 had dietary questionnaire data. For each dietary factor, our analysis focused on contrasting individuals with lower intake to those of individuals with higher intake. Dietary data were evaluated using nutrients per 1000 calories and standard servings of food; sample tertiles were computed for the consumption levels of each dietary factor. For each such dietary variable, we determined which genes showed statistically significant differential expression between the lowest and the highest intake tertile categories after adjusting for age and sex using the Bioconductor package DESeq2 (Stanford, California, USA), written for the R statistical programming environment; these genes were considered dysregulated. DESeq2 assumes that the RNA-Seq counts are distributed according to negative binomial distributions. It utilizes generalized linear modeling and variance-reduction techniques for estimated coefficients to test individual null hypotheses of zero log2-fold changes between high and low categories (i.e. no differential expression) for each gene. It uses both an independent-filtering method and the Benjamini and Hochberg 29 procedure to improve power and control the false discovery rate (FDR). The default DESeq2 options were used, including replacement of outliers, as defined by Cook’s distance, and the Wald test. For further details on DESeq2, see Love et al. 30. In identifying genes with differential expression, an FDR of 0.10 was used. The fold change calculations for differentially expressed genes was determined by DESeq2 and represents the log2 change in expression level (i.e. counts) for the high versus low dietary categories.
Bioinformatics analysis was carried out on the list of Ensemble IDs associated with genes identified as differentially expressed between high-consumption and low-consumption categories for the dietary variables of interest using QIAGEN’s Ingenuity Pathway Analysis (IPA) 31. We used genes from ingenuity knowledge base and considered both indirect and direct relationships. Causal and interaction networks were generated. Interaction networks were limited to 35 molecules per network and 25 networks per analysis, and excluded endogenous chemicals. We focused on algorithmically derived interaction networks, which are assigned a score on the basis of their relevance to the genes in the input dataset, the number of focus genes (i.e. dysregulated genes in our data that are in that network), and their connectivity 32. The score is calculated as –log10P, where P is generated using a Fisher’s exact test 33. Studies have found scores greater than 3 to be significant, with a score of 3 indicating a 1/1000 chance that the focus genes are in a network because of random chance 34–36. Other studies have opted to utilize more stringent criteria and higher scores to ensure that their discovered networks are highly significant 37,38; we utilized highly stringent criteria, only including networks with scores over 20. We applied the Benjamini and Hochberg multiple testing correction to assess pathways in IPA.
Availability of data is restricted to that authorized in the patient consent form and in accordance with data transfer agreements and institutional review board requirements.
Results
The majority of our study population included men and the median age was 65 years; 13.2% were current smokers and 37.3% currently took aspirin or nonsteroidal anti-inflammatory drugs on a regular basis (Table 1). The median BMI was 29.6. Within this population, few individuals consumed processed meat, with the highest level of intake being less than one serving per day. The highest red meat consumption was less than two servings per day. Over half of the study participants had over two servings of vegetables per day and one serving of whole grains per day.
Table 1.
Seven genes were differentially expressed for dietary calcium and folate (Table 2). Sixty-five genes were differentially expressed between high-intake and low-intake categories among nonmeat variables (i.e. prudent dietary pattern, fruits, vegetables, and whole grains). This can be further broken down into one gene with prudent dietary pattern, one gene with fruit intake, 26 genes with vegetable intake, and 37 genes with whole grain intake (Table 3, FDR<0.1). Several genes were identified as differentially expressed between consumption categories for multiple dietary variables. TXNDC17 was upregulated for both prudent dietary pattern and vegetable intake; MUC5AC was downregulated for vegetable intake, whole grain intake, and dietary folate. FOXJ2, NECAP1, and C3AR1 were upregulated for both calcium intake and vegetable intake. Using a more stringent FDR of less than 0.05, we identify one gene with differential expression between calcium intake categories, one with prudent dietary pattern, 13 with vegetable intake, and eight with whole grain intake (Tables 2 and 3). Among the genes with an FDR of less than 0.05, FOXJ2 was upregulated with high calcium intake and TXNDC17 was unregulated with a high prudent dietary pattern. In contrast, five of the eight genes associated with whole grains were downregulated and four of the 13 genes associated with vegetables were downregulated.
Table 2.
Table 3.
Our IPA analysis found three interaction networks to be significantly associated with nonmeat consumption (IPA network score ≥20). These IPA networks were identified as functioning with cancer, organismal injury and abnormalities, and tumor morphology (score=31, focus molecules=11) (Fig. 1a), cellular function, maintenance, cellular movement, cell death and survival (score=31, focus molecules=15) (Fig. 1b), and drug metabolism, molecular transport, and small molecule biochemistry (score=26, focus molecules=13) (Fig. 1c).
Among genes identified as significantly differentially expressed between high and low meat-related intake categories, 64 genes were differentially expressed with red meat, two genes were differentially expressed with processed meat, and three genes were differentially expressed with a western dietary pattern (Table 4). Of these, eight genes were differentially expressed by the level of red meat and LSAMP was differentially expressed by the level of western dietary pattern when using a more stringent selection criterion of a maximal FDR of less than 0.05. The majority of these genes were downregulated by red meat consumption; of all of the genes identified, only IL22RA1, SLC17A4, SLC25A25, MUC17, ZFY, FRK, PEX11G, NEDD9, NR3C2, and RPS4Y1 were upregulated by red meat consumption. The two genes identified as being differentially expressed for western dietary pattern were also differentially expressed for red meat; C3 was downregulated by red meat and processed meat.
Table 4.
In IPA analysis, three interaction networks were significantly associated (score≥20) with meat-related variables (i.e. red meat, processed meat, and western dietary pattern). These IPA networks were identified as hematological system developmental function, nervous system development and function, tissue morphology (IPA network score=31, focus molecules 22) (Fig. 2a); connective tissue disorders, organismal injury and abnormalities, cardiovascular system development and function (score=29, focus molecules=14) (Fig. 2b); and cellular compromise, cellular function and maintenance, and protein degradation (score=21, focus molecules 11) (Fig. 2c). Supplement Table 1, Supplemental digital content 1, http://links.lww.com/FPC/B6 shows genes upregulated and downregulated in each of these networks. Some genes were differentially expressed between high-consumption and low-consumption categories for both meat and non-meat dietary variables. COL1A1 was downregulated for vegetables and meat intake; MUC5AC was downregulated for folate, vegetables, whole grains, and meat. Foods and nutrients analyzed that were not associated with any differential gene expression (FDR<0.1) were mutagen index, total dairy products, carbohydrates, trans-fatty acids, total dietary fat, dietary fiber, refined grain products, and sucrose. In addition, we observed no differences in gene expression by level of vigorous physical activity.
Discussion
Our study provides support for associations between level of dietary intake and gene expression in normal colonic mucosa utilizing RNA-Seq, which enabled the assessment of over 17 000 genes. Some genes were differentially expressed by intake level across multiple dietary variables, generally in the same direction, although there was limited overlap between meat and nonmeat variables. We observed three IPA interaction networks to be highly associated with meat consumption and three IPA interaction networks to be associated with nonmeat consumption. These data suggest that dietary influence on gene expression plays an important role in a number of physiologic and pathologic processes.
The majority of genes that were differentially expressed with high red meat consumption were downregulated. These genes predominantly play a role in extracellular matrix (ECM) assembly and structure, calcium binding, and the regulation of signal transduction through the Wnt signaling pathway. The canonical Wnt signaling pathway triggers β-catenin-dependent gene expression, allowing cell cycle progression at the G1/S transition 39. In particular, we found that high intake of meat correlated with downregulation of FZD3, TMEM132B, CCDC150, EGFLAM, and NKD1. Both TMEM132B and NKD1 are known to downregulate Wnt signaling 40. Thus, meat consumption can contribute toward the dysregulation of a proto-oncogenic pathway. At the same time, we found that high red meat consumption was associated with a downregulation of TP53, encoding for the tumor suppressor p53. Both of these findings are consistent with previous findings that red meat intake can induce procarcinogenic gene expression changes through the p53 and Wnt signaling pathways 19. This is possibly because of the fact that red meat is high in dietary heme, a luminal irritant associated with hyperproliferation and hyperplasia through the downregulation of feedback inhibitors, including Wnt inhibitors 17. These findings add to the literature that suggest that red meat increases the risk of several chronic diseases including cancer.
High red meat consumption also downregulated the expression of genes associated with ECM. Dysregulation of genes involved in the regulation of the ECM and cell matrix adhesion, such as matrix metalloproteinases (MMPs), collagen-related genes such as COL4A2, COL1A2, and COL1A1, tissue inhibitor of metalloproteinases (TIMPs), and one cut homeobox 2 (ONECUT2) have been implicated in a variety of diseases 41. Moreover, red meat is associated with both acute oxidative stress and delayed cytotoxic stress 17. Several genes dysregulated by meat, including THY1, NCL, COL4A2, and KRT1, contribute to metastasis and angiogenesis. We found that diet downregulated a variety of genes associated with ECM assembly and maintenance, including TIMP1, providing an additional mechanistic link between red meat and disease processes.
It is also worth noting that red meat upregulated some genes, namely, genes with products involved in phosphate transport (SLC17A4 and SLC25A25) and cytokine signaling (MUC17 and IL22RA1). Diet is known to play an important role in bone health, with meat protein increasing acid load and regulating calcium balance 42. The upregulation of genes involved in phosphate transport as well as our observed downregulation of calcium-binding genes (SPARC, CEMIP, HSPA5, HSP901B, VWDE, SLC25A25, C1R, CLDN1, and EGFLAM) may help explain the mechanism by which meat consumption modulates bone metabolism, beyond the effects of dietary protein on acid load. Moreover, the upregulation of MUC17 and IL22RA1 may play a role in meat-induced inflammation and inflammation-related diseases. The exact function of MUC17 is unknown; however, other mucins are reportedly involved in the regulation of inflammation 43 and MUC17 expression is related to certain cancers, including pancreatic cancer 44. IL22 is produced by T lymphocytes and mediates cellular inflammatory responses through the activation of intracellular kinases and has been implicated in a variety of inflammatory diseases, including colon cancer and ulcerative colitis 45, both of which have been associated with red meat consumption 17,46.
Several genes associated with immune response were downregulated by high meat consumption, including TRIL and IGFGH1. We found that high red meat consumption decreased the expression of C1R and C1S. C1S encodes for a component of C1 in the classic complement pathway; C1R is the R subcomponent of C1. The downregulation of the components of the complement cascade by meat consumptions seems paradoxical; however, evidence suggests that the proximal classical complement components may provide a protective effect against atherosclerosis 47,48. Although atherosclerosis is an inflammatory disease, Hovland et al. 48 suggest that the classic complement pathway may be important for tissue homeostasis by the clearance of cell debris and immune complexes. Thus, it is possible that downregulation of proximal components of the classic pathway, such as components of C1, could contribute paradoxically toward the proinflammatory effects of red meat. However, upregulation of MUC17 by meat could help explain its association with a number of factors, including inflammation 20,46.
Further evaluation of the genes associated with nonmeat-related variables showed that genes associated with calcium were upregulated, two of the four genes associated with folate were upregulated, and the TXNDC17 gene associated with prudent dietary pattern was upregulated. The genes associated with calcium consumption were involved in transmembrane transport, regulation of transcription and translation, cell survival, and the complement system. Calcium plays a key role in muscle contraction and relaxation, blood coagulation, nerve transmission, and keratinocyte differentiation 49. The genes upregulated by high calcium consumption include FOXJ2, encoding for a transcription activator; NECAP1, whose product is involved in receptor-mediated endocytosis; and C3AR1, encoding for a C3A receptor. A prudent dietary pattern was associated with roughly three servings of milk per day in one study 20 and was associated with a decreased risk of cancer and the B cell receptor pathways 20. Our observed calcium-associated upregulation of tumor suppressors and a C3A receptor is consistent with these earlier findings.
Folate-rich foods include green vegetables and citrus fruits, and folate plays a key role as a coenzyme involved in DNA synthesis, amino acid metabolism, and methylations 50. High versus low folate intake was associated with the upregulation of SLC37A2 and CLDN4 and the downregulation of MUC5AC. The gene products for SLC37A2 and CLDN4 are involved in carbohydrate transport and tight junction organization, respectively. Tight junction proteins, such as claudins, are related to NF-κB, mTOR, and Nrf2 51,52, whereas the expression of tight junction proteins is associated with amino acids, such as arginine and tryptophan 51,52. As folic acid regulates amino acid metabolism, this could explain its role in modulating the expression of tight junction proteins. Dietary folate also downregulates the expression of MUC5AC as some mucins have been associated with inflammation 43; this could help explain the reported association between folate and certain cancers and cardiovascular diseases 50. We also found that a prudent dietary pattern upregulated TXNDC17, which modulates NF-κB. NF-κB signaling, plays a role in inflammation, apoptosis, and a variety of cellular stress responses 15. TXNDC17 overexpression has been shown to inhibit TNF-α-induced activation of NF-κB 53; thus, its upregulation could contribute toward the decrease in inflammation and cancer noted previously with a prudent dietary pattern 20.
We found that the majority of genes differentially expressed between intake levels of whole grains and vegetables were downregulated, especially for vegetable intake. The genes that are differentially expressed with level of whole grain and vegetable consumption are associated with NF-κB signaling, regulation of apoptosis, cytoskeleton dynamics, and carbohydrate metabolism. Carbohydrates are important modulators of insulin action on glucose metabolism 54, which is consistent with our findings of PFKL, ABCC3, and GMPPB downregulated expression with whole grain consumption, and HSPG2 and REG4, which were downregulated by high vegetable consumption. Both fiber and polyphenols attenuate inflammation 46. Phenolic compounds are major bioactive compounds in whole grains and polyphenols are known to interfere with the NF-κB signaling pathway 55, which is consistent with our data showing a downregulation of genes involved in the NF-κB signaling. Our data suggest that the previously described polyphenolic perturbation of nuclear signaling and anti-inflammatory effects 56 may be explained by the differential gene expression observed with both whole grain and vegetable levels of intake.
Some genes were upregulated by high vegetable intake, including ANP32A, NECAP1, and HELZ. ANP32A, which encodes for a tumor suppressor 57; and HELZ, which encodes a RNA helicase, which has been shown in conjunction with other genes to regulate downstream genes as a transcription factor containing histone methyltransferase activity 58. In association with SMYD3, HELZ plays a role in the proliferation of cancer cells 58. It has also been suggested that HELZ plays a wider role in global translation activation and the phosphorylation (activation) of ribosomal protein S6, which in turn activates eIF4b 59. This indicates that the upregulation of HELZ could help explain the association of vegetables with several factors, including inflammation 20,46, through the translational activation of numerous genes within the inflammatory process.
IPA found three interaction networks with meat consumption and three interaction networks associated with nonmeat consumption. This correlates with earlier IPA studies showing that the western dietary pattern is associated with five canonical pathways related to cancer, six pathways related to immune and/or inflammatory responses, and three pathways related to cardiovascular signaling and that the prudent dietary pattern is associated with nine IPA canonical pathways associated with immune/inflammatory response and six pathways associated with cancer 20. It has been noted previously that western dietary patterns are associated with a proinflammatory gene expression profile and that prudent dietary patterns are associated with an anti-inflammatory gene expression profile 60, which would be consistent with our described networks. Moreover, our networks suggest that although both meat-related and nonmeat- related dietary variables are associated with the cellular functions of intracellular transport and ECM regulation, the signal transduction pathways differ between meat and nonmeat networks. We found meat consumption to be associated with TP53, NFκB1, and MAPK signaling through the ERK1/2 signaling pathway. The Ras-MAPK system is involved in cell proliferation 61; moreover, our findings in this network are consistent with earlier findings that high-fat diets, the cooked meat carcinogen 2-amino-1-methyl-6-phenylimidazo [4,5-b]-pyridine (PhiP), and sodium nitrate can activate the MAPK pathways 9,16,62. Nonmeat consumption appears to play a greater role in the regulation of the PI3K complex, TNF-signaling, and TGFB1-signaling pathways. All of these pathways have been identified as major signaling pathways for colon cancer 63–66.
To the best of our knowledge, this is one of the few studies to examine the role of diet in gene expression using data from a population-based study and comprehensive gene expression data. Our study has a number of strengths including using RNA-Seq, which produces global gene expression data for each RNA sample and is an ideal method to carry out a discovery study such as ours 67,68. However, it is important to keep in mind that the gene expression profile is most relevant to current diet exposure. The time between tissue ascertainment and the referent period for diet in our study could be from several months to 3 or possibly 4 years. Although the lack of findings could indicate a disparity in time between exposure and tissue sample acquirement, finding associations would imply that the exposure is recent enough, in that dietary patterns are consistent over time, to alter the expression. In addition, we have utilized normal colonic mucosa; thus, genes would have to be expressed in colon tissue for detection. It should be recognized that different platforms carry different technical strengths and weaknesses that can influence results. Thus, it is essential to validate these findings in other populations using the methods that we used here as well as other platforms to better understand associations between diet and gene expression. We utilized DESeq2 to assess gene expression data adjusting for age and sex. We have previously shown associations between cigarette smoking and alcohol and gene expression in these samples 69. Similarly, we report statistically significant differential expression as an indicator of dysregulated genes; however, the level of dysregulation that is necessary to result in functional significance is not clear.
Although this study was carried out in a rigorous manner, there are limitations. Our dietary data were collected shortly after diagnosis. Although our dietary questionnaire allowed for reporting of over 800 food items and we implemented extremely rigid QC procedures, it is possible that recall could have been influenced by the disease status. Although this is a possibility, it should be noted that our risk estimates for diet and colon cancer are almost identical to those reported by large cohort studies 70. In addition, these data reflect foods consumed in the early 1990s. Although we believe that individuals still eat the foods that we report here, different dietary patterns may be more important today than the western and prudent diet that we report.
Our data support the hypothesis that diet is associated with gene expression. Our data showed that diet was involved in deregulation of genes involved in numerous pathways and functions such as Wnt-signaling, MAPK, NF-κB, TP53, ECM maintenance, cytoskeletal structure, and cell cycle regulation. These data suggest that differential gene expression associated with dietary intake is a possible mechanism by which diet can influence a variety of biological processes and functions. Focused functionality studies evaluating dietary influence on gene expression are needed.
Supplementary Material
Acknowledgements
This study was funded by NCI grants CA48998. The contents of this manuscript are solely the responsibility of the authors and do not necessarily represent the official view of the National Cancer Institute. The authors would like to acknowledge the contributions of Dr Bette Caan, Judy Morse, and Donna Schaffer, the Kaiser Permanente Medical Research Program, Sandra Edwards for data collection and organization, Jennifer Herrick for data management, Erica Wolff and Michael Hoffman for RNA extraction, Wade Samowitz for slide review, and Brett Milash at the Bioinformatics Core Facility at the University of Utah.
Author contributions: A.P. compiled and interpreted the data and wrote the manuscript; D.P. carried out statistical analysis, edited the manuscript, and approved the final version of manuscript; L.M. carried out bioinformatics analysis and approved the final manuscript; and R.W. oversaw RNAseq data collection and approved the final manuscript. M.S. obtained funding, collected data for study, assisted in data interpretation and analysis, edited, and approved the final manuscript.
Conflicts of interest
There are no conflicts of interest.
References
- 1.McDaniel J, Askew W, Bennett D, Mihalopoulos J, Anantharaman S, Fjeldstad AS, et al. Bison meat has a lower atherogenic risk than beef in healthy men. Nutr Res 2013; 33:293–302. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Torres N, Guevara-Cruz M, Velazquez-Villegas LA, Tovar AR. Nutrition and atherosclerosis. Arch Med Res 2015; 46:408–426. [DOI] [PubMed] [Google Scholar]
- 3.Sarbagili-Shabat C, Sigall-Boneh R, Levine A. Nutritional therapy in inflammatory bowel disease. Curr Opin Gastroenterol 2015; 31:303–308. [DOI] [PubMed] [Google Scholar]
- 4.Yang CQ, Shu L, Wang S, Wang JJ, Zhou Y, Xuan YJ, Wang SF. Dietary patterns modulate the risk of non-alcoholic fatty liver disease in Chinese adults. Nutrients 2015; 7:4778–4791. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Garcia Caraballo SC, Comhair TM, Houten SM, Dejong CH, Lamers WH, Koehler SE. High-protein diets prevent steatosis and induce hepatic accumulation of monomethyl branched-chain fatty acids. J Nutr Biochem 2014; 25:1263–1274. [DOI] [PubMed] [Google Scholar]
- 6.Jeong EA, Jeon BT, Shin HJ, Kim N, Lee DH, Kim HJ, et al. Ketogenic diet-induced peroxisome proliferator-activated receptor-γ activation decreases neuroinflammation in the mouse hippocampus after kainic acid-induced seizures. Exp Neurol 2011; 232:195–202. [DOI] [PubMed] [Google Scholar]
- 7.Zhu QC, Gao RY, Wu W, Guo BM, Peng JY, Qin HL. Effect of a high-fat diet in development of colonic adenoma in an animal model. World J Gastroenterol 2014; 20:8119–8129. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Lauber SN, Ali S, Gooderham NJ. The cooked food derived carcinogen 2-amino-1-methyl-6-phenylimidazo[4,5-b] pyridine is a potent oestrogen: a mechanistic basis for its tissue-specific carcinogenicity. Carcinogenesis 2004; 25:2509–2517. [DOI] [PubMed] [Google Scholar]
- 9.Wang Z, Pei H, Kaeck M, Lu J. Mammary cancer promotion and MAPK activation associated with consumption of a corn oil-based high-fat diet. Nutr Cancer 1999; 34:140–146. [DOI] [PubMed] [Google Scholar]
- 10.Marangoni F, Corsello G, Cricelli C, Ferrara N, Ghiselli A, Lucchin L, Poli A. Role of poultry meat in a balanced diet aimed at maintaining health and wellbeing: an Italian consensus document. Food Nutr Res 2015; 59:27606. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Kim HJ, Yoon HJ, Kim SY, Yoon YR. A medium-chain fatty acid, capric acid, inhibits RANKL-induced osteoclast differentiation via the suppression of NF-κB signaling and blocks cytoskeletal organization and survival in mature osteoclasts. Mol Cells 2014; 37:598–604. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Rafi MM, Shafaie Y. Dietary lutein modulates inducible nitric oxide synthase (iNOS) gene and protein expression in mouse macrophage cells (RAW 264.7). Mol Nutr Food Res 2007; 51:333–340. [DOI] [PubMed] [Google Scholar]
- 13.Cao HH, Cheng CY, Su T, Fu XQ, Guo H, Li T, et al. Quercetin inhibits HGF/c-Met signaling and HGF-stimulated melanoma cell migration and invasion. Mol Cancer 2015; 14:103. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Lee WJ, Chen WK, Wang CJ, Lin WL, Tseng TH. Apigenin inhibits HGF-promoted invasive growth and metastasis involving blocking PI3K/Akt pathway and beta 4 integrin function in MDA-MB-231 breast cancer cells. Toxicol Appl Pharmacol 2008; 226:178–191. [DOI] [PubMed] [Google Scholar]
- 15.Haar L, Ren X, Liu Y, Koch SE, Goines J, Tranter M, et al. Acute consumption of a high-fat diet prior to ischemia-reperfusion results in cardioprotection through NF-κB-dependent regulation of autophagic pathways. Am J Physiol Heart Circ Physiol 2014; 307:H1705–H1713. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Al-Gayyar MM, Al Youssef A, Sherif IO, Shams ME, Abbas A. Protective effects of arjunolic acid against cardiac toxicity induced by oral sodium nitrite: effects on cytokine balance and apoptosis. Life Sci 2014; 111 (1–2):18–26. [DOI] [PubMed] [Google Scholar]
- 17.Ijssennagger N, Rijnierse A, de Wit NJ, Boekschoten MV, Dekker J, Schonewille A, et al. Dietary heme induces acute oxidative stress, but delayed cytotoxicity and compensatory hyperproliferation in mouse colon. Carcinogenesis 2013; 34:1628–1635. [DOI] [PubMed] [Google Scholar]
- 18.Guo X, Yin S, Dong Y, Fan L, Ye M, Lu J, Hu H. Enhanced apoptotic effects by the combination of curcumin and methylseleninic acid: potential role of Mcl-1 and FAK. Mol Carcinog 2013; 52:879–889. [DOI] [PubMed] [Google Scholar]
- 19.Hebels DG, Sveje KM, de Kok MC, van Herwijnen MH, Kuhnle GG, Engels LG, et al. Red meat intake-induced increases in fecal water genotoxicity correlate with pro-carcinogenic gene expression changes in the human colon. Food Chem Toxicol 2012; 50:95–103. [DOI] [PubMed] [Google Scholar]
- 20.Bouchard-Mercier A, Paradis AM, Rudkowska I, Lemieux S, Couture P, Vohl MC. Associations between dietary patterns and gene expression profiles of healthy men and women: a cross-sectional study. Nutr J 2013; 12:24. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Slattery ML, Potter J, Caan B, Edwards S, Coates A, Ma KN, Berry TD. Energy balance and colon cancer – beyond physical activity. Cancer Res 1997; 57:75–80. [PubMed] [Google Scholar]
- 22.Slattery ML, Edwards SL, Palmer L, Curtin K, Morse J, Anderson K, Samowitz W. Use of archival tissue in epidemiologic studies: collection procedures and assessment of potential sources of bias. Mutat Res 2000; 432 (1–2):7–14. [DOI] [PubMed] [Google Scholar]
- 23.Edwards S, Slattery ML, Mori M, Berry TD, Caan BJ, Palmer P, Potter JD. Objective system for interviewer performance evaluation for use in epidemiologic studies. Am J Epidemiol 1994; 140:1020–1028. [DOI] [PubMed] [Google Scholar]
- 24.Liu K, Slattery M, Jacobs D, Jr, Cutter G, McDonald A, Van Horn L, et al. A study of the reliability and comparative validity of the cardia dietary history. Ethn Dis 1994; 4:15–27. [PubMed] [Google Scholar]
- 25.Slattery ML, Boucher KM, Caan BJ, Potter JD, Ma KN. Eating patterns and risk of colon cancer. Am J Epidemiol 1998; 148:4–16. [DOI] [PubMed] [Google Scholar]
- 26.Kampman E, Slattery ML, Bigler J, Leppert M, Samowitz W, Caan BJ, Potter JD. Meat consumption, genetic susceptibility, and colon cancer risk: a United States multicenter case–control study. Cancer Epidemiol Biomarkers Prev 1999; 8:15–24. [PubMed] [Google Scholar]
- 27.Li P, Conley A, Zhang H, Kim HL. Whole-Transcriptome profiling of formalin-fixed, paraffin-embedded renal cell carcinoma by RNA-seq. BMC Genomics 2014; 15:1087. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Slattery ML, Herrick JS, Mullany LE, Gertz J, Wolff RK. Improved survival among colon cancer patients with increased differentially expressed pathways. BMC Med 2015; 13:75. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Benjamini Y, Hochberg Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J R Stat Soc 1995; 57:289–300. [Google Scholar]
- 30.Love MI, Huber W, Anders S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol 2014; 15:550. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.QIAGEN’s Ingenuity Pathway Analysis; 2014, Valencia, California, USA. Available at: http://www.ingenuity.com/products/ipa. [Accessed March 2016].
- 32.Savli H, Szendröi A, Romics I, Nagy B. Gene network and canonical pathway analysis in prostate cancer: a microarray study. Exp Mol Med 2008; 40:176–185. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Li Y, Carrillo JA, Ding Y, He Y, Zhao C, Zan L, Song J. Ruminal transcriptomic analysis of grass-fed and grain-fed angus beef cattle. PLoS One 2015; 10:e0116437. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Baranzini SE, Galwey NW, Wang J, Khankhanian P, Lindberg R, Pelletier D, et al. Pathway and network-based analysis of genome-wide association studies in multiple sclerosis. Hum Mol Genet 2009; 18:2078–2090. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Yan-Fang T, Dong W, Li P, Wen-Li Z, Jun L, Na W, et al. Analyzing the gene expression profile of pediatric acute myeloid leukemia with real-time PCR arrays. Cancer Cell Int 2012; 12:40. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Naito Y, Kuroda M, Mizushima K, Takagi T, Handa O, Kokura S, et al. Transcriptome analysis for cytoprotective actions of rebamipide against indomethacin-induced gastric mucosal injury in rats. J Clin Biochem Nutr 2007; 41:202–210. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Reyes-Gibby CC, Yuan C, Wang J, Yeung SC, Shete S. Gene network analysis shows immune-signaling and ERK1/2 as novel genetic markers for multiple addiction phenotypes: alcohol, smoking and opioid addiction. BMC Syst Biol 2015; 9:25. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Jia P, Kao CF, Kuo PH, Zhao Z. A comprehensive network and pathway analysis of candidate genes in major depressive disorder. BMC Syst Biol 2011; 5 (Suppl 3):S12. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Stolz A, Bastians H. Fresh WNT into the regulation of mitosis. Cell Cycle 2015; 14:2566–2570. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Lv ZD, Zhang L, Liu XP, Jin LY, Dong Q, Li FN, et al. NKD1 down-regulation is associated with poor prognosis in breast invasive ductal carcinoma. Int J Clin Exp Pathol 2015; 8:4015–4021. [PMC free article] [PubMed] [Google Scholar]
- 41.Piccardi B, Palumbo V, Nesi M, Nencini P, Gori AM, Giusti B, et al. Unbalanced metalloproteinase-9 and tissue inhibitors of metalloproteinases ratios predict hemorrhagic transformation of lesion in ischemic stroke patients treated with thrombolysis: results from the MAGIC study. Front Neurol 2015; 6:121. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Hannan MT, Tucker KL, Dawson-Hughes B, Cupples LA, Felson DT, Kiel DP. Effect of dietary protein on bone loss in elderly men and women: the Framingham Osteoporosis Study. J Bone Miner Res 2000; 15:2504–2512. [DOI] [PubMed] [Google Scholar]
- 43.Ng GZ, Menheniott TR, Every AL, Stent A, Judd LM, Chionh YT, et al. The MUC1 mucin protects against Helicobacter pylori pathogenesis in mice by regulation of the NLRP3 inflammasome. Gut 2015. [Epub ahead of print]. [DOI] [PubMed] [Google Scholar]
- 44.Shibahara H, Higashi M, Yokoyama S, Rousseau K, Kitazono I, Osako M, et al. A comprehensive expression analysis of mucins in appendiceal carcinoma in a multicenter study: MUC3 is a novel prognostic factor. PLoS One 2014; 9:e115613. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Nardinocchi L, Sonego G, Passarelli F, Avitabile S, Scarponi C, Failla CM, et al. Interleukin-17 and interleukin-22 promote tumor progression in human nonmelanoma skin cancer. Eur J Immunol 2015; 45:922–931. [DOI] [PubMed] [Google Scholar]
- 46.Andersen V, Olsen A, Carbonnel F, Tjonneland A, Vogel U. Diet and risk of inflammatory bowel disease. Dig Liver Dis 2012; 44:185–194. [DOI] [PubMed] [Google Scholar]
- 47.Vlaicu SI, Tatomir A, Rus V, Mekala AP, Mircea PA, Niculescu F, Rus H. The role of complement activation in atherogenesis: the first 40 years. Immunol Res 2016; 64:1–13. [DOI] [PubMed] [Google Scholar]
- 48.Hovland A, Jonasson L, Garred P, Yndestad A, Aukrust P, Lappegård KT, et al. The complement system and toll-like receptors as integrated players in the pathophysiology of atherosclerosis. Atherosclerosis 2015; 241:480–494. [DOI] [PubMed] [Google Scholar]
- 49.Jiang Y, Liao L, Shrestha C, Li D, Li M, Mu Y, et al. Inhibition of 4-nitroquinoline-1-oxide-induced oral carcinogenesis by dietary calcium. Int J Clin Exp Pathol 2015; 8:3529–3542. [PMC free article] [PubMed] [Google Scholar]
- 50.Mönch S, Netzel M, Netzel G, Ott U, Frank T, Rychlik M. Folate bioavailability from foods rich in folates assessed in a short term human study using stable isotope dilution assays. Food Funct 2015; 6:242–248. [DOI] [PubMed] [Google Scholar]
- 51.Wang B, Feng L, Jiang WD, Wu P, Kuang SY, Jiang J, et al. Copper-induced tight junction mRNA expression changes, apoptosis and antioxidant responses via NF-κB, TOR and Nrf2 signaling molecules in the gills of fish: preventive role of arginine. Aquat Toxicol 2015; 158:125–137. [DOI] [PubMed] [Google Scholar]
- 52.Jiang WD, Wen HL, Liu Y, Jiang J, Kuang SY, Wu P, et al. The tight junction protein transcript abundance changes and oxidative damage by tryptophan deficiency or excess are related to the modulation of the signalling molecules, NF-kappaB p65, TOR, caspase-(3,8,9) and Nrf2 mRNA levels, in the gill of young grass carp (Ctenopharyngodon idellus). Fish Shellfish Immunol 2015; 46:168–180. [DOI] [PubMed] [Google Scholar]
- 53.Jeong W, Chang TS, Boja ES, Fales HM, Rhee SG. Roles of TRP14, a thioredoxin-related protein in tumor necrosis factor-alpha signaling pathways. J Biol Chem 2004; 279:3151–3159. [DOI] [PubMed] [Google Scholar]
- 54.Tremblay F, Lavigne C, Jacques H, Marette A. Dietary cod protein restores insulin-induced activation of phosphatidylinositol 3-kinase/Akt and GLUT4 translocation to the T-tubules in skeletal muscle of high-fat-fed obese rats. Diabetes 2003; 52:29–37. [DOI] [PubMed] [Google Scholar]
- 55.Kim YH, Kim JL, Lee EJ, Park SH, Han SY, Kang SA, Kang YH. Fisetin antagonizes cell fusion, cytoskeletal organization and bone resorption in RANKL-differentiated murine macrophages. J Nutr Biochem 2014; 25:295–303. [DOI] [PubMed] [Google Scholar]
- 56.Farzaei MH, Abdollahi M, Rahimi R. Role of dietary polyphenols in the management of peptic ulcer. World J Gastroenterol 2015; 21:6499–6517. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Buddaseth S, Göttmann W, Blasczyk R, Huyton T. Dysregulation of cell cycle control caused by overexpression of the oncogene pp32r1 (ANP32C) and the Tyr>His mutant pp32r1Y140H. Biochim Biophys Acta 2013; 1833:1212–1221. [DOI] [PubMed] [Google Scholar]
- 58.Hamamoto R, Furukawa Y, Morita M, Iimura Y, Silva FP, Li M, et al. SMYD3 encodes a histone methyltransferase involved in the proliferation of cancer cells. Nat Cell Biol 2004; 6:731–740. [DOI] [PubMed] [Google Scholar]
- 59.Hasgall PA, Hoogewijs D, Faza MB, Panse VG, Wenger RH, Camenisch G. The putative RNA helicase HELZ promotes cell proliferation, translation initiation and ribosomal protein S6 phosphorylation. PLoS One 2011; 6:e22107. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.van Dijk SJ, Feskens EJ, Bos MB, Hoelen DW, Heijligenberg R, Bromhaar MG, et al. A saturated fatty acid-rich diet induces an obesity-linked proinflammatory gene expression profile in adipose tissue of subjects at risk of metabolic syndrome. Am J Clin Nutr 2009; 90:1656–1664. [DOI] [PubMed] [Google Scholar]
- 61.Walczak K, Turski WA, Rajtar G. Kynurenic acid inhibits colon cancer proliferation in vitro: effects on signaling pathways. Amino Acids 2014; 46:2393–2401. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Creton SK, Zhu H, Gooderham NJ. The cooked meat carcinogen 2-amino-1-methyl-6-phenylimidazo[4,5-b]pyridine activates the extracellular signal regulated kinase mitogen-activated protein kinase pathway. Cancer Res 2007; 67:11455–11462. [DOI] [PubMed] [Google Scholar]
- 63.Slattery ML, Lundgreen A, Herrick JS, Caan BJ, Potter JD, Wolff RK. Associations between genetic variation in RUNX1, RUNX2, RUNX3, MAPK1 and eIF4E and riskof colon and rectal cancer: additional support for a TGF-β-signaling pathway. Carcinogenesis 2011; 32:318–326. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Slattery ML, Lundgreen A, Herrick JS, Wolff RK. Genetic variation in RPS6KA1, RPS6KA2, RPS6KB1, RPS6KB2, and PDK1 and risk of colon or rectal cancer. Mutat Res 2011; 706:13–20. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Slattery ML, Herrick JS, Lundgreen A, Wolff RK. Genetic variation in the TGF-beta-signaling pathway and colon and rectal cancer risk. Cancer Epidemiol Biomarkers Prev 2011; 20:57–69. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Slattery ML, Lundgreen A, Bondurant KL, Wolff RK. Tumor necrosis factor-related genes and colon and rectal cancer. Int J Mol Epidemiol Genet 2011; 2:328–338. [PMC free article] [PubMed] [Google Scholar]
- 67.Dorr C, Wu B, Guan W, Muthusamy A, Sanghavi K, Schladt DP, et al. Differentially expressed gene transcripts using RNA sequencing from the blood of immunosuppressed kidney allograft recipients. PLoS One 2015; 10:e0125045. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Guo Y, Sheng Q, Li J, Ye F, Samuels DC, Shyr Y. Large scale comparison of gene expression levels by microarrays and RNAseq using TCGA data. PLoS One 2013; 8:e71462. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Slattery ML, Pellatt DF, Mullany LE, Wolff RK. Differential gene expression in colon tissue associated with diet, lifestyle, and related oxidative stress. PLoS One 2015; 10:e0134406. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Figueiredo JC, Hsu L, Hutter CM, Lin Y, Campbell PT, Baron JA, et al. Genome-wide diet-gene interaction analyses for risk of colorectal cancer. PLoS Genet 2014; 10:e1004228. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.