Skip to main content
G3: Genes | Genomes | Genetics logoLink to G3: Genes | Genomes | Genetics
. 2015 Jan 12;5(3):417–425. doi: 10.1534/g3.114.016477

A Genome-Wide Association Study for Nutritional Indices in Drosophila

Robert L Unckless 1,1, Susan M Rottschaefer 1, Brian P Lazzaro 1
PMCID: PMC4349095  PMID: 25583649

Abstract

Individuals are genetically variable for the way in which they process nutrients and in the effects of dietary content on reproductive success, immunity, and development. Here, we surveyed genetic variation for nutrient stores (glucose, glycogen, glycerol, protein, triglycerides, and wet weight) in the Drosophila Genetic Reference Panel (DGRP) after rearing the flies on either a low-glucose or high-glucose diet. We found significant genetic variation for these nutritional phenotypes and identified candidate genes that underlie that variation using genome-wide associations. In addition, we found several significant correlations between the nutritional phenotypes measured in this study and other previously published phenotypes, such as starvation stress resistance, oxidative stress sensitivity, and endoplasmic reticulum stress, which reinforce the notion that these lines can be used to robustly measure related phenotypes across distinct laboratories.

Keywords: Drosophila, DGRP, glycogen, glucose, triglyceride, protein, weight


The quality of dietary nutrition and the assimilation of dietary nutrients have significant influence on many traits, including lifespan (Piper et al. 2005; Piper and Partridge 2007; Skorupa et al. 2008), development (Layalle et al. 2005), reproduction (Fricke et al. 2008), and immunity (Ayres and Schneider 2009; Fellous and Lazzaro 2010; Vass and Nappi 1998). Resources such as the Drosophila Genetic Reference Panel (DGRP) provide a practical means of using natural genetic variation to both untangle the genetic basis of complex traits and understand the intersection of selection and genetics in the maintenance of that variation (Mackay et al. 2012). The DGRP is a set of approximately 200 D. melanogaster genetic lines that have been genome-sequenced and are available to the community for the mapping of complex genetic traits. Here, we present the results of a genome-wide scan for SNPs associated with several nutritional indices measured after rearing on either a low -glucose (1 glucose: 2 yeast) diet or a high-glucose (2 glucose: 1 yeast) diet. We found significant genetic variation for all traits (total soluble protein, glucose, glycogen, free glycerol, triglycerides, and wet weight) and were able to map underlying genes. We additionally note correlations between our nutritional indices and several previously published DGRP phenotypes (Mackay et al. 2012; Jordan et al. 2012; Ayroles et al. 2009; Chow et al. 2013b).

Materials and Methods

Drosophila stocks and husbandry

We assayed nutritional indices in the DGRP (Mackay et al. 2012), a collection of approximately 200 inbred lines of Drosophila melanogaster derived from wild-caught females (2003, Raleigh, NC). Our study utilized 172 of these lines, although not every line was available for every day of the experiment.

Before measuring any phenotypes, each line was reared for at least three generations on two diets that varied in glucose content. The low-glucose diet consisted of 5% weight by volume brewer’s yeast (MP Biomedicals, Santa Ana, CA), 2.5% glucose (Sigma-Aldrich, St. Louis, MO), and 1% Drosophila agar (Genesee Scientific, San Diego, CA) supplemented with 800 mg/L methyl paraben (Sigma-Aldrich), and 6 mg/L carbendazim (Sigma-Aldrich). The high-glucose diet was exactly the same but consisted of 10% glucose.

Nutrient indices in the DGRP

We assayed nutritional indices in pools of 10 adult males from each line aged 3–6 days after eclosion. We measured free glucose, glycogen stores, total triglycerides, free glycerol, and soluble protein in groups of 10 male flies, with three biological replicates of rearing on each diet. Each group of flies was weighed using a MX5 microbalance (Mettler-Toledo, Columbus, OH) and then homogenized in 200 μL buffer (10 mM Tris, 1 mM EDTA, pH 8.0 with 0.1% v/v Triton-X-100) using lysing matrix D (MP Biomedicals, Santa Ana, CA) on a FastPrep-24 homogenizer (MP Biomedicals). We immediately froze 50 μL of the homogenate to be used for the total protein assay and incubated the remaining 150 μL at 72° for 20 min to denature enzymes naturally present in the homogenate. Each nutritional index was assayed using modifications of commercially available kits (see Unckless et al. unpublished data; Ridley et al. 2012): glucose with the oxidase kit (GAGO-20; Sigma-Aldrich); glycogen using the glucose kit and amyloglucosidase from Aspergillus niger (A7420; Sigma-Aldrich) in 10 mM acetate buffer at pH 4.6; free glycerol and triglycerides using reagent kits F6428 and T2449, respectively (Sigma-Aldrich); and soluble protein with the DC Protein Assay (BIO-RAD, Hercules, CA).

Data analysis

Before genome-wide association mapping, we estimated line means for each nutritional index using abundance of metabolite per mg of fly. The model used was:

Yijklmn=m+Wolbi+Dietj+Linek(Wolbi)+Blockn(Dietj)+Dietj×Linek(Wolbi)+eijklmn

where Y is the estimated mass (μg) per fly of each nutrient divided by the mass of the flies measured in mg (except, obviously, in the case where wet mass is itself the response variable). Wolbi (i = 1,2) indicates endosymbiotic bacterium Wolbachia pipientis infection status, Dietj (j = 1,2) indicates rearing diet, Blockn(Dietj) (n = 1,3) differentiates among the three replicate blocks within each diet, Linek(Wolbi) (k = 1,2,…,172) tests the influence of inbred line on nutritional index nested within Wolbachia infection status (52.2% of lines were infected), and the Dietj × Linek(Wolbi) interaction term tests whether inbred lines differ in their responsiveness to the two diets. All factors were considered fixed. All models were run in SAS 9.3 (Cary, NC) using the “GLM” procedure and least squares means were extracted. For modeling on each diet individually, the model used was: Yijkl=m+Wolbi+Linek(Wolbi)+Blockl+eikl.

We also obtained a more holistic view of fly metabolic status by performing a principal component analysis on the collective set of nutritional measures, excluding wet weight, because that is implicitly contained in mass-scaled measures of individual nutrients. This allowed us to distill the higher-order interactions of our nutritional phenotypes into several one-dimensional components. Line estimates for each nutritional principal component were determined using the prcomp function in R (R Core Team 2012) with tol = 0.1 and unit variance scaling turned on. This analysis was completed with flies reared on the two diets considered separately.

Genome-wide association mapping

The set of SNPs for genome-wide association mapping was described in Huang et al. (2014) and consists of only SNPs with minor alleles present in at least four of the lines (MAF >2%; 2,415,518 total SNPs). For genome-wide associations, we formatted this SNP set for PLINK-assessed (Purcell et al. 2007) associations between SNP and line estimates from the above models using the “–assoc” flag to perform associations and the “–qt-means” flag for estimates of effect size. PLINK uses an ordinary least squares model for each SNP. These analyses were performed for the high-glucose diet, low-glucose diet, and when data from both diets were pooled. We used a nominal P value threshold of P < 10−6 for declaring SNPs to be significantly associated with trait variation but relaxed this to P < 10−4 for gene ontology enrichment analysis (see below).

GO term analysis

We performed Gene Ontology (GO) analysis corrected for gene size using GOWINDA (Kofler and Schlötterer 2012) to test for the enrichment of particular functional groups in genes bearing SNPs associated with variation in phenotypic traits. Significantly associated SNPs (P < 10−4) for each treatment (low glucose, high glucose, main effect) were used as the query set with a background SNP set consisting of all remaining SNPs used in the genome-wide mapping. We used this relaxed P value threshold to increase the number of significant SNPs in this analysis. GO slim (Adams et al. 2000) terms were used to reduce redundancy in GO categories. GOWINDA was run using gene mode, including all SNPs within 1000 bp of a gene, a minimum gene number of 5, and with 100,000 simulations. We report all GO terms with a nominal P < 0.1.

Phenotypic correlations with other traits

We examined correlations among our measured traits, and between our nutritional phenotypes and independent traits that have been measured in the DGRP lines by other research groups. Correlation analyses were performed in R (R Core Team 2012) using our line mean estimates, and we report both correlation coefficient and P value. For significantly correlated traits, we queried whether a single gene or a few genes might drive the correlation by determining whether the same SNPs were significantly associated with variation in both traits with a relaxed P value threshold of 10−5.

Results

Genetic and environmental variation for nutritional status across the DGRP

ANOVA for each nutritional index (both pooled across diets and on each diet individually) is presented in Supporting Information, Table S1. When the data from each diet are analyzed separately, all nutritional indexes showed a significant (or nearly significant) line effect except soluble protein after rearing on the low-glucose diet and triglycerides after rearing on the high-glucose diet (Table S1b), indicating that most traits are genetically variable. When the data from both diets were pooled, all nutritional indices except free triglycerides and glycogen showed a significant effect of rearing diet, with glucose, glycerol, and triglycerides occurring at higher levels in flies reared on the high-glucose diet, whereas glycogen, soluble protein, and total wet mass were lower in flies reared on the high-glucose diet. All nutritional indices showed a significant effect of line. Only wet weight showed a significant interaction between line and diet (Table S1a). In addition, total soluble protein showed a significant effect of Wolbachia infection status (P = 0.047). All phenotypic values are presented in Table S2.

Principal components of nutritional indices

We considered that our nutritional indices might give more information about the metabolic status of the fly when considered in aggregate, so we used a principle component (PC) analysis to extract the top five PCs from the full nutritional data set. The top five principal components summarizing the NIs on each diet each explain 12–31% of the total in nutritional state, with loadings of each NI given in Table S3. Principal component loadings show variation in both sign and magnitude of contribution from each NI, suggesting they capture complex integrations of the nutritional indices to reflect overall metabolic state.

Phenotypic correlations with other traits

We measured correlations between our nutritional phenotypes and several other traits that have been measured in the DGRP and for which the data are publically available (starvation stress resistance, chill coma recovery, startle response, oxidative stress response, endoplasmic reticulum stress) (Mackay et al. 2012; Jordan et al. 2012; Chow et al. 2013b). Table 1 contains the correlation coefficient and P value for each trait combination. Note that for all nutritional indices, we present correlations between other phenotypes and line means estimated when data from both diets were pooled. We did not perform principal components analysis on this pooled data; however, diet-specific principal components were used for the analysis.

Table 1. Correlations between our nutritional indices and traits previously measured by other groups: principal components.

Phenotype Starvation Stress Resistance Chill Coma Startle Response Paraquat MSB ER Hazard Ratio ER T50 Male Reproductive Fitnessd Lifespand Male Aggressiond Matingd Ethanol Toleranced
Source 1 1 1 2 2 3 3 4 4 4 4 4
Glucose 0.246b −0.245b 0.309b 0.069 0.043 −0.057 0.076 −0.134 −0.073 0.303 −0.043 −0.048
Glycogen 0.307c −0.168a 0.249b 0.186a 0.197a 0.099 0.279b 0.163 0.183 0.224 0.212 0.041
Glycerol 0.079 0.005 0.008 0.012 −0.013 −0.020 0.286b −0.050 −0.227 −0.235 −0.175 −0.225
Triglycerides −0.071 −0.081 0.003 −0.045 −0.107 −0.038 -0.236a 0.067 0.042 0.245 0.244 −0.129
Protein −0.113 −0.177a −0.093 −0.183a −0.037 −0.157 −0.178 −0.428a 0.057 −0.285 −0.030 0.178
Wet weight 0.241b −0.238b 0.030 0.180a 0.191a −0.175 0.165 −0.146 0.166 −0.358a 0.066 0.231
LGD PC1 −0.261b −0.090 −0.143 −0.132 −0.136 −0.058 −0.305b 0.148 0.066 0.111 0.115 0.124
LGD PC2 0.207a −0.052 0.177a 0.152 0.114 0.072 0.037 0.184 0.113 0.242 0.288 0.050
LGD PC3 0.226b −0.318c 0.013 0.152 0.090 0.196a 0.368c 0.214 0.058 0.021 −0.280 0.155
LGD PC4 −0.098 −0.110 0 −0.020 −0.136 0.042 0.049 0.217 −0.462b −0.003 0.013 0.105
LGD PC5 0.087 −0.070 0.093 0.016 −0.065 −0.052 −0.054 0.459b −0.027 0.158 −0.130 0.021
HGD PC1 0.309c −0.248b 0.279c 0.096 0.051 −0.016 0.186 −0.015 0.086 0.338 0.104 −0.112
HGD PC2 −0.003 −0.019 −0.102 −0.093 −0.038 −0.043 0.091 −0.154 0.006 −0.281 −0196 0.120
HGD PC3 −0.035 −0.035 −0.068 −0.070 −0.079 0.046 −0.251a −0.232 0.292 0.166 −0.042 0.025
HGD PC4 0.028 −0.087 −0.094 0.026 −0.096 −0.044 0.016 −0.008 −0.026 −0.043 −0.082 −0.737c
HGD PC5 0.135 −0.116 −0.001 0.141 0.096 0.163 0.216a 0.059 0.102 0.144 0.253 −0.086

All nutritional indices (protein, glucose, etc.) are values found when data from both diets were pooled. For correlation coefficients, cells in italics are P < 0.05 and cells in bold are P < 0.01. HGD, high-glucose diet; LGD, low -glucose diet; 1, Mackay et al. 2012; 2, Jordan et al. 2012; 3, Chow, Wolfner, and Clark 2013b; 4, Ayroles et al. 2009.

a

P < 0.05

b

P < 0.01.

c

P < 0.001.

d

Correlations performed with only 40 DGRP lines.

Several interesting correlations are evident. In particular, starvation stress resistance as measured by Mackay et al. (2012) is correlated with several metabolic principal components and is positively correlated with wet weight (P = 0.005) and with levels of glucose (P = 0.004) and glycogen (P < 0.001). Chill coma recovery, also measured by Mackay et al. (2012), is correlated with two metabolic principal components as well as with wet weight (P = 0.005), levels of glucose (P = 0.004), glycogen (P = 0.048), and protein (P = 0.038). Startle response (Mackay et al. 2012) is correlated with two metabolic principal components and with glucose (P < 0.001) and triglyceride (P = 0.003) levels. Sensitivity to oxidative stress, induced by either paraquat or menadione sodium bisulfate (MSB) (Jordan et al. 2012), was positively correlated with glycogen stores (P = 0.029 and P = 0.021, respectively) and wet weight (P = 0.035 and P = 0.025, respectively). Sensitivity to paraquat was also negatively correlated with soluble protein (P = 0.032). Interestingly, several nutritional indices were significantly correlated with time to 50% mortality after endoplasmic reticulum stress (ER T50) (Chow et al. 2013b), including glycogen stores (P = 0.005), glycerol level (P = 0.004), total triglycerides (P = 0.020), as well as PC1 and PC3 on the low-glucose diet and PC1 on the high-glucose diet.

Phenotypic values for male reproductive fitness, male aggression, lifespan, and ethanol tolerance were also reported for a smaller set of 40 DGRP lines (Ayroles et al. 2009). With only 40 lines, we have less power to find correlations with these data, although we do still detect some significant correlations. Male reproductive fitness (proportion of offspring sired during competition for matings with males from a marked stock) is negatively correlated with our measure of soluble protein (P = 0.015) and positively correlated with low-glucose diet PC5. Lifespan is positively correlated with low-glucose diet PC4. Surprisingly, male aggression as determined by Ayroles et al. was negatively correlated with our measure of wet weight (P = 0.044), where we might have naively expected larger flies to be more aggressive. Finally, ethanol tolerance is significantly positively correlated with high-glucose PC4.

Genome-wide association results

SNPs that are significantly associated with variation in each nutritional phenotype (P < 10−6) are presented in Table 2 and Table 3. Overall, SNPs significantly associated with variation in our nutritional phenotypes are disproportionately found as nonsynonymous substitutions or in introns and UTRs, as opposed to synonymous substitutions or positions more than 1000 bp from known genes, relative to the distribution of all variants across the genome. For the nutritional indices, 33 out of 48 (69%) total significantly associated SNPs across phenotypes and diets are found in introns, UTRs, less than 1000 bp from an annotated gene, or as nonsynonymous SNPs. For principal components, this fraction is 17 of 24 (71%). In contrast, less than half of all SNPs meeting criteria for inclusion in this study are found in introns or UTRs, are less than 1000 bp from an annotated gene, or are nonsynonymous. This enrichment for putatively functional SNPs is significant (χ2 = 6.75, df = 1, P = 0.009 for nutritional indices; χ2 = 4.17, df = 1, P = 0.041 for principal components). For example, across the three mapping strategies (low glucose, high glucose and data pooled across diets), there were seven unique SNPs meeting our threshold for association with glucose levels. Of these, one was synonymous and one was not associated with any known gene. The remaining five mapped SNPs were intronic. For triglyceride levels, all four significantly associated SNPs were intronic. Each SNP that associates significantly with variation in a measured phenotype is given in Table 2, including significance level, estimated effect size, minor allele frequency, type of SNP, and gene functional categorization. No SNPs were significantly associated with more than one distinct nutritional phenotype, even when the significance threshold was relaxed to 10−5.

Table 2. SNPs significantly associated with variation in nutritional indices at P < 10−6.

NI Diet SNP P Effect MAF Gene FBgn Type Function Reference
Glucose HGD 3L.4811585 4.83E-07 −0.020 0.394 Dhc64C FBgn0051025 Syn. Cellularization Papoulas et al. 2005
3R.6404817 3.41E-07 −0.022 0.222 hth FBgn0001235 Intron Brain development Nagao et al. 2000
3R.23998828 9.64E-07 −0.021 0.307 CG34354 FBgn0085383 Intron Nucleic acid binding Tweedie et al. 2009
LGD NA NA NA NA NA NA NA NA NA
Pooled 3R.6290881 8.25E-07 −0.023 0.075 NA NA NA NA NA
3R.6404817 1.35E-07 −0.015 0.226 hth FBgn0001235 Intron Brain development Nagao et al. 2000
3R.6440408 9.67E-07 −0.016 0.176 hth FBgn0001235 Intron Brain development Nagao et al. 2000
3R.6446314 8.68E-07 −0.016 0.179 hth FBgn0001235 Intron Brain development Nagao et al. 2000
3R.6455818 3.57E-08 −0.018 0.154 hth FBgn0001235 Intron Brain development Nagao et al. 2000
Glycerol HGD 2R.18726642 5.41E-07 0.014 0.297 CG9825 FBgn0034783 Syn. Transmembrane transport Tweedie et al. 2009
LGD X.6541116 4.83E-07 0.013 0.224 pig FBgn0029881 Intron Small body Tweedie et al. 2009
X.6541138 2.35E-07 0.014 0.229 pig FBgn0029881 Intron Small body Tweedie et al. 2009
X.6541155 7.14E-07 0.013 0.229 pig FBgn0029881 Intron Small body Tweedie et al. 2009
X.6541215 2.24E-07 0.014 0.215 pig FBgn0029881 Intron Small body Tweedie et al. 2009
Pooled 2L.307423 1.16E-07 −0.016 0.141 Plc21C FBgn0004611 Intron Lipid catabolic process Tweedie et al. 2009
3L.11508784 7.70E-07 0.011 0.353 CG7512 FBgn0036168 Intron Metal ion binding Tweedie et al. 2009
3R.14453686 1.54E-07 −0.010 0.482 Qin FBgn0263974 Intron Protein autoubiquination Tweedie et al. 2009
Glycogen HGD 2R.7673484 4.56E-07 −0.023 0.210 ths FBgn0033652 Intron Fibroblast growth factor binding Itoh and Ornitz 2004
2R.17598285 2.15E-07 0.021 0.328 CG30403 FBgn0050403 Intron DNA binding Tweedie et al. 2009
2R.17598285 2.14E-07 0.021 0.338 CG30403 FBgn0050403 Intron DNA binding Tweedie et al. 2009
LGD 2L.8316116 4.93E-07 −0.022 0.073 CG7806 FBgn0032018 Syn. Transmembrane transport Tweedie et al. 2009
Pooled 2L.11397732 4.66E-07 −0.023 0.217 NA NA NA NA NA
Mean weight HGD NA NA NA NA NA NA NA NA NA
LGD 2L.3261343 6.08E-07 56.1 0.321 NA NA NA NA NA
2L.3271697 7.11E-07 53.6 0.420 CG3347 FBgn0031513 3′ UTR Zinc ion binding Tweedie et al. 2009
3R.25948794 7.31E-07 60.9 0.239 CG45072 FBgn0266442 Nonsyn. Unknown NA
3R.25948812 8.9E-07 60.1 0.245 CG45072 FBgn0266442 5′ UTR Unknown NA
3R.25952830 4.90E-09 84.3 0.150 Ppi1 FBgn0051025 Nonsyn. Protein phosphatase inhibitor Bennett et al. 2006
3R.25952966 1.37E-08 83.2 0.143 Ppi1 FBgn0051025 Syn. Protein phosphatase inhibitor Bennett et al. 2006
3R.25953010 9.67E-09 78.5 0.169 Ppi1 FBgn0051025 Nonsyn. Protein phosphatase inhibitor Bennett et al. 2006
3R.25953104 5.00E-09 84.1 0.152 Ppi1 FBgn0051025 Syn. Protein phosphatase inhibitor Bennett et al. 2006
3R.25953203 2.30E-08 76.4 0.155 Ppi1 FBgn0051025 Syn. Protein phosphatase inhibitor Bennett et al. 2006
3R.25953305 4.36E-08 76.9 0.161 Ppi1 FBgn0051025 Syn. Protein phosphatase inhibitor Bennett et al. 2006
Pooled 3R.25952830 2.90E-08 73.9 0.148 Ppi1 FBgn0051025 Nonsyn. Protein phosphatase inhibitor Bennett et al. 2006
3R.25952966 1.21E-07 71.5 0.141 Ppi1 FBgn0051025 Syn. Protein phosphatase inhibitor Bennett et al. 2006
3R.25953010 3.49E-07 65.1 0.168 Ppi1 FBgn0051025 Nonsyn. Protein phosphatase inhibitor Bennett et al. 2006
3R.25953104 5.44E-08 72.3 0.150 Ppi1 FBgn0051025 Syn. Protein phosphatase inhibitor Bennett et al. 2006
X.5877626 5.01E-07 −48.3 0.423 Grip FBgn0029830 Intron Glutamate receptor binding; muscle attachment Swan et al. 2004
Protein HGD NA NA NA NA NA NA NA NA NA
LGD 2L.1888490 6.48E-07 −0.016 0.298 CG7337 FBgn0031374 Intron Quinonprotein alcohol dehydrogenase activity Tweedie et al. 2009
2L.7008495 2.62E-07 −0.021 0.149 uif FBgn0031879 5′ UTR Notch binding Tweedie et al. 2009
3R.4370437 7.68E-07 −0.020 0.145 NA NA NA NA NA
3R.15771872 7.28E-07 −0.018 0.216 Hs6st FBgn0038755 Intron Sulfotransferase Ghabrial et al. 2003
Protein (cont.) 3R.18325276 2.41E-07 −0.023 0.140 oa2 FBgn0038980 Intron Octopamine receptor activity Balfanz et al. 2005
Pooled 2R.17648180 6.69E-07 −0.013 0.512 NA NA NA NA NA
3L.6131752 5.82E-07 0.012 0.482 Cpr65Av FBgn0052405 Down (571) Insect cuticle protein Karouzou et al. 2007
Lcp65Ae FBgn0020640 Up (534) Insect cuticle protein Karouzou et al. 2007
Triglycerides HGD X.20411124 3.85E-07 −0.020 0.222 RunxB FBgn0259162 Intron Cellular process Boutros et al. 2004
LGD 2L.4905518 8.54E-07 0.025 0.353 CG2837 FBgn0031646 Intron Unknown NA
2R.15064256 9.66E-07 −0.025 0.331 CG10737 FBgn0034420 Intron Intracellular signal transduction Tweedie et al. 2009
X.5445429 4.50E-07 0.033 0.188 Vsx2 FBgn0263512 Intron DNA binding Tweedie et al. 2009
Pooled NA NA NA NA NA NA NA NA NA

Effect, effect size of minor allele; SNPs labeled NA are not within 1000 bp of an annotated gene. Lines with all NAs indicate no SNPs met significance threshold; MAF, minor allele frequency.

Table 3. SNPs significantly associated with variation in principal components of nutritional indices at P < 10−6.

PC Diet SNP P Effect MAF Gene FBgn Type Function Reference
PC1 HGD 2L.15990382 9.79E-07 −0.988 0.465 CR43412 FBgn0263331 Down (435) Nonprotein coding Tweedie et al. 2009
X.16918901 9.24E-07 1.046 0.300 CG43997 FBgn0264739 Down (923) Unknown NA
PC2 HGD 3L.1235270 7.76E-08 −1.015 0.309 CG33966 FBgn0053966 3′ UTR Vitamin E binding Tweedie et al. 2009
3L.1235273 8.26E-07 −0.927 0.324 CG33966 FBgn0053966 3′ UTR Vitamin E binding Tweedie et al. 2009
3L.2644168 9.74E-07 0.974 0.264 CG14949 FBgn0035358 Up (744) Unknown NA
3L.17498584 9.88E-07 0.970 0.270 Oatp74D FBgn0036732 Intron Organic anion transport Tweedie et al. 2009
PC3 HGD X.20411124 5.38E-07 −0.977 0.222 RunxB FBgn0259162 Intron DNA binding Boutros et al. 2004
PC4 HGD 3R.14205878 2.25E-07 −0.976 0.178 CG7675 FBgn0038610 Intron Glucose/ribitol reductase Tweedie et al. 2009
3R.14206166 2.70E-08 −1.016 0.200 CG7675 FBgn0038610 Intron Glucose/ribitol reductase Tweedie et al. 2009
3R.14206170 2.34E-07 −0.924 0.211 CG7675 FBgn0038610 Intron Glucose/ribitol reductase Tweedie et al. 2009
X.8316242 5.06E-07 −1.630 0.080 NA NA NA NA NA
PC5 HGD 2L.18537420 8.35E-07 −0.817 0.181 Pde11 FBgn0085370 Syn. Phosphodiesterase Day et al. 2005
2L.19068086 9.35E-07 −1.693 0.036 CG10702 FBgn0032752 Intron Protein phosphorylation Tweedie et al. 2009
CG17343 FBgn0032751 Intron Regulation of mitotic anaphase Tweedie et al. 2009
3L.12761401 1.40E-07 −1.800 0.035 CG32113 FBgn0052113 Syn. Vesicle-mediated transport Tweedie et al. 2009
3R.22249607 4.98E-07 −1.144 0.085 CG6503 FBgn0040606 Down (813) Unknown NA
PC1 LGD NA NA NA NA NA NA NA NA NA
PC2 LGD 2L.14109774 3.30E-07 1.122 0.225 CG31769 FBgn0051769 Syn. Unknown NA
2R.6893862 6.94E-07 1.126 0.196 luna FBgn0040765 Intron DNA binding Boutros et al. 2004
X.12603566 9.74E-07 0.889 0.373 Smr FBgn0263865 Intron Regulation of mitotic cell cycle Pile et al. 2002
PC3 LGD 3R.25503463 7.36E-07 0.884 0.302 CAP-D2 FBgn0039680 Intron Mitotic sister chromatin segregation Boutros et al. 2004
3R.25504118 9.66E-07 1.009 0.269 CAP-D2 FBgn0039680 Syn. Mitotic sister chromatin segregation Boutros et al. 2004
PC4 LGD 2R.4176279 6.74E-07 1.640 0.042 NA NA NA NA NA
3L.12262931 6.95E-07 0.666 0.442 Pbgs FBgn0036271 3′ UTR Porphobolinogin synthase Golombieski et al. 2008
3L.12412047 4.3E-07 2.33 0.021 NA NA NA NA NA
3L.13348431 1.72E-07 1.682 0.042 CG17687 FBgn0036348 Down (391) Unknown NA
PC5 LGD NA NA NA NA NA NA NA NA NA

Effect, effect size of minor allele; MAF, minor allele frequency. SNPs labeled NA are not within 1000 bp of an annotated gene. Lines with all NAs indicate no SNPs met significance threshold.

Gene ontology analysis for enrichment

To determine whether the SNPs significantly associated with variation in each phenotype were clustered in genes with particular biological functions, we performed gene ontology (GO) enrichment analysis. Across all NIs and all diets, few categories were even nominally significant for enrichment and none was significant after correcting for multiple testing (Table 4). This may not be surprising because GO analysis of mapping results implicitly assumes the “infinitesimal model” of quantitative genetics, where many genes each contribute small but meaningful effects on the overall phenotype. We have no evidence that this is an appropriate model for our nutritional phenotypes, and we expect that, given the sample size of the DGRP, our experiment lacks power to identify SNPs of small effects.

Table 4. Gene Ontology term enrichment analysis for SNPs with P < 10−4.

Index Low-Glucose Diet High-Glucose Diet Both Diets Pooled
Glucose Apoptosis (P = 0.016) Signal transduction (P = 0.020) Lipid transport (P = 0.030)
DNA binding TF activitya (P = 0.065) Enzyme activator activity (P = 0.039) Protein folding (P = 0.059)
Catalytic activity (P = 0.070) G-protein-coupled receptor (P = 0.068)
Plasma membrane (P = 0.076) Intracellular (P = 0.075)
Receptor activity (P = 0.079)
Glycerol Mitochondrion organization (P = 0.044) DNA packaging (P = 0.052) RNA binding (P = 0.002)
Ion transport (P = 0.055) Structural constituent of ribosome (P = 0.052) Translation (P = 0.041)
Transporter activity (P = 0.056) Ribosome (P = 0.058) Neurotransmitter transporter act. (P = 0.065)
Endopeptidase activity (P = 0.084) Plasma membrane (P = 0.084) Nucleob (P = 0.072)
Transport (P = 0.087) Transport (P = 0.089) Behavior (P = 0.077)
Defense response (P = 0.090)
Endocytosis (P = 0.092)
Apoptosis (P = 0.096)
Glycogen Structural constituent of cytoskeleton (P = 0.051) Multicellular organismal development (P = 0.014) Lipid metabolic process (P = 0.030)
Molecular function (P = 0.095) Lipid transport (P = 0.087)
Transport (P = 0.099)
Mean wet weight Protein kinase activity (P = 0.010) Intracellular (P = 0.005) Cytoskeleton organization (P = 0.032)
Protein modification process (P = 0.025) Cytoskeleton organization (P = 0.009) Extracellular region (P = 0.035)
Response to stress (P = 0.031) Organelle development (P = 0.013)
RNA binding (P = 0.036) Endopeptidase activity (P = 0.033)
Cytosol (P = 0.054) Protein kinase activity (P = 0.039)
Intracellular protein transport (P = 0.086) Peptidase activity (P = 0.049)
Transcription factor binding (P = 0.089) Proteolysis (P = 0.033)
RNA binding (P = 0.072)
Cytoskeleton (P = 0.079)
Protein modification process (P = 0.095)
DNA-dependent transcription (P = 0.096)
Protein Extracellular region (P = 0.008) Cell death (P = 0.010) Centrosome (P = 0.005)
Intracellular (P = 0.049) Sensory perception (P = 0.032)
DNA binding (P = 0.056) Cytoskeleton organization (P = 0.058)
Extracellular region (P = 0.061) Neurotransmitter transporter activity (P = 0.099)
Molecular function (P = 0.079)
Triglyceride Motor activity (P = 0.029) DNA binding TF activitya (P = 0.012) Transporter activity (P = 0.005)
Cell death (P = 0.033) Nucleus (P = 0.024) Centrosome (P = 0.016)
Cell communication (P = 0.043) Nucleic acid binding (P = 0.078) Cellular component (P = 0.017)
Intracellular (P = 0.091) DNA binding TF activitya (P = 0.025)
Transport (P = 0.026)
Ion transport (P = 0.079)
DNA binding (P = 0.091)
a

“DNA binding TF activity” is “sequence-specific DNA binding transcription factor activity.”

b

“Nucleo” is “nucleobase, nucleoside, nucleotide, and nucleic acid metabolic process.

Discussion

We found significant genetic variation for wet weight as well as five nutritional indices (levels of glycogen, free glucose, soluble protein, triglycerides, and free glycerol) in the DGRP after rearing on two different diets that varied in glucose content. Several of these nutritional indices and the principal components describing them jointly are correlated with phenotypes that have been measured by other researchers. Because the complete genomes have been sequenced for all of the lines in the DGRP, we could conduct genome-wide association mapping to identify candidate genes that may influence Drosophila metabolic status in response to diet.

We were able to identify genetic correlations among the traits we measured and between our traits and phenotypes measured by independent groups in other studies. Many of these correlations make good biological sense. For example, starvation stress resistance is positively correlated with wet weight and with stores of glucose and glycogen, consistent with a simple interpretation that genotypes that store more nutrients are more resistant to starvation. The correlations among other phenotypes were less intuitive but may motivate follow-up examination. For example, we found correlations between endoplasmic reticulum stress and several nutritional indices (glycogen, glycerol, triglycerides), suggesting that nutrients play a role in modulating the ER stress response. One concern could be that spurious correlations arise due to variable inbreeding depression among the lines. However, we do not believe this would be a sufficient explanation because at least some of the correlations appear to be negatively correlated with respect to fitness. For example, wet weight was negatively correlated with male aggression (P = 0.044), where we would presume that both greater wet weight and more aggressive males would be more “fit.” However, guessing at the fitness value of nutritional indices is obviously difficult. For example, we simply do not know a priori whether flies with more glycogen stores are inherently more or less fit than flies storing less glycogen, and the answer probably depends on the environmental conditions.

Our genome-wide association mapping implicated many genes as explaining natural variation for nutritional phenotypes, and these can be targeted for more thorough follow-up study. One striking pattern is the over-representation of genes involved in nervous system development and behavior. This may be an artifact of the observation that neurological genes tend to be large and therefore provide a larger target for association studies (Mackay et al. 2012; Chow et al. 2013a). Neurological terms were generally not enriched in our GO analysis that controlled for gene size. A majority of significantly associated SNPs were intronic, suggesting that gene expression variation may play a major role in determining variability in nutritional phenotypes. Generally speaking, the mapping results presented here can provide a starting point for further research on these important traits.

Supplementary Material

Supporting Information
supp_5_3_417__index.html (1.7KB, html)

Acknowledgments

We would like to thank Julien Ayroles and members of the Lazzaro lab for comments and suggestions. This work was supported by NIH grant R01 AI083932 to BPL and NIH grant F32-HD071703 to RLU.

Footnotes

Supporting information is available online at http://www.g3journal.org/lookup/suppl/doi:10.1534/g3.114.016477/-/DC1

Communicating editor: R. J. Kulathinal

Literature Cited

  1. Adams M. D., Celniker S. E., Holt R. A., Evans C. A., Gocayne J. D., et al. , 2000.  The genome sequence of Drosophila melanogaster. Science 287: 2185–2195. [DOI] [PubMed] [Google Scholar]
  2. Ayres J. S., Schneider D. S., 2009.  The role of anorexia in resistance and tolerance to infections in Drosophila. PLoS Biol. 7: e1000150. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Ayroles J. F., Carbone M. A., Stone E. A., Jordan K. W., Lyman R. F., et al. , 2009.  Systems genetics of complex traits in Drosophila melanogaster. Nat. Genet. 41: 299–307. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Balfanz S., Strünker T., Frings S., Baumann A., 2005.  A family of octopamine receptors that specifically induce cyclic AMP production or Ca2+ release in Drosophila melanogaster. J. Neurochem. 93: 440–451. [DOI] [PubMed] [Google Scholar]
  5. Bennett D., Lyulcheva E., Alphey L., Hawcroft G., 2006.  Towards a comprehensive analysis of the protein phosphatase 1 interactome in Drosophila. J. Mol. Biol. 364: 196–212. [DOI] [PubMed] [Google Scholar]
  6. Boutros M., Kiger A. A., Armknecht S., Kerr K., Hild M., et al. , 2004.  Genome-wide RNAi analysis of growth and viability in Drosophila cells. Science 303: 832–835. [DOI] [PubMed] [Google Scholar]
  7. Chow C. Y., Wolfner M. F., Clark A. G., 2013a Large neurological component to genetic differences underlying biased sperm use in Drosophila. Genetics 193: 177–185. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Chow C. Y., Wolfner M. F., Clark A. G., 2013b Using natural variation in Drosophila to discover previously unknown endoplasmic reticulum stress genes. Proc. Natl. Acad. Sci. USA 110: 9013–9018. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Day J. P., Dow J. A. T., Houslay M. D., Davies S.-A., 2005.  Cyclic nucleotide phosphodiesterases in Drosophila melanogaster. Biochem. J. 388: 333–342. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Fellous S., Lazzaro B. P., 2010.  Larval food quality affects adult (but not larval) immune gene expression independent of effects on general condition. Mol. Ecol. 19: 1462–1468. [DOI] [PubMed] [Google Scholar]
  11. Fricke C., Bretman A., Chapman T., 2008.  Adult male nutrition and reproductive success in Drosophila melanogaster. Evolution 62: 3170–3177. [DOI] [PubMed] [Google Scholar]
  12. Ghabrial A., Luschnig S., Metzstein M. M., Krasnow M. A., 2003.  Branching morphogenesis of the Drosophila tracheal system. Annu. Rev. Cell Dev. Biol. 19: 623–647. [DOI] [PubMed] [Google Scholar]
  13. Golombieski R. M., Graichen D. A. S., Pivetta L. A., Nogueira C. W., Loreto E. L. S., et al. , 2008.  Diphenyl diselenide [(PhSe)2] inhibits Drosophila melanogaster delta-aminolevulinate dehydratase (delta-ALA-D) gene transcription and enzyme activity. Comparative biochemistry and physiology. Toxicology & pharmacology. CBP 147: 198–204. [DOI] [PubMed] [Google Scholar]
  14. Huang W., Massouras A., Inoue Y., Peiffer J., Ràmia M., et al. , 2014.  Natural variation in genome architecture among 205 Drosophila melanogaster genetic reference panel lines. Genome Res. 24: 1193–1208. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Itoh N., Ornitz D. M., 2004.  Evolution of the Fgf and Fgfr gene families. Trends in genetics. TIG 20: 563–569. [DOI] [PubMed] [Google Scholar]
  16. Jordan K. W., Craver K. L., Magwire M. M., Cubilla C. E., Mackay T. F. C., et al. , 2012.  Genome-wide association for sensitivity to chronic oxidative stress in Drosophila melanogaster. PLoS ONE 7: e38722. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Karouzou M. V., Spyropoulos Y., Iconomidou V. A., Cornman R. S., Hamodrakas S. J., et al. , 2007.  Drosophila cuticular proteins with the R&R Consensus: annotation and classification with a new tool for discriminating RR-1 and RR-2 sequences. Insect Biochem. Mol. Biol. 37: 754–760. [DOI] [PubMed] [Google Scholar]
  18. Kofler R., Schlötterer C., 2012.  Gowinda: unbiased analysis of gene set enrichment for genome-wide association studies. Bioinformatics 28: 2084–2085. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Layalle, S., E. Coessens, A. Ghysen, and C. Dambly-Chaudière, 2005 Smooth, a hnRNP encoding gene, controls axonal navigation in Drosophila. Genes Cells 10: 119–125. [DOI] [PubMed]
  20. Mackay T. F. C., Richards S., Stone E. A., Barbadilla A., Ayroles J. F., et al. , 2012.  The Drosophila melanogaster Genetic Reference Panel. Nature 482: 173–178. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Nagao T., Endo K., Kawauchi H., Walldorf U., Furukubo-Tokunaga K., 2000.  Patterning defects in the primary axonal scaffolds caused by the mutations of the extradenticle and homothorax genes in the embryonic Drosophila brain. Dev. Genes Evol. 210: 289–299. [DOI] [PubMed] [Google Scholar]
  22. Papoulas O., Hays T. S., Sisson J. C., 2005.  The golgin Lava lamp mediates dynein-based Golgi movements during Drosophila cellularization. Nat. Cell Biol. 7: 612–618. [DOI] [PubMed] [Google Scholar]
  23. Pile L. A., Schlag E. M., Wassarman D. A., 2002.  The SIN3/RPD3 deacetylase complex is essential for G(2) phase cell cycle progression and regulation of SMRTER corepressor levels. Mol. Cell. Biol. 22: 4965–4976. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Piper M. D. W., Partridge L., 2007.  Dietary restriction in Drosophila: delayed aging or experimental artefact? PLoS Genet. 3: e57. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Piper M. D. W., Skorupa D., Partridge L., 2005.  Diet, metabolism and lifespan in Drosophila. Exp. Gerontol. 40: 857–862. [DOI] [PubMed] [Google Scholar]
  26. Purcell S., Neale B., Todd-Brown K., Thomas L., Ferreira M. A. R., et al. , 2007.  PLINK: a tool set for whole-genome association and population-based linkage analyses. Am. J. Hum. Genet. 81: 559–575. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Ridley E. V., Wong A. C.-N., Westmiller S., Douglas A. E., 2012.  Impact of the resident microbiota on the nutritional phenotype of Drosophila melanogaster. PLoS ONE 7: e36765. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. R Core Team, 2012 R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. Available at: http://www.R-project.org/.
  29. R Core Team, 2014 R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. Available at: http://www.R-project.org/.
  30. Skorupa D. A., Dervisefendic A., Zwiener J., Pletcher S. D., 2008.  Dietary composition specifies consumption, obesity, and lifespan in Drosophila melanogaster. Aging Cell 7: 478–490. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Swan L. E., Wichmann C., Prange U., Schmid A., Schmidt M., et al. , 2004.  A glutamate receptor-interacting protein homolog organizes muscle guidance in Drosophila. Genes Dev. 18: 223–237. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Tweedie S., Ashburner M., Falls K., Leyland P., McQuilton P., et al. , 2009.  FlyBase: enhancing Drosophila Gene Ontology annotations. Nucleic Acids Res. 37: D555–D559. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Vass E., Nappi A. J., 1998.  The effects of dietary yeast on the cellular immune response of Drosophila melanogaster against the larval parasitoid, Leptopilina boulardi. J. Parasitol. 84: 870–872. [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supporting Information
supp_5_3_417__index.html (1.7KB, html)

Articles from G3: Genes|Genomes|Genetics are provided here courtesy of Oxford University Press

RESOURCES