Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2015 Mar 1.
Published in final edited form as: Public Health Nutr. 2014 Mar;17(3):510–518. doi: 10.1017/S1368980012005411

Using exploratory factor analysis of food frequency questionnaires to identify dietary patterns among Yup’ik People

Tove K Ryman 1, Melissa A Austin 1, Scarlett Hopkins 2, Jacques Philip 2, Diane O’Brien 2, Kenneth Thummel 3, Bert B Boyer 2
PMCID: PMC3972766  NIHMSID: NIHMS428214  PMID: 23290469

Abstract

Objective

A food frequency questionnaire (FFQ) developed by the Center for Alaska Native Health Research for studies in Yup’ik people includes market foods and subsistence foods such as moose, seal, waterfowl and salmon that may be related to disease risk. Because the FFQ contains >100 food items, we sought to characterize dietary patterns more simply for use in ongoing pharmacogenomics studies.

Design

Exploratory factor analysis was used to derive a small number of “factors” that explain a substantial amount of the variation in the Yup’ik diet. We estimated factor scores and measured associations with demographic characteristics and biomarkers.

Setting

Southwest Alaska, US

Subjects

358 Yup’ik people, ≥ 18 years

Results

We identified 3 factors that each accounted for ≥10% of the common variance: the first characterized by “processed foods” (e.g., salty snacks, sweetened cereals); the second by “fruits and vegetables” (e.g., fresh citrus, potato salad); and the third by “subsistence foods” (seal or walrus soup, non-oily fish). Participants from coastal communities had higher values for the “subsistence” factor, whereas participants from inland communities had higher values for the “fruits and vegetables” factor. A biomarker of marine intake, δ 15N, was correlated with the “subsistence” factor, whereas a biomarker of corn and sugar-cane based market food intake, δ 13C, was correlated with “processed foods”.

Conclusions

This exploratory factor analysis identified 3 factors that appear to reflect dietary patterns among Yup’ik based on associations with participant characteristics and biomarkers. These factors will be useful for chronic disease studies in this population.

Keywords: exploratory factor analysis, food-frequency questionnaire, FFQ, Yup’ik, Alaska Natives, diet, traditional foods, seasonality

INTRODUCTION

Traditionally, the diet of Yup’ik people living in the Yukon-Kuskokwim Delta of Southwest Alaska has included a variety of culturally important regional subsistence foods such as moose, seal, waterfowl and salmon, many of which are eaten seasonally, as well as foods purchased in the community market (i.e., small grocery store). In this study population, as in other indigenous populations, there is an ongoing nutritional transition from subsistence foods to more market-based foods (15). Bersamin et al, (4) found that an average of 22% of energy intake of Yup’ik people was from traditional foods, with a higher percentage in older people. Increases in consumption of market-based foods among indigenous circumpolar populations may lead to increased prevalence of obesity and a variety of chronic diseases such as coronary heart disease and diabetes (6, 7). Conversely, the traditional marine-based diet of Yup’ik people typically includes high levels of eicosapentaenoic (EPA) and docosahexaenoic (DHA) acid, and may have strong beneficial effects in preventing chronic disease risk by lowering circulating triglycerides and inflammatory markers normally associated with obesity (8, 9).

Food frequency questionnaires (FFQ) are a useful method for collecting food exposure data in large epidemiological studies because they are relatively easy and economical to administer (10), and they can capture with 1 interview foods than may only be eaten seasonally. This is particularly beneficial for the use in this Yup’ik population where it can be difficult to access communities during all seasons of the year. Given that FFQs collect frequency of consumption for a large number of foods in order to characterize usual dietary patterns, methods are needed to reduce the amount of dietary data needed to a few underlying variables that can be used for both association studies between risk factors and health outcomes and to reduce participant burden during data collection. Factor analysis methods have been used to confirm hypothesized dietary factors (11) and to describe dietary patterns (1215). For example Hu et al., identified 2 major dietary patterns using factor analysis of a 131-item FFQ, which were then used for assessing the relative risk of coronary heart disease by differing diets (14).

The objectives of this study were to identify dietary patterns using exploratory factor analysis, and to use these factors to evaluate the differences in dietary patterns by demographic characteristics and dietary biomarkers among this Yup’ik study population. Dietary data were collected in this study population of Yup’ik people for research on dietary and gene by diet interactions, associated with chronic disease. The long term goals of this research include assessing the association of genetic variants in drug-metabolizing enzymes of warfarin with clotting time and how dietary factors like vitamin K intake from leafy green vegetables and DHA/EPA intake from subsistence foods may modify these associations.

METHODS

Study sample

A semi-quantitative FFQ developed by the Center for Alaska Native Health Research was administered as part of a community-based participatory research study in the Yukon-Kuskokwim Delta region of Alaska. Recruitment methods are published elsewhere (16, 17). Briefly, the 435 study participants, representing a convenience sample of Yup’ik people from 9 communities (3 inland [>5 miles from the coast] and 6 coastal) were recruited between September 2009 and September 2011. Sampling was longitudinal such that participants could have completed more than 1 FFQ. In this case, only the most recent FFQ was used for this analysis. Participants could elect to be interviewed in Yup’ik by a native Yup’ik speaker trained to administer the FFQ. Participants were excluded from analysis if they self-reported their ethnicity as non-native Alaskans or “other”, were <18 years of age, or the FFQ was considered by the interviewer to be of “poor” or “unknown” quality.

Demographic data were also collected, including location of community (i.e., coastal or inland), participant age, household size, sex, and cultural identification (as self-reported adherence to “Kass’aq” [white] and Yup’ik lifestyle). Answers to cultural identification questions were consolidated into those participants reporting “not at all” or “some” in one group and “a lot” into a second group. The cultural identification questions were not mutually exclusive, for example a participant could report “a lot” for both the Yup’ik and Kass’aq lifestyle. Age was categorized into “younger” and “older” using the mean age (41.6 years) in the sample as the cut point.

This study was conducted according to the guidelines laid down in the Declaration of Helsinki and all procedures involving human subjects were approved by the University of Alaska Fairbanks Institutional Review Board and the Yukon-Kuskokwim Health Corporation Human Studies Committee. Informed consent was obtained prior to participation in the study.

Food Frequency Data

This FFQ was developed from a list of the most common foods consumed in this population and was derived from approximately 2,000 24-hour food recalls collected in this region. The resulting FFQ was piloted and further reduced in size to decrease participant burden. The FFQ version used for this analysis included 163 specific food questions (e.g., frequency of fresh citrus) grouped into 15 categories (e.g., fruits, vegetables, ocean meats). Frequency of alcohol consumption was not included in the questionnaire as possession of alcohol is illegal in these communities. Participants reported how frequently they typically consumed each food during the previous 12 months and, for traditional subsistence foods, if they ate the food seasonally. Four frequency scales were used in the FFQ for the 163 food items. As described below, we used 2 of these scales that were the most amenable to factor analysis, and that were used for most of the foods (132 of the 163 foods, 81%).

Among the 132 food questions included in our analysis (Supplemental Table 1), frequency of consumption was measured on a 9-point scale which differed for foods and beverages. For both scales the lowest frequency option was “never or less than once per month,” for foods the highest frequency option was “2+ times per day,” and for beverages the highest frequency value was “6+ times per day”. After considering possible methodological approaches to quantify food frequency over the course of a year, we elected to convert the frequency of consumption from the 9-point scales to a continuous scale of annual consumption by multiplying the reported frequency of consumption to a 365 day scale (e.g., if the food was reportedly eaten weekly, then the annual consumption was 52.17) for analysis. If the frequency was reported as a range (e.g., eaten every 2–3 weeks), the mean of the range was used to calculate annual consumption. For the 38 traditional subsistence foods typically eaten seasonally (e.g., wild bird eggs, wild greens, seal and walrus soup [see Supplemental Table 1 for complete list]), it was elicited from the participant if they ate the food seasonally or year round. If the participant reported eating the food seasonally, the annual consumption was calculated as the product of the reported annual consumption and the proportion of the year that food was typically available (determined by local participants and referred to here as the “seasonal weight”). Weights for all seasonal foods are provided in Supplemental Table 1.

For the factor analysis, annual food consumption values were transformed to the natural log scale to improve normality of the frequency distributions. We assessed potential outliers as annual consumption values >2 standard deviations from the mean, but retained all potential outlier values as they were judged realistic frequencies of consumption (BB, SH).

Biomarkers of food intake

We analyzed 2 biomarkers of dietary intake for the Yup’ik study population in relation to the estimated factor scores. Because the nitrogen isotope ratio (δ 15N) is elevated in traditional marine foods, red blood cell (RBC) δ 15N is strongly correlated with traditional marine intake (18) and intake of the n-3 polyunsaturated fatty acids eicosapentaenoic acid (EPA) and docosahexaenoic acid (DHA) (19). The carbon isotope ratio (δ 13C) is elevated in market foods made from corn or sugar cane (18, 20) and meat from corn-fed animals (18, 21). In this Yup’ik study population, RBC δ 13C is most strongly associated with corn-based market foods (18). We analyzed these biomarkers as a way to assess how well the estimated factors from the FFQ characterize the dietary components measured with biomarkers. RBC samples were prepared and assessed for carbon and nitrogen stable isotope ratios at the Alaska Stable Isotope Facility as previously described (18, 19). By convention and for ease of interpretation, isotope ratios are presented as delta values in “permil” relative to international standards : δX = [(Rsample –Rstandard)/(Rstandard)] · 1000‰, where X is 15N or 13C, R is the ratio of heavy to light isotope, and the standards are atmospheric nitrogen and Vienna Pee Dee Belemnite for carbon. We concurrently prepared and ran multiple laboratory standards to assess analytical accuracy and precision; these were within 0.1 and 0.2‰, respectively, for both isotopes. The range of isotopic variation in our dataset (4.1 for δ 13C, 8.0 for δ 15N) was large relative to analytical precision (0.2‰).

Factor Analysis

As our objective was to estimate underlying dietary patterns among this Yup’ik study population, we used exploratory factor analysis. Factor analysis is a statistical approach that uses many correlated dietary components to derive a smaller number of latent variables or “factors” that explain a substantial amount of the variation in diet. Exploratory factor analysis does not require a priori hypotheses about the data; rather, it is used to understand the underlying structure of a large set of variables. All analyses were conducted in Statistical Analysis System software package version 9.3 (SAS Institute, Cary, NC, USA), with the exploratory factor analysis procedure using the SAS maximum-likelihood parameter estimate and orthogonal varimax rotation. The number of factors retained in the analysis was based on retaining factors that accounted for greater than 10% of the common variance as well as interpretability. Figures were created using R (http://www.r-project.org/) with the default Gaussian kernel for smoothing. To reduce the 132 foods to a number that could be analyzed using factor analysis with our sample size, we developed a 2-stage exploratory factor analysis process to identify relevant factors.

The objective of the first stage of the analysis was to reduce the number of food variables to less than 36, i.e. <10 foods per participant in our sample. We first assessed correlations between foods and conducted exploratory factor analyses within each of the 15 food categories. In each of these categories, we then evaluated the 3 foods that had the highest loading for each factor. One food from among the top 3 loading foods on each factor was selected to be used in the final exploratory factor analysis, by 2 of the investigators (BB, SH). Additionally, although it was not in the top 3 foods for the vegetable factor, the food “wild greens” was selected to be included in the final exploratory factor analysis as it was particularly important to our vitamin K intake research questions. In total, 22 foods were selected for inclusion in our final factor analysis. The 22 food variables selected for the final factor analysis are noted in the last column of Supplemental Table 1.

In the second stage, exploratory factor analysis was conducted using the 22 selected food variables. We qualitatively considered foods with a factor loading coefficient >0.40 to load highly on that factor, and these foods were used to provide the factor with a descriptive name. Using the factor loading coefficients, we calculated estimated factor scores for each factor for all participants by summing the product of the factor loading coefficient and the participants’ reported annual consumption frequency for each food item. For a participant, a higher estimated factor score for a particular factor indicated a higher correlation with foods that had high loading coefficients on that factor. The estimated factor scores were assessed for normality using descriptive statistics and frequency histograms.

To measure associations between the estimated factors and demographic characteristics (i.e., community location, age, gender, cultural identification) for each of the estimated factors, we conducted t tests assuming unequal variance. We also assessed the associations using the non-parametric Wilcoxon-Mann-Whitney test for estimated factor scores which were not normally distributed, and the results were similar to the t test. Additionally, we conducted a multivariate linear regression analysis for each factor including all of the study participant characteristics (community location, age, sex, Kass’aq lifestyle, and Yup’ik lifestyle) to assess independent associations. We measured associations between the dietary biomarkers and the estimated factor scores using Pearson correlations. To assess if the estimated factors were associated with the biomarkers independent of other participant characteristics, we conducted a multivariate linear regression analysis with the participant characteristics (listed above) and the estimated factor score as the independent variables. P values ≤0.05 were considered statistically significant for all analyses.

RESULTS

Sample characteristics

FFQ data were available from a total of 435 participants, of which 68 were excluded based on age, ethnicity and quality of the interview. Of the remaining 367 participants, 358 had complete data for the selected foods used in the final factor analysis (Table 1). From this sample, more participants were from coastal communities (58%) than inland communities (42%). Participants ranged in age from 18 to 91 years, with a mean age of 41.6. A larger proportion (58%) of the sample was female than male. Only 16% of the participants reported that they adhere “a lot” (versus “not at all” or “some”) to a Kass’aq lifestyle. Nearly half of participants reported “not at all/some” adherence to Yup’ik lifestyle with the other half reporting “a lot”, and trends were similar in coastal and inland communities. “A lot” of self-reported adherence to Yup’ik lifestyle was associated with being in the older age group (chi-square=46.4, P <0.001).

TABLE 1.

Characteristics of Yup’ik people with complete food frequency questionnaire data for all items included in factor analysis (September 2009–2011)

All Coastal communities Inland communities
Sample size 358 208 150
Average age in years (range) 41.6 (18–91) 41.9 (18–91) 41.1 (18–79)
Average household size (range) 6.0 (1–21) 6.0 (1–21) 5.9 (1–14)
Sex (% female) 58.4 59.1 57.3
Kass’aq (white) lifestyle (%)*
 Missing 0.8 1.0 0.7
 Not at all/some 83.5 79.3 89.3
 A lot 15.6 19.7 10.0
Yup’ik lifestyle (%)*
 Missing 0.6 1.0 0.0
 Not at all/some 48.3 50.0 46.0
 A lot 51.1 49.0 54.0
*

Self-reported

Exploratory factor analysis

The estimated factor analysis within the 15 food categories resulted in 1 factor per food category for 11 of the categories. For beverages, desserts and snacks, and fruits, 2 factors loaded, and for vegetables 4 factors loaded. Thus, the selection of 1 food from each factor plus the addition of wild greens resulted in 22 foods used in the factor analysis.

The distributions of annual consumption of the 22 foods selected for the final exploratory factor analysis were generally highly right skewed, due to a large number of participants reporting consumption frequencies of “never or less than once per month.” Transformation of values to the natural log scale improved normality considerably, as illustrated by skewness and kurtosis values (Table 2). That is, prior to the log transformation, skewness and kurtosis values were >1 for all foods, compared to after, when the absolute skewness or kurtosis value was ≤1 for 15 and 5 foods respectively. Pearson correlations between the 22 selected food variables were typically positive, with the strongest correlations between salty snacks and sweetened drinks (r=0.43) and salty snacks and pizza (r=0.40) (Supplemental Table 2).

TABLE 2.

Characteristics of the annual food consumption distributions for 22 foods included in the factor analysis, values represent the annual frequency of consumption (n=358)

Foods Untransformed values
Natural log transformed values
Mean Median SD* Skewness Kurtosis Mean Median SD* Skewness Kurtosis
Salty snacks 55.38 30.00 88.85 3.01 12.32 1.63 3.40 3.71 −0.95 −0.77
Sweetened cereals 51.04 12.00 96.40 3.37 15.33 0.40 2.48 4.21 −0.26 −1.75
Pizza 31.22 12.00 72.56 6.04 49.40 0.05 2.48 4.02 −0.22 −1.80
Sweetened drinks 229.75 78.27 389.52 2.43 6.17 2.25 4.36 4.43 −0.76 −1.12
Hot dogs and lunch meat 29.91 12.00 62.12 5.86 50.79 0.40 2.48 3.92 −0.43 −1.67
Fried chicken 22.83 12.00 36.94 3.82 18.94 0.59 2.48 3.72 −0.62 −1.48
Canned tuna 8.69 0.01 28.03 7.79 81.93 −2.53 −4.61 3.43 1.09 −0.74
Fresh citrus 31.84 12.00 64.61 5.61 44.87 0.90 2.48 3.72 −0.72 −1.28
Potato salad 11.40 0.01 33.07 6.26 44.90 −1.75 −4.61 3.70 0.56 −1.60
Citrus juice 67.44 24.00 170.07 3.99 16.46 −0.06 3.18 4.40 0.01 −1.84
Corn 56.14 30.00 89.94 3.67 19.08 2.08 3.40 3.39 −1.29 0.12
Green beans 42.32 12.00 70.31 2.76 8.25 1.08 2.48 3.83 −0.70 −1.27
Green salad 8.88 0.01 27.55 6.17 45.96 −2.44 −4.61 3.46 1.01 −0.89
Pudding and jello 10.82 0.01 23.31 6.58 64.21 −1.30 −4.61 3.76 0.29 −1.86
Dried salmon 116.93 52.18 128.64 1.72 4.30 3.26 3.95 3.13 −1.82 2.09
Wild game soup 89.48 41.09 106.76 2.48 9.88 3.39 3.68 2.36 −2.13 4.84
Pancakes 59.64 30.00 107.94 4.03 19.90 2.11 3.40 3.36 −1.31 0.23
Market berries in akutaq 19.44 0.01 54.45 7.95 87.67 −1.01 −4.61 3.94 0.23 −1.84
Seal or walrus soup 23.86 12.00 40.49 3.49 15.06 0.60 2.48 3.66 −0.62 −1.40
Non-oily fish (not dried) 20.63 3.96 42.45 4.38 24.88 −0.32 1.38 3.87 −0.12 −1.83
Wild greens 10.03 0.01 30.64 7.48 71.68 −1.71 −4.61 3.54 0.51 −1.55
Bird soup 40.96 30.00 51.53 2.81 9.93 2.47 3.40 2.70 −1.94 2.65
*

Standard deviation

seasonal food items

The final exploratory factor analysis of the 22 selected foods resulted in 3 factors (Table 3). The 3 factors had eigenvalues >1.0 and each accounted for at least 10% of the common variance, whereas the 19 factors not retained had eigenvalues <1.0. The first factor, with an eigenvalue of 5.93, and high loading coefficients (>0.40) for salty snacks, sweetened cereals, pizza, sweetened drinks and hot dogs and lunch meat, was labeled “processed foods”. The second factor, with an eigenvalue of 3.15 had high loading coefficients for fresh citrus, potato salad, citrus juice, corn, green beans and green salad, was labeled “fruits and vegetables”. Finally, the third factor, with an eigenvalue of 1.06 and high loading factors for seasonal foods, seal or walrus soup and non-oily fish (not dried), was labeled “subsistence foods”. Estimated factor weights were approximately normally distributed for “processed foods” and “fruits and vegetables”, while “subsistence foods” had a bimodal distribution.

TABLE 3.

Standardized factor loadings obtained from exploratory factor analysis, using varimax rotation, of annual consumption of food frequency questionnaire items (n=358)*

Foods Processed foods (Factor 1) Fruits & vegetables (Factor 2) Subsistence foods (Factor 3)
Salty snacks 0.60 0.25 −0.03
Sweetened cereals 0.55 0.06 −0.02
Pizza 0.53 0.21 −0.09
Sweetened drinks 0.52 0.13 0.04
Hot dogs and lunch meat 0.46 0.34 0.11
Fried chicken 0.36 0.29 0.07
Canned tuna 0.35 0.24 0.08
Fresh citrus 0.26 0.54 0.15
Potato salad 0.25 0.47 −0.01
Citrus juice 0.31 0.47 0.12
Corn 0.37 0.46 −0.04
Green beans 0.20 0.45 0.12
Green salad 0.22 0.45 0.04
Pudding and jello 0.25 0.35 0.07
Dried salmon 0.11 0.29 0.06
Wild game soup 0.17 0.23 0.15
Pancakes −0.01 0.21 0.10
Market berries in akutaq 0.17 0.20 0.03
Seal or walrus soup 0.24 −0.13 0.82
Non-oily fish (not dried) −0.06 0.09 0.47
Wild greens −0.08 0.32 0.39
Bird soup 0.00 0.16 0.35
*

gray highlighted cell indicate factor loadings >0.40 for interpretation

seasonal food items

Estimated factor score associations

Participants residing in inland communities had greater mean estimated scores for “fruits and vegetables” (Factor 2) (P < 0.001), indicating higher frequency of consumption of foods with greater loading coefficients on these factors (Table 4 and Figure 1A). In contrast, participants residing in coastal communities had a significantly greater mean score for “subsistence foods” (Factor 3) (P <0.001) (Figure 1C). Younger participants (age < 41.6 years) had significantly greater estimated mean factor scores for “processed foods” (Factor 1) (P <0.001) and significantly lower mean factor score for “subsistence foods” (Factor 3) (P = 0.035) compared to older participants (age ≥ 41.6 years) (Table 4). There were no significant differences in mean factor scores between males and females for any of the factors. Reporting “a lot” for Kass’aq lifestyle was significantly associated with a greater estimated mean factor score for “processed foods” (Factor 1) (P = 0.001). Conversely, participants reporting “a lot” for Yup’ik lifestyle had a significantly lower estimated mean factor score for “processed foods” (Factor 1) (P <0.001) and a significantly greater estimated mean factor score for “subsistence foods” (Factor 3) (P = 0.025) (Table 4). All associations remained significant in multivariate models except the association between age and the “subsistence foods” factor (P = 0.374). Completion of high school was not associated with any of the 3 factors (data not presented).

TABLE 4.

Means of natural log transformed factor scores for each factor by characteristics of study participants (n=358)

N Processed foods (Factor 1)
Fruits & vegetables (Factor 2)
Subsistence foods (Factor 3)
Mean P* P Mean P* P Mean P* P
Community location
 Coastal 208 −0.064 0.081 0.032 −0.227 <0.001 <0.001 0.245 <0.001 <0.001
 Inland 150 0.089 0.315 −0.340
Age
 Younger (<41.6 years) 186 0.412 <0.001 <0.001 −0.007 0.870 0.897 −0.089 0.044 0.374
 Older (≥41.6 years) 172 −0.445 0.007 0.097
Sex
 Female 209 −0.052 0.157 0.727 0.018 0.632 0.495 −0.033 0.400 0.286
 Male 149 0.074 −0.025 0.046
Kass’aq (white) lifestyle
 Not at all/some 299 −0.064 0.001 <0.001 −0.011 0.488 0.059 0.009 0.702 0.507
 A lot 56 0.324 0.083 −0.037
Yup’ik lifestyle
 Not at all/some 173 0.301 <0.001 <0.001 −0.042 0.326 0.419 −0.103 0.025 0.042
 A lot 183 −0.286 0.044 0.104
*

Unadjusted t test assuming unequal variance, for factor 3 the Wilcoxon-Mann-Whitney test p-values were similar t test p-values

Adjusted for the other study participant characteristics using a mulitvariate linear regression model, n=354 due to missing values

Sample sizes vary due to missing values

FIGURE 1.

FIGURE 1

Distributions of estimated factor scores for each factor by coastal and inland community with Gaussian kernel density line, x-axis refers to the natural log transformed estimated factor scores (n=358)

RBC δ 15N was negatively correlated with the “processed foods” (Factor 1) (r= −0.492, P <0.001) and “fruits and vegetables” (Factor 2) (r= −0.144, P = 0.007) factors and positively correlated with the “subsistence foods” factor (Factor 3) (r=0.252, P <0.001) (Table 5). In comparison, RBC δ 13C was positively correlated with “processed foods” (Factor 1) (r=0.364, P <0.001). δ 13C was not associated with either “fruits and vegetables” (Factor 2) or “subsistence foods” (Factor 3). Adjusting δ 13C for δ15N did not improve the association between δ 13C and the “processed foods” factor (Factor 1). With the exception of the association between RBC δ 15N and “fruits and vegetables” (Factor 2) (P = 0.258), all significant associations remained significant in the multivariate analysis assessing the associations adjusted for demographic characteristics.

TABLE 5.

Pearson correlations between red blood cell (RBC) δ15N and δ13C and the estimated factor scores (n=356) *

Processed Foods (Factor 1)
Fruits & vegetables (Factor 2)
Subsistence Foods (Factor 3)
r P P r P P r P P
RBC δ15N −0.492 <0.001 <0.001 −0.144 0.007 0.258 0.252 <0.001 0.027
RBC δ13C 0.364 <0.001 <0.001 0.000 0.996 0.211 0.058 0.276 0.173
*

Biomarker data missing on 2 participants

Adjusted for the other study participant characteristics (community location, age, sex, Kass’aq lifestyle and Yup’ik lifestyle) using a mulitvariate linear regression model, n=352 due to missing values

DISCUSSION

In this exploratory factor analysis we identified 3 factors that characterized dietary patterns among Yup’ik people: “processed foods”, “fruits and vegetables”, and “subsistence foods”. These 3 factors were easily interpretable and the associations were consistent with other studies of Native Alaskan diet (1, 4, 18, 22). Moreover, the factors were associated with participant characteristics and biomarkers, as hypothesized based on other studies. Specifically, we observed differences in factor scores between community locations, participant ages, self-reported cultural identification and δ15N and δ13C biomarkers.

Greater mean factor scores for “processed foods” were found in younger participants, those with self-reported greater adherence to Kass’aq lifestyle and those reporting lower adherence to a Yup’ik lifestyle in the bivariate analysis and additionally in inland communities in the multivariate analysis. A previously described marker of corn and sugar cane based market foods, δ13C (18), was positively correlated with this factor, and δ15N, a marker of traditional marine foods (18) had a strong negative correlation with this factor. Because these biomarkers were previously validated for this Yup’ik study population using 24-hour food recalls and 3-day food records (18), the correlations between the biomarkers and the identified dietary pattern strengthens our confidence that the factor analysis approach is accurately characterizing the dietary patterns in this population.

The “fruit and vegetable” factor was significantly associated only with residing in an inland community. This could partially be explained by the easier and more regular access to a regional hub of approximately 4,000 people that has grocery stores that closely resemble those found in a large city (e.g., having a full fresh produce and dairy section). Some of the foods included in the “fruits and vegetables” factor are fresh (e.g., fresh citrus and green salad) and thus would not be easily available in more remote communities. As expected, this factor was not highly correlated with either δ15N or δ13C.

The mean “subsistence foods” factor score was significantly higher in coastal communities, in older participants and among those participants with greater adherence to a Yup’ik lifestyle. Other studies of diet in indigenous populations have reported similar trends of older participants consuming more subsistence foods (1, 4, 22). However, older age was no longer significantly associated with the “subsistence foods” factor after adjusting for other characteristics in the multivariate analysis, potentially due to the strong association between being older and adhering to a Yup’ik lifestyle. A biomarker of traditional marine intake (δ 15N) was correlated with this factor, as expected from previous studies (18). Taken together, these 3 factors will be useful for characterizing Yup’ik diet for future studies in this population.

Our findings have potential implications for research on individualized medicine, where diet may be an important modulator of drug disposition or response. For example, we are currently investigating the impact of diet on anticoagulation response during warfarin therapy among Alaskan Native people. Both vitamin K and polyunsaturated fatty acid intake in the diet may affect the function of the vitamin K cycle and warfarin response (23, 24). If quantitative information about overall dietary intake of DHA, EPA and vitamin K could be captured with a simplified FFQ and reliable factors representing dietary patterns identified, this could greatly enhance our ability to conduct a large-scale warfarin pharmacogenetic study in the future that includes an examination of gene-diet interactions. Indeed, data are already available supporting the use of FFQ for assessing vitamin K status in population-based studies (25). A similar approach based on the present work supports use of a FFQ for evaluating DHA/EPA status as well.

This study population consumes many foods seasonally, particularly the foods included in the “subsistence factor”. The FFQ was administered during different seasons, and although the questionnaire asked about diet over the previous year, there may have been recall bias for seasonal foods not eaten during the season when the questionnaire was administered (26,27). Moreover, for the foods eaten seasonally, it was challenging to identify the most appropriate way to account for this variability in diet for the analysis. We elected to use annual frequency weighted based on seasonal availability. However, this aggregation across seasons could modify some of the findings. For example, Receveur et al. found differences in dietary intake between sexes and age groups were more pronounced during seasons of peak traditional food consumption in 16 indigenous communities in the Canadian Northwest Territories (22). We might anticipate similar findings, particularly among the older participants that have greater mean levels of seasonal foods included in the “subsistence” factor.

In this study population there may also be seasonality in consumption of non-subsistence foods due to other geographical and economic influences. In inland communities, travel to regional hubs can be restricted due to weather and ice conditions affecting access to market-based foods. In all communities, market-based food consumption (i.e., those foods in the “processed foods” and “fruits and vegetables” factors) is also associated with income and there is seasonality to the availability of income (e.g., Alaskan Permanent Fund payments) and support for purchasing of foods (e.g., food stamps, WIC coupons). These variations may result in seasonal fluctuation in consumption of fruits and vegetables, and to a lesser degree processed foods, and may have important implications for vitamin K levels and clotting time, as well as other diet and chronic disease associations. Additional work is needed to better capture seasonal variation in diet.

In addition to the challenge of accurately measuring seasonal variation in diet, this study was subject to other limitations related to data collection and analysis. A convenience sampling method was used, possibly limiting the generalizability. Additionally, participants from the same household were not excluded, and thus diet data from all participants may not be independent. The questionnaire collected the frequency that foods were eaten, not actual consumption. For participants that ate a particular food very frequently, the FFQ was not designed to capture a frequency greater than 2 or more times per day, and for beverages 6 or more times per day. Because the “subsistence foods” factor had a bimodal distribution, we conducted non-parametric association tests for this factor, and the results were found to be comparable using parametric methods. The bimodal distribution is the result of a high proportion of participants that reported eating the subsistence foods “never or less than once per month,” possibly reflecting a dietary transition away from subsistence foods to more Western foods. However, given the importance of describing this factor, we elected to retain foods even if they were not frequently eaten if they were in one of the top 3 loading factors for the food category. This FFQ has not undergone a rigorous validation exercise. However, the correlations with biomarkers that have been validated in this population (18) strengthen our confidence in the validity, as the biomarkers have been rigorously evaluated and compared to actual fatty acid composition in RBCs (19). Although the specific dietary patterns identified in this analysis are likely unique to this Yup’ik population, this methodology used could be generalized to other difficult to access or minority populations.

Based on these findings, the next steps in this research will be to collect additional FFQ data from a new set of participants and conduct a confirmatory factor analysis. If these results are found to be consistent, factor analysis will be used to reduce the number of FFQ questions asked, decreasing the expense of the interview and the time burden on participants, and finally the conducting of a rigorous validation study. In this analysis the identified dietary factors were not adjusted for energy intake as our interest was in dietary patterns, not specific nutrient intake. However, for future analyses it may become important to adjust for energy intake.

Conclusions

We found that exploratory factor analysis identified 3 dietary factors, “processed foods”, “fruits and vegetables”, and “subsistence foods”, which based on the observed associations with participant characteristics and biomarkers, appear to be accurate reflections of dietary pattern among Yup’ik people. The factors identified through these analyses can be used on an ongoing basis to better understand the role of dietary factors in health disparities associated with chronic disease health outcomes in this study population and facilitate future pharmacogenetic studies.

Supplementary Material

Acknowledgments

MA, SH, DO, BB and KT designed research; BB, SH and DO conducted research; TR, JP and MA analyzed data; TR and MA wrote the paper; TR had primary responsibility for final content. All authors read and approved the final manuscript. The study protocols were reviewed and approved by the University of Alaska Institutional Review Board, the National and Alaska Area Indian Health Service Institutional Review Boards, and the Yukon-Kuskokwim Health Corporation Human Studies Committee. We gratefully acknowledge the Center for Alaska Native Health Research field research team, including Eliza Orr and Jynene Black, as well as Tim Howe and Matthew Wooller at the Alaska Stable Isotope Facility for assistance with the stable isotope analyses. This manuscript was improved by comments from Barbara Kavanaugh, and Nick Au. The authors also thank the community field research assistants who helped with the development of the FFQ, study recruitment, and data collection. Finally, the Center for Alaska Native Health Research team would like to express our sincere appreciation to all of our study participants and their communities for welcoming us and teaching us so much about the Yup’ik way of life. Quyana!

Funding: This work was supported by U01GM092676 (PIs: Thummel and Burke), R01DK074842 and P20RR016430 (PI: Boyer)

Footnotes

Reprints: Reprints not available.

Conflicts of Interest: TR, MA, SH, JP, DO, KT and BB report no personal or financial conflict of interest.

References

  • 1.Bersamin A, Luick BR, Ruppert E, et al. Diet Quality among Yup’ik Eskimos Living in Rural Communities Is Low: The Center for Alaska Native Health Research Pilot Study. J Am Diet Assoc. 2006;106:1055–63. doi: 10.1016/j.jada.2006.04.026. [DOI] [PubMed] [Google Scholar]
  • 2.Kuhnlein HV, Receveur O, Soueida R, et al. Arctic Indigenous Peoples Experience the Nutrition Transition with Changing Dietary Patterns and Obesity. J Nutr. 2004;134:1447–1453. doi: 10.1093/jn/134.6.1447. [DOI] [PubMed] [Google Scholar]
  • 3.Bersamin A, Luick BR, King IB, et al. Westernizing Diets Influence Fat Intake, Red Blood Cell Fatty Acid Composition, and Health in Remote Alaskan Native Communities in the Center for Alaska Native Health Study. J Am Diet Assoc. 2008;108:266–73. doi: 10.1016/j.jada.2007.10.046. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Bersamin A, Zidenberg-Cherr S, Stern JS, et al. Nutrient intakes are associated with adherence to a traditional diet among Yup’ik Eskimos living in remote Alaska Native communities: the CANHR Study. Int J Circumpolar Health. 2007;66:62–70. doi: 10.3402/ijch.v66i1.18228. [DOI] [PubMed] [Google Scholar]
  • 5.Bjerregaard P, Young TK, Dewailly E, et al. Indigenous health in the Arctic: an overview of the circumpolar Inuit population. Scand J Public Health. 2004;32:390–5. doi: 10.1080/14034940410028398. [DOI] [PubMed] [Google Scholar]
  • 6.McLaughlin JB, Middaugh JP, Utermohle CJ, et al. Changing Patterns of Risk Factors and Mortality for Coronary Heart Disease Among Alaska Natives, 1979–2002. JAMA. 2004;291:2545–2546. doi: 10.1001/jama.291.21.2545. [DOI] [PubMed] [Google Scholar]
  • 7.Haman F, Fontaine-Bisson B, Batal M, et al. Obesity and type 2 diabetes in Northern Canada’s remote First Nations communities: the dietary dilemma. Int J Obes (Lond) 2010;34(Suppl 2):S24–31. doi: 10.1038/ijo.2010.236. [DOI] [PubMed] [Google Scholar]
  • 8.Makhoul Z, Kristal AR, Gulati R, et al. Associations of obesity with triglycerides and C-reactive protein are attenuated in adults with high red blood cell eicosapentaenoic and docosahexaenoic acids. Eur J Clin Nutr. 2011;65:808–17. doi: 10.1038/ejcn.2011.39. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Makhoul Z, Kristal AR, Gulati R, et al. Associations of very high intakes of eicosapentaenoic and docosahexaenoic acids with biomarkers of chronic disease risk among Yup’ik Eskimos. Am J Clin Nutr. 2010;91:777–85. doi: 10.3945/ajcn.2009.28820. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Willett W. Nutritional Epidemiology. New York: Oxford University Press; 1990. [Google Scholar]
  • 11.Pierce BL, Austin MA, Crane PK, et al. Measuring dietary acculturation in Japanese Americans with the use of confirmatory factor analysis of food-frequency data. Am J Clin Nutr. 2007;86:496–503. doi: 10.1093/ajcn/86.2.496. [DOI] [PubMed] [Google Scholar]
  • 12.van Dam RM, Rimm EB, Willett WC, et al. Dietary Patterns and Risk for Type 2 Diabetes Mellitus in U.S. Men. Ann Intern Med. 2002;136:201–209. doi: 10.7326/0003-4819-136-3-200202050-00008. [DOI] [PubMed] [Google Scholar]
  • 13.Hu FB, Rimm E, Smith-Warner SA, et al. Reproducibility and validity of dietary patterns assessed with a food-frequency questionnaire. Am J Clin Nutr. 1999;69:243–249. doi: 10.1093/ajcn/69.2.243. [DOI] [PubMed] [Google Scholar]
  • 14.Hu FB, Rimm EB, Stampfer MJ, et al. Prospective study of major dietary patterns and risk of coronary heart disease in men. Am J Clin Nutr. 2000;72:912–921. doi: 10.1093/ajcn/72.4.912. [DOI] [PubMed] [Google Scholar]
  • 15.Khani BR, Ye W, Terry P, et al. Reproducibility and Validity of Major Dietary Patterns among Swedish Women Assessed with a Food-Frequency Questionnaire. J Nutr. 2004;134:1541–5. doi: 10.1093/jn/134.6.1541. [DOI] [PubMed] [Google Scholar]
  • 16.Mohatt GV, Plaetke R, Klejka J, et al. The Center for Alaska Native Health Research Study: a community-based participatory research study of obesity and chronic disease-related protective and risk factors. Int J Circumpolar Health. 2007;66:8–18. doi: 10.3402/ijch.v66i1.18219. [DOI] [PubMed] [Google Scholar]
  • 17.Boyer BB, Mohatt GV, Lardon C, et al. Building a community-based participatory research center to investigate obesity and diabetes in Alaska Natives. Int J Circumpolar Health. 2005;64:281–90. doi: 10.3402/ijch.v64i3.18002. [DOI] [PubMed] [Google Scholar]
  • 18.Nash SH, Bersamin A, Kristal AR, et al. Stable nitrogen and carbon isotope ratios indicate traditional and market food intake in an indigenous circumpolar population. J Nutr. 2012;142:84–90. doi: 10.3945/jn.111.147595. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.O’Brien DM, Kristal AR, Jeannet MA, et al. Red blood cell delta15N: a novel biomarker of dietary eicosapentaenoic acid and docosahexaenoic acid intake. Am J Clin Nutr. 2009;89:913–9. doi: 10.3945/ajcn.2008.27054. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Jahren AH, Saudek C, Yeung EH, et al. An isotopic method for quantifying sweeteners derived from corn and sugar cane. Am J Clin Nutr. 2006;84:1380–4. doi: 10.1093/ajcn/84.6.1380. [DOI] [PubMed] [Google Scholar]
  • 21.Jahren AH, Kraft RA. Carbon and nitrogen stable isotopes in fast food: Signatures of corn and confinement. PNAS. 2008;105:17855–60. doi: 10.1073/pnas.0809870105. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Receveur O, Boulay M, Kuhnlein HV. Decreasing Traditional Food Use Affects Diet Quality for Adult Dene/Métis in 16 Communities of the Canadian Northwest Territories. J Nutr. 1997;127:2179–86. doi: 10.1093/jn/127.11.2179. [DOI] [PubMed] [Google Scholar]
  • 23.Leray C, Wiesel M-L, Freund M, et al. Long-Chain N-3 Fatty Acids Specifically Affect Rat Coagulation Factors Dependent on Vitamin K: Relation to Peroxidative Stress. Arterioscler Thromb Vasc Biol. 2001;21:459–65. doi: 10.1161/01.atv.21.3.459. [DOI] [PubMed] [Google Scholar]
  • 24.Kim KH, Choi WS, Lee JH, et al. Relationship between dietary vitamin K intake and the stability of anticoagulation effect in patients taking long-term warfarin. Thromb Haemost. 2010;104:755–9. doi: 10.1160/TH10-04-0257. [DOI] [PubMed] [Google Scholar]
  • 25.McKeown NM, Jacques PF, Gundberg CM, et al. Dietary and Nondietary Determinants of Vitamin K Biochemical Measures in Men and Women. J Nutr. 2002;132:1329–34. doi: 10.1093/jn/132.6.1329. [DOI] [PubMed] [Google Scholar]
  • 26.Fowke JH, Schlundt D, Gong Y, et al. Impact of season of food frequency questionnaire administration on dietary reporting. Ann Epidemiol. 2004;14:778–85. doi: 10.1016/j.annepidem.2004.02.002. [DOI] [PubMed] [Google Scholar]
  • 27.Subar AF, Frey CM, Harlan LC, et al. Differences in reported food frequency by season of questionnaire administration: the 1987 National Health Interview Survey. Epidemiology. 1994;5:226–33. doi: 10.1097/00001648-199403000-00013. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

RESOURCES