Abstract
The Food Compass is a nutrient profiling system (NPS) to characterize the healthfulness of diverse foods, beverages and meals. In a nationally representative cohort of 47,999 U.S. adults, we validated a person’s individual Food Compass Score (i.FCS), ranging from 1 (least healthful) to 100 (most healthful) based on cumulative scores of items consumed, against: (a) the Healthy Eating Index (HEI) 2015; (b) clinical risk factors and health conditions; and (c) all-cause mortality. Nationally, the mean (SD) of i.FCS was 35.5 (10.9). i.FCS correlated highly with HEI-2015 (R = 0.81). After multivariable-adjustment, each one SD (10.9 point) higher i.FCS associated with more favorable BMI (−0.60 kg/m2 [−0.70,−0.51]), systolic blood pressure (−0.69 mmHg [−0.91,−0.48]), diastolic blood pressure (−0.49 mmHg [−0.66,−0.32]), LDL-C (−2.01 mg/dl [−2.63,−1.40]), HDL-C (1.65 mg/d [1.44,1.85]), HbA1c (−0.02% [−0.03,−0.01]), and fasting plasma glucose (−0.44 mg/dL [−0.74,−0.15]); lower prevalence of metabolic syndrome (OR = 0.85 [0.82,0.88]), CVD (0.92 [0.88,0.96]), cancer (0.95 [0.91,0.99]), and lung disease (0.92 [0.88,0.96]); and higher prevalence of optimal cardiometabolic health (1.24 [1.16,1.32]). i.FCS also associated with lower all-cause mortality (HR = 0.93 [0.89,0.96]). Findings were similar by age, sex, race/ethnicity, education, income, and BMI. These findings support validity of Food Compass as a tool to guide public health and private sector strategies to identify and encourage healthier eating.
Subject terms: Risk factors, Nutrition, Public health
The Food Compass is a nutrient profiling system to characterize the healthfulness of diverse foods, beverages and meals. In a nationally representative cohort of 47,999 U.S. adults, the authors validate a person’s individual Food Compass Score against health outcomes.
Introduction
Uncertainty and inconsistency around selecting healthier food products contribute to consumer confusion, inconsistent and suboptimal government food policy, varying industry goals regarding reformulation, and lack of nutrition-based impact investing1–4. To address these challenges, nutrient profiling systems (NPS) aim to provide a scientifically grounded means for evaluating the overall healthfulness of food and beverage products. Momentum around NPS is growing globally, including for front-of-package (FOP) and product warning labeling, school meal procurement, taxation and subsidy programs, food reformulation by industry, company rankings, and more. The European Union, Australia, New Zealand, and other governments are promoting FOP labeling requirements5,6; retailers are using NPS to guide consumer choices (e.g., Hannaford’s Guiding Stars, Walmart’s Great For You, Kroger’s Opt Up)7–9; and industry is voluntarily self-labeling (i.e., Danone adopted NutriScore for FOP)10. In addition, organizations like Access to Nutrition Index (ATNI) are using NPS to rate and rank food manufacturers and retailers globally11; while investors are pledging to hold their food sector investees accountable to using an NPS to report on healthfulness of product sales and product portfolios12. Clearly, the availability of accurate, objective, and validated NPS is of major importance for advancing nutrition and health.
Current major NPS have important limitations, including evaluating a limited range of nutritional attributes; focusing on certain attributes (e.g., total fat) inconsistent with updated scientific evidence; omitting other attributes of importance to health (e.g., diverse food ingredients, bioactive phytochemicals, processing characteristics)1,13,14; and scoring products by weight, volume, or serving size, creating inconsistencies depending on water and fat content1. For most current NPS, these limitations also result in untenable scoring of certain major food groups, requiring differing, tailored scoring principles and algorithms for different food and beverage categories, which increases subjectivity as well as contradictions for scoring mixed dishes and meals.
The Food Compass is a novel NPS that incorporates a range of 54 potentially protective and harmful nutrients, ingredients, bioactives, additives, and processing attributes, grouped across 9 domains, and selected and weighted based on the latest evidence about their relative healthfulness15. The Food Compass Score (FCS), ranging from 1 (least healthful) to 100 (most healthful), enables a more consistent and universal algorithm across all food and beverage categories; permits similar scoring for mixed dishes and meals; and improves convergent and discriminatory validity of product scoring compared to other major NPS15. However, the validity of extending FCSs of individual food product to the daily dietary patterns of individuals, and the corresponding validity for the association between a person’s FCS with a validated healthy dietary pattern and major health outcomes, have not been established. Such research is critical to substantiate the utility of Food Compass as a tool for identifying healthful foods for consumers, food businesses, investors, and policy stakeholders.
In this work, we found that a person’s FCS, based on the individual food and beverages they consume, was favorably associated with three validation endpoints: (a) the Healthy Eating Index (HEI) 2015, an established measure of a healthy dietary pattern; (b) the prevalence of major clinical risk factors and health conditions; and (c) the prospective risk of all-cause mortality. We also explored how each of the 9 scoring domains of the Food Compass separately related to these outcomes.
Results
Participant characteristics
We evaluated a nationally representative sample of 47,999 adults aged 20–85 years enrolled in the 10 most recent cycles of NHANES, 1999–2000 through 2017–2018 (Fig. S1). The mean (SD) age was 47.2 y (17.1), 52.2% were female, and 27.8% had a college degree or more (Table 1). Mean (SD) BMI was overweight (28.8 kg/m2 [6.8]); blood glucose levels, HbA1c: 5.6% (0.9); fasting plasma glucose: 105.8 mg/dL (30.8); total: HDL-C ratio, high (4.0 [1.5]), and blood pressure levels, systolic: 122.8 mmHg (18.0); diastolic: 70.9 mmHg (12.5). About 42.0% of U.S. adults had metabolic syndrome; 12.9%, diabetes; 7.7%, clinical CVD; 18.9%, lung disease; and 9.8%, cancer. Only 7.4% had optimal cardiometabolic health.
Table 1.
Baseline characteristics of the national sample of 47,999 U.S. adults in this investigation, 1999–2018a
| Sociodemographic characteristics | n = 47,999 |
|---|---|
| Age group, years | |
| 20–34 | 12479 (27.9) |
| 35–49 | 11957 (28.2) |
| 50–64 | 11708 (25.5) |
| 65+ | 11855 (18.4) |
| Age, years | 47.2 (17.1) |
| Sex | |
| Male | 23029 (47.8) |
| Female | 24970 (52.2) |
| Race/ethnicity | |
| Mexican-American | 8419 (8.2) |
| Other Hispanic | 3908 (5.3) |
| Non-Hispanic White | 21590 (68.5) |
| Non-Hispanic Black | 9962 (11.1) |
| Asian/Other | 4120 (6.8) |
| Education Level | |
| <HS graduate | 12769 (17.0) |
| HS graduate | 11138 (24.1) |
| Some college or AA school | 13657 (31.2) |
| ≥College graduate | 10435 (27.8) |
| Ratio of Family Income to Povertyb | |
| <1.3 | 14734 (22.0) |
| 1.3–2.9 | 15384 (28.8) |
| 3+ | 17881 (49.2) |
| Lifestyle and metabolic health characteristics | |
| Alcohol use, percent total energyc | 2.7 (6.1) |
| Total physical activity, MET-hours/weekd | 63.0 (103.7) |
| Smoking status, n (%) | |
| Non-smoker | 26067 (53.6) |
| Former smoker | 12039 (25.0) |
| Current smoker | 9893 (21.4) |
| Waist circumference, cm | 98.5 (16.4) |
| BMI, kg/m2 | 28.8 (6.8) |
| HbA1c, % | 5.6 (0.9) |
| Fasting plasma glucose, mg/dL | 105.8 (30.8) |
| Total cholesterol, mg/dL | 196.7 (42.2) |
| HDL-cholesterol, mg/dL | 53.2 (16.3) |
| LDL-cholesterol, mg/dL | 119.0 (43.6) |
| Triglycerides, mg/dL | 122.7 (110.5) |
| Total cholesterol: HDL ratio | 4.0 (1.5) |
| Systolic blood pressure, mm Hg | 122.8 (18.0) |
| Diastolic blood pressure, mm Hg | 70.9 (12.5) |
| i.FCSc | 35.5 (10.9) |
| HEI 2015c | 57.3 (8.5) |
| Total energy intake, kcal/dayc | 2090 (803) |
| Energy reporting, TEI:BMRe | |
| Underreport | 12062 (23.4) |
| Acceptable | 34623 (75.4) |
| Overreport | 574 (1.1) |
| Presence of health conditions and medication usage | |
| Metabolic syndromef | 21935 (42.0) |
| Diabetesg | 8126 (12.9) |
| Cardiovascular diseaseh | 4713 (7.7) |
| Lung disease | 8709 (18.9) |
| Cancer | 4514 (9.8) |
| Optimal cardiometabolic healthi | 2923 (7.4) |
| Diabetes medication | 5229 (8.0) |
| Cholesterol lowering medication | 8618 (16.3) |
| Hypertension medication | 14836 (26.3) |
| Angina medication | 645 (1.0) |
AA Associates Degree, BMI body mass index, BMR basal metabolic rate, CI confidence interval, CVD cardiovascular disease, HbA1c hemoglobin A1c, HDL high-density lipoprotein, HEI Healthy Eating Index, HS high school, i.FCS individual, dietary Food Compass Score, LDL low-density lipoprotein, MET metabolic equivalent of task, SD standard deviation, TC:HDL total cholesterol to HDL-cholesterol ratio, TEI total energy intake, WC waist circumference.
aValues are mean (SD) for continuous variables and N (%) for categorical variables. NHANES dietary recall sample weights were incorporated in all analyses to account for the complex survey design and response rates, providing nationally representative estimates for the non-institutionalized U.S. adult population. Any missing values were imputed using predictive mean matching, incorporating all available demographics, risk factors, and conditions data in the dataset. 81 fewer subjects were included in the prospective mortality analyses due to lack of linkage to the National Death Index.
bRepresents the ratio of family income to the federal poverty threshold, adjusted for household size. A higher ratio indicates a higher level of income.
cFor all dietary factors (i.e., percentage of total energy from alcohol (%); i.FCS; HEI 2015; and average total energy intake (in kcal/day)): the value was calculated based on the total number of recall days available for each respondent. For example, when two days of data were available, an arithmetic average of that factor was calculated.
dMethodological differences in how physical activity was assessed in NHANES from 1999 to 2006 and from 2007 to 2018 preclude the use of a consistent variable for total physical activity across survey cycles. As such, we used a total physical activity variable (MET-hours/week) evaluated from 2007 to 2018, a leisure time physical activity (MET-hrs/week) variable for all survey cycles, and demographic and health data to impute total physical activity (MET-hrs/week) for 1999–2006 survey cycles using predictive mean matching. See Supplementary Materials (Supplementary Note 4, Table S13–15) for further methodological details.
eTEI:BMR, the ratio of average total energy intake (TEI, in kcal/day) to basal metabolic rate (BMR, kcal/day), was used to determine the plausibility of total energy reporting based on Goldberg cut-offs. BMR was estimated via the Schofield sex- and age-specific predictive equations based on weight and height. TEI was calculated based on the total number of recall days available for each respondent. For example, when two days of data were available, an arithmetic average TEI was calculated. Under-reporters, acceptable reporters and over-reporters were defined as having TEI:BMR < 0·96, 0·96–2·49 and >2·49 for 2-day dietary recall data, and < 0·87, 0·87–2·75 and >2·75 for 1-day dietary recall data, respectively, as previously derived in NHANES57. Over-reporters were excluded from sub-group analyses given sample size and number of events limitations.
fDefined according to AHA/NHLBI as the presence of three or more of: HDL-C < 40 mg/dL (men)/<50 mg/dL (women) or on drug treatment for low HDL-C; triglycerides ≥150 mg/dL or on drug treatment for elevated triglycerides; blood pressure ≥130 SBP or ≥85 DBP or on antihypertensive drug treatment with a history of hypertension; WC ≥102 cm (men)/ ≥88 cm (women); and fasting plasma glucose ≥100 mg/dL or on drug treatment for elevated glucose58.
gDefined based on presence of at least one of: HbA1c level > 6.5%, fasting plasma glucose > 126 mg/dL, or use of diabetes medications.
hDefined based on the presence of both (1) self-reported coronary heart disease, heart failure, stroke, myocardial infarction, or angina and also (2) cardiovascular medication use including use of angina, hypertension, or lipid medications.
iDefined based on optimal levels for each of the following: adiposity (BMI < 25 kg/m2 and WC ≤ 88 cm (women)/WC ≤ 102 cm (men)); blood glucose (FPG < 100 mg/dL and HbA1c < 5.7% and not taking diabetes medication); blood lipids (TC:HDL < 3.5:1 and not taking lipid lowering medication); blood pressure (SBP < 120 mmHg, DBP < 80 mmHg and not taking blood-pressure lowering medication); and history of CVD (no self-reported CHD, heart failure, myocardial infarction, stroke, or angina).
Individual Food Compass Scores
To extend FCS for specific products to an individual’s overall diet, the FCS for each item reported in a person’s diet was weighted by its percent contribution to that person’s total energy intake, and then summed, to calculate an individual’s Food Compass Score (i.FCS, theoretical range: 1 to 100). Among U.S. adults, the mean (SD) i.FCS was 35.5 (10.9), with 5th and 95th values of 19.5 and 55.3 (Fig. 1). Nearly all (99.5%) U.S. adults had an i.FCS below 70 (a threshold previously defined at the product level for foods to encourage); and 32.7%, an i.FCS of 30 or below (a threshold previously defined at the product level for foods to minimize).
Fig. 1. Population energy-weighted, individual Food Compass Score (i.FCS) and individual component Domain Score densities among U.S. adults, 1999–2018.
Density plots reflect the population distribution of i.FCS among 47999 U.S. adults from 1999 to 2018. i.FCS was calculated as the energy-weighted mean of the Food Compass Score of all foods consumed, as reported in all available 24 hr recalls, ranging from 1 to 100. The same methods were used for calculating the component domain scores at the individual level. i.FCS individual Food Compass Score.
Characteristics of U.S. adults across i.FCS quintiles are presented in Table S4. Those with higher scores tended to be older, female, and have higher education and income. In crude (unadjusted) analyses, adults with a higher i.FCS were more likely to be of Asian or other racial descent and less likely to be of Non-Hispanic Black race/ethnicity; tended to have lower total energy intake; and were more likely to be non-smokers, drink more alcohol, and be less physically active. However, after adjusting these crude associations by age and sex, the differences by i.FCS quintiles in total energy intake and physical activity were attenuated.
Individuals with higher i.FCS (≥70) consumed a greater number of and percentage total energy contribution from products with FCS ≥ 70 (median [IQR] count: 13 [8, 20]; percentage energy: 65.6% [57.9, 71.4%])) compared to products with FCS 31–69 (6 [3, 9]; 14.2% [7.3, 23.2]) or FCS ≤ 30 (2 [1, 4]; 2.3% [0.2, 6.1]) (Table S5). In comparison, individuals with lower i.FCS (≤ 30) consumed a greater number of and percentage energy contribution from products with FCS ≤ 30 (11 [7, 15]; 65.0% [56.1, 74.4]) compared to products with FCS 31–69 (8 [5, 12]; 39.1% [28.3, 48.8]) or FCS ≥ 70 (2 [1,4]; 5.9% [2.1, 10.4]). At the same time, however, for any i.FCS, there was also substantial heterogeneity in the counts and energy contribution from products of different FCS (Fig. 2), indicating that different people could arrive at similar overall i.FCS in different ways.
Fig. 2. Examples of individuals having the mean U.S. i.FCS of 35.5, demonstrating variation in the consumption of products with FCS ≤ 30, 31–69, and ≥70 that achieves a similar overall score.
The distribution (count and contribution to total energy intake [%]) of consumed food and beverage products, based on previously defined product healthfulness thresholds (i.e., FCS ≤ 30 as products to minimize; FCS 31-69 as products to be consumed in moderation; and FCS ≥ 70 as products to be encouraged) was assessed for all NHANES participants. Examples are shown for 10 participants with i.FCS at the U.S. mean (35.5 ±1) and consuming at least 10 different products over 2 days of 24 hr recalls. The numbers within each stacked bar graph indicate the count of food and beverage products consumed from that category of FCS across two days of reported intake; and the color bars represent the percentage energy contribution of food and beverage products from that category of FCS across two days of reported intake. FCS Food Compass Score, grad graduate, HS high school, i.FCS individual Food Compass Score.
Validation of Food Compass against a healthy dietary pattern
To understand how a person’s i.FCS, derived from the product scores for the individual food and beverage products they consume, correlated with an externally derived healthy dietary pattern, we calculated the Healthy Eating Index (HEI) 2015 for each individual, a measure of adherence to the Dietary Guidelines for Americans that has been widely validated against health outcomes including mortality, cardiovascular and metabolic endpoints, and cancer16–19. The i.FCS for individuals highly correlated with HEI-2015 (R = 0.81) (Fig. 3), with strong correlations persisting across population subgroups ranging from 0.76 to 0.83 (Table S6). Individually, the 9 domains of the FCS were not as highly correlated, ranging from 0.23 for i.Specific Lipids to 0.76 for i.Nutrient Ratios (Fig. S2).
Fig. 3. Relationship between the Healthy Eating Index (HEI) 2015 and energy-weighted, individual Food Compass Score (i.FCS) among U.S. adults, 1999–2018.

i.FCS was calculated by taking the energy-weighted mean of the respective score for all foods consumed by that individual reported in up to two 24 hr recalls. Black dots represent each NHANES respondent; solid line, the line of best fit between all NHANES respondents; and R value, the correlation and p-value (2-sided) for the Spearman correlation. i.FCS individual Food Compass Score, NHANES National Health and Nutrition Examination Survey.
Validation of Food Compass against clinical risk factors
To understand how a person’s i.FCS was cross-sectionally associated with major clinical risk factors in NHANES (body mass index [BMI] [weight in kilograms divided by height in meters squared]; systolic and diastolic blood pressure [mm Hg]; HDL-cholesterol, LDL-cholesterol, triglycerides [mg/dL], and total cholesterol: HDL ratio; and Hba1c [%] and fasting plasma glucose [mg/dL]), we evaluated survey-weighted linear regression models for each standard deviation (SD) unit increase in i.FCS, adjusting for key sociodemographic and lifestyle factors. Higher i.FCS was significantly associated with more favorable levels of nearly all risk factors evaluated (Table 2). For example, each SD increase in i.FCS (~10.9 points out of 100) was associated with lower BMI (−0.60 kg/m2 [−0.70, −0.51]), systolic blood pressure (−0.69 mmHg [−0.91, −0.48]), LDL-C (−2.01 mg/dl [−2.63, −1.40]), HbA1c (−0.02% [−0.03, −0.01]); and fasting plasma glucose (−0.44 mg/dL [−0.74, −0.15]); and higher HDL-C (1.65 mg/d: [1.44, 1.85]). The association between i.FCS and triglyceride levels was nonsignificant (−1.55 mg/dL [−3.13, 0.03]). Further adjustment for total energy intake and NHANES cycle had no appreciable effect on the magnitude or statistical significance of these relationships (Table S7).
Table 2.
Multivariable-adjusted, cross-sectional associations of the individual Food Compass Score (i.FCS) with major clinical risk factors and prevalent health conditions among U.S. adults, 1999–2018
| i.FCS, per SD (10.9 points)a | |
|---|---|
| Clinical risk factors | Difference (95% CI)b |
| BMI, kg/m2 | −0.60 (−0.70, −0.51) |
| Systolic blood pressure, mmHg | −0.69 (−0.91, −0.48) |
| Diastolic blood pressure, mmHg | −0.49 (−0.66, −0.32) |
| LDL-C, mg/dL | −2.01 (−2.63, −1.40) |
| HDL-C, mg/dL | 1.65 (1.44, 1.85) |
| Triglycerides, mg/d | −1.55 (−3.13, 0.03) |
| TC: HDL ratio, 1 unit | −0.13 (−0.15, −0.12) |
| HbA1c, % | −0.02 (−0.03, −0.01) |
| Fasting plasma glucose, (mg/dL) | −0.44 (−0.74, −0.15) |
| Prevalent health conditions | Odds ratio (95% CI)b |
| Metabolic syndromec | 0.85 (0.82, 0.88) |
| Diabetesd | 0.96 (0.91, 1.01) |
| Cardiovascular diseasee | 0.92 (0.88, 0.96) |
| Cancer | 0.95 (0.91, 0.99) |
| Lung disease | 0.92 (0.88, 0.96) |
| Optimal cardiometabolic healthf | 1.24 (1.16, 1.32) |
BMI body mass index, CI confidence interval, CVD cardiovascular disease, HbA1c hemoglobin A1c, HDL-C high-density lipoprotein cholesterol, i.FCS individual, dietary Food Compass Score, LDL low density lipoprotein cholesterol, TC:HDL total cholesterol to HDL-cholesterol ratio.
aIndividual Food Compass Score (i.FCS) calculated as the energy-weighted mean of FCS of all foods consumed, as reported in up to two 24 hr recalls per person, with a potential range from 1 to 100.
bSurvey-weighted, multivariable-adjusted linear or logistic regression models incorporated NHANES dietary recall sample weights to account for the complex survey design and response rates and provide nationally representative effect estimates and associated 95% confidence interval for the non-institutionalized U.S. population. All models adjusted for: age (years), age2 (years), sex, race/ethnicity (Mexican-American, other Hispanic, non-Hispanic White, non-Hispanic Black, Asian/other race), education level (<HS graduate, HS graduate, some college or associates degree, ≥ college graduate), income (poverty: income ratio), smoking status (non-smoker, former smoker, current smoker); total physical activity (MET-hours/week), alcohol use (%energy), and self-reported diabetes (yes, no).
cDefined according to AHA/NHLBI as the presence of three or more of: HDL-C < 40 mg/dL (men)/<50 mg/dL (women) or on drug treatment for low HDL-C; triglycerides ≥150 mg/dL or on drug treatment for elevated triglycerides; blood pressure ≥130 SBP or ≥85 DBP or on antihypertensive drug treatment with a history of hypertension; WC ≥102 cm (men)/ ≥88 cm (women); and fasting plasma glucose ≥100 mg/dL or on drug treatment for elevated glucose58.
dDefined based on presence of at least one of: HbA1c level > 6.5%, fasting plasma glucose > 126 mg/dL, or use of diabetes medications.
eDefined based on the presence of both (1) self-reported coronary heart disease, heart failure, stroke, myocardial infarction, or angina and also (2) cardiovascular medication use including use of angina, hypertension, or lipid medications.
fDefined based on optimal levels for each of the following: adiposity (BMI < 25 kg/m2 and WC ≤ 88 cm (women)/WC ≤ 102 cm (men)); blood glucose (FPG < 100 mg/dL and HbA1c < 5.7% and not taking diabetes medication); blood lipids (TC:HDL < 3.5:1 and not taking lipid lowering medication); blood pressure (SBP < 120 mmHg, DBP < 80 mmHg and not taking blood-pressure lowering medication); and history of CVD (no self-reported CHD, heart failure, myocardial infarction, stroke, or angina).
We used the same statistical methods to explore the association between an individual’s FCS domain scores (separately derived from the product level domain score) and clinical risk factors. i.Nutrient Ratios, i.Food Ingredients, and i.Minerals each generally showed stronger protective associations with clinical risk factors, while i.Fiber and Protein, i.Processing, and i.Phytochemicals generally showed less robust associations (Table 3). Considering each major clinical risk factor, i.Nutrient Ratios had the strongest protective association with BMI (−0.47 kg/m2 [−0.56, −0.39]), diastolic blood pressure (−0.52 mmHg [−0.69, −0.36]), LDL cholesterol (−2.12 mg/dL [−2.76, −1.47]), HDL-cholesterol (1.50 mg/dL [1.29, 1.71]), and the TC:HDL ratio (−0.14 [−0.16, −0.12]); and i.Food Ingredients with systolic blood pressure (−0.86 mmHg [−1.07, −0.64]), triglycerides (−3.23 mg/dL [−4.73, −1.73]), HbA1c, (−0.01% [−0.02, −0.01]), and fasting plasma glucose (−0.44 mg/dL [−0.73, −0.14]). However, an individual’s FCS domain score generally had significant protective associations with at least several of the different risk factors.
Table 3.
Multivariable adjusted, survey-weighted, cross-sectional analysis of the association between the 9 components of the Food Compass Score at the individual level (i.Domain Scores) and cardiometabolic health among U.S. adults, 1999–2018
| i.Domain Scoresa | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| i.Nutrient Ratios | i.Vitamins | i.Minerals | i.Food Ingredients | i.Additivesc | i.Processing | i.Fiber and Proteinc | i.Specific Lipids | i.Phytochemicals | |
| Continuous biomarkers (difference [95% CI])b | |||||||||
| BMI, kg/m2 | −0.47 (−0.56, −0.39)*** | −0.27 (−0.37, −0.18)*** | −0.46 (−0.55, −0.37)*** | −0.46 (−0.54, −0.37)*** | −0.21 (−0.31, −0.1)*** | −0.30 (−0.38, −0.21)*** | −0.04 (−0.13, 0.06) | −0.16 (−0.26, −0.07)*** | −0.32 (−0.41, −0.23)*** |
| Systolic blood pressure, mmHg | −0.62 (−0.85, −0.40)*** | −0.27 (−0.5, −0.05)* | −0.50 (−0.71, −0.30)*** | −0.86 (−1.07, −0.64)*** | 0.16 (−0.05, 0.37) | −0.04 (−0.25, 0.18) | −0.06 (−0.28, 0.16) | −0.24 (−0.47, −0.01)* | −0.30 (−0.49, −0.12)** |
| Diastolic blood pressure, mmHg | −0.52 (−0.69, −0.36)*** | −0.35 (−0.49, −0.2)*** | −0.42 (−0.57, −0.27)*** | −0.35 (−0.51, −0.19)*** | −0.18 (−0.34, −0.02)* | −0.08 (−0.23, 0.08) | −0.34 (−0.5, −0.18)*** | 0.01 (−0.14, 0.16) | −0.26 (−0.42, −0.11)** |
| LDL-C, mg/dL | −2.12 (−2.76, −1.47)*** | −1.24 (−1.81, −0.66)*** | −0.96 (−1.51, −0.42)*** | −1.69 (−2.24, −1.15)*** | −0.88 (−1.46, −0.30)** | −0.32 (−0.97, 0.33) | −0.88 (−1.47, −0.29)** | −1.46 (−2.07, −0.85)*** | −0.94 (−1.53, −0.34)** |
| HDL-C, mg/dL | 1.50 (1.29, 1.71)*** | 0.80 (0.61, 0.98)*** | 0.98 (0.8, 1.16)*** | 1.29 (1.09, 1.50)*** | 1.19 (0.99, 1.39)*** | 0.79 (0.58, 1.00)*** | 1.10 (0.92, 1.29)*** | 0.67 (0.47, 0.86)*** | 0.54 (0.35, 0.72)*** |
| Triglyceride, mg/dL | −1.87 (−3.49, −0.25)* | 0.03 (−1.35, 1.41) | 0.50 (−1.02, 2.01) | −3.23 (−4.73, −1.73)*** | 0.50 (−0.88, 1.88) | 1.20 (−0.45, 2.84) | 0.36 (−1.12, 1.84) | −2.35 (−3.89, −0.8)** | −0.60 (−2.08, 0.88) |
| TC: HDL ratio, 1 unit | −0.14 (−0.16, −0.12)*** | −0.07 (−0.09, −0.05)*** | −0.07 (−0.09, −0.05)*** | −0.12 (−0.13, −0.10)*** | −0.08 (−0.1, −0.06)*** | −0.04 (−0.06, −0.02)** | −0.07 (−0.09, −0.05)*** | −0.08 (−0.1, −0.05)*** | −0.05 (−0.07, −0.04)*** |
| HbA1c, % | −0.01 (−0.02, 0.00)* | −0.01 (−0.02, 0.00)** | −0.01 (−0.02, −0.01)** | −0.01 (−0.02, −0.01)*** | 0.01 (0.0, 0.01) | −0.01 (−0.02, 0.0)** | 0.00 (−0.01, 0.01) | 0.00 (−0.01, 0.01) | −0.01 (−0.02, −0.01)** |
| Fasting plasma glucose, mg/dL | −0.32 (−0.63, −0.01)* | −0.28 (−0.58, 0.01) | −0.35 (−0.65, −0.06)* | −0.44 (−0.73, −0.14)** | 0.33 (0.07, 0.6)* | −0.17 (−0.48, 0.15) | 0.04 (−0.26, 0.34) | −0.15 (−0.45, 0.15) | −0.22 (−0.51, 0.08) |
| Prevalent health conditions (odds ratio [95% CI])† | |||||||||
| Metabolic syndromed | 0.86 (0.84, 0.89)*** | 0.94 (0.91, 0.97)*** | 0.90 (0.87, 0.93)*** | 0.88 (0.85, 0.91)*** | 0.93 (0.9, 0.96)*** | 0.94 (0.91, 0.97)*** | 0.97 (0.93, 1.00)* | 0.96 (0.93, 0.99)* | 0.92 (0.89, 0.95)*** |
| Diabetese | 0.94 (0.89, 0.99)* | 1.03 (0.97, 1.09) | 0.99 (0.93, 1.05) | 0.94 (0.90, 1.00)* | 1.04 (0.99, 1.1) | 1.00 (0.95, 1.05) | 1.07 (1.01, 1.13)* | 0.96 (0.91, 1.02) | 0.97 (0.91, 1.03) |
| Cardiovascular diseasef | 0.96 (0.91, 1.01) | 1.00 (0.95, 1.04) | 0.95 (0.90, 0.99)* | 0.94 (0.89, 0.99)* | 0.92 (0.88, 0.97)** | 0.91 (0.87, 0.96)*** | 0.94 (0.89, 0.99)* | 0.95 (0.90, 1.00)* | 1.01 (0.97, 1.05) |
| Cancer | 0.94 (0.90, 0.99)* | 0.96 (0.91, 1.00) | 0.96 (0.92, 1.01) | 1.02 (0.97, 1.06) | 0.93 (0.89, 0.97)** | 0.95 (0.90, 1.00)* | 0.93 (0.88, 0.98)** | 1.02 (0.98, 1.07) | 1.00 (0.94, 1.05) |
| Lung disease | 0.90 (0.9, 0.97)*** | 0.93 (0.90, 0.96)*** | 0.91 (0.87, 0.95)*** | 1.00 (0.96, 1.04) | 0.93 (0.89, 0.96)*** | 0.92 (0.89, 0.95)*** | 0.90 (0.87, 0.93)*** | 1.05 (1.01, 1.08)** | 0.95 (0.92, 0.99)* |
| Optimal cardiometabolic healthg | 1.22 (1.15, 1.29)*** | 1.05 (0.99, 1.11) | 1.11 (1.05, 1.18)*** | 1.20 (1.13, 1.29)*** | 1.10 (1.03, 1.17)** | 1.09 (1.03, 1.15)** | 1.03 (0.98, 1.09) | 1.08 (1.03, 1.14)** | 1.12 (1.07, 1.17)*** |
BMI body mass index, CI confidence interval, CVD cardiovascular disease, HbA1c hemoglobin A1c, HDL-C high-density lipoprotein cholesterol, i.FCS individual, dietary Food Compass Score, LDL-C low density lipoprotein cholesterol, TC:HDL total cholesterol to HDL-cholesterol ratio.
*P < 0.05, **P < 0.01, ***P < 0.001.
ai.Domain Scores represent the nine component domains of the Food Compass Score at the individual level. For each i.Domain Score, the energy-weighted mean for all reported foods consumed, as reported in 24 hr dietary recalls, was calculated, as with i.FCS. Differences and odds ratios reflect a 1 standard deviation increase in respective i.Domain Score.
bSurvey-weighted, multivariable-adjusted linear regression (or logistic regression) models incorporated NHANES dietary sample weights to account for the complex survey design and response rates and provide nationally representative effect estimates (or odds ratios) and associated 95% confidence interval for the non-institutionalized U.S. population. All outcomes are reported in their units. All models adjusted for: age (years), age2 (years), sex, race/ethnicity (Mexican-American, other Hispanic, non-Hispanic White, non-Hispanic Black, Asian/other race), education level (< HS graduate, HS graduate, some college or associates degree, ≥ college graduate), income (poverty: income ratio), smoking status (non-smoker, former smoker, current smoker); total physical activity (MET-hours/week), alcohol use (%energy), and self-reported diabetes (yes, no).
cThe i.Additives domain, of which added sugar is a major component, was cross-sectionally associated with more favorable levels of BMI, diastolic blood pressure, LDL-C, HDL-C, TC:HDL, metabolic syndrome, CVD, cancer, lung disease, and optimal cardiometabolic health, but also higher fasting glucose and nonsignificant trend toward higher prevalence of diabetes; suggesting potential for some residual reverse causation (people with higher blood sugar avoiding added sugar in their diets). Similarly, the i.Fiber and Protein domain was cross-sectionally associated with more favorable levels of diastolic blood pressure, LDL-C, HDL-C, TC:HDL, metabolic syndrome, CVD, cancer, and lung disease, but also higher prevalence of diabetes, which was driven by the protein component when attributes were separately evaluated. This could relate to reverse causation, adverse effects of excess protein on hepatic de novo lipogenesis, or other adverse compounds, like heme iron, in high protein foods.
dDefined according to AHA/NHLBI as the presence of three or more of: HDL-C < 40 mg/dL (men)/<50 mg/dL (women) or on drug treatment for low HDL-C; triglycerides ≥150 mg/dL or on drug treatment for elevated triglycerides; blood pressure ≥ 130 SBP or ≥ 85 DBP or on antihypertensive drug treatment with a history of hypertension; WC ≥102 cm (men)/ ≥88 cm (women); and fasting plasma glucose ≥100 mg/dL or on drug treatment for elevated glucose58.
eDefined based on presence of at least one of: HbA1c level > 6.5%, fasting plasma glucose > 126 mg/dL, or use of diabetes medications.
fDefined based on the presence of both (1) self-reported coronary heart disease, heart failure, stroke, myocardial infarction, or angina and also (2) cardiovascular medication use including use of angina, hypertension, or lipid medications.
gDefined based on optimal levels for each of the following: adiposity (BMI < 25 kg/m2 and WC ≤ 88 cm (women)/WC ≤ 102 cm (men)); blood glucose (FPG < 100 mg/dL and HbA1c < 5.7% and not taking diabetes medication); blood lipids (TC:HDL < 3.5:1 and not taking lipid lowering medication); blood pressure (SBP < 120 mmHg, DBP < 80 mmHg and not taking blood-pressure lowering medication); and history of CVD (no self-reported CHD, heart failure, myocardial infarction, stroke, or angina).
Validation of Food Compass against prevalent health conditions
To assess how a person’s i.FCS was cross-sectionally associated with prevalent health conditions in NHANES (metabolic syndrome, cardiovascular disease, diabetes, lung disease, and optimal cardiometabolic health [optimal levels of a constellation of 5 cardiometabolic risk factors; see table S3 for detailed definition]), we evaluated survey-weighted logistic regression models for each standard deviation (SD) unit increase in i.FCS, adjusting for key sociodemographic and lifestyle factors. Higher levels of i.FCS were significantly associated with lower risk of several prevalent health conditions (Table 2). For example, each SD increase in i.FCS (~10.9 points) was associated with 15% lower prevalence of metabolic syndrome (OR = 0.85 [95%CI: 0.82, 0.88]), 8% lower prevalence of CVD (0.92 [0.88, 0.96]), 5% lower prevalence of cancer (0.95 [0.91, 0.99]),and 8% lower prevalence of lung disease (0.92 [0.88, 0.96]). A nonsignificant trend was seen toward lower prevalence of diabetes (0.96 [0.90, 1.01]). Each SD increase in i.FCS was also associated with 24% higher prevalence of optimal cardiometabolic health (1.24 [1.16, 1.32]). Further adjustment for total energy intake and NHANES cycle had no appreciable effects on the magnitude or statistical significance of results (Table S7).
We used the same statistical methods to explore the association between individual FCS domains (separately derived from the product level domain score) and prevalent health conditions. Each individual FCS domains was associated with a lower risk of metabolic syndrome, with i.Nutrient Ratios having the strongest inverse association (0.86 [0.84, 0.89]) (Table 3). For diabetes, i.Nutrient Ratios had the strongest inverse association (0.94 [0.89, 0.99]); for CVD, i.Minerals, i.Food Ingredients, i.Additives, i.Processing, i.Fiber and Protein, and i.Specific Lipids were each inversely associated (OR 0.91 to 0.95 each); for cancer, i.Nutrient Ratios, i.Additives, i.Processing and i.Fiber and Protein were each inversely associated (OR 0.93 to 0.95 each); and for lung disease, all domains except i.Food Ingredients were inversely associated. Seven of 9 individual domain scores (except i.Vitamins and i.Fiber and Protein) were also associated with higher prevalence of optimal cardiometabolic health. In further post-hoc exploratory analyses disaggregating the i.Fiber and Protein domain into its individual attributes (i.Fiber and i.Protein), i.Fiber was marginally associated with lower prevalence of diabetes (0.97 [0.92, 1.03]), while i.Protein was associated with higher prevalence of diabetes (1.10 [1.04, 1.16]).
Validation of Food Compass against all-cause and cause-specific mortality
To assess the prospective association between a person’s i.FCS and total mortality, we used linked, cause-specific mortality data from the National Death Index through 2018. Survey-weighted Cox proportional hazard models were used to estimate the hazard ratio (HR) and 95% CI between a one SD in i.FCS and risk of all-cause and cause-specific (cardiometabolic and cancer) mortality, adjusting for key sociodemographic and lifestyle factors, and with study time as the time variable. Over a total of 20.8 years of follow-up (with participants contributed person-time from the time of their enrollment until death or end-of-follow-up on January 1, 2019, whichever came first), 7481 deaths occurred, including 2619 (35.0%) due to cardiometabolic disease and 1691 (22.6%) due to cancer. In multivariable-adjusted analyses, each 1 SD (10.9) increase in i.FCS was prospectively associated with a 7% lower risk of all-cause mortality (HR = 0.93 [0.89, 0.96]) (Fig. 4, Table S8). For comparison, similar lower risk of all-cause mortality was seen among individuals with some college education compared to individuals with less than high school education (0.94 [0.86, 1.02] and for a 1 SD increase in total physical activity (0.90 [0.85, 0.94]). i.FCS was associated with a trend toward lower cancer-specific mortality (0.92 [0.85, 1.00]); and not significantly associated with cardiometabolic mortality (0.95 [0.89, 1.02]).
Fig. 4. Prospective associations of the individual Food Compass Score (i.FCS), sociodemographic factors, and other factors with all-cause mortality among U.S. adults, 1999–2018.
The i.FCS for each person was calculated as the energy-weighted mean of the FCS of all unique foods and beverages consumed, based on up to two 24 hr recalls per person, with a potential range from 1 to 100. Boxes represent the multivariable-adjusted hazard ratio; and error bars, the 95% CI; based on survey-weighted Cox proportional hazard models incorporating NHANES dietary recall sample weights to account for the complex survey design and response rates and provide nationally representative estimates for the non-institutionalized U.S. population. All variables in the Figure were included together in the multivariable model. AA Associates Degree, grad graduate, HS high school, i.FCS individual Food Compass Score, MET metabolic equivalent of task, %E percent energy, PIR ratio of family income to poverty threshold, SD standard deviation.
Further adjustment for clinical risk factors, prevalent conditions, and medications, which could each be confounders or mediators, did not appreciably alter these findings (Table S8). Censoring deaths at 85+ years as well as further adjustment for energy intake and survey cycle, slightly strengthened the magnitude of the protective association between i.FCS and all-cause mortality (0.91 [0.87, 0.95] and cancer mortality (0.92 [0.89, 0.96]) (Table S9). In addition, when censoring deaths at 85+ years, the association of i.FCS with CMD-specific mortality became marginally significant (0.92 [0.85, 1.00]).
In exploratory analyses, we investigated whether the relationship between i.FCS and total mortality varied in population subgroups according to age, sex, race/ethnicity, education, income, BMI, and plausibility of energy reporting. Findings were similar across subgroups, with no significant differences in the observed protective associations between i.FCS and mortality (p-interaction >0.05 each) (Table S10).
We used the same statistical methods to explore the association between individual FCS domains (separately derived from the product level domain score) and mortality. Exploring individual FCS domains, significant inverse associations with all-cause mortality were seen for i.Nutrient Ratios (0.92 [0.89, 0.96]), i.Food Ingredients (0.95 [0.91, 0.99]), i.Additives (0.95 [0.92, 0.98]), i.Processing (0.93 [0.89, 0.96]), i.Fiber and Protein (0.93 [0.90, 0.96]), and i.Specific Lipids (0.96 [0.92, 1.00]) (Table S8).
Using semi-parametric, restricted cubic splines, the relationship between i.FCS and all-cause mortality appeared potentially nonlinear, with a stronger protective association until an i.FCS of ~40 (approximately the 75th percentile score), with a less strong inverse relationship thereafter, but this potential nonlinearity was not statistically significant (p-nonlinearity = 0.12) (Fig. S3).
Discussion
Based on nationally representative data among U.S. adults, we assessed individuals’ dietary habits based on the Food Compass-derived nutrient profiling scores of the individual food and beverage products they consumed. An individual’s dietary quality assessed by Food Compass (i.FCS) was highly correlated with HEI-2015 (R = 0.81), validating the use of Food Compass scoring of specific food and beverage products for adequately reflecting dietary quality. Furthermore, after adjustment for other major risk factors, i.FCS was associated with more favorable levels of major clinical risk factors, major disease conditions, and optimal cardiometabolic health. Prospectively, each SD increase in i.FCS (~11 points out of 100) was associated with a 7% lower risk of all-cause mortality. Taken together, these findings support the validity of the FCS as a tool to score the healthfulness of individual food and beverage products.
These findings support the original design intent of Food Compass – to incorporate the best available science on health impacts of nutrients, food ingredients, and other emerging food and beverage characteristics of public health concern, selected and weighted to identify more healthful products15. Notably, different people could arrive at a similar i.FCS with very different combinations of healthier or less healthy foods, and yet the overall i.FCS still associated with a healthy diet pattern, health risk factors, prevalent disease conditions, and total mortality. Together with our previous demonstration of content validity, discriminatory validity, and convergent validity of the Food Compass when applied to individual products15, the current demonstration of construct validity against population diet quality, risk factors, health conditions, and mortality lends further support to this NPS as a valid measure of product healthfulness.
The identified associations for individual domains also support biologic plausibility of the findings. For example, the i.Nutrient Ratios domain was associated with blood lipid levels, adiposity, blood pressure, blood glucose levels, prevalent metabolic syndrome, and prevalent diabetes supported by robust evidence for effects of dietary fat quality on these outcomes20, of carbohydrate quality (i.e., total carbohydrate: fiber ratio) on adiposity and diabetes risk21,22, and of the joint intakes of potassium and sodium on blood pressure and stroke risk23. The i.Food Ingredients domain was also robustly associated with several major health endpoints, consistent with the established etiologic associations of the included healthful (fruits, non-starchy vegetables, beans/legumes, whole grains, nuts/seeds, seafood, yogurt, plant oils) and unhealthful (refined grains, red/processed meat) attributes in this domain24,25. i.Processing was associated with BMI, consistent with interventional evidence suggesting that diets high in ultra-processed foods cause excess caloric intake and weight gain26. The less robust (or nonsignificant) associations of i.Phytochemicals and i.Specific Lipids with several health endpoints are consistent with the more emerging nature of scientific evidence for causal effects of flavonoids, carotenoids, and specific lipids on major health outcomes, and support the lower weighting of these domains in the Food Compass scoring15. The observed harmful association between the i.Protein attribute and prevalent diabetes is consistent with meta-analyses of prospective cohort studies which identified a positive association between higher protein intake and higher incidence diabetes27; as well as randomized trials where higher protein intake was associated with hepatic de novo lipogenesis28—a driving pathway for fatty liver, visceral fat accumulation, and insulin resistance.
One striking finding was the low average i.FCS across the U.S. adult population. The mean score (~36) was not much higher than the identified threshold score (FCS ≤ 30) for individual food and beverage products to be minimized in the diet15, and fewer than 1% of U.S. adults had an i.FCS above 70 (the identified threshold score for food and beverage products to be encouraged). These results suggest that, based on NPS scoring of individual foods and beverages, the leading caloric contributors to American’s diets are, on average, mostly foods and beverages to be minimized or avoided. Using the cut-points of ≤ 30, 31–69, and ≥ 70 for i.FCS, we identified 32.7% of U.S. adults having poor diet quality, 66.8%, intermediate; and 0.5%, ideal. These results using an NPS of individual foods and beverages are consistent with previous studies based on validated diet pattern scores, which reported 46% of U.S. adults having poor diet quality; 53%, intermediate diet quality; and only 1%, ideal diet quality29,30. Our findings further highlight the suboptimal state of U.S. nutrition, and the urgent need for multistakeholder policy, business, and other systems interventions to improve diet quality.
In the U.S., the FDA is currently finalizing a definition to permit industry use of the word “healthy” on food and beverage products, including plans for a corresponding “FDA healthy” front-of-package icon31. Separately, major food retailers are also designing their own, different, industry-developed nutrient profiling systems for product labeling on their shelves and online7–9. In the European Union, seven countries have pledged to support coordination for promoting a unified front-of-pack (FOP) label5, with further calls by non-governmental bodies to adapt nutrient profiling as a tool to guide consumer choice32. The Food Compass could be a possible unified standard NPS for these efforts, given its association with a healthy diet pattern and multiple health endpoints and its use of a single, consistent algorithm for all foods, beverages, mixed ingredients, and mixed meals. Such a standard may also provide a more accurate assessment of healthfulness for consumers, industry, and procurement decisions, in comparison to strategies using isolated nutrients or ingredients, such as the FOP warning labels recently implemented in Chile, Mexico, Uruguay, and Israel33. While the Food Compass has more attributes than existing NPS, its core design strengths are the use of domains, which provides a holistic assessment of foods and beverages while also preventing the excess weight from any single attribute; integration of cutting edge science in selection of attributes (i.e., nutrient ratios which more accurately capture fat, mineral and carbohydrate quality) and omission of outdated attributes such as total fat or total calories; scoring on a basis of 100 kcal rather than 100 g, which greatly reduces bias from water weight; and incorporation of novel relevant features such as processing and additives. In future work, we plan to test FCS versions that retain these core design strengths while employing a more limited set of attributes; as well as leverage multivariable imputation to handle missing values in certain attributes in large datasets of branded products. Extension to datasets in low and middle income countries will also be important.
While FOP labeling of retail products is timely and relevant, an NPS like Food Compass that scores mixed meals and complex foods has broader applications. In this national dataset, Food Compass was able to score not only manufactured products but a person’s entire diet, including complex home-cooked, cafeteria, and restaurant mixed meals which other major NPS generally cannot do. This raises the potential for nutrient profiling for labeling of restaurant menus; as well as for expanding economic incentives and subsidies beyond fruits and vegetables in programs like the U.S. Supplemental Nutrition Assistance Program (SNAP)34 and insurance wellness programs35. Similarly, food manufacturer pledges for product reformulation, often based on internally selected goals for isolated single nutrients36,37, could be based on an independently derived, validated NPS. And, socially-driven investment metrics can incorporate nutrition and health by leveraging NPS to identify companies with healthier product portfolios and sales4,12. NPS like Food Compass can be one important tool in such strategies, allowing consumers, industry (including food manufacturers, retailers, and restaurants), investors, schools, hospitals, worksites, and policy makers to identify and shift toward healthier food and beverage options. Our current findings also support flexibility for these different stakeholders in improving healthfulness of their portfolios, as different people arrived at their i.FCS scores with very different combinations of healthier or less healthy foods, and yet still the overall i.FCS was predictive of HEI, health risk factors, prevalent disease conditions, and total mortality.
Further analyses are required to assess Food Compass’s comparative utility to other NPS across these various public and private use cases. Additionally, we have not compared the validity of different NPS against health outcomes, and in different nations and subpopulations – an important area for future work. Finally, the long-term vision of Food Compass is to score additional features of foods and beverages, such as environmental sustainability, social justice, and animal welfare – one for each direction of the compass. Future work is required to explore, add, and validate these additional dimensions.
The ability to score food and beverage items across 9 component domains is an important feature of the FCS and current investigation. Among the 9 domains, applied to individual items and then energy-weighted to persons’ diets, none by itself had as strong an association with HEI, nor with the full range of health risk factors, prevalent diseases, and total mortality, as the overall i.FCS. At the same time, each i.Domain Score had some associations with HEI as well as varying strengths of associations with different health endpoints. These findings support the complementary nature of the different domains, each providing supportive and somewhat distinct information, as components of the Food Compass. The growing evidence for complex, heterogenous effects of diet on health (e.g., via the gut microbiota, epigenetics, etc.)38 further supports the utility of a more holistic, multi-domain measure of healthfulness of foods and beverages. In addition, the domain structure ensures that no one domain or attributes can drive the overall product score—a mechanism which prevents mis-scoring based on extreme values of a single or few nutrients as well as industry “gaming” the system by fortifying food products with isolated vitamins. This holistic, domain-based scoring also permits a single, consistent scoring algorithm across all food and beverage products, including mixed meals – in contrast to all other major current NPS, which require subjective grouping of foods and beverages into multiple categories that use differing algorithms and/or scoring thresholds and have trouble scoring mixed meals that contain ingredients across two or more of these categories15.
Strengths
This investigation examined nationally representative data on nearly 50,000 U.S. adults, increasing generalizability and statistical power. Dietary data were calculated using up to two 24 hr recalls per individual, allowing precise scoring of individual food and beverage products. While some prior NPS validation studies utilized detailed product information from 24 hr recalls39–44, others utilized food frequency questionanires45–48 that only allow scoring of major food categories (e.g., fruits, vegetables, red meats, poultry, eggs, salty snacks, breakfast cereals), which does not truly validate the ability of the NPS to discriminate healthfulness of the diverse food and beverage products within each of these categories. Both the FCS and i.FCS were weighted by energy (kcal), rather than weight or portion size, which prevents bias or the need for differing scoring cut-points for different food and beverage categories due to differences in water weight, fiber, or fat content.
Our endpoints were clinically relevant and well measured, including dietary quality assessed using a validated diet pattern score; major clinical risk factors and prevalent diseases, using standardized physical examination and laboratory measures; and all-cause mortality, prospectively using the NDI. Qualitatively, we evaluated and demonstrated favorable associations with a greater number of cardiometabolic health outcomes than any prior NPS validation study to date. All analyses adjusted for a range of well measured demographic, socioeconomic, and lifestyle factors, reducing confounding. Relationships with mortality were prospectively assessed, establishing temporality. Findings were similar in several sensitivity analyses, increasing confidence in robustness of the findings.
Limitations
Cross-sectional analyses preclude assessment of temporality; however, findings were consistent in prospective analyses of all-cause mortality. Use of energy-weighting to calculate i.FCS captures overall dietary composition but provides lower weighting to certain foods with lower calories per servings, such as fruits and vegetables. However, portion- or gram-based weighting would create bias by failing to account for water weight, as described above. For practical goals of creating a single NPS score, some simplifications were required, such as using the RDA for 19–50 year-old men as target thresholds for several nutrients. In the future, a more “personalized” FCS could be crafted based on specific characteristics of the individual, such as age, sex, disease status, and more. Some nutrients such as Vitamin D, choline, and flavonoids were only available in certain NHANES cycles, requiring imputation in other cycles. Because errors in nutrient imputation would unlikely be systematic with respect to disease outcomes or total mortality, this imputation may have attenuated findings toward the null. While using two 24 hr recalls provides a reasonable measure of an individual’s diet, within-person dietary variation reduces the ability to detect associations with clinical risk factors and health outcomes49. Misreporting and omission of food items by dietary recall participants was also possible. If such error were random with respect to the outcomes, this would attenuate findings toward the null. If systematic with respect to the outcomes, for instance if individuals at higher health risk were more likely to underreport unhealthy foods and beverages, this would artificially inflate these individuals’ i.FCS – a bias which would weaken the reported findings and make the i.FCS appear less protective than it actually is.
Certain fasting biomarker levels (e.g., triglycerides, fasting plasma glucose) were missing for a significant proportion of the NHANES sample, which may have weakened the accuracy of the estimates for these health indicators. However, assessment of complementary biomarkers with less missingness for long-term glucose homeostasis (HbA1c) and blood lipid profiles (HDL-C, total cholesterol: HDL ratio) yielded generally consistent findings. The associations of i.FCS with diet quality, clinical risk factors, and mortality are observational, and residual confounding cannot be excluded. Yet, associations remained significant after adjustment for a range of well-measured covariates, and analyses of individual domains are consistent with known biologic effects of several of the attributes within these domains. Finally, we did not compare the findings to other NPS, which can be done in future work.
The Food Compass, extended based on scores of specific food and beverage products to a person’s dietary intake, associates with a healthy dietary pattern, major clinical risk factors and prevalent conditions, and longitudinal risk of all-cause mortality. These findings support the validity of Food Compass as a potential tool to guide public and private strategies to identify and encourage healthier foods and beverages.
Methods
Study design and participants
We utilized nationally representative data among 47,999 adults aged 20–85 years enrolled in the 10 most recent cycles of NHANES, 1999–2000 through 2017–2018 (Fig. S1). NHANES uses trained personnel and standardized methods including in-person physical examination to collect information on sociodemographic factors, clinical risk factors, health conditions, and dietary and other lifestyle habits from recurrent cross-sectional nationally representative samples. In addition, through the most current available linkage to the National Death Index (NDI), we prospectively evaluated mortality outcomes among 47,918 NHANES adult participants from the 1999–2000 to 2017–18 cycles (excluding 81 participants who did not have linked NDI information). All analyses accounted for the complex survey design and sampling weights to be representative of the non-institutionalized U.S. population. NHANES was approved by the Institutional Review Board at the National Center for Health Statistics (NCHS), and all participants provided informed written consent, and were provided compensation and a report of medical findings.
Dietary assessment
As part of the What We Eat in America (WWEIA) component of NHANES, dietary intake was assessed using standardized 24 hr dietary recalls, administered by trained NHANES interviewers, using United States Department of Agriculture’s (USDA) Automated Multiple Pass Method50. A single 24 hr recall was completed in 1999–2000 and 2001–2002, and a second 24 hr recall was added for most respondents (87%) from 2003–2004 onwards. We used all complete and reliable dietary records available51 (Supplementary Note 1), as determined by the National Center for Health Statistics staff, averaging the two recalls for each individual when present to reduce the influence of day-to-day dietary variation.
Product-level nutritional information was extracted from the Food and Nutrient Database for Dietary Studies (FNDDS) from 2001–2018, food ingredients data from the Food Pattern Equivalents Database (FPED), and flavonoid data from the 2010 USDA Flavonoid database. We excluded infant formula, baby foods, alcohol, specialized medical foods and dietary supplements, and items providing < 5 kcal per 100 g (e.g., unsweetened tea or coffee, diet soda). Any missing attributes (i.e., nutrient quantities) required for nutrient profiling were estimated by first propagating attribute values for an item with an identical food code available in the most recent survey cycle; and then for the remaining, by predictive mean matching (PMM) imputation using FNNDS and FPED attribute values and WWEIA indicator category (152 categories) as predictors for the food item (Supplementary Note 2, Table S11). PMM imputation fills in a value randomly from among the observed donor values from an observation whose regression-predicted values are closest to the regression-predicted value for the missing value from a simulated model to ensure the imputed values are plausible, particularly when the normality assumption is violated52,53.
Food Compass scoring
Briefly, Food Compass incorporates 54 attributes across 9 domains, namely: nutrient ratios, vitamins, minerals, food ingredients, additives, processing, specific lipids, fiber and protein, and phytochemicals (Tables S1–S2)15. Each attribute is scored from 0 to 10 for beneficial attributes, −10 to 0 for harmful attributes and −10 to 10 for ratios, with domain scores calculated as an average of its component attributes (or for the food ingredients domain, as the sum). Domain scores are summed, with the latter three domains (e.g., specific lipids, fiber and protein, and phytochemicals) each receiving a half weight; and the summary FCS scaled to range from 1 (least healthful) to 100 (most healthful). The algorithm included several minor modifications, identified since publication of our original report, such as a more graded approach to scoring processed meats and fermented foods (Supplementary Note 3, Table S12). In addition, due to data unavailability in FNDDS, seven attributes were excluded from Food Compass scoring (Table S1). The same attribute scoring and algorithm are used across all food and beverage products. Items with a score of ≥ 70 were generally identified as foods to be encouraged; with a score of 31–69, foods to be eaten in moderation; and with a score of ≤30, to be minimized15.
To extend FCS for specific products to an individual’s overall diet, the FCS score for each item reported in a person’s diet was weighted by its percent contribution to that person’s total energy intake, and then summed, to calculate an individual’s Food Compass Score (i.FCS). Because alcohol is not scored by FCS, energy from alcohol intake was excluded from the i.FCS calculation and included as a covariate in all models. The i.FCS for each individual could theoretically range from 1 to 100.
To understand how different combinations of items could contribute to a similar i.FCS, we assessed the distribution (number and contribution to total energy intake) of consumed food and beverage products, based on previously defined product healthfulness thresholds of FCS ≤ 30, 31–69, and FCS ≥ 70, among subgroups of the US populations having various i.FCS.
To explore the contribution of each separate Food Compass domain to dietary quality and health associations, we also calculated an energy-weighted individual dietary score for each of the 9 FCS domains, using similar methods, and also each ranging from 1 to 100. The i.Domain scores were not used to derive the i.FCS, but separately calculated from the product-level domain scores to explore their associations with the endpoints of this investigation.
Sociodemographic factors and covariates
In NHANES, sociodemographic factors including age, sex, race/ethnicity, household income, and education levels were assessed by trained personnel. Race/ethnicity was self-identified by each participant based on standardized response categories to questions pertaining to race and Hispanic origin: non-Hispanic White, non-Hispanic Black, Mexican-American, other Hispanic, and Asian or other race, including multiracial. The ratio of family income to poverty level was calculated by dividing the participant’s self-reported family income by the poverty guidelines specific to family size, appropriate year, and state. Additional lifestyle covariates included in the models were: smoking status, total physical activity, average total energy intake, and alcohol consumption. Smoking status was categorized as never, former, or current smoker based on a range of questions regarding the respondent’s self-reported smoking habits. Total physical activity was assessed continuously as self-reported Metabolic Equivalent of Task (MET)-hours per week. Methodological differences in how physical activity was assessed in NHANES from 1999 to 2006 and from 2007 to 2018 preclude the use of a consistent variable for total physical activity across survey cycles. As such, we used a total physical activity variable (MET-hours/week) evaluated from 2007 to 2018, a leisure time physical activity variable (MET-hours/week) for all survey cycles, and all other demographic and health data to impute total physical activity (MET-hours/week) for 1999–2006 survey cycles using PMM (Supplementary Note 4, Table S13–15). The relative energy contribution of each food item was accounted for in the dietary i.FCS calculation. Total energy intake was not included as a component of either the FCS (food level) or the i.FCS (dietary level). Thus, we adjusted for total energy intake derived from the 24 hr recall data available and adjusted for it continuously as a potential confounder in the models. Alcohol consumption was also modeled continuously as the percentage of average total energy intake from alcohol.
Outcomes
i.FCS was validated against three different outcomes: (a) the Healthy Eating Index (HEI) 2015, a validated measure of a healthy dietary pattern; (b) prevalence of major clinical risk factors and health conditions; and (c) prospective risk of all-cause and cause-specific mortality.
Healthy dietary pattern
We calculated HEI 2015 for each individual, a measure based on adherence to the 2015–2020 Dietary Guidelines for Americans (DGAs). HEI 2015 is calculated based on key food groups and nutrients emphasized in the DGAs, resulting in a score ranging from 0 to 100. HEI 2015 has been widely validated against health outcomes, including all-cause and cardiometabolic mortality, cardiometabolic health outcomes, and cancer16–19.
Clinical risk factors and prevalent health conditions
We assessed major clinical risk factors in NHANES including body mass index (BMI) (weight in kilograms divided by height in meters squared); systolic and diastolic blood pressure (mm Hg); HDL-cholesterol, LDL-cholesterol, triglycerides (mg/dL), and the total cholesterol: HDL ratio; and Hba1c (%) and fasting plasma glucose (mg/dL). For respondents with missing biomarker values (i.e., triglycerides, LDL cholesterol, and fasting plasma glucose were only collected on a fasting sub-sample of each NHANES cohort), PMM imputation was used to resolve missing values using all available health and demographic variables as predictors (Supplementary Note 5, Table S16). We also assessed health conditions, including the presence of metabolic syndrome, diabetes, cardiovascular disease (CVD), cancer, and lung disease; as well as optimal cardiometabolic health, characterized by optimal levels of adiposity, blood glucose, blood lipids, blood pressure, and absence of prior clinical CVD (Table S3). Missing values for self-reported health conditions and medications were assumed to be normal (i.e., no prevalent condition or medication use).
All-cause and cause-specific mortality
NHANES survey data were linked to the National Death Index (NDI) mortality data from the Public-Use Linked Mortality Files54, allowing prospective assessment of mortality through Jan 1, 2019. The NDI captures ~97% of deaths when social security numbers are available, which is the case for all eligible WWEIA respondents linked to the NDI55,56. NCHS staff screen respondents from NHANES for identifying information such as social security number, name, date of birth, and state of residence, and use probabilistic matching methods to link these respondents to records in the NDI. Deaths were adjudicated by the NCHS using standardized procedures. Our primary endpoint was mortality from all causes with secondary outcomes of deaths from cardiometabolic disease (sum of deaths from coronary heart disease, stroke, and diabetes) and from all cancer.
Statistical analysis
The relationship of i.FCS with HEI 2015 was assessed using Spearman correlation. To assess relationships with clinical risk factors, prevalent health conditions, and mortality outcomes, i.FCS was evaluated per each standard deviation (SD) unit. We used survey-weighted linear regressions with robust standard errors to estimate the association of i.FCS with continuous risk factors; and survey-weighted logistic regression to estimate the odds ratio (OR) and 95% confidence interval (CI) for the association with prevalent health conditions. Each FCS domain was similarly evaluated. Multivariable models were used to minimize potential confounding, with covariates included based on clinical and biological relevance to both exposures and outcomes, including age and age-squared (continuous), sex, race/ethnicity (non-Hispanic White, non-Hispanic Black, Mexican-American, Asian/other), education (< high school graduate, high school graduate, some college education, college graduate or more), income (based on family poverty: income ratio, continuous), smoking status (non-smoker, former, current), alcohol use (percentage energy contribution from alcohol, continuous), total physical activity (MET-hours per week, continuous), and self-reported prevalent diabetes, which was identified as being significantly cross-sectionally associated with dietary habits to reduce added sugar. Further adjustment for self-reported CVD and cancer had no appreciable effect on results. In sensitivity analyses, we further adjusted for total energy intake, which could be a confounder or mediator of the relationship between i.FCS and these outcomes, and for NHANES survey cycle as an indicator variable.
Survival analyses used survey-weighted Cox proportional hazard models to estimate the hazard ratio (HR) and 95% CI between i.FCS and risk of all-cause and cause-specific (cardiometabolic and cancer) mortality. Study time was used as the time variable. Participants contributed person-time from the time of their enrollment until death or end-of-follow-up on January 1, 2019, whichever came first. We censored (e.g., removed as events) early deaths within 3 months of baseline, to minimize reverse causation. The proportional hazards assumption was assessed with visual Kaplan-Meier plots and statistical testing of Schoenfeld residuals (and not rejected if p > 0.05), adjusted for age and age-squared (continuous). Multivariable models were adjusted as described for the cross-sectional analyses. In addition, a “mediator” model was assessed in which each of the following could be either confounders or mediators (i.e., in the downstream causal pathway) of the relationship between i.FCS and mortality: major risk factors (BMI, SBP, DBP, HDL, LDL, TG, TC:HDL ratio, HbA1c, FPG; all continuous), self-reported health conditions (CVD, cancer, lung disease), and medications (for diabetes, hypertension, lipid, and angina). In sensitivity analyses, we additionally censored deaths at older ages (85 + years), and further adjusted for total energy intake and NHANES survey cycle as an indicator category. We also assessed potential nonlinear relationships using semi-parametric restricted cubic splines, excluding the top and bottom 1% of i.FCS values and using the 5th percentile of i.FCS as the reference. In exploratory analyses, we investigated the relationship between i.FCS and total mortality in subgroups by age, sex, race/ethnicity, education, income, BMI, and plausibility of energy reporting to assess potential variation (interaction) in the association according to these key sociodemographic factors.
All analyses incorporated NHANES dietary sample weights to account for the complex survey design and response rates, and provide nationally representative estimates for the non-institutionalized U.S. population.
Reporting summary
Further information on research design is available in the Nature Research Reporting Summary linked to this article.
Supplementary information
Acknowledgements
The authors are grateful to Lu Wang, PhD (Friedman School of Nutrition Science and Policy, Tufts University) for providing NOVA classification scores for FNDDS products (2001–2018); Mengxi Du, MS RD (Friedman School of Nutrition Science and Policy, Tufts University) for assistance with the smoking variable in NHANES 1999–2018; and Mengyuan Ruan, MS (Mass General Bringham) for assistance with the physical activity variables in NHANES 1999–2018. This research was supported by the National Institutes of Health (2R01HL115189) and Vail Innovative Global Research (grant N316001 PR0677), both awarded to D.M. The funders had no role in the design or conduct of the study; collection, management, analysis, or interpretation of the data; the preparation, review, or approval of the manuscript; or the decision to submit the manuscript for publication.
Author contributions
All authors had full access to all of the data in the study and take responsibility for the integrity of the data and the accuracy of the data analysis. M.O. and D.M. were responsible for the concept and design of the analysis. M.O., J.E.M., B.N.L., S.G., and D.M. acquired and interpreted the data. M.O. conducted data and statistical analyses. M.O. and D.M. drafted the manuscript. J.E.M., S.G., C.E., J.B.W., J.B.B., and D.M. critically reviewed the manuscript for its intellectual content. D.M. obtained funding and supervised the entire body of work.
Peer review
Peer review information
Nature Communications thanks Mike Rayner and the other anonymous reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.
Data availability
The attribute and domain scoring algorithm used to generate the Food Compass is available in Supplementary Table S2. All data used in this analysis is publicly available from the following USDA and CDC sources: (1) Nutrient composition data for foods reported in NHANES dietary recalls [USDA Food and Nutrient Database for Dietary Studies 2001-2018, FNDDS: https://www.ars.usda.gov/northeast-area/beltsville-md-bhnrc/beltsville-human-nutrition-research-center/food-surveys-research-group/docs/fndds-download-databases/]. (2) Food ingredients data for foods reported in NHANES dietary recalls [USDA Food Pattern Equivalents Database, 2001-2018: https://www.ars.usda.gov/northeast-area/beltsville-md-bhnrc/beltsville-human-nutrition-research-center/food-surveys-research-group/docs/fped-overview/]. (3) Flavonoid data for select foods reported in NHANES dietary recalls [USDA Flavonoid Database, 2007-2010: https://data.nal.usda.gov/dataset/usda-database-flavonoid-content-selected-foods-release-32-november-2015. (4) National dietary recall, sociodemographic, physical activity, smoking, cardiometabolic biomarker, and prevalent condition data for US adults [National Health and Nutrition Examination Survey 1999-2018, NHANES: https://wwwn.cdc.gov/nchs/nhanes/continuousnhanes/default.aspx. (5) All-cause and cause-specific mortality data for US adults [National Death Index 1999–2018, NDI: https://www.cdc.gov/nchs/data-linkage/mortality-public.htm.
Code availability
Custom code was developed using R (Version 4.0.3) and Stata SE (Version 15.1) for data cleaning, adjustments, and merging, algorithm scoring of FCS for individual food and beverage items, i.FCS scoring of individuals, cross-sectional and survival analyses of i.FCS with health outcomes, and summary statistics, tables, and figures. Statistical significance was defined as two-tailed alpha=0.05. Tufts University is considering licensing of expertise, knowledge, and background work for potential commercial viability of Food Compass for the private sector and non-profit applications, and thus the code is not publicly available. There are no intellectual property or patent protections associated with Food Compass. The detailed Food Compass algorithm and it scoring are specified in our earlier Nature Food Publication15, as well as in Table S2 of the current supplementary materials, and can be reproduced freely by any individual or organization.
Competing interests
Ms. O’Hearn reports research funding from the National Institutes of Health and Vail Innovative Global Research. Dr. Erndt-Marino reports income from Bespoke Analytics, LLC, which received research funding from the Florida Department of Citrus, outside the submitted work. Ms. Gerber reports research funding from the National Institute of Food and Agriculture (USDA) and the National Institutes of Health, and additional income from Inova Medical Systems in the Beatty Liver and Obesity Research Program as well as the USDA-Tufts Human Nutrition Research Center on Aging, all outside the submitted work. Ms. Lauren has received personal fees from Abt Associates and the Centers for Disease Control and Prevention, both outside the submitted work. Dr. Economos reports research funding from the United States Department of Agriculture National Institutes of Health, JPB Foundation, and Newman’s Own Foundation. She also reports her position as Vice Chair to National Academies of Science Roundtable on Obesity Solutions (unpaid) and her Advisory Board position at Care/of Scientific. None of the above relate to the manuscript. Dr. Wong reports research funding from the National Institutes of Health and membership in the US Preventive Services Task Force (unpaid) and National Academies of Sciences, Engineering and Medicine Committee on Evaluating the Process to Develop the Dietary Guidelines for Americans, 2020-2025 (unpaid), outside the submitted work. Dr. Blumberg reports being on the scientific advisory board for Bragg Live Food Products, LLC, California Prune Board, California Walnut Commission, Cranberry Institute, Good Pharma, LLC, Guiding Stars Licensing Co., Inside Tracker/Segeterra Inc., and January.ai, also all outside the submitted work. Dr. Mozaffarian reports research funding from the National Institutes of Health, Gates Foundation, Rockefeller Foundation, and Vail Innovative Global Research; and scientific advisory board for Beren Therapeutics, Brightseed, Calibrate, DayTwo (ended 6/2021), Elysium Health, Filtricine, Foodome, HumanCo, January Inc., Perfect Day, Season and Tiny Organics, all outside the submitted work.
Footnotes
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
The online version contains supplementary material available at 10.1038/s41467-022-34195-8.
References
- 1.El-Abbadi NH, Taylor SF, Micha R, Blumberg JB. Nutrient profiling systems, front of pack labeling, and consumer behavior. Curr. Atherosclerosis Rep. 2020;22:36. doi: 10.1007/s11883-020-00857-5. [DOI] [PubMed] [Google Scholar]
- 2.Labonté M, et al. Nutrient profile models with applications in government-led nutrition policies aimed at health promotion and noncommunicable disease prevention: a systematic review. Adv. Nutr. 2018;9:741–788. doi: 10.1093/advances/nmy045. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.McColl K, Lobstein T, Brinsden H, World Health Organization. Regional Office for, E. Nutrient profiling could be used to transform food systems and support health-promoting food policies. Public health Panor. 2017;3:586–597. [Google Scholar]
- 4.O’Hearn, M., Gerber, S., Cruz, S. M., & Mozaffarian, D. The time is ripe for ESG + Nutrition: evidence-based nutrition metrics for Environmental, Social, and Governance (ESG) investing. Eur. J. Clin. Nutr. 76, 1047–1052 (2022). [DOI] [PMC free article] [PubMed]
- 5.Southey, F. 7 European countries team up to propel Nutri-Score rollout. Food Navigator Retrieved from https://www.foodnavigator.com/Article/2021/02/12/7-Euopean-countries-team-up-to-propel-Nutri-Score-rollout# (Accessed April 2021).
- 6.Jones A, Shahid M, Neal B. Uptake of Australia’s Health Star Rating System. Nutrients. 2018;10:997. doi: 10.3390/nu10080997. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Hannaford Bros. Learn about Guiding Stars. Available at: https://www.hannaford.com/health-wellness/guiding-stars (2020) (Accessed 11 January 2022).
- 8.Walmart. Great For You. Available at: https://corporate.walmart.com/global-responsibility/hunger-nutrition/great-for-you (2021) (Accessed 18 January 2022).
- 9.Kroger. OptUp. Available at: https://www.kroger.com/health/pharmacy/optup (2021) (Accessed 6 January 2022).
- 10.Danone joins actors across food value chain and civil society to call on the EU to establish mandatory Nutri-Score labelling across all countries. [Press release]. Retrieved from https://www.danone.com/content/dam/danone-corp/danone-com/medias/medias-en/2020/localnews/danone-nutri-score-en.pdf (2020) (Accessed January 2022).
- 11.Access to Nutrition Index Available at: https://accesstonutrition.org/ (Accessed May 2021).
- 12.N4G Investor Pledge. Access to Nutrition Initiative. Available at: https://accesstonutrition.org/app/uploads/2021/10/Investor-Pledge-Submitted-20211202-final.pdf (2021) (Accessed January 2022).
- 13.Chantal J, Hercberg S, World Health Organization. Regional Office for, E. Development of a new front-of-pack nutrition label in France: the five-colour Nutri-Score. Public health Panor. 2017;03:712–725. [Google Scholar]
- 14.Australian Department of Health. Health Star Rating system: Calculator and Style Guide, Version 5. Available at: http://www.healthstarrating.gov.au/internet/healthstarrating/publishing.nsf/Content/guide-for-industry (2021) (Accessed October 2021).
- 15.Mozaffarian D, et al. Food Compass is a nutrient profiling system using expanded characteristics for assessing healthfulness of foods. Nat. Food. 2021;2:809–818. doi: 10.1038/s43016-021-00381-y. [DOI] [PubMed] [Google Scholar]
- 16.Schwingshackl L, Bogensberger B, Hoffmann G. Diet quality as assessed by the healthy eating index, alternate healthy eating index, dietary approaches to stop hypertension score, and health outcomes: an updated systematic review and meta-analysis of cohort studies. J. Acad. Nutr. Diet. 2018;118:74–100.e111. doi: 10.1016/j.jand.2017.08.024. [DOI] [PubMed] [Google Scholar]
- 17.Steck SE, Guinter M, Zheng J, Thomson CA. Index-based dietary patterns and colorectal cancer risk: a systematic review. Adv. Nutr. 2015;6:763–773. doi: 10.3945/an.115.009746. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Shan Z, et al. Association between healthy eating patterns and risk of cardiovascular disease. JAMA Intern Med. 2020;180:1090–1100. doi: 10.1001/jamainternmed.2020.2176. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Panizza, C. E. et al. Testing the predictive validity of the Healthy Eating Index-2015 in the Multiethnic Cohort: is the score associated with a reduced risk of all-cause and cause-specific mortality? Nutrients 10. 10.3390/nu10040452 (2018). [DOI] [PMC free article] [PubMed]
- 20.Schwingshackl L, et al. Total dietary fat intake, fat quality, and health outcomes: a scoping review of systematic reviews of prospective studies. Ann. Nutr. Metab. 2021;77:4–15. doi: 10.1159/000515058. [DOI] [PubMed] [Google Scholar]
- 21.Sawicki CM, et al. Comparison of indices of carbohydrate quality and food sources of dietary fiber on longitudinal changes in waist circumference in the framingham offspring cohort. Nutrients. 2021;13:997. doi: 10.3390/nu13030997. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.AlEssa HB, et al. Carbohydrate quality and quantity and risk of type 2 diabetes in US women. Am. J. Clin. Nutr. 2015;102:1543–1553. doi: 10.3945/ajcn.115.116558. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Neal B, et al. Effect of salt substitution on cardiovascular events and death. N. Engl. J. Med. 2021;385:1067–1077. doi: 10.1056/NEJMoa2105675. [DOI] [PubMed] [Google Scholar]
- 24.Miller, V. et al. Evaluation of the Quality of Evidence of the Association of Foods and Nutrients With Cardiovascular Disease and Diabetes: A Systematic Review. JAMA Network Open.5, e2146705-e2146705 (2022). [DOI] [PMC free article] [PubMed]
- 25.WCRF/AICR Systematic Literature Review Continuous Update Project Report: The Associations between Food, Nutrition and Physical Activity and the Risk of Colorectal Cancer. World Cancer Research Fund/American Institute for Cancer Research. Available at: https://www.wcrf.org/wp-content/uploads/2021/02/Colorectal-cancer-report.pdf (2011) (Accessed December 2021).
- 26.Hall KD, et al. Ultra-processed diets cause excess calorie intake and weight gain: an inpatient randomized controlled trial of ad libitum food intake. Cell Metab. 2019;30:67–77.e63. doi: 10.1016/j.cmet.2019.05.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Tian, S. et al. Dietary protein consumption and the risk of type 2 diabetes: a systematic review and meta-analysis of cohort studies. Nutrients9, 982 (2017). [DOI] [PMC free article] [PubMed]
- 28.Charidemou, E., et al. A randomized 3-way crossover study indicates that high-protein feeding induces de novo lipogenesis in healthy humans. JCI Insight 4, e124819. 10.1172/jci.insight.124819 (2019). [DOI] [PMC free article] [PubMed]
- 29.Liu J, Micha R, Li Y, Mozaffarian D. Trends in food sources and diet quality among US children and adults, 2003–2018. JAMA Netw. Open. 2021;4:e215262–e215262. doi: 10.1001/jamanetworkopen.2021.5262. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Rehm CD, Penalvo JL, Afshin A, Mozaffarian D. Dietary intake among US adults, 1999-2012. JAMA. 2016;315:2542–2553. doi: 10.1001/jama.2016.7491. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Federal Register. A Proposed Rule by the Food and Drug Administration on 09/29/2022. Available at: https://www.federalregister.gov/documents/2022/09/29/2022-20975/food-labeling-nutrient-content-claims-definition-of-term-healthy (2022).
- 32.International Agency for Resarch on Cancer. Nutri-Score: Harmonized and mandatory front-of-pack nutrition label urgently needed at the European Union level and beyond [Press release]. Retrieved from https://www.iarc.who.int/wp-content/uploads/2021/09/pr301_E.pdf (2021) (Accessed January 2022).
- 33.Reyes M, et al. Development of the Chilean front-of-package food warning label. BMC Public Health. 2019;19:906. doi: 10.1186/s12889-019-7118-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Olsho LE, Klerman JA, Wilde PE, Bartlett S. Financial incentives increase fruit and vegetable intake among Supplemental Nutrition Assistance Program participants: a randomized controlled trial of the USDA Healthy Incentives Pilot. Am. J. Clin. Nutr. 2016;104:423–435. doi: 10.3945/ajcn.115.129320. [DOI] [PubMed] [Google Scholar]
- 35.John Hancock. John Hancock Vitality program. Available at: https://www.johnhancockvitality.com/ (2018) (Accessed 17 April 2020).
- 36.Unilever. Positive nutrition: Strategies and goals. Available at: https://www.unilever.com/planet-and-society/positive-nutrition/strategy-and-goals/ (2022) (Accessed January 2022).
- 37.PepsiCo. PepsiCo Positive: planet and people. Available at: https://www.pepsico.com/pepsicopositive (2022) (Accessed January 2022).
- 38.Mozaffarian, D. Perspective: Obesity—an unexplained epidemic. Amer. J. Clin. Nutr. 10.1093/ajcn/nqac075 (2022). [DOI] [PMC free article] [PubMed]
- 39.Streppel MT, de Groot LC, Feskens EJ. Nutrient-rich foods in relation to various measures of anthropometry. Fam. Pract. 2012;29:i36–i43. doi: 10.1093/fampra/cmr093. [DOI] [PubMed] [Google Scholar]
- 40.Streppel MT, et al. Nutrient-rich foods, cardiovascular diseases and all-cause mortality: the Rotterdam study. Eur. J. Clin. Nutr. 2014;68:741–747. doi: 10.1038/ejcn.2014.35. [DOI] [PubMed] [Google Scholar]
- 41.Adriouch S, et al. Prospective association between a dietary quality index based on a nutrient profiling system and cardiovascular disease risk. Eur. J. Preventive Cardiol. 2020;23:1669–1676. doi: 10.1177/2047487316640659. [DOI] [PubMed] [Google Scholar]
- 42.Adriouch S, et al. Association between a dietary quality index based on the food standard agency nutrient profiling system and cardiovascular disease risk among French adults. Int J. Cardiol. 2017;234:22–27. doi: 10.1016/j.ijcard.2017.02.092. [DOI] [PubMed] [Google Scholar]
- 43.Julia C, et al. Prospective associations between a dietary index based on the British Food Standard Agency nutrient profiling system and 13-year weight gain in the SU.VI.MAX cohort. Prev. Med. 2015;81:189–194. doi: 10.1016/j.ypmed.2015.08.022. [DOI] [PubMed] [Google Scholar]
- 44.Julia C, et al. The nutrient profile of foods consumed using the British Food Standards Agency nutrient profiling system is associated with metabolic syndrome in the SU.VI.MAX Cohort. J. Nutr. 2015;145:2355–2361. doi: 10.3945/jn.115.213629. [DOI] [PubMed] [Google Scholar]
- 45.Chiuve SE, Sampson L, Willett WC. The association between a nutritional quality index and risk of chronic disease. Am. J. Prev. Med. 2011;40:505–513. doi: 10.1016/j.amepre.2010.11.022. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Deschasaux M, et al. Nutritional quality of food as represented by the FSAm-NPS nutrient profiling system underlying the Nutri-Score label and cancer risk in Europe: Results from the EPIC prospective cohort study. PLoS Med. 2018;15:e1002651. doi: 10.1371/journal.pmed.1002651. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Gómez-Donoso C, et al. Association between the nutrient profile system underpinning the Nutri-Score front-of-pack nutrition label and mortality in the SUN project: a prospective cohort study. Clin. Nutr. 2021;40:1085–1094. doi: 10.1016/j.clnu.2020.07.008. [DOI] [PubMed] [Google Scholar]
- 48.Pan, X-F, et al. Seventeen-year associations between diet quality defined by the health star rating and mortality in Australians: The Australian Diabetes, Obesity and Lifestyle Study (AusDiab). Curr. Dev. Nutr.4. 10.1093/cdn/nzaa157 (2020). [DOI] [PMC free article] [PubMed]
- 49.Willett, W. Nutritional Epidemiology: Nature of Variation in Diet Ch. 3, 3rd edn. (Oxford University Press, 2013).
- 50.Moshfegh AJ, et al. The US Department of Agriculture Automated Multiple-Pass Method reduces bias in the collection of energy intakes. Am. J. Clin. Nutr. 2008;88:324–332. doi: 10.1093/ajcn/88.2.324. [DOI] [PubMed] [Google Scholar]
- 51.Centers for Disease Control and Prevention. National Health and Nutrition Examination Survey. Available at: http://www.cdc.gov/nchs/nhanes.htm (2013) (Accessed 15 April 2020).
- 52.Heitjan DF, Rubin DB. Inference from coarse data via multiple imputation with application to age heaping. J. Am. Stat. Assoc. 1990;85:304–314. doi: 10.1080/01621459.1990.10476202. [DOI] [Google Scholar]
- 53.Schenker N, Taylor JMG. Partially parametric techniques for multiple imputation. Comput. Stat. Data Anal. 1996;22:425–446. doi: 10.1016/0167-9473(95)00057-7. [DOI] [Google Scholar]
- 54.National Center for Health Statistics Centers for Disease Control and Prevention. 2015 Public-Use Linked Mortality Files. Available at: https://www.cdc.gov/nchs/data-linkage/mortality-public.htm (2020) (Accessed April 2021).
- 55.Hermansen SW, Leitzmann MF, Schatzkin A. The impact on National Death Index ascertainment of limiting submissions to Social Security Administration Death Master File matches in epidemiologic studies of mortality. Am. J. Epidemiol. 2009;169:901–908. doi: 10.1093/aje/kwn404. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Cowper DC, Kubal JD, Maynard C, Hynes DM. A primer and comparative review of major US mortality databases. Ann. Epidemiol. 2002;12:462–468. doi: 10.1016/S1047-2797(01)00285-X. [DOI] [PubMed] [Google Scholar]
- 57.Murakami K, Livingstone MB. Prevalence and characteristics of misreporting of energy intake in US adults: NHANES 2003-2012. Br. J. Nutr. 2015;114:1294–1303. doi: 10.1017/S0007114515002706. [DOI] [PubMed] [Google Scholar]
- 58.Grundy SM, et al. Diagnosis and management of the metabolic syndrome. Circulation. 2005;112:2735–2752. doi: 10.1161/CIRCULATIONAHA.105.169404. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The attribute and domain scoring algorithm used to generate the Food Compass is available in Supplementary Table S2. All data used in this analysis is publicly available from the following USDA and CDC sources: (1) Nutrient composition data for foods reported in NHANES dietary recalls [USDA Food and Nutrient Database for Dietary Studies 2001-2018, FNDDS: https://www.ars.usda.gov/northeast-area/beltsville-md-bhnrc/beltsville-human-nutrition-research-center/food-surveys-research-group/docs/fndds-download-databases/]. (2) Food ingredients data for foods reported in NHANES dietary recalls [USDA Food Pattern Equivalents Database, 2001-2018: https://www.ars.usda.gov/northeast-area/beltsville-md-bhnrc/beltsville-human-nutrition-research-center/food-surveys-research-group/docs/fped-overview/]. (3) Flavonoid data for select foods reported in NHANES dietary recalls [USDA Flavonoid Database, 2007-2010: https://data.nal.usda.gov/dataset/usda-database-flavonoid-content-selected-foods-release-32-november-2015. (4) National dietary recall, sociodemographic, physical activity, smoking, cardiometabolic biomarker, and prevalent condition data for US adults [National Health and Nutrition Examination Survey 1999-2018, NHANES: https://wwwn.cdc.gov/nchs/nhanes/continuousnhanes/default.aspx. (5) All-cause and cause-specific mortality data for US adults [National Death Index 1999–2018, NDI: https://www.cdc.gov/nchs/data-linkage/mortality-public.htm.
Custom code was developed using R (Version 4.0.3) and Stata SE (Version 15.1) for data cleaning, adjustments, and merging, algorithm scoring of FCS for individual food and beverage items, i.FCS scoring of individuals, cross-sectional and survival analyses of i.FCS with health outcomes, and summary statistics, tables, and figures. Statistical significance was defined as two-tailed alpha=0.05. Tufts University is considering licensing of expertise, knowledge, and background work for potential commercial viability of Food Compass for the private sector and non-profit applications, and thus the code is not publicly available. There are no intellectual property or patent protections associated with Food Compass. The detailed Food Compass algorithm and it scoring are specified in our earlier Nature Food Publication15, as well as in Table S2 of the current supplementary materials, and can be reproduced freely by any individual or organization.



