Abstract
Household consumption is a key measure of well-being in low- and middle-income countries (LMICs) where agriculture remains the primary livelihood, and food represents a substantial share of household total consumption expenditures. This paper introduces a new harmonized dataset of food consumption value by food categories and acquisition sources for 16 sub-Saharan African countries from 2008 to 2021. The dataset is constructed from consumption modules of large-scale, nationally representative household surveys collected by the World Bank and each country’s National Statistical Office. It adds value to these surveys by standardizing indicators, including monetizing non-market consumption, generating food item-level estimates, and making the processing code and record-level microdata publicly available for replication and use by researchers. The dataset facilitates valid cross-country comparisons of food consumption over time and can be merged with other satellite and climate data datasets for additional analysis of the drivers and impacts of food consumption in LMICs. Additionally, an indicator dashboard and visualizations have been created to make the estimates accessible to policymakers and the public.
Subject terms: Agriculture, Economics
Background & Summary
In most low- and middle-income countries (LMICs), household consumption is an important indicator of household wellbeing1–3. For several decades, prior to the COVID-19 pandemic, household consumption in many of these countries increased substantially and global poverty fell4–6. Increases in household consumption have been driven by a host of factors that include rising incomes, globalization, and increased economic integration7–9. In most LMICs, food consumption constitutes the largest share of household total consumption. In sub-Saharan Africa (SSA), household food preferences are also changing due to increased urbanization, education levels, and rising income. Studies have documented a shift away from the consumption of staple foods toward an increasingly diversified diet10,11. These changes have implications for household nutrition and agricultural transformation. Hence, it is important to understand the dynamics of household food consumption across different food items and food sources.
Most studies on household food consumption patterns are limited to a single country7,12–15. One of the constraints to cross-country analysis of food consumption patterns is the lack of comparable datasets. While most countries conduct national household living standard surveys, there are barriers to using these data. For example, most publicly available surveys with consumption modules are inconsistent between countries. Also, formulations of food consumption questions in these surveys are inconsistent across countries and sometimes vary across different survey rounds of the same country. For example, food consumption data collected in the following surveys: the Madagascar Enquête Périodique auprès des Ménages (EPM), the Rwanda Integrated Household Living Conditions Survey (EICV), and the South Africa Income and Expenditure Survey were not disaggregated by source of acquisition. Additionally, some existing datasets only report aggregate food consumption, which leaves little room for micro-level disaggregated information. For instance, the 2011 Senegal Enquête de Suivi de la Pauvreté (ESPS-II) survey did not collect disaggregated crop-level food consumption data. Standardizing food consumption indicators across countries and multiple waves of publicly available data would facilitate analyses of trends in the value of different food categories consumed and their sources.
This paper introduces a newly harmonized dataset of food consumption indicators by food category and acquisition source for 16 SSA countries over the period 2008–2021. The dataset was constructed using publicly available nationally representative household surveys collected by the World Bank’s Living Standards Measurement Study - Integrated Surveys on Agriculture (LSMS-ISA) team and individual country’s National Statistical Offices. Some countries have collected data over multiple rounds, including panel data, following the same household over multiple years. Information can be further aggregated at the household, regional, and country levels. The dataset also includes key household demographic characteristics like gender and age of household head, location of household, household size, and dependents. This dataset contributes to and complements recent efforts to make comparable cross-country socio-economic and agricultural production data for LMICs public and more widely available16–21. These data were originally prepared by the Evans School Policy and Analysis group (EPAR) to provide nutritional insights from changes in food groups consumed, but will be useful for researchers, analysts, and policymakers seeking to understand food consumption trends across multiple countries and years. This paper outlines the data processing steps in constructing the harmonized food consumption dataset, a description of the data structure, and usage recommendations.
Methods
Data sources and sample
We focus on processing and standardizing food consumption data from publicly available surveys, ensuring full compliance with their terms of use. Table 1 presents the list of survey data processed. It includes the name of the survey and the years of data collection. The processing covers 36 surveys collected from 16 SSA countries between 2008 and 2021. Table S1 in the Supplementary Information provides inclusion and exclusion criteria for countries in this harmonized dataset, and Table S2 provides links to the raw data for each included survey.
Table 1.
Source Datasets, Countries, and Years Covered.
| Country | Survey | Waves |
|---|---|---|
| Benin | Enquête Harmonisée sur les Conditions de Vie des Ménages (EHCVM) | 2018/1926 |
| Burkina Faso | Enquête Harmonisée sur les Conditions de Vie des Ménages (EHCVM)a | 2018/1927, 2021/2228 |
| Cote d’Ivoire | Enquête Harmonisée sur les Conditions de Vie des Ménages (EHCVM) | 2018/1929 |
| Ethiopia | Socioeconomic Panel Survey (ESS)b | 2015/1630, 2018/1931, 2021/2232 |
| Ghana | Socioeconomic Panel Survey (SPS) | 2009/1033 |
| Guinea Bissau | Enquête Harmonisée sur les Conditions de Vie des Ménages (EHCVM) | 2018/1934 |
| Kenya | Integrated Household Budget Survey (KIHBS) | 2015/1635 |
| Malawi | Integrated Household Survey (IHS)c | 2010/1136, 201337, 2016/1738, 2019/202039 |
| Mali | Enquête Agricole de Conjoncture Intégrée (EACI) | 201440 |
| Enquête Harmonisée sur les Conditions de Vie des Ménages (EHCVM) | 2018/1941 | |
| Niger | Enquête Harmonisée sur les Conditions de Vie des Ménages (EHCVM) | 2018/1942 |
| Nigeria | General Household Survey (GHS)d | 2010/1243, 2012/1344, 2015/1645, and 2018/1946 |
| Senegal | Enquête Harmonisée sur les Conditions de Vie des Ménages (EHCVM) | 2018/1947 |
| Sierra Leone | Integrated Household Survey (IHS) | 201848 |
| Tanzania | National Panel Survey (NPS)e | 2008/0949, 2010/1150, 2012/1351, 2014/1552, 2019/2053 |
| Togo | Enquête Harmonisée sur les Conditions de Vie des Ménages (EHCVM) | 2018/1954 |
| Uganda | National Panel Survey (UNPS)f | 2009/1055, 2010/1156, 2011/1257, 2013/1458, 2015/1659, 2018/1960, 2019/2061 |
Notes: (a) The two Burkina Faso EHCVM waves are panel datasets; (b) Ethiopia ESS 2018/19 and 2021/22 are panel datasets; (c) The four Malawi IHS waves are panel datasets; (d) The Nigeria GHS 2010/12, 2012/13, and 2015/16 waves are panel datasets, while the 2018/19 wave is a refreshed sample that only tracked a subset of the original panel sample; (e) The Tanzania NPS 2008/09, 2010/11, 2012/13, and 2014/15 waves are panel datasets, the 2019/20 wave is a refreshed sample that only tracks a subset of the original panel sample; (f) The Uganda 2009/10, 2010/11, and 2013/14 waves can be merged into a panel, while the other waves are refreshed and track only a subset of the original sample. All the other surveys are cross-sectional or repeated cross-sectional datasets
The 16 countries included in the dataset had a combined Gross Domestic Product (GDP) of about US$1057 billion in 2021, representing about 54% of the total GDP of SSA for that year22. They also accounted for about 703 million individuals or 59% of the total SSA population. Thus, the data provides a broad representation of the continent, both in terms of GDP and population. Figure 1 below shows a map of the countries included in the dataset, with color shades representing the number of rounds for which data are available. Figure 2 presents the same map but color-codes the number of households captured in each country, ranging from 5003 in Ghana to over 30,000 households in Malawi. Figure 2 reflects the total number of household-level data points across all waves in each country, not necessarily unique as, households may appear multiple times in countries with panel data.
Fig. 1.
Countries covered and the number of rounds of data availability.
Fig. 2.
Number of households captured by country across all surveys.
Data processing
We add value to publicly available consumption data by constructing comparable indicators across countries and years. The consumption modules in these surveys are not standardized; hence, processing the data involves several steps. Figure 3 summarizes these steps.
Fig. 3.
Data processing workflow.
We begin by downloading the raw data files from the websites of the World Bank or the country’s national statistical agencies (Table S2). The raw data consists of questions, enumerated across different years, asking households about the quantity of food consumption from their own production and gifts, as well as the amount and value of food purchased over a period of time. In each survey, we combine several questions from the consumption module to construct estimates of food consumption value by food items and sources of acquisition. Survey questions vary across countries, sometimes even for the same country across waves. Recall period, the food items included in the basket of goods, and how items are enumerated and aggregated can all vary. For example, questions in the Nigeria LSMS-ISA/General Household Survey consumption Wave 3 module use a 7-day recall and ask about quantity and unit by source, and expenditure on purchases only:
How much in total did your household consume of this [ITEM] (quantity)
How much in total did your household consume of this [ITEM] (unit)
How much did your household purchase of [ITEM]? (quantity)
How much did your household purchase of [ITEM]? (unit)
How much did your household spend on [ITEM](NAIRA)
How much of the consumption of [ITEM] comes from purchases? (quantity)
How much of the consumption of [ITEM] comes from purchases? (unit)
How much consumption of [ITEM] came from own production? (quantity)
How much consumption of [ITEM] came from own production? (unit)
How much consumption of [ITEM]came from gift? (quantity)
How much consumption of [ITEM]came from gift? (unit)
In contrast, in the Ghana Socioeconomic Panel Survey (SPS), the recall period is 30 days, the quantity consumed from different sources is reported in the same unit, and respondents were asked to self-estimate and report the monetary value of consumption from own production, purchase, and gifts in Ghana Cedis. Although the Ghana SPS raw data includes households’ self-estimated value of food consumption from own production, and gifts, for consistency, we reconstructed these estimates following the same data processing steps used for other surveys where such monetary valuations were not available:
Quantity of own produce?
Quantity of purchases?
Quantity of gift received?
Quantity of gift given?
Expenditure on own-produced food in the last 30 days? (GH¢)
Expenditure on purchased food in the last 30 days? (GH¢)
Expenditure on gifts received in the last 30 days? (GH¢)
Expenditure on gifts given in the last 30 days? (GH¢)
Unit given?
Number of months used?
Total monthly expenditure on food? (GH¢)
Data cleaning and variable construction
Processing each survey’s raw data involves several steps, including converting local measures into standard weight or volume units, determining unit prices, valuing consumption quantities, and constructing consistent socio-demographic variables.
Local unit name harmonization
Survey respondents can report quantities in standard conventional units (kg, liter, etc.), as well as local non-standard units (mudu, bundles, bags, heaps, etc.), or both. Using conversion files collected by the survey teams, as much as possible, we convert the reported non-standard unit measures to standardized units. When conversion factors are unavailable and converting to conventional units is impossible, we harmonize non-standard units (e.g., we code Bundle, BUNDLE, bundled into bundle) to be able to capture all quantities in the unit price estimation. Missing conversion factors for some crops were imputed using medians for crops with available conversion factors at the lowest geographic level with at least 10 valid observations.
Rationalize food item names
The description of food items consumed varies within country, reflecting local names and sub-crop varieties. For example, respondents in Ghana report consuming kenkey and banku, which are different forms of maize reported in their local names. In Kenya, variants of rice listed include aromatic and non-aromatic, white, brown, and broken rice. We rationalize food item names within the survey by aggregating the different forms of a specific food into a single category. For instance, in the case of Ghana, kenkey and banku, are aggregated into maize.
Monetary valuation of food consumption in local currency units
Next, we estimate the monetary values of the reported quantities in local currencies. In all surveys, the value of purchased consumption is provided in each country’s domestic currency. However, in most surveys, the values of non-purchased consumption (own production and gifts) are not collected. To address this, we estimate item-specific prices, calculated as unit values of food purchases (value of purchases divided by the reported quantities of purchases converted into standard units). For items for which no purchase was reported, and for consumption from own production and gifts, we impute prices by estimating the median purchase price of the same food item-unit combination at the lowest administrative level with at least 10 observations. For example, when observations in the enumeration area (EA) are less than 10, we consider the median purchase price at the next administrative level (e.g., parish in Uganda, local government area in Nigeria, and ward in Tanzania) for which there are at least 10 observations. If there are still fewer than 10 valid unit prices at that level, this process continues until the median purchase price of a food item is determined at the country level. This iterative process allows for the full exploitation of available information to compute local prices for each household and food item-unit combination, rather than relying on a fixed price for every household. After estimating unit prices for each food item, we derive the value of food consumed from own production and gifts by multiplying reported quantities by the estimated unit price of each food item. A limitation of using unit prices to value food consumption instead of market prices is that they do not account for variations within food commodity groups or quality substitution bias, i.e., households or individuals that respond to price increases by switching to lower quality or cheaper produce23. Given the lack of market prices in the surveys, we rely on self-reported unit prices to value food consumption.
Construct and merge selected socio-demographic variables
Each survey collects household demographic information in a household roster module. To improve the usability of the data, we construct household socio-economic and demographic variables. We include variables that capture the number of household members, female-headed households, the age of the household head, and the location of the household. We also generate dependency measures by constructing variables for the number of household members that are children (below 17 years), working age adults (>=18 and <65), female working age adults, male working age adults, and elders (>=65 years). To account for the crop season when data were collected, we construct variables for the day, month, and year the data collection interview was conducted. Survey weights provided in each survey wave are included to ensure the estimates are representative at the country level. Other socio-demographic variables included in the dataset are adult equivalence and households residing in rural areas. Finally, to facilitate spatial mapping, we include a variable that matches the first administrative unit where households reside to the corresponding administrative level, which is a commonly used country shapefile code from GADM. Although, this dataset does not include income variables, using codes in the EPAR repository24, income and other agricultural variables for all LSMS-ISA surveys for Ethiopia ESS, Nigeria GHS, Malawi IHS, Uganda NPS, and Tanzania NPS can be constructed and merged into the dataset using the household identifier variables.
Data standardization across surveys
After the first set of steps to clean and construct consistent variables within each survey, the next stage involves steps to standardize across surveys such that the processed data have the same structure for all countries and survey waves.
Aggregate original food items list into broad categories
The list of food items consumed varies substantially across countries, even across survey waves within a country. This reflects both the heterogeneity in local food consumption across countries and the choices made by the data collection agencies. We aggregate food items into broad categories to allow comparability across countries and waves. The food groups are: bananas and plantains, beef meat, cassava, dairy, eggs, fish and seafood, fruits, groundnuts, lamb and goat meat, maize, meals away from home, millet and sorghum, non-dairy beverages, nuts and seeds, oils and fats, other cereals, other meat, other roots and tubers, pork meat, potato, poultry meat, pulses, rice, spices, sugar, sweets, pastries, sweet potato, vegetables, wheat and yams. We also include a variable capturing a more compact grouping that has the categories: cereals, dairy, eggs, fish and seafood, fruits, vegetables, livestock products, spices, meals away from home, non-dairy beverages, oils and fats, processed foods, pulses, legumes, nuts, and roots and tubers. Table S3 in the Supplementary Information lists the components of each food category. The content of each category differs by country and survey. For example, in Ethiopia, oils and fats consist of butter, ghee, sunflower oil, and processed oils, whereas in Kenya, they include butter, ghee, margarine, groundnut butter, cooking fat, cooking oil, fortified cooking oil, lard, other oils, and fats.
Annualize value of food consumption
Different surveys had different food consumption recall periods. For example, the LSMS-ISA surveys use a 7-day recall period while the Ghana Socio-economic Panel Survey uses a 30-day recall period. In order to standardize food consumption indicators across different survey countries and years, we annualize food consumption estimates. For surveys with a 7-day recall period, we multiply the food consumption value by 52 (the standard number of weeks in a year). For surveys with a 30-day recall period, we multiply the food consumption value by 12 (the number of months in a year). This could present limitations in the data because it ignores seasonal variations in food consumption, which is often very important in agricultural communities. A variable indicating the original recall period for each survey year is included in the dataset to allow users to account for this if necessary.
Adjust monetary values for inflation and convert to 2017 PPP
To further facilitate comparisons across countries and surveys that take place in different years, we convert the nominal values calculated in local currencies into real values expressed in international dollars. Specifically, nominal values are first deflated using the National consumer price index (CPI) and then converted to 2017 Purchasing Power Parity (PPP) values. Both the CPI and PPP conversion factors are sourced from the World Bank’s World Development Indicators database.
Address outliers through winzorisation
To mitigate extreme consumption values that could lead to biased and inefficient statistical estimates, we winsorize the consumption value by replacing the top 1% values with the 99th percentile.
Append individual survey datasets into a combined dataset
Finally, all harmonized and standardized country-wave datasets are combined into a single, unified dataset, which includes all the food consumption indicators.
Data Records
The harmonized dataset described in this article is available in a Figshare repository25: 10.6084/m9.figshare.29874011. It is deposited in two formats: a single Stata file and three csv files.
Combined dataset of food consumption by items
The variables included in the combined dataset are listed in Table 2. We include several identification variables, including codes for administrative divisions, total food consumption value, food consumption value from purchase, food consumption value from own production, and food consumption value from gifts. The food consumption values are presented in nominal (local currency unit) and real (2017 PPP) values. We also include household demographic variables like rural location, age, gender of household head, number of household members, and the number of working and dependent household members.
Table 2.
List of variables and their description.
| Variable name | Variable Description |
|---|---|
| hhid | Household ID |
| weight | Household cross-section weight |
| GID_1 | Adm1 code from the GADM shapefile |
| adm1 | Administrative subdivision 1 |
| adm2 | Administrative subdivision 2 |
| Instrument | Survey name |
| Year | Survey year |
| Country | Country name |
| interview_year | Household survey interview year |
| interview_month | Household survey interview month |
| interview_day | Household survey interview day |
| crop_category1 | Aggregated Food Items 1 |
| crop_category2 | Aggregated Food Items 2 |
| food_consu_value_ppp | Annual value of food consumed, 2017 PPP |
| food_consu_value_lcu | Annual value of food consumed, nominal LCU |
| food_purch_value_ppp | Annual value of food consumed from purchases, 2017 PPP |
| food_purch_value_lcu | Annual value of food consumed from purchased, nominal LCU |
| food_prod_value_ppp | Annual value of food consumed from own production, 2017 PPP |
| food_prod_value_lcu | Annual value of food consumed from own production, nominal LCU |
| food_gift_value_ppp | Annual value of food consumed from gifts, 2017 PPP |
| food_gift_value_lcu | Annual value of food consumed from gifts, nominal LCU |
| w_food_consu_value_ppp | Annual value of food consumed, 2017 PPP - Winzorized top 1% |
| w_food_consu_value_lcu | Annual value of food consumed, nominal LCU - Winzorized top 1% |
| w_food_purch_value_ppp | Annual value of food consumed from purchases, 2017 PPP - Winzorized top 1% |
| w_food_purch_value_lcu | Annual value of food consumed from purchases, nominal LCU - Winzorized top 1% |
| w_food_prod_value_ppp | Annual value of food consumed from own production, 2017 PPP - Winzorized top 1% |
| w_food_prod_value_lcu | Annual value of food consumed from own production, nominal LCU - Winzorized top 1% |
| w_food_gift_value_ppp | Annual value of food consumed from gifts, 2017 PPP - Winzorized top 1% |
| w_food_gift_value_lcu | Annual value of food consumed from gifts, nominal LCU - Winzorized top 1% |
| recall_period | Food consumption recall period in days |
| level_representativeness | Level of sub-national representativeness of the survey |
| conv_lcu_ppp | Conversion Factor |
| age_hh | Age of household head |
| fhh | Gender of household head |
| hh_members | Number of household members |
| adulteq | Adult-Equivalent |
| nadultworking | Number of working age adults in the household |
| nadultworking_female | Number of working age female adults in the household |
| nadultworking_male | Number of working age male adults in the household |
| nchildren | Number of children in the household |
| nelders | Number of elders in the household |
| Rural | Household resides in a rural area |
Data Overview
This section illustrates some of the insights possible through the accompanying interactive Tableau visualization that rests on the harmonized dataset. Figure 4 presents graphs of the mean total per capita consumption and share of food consumption value from different sources. On average, Nigeria had the highest per capita value of food consumption in all years captured compared to the other SSA countries. About 75% of the household value of food consumed is from market purchases (in green), while own production (in yellow) and gifts (in blue) represent 20% and 5%, respectively. The data show substantial variations across countries and over time. Countries in East Africa show a greater tendency to source food from their own production than other regions of sub-Saharan Africa.
Fig. 4.
Mean total value and share of total value of per capita consumption from different sources.
The dataset also allows disaggregation of consumption estimates by food category. In Fig. 5, we illustrate such a disaggregation for two surveys, Ethiopia 2021 and Nigeria 2018. The graph shows Cereals as the dominant food item, accounting for both countries’ highest per capita consumption value. The extent of reliance on market purchases for food acquisition varies by country and food commodity. For example, in Ethiopia, the majority of dairy consumption is from own production (in yellow, 64%), while in Nigeria, the majority is from purchases (in green, 82%). Further disaggregation of food consumption value and the share of food consumption value of various food categories by household location and the gender of the household head is possible.
Fig. 5.
Mean value of consumption of different food categories and share of total value of consumption from different sources by food categories.
Technical Validation
To ensure the reliability and robustness of the harmonized food consumption dataset, we conducted multiple layers of technical validation to ensure the accuracy of data transformations, internal consistency, and statistical coherence across survey waves and countries.
Source data integrity and reproducibility
All input datasets were obtained from publicly available, nationally representative household surveys such as the Living Standards Measurement Study – Integrated Surveys on Agriculture (LSMS-ISA). The LSMS-ISA and other surveys generally undergo rigorous methodological and technical validation before public release. During data collection, the World Bank and the country statistical offices implement a data quality assurance plan to validate data entries. The survey team also utilizes CAPI software, such as Survey Solutions, which flags data inconsistencies and potential errors during fieldwork and enables real-time, high-frequency data quality checks. All data transformations were implemented using reproducible Stata code, and the harmonization workflow is documented and version-controlled.
Consistency and range checks
Internal consistency checks were applied to verify that key variables conformed to expected value ranges and logical relationships. We ensured that food values were expressed in consistent units across datasets, with local units converted to standard metric equivalents where necessary. The indicators were also carefully reviewed by inspecting summary statistics to detect any irregularities, and the code was thoroughly reviewed by at least two co-authors to ensure that any issues were not the result of coding errors.
Outlier detection and treatment
We conducted an outlier analysis in which extreme values were visually examined using box plots and density plots. To mitigate the influence of outliers while preserving the underlying data structure, values in the top 1% of the distribution were winsorized. Summary statistics before and after winsorization were compared to assess the impact of the outlier-handling procedure.
Cross-country and temporal validation
To validate the constructed food consumption indicators and further assess comparability across countries and survey years, we compare the average per capita value of food consumption to Gross Domestic Production per capita for matching survey years from the World Development Indicators database. Trends in consumption patterns over time were examined to ensure that observed changes were consistent with national economic and agricultural dynamics. Figure 6 shows a scatterplot of average per capita food consumption value and GDP per capita. As expected, we see a strong correlation between the two indicators. We also provide a scatterplot of countries’ ranking based on these two indicators (Fig. 7). These rankings are consistent, albeit with some differences, likely due to inherent noise in survey estimates and non-food consumption excluded from our dataset. Both scatterplots have a Pearson correlation coefficient of 0.4, statistically significant at the 5% level, indicating a tendency for higher average per capita food consumption value (consumption rankings) being associated with higher GDP per capita (GDP per capita rankings).
Fig. 6.
Graph showing convergence between the average food consumption value and GDP per capita for each country and year.
Fig. 7.
Graph showing convergence between the rankings of average food consumption value and GDP per capita for each country and year.
Our second validation exercise uses the FAOSTAT apparent food intake data, which reports the quantity of food consumption by food categories in grams per capita per day. The FAOSTAT data include food categories that are different from the categories captured in our consumption dataset. For this reason, we conduct the validation using aggregate consumption at the national level. Also, the apparent food intake data lacks price information to value quantities. Hence, they are not directly comparable to our estimates. Nonetheless, we plot a scatter plot of our estimated average value of food consumption, and FAOSTAT estimated aggregate per capita food intake quantities (Fig. 8). The scatter plot shows a moderate positive correlation between the two measures with a Pearson correlation coefficient of 0.36. This suggests that countries and years would be ranked similarly under either indicator.
Fig. 8.
Graph showing convergence between the average food consumption value and aggregate FAOSTAT apparent food intake quantities for each country and year.
We also rely on the Global Consumption Database (GCD), which includes two overlapping data points (Malawi 2010/11 and Uganda 2009/2010) for an additional validation. The GCD reported annual aggregate national value of household consumption. We convert these values into per capita consumption using population estimates from the World Development Indicators database. For Malawi, the average per capita value of food consumption in our dataset is $672 (2017 PPP), while the estimate based on GCD is $432 (2017 PPP). For Uganda, our estimate is $734 (2017 PPP), while the calculation using the GCD is $593 (2017 PPP). In both cases, our estimates are higher than the GCD estimates. These differences reflect variations in the data sources underlying the two estimates, as well as the use of aggregate national numbers in the GCD estimates as opposed to average per capita estimates in our case.
Usage Notes
The household food consumption estimates constructed in this dataset are based on survey data from various secondary sources (See Table 1). Each survey wave is first processed separately, and identical key variables are generated for each wave, after which data for each wave is appended to construct the harmonized dataset. The harmonized dataset is at the crop level to facilitate consumption comparisons among major food categories. The data can be aggregated to the household level using the unique survey wave and household identifier variables (country_year and hhid) for household-level analysis. Variables identifying the first and second administrative level of household residence were also included in the dataset (adm1 and adm2). To obtain nationally representative estimates and account for complex survey data design, users must use sample weights (weight) as probability weights and strata identifiers to obtain valid standard errors.
Supplementary information
Author contributions
A.P.N.: Conceptualization, Methodology, Data Curation, Formal Analysis, Validation, Visualization, Writing - Original Draft, Review and Editing. D.Y.A.: Conceptualization, Methodology, Data Curation, Formal Analysis, Validation, Visualization, Writing - Review and Editing. A.R.: Visualization, Writing - Review and Editing. C.L.A.: Conceptualization, Writing - Review and Editing, Supervision, Funding Acquisition.
Data availability
The entire dataset is available online in a Figshare repository. The following URL provides access to the dataset25: 10.6084/m9.figshare.29874011.
Code availability
The Stata codes used to process the raw data, generate standardized food consumption indicators, and append different survey waves for each country into a combined dataset are available on GitHub (https://github.com/EvansSchoolPolicyAnalysisAndResearch/Household-Consumption-Data). Each survey data processed has its own Stata dofile. Additionally, we include a dofile used to append individual datasets into a combined dataset. The do files are annotated to describe and guide the user through the different stages of data processing. This ensures full reproducibility of the data harmonization process and enables users to trace each indicator back to its original source. An interactive and more comprehensive visualization of the data is accessible on the Tableau visualization platform. Table S4 in the Supplementary Information lists the names of Stata code files available in the GitHub repository.
Competing interests
The authors declare no competing interests.
Footnotes
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
The online version contains supplementary material available at 10.1038/s41597-026-06548-1.
References
- 1.World Bank. Poverty and Shared Prosperity 2018: Piecing Together the Poverty Puzzle. http://hdl.handle.net/10986/30418 (2018).
- 2.Deaton, A. Income, health, and well-being around the world: Evidence from the Gallup World Poll. Journal of Economic perspectives22, 53–72 (2008). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Ravallion, M. The Economics of Poverty: History, Measurement, and Policy. (Oxford University Press, 2015).
- 4.Baah, S. K. T. et al. March 2023 global poverty update from the World Bank: the challenge of estimating poverty in the pandemic. Diakses pada9 (2023).
- 5.Kharas, H. & Dooley, M. The Evolution of Global Poverty, 1990–2030: Trends and Projections. (Brookings Institution. United States of America, 2022).
- 6.Debucquet, D. L. & Martin, W. Implications of the global growth slowdown for rural poverty. Agricultural Economics49, 325–338 (2018). [Google Scholar]
- 7.Damari, Y. & Kissinger, M. Quantity-based analysis of household food consumption patterns and drivers: The case of Israel. Appetite127, 373–385 (2018). [DOI] [PubMed] [Google Scholar]
- 8.Otero, G., Gürcan, E. C., Pechlaner, G. & Liberman, G. Food security, obesity, and inequality: Measuring the risk of exposure to the neoliberal diet. Journal of Agrarian Change18, 536–554 (2018). [Google Scholar]
- 9.Regmi, A. & Meade, B. Demand side drivers of global food security. Global Food Security2, 166–171 (2013). [Google Scholar]
- 10.Chen, C., Chaudhary, A. & Mathys, A. Dietary Change and Global Sustainable Development Goals. Front. Sustain. Food Syst. 6 (2022).
- 11.Pingali, P., Aiyar, A., Abraham, M. & Rahman, A. Diet Diversity and the Declining Importance of Staple Grains. in Transforming Food Systems for a Rising India (eds Pingali, P., Aiyar, A., Abraham, M. & Rahman, A.) 73–91, 10.1007/978-3-030-14409-8_4 (Springer International Publishing, Cham, 2019).
- 12.Elzaki, R., Yunus Sisman, M. & Al-Mahish, M. Rural Sudanese household food consumption patterns. Journal of the Saudi Society of Agricultural Sciences20, 58–65 (2021). [Google Scholar]
- 13.Hayat, N., Mustafa, G., Alotaibi, B. A., Nayak, R. K. & Naeem, M. Households food consumption pattern in Pakistan: Evidence from recent household integrated economic survey. Heliyon9 (2023). [DOI] [PMC free article] [PubMed]
- 14.Mekonnen, D. A. et al. Food consumption patterns, nutrient adequacy, and the food systems in Nigeria. Agric Econ9, 16 (2021). [Google Scholar]
- 15.Wolle, A., Hirvonen, K., de Brauw, A., Baye, K. & Abate, G. T. Household Food Consumption Patterns in Addis Ababa, Ethiopia. 10.2499/P15738COLL2.133654 (2020).
- 16.Waha, K., Zipf, B., Kurukulasuriya, P. & Hassan, R. M. An agricultural survey for more than 9,500 African households. Scientific data3, 1–8 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.van Wijk, M. et al. The Rural Household Multiple Indicator Survey, data from 13,310 farm households in 21 countries. Scientific Data7, 46 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Belmin, C., Hoffmann, R., Elkasabi, M. & Pichler, P.-P. LivWell: a sub-national Dataset on the Living Conditions of Women and their Well-being for 52 Countries. Scientific data9, 719 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Chemura, A., Gleixner, S. & Gornott, C. Dataset of the suitability of major food crops in Africa under climate change. Scientific Data11, 294 (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Machefer, M. et al. A monthly sub-national Harmonized Food Insecurity Dataset for comprehensive analysis and predictive modeling. Scientific Data12, 741 (2025). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Lee, D. et al. HarvestStat Africa–harmonized subnational crop statistics for sub-Saharan Africa. Scientific Data12, 690 (2025). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.World Bank. World Development Indicators. World Bank, DataBank, https://databank.worldbank.org/source/world-development-indicators.
- 23.McKelvey, C. Price, unit value, and quality demanded. Journal of Development Economics95, 157–169 (2011). [Google Scholar]
- 24.Evans School Policy Analysis and Research (EPAR). EPAR GitHub Code Repository. GitHub (2025).
- 25.Nnaji, A. P., Alia, D. Y., Raina, A., & Anderson, C. L. Harmonized food consumption dataset by food category and acquisition source for Sub-Saharan African countries figshare, 10.6084/m9.figshare.29874011 (2025). [DOI] [PMC free article] [PubMed]
- 26.Institut National de la Statistique et de l’Analyse Économique (INSAE). Enquête Harmonisée sur le Conditions de Vie des Ménages 2018-2019 [Data set]. World Bank, Development Data Group, 10.48529/rn3k-z374 (2022).
- 27.Institut National de la Statistique et de la Démographie (INSD). Enquête Harmonisée sur le Conditions de Vie des Ménages 2018-2019 [Data set]. World Bank, Development Data Group, 10.48529/wv88-j486 (2022).
- 28.Institut National de la Statistique et de la Démographie (INSD). Enquête Harmonisée sur le Conditions de Vie des Ménages, Panel Survey 2021-2022 [Data set]. World Bank, Development Data Group, https://microdata.worldbank.org/index.php/catalog/6224 (2024).
- 29.Institut National de la Statistique (INS). Enquête Harmonisée sur le Conditions de Vie des Ménages 2018-2019 [Data set]. World Bank, Development Data Group, 10.48529/8wh3-bf40 (2022).
- 30.Central Statistical Agency of Ethiopia. Socioeconomic Survey 2015-2016, Wave 3 [Data set]. World Bank, Development Data Group, 10.48529/ampf-7988 (2017).
- 31.Central Statistics Agency of Ethiopia. Socioeconomic Survey 2018-2019, Wave 4 [Data set]. World Bank, Development Data Group, https://microdata.worldbank.org/index.php/catalog/3823 (2021).
- 32.Ethiopian Statistical Service (ESS). Socioeconomic Panel Survey 2021-2022 Wave 5 [Data set]. World Bank, Development Data Group, https://microdata.worldbank.org/index.php/catalog/6161 (2024).
- 33.Institute of Statistical, Social and Economic Research & Economic Growth Center. Socioeconomic Panel Survey: 2009-2010 [Data set]. World Bank, Development Data Group, 10.48529/851p-b825 (2016).
- 34.Instituto Nacional de Estatística (INE). Inquérito Harmonizado sobre as Condiçöes de vide dos Agreagados Familiares 2018-2019 [Data set]. World Bank, Development Data Group, 10.48529/1ekb-m086 (2022).
- 35.Kenya National Bureau of Statistics. Kenya Integrated Household Budget Survey 2015-2016 [Data set]. Kenya National Data Archive, https://statistics.knbs.or.ke/nada/index.php/catalog/13 (2018).
- 36.National Statistical Office (NSO). Third Integrated Household Survey 2010-2011 [Data set]. World Bank, Development Data Group, 10.48529/w1jq-qh85 (2012).
- 37.National Statistical Office (NSO). Integrated Household Panel Survey 2010-2013 [Data set]. World Bank, Development Data Group, 10.48529/4q9f-2288 (2015).
- 38.National Statistical Office (NSO). Fourth Integrated Household Survey 2016-2017 [Data set]. World Bank, Development Data Group, 10.48529/g2p9-9r19 (2017).
- 39.National Statistical Office (NSO). Fifth Integrated Household Survey 2019-2020 [Data set]. World Bank, Development Data Group, 10.48529/mpyk-ds48 (2021).
- 40.Planning and Statistics Unit, National Institute of Statistics, & National Directorate of Agriculture. 2014 Integrated Agricultural Economic Survey [Data set]. World Bank, Development Data Group, 10.48529/qqam-mn86 (2016).
- 41.Institut National de la Statistique (INSTAT). Enquête Harmonisée sur le Conditions de Vie des Ménages 2018-2019 [Data set]. World Bank, Development Data Group, 10.48529/90e9-4e91 (2022).
- 42.Institut National de la Statistique (INS). Enquête Harmonisée sur le Conditions de Vie des Ménages 2018-2019 [Data set]. World Bank, Development Data Group, 10.48529/ggam-ax39 (2022).
- 43.National Bureau of Statistics (NBS). General Household Survey, Panel 2010-2011, Wave 1 [Data set]. World Bank, Development Data Group, 10.48529/y9e2-b753 (2012).
- 44.National Bureau of Statistics (NBS). General Household Survey, Panel 2012-2013, Wave 2 [Data set]. World Bank, Development Data Group, 10.48529/kxpy-aa72 (2014).
- 45.National Bureau of Statistics (NBS). General Household Survey, Panel 2015-2016, Wave 3 [Data set]. World Bank, Development Data Group, 10.48529/7xmj-q133 (2016).
- 46.National Bureau of Statistics (NBS). General Household Survey, Panel 2018-2019, Wave 4 [Data set]. World Bank, Development Data Group, 10.48529/1hgw-dq47 (2019).
- 47.Agence National de la Statistique et de la Démographie (ANSD). Enquête Harmonisée sur le Conditions de Vie des Ménages 2018-2019 [Data set]. World Bank, Development Data Group, 10.48529/hhhx-j012 (2022).
- 48.Statistics Sierra Leone, & The World Bank. Integrated Household Survey 2018 Dataset. International Household Survey Network (IHSN) Data Catalog, https://catalog.ihsn.org/catalog/9246 (2020).
- 49.National Bureau of Statistics. National Panel Survey 2008-2009, Wave 1 [Data set]. World Bank, Development Data Group, 10.48529/hz8s-3489 (2011).
- 50.National Bureau of Statistics. National Panel Survey 2010-2011, Wave 2 [Data set]. World Bank, Development Data Group, 10.48529/jm20-c742 (2012).
- 51.National Bureau of Statistics. National Panel Survey 2012-2013, Wave 3 [Data set]. World Bank, Development Data Group, 10.48529/7vqv-5f71 (2015).
- 52.National Bureau of Statistics. National Panel Survey 2014-2015, Wave 4 [Data set]. World Bank, Development Data Group, 10.48529/y3qj-d018 (2017).
- 53.World Bank. National Panel Survey 2019-2020 [Data set]. World Bank, Development Data Group, 10.48529/0y7d-1v78 (2021).
- 54.Institut National de la Statistique et des Etudes Economiques et Démographiques (INSEED). Enquête Harmonisée sur le Conditions de Vie des Ménages 2018-2019 [Data set]. World Bank, Development Data Group, 10.48529/ww9z-d865 (2022).
- 55.Uganda Bureau of Statistics (UBOS). National Panel Survey 2005-2009 Wave I [Dataset]. World Bank, Development Data Group, 10.48529/9vep-e461 (2012).
- 56.Uganda Bureau of Statistics (UBOS). National Panel Survey 2010-2011 Wave II [Data set]. World Bank, Development Data Group, 10.48529/w2d8-t244 (2014).
- 57.Uganda Bureau of Statistics (UBOS). National Panel Survey 2011-2012 Wave III [Data set]. World Bank, Development Data Group, 10.48529/wjg5-bg56 (2014).
- 58.Uganda Bureau of Statistics (UBOS). National Panel Survey 2013-2014 [Data set]. World Bank, Development Data Group, https://microdata.worldbank.org/index.php/catalog/2663 (2016).
- 59.Uganda Bureau of Statistics (UBOS). National Panel Survey 2015-2016 [Data set]. World Bank, Development Data Group, https://microdata.worldbank.org/index.php/catalog/3460 (2018).
- 60.Uganda Bureau of Statistics (UBOS). National Panel Survey 2018-2019 [Data set]. World Bank, Development Data Group, https://microdata.worldbank.org/index.php/catalog/3795 (2021).
- 61.Uganda Bureau of Statistics (UBOS). National Panel Survey 2019-2020 [Data set]. World Bank, Development Data Group, 10.48529/nqzx-f196 (2021).
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The entire dataset is available online in a Figshare repository. The following URL provides access to the dataset25: 10.6084/m9.figshare.29874011.
The Stata codes used to process the raw data, generate standardized food consumption indicators, and append different survey waves for each country into a combined dataset are available on GitHub (https://github.com/EvansSchoolPolicyAnalysisAndResearch/Household-Consumption-Data). Each survey data processed has its own Stata dofile. Additionally, we include a dofile used to append individual datasets into a combined dataset. The do files are annotated to describe and guide the user through the different stages of data processing. This ensures full reproducibility of the data harmonization process and enables users to trace each indicator back to its original source. An interactive and more comprehensive visualization of the data is accessible on the Tableau visualization platform. Table S4 in the Supplementary Information lists the names of Stata code files available in the GitHub repository.








