Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2023 Aug 1.
Published in final edited form as: Nat Med. 2023 Feb;29(2):283–286. doi: 10.1038/s41591-022-02171-w

American Life in Realtime: a benchmark registry of health data for equitable precision health

Ritika R Chaturvedi 1,, Marco Angrisani 2, Wendy M Troxel 3, Tania Gutsche 2, Eva Ortega 2, Monika Jain 4, Adrien Boch 4, Arie Kapteyn 2
PMCID: PMC9974879  NIHMSID: NIHMS1868377  PMID: 36737671

Social, structural, and environmental determinants of health, like food or housing insecurity, systemic racism, or chronic stress, account for 60-80% of the modifiable risk of disparities in marginalized populations.1,2 Such determinants have been difficult to systematically address due to their complexity, multidimensionality, and heterogeneity (Fig 1). Emerging precision health methods use large-scale person-generated health data from smartphones and wearables to better characterize and ultimately improve health and wellbeing through strategies customized to individual context and need.3,4 Applying artificial intelligence and machine learning to person-generated health data allows for unprecedented assessment of recursive, networked, and latent associations between everyday life and health, including social, structural, and environmental exposures, behaviors, biometrics, and health outcomes. As such, precision health provides an important opportunity to reduce health disparities amongst minoritized racial or ethnic groups, or those who are under-resourced.

Figure 1.

Figure 1.

A framework for social, structural, and environmental determinants of health.

Despite the potential to improve health equity, the research community lacks benchmark training datasets of person-generated health data, limiting the ability to develop precision health models that are equally effective across diverse populations. Both validity and generalizability of an artificial intelligence or machine learning system are intrinsically tied to the underlying training data. The ideal benchmark dataset should feature high quality, well-characterized data that comprehensively represent the target population in order to instill the highest standards of scientific transparency and rigor to model development, validation, and evaluation.

Person-generated health data cohorts within the National Institutes of Health’s All of Us, UK Biobank, the Framingham Heart Study, and the majority of commercial studies rely on convenience sampling and/or bring-your-own-device designs. Consequently, those who lack access to digital technologies, who tend to be older, Black, Latino, Indigenous, poorer, and sicker, are systematically underrepresented.5,6 The National Health and Nutrition Examination Survey is representative, but employs a cross-sectional design and a 1-week accelerometer measurement period, which limits the ability to assess temporal effects or account for seasonality. The absence of a benchmark dataset risks introducing systemic bias, exacerbating health disparities, and causing additional patient harm in already marginalized groups.7,8

To bridge this critical gap, we created American Life in Realtime (ALiR), a publicly available benchmark registry and research infrastructure for person-generated health data. ALiR has four primary objectives that advance equitable precision health: promoting inclusive representation; encouraging methodological rigor in artificial intelligence and machine learning; fostering interdisciplinary collaboration and transparency; and, facilitating comprehensive exploration of the dynamic interplay between everyday life and health. We highlight several design choices to achieve these objectives, as well as precision health uses for the data and infrastructure.

We employed several strategies to ensure that the enrolled ALiR cohort (n=1,038) would achieve inclusive representation of the adult U.S. population across demographic, socioeconomic, and health factors.9 Participants were invited from the Understanding America Study,10 an established, probability-based survey panel whose members are randomly sampled from all U.S. addresses. To reduce digital inclusion barriers, we provided a Fitbit Inspire 2 to all participants as a study incentive and a 4G Samsung Galaxy Tablet to those who would not otherwise have internet access. We designed the participant study app to be compatible with a wide range of mobile devices and operating systems and maintained a helpdesk for technical support. As factors beyond digital inclusion, like mistrust or privacy concerns, could lower participation among historically marginalized groups, we also oversampled individuals who were Black, American Indian, Alaska Native, Hawaiian, Pacific Islander, mixed race, Hispanic or Latino, and people whose education was lower than a bachelor’s degree.

These ALiR features offer several advantages. Probability sampling improves accuracy and validity of population-level inference such as generalizable predictions of health outcomes in response to population-level stressors like current events, systemic racism, natural disasters, or surges in SARS-CoV-2 cases. Hardware provisioning, in our experience, eliminated sociodemographic disparities in participation rates. Oversampling resulted in a proportionally larger sample of historically underrepresented and marginalized populations,* providing statistical power to detect subgroup-specific differences, such as heterogeneity in outcomes experienced by Black and Latino individuals.

To encourage artificial intelligence and machine learning methodological rigor, we designed a comprehensive data collection strategy with validated, longitudinal measures (labels) of participant exposures, behaviors, and outcomes over long time-scales (Table 1). Measures are derived from high-quality consensus instruments such as University of Michigan’s Health and Retirement Study, Patient Reported Outcomes Measurement Information System, and the National Institutes of Health’s Consensus Measures on Phenotypes and eXposures. A custom mobile app facilitates Fitbit integration, fields electronic surveys, deploys push notification announcements and reminders, and incentivizes long-term engagement through earned points, which are redeemable as monetary compensation.

Table 1:

ALiR’s Individual-level Data.

Electronic surveys
(repeated every 1-2 years)
Electronic surveys
(repeated every 13 months)
Fitbit
(continuous)
Public Datasets
(continuous)
Physical Environment Location Zip Code / Census Region Weather
Urbanicity Air / Water Quality
Neighborhood Features / Built Environment Perceived Safety Crime
Transportation / Walkability Police Presence
Housing Type / Characteristics
Social Environment Network Social Network Household
Marital Status / Family Structure
Social Support (PROMIS)
Context Culture / Religiosity / Politics Hope (PhenX)
Stressors Racism and Discrimination Loneliness
Childhood Trauma (ACEs) Stress (PSS-4)
Economic Environment Education and Competencies Education Fitbit Engagement
Financial Literacy Fitbit Proficiency
Digital Literacy
Health Literacy
Work and Finance Wealth / Debt (HRS) Labor Force Status / Occupation
Financial Wellbeing / Readiness Income
Housing Stability Food Security (USDA)
Healthcare Insurance Access
Quality / Satisfaction
Intrinsic Factors Demographics and Personality Personality (Big-5) Age
Decision Making Race / Ethnicity
Risk Tolerance Sex / Gender Identity
Mood / Affect Sexual Orientation
Life Satisfaction Citizenship
Immigration Generation
Birthplace
Primary Language
Health Motivations
Health Behaviors Exercise Physical Activity (HRS) Active / Sedentary Time
Step Count
Distance Covered
Walking Speed
Diet Diet (BRFSS)
Sugar Consumption
Sleep Sleep Hygiene (PROMIS) Time in Bed
Sleep / Wake Onset
Napping
Substance Use Drugs / Alcohol (ASSIST)
Healthcare Prevention / Screening (HRS) Utilization (HRS)
Medical Mistrust (PhenX)
Vaccine Hesitancy (PhenX)
Health Outcomes Health Status Self-reported Health (HRS) General Health (BRFSS)
Physical Health Chronic Condition Diagnoses (HRS) Functional Ability (iADL / ADL) Physical Health
Pregnancy (PhenX) Body Mass Index Resting / Active Heart Rate
Influenza-like Illness (PhenX) Circadian Rhythm
Sleep Health Sleep Disorders (HRS) Sleep Disturbance (PROMIS) Sleep Duration
Sleep Efficiency / Latency
Sleep Stages
Mental Health Mental Illness Diagnoses (HRS) Depression (PHQ-8)
Anxiety (GAD-7)
Substance Abuse (ASSIST)
Cognitive Health Cognitive Status / Ability
Episodic Memory Test Scores
Mental Function Test Scores

Note that this list is non-exhaustive. Core Understanding America Study survey measures, repeated quarterly for sociodemographics and at least biennially for other variables, are available from 2014. For ALiR, survey measures are repeated monthly and minute-level data from wrist-worn Fitbits are collected continuously. ALiR activities occur for at least 1 year, though participants may subsequently choose to contribute indefinitely. Contextual data from public datasets are matched geospatially and temporally to participant timelines. Specific instruments are in parentheses. HRS = University of Michigan’s Health and Retirement Study; PROMIS = Patient Reported Outcomes Measurement Information System; PhenX = National Institutes of Health’s Consensus Measures for Phenotypes and eXposures. ACEs = Adverse Childhood Events Scale; PSS-4 = Perceived Stress Scale-4; USDA = U.S. Department of Agriculture; (i)ADL = (instrumental) Activities of Daily Living; ASSIST = World Health Organization Alcohol, Smoking, and Substance Involvement Screening Test; BRFSS = Behavioral Risk Factor Surveillance System; GAD-7 = General Anxiety Disorder-7; PHQ-8 = Patient Health Questionnaire-8.

We created a flexible and sustainable infrastructure to foster growth, agility, collaboration, and transparency. ALiR is both a data resource and research platform, which can encourage agile and cost-efficient community collaboration. Data collected through ALiR will be available to registered users through the Understanding America Study website upon curation of each cohort-year of data, with year 1 anticipated in mid-2023. With appropriate privacy and data security safeguards, freely available data encourages transparency, reproducibility, and explainability of outputs from statistical, artificial intelligence, and machine learning analyses.

The research platform can accommodate additional features, including: participants; application programming interfaces from wearables, medical devices, Internet of Things, genomics and biomarkers; survey designs including preloaded information, randomization, experiments, and ecological momentary assessments; and interactive communications like notifications and visual dashboards. ALiR seeks to achieve a truly large-scale sample through leveraging the ongoing expansion of the Understanding America Study, which is expected to reach 20,000 participants by 2025, and by incorporating special populations like those with specific diseases, which are housed at academic and industry partners. Accordingly, the code will be open-sourced to encourage harmonizable data collection efforts by others.

The ultimate aim of ALiR is to facilitate broad, multidisciplinary, and equitable investigation of precision health. Engineers may leverage the platform to test performance of new hardware or sensors in diverse populations. Methodologists may characterize factors that drive or amplify selection biases, such as social and structural patterning of study participation and attrition, data quality and missingness as well as develop and test solutions to minimize their impact, such as incentive designs and imputation techniques. Social scientists may investigate social determinant clustering and importance in different populations to prioritize public health investments. Behavioral researchers may develop just-in-time interventions where deviations from individual-specific baselines trigger automated nudges and/or suggestions, such as passive detection of influenza-like symptoms via Fitbit data, which triggers a SARS-CoV-2 testing recommendation. Operations researchers may evaluate utility of case worker or health system integrations.

Collectively, ALiR is a model for achieving diversity, equity, inclusion, transparency, and multi-disciplinary collaboration in precision health.

ACKNOWLEDGEMENTS:

We are grateful to the participants who agreed to contribute their time and digital data for research. We thank Evidation Health, Inc. for study app development. This work was funded by a grant from the National Library of Medicine (NLM) at the NIH awarded to Dr. Chaturvedi (R01LM013237). ALiR relies on data from survey(s) administered by the Understanding America Study, which is maintained by the Center for Economic and Social Research (CESR) at the University of Southern California. The funders had no role in the study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Footnotes

*

Weights are provided to rebalance the sample to match the demographic composition of the general U.S. adult population.

COMPETING INTERESTS:

The authors have no financial, personal, or professional competing interests that have influenced the content of the work described herein.

References

RESOURCES