Social, structural, and environmental determinants of health, like food or housing insecurity, systemic racism, or chronic stress, account for 60-80% of the modifiable risk of disparities in marginalized populations.1,2 Such determinants have been difficult to systematically address due to their complexity, multidimensionality, and heterogeneity (Fig 1). Emerging precision health methods use large-scale person-generated health data from smartphones and wearables to better characterize and ultimately improve health and wellbeing through strategies customized to individual context and need.3,4 Applying artificial intelligence and machine learning to person-generated health data allows for unprecedented assessment of recursive, networked, and latent associations between everyday life and health, including social, structural, and environmental exposures, behaviors, biometrics, and health outcomes. As such, precision health provides an important opportunity to reduce health disparities amongst minoritized racial or ethnic groups, or those who are under-resourced.
Despite the potential to improve health equity, the research community lacks benchmark training datasets of person-generated health data, limiting the ability to develop precision health models that are equally effective across diverse populations. Both validity and generalizability of an artificial intelligence or machine learning system are intrinsically tied to the underlying training data. The ideal benchmark dataset should feature high quality, well-characterized data that comprehensively represent the target population in order to instill the highest standards of scientific transparency and rigor to model development, validation, and evaluation.
Person-generated health data cohorts within the National Institutes of Health’s All of Us, UK Biobank, the Framingham Heart Study, and the majority of commercial studies rely on convenience sampling and/or bring-your-own-device designs. Consequently, those who lack access to digital technologies, who tend to be older, Black, Latino, Indigenous, poorer, and sicker, are systematically underrepresented.5,6 The National Health and Nutrition Examination Survey is representative, but employs a cross-sectional design and a 1-week accelerometer measurement period, which limits the ability to assess temporal effects or account for seasonality. The absence of a benchmark dataset risks introducing systemic bias, exacerbating health disparities, and causing additional patient harm in already marginalized groups.7,8
To bridge this critical gap, we created American Life in Realtime (ALiR), a publicly available benchmark registry and research infrastructure for person-generated health data. ALiR has four primary objectives that advance equitable precision health: promoting inclusive representation; encouraging methodological rigor in artificial intelligence and machine learning; fostering interdisciplinary collaboration and transparency; and, facilitating comprehensive exploration of the dynamic interplay between everyday life and health. We highlight several design choices to achieve these objectives, as well as precision health uses for the data and infrastructure.
We employed several strategies to ensure that the enrolled ALiR cohort (n=1,038) would achieve inclusive representation of the adult U.S. population across demographic, socioeconomic, and health factors.9 Participants were invited from the Understanding America Study,10 an established, probability-based survey panel whose members are randomly sampled from all U.S. addresses. To reduce digital inclusion barriers, we provided a Fitbit Inspire 2 to all participants as a study incentive and a 4G Samsung Galaxy Tablet to those who would not otherwise have internet access. We designed the participant study app to be compatible with a wide range of mobile devices and operating systems and maintained a helpdesk for technical support. As factors beyond digital inclusion, like mistrust or privacy concerns, could lower participation among historically marginalized groups, we also oversampled individuals who were Black, American Indian, Alaska Native, Hawaiian, Pacific Islander, mixed race, Hispanic or Latino, and people whose education was lower than a bachelor’s degree.
These ALiR features offer several advantages. Probability sampling improves accuracy and validity of population-level inference such as generalizable predictions of health outcomes in response to population-level stressors like current events, systemic racism, natural disasters, or surges in SARS-CoV-2 cases. Hardware provisioning, in our experience, eliminated sociodemographic disparities in participation rates. Oversampling resulted in a proportionally larger sample of historically underrepresented and marginalized populations,* providing statistical power to detect subgroup-specific differences, such as heterogeneity in outcomes experienced by Black and Latino individuals.
To encourage artificial intelligence and machine learning methodological rigor, we designed a comprehensive data collection strategy with validated, longitudinal measures (labels) of participant exposures, behaviors, and outcomes over long time-scales (Table 1). Measures are derived from high-quality consensus instruments such as University of Michigan’s Health and Retirement Study, Patient Reported Outcomes Measurement Information System, and the National Institutes of Health’s Consensus Measures on Phenotypes and eXposures. A custom mobile app facilitates Fitbit integration, fields electronic surveys, deploys push notification announcements and reminders, and incentivizes long-term engagement through earned points, which are redeemable as monetary compensation.
Table 1:
Electronic surveys (repeated every 1-2 years) |
Electronic surveys (repeated every 13 months) |
Fitbit (continuous) |
Public Datasets (continuous) |
||
---|---|---|---|---|---|
Physical Environment | Location | Zip Code / Census Region | Weather | ||
Urbanicity | Air / Water Quality | ||||
Neighborhood | Features / Built Environment | Perceived Safety | Crime | ||
Transportation / Walkability | Police Presence | ||||
Housing | Type / Characteristics | ||||
Social Environment | Network | Social Network | Household | ||
Marital Status / Family Structure | |||||
Social Support (PROMIS) | |||||
Context | Culture / Religiosity / Politics | Hope (PhenX) | |||
Stressors | Racism and Discrimination | Loneliness | |||
Childhood Trauma (ACEs) | Stress (PSS-4) | ||||
Economic Environment | Education and Competencies | Education | Fitbit Engagement | ||
Financial Literacy | Fitbit Proficiency | ||||
Digital Literacy | |||||
Health Literacy | |||||
Work and Finance | Wealth / Debt (HRS) | Labor Force Status / Occupation | |||
Financial Wellbeing / Readiness | Income | ||||
Housing Stability | Food Security (USDA) | ||||
Healthcare | Insurance | Access | |||
Quality / Satisfaction | |||||
Intrinsic Factors | Demographics and Personality | Personality (Big-5) | Age | ||
Decision Making | Race / Ethnicity | ||||
Risk Tolerance | Sex / Gender Identity | ||||
Mood / Affect | Sexual Orientation | ||||
Life Satisfaction | Citizenship | ||||
Immigration Generation | |||||
Birthplace | |||||
Primary Language | |||||
Health Motivations | |||||
Health Behaviors | Exercise | Physical Activity (HRS) | Active / Sedentary Time | ||
Step Count | |||||
Distance Covered | |||||
Walking Speed | |||||
Diet | Diet (BRFSS) | ||||
Sugar Consumption | |||||
Sleep | Sleep Hygiene (PROMIS) | Time in Bed | |||
Sleep / Wake Onset | |||||
Napping | |||||
Substance Use | Drugs / Alcohol (ASSIST) | ||||
Healthcare | Prevention / Screening (HRS) | Utilization (HRS) | |||
Medical Mistrust (PhenX) | |||||
Vaccine Hesitancy (PhenX) | |||||
Health Outcomes | Health Status | Self-reported Health (HRS) | General Health (BRFSS) | ||
Physical Health | Chronic Condition Diagnoses (HRS) | Functional Ability (iADL / ADL) | Physical Health | ||
Pregnancy (PhenX) | Body Mass Index | Resting / Active Heart Rate | |||
Influenza-like Illness (PhenX) | Circadian Rhythm | ||||
Sleep Health | Sleep Disorders (HRS) | Sleep Disturbance (PROMIS) | Sleep Duration | ||
Sleep Efficiency / Latency | |||||
Sleep Stages | |||||
Mental Health | Mental Illness Diagnoses (HRS) | Depression (PHQ-8) | |||
Anxiety (GAD-7) | |||||
Substance Abuse (ASSIST) | |||||
Cognitive Health | Cognitive Status / Ability | ||||
Episodic Memory Test Scores | |||||
Mental Function Test Scores |
Note that this list is non-exhaustive. Core Understanding America Study survey measures, repeated quarterly for sociodemographics and at least biennially for other variables, are available from 2014. For ALiR, survey measures are repeated monthly and minute-level data from wrist-worn Fitbits are collected continuously. ALiR activities occur for at least 1 year, though participants may subsequently choose to contribute indefinitely. Contextual data from public datasets are matched geospatially and temporally to participant timelines. Specific instruments are in parentheses. HRS = University of Michigan’s Health and Retirement Study; PROMIS = Patient Reported Outcomes Measurement Information System; PhenX = National Institutes of Health’s Consensus Measures for Phenotypes and eXposures. ACEs = Adverse Childhood Events Scale; PSS-4 = Perceived Stress Scale-4; USDA = U.S. Department of Agriculture; (i)ADL = (instrumental) Activities of Daily Living; ASSIST = World Health Organization Alcohol, Smoking, and Substance Involvement Screening Test; BRFSS = Behavioral Risk Factor Surveillance System; GAD-7 = General Anxiety Disorder-7; PHQ-8 = Patient Health Questionnaire-8.
We created a flexible and sustainable infrastructure to foster growth, agility, collaboration, and transparency. ALiR is both a data resource and research platform, which can encourage agile and cost-efficient community collaboration. Data collected through ALiR will be available to registered users through the Understanding America Study website upon curation of each cohort-year of data, with year 1 anticipated in mid-2023. With appropriate privacy and data security safeguards, freely available data encourages transparency, reproducibility, and explainability of outputs from statistical, artificial intelligence, and machine learning analyses.
The research platform can accommodate additional features, including: participants; application programming interfaces from wearables, medical devices, Internet of Things, genomics and biomarkers; survey designs including preloaded information, randomization, experiments, and ecological momentary assessments; and interactive communications like notifications and visual dashboards. ALiR seeks to achieve a truly large-scale sample through leveraging the ongoing expansion of the Understanding America Study, which is expected to reach 20,000 participants by 2025, and by incorporating special populations like those with specific diseases, which are housed at academic and industry partners. Accordingly, the code will be open-sourced to encourage harmonizable data collection efforts by others.
The ultimate aim of ALiR is to facilitate broad, multidisciplinary, and equitable investigation of precision health. Engineers may leverage the platform to test performance of new hardware or sensors in diverse populations. Methodologists may characterize factors that drive or amplify selection biases, such as social and structural patterning of study participation and attrition, data quality and missingness as well as develop and test solutions to minimize their impact, such as incentive designs and imputation techniques. Social scientists may investigate social determinant clustering and importance in different populations to prioritize public health investments. Behavioral researchers may develop just-in-time interventions where deviations from individual-specific baselines trigger automated nudges and/or suggestions, such as passive detection of influenza-like symptoms via Fitbit data, which triggers a SARS-CoV-2 testing recommendation. Operations researchers may evaluate utility of case worker or health system integrations.
Collectively, ALiR is a model for achieving diversity, equity, inclusion, transparency, and multi-disciplinary collaboration in precision health.
ACKNOWLEDGEMENTS:
We are grateful to the participants who agreed to contribute their time and digital data for research. We thank Evidation Health, Inc. for study app development. This work was funded by a grant from the National Library of Medicine (NLM) at the NIH awarded to Dr. Chaturvedi (R01LM013237). ALiR relies on data from survey(s) administered by the Understanding America Study, which is maintained by the Center for Economic and Social Research (CESR) at the University of Southern California. The funders had no role in the study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Footnotes
Weights are provided to rebalance the sample to match the demographic composition of the general U.S. adult population.
COMPETING INTERESTS:
The authors have no financial, personal, or professional competing interests that have influenced the content of the work described herein.
References
- 1.Hood CM, et al. American Journal of Preventive Medicine 50(2):129–135. [DOI] [PubMed] [Google Scholar]
- 2.Bradley EH, et al. 2011. BMJ Quality and Safety in Health Care 20(10):826–831. [DOI] [PubMed] [Google Scholar]
- 3.Collins Francis S., et al. New England Journal of Medicine 372.9 (2015): 793–795. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Khoury MJ, et al. JAMA (2016) 316(13):1357–8. 10.1001/jama.2016.12260 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Cho Peter Jaeho, et al. JMIR mHealth and uHealth 10.4 (2022): e29510. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Pratap Abhishek, et al. NPJ Digital Medicine 3.1 (2020): 1–10.31934645 [Google Scholar]
- 7.Zou J, Schiebinger L, Nature 559, 324–326 (2018). [DOI] [PubMed] [Google Scholar]
- 8.Obermeyer Ziad, et al. Science 366.6464 (2019): 447–453. [DOI] [PubMed] [Google Scholar]
- 9.USC Schaeffer Center, 2022. [Online] USC Schaeffer. Available at: <https://healthpolicy.usc.edu/article/biomedical-expert-designs-health-dataset-representing-all-demographic-groups> [Accessed 29 August 2022].
- 10.Alattar Laith, et al. Social Security Bulletin, Vol. 78 No. 2, 2018. [Google Scholar]