Skip to main content
. 2025 Aug 22;27:e63793. doi: 10.2196/63793

Table 1. “Underrepresented in biomedical research” (UBR) codebook used to classify the dataset.

UBR code and subcodesa Description
“Race and Ethnicity” (ie, “Asian,” “Black, African or African American,” “Hispanic, Spanish, or Latino,” “Native Hawaiian or Pacific Islander,” “Middle Eastern or North African,” Other, Ambiguous) “Individuals who identify as other than White and non-Hispanic”
“Access to Care” “Individuals who have not had a needed medical visit in the past 12 months or cannot easily obtain or pay for medical care as needed”
“Age” (ie, “Children 17 or younger,” “Adults 65 or older”) “Children 17 or younger and adults 65 or older”
“Annual Household Income” “Individuals with household incomes equal to or below 200% of the Federal Poverty Level”
“Disability” “Individuals with either a physical or cognitive disability”
“Educational Attainment” “Individuals with less than a high school degree or equivalent”
“Gender Identity” “Individuals who identify as gender variant, non-binary, transgender, or something else”
“Geography” “Individuals who reside in rural and non-metropolitan areas”
“Sex Assigned at Birth” “Individuals who are neither male nor female (ie, intersex)”
“Sexual Orientation” “Individuals who identify as asexual, bisexual, gay or lesbian or something else”
Nonspecific Individuals who are broadly categorized as underrepresented but whose specific group is unnamed (eg, “groups that are underrepresented in biomedical research”)
a

Codes and subcodes with quotation marks came verbatim from the All of Us Research Program [5,16], whereas codes and subcodes in italics came from the authors. Only “Race and Ethnicity” and “Age” have subcodes listed because these were the only codes with related subcodes assigned to any posts.