Abstract
Indiscriminate use of predictive models incorporating race can reinforce biases present in source data and lead to an exacerbation of health disparities. In some countries, such as the United States, there is therefore a push to remove race from prediction models; however, there are still many prediction models that use race as an input. Biomedical informaticists who are given the responsibility of using these predictive models in healthcare environments are likely to be faced with questions like how to deal with race covariates in these models. Thus, there is a need for a pragmatic framework to help model users think through how to include race in their chosen model so as to avoid inadvertently exacerbating disparities. In this paper, we use the case study of lung cancer screening to propose a simple framework to guide how model users can approach the use (or non-use) of race inputs in the predictive models they are tasked with leveraging in electronic health records and clinical workflows.
Keywords: health disparities, prediction models, decision framework, race
Structured abstract:
Objective
Indiscriminate use of predictive models incorporating race can reinforce biases present in source data and lead to an exacerbation of health disparities. The use of such prediction models is not always straightforward. Biomedical informaticists who are given the responsibility of using such models are likely to be faced with questions like how to deal with race covariates. Thus, there is a need for a pragmatic framework to help model users think through how to use predictive models incorporating race so as to avoid inadvertently exacerbating disparities.
Methods
In this paper, we use the case study of lung cancer screening to propose a simple framework to guide how model users can approach the use (or non-use) of race in the predictive models they are tasked with leveraging in electronic health records and clinical workflows.
Results
A five-step pragmatic framework to help informaticists think through how to use predictive models that include race inputs.
Conclusion
There is unlikely to be any single path to using or not using race that is appropriate for all models and conditions. Therefore, among those responsible for the use of prediction models, an explicit discussion of the role and use of race within each model is warranted. A framework to guide the process can assist model users and provide a structure to weigh these model usage decisions with a consideration for health disparities, fairness, and other institutional priorities.
INTRODUCTION
Indiscriminate use of predictive models incorporating race can reinforce biases present in source data and lead to an exacerbation of health disparities.1 For example, by systematically lowering risk estimates for Black patients, the use of models with race covariates or race ‘correction factors’ as inputs in some clinical decision support tools (e.g. STONE score,2,3 Breast Cancer Surveillance Consortium Risk Calculator,4,5 Osteoporosis Risk SCORE6,7) may inappropriately guide clinicians away from recommending further diagnostics or more aggressive surveillance.1
In response, in some countries, such as the United States, there is a push to remove race from prediction models.8 Consequently, race has been deliberately removed from some models, such as in the National Kidney Foundation’s estimated glomerular filtration rate (eGFR) calculator. While this model reflects the correlation of race with glomerular filtration rate (GFR), the differences in GFR observed between races is mediated by lean muscle mass. Black people tend to have more lean muscle mass than White people on average, but there is significant overlap in body composition between races.9 Therefore, when race-was used as ‘correction factor’ in the calculation of eGFR, Black patients were systematically estimated to have a higher eGFR than White patients, regardless of actual body composition for the individual. Thus, this potentially hindered the early detection and management of kidney disease for many Black individuals whose elevated creatinine was not due to more lean muscle mass.
Indeed, while often observed as a predictor of risk, race is rarely a causative factor in condition development pathways. While frequently used, race is often a surrogate for actual causative factors (e.g., greater lean muscle mass causing elevated creatinine). There can be multiple factors that contribute to condition development that are either unknown or difficult to measure, many of which are correlated with race, including social and environmental factors such as access to healthcare, exposure to carcinogens, and socioeconomic status, as well as disease causing alleles. Thus, if the actual causal factors could be accurately measured, race would likely drop out of many prediction models.
The removal of race from prediction models, however, is not always the best approach, particularly in settings with limited availability of data on causal factors, limited ability to routinely measure causal factors in practice, or an incomplete understanding of causal pathways. In cases where race is associated with the outcome, but there are no suitable alternatives factors to replace it (e.g., association of Black race with a higher cancer risk due to childhood exposures that are hard or impossible to measure), the removal of race from prediction models carries the potential to worsen health disparities.1 For example, Black men have a 1.5 to >3 times higher incidence rate of ischemic stroke than White men.10 Without the addition of an alternate factor to replace the race covariate to stroke incidence rates, removing race from a stroke prediction model is likely to underestimate the stroke risk for Black men. This could lead to an increased disparity in care if more intensive treatments are reserved for patients at higher stroke risk and that risk is systematically underestimated for Black men. While not a biological or genetic determinant of health outcomes,11 the very real associations observed between race and many health outcomes challenges the notion that race should not be considered in clinical decision making.12
Biomedical informaticists are often given the responsibility of using existing predictive models in the electronic health record (EHR). During this process, these model users are likely to be faced with questions on how to deal with race inputs. Healthcare information technology is used in complex and fast-paced environments. Thus, there is a need for a pragmatic framework to help informaticists think through whether and how to use models incorporating race so as to avoid inadvertently exacerbating disparities. In this paper, we use the case study of lung cancer screening (LCS) to propose a simple framework to guide how informaticists can approach the use (or non-use) of race in the predictive models they are tasked with using in EHRs and clinical workflows.
While race is considered a more biological characteristic and ethnicity more a cultural characteristic, in practice, both are often intertwined. For example, in risk models for lung cancer screening used as an example in this manuscript,13,14 an important model input combines race and ethnicity, with possible values of Hispanic, non-Hispanic Black, non-Hispanic White, and Other. Thus, for simplicity and the purposes of this manuscript, we consider race and race/ethnicity to be synonymous. However, a similar framework may be applied to race and ethnicity separately, or to other individual characteristics.
Race in lung cancer prediction
Low dose computed tomography (LDCT) LCS presents an opportunity to shift the stage at which lung cancer is diagnosed and can decrease the lung cancer death rate by 20–24% compared to other screening methods.15,16 However, uptake has been limited and there are significant racial and ethnic disparities in LCS rates, where Black and Hispanic patients have a significantly lower odds of receiving LDCT as compared to White patients.17–19 Consequently, when diagnosed with lung cancer, these groups face worse outcomes compared to White patients, including later diagnosis, lower treatment rates, and lower survival rates.20,21
LCS represents a prime example of where a decision of whether and how to use a predictive model incorporating race poses a dilemma, as the inclusion of race can influence health disparities in divergent ways.22 Specifically, Black patients have a significantly increased risk of developing and dying from lung cancer, such that using race when estimating cancer risk can reduce health disparities by promoting screening for more members of this population.23 On the other hand, Black patients also have reduced average life expectancy as a group, such that using race when estimating life expectancy can exacerbate health disparities by labeling Black patients as unlikely to benefit from screening due to limited predicted life expectancy.14,24 There are many factors that impact lung cancer risk and life expectancy that are either unknown or difficult to measure, many of which are correlated with race, such as access to healthcare, exposure to carcinogens, and other social determinants of health.23,25 In lung cancer and life expectancy prediction models, race can act as a surrogate, albeit imperfect, for these unmeasured factors. Race may also serve to account for race-associated differences in the completeness of EHR data.26–28 If the causal factors contributing to the observed racial differences in lung cancer risk and life expectancy could be measured accurately, race would likely drop out of these models, but given feasibility and the current state of science, race remains an important, if not always straightforward, variable to consider in lung cancer and life expectancy prediction models.
Removing race covariates for lung cancer risk can increase disparities by limiting the number of potentially vulnerable high-risk people who are encouraged to receive LCS.29 As it currently stands, neither the United States Preventive Services Taskforce (USPSTF) or CHEST LCS guidelines, nor the frequently used Bach et al., 2003 lung cancer risk model, take race directly into consideration; instead, they use factors including age, gender, and smoking history to designate high-risk patients.29–31 The removal of race as a potential variable in these LCS guidelines disproportionately disadvantages high-risk Black patients from LCS eligibility as compared to White patients.32 Indeed, among those diagnosed with lung cancer, a significantly lower percentage of Black smokers were eligible for screening according to USPSTF guidelines as compared with White smokers (32% vs. 56%).32
In an attempt to address potential disparities resulting from racial differences in smoking habits,32 the USPSTF LCS guidelines were updated to reduce the pack years required for screening eligibility and the CHEST guidelines provide a more nuanced approach based on patient smoking history and age.31 While this increases the number of Black patients who smoke who qualify for screening, the guideline changes have been estimated to actually widen disparities, as proportionately more White smokers qualify for screening, while Black smokers continue to be underrepresented among individuals eligible for LCS.33 Lung cancer risk prediction models that include race repeatedly outperform the race-blind guidelines for identifying high-risk Black and Hispanic patients with LCS-avertable lung cancer deaths.13,34–36
Conversely, using LCS models that incorporate race also has the potential to exacerbate health disparities. In particular, this can been seen in the use of life expectancy when estimating LCS benefit, which has been proposed as a potential strategy to identify high-benefit individuals.14,37 In the US, Black Americans have an average life expectancy several years less than other populations.24 Consequently, straightforward use of race in life-expectancy calculations could negatively impact screening recommendations for Black individuals due to a lower estimated life expectancy or lower estimated life-year gain. Thus, when estimating life expectancy, using a model that incorporates race can potentially worsen disparities.
LCS model example
To illustrate the use of the framework discussed below, we reference our experiences using the LCS models developed by Katki et al.13 and Cheung et al.,14 and freely available as an R package,38 into the EHR.39 The LCS models estimate the benefits of LCS by calculating individual risks of lung cancer incidence, lung cancer mortality, and all-cause mortality, in both the absence and presence of screening, based on covariates: age, education, sex, race/ethnicity, smoking intensity/duration/quit-years, body mass index, family history of lung-cancer, and multiple comorbidities.14 These models use the covariates to estimate the expected impact of lung cancer screening on increasing life expectancy and reducing lung cancer mortality. The model estimates both the risk of developing lung cancer and life expectancy.
A decision-making framework for considering race in prediction models
In our own implementation of an LCS decision tool into the EHR,39 acknowledgement of the disparate impacts of race covariates on the propagation of LCS disparities required the development of a framework within which we evaluated the potential impact of race inputs in our prediction models. Working within the framework provided a structured means for addressing critical questions, such as 1) Does including race help close outcome disparities? 2) Do overall gains in health improvements outweigh harms? and 3) Is there an alternate model with a non-race factor within the causal pathway that is feasible to use? When faced with the dilemma of whether and how to use prediction models with race inputs, model users may benefit from employing a similar framework to guide their inquiry.
Here we describe a framework using examples from LCS to guide model users as they think about and address race in the models they are using. Figure 1 depicts the assessment framework developed. This pragmatic framework guides users through pertinent questions to assist in decision making related to the use of models that include race covariates. The five steps of the framework are as follows:
Figure 1:

Assessing the impact of race on inequity and its application to lung cancer screening
Step 1:
Determine places in which race appears in the model.
A first step in the framework is to review the prediction model variables to identify each use where there is the inclusion of race as an input. Step 1 identifies model input variables that include race itself, or are felt to be so intimately associated with race that they are virtually synonymous. Race synonymous variables may include country of origin or preferred language. This step is not intended to identify variables that are merely correlated with race in a way that a modeler would consider including a race-based covariate to that variable.
LCS example:
For the LCS models, the model race input variables identified occur in two places as covariates for lung cancer risk and life-expectancy. In this example, lung cancer rates are higher in Black patients, so the model developer included a race-based rate modifier to increase the base rate of lung cancer for someone who is Black.
Step 2:
Evaluate the suitability of model race variables through the lens of potential comparable models that use alternative variables that represent factors within the causal pathway of the condition of interest.
Race is infrequently involved directly within causal pathways, but it is often used as a surrogate for causal factors that are unknown or unmeasured.23,25,40 While including race is potentially useful, much of the data used to generate conclusions about the associations between race and health outcomes are based on sources that represent samples with predominantly White populations, with racial differences found in large data sets most likely reflecting sampling bias or the effects of being Black in America rather than being Black itself.41 For this reason, when available, alternate but otherwise comparable models with better covariates, such as those with other more causative factors should be evaluated as replacements. If a better factor is available for a race covariate identified in step 1, seek an alternative model that employs those variables and evaluate that model beginning at step 1; otherwise, if no better alternate model exists, continue to step 3. For example, in the previously discussed case of the eGFR calculator,12,42,43 lean muscle mass represents the variable that could replace race in the model.9,44 If using a GFR calculator, an informatician should select an otherwise comparable model that includes a lean muscle mass variable as opposed to a model with a race variable.
LCS example:
No causative factors in the lung cancer development pathway were identified as potential replacements for the race covariate in the LCS prediction model and upon assessment of other existent LCS prediction models;31 thus, the decision was made to continue using the selected model. Socioeconomic status (SES) was considered as a potential alternative to race due to its significant association with lung cancer risk.45 However, SES is similarly outside the causal pathway for lung cancer and limited SES data availability in the EHR hinders the feasibility of applying SES for risk prediction.
Step 3:
Determine the impact of race in the model by answering the questions: Does using the actual race reduce inequity? Does a counter-factual race input reduce inequity?
Once uses of race as covariates are identified and potential alternate measures examined, a systematic evaluation is needed to determine the direction and magnitude of the impact on disparities of including actual race or a counter-factual race in a model. The outcomes of these analyses may not always be straight forward and opposing effects may be observed. Removing race entirely from a model may be burdensome; likely requiring model redevelopment. Therefore, one method for evaluating the potential impact of race covariates in a prediction model is to examine the counter factual to determine the disparity impact if the patient were a different race. In a counter-factual analysis, the model outcomes for someone of one race are compared to the model outcomes for another race, while holding all other factors constant. This can allow for the examination of model fairness and potential impact on disparities.46
To accomplish this step, an appropriate outcome measure to evaluate impact on inequity must be determined. These measures will be dependent on a model’s focus and purpose, as well as institutional/societal priorities. Measures may include health outcomes, life expectancy, receipt of care, etc. While our LCS model specifically assessed disparities and inequity as they relate to the differential receipt of LCS by race, the way in which outcome disparity is defined will be dependent on a model’s subject content and the priorities of the users. For example, when using a model designed to guide the use of Prostate-Specific Antigen (PSA) testing, a stakeholder with a priority to reduce over overdiagnosis47 may choose to use a fairness outcome such as rate of positive tests leading to unnecessary follow-up procedures.
LCS example:
The two outcome variables with race covariates identified in the LCS prediction model produced divergent results when assessed in counter-factual scenarios. Altering race in lung cancer risk estimates exacerbated inequities leading to fewer recommendations for LCS among Black patients as compared to White patients.48 On the other hand, altering race in life expectancy estimates reduced inequity, with more Black patients receiving LCS, but had an unintended consequence of reducing the number of Asian patients referred to LCS as a result of Asian-Americans on average having a longer life expectancy than White Americans.
Step 4:
Through the lens of fairness, and within the context of institutional (or societal etc.) priorities, weigh which race input variables should be kept or modified within the model being used.
Decisions about how to use race in the delivery of healthcare should never occur in isolation. The results of counter-factual analyses will likely need to be placed in the context of use goals and institutional priorities, along with a consideration of what would be considered “fair” for those impacted by these decisions. While there are multiple definitions within healthcare of what is “fair,” in this setting, fairness is defined as not disproportionately harming or benefiting those of a particular race.49
LCS example:
In the LCS model, the decision was made to not modify race for estimating lung cancer risk, but to modify race when estimating life-expectancy using “counterfactual eligibility”, that recalculates life-expectancy for a minority member as if they were White-American, and then uses the larger estimate.50 In fairness analyses, using an non-race specific average cancer risk variable to simulate removing race from lung cancer risk significantly increased LCS disparities and additionally could increase harms due to adverse events from over-screening among Hispanic- and Asian-Americans, who have a lower observed lung cancer risk than other groups. When estimating life expectancy, however, it was observed that modifying race (making life expectancy at least as high as a comparable White patient) had a substantial positive impact on LCS rates among Black patients, but could negatively impact Hispanic and Asian patients. Of note, a counter-factual use of race can be applied selectively, e.g., such that a counter-factual race is used only when it reduces disparities. For example, if an Asian patient would be estimated to live longer than a White patient, their actual race can be used rather than the counter-factual White race. This is in fact what we decided to do in this example for LCS. Therefore, the use of counter-factual eligibility increases African-American eligibility, without reducing eligibility for any other group. We feel that counterfactual eligibility is more fair than simply removing race from a model.50 Within the institutional context, the priority of increasing LCS among high-risk populations further reinforced these decisions.
Step 5:
Create transparency by reporting the approach to how and where race is used in the model where appropriate.
Reporting the approach of how race was used in the model can be important for creating transparency that allows end users to make a more educated decision of how to apply the model results in their decision making. With the input of potential users an appropriate and informative description of the use of race in the model can be included in the user-facing tools. In addition, such transparency has the potential to improve patient and clinician trust in the model. This step may not be applicable if there are no race variables in the model used.
LCS example:
Focus groups were held with patients and providers to determine preferences for how the model inclusion decisions would be best presented. Using feedback from these stakeholder populations, information explaining how race may impact model results and the interpretation of the results is planned to be incorporated into relevant LCS decision tools.
DISCUSSION
An empirical approach that critically and systemically evaluates the impact of race exclusion and inclusion in prediction models can aid in decision making related to prediction model use. As with the example of LCS, a framework to guide the process can assist informaticists to tease apart sometimes opposing effects of using race within prediction models and provide a structure useful for weighing these decisions with regard to health disparities and fairness, along with other priorities. However, due to the thorny nature of including race in prediction models, further research is needed to answer questions such as 1) Is it appropriate to ignore race for some measures like life expectancy, but not others? 2) How can measures that include race account for multi-racial individuals? And 3) How do you explain these decisions to relevant stakeholders?
Weighing the fairness of a model (framework step 4) is the step that requires the most work and deliberation. This is particularly true when not all groups are impacted similarly by including or excluding race from the model. When determining model fairness with disparate population-specific effects, it may be pertinent to consider the thought experiment of the “original position” proposed by political philosopher John Rawls. Rawls built on the foundational premise of social contract theory to argue that rational people will develop fair and equal conceptions of justice in society when hypothetically isolated from knowledge of their own personal circumstances (social position, intelligence, appearance) – thereby circumventing the biases of thought that emerge when protecting self-interests.51,52 Translating this mode of thinking to the use of race in prediction models would encourage developers not to think of the benefits or harms of including race as a variable from the perspective of just one population, but from the perspective of all groups. With this foundation, developers could better weigh what a “fair” model would be and strive to make decisions that would not be overwhelmingly harmful or advantageous for any one group. However, this approach remains imperfect and additional research is needed to guide developers in what to do in these cases in particular where conflict is present.
Given the current inescapability of potential bias whether including or excluding race, transparency has the potential to ensure clinicians and others using the prediction models can make educated decisions regarding clinical decisions that may be based on race. However, there is additional research needed to determine optimal strategies for conveying the choices made in model development and the implications of using race within prediction models. More generally, to improve transparency, prediction model users could be encouraged to summarize how race impacts results of a particular recommendation so clinicians can be better informed.53 A sociotechnical systems methods may serve to help guide these decisions as they are designed to elucidate and solve important challenges related to communication, human-computer interaction, cognition, and motivation.54–58
Furthermore, how race is defined deserves reflection, as definitions can differentially capture phenotypic, cultural, and genetic factors associated with race and thus impact model interpretation. Most frequently biomedical research relies on self-reported race, which may provide a representation of particular phenotypes or cultural affiliations.59 Similarly, race may also be defined by a third-party observer, such as a healthcare provider, which may capture phenotypic characteristics, but miss cultural factors. Both of these categorizations, however, may not be reflective of genetic ancestry as there is significant genetic homogeneity within racial categories. Additionally, race can also be categorized by genetic ancestry, which may capture biological factors contributing to health, but not account for societal and cultural implications of that individual’s unique phenotypes.
Finally, there is a general need to develop better data collection strategies and analyses of causative pathways to more accurately account for the mediating factors that lead to the observed racial associations with health outcomes. When possible, factors within a causal pathway should be used to replace race. However, existing models are often functionally limited by the type of data routinely collected in operational clinical care, which, more often than not, does not include potentially impactful circumstantial factors. Therefore, alternate data collection strategies could be considered to obtain usable data on key known risk factors, such as using neighborhood-level measures by zip code as a stand-in for SES.60 Along with addressing causal factor considerations, wise decisions about including race can lead to the development of less biased prediction models that improve racial disparities in care delivery.
CONCLUSION
Decision making surrounding the use of race in the use of prediction models is not always straight forward. There is unlikely to be any single path to including or excluding race that is appropriate for all models and conditions. Therefore, among those responsible for the use of prediction models, an explicit discussion of the role of race within each model is warranted. A framework to guide the process can assist model users and provide a structure to weigh these model usage decisions with a consideration for health disparities, fairness, and other institutional priorities. However, additional attention is needed to evaluate the broader implications of selectively using race in the use of these models and how to best communicate these decisions to clinicians and patients. Further efforts are also needed to identify and make data available for factors that may better replace race in prediction models.
Statement of Significance.
Problem or Issue
Indiscriminate use of race in clinical algorithms can reinforce biases present in source data and lead to an exacerbation of health disparities.
What is Already Known
Biomedical informaticists who are given the responsibility of implementing clinical algorithms are likely to be faced with questions of how to deal with race in these models. For those instances where the answer is not simple, there is a need for a pragmatic framework to help informaticists think through how to include race in their models.
What this Paper Adds
We propose a simple framework to guide the use (or non-use) of race clinical algorithms.
Acknowledgements
Funding:
This work was funded by ARHQ R18-HS028791 (Kawamoto). Stevens is supported by NIH grant number K01-AG075169.
Footnotes
Conflicts of Interest: The authors have no conflicts of interest to declare.
REFERNCES
- 1.Vyas DA, Eisenstein LG, Jones DS. Hidden in Plain Sight — Reconsidering the Use of Race Correction in Clinical Algorithms. New England Journal of Medicine. 2020;383(9):874–882. doi: 10.1056/NEJMms2004740 [DOI] [PubMed] [Google Scholar]
- 2.Wang RC, Rodriguez RM, Moghadassi M, et al. External Validation of the STONE Score, a Clinical Prediction Rule for Ureteral Stone: An Observational Multi-institutional Study. Ann Emerg Med. Apr 2016;67(4):423–432.e2. doi: 10.1016/j.annemergmed.2015.08.019 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Wang RC, Rodriguez RM, Moghadassi M, et al. External Validation of the STONE Score, a Clinical Prediction Rule for Ureteral Stone: An Observational Multi-institutional Study. Annals of Emergency Medicine. 2016/April/01/ 2016;67(4):423–432.e2. doi: 10.1016/j.annemergmed.2015.08.019 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Tice JA, Miglioretti DL, Li CS, Vachon CM, Gard CC, Kerlikowske K. Breast Density and Benign Breast Disease: Risk Assessment to Identify Women at High Risk of Breast Cancer. J Clin Oncol. Oct 1 2015;33(28):3137–43. doi: 10.1200/jco.2015.60.8869 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Hubbard RA, Pujol TA, Alhajjar E, Edoh K, Martin ML. Sources of Disparities in Surveillance Mammography Performance and Risk-Guided Recommendations for Supplemental Breast Imaging: A Simulation Study. Cancer Epidemiol Biomarkers Prev. 2023:OF1–OF11. doi: 10.1158/1055-9965.Epi-23-0330 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Lydick E, Cook K, Turpin J, Melton M, Stine R, Byrnes C. Development and validation of a simple questionnaire to facilitate identification of women likely to have low bone density. Am J Manag Care. Jan 1998;4(1):37–48. [PubMed] [Google Scholar]
- 7.Cass AR, Shepherd AJ, Carlson CA. Osteoporosis risk assessment and ethnicity. JGIM. 2006/June/01 2006;21(6):630–635. doi: 10.1111/j.1525-1497.2006.00459.x [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Basu A Use of race in clinical algorithms. Science Advances. 2023;9(21):eadd2704. doi:doi: 10.1126/sciadv.add2704 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Silva AM, Shen W, Heo M, et al. Ethnicity-related skeletal muscle differences across the lifespan. Am J Hum Biol. Jan-Feb 2010;22(1):76–82. doi: 10.1002/ajhb.20956 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Albright KC, Huang L, Blackburn J, et al. Racial differences in recurrent ischemic stroke risk and recurrent stroke case fatality. Neurology. Nov 6 2018;91(19):e1741–e1750. doi: 10.1212/wnl.0000000000006467 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Suzuki K, Von Vacano DA. Reconsidering Race: Social Science Perspectives on Racial Categories in the Age of Genomics. Oxford University Press; 2018. [Google Scholar]
- 12.Paulus JK, Kent DM. Race and Ethnicity: A Part of the Equation for Personalized Clinical Decision Making? Circ Cardiovasc Qual Outcomes. Jul 2017;10(7)doi: 10.1161/circoutcomes.117.003823 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Katki HA, Kovalchik SA, Berg CD, Cheung LC, Chaturvedi AK. Development and Validation of Risk Models to Select Ever-Smokers for CT Lung Cancer Screening. Jama. Jun 7 2016;315(21):2300–11. doi: 10.1001/jama.2016.6255 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Cheung LC, Berg CD, Castle PE, Katki HA, Chaturvedi AK. Life-Gained-Based Versus Risk-Based Selection of Smokers for Lung Cancer Screening. Ann Intern Med. Nov 5 2019;171(9):623–632. doi: 10.7326/m19-1263 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Potter AL, Rosenstein AL, Kiang MV, et al. Association of computed tomography screening with lung cancer stage shift and survival in the United States: quasi-experimental study. BMJ. 2022;376:e069008. doi: 10.1136/bmj-2021-069008 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Lung Cancer Incidence and Mortality with Extended Follow-up in the National Lung Screening Trial. J Thorac Oncol. Oct 2019;14(10):1732–1742. doi: 10.1016/j.jtho.2019.05.044 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Lake M, Shusted CS, Juon HS, et al. Black patients referred to a lung cancer screening program experience lower rates of screening and longer time to follow-up. BMC Cancer. Jun 16 2020;20(1):561. doi: 10.1186/s12885-020-06923-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Narayan AK, Chowdhry DN, Fintelmann FJ, Little BP, Shepard JO, Flores EJ. Racial and Ethnic Disparities in Lung Cancer Screening Eligibility. Radiology. Dec 2021;301(3):712–720. doi: 10.1148/radiol.2021204691 [DOI] [PubMed] [Google Scholar]
- 19.Japuntich SJ, Krieger NH, Salvas AL, Carey MP. Racial Disparities in Lung Cancer Screening: An Exploratory Investigation. J Natl Med Assoc. Oct 2018;110(5):424–427. doi: 10.1016/j.jnma.2017.09.003 [DOI] [PubMed] [Google Scholar]
- 20.American Lung Association. State of Lung Cancer: 2022 Report. March 20, 2023. https://www.lung.org/getmedia/647c433b-4cbc-4be6-9312-2fa9a449d489/solc-2022-print-report
- 21.Olazagasti C, Seetharamu N. Disparities in lung cancer screening rates among the Hispanic/LatinX population. Lung Cancer Manag. Sep 2021;10(3):Lmt51. doi: 10.2217/lmt-2021-0004 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Paulus JK, Kent DM. Predictably unequal: understanding and addressing concerns that algorithmic clinical prediction may increase health disparities. NPJ digital medicine. 2020;3(1):99. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Shusted CS, Evans NR, Juon H-S, Kane GC, Barta JA. Association of Race With Lung Cancer Risk Among Adults Undergoing Lung Cancer Screening. JAMA Network Open. 2021;4(4):e214509–e214509. doi: 10.1001/jamanetworkopen.2021.4509 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Schwandt H, Currie J, Bär M, et al. Inequality in mortality between Black and White Americans by age, place, and cause and in comparison to Europe, 1990 to 2018. Proc Natl Acad Sci U S A. Oct 5 2021;118(40)doi: 10.1073/pnas.2104684118 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Coates RD, Ferber AL, Brunsma DL. The matrix of race: Social construction, intersectionality, and inequality. Sage Publications; 2021. [Google Scholar]
- 26.Chavez-Yenter D, Goodman MS, Chen Y, et al. Association of Disparities in Family History and Family Cancer History in the Electronic Health Record With Sex, Race, Hispanic or Latino Ethnicity, and Language Preference in 2 Large US Health Care Systems. JAMA Network Open. 2022;5(10):e2234574–e2234574. doi: 10.1001/jamanetworkopen.2022.34574 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Russell CK, McNeill M. Improving Lung Cancer Screening Rates Through an Evidence-Based Electronic Health Record Smoking History. J Nurs Care Qual. Jul-Sep 01 2022;37(3):263–268. doi: 10.1097/ncq.0000000000000623 [DOI] [PubMed] [Google Scholar]
- 28.Kukhareva PV, Caverly TJ, Li H, et al. Inaccuracies in electronic health records smoking data and a potential approach to address resulting underestimation in determining lung cancer screening eligibility. J Am Med Inform Assoc. Apr 13 2022;29(5):779–788. doi: 10.1093/jamia/ocac020 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Rivera MP, Katki HA, Tanner NT, et al. Addressing Disparities in Lung Cancer Screening Eligibility and Healthcare Access. An Official American Thoracic Society Statement. Am J Respir Crit Care Med. Oct 1 2020;202(7):e95–e112. doi: 10.1164/rccm.202008-3053ST [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Bach PB, Kattan MW, Thornquist MD, et al. Variations in Lung Cancer Risk Among Smokers. JNCI: Journal of the National Cancer Institute. 2003;95(6):470–478. doi: 10.1093/jnci/95.6.470 [DOI] [PubMed] [Google Scholar]
- 31.Mazzone PJ, Silvestri GA, Souter LH, et al. Screening for Lung Cancer: CHEST Guideline and Expert Panel Report. Chest. Nov 2021;160(5):e427–e494. doi: 10.1016/j.chest.2021.06.063 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Aldrich MC, Mercaldo SF, Sandler KL, Blot WJ, Grogan EL, Blume JD. Evaluation of USPSTF Lung Cancer Screening Guidelines Among African American Adult Smokers. JAMA Oncol. Sep 1 2019;5(9):1318–1324. doi: 10.1001/jamaoncol.2019.1402 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Reese TJ, Schlechter CR, Potter LN, et al. Evaluation of Revised US Preventive Services Task Force Lung Cancer Screening Guideline Among Women and Racial/Ethnic Minority Populations. JAMA Network Open. 2021;4(1):e2033769–e2033769. doi: 10.1001/jamanetworkopen.2020.33769 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Pasquinelli MM, Tammemägi MC, Kovitz KL, et al. Brief Report: Risk Prediction Model Versus United States Preventive Services Task Force 2020 Draft Lung Cancer Screening Eligibility Criteria-Reducing Race Disparities. JTO Clin Res Rep. Mar 2021;2(3):100137. doi: 10.1016/j.jtocrr.2020.100137 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Han SS, Chow E, Ten Haaf K, et al. Disparities of National Lung Cancer Screening Guidelines in the US Population. J Natl Cancer Inst. Nov 1 2020;112(11):1136–1142. doi: 10.1093/jnci/djaa013 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Young CD, Cheung LC, Berg CD, et al. Abstract PR-13: Potential effect on racial/ethnic disparities of removing racial/ethnic variables from risk models: The example of lung-cancer screening. Cancer Epidemiol Biomarkers Prev. 2022;31(1_Supplement):PR-13–PR-13. doi: 10.1158/1538-7755.Disp21-pr-13 [DOI] [Google Scholar]
- 37.Landy R, Young CD, Skarzynski M, et al. Using Prediction Models to Reduce Persistent Racial and Ethnic Disparities in the Draft 2020 USPSTF Lung Cancer Screening Guidelines. JNCI: Journal of the National Cancer Institute. 2021;113(11):1590–1594. doi: 10.1093/jnci/djaa211 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.National Cancer Institute. Lung Cancer Risk Models for Screening (R package: lcrisks). Accessed August 29, 2023, https://dceg.cancer.gov/tools/risk-assessment/lcrisks [Google Scholar]
- 39.Kukhareva PV, Li H, Caverly TJ, et al. Implementation of Lung Cancer Screening in Primary Care and Pulmonary Clinics: Pragmatic Clinical Trial of Electronic Health Record-Integrated Everyday Shared Decision-Making Tool and Clinician-Facing Prompts. Chest. May 3 2023;doi: 10.1016/j.chest.2023.04.040 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Zavala VA, Bracci PM, Carethers JM, et al. Cancer health disparities in racial/ethnic minorities in the United States. British Journal of Cancer. 2021/January/01 2021;124(2):315–332. doi: 10.1038/s41416-020-01038-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Krieger N Methods for the Scientific Study of Discrimination and Health: An Ecosocial Approach. American Journal of Public Health. 2012;102(5):936–944. doi: 10.2105/ajph.2011.300544 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.National Kidney Foundation. Changes to eGFR Calculation and What that Means for People Living with Kidney Disease. Accessed April 3, 2023, https://www.kidney.org/newsletter/changes-to-egfr-calculation-and-what-means-people-living-kidney-disease
- 43.Delgado C, Baweja M, Burrows NR, et al. Reassessing the Inclusion of Race in Diagnosing Kidney Diseases: An Interim Report From the NKF-ASN Task Force. Am J Kidney Dis. Jul 2021;78(1):103–115. doi: 10.1053/j.ajkd.2021.03.008 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Chew-Harris JS, Florkowski CM, Elmslie JL, Livesey J, Endre ZH, George PM. Lean mass modulates glomerular filtration rate in males of normal and extreme body composition. Intern Med J. Aug 2014;44(8):749–56. doi: 10.1111/imj.12479 [DOI] [PubMed] [Google Scholar]
- 45.Hovanec J, Siemiatycki J, Conway DI, et al. Lung cancer and socioeconomic status in a pooled analysis of case-control studies. PLoS One. 2018;13(2):e0192999. doi: 10.1371/journal.pone.0192999 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Cornacchia G, Anelli VW, Narducci F, Ragone A, Di Sciascio E. Counterfactual Fair Opportunity: Measuring Decision Model Fairness with Counterfactual Reasoning. arXiv preprint arXiv:230208158. 2023; [Google Scholar]
- 47.Marcus PM, Prorok PC, Miller AB, DeVoto EJ, Kramer BS. Conceptualizing overdiagnosis in cancer screening. J Natl Cancer Inst. Apr 2015;107(4)doi: 10.1093/jnci/djv014 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Gomez I, Caverly T, Kawamoto K, et al. Effect of Removing Race and Ethnicity From Lung Cancer Prediction Models on Disparities in Eligibility for Lung Cancer Screening. A110 LUNG CANCER RISK, SCREENING, TREATMENT, AND OUTCOMES. A2722–A2722. [Google Scholar]
- 49.Olsen JA. 814 Concepts of Equity and Fairness in Health and Health Care. In: Glied S, Smith PC, eds. The Oxford Handbook of Health Economics. Oxford University Press; 2011:0. [Google Scholar]
- 50.Landy R GI, Caverly TJ, Kawamoto K, Rivera MP, Robbins HA, Young CD, Chaturvedi AK, Cheung LC, Katki HA. Comparison of methods for using race and ethnicity in prediction models for lung cancer screening eligibility in the U.S. population. JAMA Network Open. 2023;In Press; [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Vickrey W Measuring Marginal Utility by Reactions to Risk. Econometrica. 1945;13(4):319–333. doi: 10.2307/1906925 [DOI] [Google Scholar]
- 52.Harsanyi JC. Cardinal Utility in Welfare Economics and in the Theory of Risk-taking. Journal of Political Economy. 1953;61(5):434–435. doi: 10.1086/257416 [DOI] [Google Scholar]
- 53.MDCalc. MDCalc Statement on Race. Accessed March 23, 2023, https://www.mdcalc.com/race
- 54.Sittig DF, Singh H. A new sociotechnical model for studying health information technology in complex adaptive healthcare systems. Quality & safety in health care. Oct 2010;19 Suppl 3(Suppl 3):i68–74. doi: 10.1136/qshc.2010.042085 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Ash JS, Sittig DF, Dykstra RH, Guappone K, Carpenter JD, Seshadri V. Categorizing the unintended sociotechnical consequences of computerized provider order entry. Int J Med Inform. Jun 2007;76 Suppl 1:S21–7. doi: 10.1016/j.ijmedinf.2006.05.017 [DOI] [PubMed] [Google Scholar]
- 56.Weir CR, Taber P, Taft T, Reese TJ, Jones B, Del Fiol G. Feeling and thinking: can theories of human motivation explain how EHR design impacts clinician burnout? J Am Med Inform Assoc. Apr 23 2021;28(5):1042–1046. doi: 10.1093/jamia/ocaa270 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Weir C, Drews FA, Leecaster MK, Barrus RJ, Hellewell JL, Nebeker JR. The orderly and effective visit: impact of the electronic health record on modes of cognitive control. AMIA Annual Symposium proceedings / AMIA Symposium AMIA Symposium. 2012;2012:979–87. [PMC free article] [PubMed] [Google Scholar]
- 58.Butler J, Gibson B, Velasquez T, Weir C. Patient Centered Care for Geriatric Patients: Information Expert Providers Use to Make Patient Centered Care Decisions. presented at: 38th Annual Meeting for Society for Medical Decision Making; 2016; Vancouver BC. [Google Scholar]
- 59.Tessum CW, Paolella DA, Chambliss SE, Apte JS, Hill JD, Marshall JD. PM2.5 polluters disproportionately and systemically affect people of color in the United States. Science Advances. 2021;7(18):eabf4491. doi:doi: 10.1126/sciadv.abf4491 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Link-Gelles R, Westreich D, Aiello AE, et al. Bias with respect to socioeconomic status: A closer look at zip code matching in a pneumococcal vaccine effectiveness study. SSM Popul Health. Dec 2016;2:587–594. doi: 10.1016/j.ssmph.2016.08.005 [DOI] [PMC free article] [PubMed] [Google Scholar]
