Key Points
Question
Are there systematic racial and ethnic biases in pulse oximetry among patients with COVID-19, and is there an association between such biases and unrecognized or delayed recognition of eligibility for oxygen threshold–specific therapy?
Findings
In this retrospective cohort study of 7126 patients with COVID-19, an analysis of 1216 patients with oxygen saturation levels that were concurrently measured by pulse oximetry and arterial blood gas demonstrated that pulse oximetry overestimated arterial oxygen saturation among Asian, Black, and Hispanic patients compared with White patients. Separately, among 6673 patients with pulse oximetry measurements and available covariate data, predicted overestimation of arterial oxygen saturation levels by pulse oximetry among 1903 patients was associated with a systematic failure to identify Black and Hispanic patients who were qualified to receive COVID-19 therapy and a statistically significant delay in recognizing the guideline-recommended threshold for initiation of therapy.
Meaning
The study results suggest that overestimation of arterial oxygen saturation levels by pulse oximetry occurs in patients of racial and ethnic minority groups with COVID-19 and contributes to unrecognized or delayed recognition of eligibility to receive COVID-19 therapies.
Abstract
Importance
Pulse oximetry guides triage and therapy decisions for COVID-19. Whether reported racial inaccuracies in oxygen saturation measured by pulse oximetry are present in patients with COVID-19 and associated with treatment decisions is unknown.
Objective
To determine whether there is differential inaccuracy of pulse oximetry by race or ethnicity among patients with COVID-19 and estimate the association of such inaccuracies with time to recognition of eligibility for oxygen threshold–specific COVID-19 therapies.
Design, Setting, and Participants
This retrospective cohort study of clinical data from 5 referral centers and community hospitals in the Johns Hopkins Health System included patients with COVID-19 who self-identified as Asian, Black, Hispanic, or White.
Exposures
Concurrent measurements (within 10 minutes) of oxygen saturation levels in arterial blood (SaO2) and by pulse oximetry (SpO2).
Main Outcomes and Measures
For patients with concurrent SpO2 and SaO2 measurements, the proportion with occult hypoxemia (SaO2<88% with concurrent SpO2 of 92%-96%) was compared by race and ethnicity, and a covariate-adjusted linear mixed-effects model was produced to estimate the association of race and ethnicity with SpO2 and SaO2 difference. This model was applied to identify a separate sample of patients with predicted SaO2 levels of 94% or less before an SpO2 level of 94% or less or oxygen treatment initiation. Cox proportional hazards models were used to estimate differences by race and ethnicity in time to recognition of eligibility for guideline-recommended COVID-19 therapies, defined as an SpO2 level of 94% or less or oxygen treatment initiation. The median delay among individuals who ultimately had recognition of eligibility was then compared.
Results
Of 7126 patients with COVID-19, 1216 patients (63 Asian [5.2%], 478 Black [39.3%], 215 Hispanic [17.7%], and 460 White [37.8%] individuals; 507 women [41.7%]) had 32 282 concurrently measured SpO2 and SaO2. Occult hypoxemia occurred in 19 Asian (30.2%), 136 Black (28.5%), and 64 non-Black Hispanic (29.8%) patients compared with 79 White patients (17.2%). Compared with White patients, SpO2 overestimated SaO2 by an average of 1.7% among Asian (95% CI, 0.5%-3.0%), 1.2% among Black (95% CI, 0.6%-1.9%), and 1.1% among non-Black Hispanic patients (95% CI, 0.3%-1.9%). Separately, among 1903 patients with predicted SaO2 levels of 94% or less before an SpO2 level of 94% or less or oxygen treatment initiation, compared with White patients, Black patients had a 29% lower hazard (hazard ratio, 0.71; 95% CI, 0.63-0.80), and non-Black Hispanic patients had a 23% lower hazard (hazard ratio, 0.77; 95% CI, 0.66-0.89) of treatment eligibility recognition. A total of 451 patients (23.7%) never had their treatment eligibility recognized, most of whom (247 [54.8%]) were Black. Among the remaining 1452 (76.3%) who had eventual recognition of treatment eligibility, Black patients had a median delay of 1.0 hour (95% CI, 0.23-1.9 hours; P = .01) longer than White patients. There was no significant median difference in delay between individuals of other racial and ethnic minority groups and White patients.
Conclusions and Relevance
The results of this cohort study suggest that racial and ethnic biases in pulse oximetry accuracy were associated with greater occult hypoxemia in Asian, Black, and non-Black Hispanic patients with COVID-19, which was associated with significantly delayed or unrecognized eligibility for COVID-19 therapies among Black and Hispanic patients. This disparity may contribute to worse outcomes among Black and Hispanic patients with COVID-19.
This cohort study examines the differential inaccuracy of pulse oximetry by race or ethnicity among patients with COVID-19.
Introduction
The modern pulse oximeter provides a noninvasive estimate of arterial blood oxygen saturation levels based on the relative absorbance of 2 wavelengths of light and the pulsatile flow of arterial blood (SpO2).1 Since its development in the 1970s, pulse oximetry has been integrated into the routine monitoring of hospitalized patients, particularly those with respiratory illnesses. Inaccurate estimation of true arterial blood oxygen saturation levels (SaO2) by pulse oximetry can occur for various physiologic, pathologic, technical or iatrogenic reasons.1 However, several studies have reported a systematic overestimation of SpO2 compared with SaO2 among individuals with skin of darker pigmentation compared with individuals with lighter pigmentation.2,3,4 Recently, Sjoding et al5 reported 3 times the frequency of occult hypoxemia among individuals self-identified as Black compared with White race, while Wong et al6 reported significantly greater risk of occult hypoxemia for Asian and Black patients and relatively higher in-hospital mortality for Black patients with occult hypoxemia compared with White patients.
Pulse oximetry has played a prominent role in guiding triage and therapy throughout the COVID-19 pandemic, during which recommendations for hospitalization and therapy have been commonly based on SpO2 thresholds. For example, while the US Food and Drug Administration has authorized the use of remdesivir in treating hospitalized patients with COVID-19, most institutions use a threshold of an SpO2 of 94% or less or the use of supplemental oxygen to determine eligibility for treatment with remdesivir.7,8 Similarly, dexamethasone is frequently used for patients who require supplemental oxygen.9 Overestimation of SpO2 may be associated with premature deescalation of therapies or hospital discharge, or could be associated with the delay or withholding of therapies that shorten the disease course, slow progression, or reduce mortality. SpO2 is also frequently used as a variable in clinical risk predictors, either alone or as part of the ratio of SpO2 to the fraction of inspired oxygen.10,11 An overestimation of oxygen saturation levels could be associated with an underappreciation of clinical risk as presented by these calculators.
To our knowledge, the performance of pulse oximetry in patients with COVID-19 and its potential association with clinical decision-making remains unexplored and can help identify factors that may explain the disproportionate COVID-19 mortality experienced by patients of certain racial and ethnic minority groups.12 The objective of this study was to investigate the frequency and magnitude of pulse oximeter bias in estimation of SaO2 associated with race and ethnicity within a diverse cohort of patients with COVID-19 who were treated in a large multihospital health system. Furthermore, this study aimed to determine the potential association of such biases with recognition of eligibility for guideline-recommended therapies. As a respiratory illness that affects many individuals with disproportionate morbidity and mortality among racial and ethnic minority populations, the COVID-19 pandemic provides a unique opportunity to investigate the effect of race and ethnicity–based inaccuracy of pulse oximetry.
Methods
The study was conducted using the JH-CROWN registry, which includes data collected during routine clinical care at 5 hospitals comprising 2513 beds in the Johns Hopkins Health System (Johns Hopkins Hospital, Baltimore, Maryland; Bayview Medical Center, Baltimore, Maryland; Howard County General Hospital, Columbia, Maryland; Suburban Hospital, Bethesda, Maryland; and Sibley Hospital, Washington, DC). The institutional review board of each hospital approved this study as minimal risk and waived informed consent requirements. COVID-19 was defined as the detection of SARS-CoV-2 from any nucleic acid test combined with an International Statistical Classification of Diseases and Related Health Problems, Tenth Revision (ICD-10) code indicating the presence of symptomatic disease.11 Patients who were evaluated in the emergency department or hospitalized between March 4, 2020, and November 12, 2021, were eligible for inclusion. Some patients were included in prior studies.10,11,13,14,15,16,17,18
Exposures and Outcomes
Patients self-reported their race and ethnicity during hospital registration, which was then coded into prespecified categories (eTable 1 in the Supplement). Individuals who self-identified their race as Asian, Black or African American, or White and their ethnicity as Hispanic or non-Hispanic were included in this analysis. To most accurately reflect skin tone and because the health outcomes of Black Hispanic patients most closely align with non-Hispanic Black patients,19,20 individuals who reported Black race and Hispanic ethnicity were categorized as Black. Consequently, race was categorized as Asian, Black or African American (which comprised Hispanic and non-Hispanic ethnicity), non-Black Hispanic, or non-Hispanic White. The SpO2 readings and SaO2 values from arterial blood gases were collected from the electronic medical record. All arterial blood gases were analyzed using the ABL825, ABL827, or ABL90 blood gas analyzer (Radiometer America), which utilize cooximetry to determine oxygen saturation values. The primary outcome of interest was the difference between SpO2 and SaO2 for readings that occurred within 10 minutes of one another. Occult hypoxemia was defined as an SaO2 of less than 88% in the setting of an SpO2 of 92% to 96%.5 The secondary outcome of interest was the relative difference in time to recognition of eligibility for treatment with oxygen threshold–specific COVID-19 therapies between patients of racial and ethnic minority groups and White patients with COVID-19. Based on the joint guidelines of the US Centers for Disease Control and Prevention and Infectious Diseases Society of America, we set the threshold of interest as an oxygen saturation of 94% or less or use of supplemental oxygen, reflecting eligibility for treatment with dexamethasone or remdesivir.21,22
Statistical Analysis
For each SaO2 value, we first identified a concurrent SpO2 value, which was defined as an SpO2 measured within 10 minutes before or after the corresponding SaO2. If there were multiple values within the period, the closest one was selected. For the paired SpO2 and SaO2 values, the mean and standard error of the paired differences were calculated within each racial and ethnic category. The occurrence of occult hypoxemia (defined as SaO2 <88% with concurrent SpO2 measurement of 92%-96%) was presented as the proportion of individuals with at least 1 instance of occult hypoxemia during their hospital encounter and the proportion of measurements consistent with occult hypoxemia by racial and ethnic category.
The association between racial and ethnic category and the difference in paired SpO2 and SaO2 measurements (response variable) was examined using an unadjusted linear mixed-effects model that accounted for repeated measurements, with statistical significance defined as P < .02 based on Bonferroni correction. The linear mixed-effects model was then adjusted for covariates that captured disease severity or an underlying comorbidity or had a known or theoretical association with pulse oximetry accuracy, including demographic characteristics along with time-varying clinical and laboratory variables.1 All covariates and their interactions with race and ethnicity were included as fixed effects, and time-varying vital and laboratory measurements were included as random effects (complete model details are described in eMethods 1 in the Supplement). Only complete cases (ie, records with no missing data in covariates) were included in the adjusted linear mixed-effects model. R-squared (the proportion of variance explained by the model’s independent variables) was calculated to measure the goodness of fit of the model. The mean absolute prediction error from 10-fold cross validation was presented to show model prediction accuracy.
The fitted linear mixed-effects model was subsequently used to explore the association of differential pulse oximetry accuracy by race and ethnicity with recognition of eligibility for treatment with oxygen threshold–based COVID-19 pharmacologic therapy. Among all patients with COVID-19 who were admitted between March 4, 2020, and November 12, 2021, who were not already receiving supplemental oxygen or mechanical ventilation, we estimated the predicted SaO2 corresponding to each SpO2 measurement using the fitted linear mixed-effects model and nearest covariates. Individuals with missing covariates, those who did not have a predicted SaO2 of 94% or less, or those who initiated treatment with oxygen or had a measured SpO2 of 94% or less before a predicted SaO2 of 94% or less were excluded from this analysis.
Proportions were presented by racial and ethnic category for patients with delayed recognition or unrecognized treatment eligibility, defined as those patients with a predicted SaO2 of 94% or less before a measured SpO2 of 94% or less or oxygen treatment initiation (delayed recognition) or those with a predicted SaO2 of 94% or less who did not initiate treatment with oxygen or have a recorded SpO2 of 94% or less at any time (unrecognized). The difference in time to recognition of treatment eligibility between patients of racial and ethnic minority groups and White patients was estimated using a Cox proportional hazards model. Patients were censored at 96 hours if they were discharged before experiencing an event or experienced the event after 96 hours (censoring 2.3% of events).
Among individuals with delayed recognition of treatment eligibility, we calculated the length of delayed recognition as the time between predicted SaO2 of 94% or less and measured SpO2 of 94% or less or oxygen treatment initiation and compared the distributions between patients of racial and ethnic minority groups and White patients using Wilcoxon rank sum tests. A sensitivity analysis excluding observations in which titration of oxygen may have occurred between SpO2 and SaO2 measurement was also conducted (eMethods 2 in the Supplement). All statistical analyses were conducted using R, version 4.0.2 (R Foundation for Statistical Computing), and the linear mixed-effects model was fitted using the lme4 package.23
Results
Of the 7448 individuals evaluated in the emergency department or hospitalized for COVID-19 during the period of interest, 1216 patients (16.3%) had a total of 32 282 concurrently measured SpO2 and SaO2 levels during their hospital encounter, with a median (IQR) of 10 (2-30) measurements per patient. A patient flow diagram is presented in Figure 1, and characteristics stratified by race and ethnicity are presented in the Table. Compared with White patients, Black and non-Black Hispanic patients were younger, had longer hospital stays, and more SaO2 measurements per person. In addition, non-Black Hispanic patients had a smaller proportion of women and fewer comorbidities. The median SaO2 for Asian and Black patients was consistently lower than the SpO2 value (ie, SpO2 overestimated the true oxygen saturation) for all SpO2 readings of 88%- to 96%, while the median SaO2 was consistently higher than the SpO2 value (SpO2 underestimated the true oxygen saturation) for White patients with an SpO2 of 88% to 96% (Figure 2). Using the unadjusted linear mixed-effects model that was limited to SpO2 values from 88% to 96%, compared with White patients, SpO2 overestimated SaO2 by 2.1% for Asian patients (P < .001), 1.4% for Black patients (P < .001), and 0.8% for non-Black Hispanic patients (P < .001). Results were attenuated but qualitatively similar when considering the entire range of SpO2 measurements (eTable 2 in the Supplement).
Table. Baseline Characteristics of Patients Testing Positive for COVID-19 Who Had Arterial Oxygen Saturation and Pulse Oximetry Measured Concurrently Within 10 Minutes by Race and Ethnicity in the Johns Hopkins Health System Between March 4, 2020, and November 12, 2021.
Characteristic | No. (%) | |||
---|---|---|---|---|
Asian (n = 63) | Black (n = 478) | Hispanic (excluding Black Hispanic) (n = 215) | White, non-Hispanic (n = 460) | |
Age, mean (SD), y | 66.0 (18.3) | 58.7 (18.3) | 50.3 (18.4) | 64.5 (18.7) |
Female | 22 (34.9) | 238 (49.8) | 52 (24.2) | 195 (42.4) |
Male | 41 (65.1) | 240 (50.2) | 163 (75.8) | 265 (57.6) |
Location | ||||
Johns Hopkins Hospital | 10 (15.9) | 215 (45) | 72 (33.5) | 129 (28) |
Bayview Medical Center | 4 (6.3) | 96 (20.1) | 36 (16.7) | 135 (29.3) |
Howard County General Hospital | 28 (44.4) | 89 (18.6) | 40 (18.6) | 82 (17.8) |
Sibley Memorial Hospital | 2 (3.2) | 30 (6.3) | 11 (5.1) | 29 (6.3) |
Suburban Hospital | 17 (27.0) | 48 (10) | 49 (22.8) | 77 (16.7) |
Hemoglobin level on admission, mean (SD), g/dL | 12.5 (2) | 11.5 (2.4) | 12.8 (2.1) | 12.1 (2.4) |
Bilirubin level on admission, mean (SD), mg/dL | 1.2 (0.9) | 2.1 (2.5) | 1.2 (1.5) | 1.6 (1.5) |
Creatinine level on admission, mean (SD), mg/dL | 0.6 (0.6) | 0.8 (1.9) | 0.7 (0.5) | 0.9 (2.5) |
BMI | 25.7 (4.2) | 31.4 (10.1) | 31.5 (13.6) | 30.4 (8.5) |
Underweight (<18.5) | 2 (3.2) | 27 (5.6) | 7 (3.3) | 15 (3.3) |
Normal or overweight (18.5 to <30) | 50 (79.4) | 198 (41.4) | 95 (44.2) | 212 (46.1) |
Obesity (≥30) | 8 (12.7) | 229 (47.9) | 92 (42.8) | 193 (42) |
Charleston Comorbidity Index score | ||||
1-4 | 41 (65.1) | 307 (64.2) | 146 (67.9) | 319 (69.3) |
≥5 | 8 (12.7) | 111 (23.2) | 9 (4.2) | 97 (21.1) |
Diabetes | 38 (60.3) | 271 (56.7) | 103 (47.9) | 199 (43.3) |
COPD | 17 (27) | 198 (41.4) | 43 (20) | 211 (45.9) |
Peripheral vascular disease | 8 (12.7) | 88 (18.4) | 12 (5.6) | 90 (19.6) |
Chronic kidney disease | 17 (27) | 184 (38.5) | 27 (12.6) | 125 (27.2) |
Maximum respiratory support needed | ||||
Room air or oxygen by mask or nasal prongs | 7 (11.1) | 85 (17.8) | 28 (13) | 90 (20) |
Noninvasive ventilation or high-flow oxygen | 9 (14.3) | 41 (8.6) | 20 (9.3) | 38 (8.3) |
Mechanical ventilation | 47 (74.6) | 352 (73.6) | 167 (77.7) | 332 (72.2) |
No. of SpO2 measurements per person, median (IQR) | 9 (2-30.5) | 11 (2-28) | 14 (2.5-43.5) | 7 (2-27) |
Length of stay, median (IQR), d | 15.4 (10.7-31.5) | 18.1 (9.2-33.2) | 20.8 (10.7-34.1) | 15.9 (7.1-26.8) |
Died during hospitalization | 24 (38.1) | 153 (32) | 64 (29.8) | 203 (44.1) |
Abbreviations: BMI, body mass index (calculated as weight in kilograms divided by height in meters squared); COPD, chronic obstructive pulmonary disease; SpO2, pulsatile flow of arterial blood.
SI conversion factors: To convert bilirubin to µmol/L, multiply by 17.104. For creatinine to µmol/L, multiply by 88.4. For hemoglobin to g/L, multiply by 10.
Occult hypoxemia (SaO2 <88% with concurrent SpO2 measurement of 92%-96%) was identified in 3.7% of samples from Asian patients, 3.7% of samples from Black patients, 2.8% of samples from non-Black Hispanic patients, and 1.7% of samples from White patients. At the patient level, 19 Asian patients (30.2%), 136 Black patients (28.5%), 64 non-Black Hispanic patients (29.8%), and 79 White patients (17.2%) had occult hypoxemia at some point during the hospital encounter.
Results of the adjusted fully specified model are presented in eTable 3 and eFigure 1 in the Supplement. The interaction of race and ethnicity with SpO2 and oxygen device was statistically significant and included in the final parsimonious model (eTable 4 in the Supplement), which had an R2 0.88 and mean (SD) absolute prediction error of 2.64 (0.11). In the adjusted parsimonious linear mixed-effects model, compared with White patients, SpO2 overestimated SaO2 by an average of 1.7% among Asian patients (95% CI, 0.5%-3.0%; P = .01), 1.2% among Black patients (95% CI, 0.6%-1.9%; P < .001), and 1.1% among non-Black Hispanic patients (95% CI, 0.3%-1.9%; P = .01; Figure 3). The absolute differences between SaO2 and SpO2 by race and ethnicity as stratified by oxygen device are presented in eFigure 2 in the Supplement.
Among 6673 patients with SpO2 measurements and available covariate data for estimation of predicted SaO2 using the mixed-effects model, 1903 (28.5%) had a predicted SaO2 of 94% or less before a measured SpO2 of 94% or less or oxygen treatment initiation and were thus included in the analysis of unrecognized or delayed treatment eligibility (see eTable 5 in the Supplement for baseline characteristics). Within each race and ethnicity, more Black and Hispanic patients unrecognized or delayed recognition of treatment eligibility compared with White patients (25 Asian [7.1%], 928 Black [35.1%], and 445 non-Black Hispanic patients [38.0%] vs 505 White patients [20.1%]).
Compared with White patients, Black patients had a 29% lower hazard of eligibility recognition (hazard ratio [HR], 0.71; 95% CI, 0.63-0.80; P < .001) and non-Black Hispanic patients had a 23% lower hazard of eligibility recognition (HR 0.77; 95% CI, 0.66-0.89; P < .001), with no difference between Asian and White patients (HR, 0.97; 95% CI, 0.62-1.5; P = .90; Figure 4). Of these 1903 patients, 451 (23.7%) never had a recorded SpO2 of 94% or less or received oxygen therapy and were considered to have unrecognized treatment eligibility. Most of these patients were Black (247 [54.8%]), followed by non-Black Hispanic (122 [27.1%]), White (78 [17.3%]), and Asian (4 [0.9%]). Among the remaining 1452 patients with eventual recognition of eligibility, the median (IQR) time until recognition was highest for Asian patients (7.7 [3.5-13.6] hours) and Black patients (7.0 [1.9-20.8] hours), followed by non-Black Hispanic patients (5.0 [1.2-15.8] hours) and White patients (5.3 [1.4-15.2] hours). Compared with White patients, Black patients had a significantly higher median difference in time to recognition of eligibility by 1 hour (95% CI, 0.23-1.9 hours; P = .01). There was no statistically significant difference in treatment delay between Asian and Hispanic patients and White patients (eFigure 3 and eTable 6 in the Supplement). The sensitivity analysis refitting the linear mixed-effects model after excluding observations in which treatment with supplemental oxygen may have been titrated between SpO2 and SaO2 measurement produced similar results (eResults in the Supplement).
Discussion
In this retrospective analysis of a large clinical data set of patients with COVID-19 who were admitted to a regional health care system, we identified persistent overestimation of arterial oxygen saturation among Asian, Black, and Hispanic individuals. This was associated with systematic misclassification of patient oxygenation status based on racial or ethnic self-identification. When comparing pulse oximetry measurements with arterial oxygen saturation, approximately one-third of patients from each racial or ethnic minority group had at least 1 unidentified episode of hypoxia compared with fewer than one-fifth of White patients. In addition, we found a systematic failure to identify Black and Hispanic patients who were likely qualified to receive COVID-19 therapy and a statistically significant delay in recognizing the guideline-recommended threshold for initiation of therapy among Black patients compared with White patients.
A recent investigation also reported overestimation of arterial oxygen saturation levels by pulse oximetry among Black individuals, with a 2.7-fold greater incidence of occult hypoxemia in Black patients compared with White patients, which was corroborated in a pre–COVID-19 pandemic sample of patients who were about to undergo extracorporeal membrane oxygenation.5,24 By limiting the patient sample to individuals who had a common pulmonary diagnosis (COVID-19), the present study minimizes the patient heterogeneity in the prior investigations while accounting for several factors that may be associated with pulse oximetry inaccuracy. This study also extends these findings by demonstrating similar overestimation of oxygen saturation levels, occult hypoxemia, and associated disparities in recognition of treatment eligibility among non-Black Hispanic patients.
The finding that occult hypoxemia affected more patients from all racial and ethnic minority groups compared with White patients is at odds with the finding by Wong et al6 that occult hypoxemia was similar between Asian and White patients. This discrepancy may result from Wong et al6 limiting their investigation to the first arterial blood gas measurement or heterogeneity of the Asian populations between the studies. Two recent studies from the United Kingdom of individuals with COVID-19 reported conflicting results and were limited by overall small sample sizes comprising predominantly White patients with few Black patients (<10%) and no Hispanic patients.25,26
By leveraging a large data set of paired SaO2-SpO2 measurements to predict the time at which therapies would become indicated, we found that most patients who had an unrecognized indication for COVID-19 therapy were Black, along with statistically significant differences in time to recognition of treatment eligibility for Black and Hispanic patients compared with White patients. While long-term clinical data may inform whether the delay in recognition of treatment eligibility had clinical consequences, these data are not yet available for this cohort. However, recognizing that delays in care may exist because of inaccurate pulse oximetry measurements informs studies that have investigated COVID-19 outcomes among patients of racial and ethnic minority groups.12,27,28,29,30 Because occult hypoxia is associated with higher in-hospital mortality, particularly among racial and ethnic minority groups,6 it is possible that occult hypoxia in COVID-19 contributes to racial and ethnic disparities in COVID-19 outcomes.31,32,33
While pulse oximetry has become a fundamental tool in diagnosis, triage, and management decisions in the acute care setting, the device’s lack of accuracy in certain populations has not been adequately investigated or addressed, although it has been recognized for several decades34 and was highlighted in a 2020 safety communication by the US Food and Drug Administration.35 With the integration of economical light-emitting diodes and semiconductors, pulse oximeters have also migrated out of the acute care setting and are now available and affordable to the average consumer for home use. The expanded use of a differentially inaccurate device potentially exacerbates racial and ethnic health disparities. For example, a Black individual who tests positive for COVID-19 in the outpatient setting may be advised against or decide to delay seeking care based on false reassurance from normal pulse oximetry readings. Such a scenario was reflected in the overrepresentation of Black patients who had unrecognized eligibility for treatment with remdesivir and dexamethasone. The differential inaccuracy of pulse oximetry in racial and ethnic minority groups may be similarly associated with treatment and triage decisions for other respiratory illnesses, such as pneumonia and acute respiratory distress syndrome. Although increased awareness of the limitations of pulse oximetry may mitigate some of the adverse effects, innovative approaches, such as the integration of additional wavelengths36 or calibration based on skin pigmentation, are needed. Unlike the use of race and ethnicity in the calculation of glomerular filtration rate37 or interpretation of pulmonary function testing,38 which have come under recent scrutiny, the race and ethnicity–based discrepancy of pulse oximetry exposes a fundamental flaw in the acquisition rather than interpretation of data, although all the aforementioned biases are associated with systematic underdiagnosis of disease or withholding of therapies for racial and ethnic minority groups.39,40,41,42,43
Although the principal objective of the study was to characterize relative biases in pulse oximetry, the absolute biases identified among all racial and ethnic minority groups are notable. While some variability may be explained by the allowance of 10 minutes between paired SpO2 and SaO2 measurements, such variability would not explain the systematic error among White patients that is separate from the pulse oximeter’s imprecision. This imprecision was more pronounced among racial and ethnic minority groups, as reflected by the standard errors in eTable 2 in the Supplement, which corroborates previous findings and may exacerbate the biases identified in this investigation.44 This study’s results suggest that for critical treatment decisions that rely on arterial oxygen saturation, use of pulse oximetry may be inadequate and produce opportunities for undertreatment and overtreatment irrespective of the patient’s race or ethnicity. This mirrors inaccuracies in capillary blood glucose levels reported among patients with critical illness.45
Strengths and Limitations
The strengths of this study include a large diverse patient population spanning multiple hospitals that allows statistical inferences to be made for several racial or ethnic minority groups; many paired measurements, which decreases susceptibility to bias; a curated data set with prior validation of important covariates; and the relative uniformity of underlying disease state. However, there are several limitations. Self-reported race and ethnicity is used in this study as a surrogate marker for skin tone, which is not routinely collected in the clinical setting. Thus, we were unable to measure or account for heterogeneity in skin tones within each racial or ethnic group. Rapid fluctuations in clinical condition among patients with COVID-19 may introduce downward bias in SpO2-SaO2 calculations if there is a preponderance of patients who quickly deteriorate after their SpO2 measurement is logged. However, such biases should not be different between racial and ethnic minority groups. Because the analysis estimating racial and ethnic difference in SpO2-SaO2 required a blood gas measurement, results may not be generalizable to healthy individuals or those with less acute illness. Furthermore, use of an oxygen saturation threshold to represent recognition of drug eligibility is an imperfect estimate of the actual delay in treatment that is attributable to pulse oximetry, as clinical attention, time to ordering, medication delivery, and drug availability are all contributory factors. However, this would tend to conservatively bias estimates. Finally, treatment delay estimates depend on the frequency with which SpO2 was measured or recorded, which may potentially differentially underestimate delay in recognition of treatment eligibility based on the patient’s level of care.
Conclusions
In this cohort study using a large multihospital COVID-19 clinical data set, we found statistically significant and persistent overestimation of arterial oxygen saturation by pulse oximetry among Asian, Black, and Hispanic patients compared with non-Hispanic White patients. Black and Hispanic patients were more likely to experience unrecognized and delayed recognition of eligibility to receive COVID-19 therapy. Differential inaccuracies in pulse oximetry should be examined as a potential explanation for disparities in COVID-19 outcomes and may have implications for the monitoring and treatment of other respiratory illnesses.
References
- 1.Jubran A. Pulse oximetry. Crit Care. 2015;19(1):272. doi: 10.1186/s13054-015-0984-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Zeballos RJ, Weisman IM. Reliability of noninvasive oximetry in black subjects during exercise and hypoxia. Am Rev Respir Dis. 1991;144(6):1240-1244. doi: 10.1164/ajrccm/144.6.1240 [DOI] [PubMed] [Google Scholar]
- 3.Bickler PE, Feiner JR, Severinghaus JW. Effects of skin pigmentation on pulse oximeter accuracy at low saturation. Anesthesiology. 2005;102(4):715-719. doi: 10.1097/00000542-200504000-00004 [DOI] [PubMed] [Google Scholar]
- 4.Feiner JR, Severinghaus JW, Bickler PE. Dark skin decreases the accuracy of pulse oximeters at low oxygen saturation: the effects of oximeter probe type and gender. Anesth Analg. 2007;105(6)(suppl):S18-S23. doi: 10.1213/01.ane.0000285988.35174.d9 [DOI] [PubMed] [Google Scholar]
- 5.Sjoding MW, Dickson RP, Iwashyna TJ, Gay SE, Valley TS. Racial bias in pulse oximetry measurement. N Engl J Med. 2020;383(25):2477-2478. doi: 10.1056/NEJMc2029240 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Wong AI, Charpignon M, Kim H, et al. Analysis of discrepancies between pulse oximetry and arterial oxygen saturation measurements by race and ethnicity and association with organ dysfunction and mortality. JAMA Netw Open. 2021;4(11):e2131674. doi: 10.1001/jamanetworkopen.2021.31674 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.US Food and Drug Administration . FDA approves first treatment for COVID-19. Accessed October 12, 2021. https://www.fda.gov/news-events/press-announcements/fda-approves-first-treatment-covid-19
- 8.Garibaldi BT, Wang K, Robinson ML, et al. Comparison of time to clinical improvement with vs without remdesivir treatment in hospitalized patients with COVID-19. JAMA Netw Open. 2021;4(3):e213071. doi: 10.1001/jamanetworkopen.2021.3071 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Horby P, Lim WS, Emberson JR, et al. ; RECOVERY Collaborative Group . Dexamethasone in hospitalized patients with Covid-19. N Engl J Med. 2021;384(8):693-704. doi: 10.1056/NEJMoa2021436 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Garibaldi BT, Fiksel J, Muschelli J, et al. Patient trajectories among persons hospitalized for COVID-19: a cohort study. Ann Intern Med. 2021;174(1):33-41. doi: 10.7326/M20-3905 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Wongvibulsin S, Garibaldi BT, Antar AAR, et al. Development of Severe COVID-19 Adaptive Risk Predictor (SCARP), a calculator to predict severe disease or death in hospitalized patients with COVID-19. Ann Intern Med. 2021;174(6):777-785. doi: 10.7326/M20-6754 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.US Centers for Disease Control and Prevention . Cases, data, and surveillance. Accessed June 8, 2021. https://www.cdc.gov/coronavirus/2019-ncov/covid-data/investigations-discovery/hospitalization-death-by-race-ethnicity.html
- 13.Karaba SM, Jones G, Helsel T, et al. Prevalence of co-infection at the time of hospital admission in COVID-19 patients: a multicenter study. Open Forum Infect Dis. 2020;8(1):ofaa578. doi: 10.1093/ofid/ofaa578 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Andersen KM, Mehta HB, Palamuttam N, et al. Association between chronic use of immunosuppressive drugs and clinical outcomes from coronavirus disease 2019 (COVID-19) hospitalization: a retrospective cohort study in a large US health system. Clin Infect Dis. 2021;73(11):e4124-e4130. Published online January 7, 2021. doi: 10.1093/cid/ciaa1488 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Scully EP, Schumock G, Fu M, et al. Sex and gender differences in testing, hospital admission, clinical presentation, and drivers of severe outcomes from COVID-19. Open Forum Infect Dis. 2021;8(9):ofab448. doi: 10.1093/ofid/ofab448 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Bowring MG, Wang Z, Xu Y, et al. Outcome-stratified analysis of biomarker trajectories for patients infected with severe acute respiratory syndrome coronavirus 2. Am J Epidemiol. 2021;190(10):2094-2106. doi: 10.1093/aje/kwab138 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Boulanger M, Molina E, Wang K, Kickler T, Xu Y, Garibaldi BT. Peripheral plasma cells associated with mortality benefit in severe COVID-19: a marker of disease resolution. Am J Med. 2021;134(8):1029-1033. doi: 10.1016/j.amjmed.2021.01.040 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Ignatius EH, Wang K, Karaba A, et al. Tocilizumab for the treatment of COVID-19 among hospitalized patients: a matched retrospective cohort analysis. Open Forum Infect Dis. 2020;8(1):ofaa598. doi: 10.1093/ofid/ofaa598 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Borrell LN. Racial identity among Hispanics: implications for health and well-being. Am J Public Health. 2005;95(3):379-381. doi: 10.2105/AJPH.2004.058172 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Cuevas AG, Dawson BA, Williams DR. Race and skin color in Latino health: an analytic review. Am J Public Health. 2016;106(12):2131-2136. doi: 10.2105/AJPH.2016.303452 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.COVID-19 Real-Time Learning Network . Immunomodulators. Accessed October 4, 2021. https://www.idsociety.org/covid-19-real-time-learning-network/therapeutics-and-interventions/immunomodulators/
- 22.COVID-19 Real-Time Learning Network . Remdesivir. Accessed October 4, 2021. https://www.idsociety.org/covid-19-real-time-learning-network/therapeutics-and-interventions/remdesivir/
- 23.Bates D, Mächler M, Bolker B, Walker S. Fitting linear mixed-effects models using lme4. J Stat Softw. 2015;67:1-48. doi: 10.18637/jss.v067.i01 [DOI] [Google Scholar]
- 24.Valbuena VSM, Barbaro RP, Claar D, et al. Racial bias in pulse oximetry measurement among patients about to undergo extracorporeal membrane oxygenation in 2019-2020: a retrospective cohort study. Chest. 2022;161(4):971-978. doi: 10.1016/j.chest.2021.09.025 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Crooks CJ, West J, Morling JR, et al. Pulse oximeters’ measurements vary across ethnic groups: an observational study in patients with Covid-19 infection. Eur Respir J. 2022;2103246. doi: 10.1183/13993003.03246-2021 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Wiles MD, El-Nayal A, Elton G, et al. The effect of patient ethnicity on the accuracy of peripheral pulse oximetry in patients with COVID-19 pneumonitis: a single-centre, retrospective analysis. Anaesthesia. 2022;77(2):143-152. doi: 10.1111/anae.15581 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Sze S, Pan D, Nevill CR, et al. Ethnicity and clinical outcomes in COVID-19: a systematic review and meta-analysis. EClinicalMedicine. 2020;29:100630. doi: 10.1016/j.eclinm.2020.100630 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Adegunsoye A, Ventura IB, Liarski VM. Association of Black race with outcomes in COVID-19 disease: a retrospective cohort study. Ann Am Thorac Soc. 2020;17(10):1336-1339. doi: 10.1513/AnnalsATS.202006-583RL [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Williamson EJ, Walker AJ, Bhaskaran K, et al. Factors associated with COVID-19-related death using OpenSAFELY. Nature. 2020;584(7821):430-436. doi: 10.1038/s41586-020-2521-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Clift AK, Coupland CAC, Keogh RH, et al. Living risk prediction algorithm (QCOVID) for risk of hospital admission and mortality from coronavirus 19 in adults: national derivation and validation cohort study. BMJ. 2020;371:m3731. doi: 10.1136/bmj.m3731 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.US Centers for Disease Control and Prevention . COVID-19 racial and ethnic disparities. Accessed December 14, 2021. https://www.cdc.gov/coronavirus/2019-ncov/community/health-equity/racial-ethnic-disparities/index.html
- 32.Holmes L Jr, Enwere M, Williams J, et al. Black-White risk differentials in COVID-19 (SARS-COV2) transmission, mortality and case fatality in the United States: translational epidemiologic perspective and challenges. Int J Environ Res Public Health. 2020;17(12):4322. doi: 10.3390/ijerph17124322 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Ebinger JE, Achamallah N, Ji H, et al. Pre-existing traits associated with Covid-19 illness severity. PLoS One. 2020;15(7):e0236240. doi: 10.1371/journal.pone.0236240 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Tobin MJ. Respiratory monitoring. JAMA. 1990;264(2):244-251. doi: 10.1001/jama.1990.03450020096034 [DOI] [PubMed] [Google Scholar]
- 35.US Food and Drug Administration . Pulse oximeter accuracy and limitations. Accessed October 5, 2021. https://www.fda.gov/medical-devices/safety-communications/pulse-oximeter-accuracy-and-limitations-fda-safety-communication
- 36.Aoyagi T, Fuse M, Kobayashi N, Machida K, Miyasaka K. Multiwavelength pulse oximetry: theory for the future. Anesth Analg. 2007;105(6)(suppl):S53-S58. doi: 10.1213/01.ane.0000268716.07255.2b [DOI] [PubMed] [Google Scholar]
- 37.Hsu J, Johansen KL, Hsu CY, Kaysen GA, Chertow GM. Higher serum creatinine concentrations in Black patients with chronic kidney disease: beyond nutritional status and body composition. Clin J Am Soc Nephrol. 2008;3(4):992-997. doi: 10.2215/CJN.00090108 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Braun L. Race correction and spirometry: why history matters. Chest. 2021;159(4):1670-1675. doi: 10.1016/j.chest.2020.10.046 [DOI] [PubMed] [Google Scholar]
- 39.Eneanya ND, Yang W, Reese PP. Reconsidering the consequences of using race to estimate kidney function. JAMA. 2019;322(2):113-114. doi: 10.1001/jama.2019.5774 [DOI] [PubMed] [Google Scholar]
- 40.Vyas DA, Eisenstein LG, Jones DS. Hidden in plain sight—reconsidering the use of race correction in clinical algorithms. N Engl J Med. 2020;383(9):874-882. doi: 10.1056/NEJMms2004740 [DOI] [PubMed] [Google Scholar]
- 41.Moffett AT, Eneanya ND, Halpern SD, Weissman GE. The impact of race correction on the interpretation of pulmonary function testing among Black patients. Am J Respir Crit Care Med. 2021;203:A1030. doi: 10.1164/ajrccm-conference.2021.203.1_MeetingAbstracts.A1030 [DOI] [Google Scholar]
- 42.Anderson MA, Malhotra A, Non AL. Could routine race-adjustment of spirometers exacerbate racial disparities in COVID-19 recovery? Lancet Respir Med. 2021;9(2):124-125. doi: 10.1016/S2213-2600(20)30571-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.McCormack MC, Balasubramanian A, Matsui EC, Peng RD, Wise RA, Keet CA. Race, lung function, and long-term mortality in the National Health and Nutrition Examination Survey III. Am J Respir Crit Care Med. 2022;205(6):723-724. doi: 10.1164/rccm.202104-0822LE [DOI] [PubMed] [Google Scholar]
- 44.Jubran A, Tobin MJ. Reliability of pulse oximetry in titrating supplemental oxygen therapy in ventilator-dependent patients. Chest. 1990;97(6):1420-1425. doi: 10.1378/chest.97.6.1420 [DOI] [PubMed] [Google Scholar]
- 45.Kulkarni A, Saxena M, Price G, O’Leary MJ, Jacques T, Myburgh JA. Analysis of blood glucose measurements using capillary and arterial blood samples in intensive care patients. Intensive Care Med. 2005;31(1):142-145. doi: 10.1007/s00134-004-2500-5 [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.