This cohort study assesses whether there are racial and ethnic disparities in supplemental oxygen administration associated with inconsistent pulse oximeter performance.
Key Points
Question
Are there differences in supplemental oxygen administration among patients of different races and ethnicities associated with pulse oximeter performance discrepancies?
Findings
In this cohort study of 3069 patients in the intensive care unit, Asian, Black, and Hispanic patients had a higher adjusted time-weighted average pulse oximetry reading and were administered significantly less supplemental oxygen for a given average hemoglobin oxygen saturation compared with White patients.
Meaning
There were differences in supplemental oxygen administration between Asian, Black, and Hispanic patients and White patients that were associated with pulse oximeter performance and may contribute to racial and ethnic disparities in care.
Abstract
Importance
Pulse oximetry (SpO2) is routinely used for transcutaneous monitoring of blood oxygenation, but it can overestimate actual oxygenation. This is more common in patients of racial and ethnic minority groups. The extent to which these discrepancies are associated with variations in treatment is not known.
Objective
To determine if there are racial and ethnic disparities in supplemental oxygen administration associated with inconsistent pulse oximeter performance.
Design, Setting, and Participants
This retrospective cohort study was based on the Medical Information Mart for Intensive Care (MIMIC)-IV critical care data set. Included patients were documented with a race and ethnicity as Asian, Black, Hispanic, or White and were admitted to the intensive care unit (ICU) for at least 12 hours before needing advanced respiratory support, if any. Oxygenation levels and nasal cannula flow rates for up to 5 days from ICU admission or until the time of intubation, noninvasive positive pressure ventilation, high-flow nasal cannula, or tracheostomy were analyzed.
Main Outcomes and Measures
The primary outcome was time-weighted average supplemental oxygen rate. Covariates included race and ethnicity, sex, SpO2–hemoglobin oxygen saturation discrepancy, data duration, number and timing of blood gas tests on ICU days 1 to 3, partial pressure of carbon dioxide, hemoglobin level, average respiratory rate, Elixhauser comorbidity scores, and need for vasopressors or inotropes.
Results
This cohort included 3069 patients (mean [SD] age, 66.9 [13.5] years; 83 were Asian, 207 were Black, 112 were Hispanic, 2667 were White). In a multivariable linear regression, Asian (coefficient, 0.602; 95% CI, 0.263 to 0.941; P = .001), Black (coefficient, 0.919; 95% CI, 0.698 to 1.140; P < .001), and Hispanic (coefficient, 0.622; 95% CI, 0.329 to 0.915; P < .001) race and ethnicity were all associated with a higher SpO2 for a given hemoglobin oxygen saturation. Asian (coefficient, −0.291; 95% CI, −0.546 to −0.035; P = .03), Black (coefficient, −0.294; 95% CI, −0.460 to −0.128; P = .001), and Hispanic (coefficient, −0.242; 95% CI, −0.463 to −0.020; P = .03) race and ethnicity were associated with lower average oxygen delivery rates. When controlling for the discrepancy between average SpO2 and average hemoglobin oxygen saturation, race and ethnicity were not associated with oxygen delivery rate. This discrepancy mediated the effect of race and ethnicity (−0.157; 95% CI, −0.250 to −0.057; P = .002).
Conclusions and Relevance
In this cohort study, Asian, Black, and Hispanic patients received less supplemental oxygen than White patients, and this was associated with differences in pulse oximeter performance, which may contribute to known race and ethnicity–based disparities in care.
Introduction
Since its invention in 1974, near-infrared pulse oximetry (SpO2) has allowed for convenient, noninvasive transcutaneous monitoring of arterial hemoglobin oxygen saturation.1,2 This technology uses spectrophotometry to indirectly calculate the arterial hemoglobin saturation by determining the proportion of oxyhemoglobin in peripheral arterial blood. The accuracy of SpO2 is usually within 2 to 3 percentage points of true blood hemoglobin oxygen saturation when arterial saturations are greater than 90%, but the accuracy is reduced as saturations decrease to less than 90%.3,4
It has been known for decades that these readings are affected by various surface pigmentations, including nail polish and skin melanin, which may affect light absorption and scattering.5,6 Previous clinical studies in adults and children have shown clear differences in temporally associated SpO2 and blood hemoglobin oxygen saturation associations according to race and ethnicity, generally with higher SpO2 readings in patients of racial and ethnic minority groups.7,8,9,10 This increases the risk of hidden hypoxemia, in which patients have falsely elevated SpO2 readings, usually defined as 92% or greater, with a blood hemoglobin oxygen saturation less than 88%.9
While patients of all races and ethnicities are subject to hidden hypoxemia, the higher incidence in patients of racial and ethnic minority groups could possibly lead to more insufficient treatment in this population and contribute to known population disparities in outcomes, including those seen during the COVID-19 pandemic.11,12,13,14 In fact, it has been shown that hidden hypoxemia is associated with higher mortality rates.9
However, few studies have directly assessed the factors that may mediate these downstream effects.15 Potential culprits may include insufficient administration of supplemental oxygen and differences in initiation and management of noninvasive and invasive mechanical ventilation. Pulse oximeter performance disparities may also play a role in decision-making regarding fluid management, specialty service consultation, and intensive care unit (ICU) admission.16 Artificially high SpO2 readings in the emergency department could affect the perceived need for cardiology service admission for heart failure management, possibly explaining the finding that Black and Hispanic patients were less likely than White patients to be admitted to a cardiology service.17
The objective of this study was to assess if there are disparities in supplemental oxygen administration between Asian, Black, and Hispanic patients and White patients in the ICU and whether they are associated with discrepancies in pulse oximeter performance.
Methods
Research Ethics
This study is based on the Medical Information Mart for Intensive Care (MIMIC)-IV critical care data set. It was approved for research by the institutional review boards of Beth Israel Deaconess Medical Center in Boston, Massachusetts (2001-P-001699/14), and the Massachusetts Institute of Technology (0403000206) without a requirement for individual patient informed consent because data were deidentified and publicly available.18
Cohort Selection
MIMIC-IV data were queried from the Google BigQuery (Alphabet Inc) cloud platform using RStudio, version 1.4 (RStudio PBC) with the R, version 4.0.3 programming language (R Foundation for Statistical Computing). MIMIC-IV includes data on 76 540 ICU stays for 53 150 unique patients (computed by authors) at Beth Israel Deaconess Medical Center. These include the years 2008 through 2019, and therefore there were no patients with COVID-19.19 MIMIC-IV does not provide an admission diagnosis for each patient, but Sequential Organ Failure Assessment (SOFA) scores are used to specify the type and severity of organ dysfunction.
Selection criteria, as well as the number and proportion of patients of each race and ethnicity that were excluded at each step, are shown in Figure 1. An initial cohort was identified by downloading vital sign data for patients’ first ICU admission in the data set. Data were limited to the first period of up to 5 days from ICU admission or until the time point at which the patient received invasive or noninvasive mechanical ventilation, high-flow nasal cannula, or a tracheostomy, whichever came first. This period is subsequently referred to as the index period. Patients were excluded if they did not have a documented race or ethnicity of Asian, Black, Hispanic, or White (groups that were excluded included “other,” “unknown,” “unable to obtain,” and “American Indian/Alaska Native”). Patients were also excluded if they were missing key variables from the index period, including supplemental oxygen (or room air) records, initial partial pressure of carbon dioxide (pCO2), hemoglobin level, and vital sign data, and/or had an index period of less than 12 hours. This study followed the Strengthening the Reporting of Observational Studies in Epidemiology (STROBE) reporting guidelines for observational studies.20
Pulse Oximetry and Vital Signs
Vital sign records within the index period, including temperatures (converted to Celsius), heart rates, mean arterial pressures, respiratory rates, and SpO2, were extracted. For each vital sign type, area under the curve was calculated using the trapezoid method with the “MLmetrics” R package21 and was divided by the number of minutes between the first and last reading to produce a time-weighted average value.
Hemoglobin Oxygen Saturation
To avoid aberrant values as well as likely venous blood gas values, which are not directly distinguished from arterial blood gas values under some laboratory codes, readings were removed if the hemoglobin oxygen saturation was less than 70% or greater than 100%. For each patient, the area under the curve was calculated from all remaining timed blood gas records during the index period and was divided by the difference in time between the first and last blood gas reading to give a time-weighted average value. If patients had only 1 blood gas reading, this value was used instead of a calculated average. The number of blood gas tests on each day of the index period was determined as well.
Supplemental Oxygen Rate
To compute estimates of average supplemental oxygen rates, all nasal cannula flow rates and time on room air records within the index period were downloaded from a derived oxygen supplementation table in MIMIC-IV. Numbers of oxygen records of other device types were reviewed to determine that these were infrequent relative to nasal cannula for most patients and could reasonably be excluded. For example, if a patient had a nasal cannula record at hour 1, face tent at hour 2, and a nasal cannula again at hour 3, the patient was recorded as having a nasal cannula at the respective rates from hour 1 through hour 3. Supplemental oxygen area under the curve was calculated and was divided by the duration of time in seconds between the first and last recorded rate. For 100 patients for whom only 1 oxygen rate was recorded, this rate was used instead of the average.
Other Data Elements
The SOFA score is available in another derived table in MIMIC-IV. It includes 6 components based on organ systems (respiration, coagulation, liver, cardiovascular, central nervous system, and kidney), with a score of 1 to 4 for each system and an overall maximum score of 24 (most severe organ dysfunction).22 For each patient, the maximum of these scores during the index period was extracted. Vasopressor or inotrope dose rates of infusion were extracted from the fluid input table, and a binary variable was created to indicate whether any vasopressors or inotropes were required during the index period. Elixhauser comorbidity scores were computed from International Classification of Diseases, Ninth Revision (ICD-9) and International Statistical Classification of Diseases and Related Health Problems, Tenth Revision (ICD-10) codes using the R “Comorbidity” package.23,24
Statistical Analysis
Continuous values were compared using a nonparametric unpaired Wilcoxon rank test. Multivariable linear regression was performed to determine the association between blood hemoglobin oxygen saturation and SpO2, and associations with oxygen delivery. Specifically, models with race and ethnicity as a covariate were developed to show associations, including (1) SpO2 as a function of criterion standard hemoglobin oxygen saturation to confirm different device performance between races and ethnicities; (2) supplemental oxygen rates for a given hemoglobin oxygen saturation to assess for differences in oxygen delivery by race and ethnicity; (3) similar to model 2 with an added variable for the difference between SpO2 and hemoglobin oxygen saturation to determine whether this discrepancy mediates the effect of race; and (4) supplemental oxygen rates, controlling for SpO2, to determine whether there are differences in oxygen rates when titrating to pulse oximeter readings. An additional model (5) used heparin rate as the outcome and the same independent variables as model 2 to rule out a spurious correlation.
To assess the SpO2–hemoglobin oxygen saturation discrepancy as a mediator of the effect of race and ethnicity, we performed a formal mediation analysis, with race and ethnicity as the treatment variable, using the R “Mediate” package.25 Summary results included the average causal mediation effect, the average direct effect, and the total effect.
For all models, clinically relevant covariates were selected for analysis. Nonsignificant covariates were excluded from the final regression models except to maintain comparability between related models. Given that patients missing key data points for regression analysis were excluded as described above, no imputation was performed. For all analyses, a 2-sided 95% CI was used, and a P value of less than .05 was considered statistically significant.
Results
Cohort Characteristics
There were 3069 patients included in this study. Patient baseline characteristics, stratified by race and ethnicity, are shown in Table 1. Additional variables relevant to the hospital course, including numbers of blood gas draws, eventual ventilation and high flow oxygen needs, and care units by race and ethnicity are shown in Table 2. The mean (SD) ages of Asian (n = 83), Black (n = 207), Hispanic (n = 112), and White (n = 2667)patients were 64.2 (14.9), 64.0 (13.8), 59.3 (14.5), and 67.6 (13.3)years, respectively. A lower proportion of Black patients were male compared with patients of other races and ethnicities (Asian, 59 [71.1%]; Black, 103 [49.8%]; Hispanic, 68 [60.7%]; White, 1751 [65.7%]). Mortality rates were 3.6% (3 of 83) for Asian patients, 8.2% (17 of 207) for Black patients, 7.1% (8 of 112) for Hispanic patients, and 6.5% (174 of 2667) for White patients. SOFA score components and laboratory values were overall well matched across races and ethnicities.
Table 1. Clinical and Demographic Characteristics of Patients by Race and Ethnicity.
Variable | Median (IQR) | |||
---|---|---|---|---|
Asian patients | Black patients | Hispanic patients | White patients | |
No. | 83 | 207 | 112 | 2667 |
Age, mean (SD), y | 64.2 (14.9) | 64.0 (13.8) | 59.3 (14.5) | 67.6 (13.3) |
Sex, No. (%) | ||||
Female | 24 (28.9) | 104 (50.2) | 44 (39.3) | 916 (34.3) |
Male | 59 (71.1) | 103 (49.8) | 68 (60.7) | 1751 (65.7) |
Died, No. (%) | 3 (3.6) | 17 (8.2) | 8 (7.1) | 174 (6.5) |
Elixhauser score (SD) | 3.9 (2.4) | 5.1 (2.3) | 4.6 (2.6) | 4.3 (2.3) |
Lactate, mg/dL | 19.8 (13.5 to 26.1) | 17.1 (11.7 to 24.3) | 18.9 (13.5 to 27.0) | 16.2 (11.7 to 22.5) |
Sodium, mEq/L | 136.0 (134.5 to 139.0) | 138.0 (135.0 to 141.0) | 135.5 (133.0 to 138.0) | 136.0 (134.0 to 139.0) |
Potassium, mEq/L | 4.3 (3.9 to 4.8) | 4.3 (3.8 to 5.0) | 4.3 (3.8 to 4.9) | 4.4 (3.9 to 4.9) |
Bicarbonate, mEq/L | 23.0 (21.0 to 25.0) | 23.0 (20.0 to 26.0) | 23.0 (21.0 to 25.0) | 23.0 (21.0 to 25.0) |
BUN, mg/dL | 16.0 (11.0 to 22.5) | 19.0 (13.0 to 37.0) | 17.5 (12.0 to 27.2) | 17.0 (13.0 to 25.0) |
Creatinine, mg/dL | 0.8 (0.6 to 1.1) | 1.1 (0.8 to 2.1) | 0.9 (0.7 to 1.3) | 0.9 (0.7 to 1.2) |
AST, IU/L | 40.5 (25.5 to 167.5) | 50.0 (29.0 to 105.0) | 44.0 (27.0 to 114.0) | 41.0 (26.0 to 84.0) |
ALT, IU/L | 33.5 (18.5 to 87.8) | 33.0 (18.5 to 67.2) | 28.0 (18.0 to 85.0) | 27.0 (15.0 to 57.0) |
Total bilirubin, mg/dL | 0.7 (0.4 to 2.1) | 0.7 (0.3 to 1.5) | 1.0 (0.6 to 2.0) | 0.7 (0.5 to 1.6) |
Albumin, g/dL | 3.1 (2.6 to 3.3) | 3.0 (2.5 to 3.4) | 3.3 (2.8 to 3.9) | 3.1 (2.6 to 3.5) |
White blood cells, K/uL | 10.3 (7.3 to 13.8) | 9.6 (7.3 to 13.8) | 10.9 (8.0 to 15.6) | 11.9 (8.7 to 15.4) |
Hemoglobin, g/dL | 9.8 (8.2 to 11.2) | 9.9 (8.4 to 11.5) | 9.9 (8.2 to 11.6) | 10.3 (8.9 to 11.8) |
Platelets, K/uL | 159.0 (122.0 to 223.0) | 165.0 (118.0 to 223.5) | 160.0 (112.8 to 235.5) | 161.0 (124.0 to 217.2) |
Initial pO2, mm Hg | 254.0 (87.5 to 344.5) | 109.0 (71.0 to 288.0) | 198.0 (90.5 to 321.2) | 231.0 (87.5 to 340.0) |
Initial pCO2, mm Hg | 39.0 (34.0 to 44.5) | 40.0 (34.5 to 45.0) | 40.0 (35.0 to 44.0) | 40.0 (36.0 to 45.0) |
Max SOFA, mean (SD) | ||||
Respiration | 1.1 (0.9) | 1.0 (1.0) | 1.0 (0.9) | 1.3 (0.9) |
Kidney | 0.6 (1.1) | 1.3 (1.4) | 0.8 (1.2) | 0.7 (1.1) |
Cardiovascular | 1.3 (1.0) | 1.4 (1.1) | 1.5 (1.1) | 1.5 (1.1) |
Liver | 0.3 (0.9) | 0.4 (1.0) | 0.4 (0.8) | 0.2 (0.7) |
Coagulation | 0.8 (0.9) | 0.8 (0.9) | 0.9 (1.0) | 0.8 (0.8) |
CNS | 1.2 (1.4) | 1.0 (1.1) | 0.9 (0.9) | 0.9 (1.1) |
Average hourly input, mean (SD), mL/h | 68.7 (108.6) | 78.6 (115.9) | 95.4 (117.5) | 91.5 (279.1) |
Data duration, mean (SD), d | 2.1 (1.4) | 2.1 (1.3) | 2.0 (1.4) | 2.0 (1.2) |
Weight, kg | 71.2 (60.5 to 81.7) | 86.7 (72.3 to 101.8) | 83.0 (71.6 to 95.5) | 88.0 (74.9 to 101.3) |
Average temperature, °C | 36.8 (36.7 to 37.1) | 36.8 (36.7 to 37.0) | 36.9 (36.6 to 37.2) | 36.8 (36.6 to 37.0) |
Average heart rate, bpm | 86.8 (79.1 to 96.4) | 86.4 (75.2 to 95.8) | 88.9 (80.3 to 95.9) | 83.5 (75.8 to 92.7) |
Average MAP, mm Hg | 75.4 (72.7 to 81.1) | 77.1 (71.8 to 83.6) | 74.5 (70.6 to 81.3) | 74.3 (70.1 to 79.0) |
Average respiratory rate, breaths/min | 19.0 (17.2 to 21.4) | 19.9 (17.3 to 22.3) | 19.3 (16.6 to 22.0) | 18.6 (16.6 to 21.0) |
Average SpO2, % | 97.3 (96.0 to 98.5) | 97.6 (96.1 to 98.6) | 97.2 (96.2 to 98.1) | 96.6 (95.5 to 97.7) |
Average blood oxygen saturation, % | 96.0 (93.2 to 97.9) | 95.0 (91.0 to 97.0) | 96.2 (92.5 to 98.0) | 96.0 (92.0 to 97.8) |
Average oxygen delivery, L/min | 2.6 (2.0 to 3.5) | 2.8 (2.0 to 3.6) | 2.8 (2.2 to 3.6) | 3.0 (2.3 to 3.9) |
SpO2-Hgb saturation delta | 1.5 (0.0 to 3.3) | 2.2 (0.6 to 4.9) | 1.2 (−0.2 to 3.6) | 0.8 (−0.7 to 4.1) |
Abbreviations: ALT, alanine aminotransferase; AST, aspartate aminotransferase; bpm, beats/min; BUN, blood urea nitrogen; CNS, central nervous system; Hgb, hemoglobin; MAP, mean arterial pressure; pCO2, partial pressure of carbon dioxide; SOFA, Sequential Organ Failure Assessment; SpO2, pulse oximetry.
SI conversion factors: To convert albumin to g/L, multiply by 10; ALT and AST to μkat/L, multiply by 0.0167; bilirubin to μmol/L, multiply by 17.104; BUN to mmol/L, multiply by 0.357; creatinine to μmol/L, multiply by 76.25; lactate to mmol/L, multiply by 0.111; pO2 and pCO2 to kPa, multiply by 0.133; sodium, potassium, and bicarbonate to mmol/L, multiply by 1.0.
Table 2. Numbers of Blood Gas Draws, Ventilation Outcomes, and Care Unit Details by Race and Ethnicity.
Variable | No. (%) | |||
---|---|---|---|---|
Asian patients | Black patients | Hispanic patients | White patients | |
No. | 83 | 207 | 112 | 2667 |
Total No. of blood gas tests days 1-5, mean (SD) | 3.4 (2.3) | 3.5 (2.9) | 3.9 (3.5) | 3.8 (3.3) |
Daily No. of blood gas tests, mean (SD) | ||||
Day 1 | 2.9 (2.2) | 2.9 (2.5) | 3.0 (2.8) | 3.1 (2.7) |
Day 2 | 0.4 (1.0) | 0.3 (1.0) | 0.3 (1.1) | 0.3 (1.2) |
Day 3 | 0.0 (0.2) | 0.2 (0.9) | 0.3 (1.0) | 0.2 (1.0) |
Day 4 | 0.1 (0.4) | 0.1 (0.4) | 0.2 (0.7) | 0.1 (0.6) |
Day 5 | 0.0 (0.3) | 0.0 (0.2) | 0.1 (0.8) | 0.0 (0.3) |
Required invasive ventilation | 2 (2.4) | 13 (6.3) | 6 (5.4) | 137 (5.1) |
Required noninvasive ventilation | 0 | 4 (1.9) | 2 (1.8) | 45 (1.7) |
Required high-flow NC | 1 (1.2) | 1 (0.5) | 0 | 36 (1.3) |
Required tracheostomy | 1 (1.2) | 1 (0.5) | 0 | 4 (0.1) |
Required pressors/inotropes | 45 (54.2) | 97 (46.9) | 59 (52.7) | 1611 (60.4) |
Care unit | ||||
Cardiac vascular ICU | 42 (50.6) | 85 (41.1) | 66 (58.9) | 1579 (59.2) |
Coronary care unit | 5 (6.0) | 23 (11.1) | 9 (8.0) | 299 (11.2) |
Medical/surgical ICU | 9 (10.8) | 33 (15.9) | 15 (13.4) | 214 (8.0) |
Medical ICU | 12 (14.5) | 34 (16.4) | 10 (8.9) | 276 (10.3) |
Neurology intermediate | 1 (1.2) | 1 (0.5) | 1 (0.9) | 5 (0.2) |
Neurology stepdown | 0 | 0 | 0 | 1 (<0.1) |
Neurosurgical ICU | 0 | 1 (0.5) | 0 | 9 (0.3) |
Surgical ICU | 12 (14.5) | 20 (9.7) | 9 (8.0) | 169 (6.3) |
Trauma surgical ICU | 2 (2.4) | 10 (4.8) | 2 (1.8) | 115 (4.3) |
Abbreviations: ICU, intensive care unit; NC, nasal cannula.
Pulse Oximetry and Hemoglobin Oxygen Saturation by Race and Ethnicity
Black patients had a lower median (IQR) blood hemoglobin oxygen saturation than White patients (95.0% [91.0%-97.0%] vs 96.0% [92.0%-97.8%]; P < .001). There were no overall differences in average blood hemoglobin oxygen saturation for Hispanic (96.2% [92.5%-98.0%]; P = .59) and White patients or Asian (96.0% [93.2%-97.9%]; P = .43) and White patients. Black patients had a higher SpO2 than White patients (97.6% [96.1%-98.6%] vs 96.6% [95.5%-97.7%]; P < .001). Similarly, Asian (97.3% [96.0%-98.5%]; P < .001) and Hispanic (97.2% [96.2%-98.1%]; P < .001) patients had a higher SpO2 than White patients.
In multivariable linear regression model 1 (Table 3), Black race was associated with a higher SpO2 for a given hemoglobin oxygen saturation, when controlling for average respiratory rate, initial pCO2, number of blood gas draws on day 1, hemoglobin level, need for vasopressors and inotropes, sex, time duration of index period, and Elixhauser score (coefficient, 0.919; 95% CI, 0.698-1.140; P < .001). The same was true for Hispanic (coefficient, 0.622; 95% CI, 0.329-0.915; P < .001) and Asian patients (coefficient, 0.602; 95% CI, 0.263-0.941; P = .001) patients. Age was not significant in this or any of the subsequent models and was excluded. Differences in SpO2 for a given hemoglobin oxygen saturation between races and ethnicities are shown in Figure 2A.
Table 3. Linear Regression Models of Associations Between Oxygenation Measures and Supplemental Oxygen Delivery (n = 3069).
Linear regression coefficients (95% CI) | |||||
---|---|---|---|---|---|
Model 1: SpO2, % | O2 rate, L/min | Model 5: heparin rate, U/h | |||
Model 2 | Model 3 | Model 4 | |||
Avg Hgb sat | 0.028 (0.020 to 0.036) | −0.001 (−0.007 to 0.005) | −0.230 (−0.254 to −0.207) | NA | −1.918 (−3.318 to −0.517) |
P value | <.001 | .67 | <.001 | NA | .007 |
Avg SpO2 | NA | NA | NA | −0.247 (−0.271 to −0.223) | NA |
P value | NA | NA | NA | <0.001 | NA |
SpO2-Hgb sat delta | NA | NA | −0.238 (−0.262 to −0.214) | NA | NA |
P value | NA | NA | <.001 | NA | NA |
Asian | 0.602 (0.263 to 0.941) | −0.291 (−0.546 to −0.035) | −0.144 (−0.386 to 0.098) | −0.131 (−0.373 to 0.111) | −44.840 (−106.002 to 16.322) |
P value | .001 | .03 | .24 | .29 | .15 |
Black | 0.919 (0.698 to 1.140) | −0.294 (−0.460 to −0.128) | −0.081 (−0.239 to 0.077) | −0.060 (−0.218 to 0.098) | 3.852 (−35.911 to 43.615) |
P value | <.001 | .001 | .31 | .46 | .85 |
Hispanic | 0.622 (0.329 to 0.915) | −0.242 (−0.463 to −0.020) | −0.092 (−0.301 to 0.118) | −0.084 (−0.293 to 0.125) | −29.573 (−82.504 to 23.359) |
P value | <.001 | .03 | .39 | .43 | .27 |
Avg resp rate | −0.094 (−0.109 to −0.079) | 0.056 (0.045 to 0.067) | 0.029 (0.018 to 0.040) | 0.030 (0.019 to 0.042) | 8.845 (6.135 to 11.556) |
P value | <.001 | <.001 | <.001 | <.001 | <.001 |
Initial pCO2 | −0.022 (−0.028 to −0.016) | 0.009 (0.005 to 0.014) | 0.004 (−0.000 to 0.008) | NA | −1.828 (−2.863 to −0.793) |
P value | <.001 | <.001 | .06 | NA | .001 |
No. of blood gas tests day 1 | −0.044 (−0.066 to −0.022) | 0.047 (0.031 to 0.064) | 0.041 (0.026 to 0.056) | 0.042 (0.027 to 0.057) | −6.857 (−10.788 to −2.926) |
P value | <.001 | <.001 | <.001 | <.001 | .001 |
No. of blood gas tests day 2 | NA | 0.044 (0.006 to 0.083) | 0.025 (−0.011 to 0.062) | 0.039 (0.002 to 0.076) | 20.723 (11.472 to 29.974) |
P value | NA | .03 | .18 | .04 | <.001 |
No. of blood gas tests day 3 | NA | 0.054 (0.009 to 0.100) | 0.041 (−0.002 to 0.084) | 0.058 (0.014 to 0.102) | 16.297 (5.455 to 27.140) |
P value | NA | .02 | .06 | .01 | .003 |
Hgb | −0.149 (−0.177 to −0.121) | 0.029 (0.009 to 0.049) | NA | NA | 11.018 (6.157 to 15.879) |
P value | <.001 | .005 | NA | NA | <.001 |
Required pressors/inotropes | 0.418 (0.301 to 0.535) | −0.097 (−0.185 to −0.009) | NA | NA | −46.403 (−67.431 to −25.375) |
P value | <.001 | .03 | NA | NA | <.001 |
Male sex | 0.126 (0.007 to 0.245) | NA | NA | 0.090 (0.007 to 0.172) | NA |
P value | .04 | NA | NA | .03 | NA |
Index period duration | −0.263 (−0.310 to −0.216) | NA | NA | −0.063 (−0.099 to −0.028) | NA |
P value | <.001 | NA | NA | <.001 | NA |
Elixhauser score | −0.038 (−0.064 to −0.012) | NA | NA | NA | NA |
P value | .004 | NA | NA | NA | NA |
Abbreviations: Avg, average; Hgb, hemoglobin; NA, not applicable; pCO2, partial pressure of carbon dioxide; resp, respiratory; sat, saturation; SpO2, pulse oximetry.
Factors Associated With Oxygen Delivery Rates
Oxygen delivery rates by race and ethnicity are shown in Figure 2B, the association between oxygen delivery rates and hemoglobin oxygen saturation is shown in Figure 2C, and the association between oxygen delivery rates and SpO2 is shown in Figure 2D. For clarity, Figures 2C and D are limited to average oxygen rates up to 10 L/min, excluding a small number of outliers. Black patients had lower median (IQR) oxygen delivery rates than White patients (2.8 L/min [2.0-3.6 L/min] vs 3.0 L/min [2.3-3.9 L/min]; P < .001). Asian patients also had lower median (IQR) oxygen delivery rates than White patients (2.6 L/min [2.0-3.5 L/min]; P = .009). There was a trend toward a difference between Hispanic patients (2.8 L/min [2.2-3.6 L/min] and White patients (P = .06).
Model 2 multivariable linear regression shows that Asian (coefficient, −0.291; 95% CI, −0.546 to −0.035; P = .03), Black (coefficient, −0.294; 95% CI, −0.460 to −0.128; P = .001), and Hispanic (coefficient, −0.242; 95% CI, −0.463 to −0.020; P = .03) race and ethnicity were associated with approximately 0.2 to 0.3 L/min lower average oxygen delivery rates when controlling for average respiratory rate; initial pCO2; number of blood gas tests on days 1, 2, and 3; hemoglobin level; and need for vasopressors or inotropes (Figure 2C). Sex, index period duration, and Elixhauser score were not significant in this model.
We performed another regression (model 3) using the same variables as model 2 regression and added a new variable representing the difference between average SpO2 and average blood hemoglobin oxygen saturation. In this model, a greater discrepancy was associated with lower supplemental oxygen delivery (coefficient, −0.238; 95% CI, −0.262 to −0.214; P < .001). Notably, Asian (coefficient, −0.144; 95% CI, −0.386 to 0.098; P = .24), Black (coefficient, −0.081; 95% CI, −0.239 to 0.077; P = .31), and Hispanic (coefficient, −0.092; 95% CI, −0.301 to 0.118; P = .39) race and ethnicity were not significant in model 3. Several variables from model 2, including hemoglobin level and use of vasopressors or inotropes, were excluded for nonsignificance.
We performed a mediation analysis to assess the effect of the SpO2–average blood hemoglobin oxygen saturation discrepancy as a mediator of differences in oxygen supplementation. The average causal mediation effect was −0.157; 95% CI, −0.250 to −0.057; P = .002. The average direct effect was −0.144; 95% CI, −0.340 to 0.062; P = .17. The total effect was −0.301; 95% CI, −0.506 to −0.092; P = .008.
In model 4, neither Asian (coefficient, −0.131; 95% CI, −0.373 to 0.111; P = .29), nor Black (coefficient, −0.060; 95% CI, −0.218 to 0.098; P = .46), nor Hispanic (coefficient, −0.084; 95% CI, −0.293 to 0.125; P = .43) race and ethnicity was associated with a difference in oxygen delivery rates when controlling for SpO2. Nonsignificant variables included age, hemoglobin level, pCO2, need for vasopressors or inotropes, and Elixhauser score.
Falsification Hypothesis
To rule out a spurious correlation by testing a falsification hypothesis,26 we performed a regression with the same variables as model 2, but replaced oxygen delivery with heparin rate, which would not be expected to differ across races and ethnicities, as the dependent variable (model 5). In this model, race and ethnicity were not significant (Asian coefficient, −44.840; 95% CI, −106.002 to 16.322; P = .15; Black coefficient, 3.852; 95% CI, −35.911 to 43.615; P = .85; Hispanic coefficient, −29.573; 95% CI, −82.504 to 23.359; P = .27).
Discussion
In this study, we show that Asian, Black, and Hispanic patients had higher average SpO2 readings than White patients for a given blood hemoglobin oxygen saturation. They also received less supplemental oxygen when adjusting for potential confounders, and these disparities appear to be mediated by larger discrepancies between SpO2 and blood hemoglobin oxygen saturation.
Our study is consistent with prior studies that have shown differences of several percentage points in SpO2 for a given hemoglobin oxygen saturation between Black and White patients,7,8,9,10 but in the past, the clinical significance of these findings has occasionally been discounted. For example, 1 article stated that for the purpose of replacing arterial blood gas, “oximetry need not have exact accuracy—we are likely to provide oxygen to a patient whether his oxygen saturation (SpO2) value is 88% or 91%, and we feel little need to give it if his SpO2 value is 97% or 100%.”27(p131)
However, by analyzing oxygen rates, we identify a clear association with an important treatment parameter, and although we cannot directly determine the causal association of differences in oxygen supplementation shown in this study with known disparities in outcomes, the finding that race and ethnicity could affect how much oxygen a patient receives is notable and concerning. This will become increasingly important with the advent of closed-loop circuits that automatically titrate supplemental oxygen according to SpO2.28,29 While patients of some races and ethnicities may be more likely to have diseases states in which a lower SpO2 is tolerated, no difference in supplemental oxygen in relation to SpO2 was observed to suggest intentional differences in titration practices. Notably, a recent study30 showed that hidden hypoxemia in patients with COVID-19 was associated with delayed or nonadministration of COVID-19 therapies.
Our findings present a unique and compelling opportunity to improve equity through device reengineering and by reevaluating how data are interpreted.31,32,33 However, this should be done with caution, as some past attempts to correct for race and ethnicity in algorithms have exacerbated disparities and are subject to ethical concerns, such as those leading to the removal of the African American race coefficient from the Chronic Kidney Disease Epidemiology Collaboration estimated glomerular filtration formula in 2021.34,35 Even though skin pigmentation is less nebulous than race and ethnicity–associated factors that have been previously hypothesized to affect GFR estimation, it would still be prudent to justify why a correction factor is appropriate. Ideally, the SpO2 reading would be corrected based on objective colorimetry, not reported race or ethnicity.36
Strengths and Limitations
Strengths of our study include a large cohort of patients and high-resolution data, which allowed us to control for multiple clinical confounders. By averaging oxygenation data over several days, we derived longer-term values to complement the temporal matching studies that we previously referenced.
Our study also has several limitations, including that it was based on data from 1 institution. We initially intended to perform this analysis with a multi-ICU research database, but marked regional differences in oxygen delivery rates combined with substantially different percentages of Black patients, as well as nonstandard reporting of oxygen delivery devices, precluded further analysis. Our study was also limited to nasal cannula oxygen delivery and did not study titration of fraction of inspired oxygen. Time off oxygen is likely underreported as well, because a room-air designation may not be documented with the vital signs.
We also did not temporally match hemoglobin oxygen saturation or SpO2 values with specific oxygen delivery data points for the reasons we have described and because it was not feasible to reliably determine if, how, and when specific blood gas laboratory results were used in minute-by-minute oxygen titration. We did not assess for variable effects at degrees of hypoxemia. Lastly, we were limited to self-reported race and ethnicity, and we did not assess skin tone. Hospital race and ethnicity are ideally documented as reported by the patient or proxy, and while agreement is generally good, this reporting is imperfect.37
We suggest several areas for further studies. Similar analyses should be performed at other institutions and should explore specific factors within a racial and ethnic group that could put some patients at particularly high risk of oxygenation disparities, including skin tone, degree of desaturation, exposure to specific oxygen delivery devices, comorbidities, and other sociodemographic factors. Differences in oxygen supplementation in patients receiving invasive or noninvasive positive pressure ventilation should be studied and a potential association of vasopressors and inotropes should be explored in further detail. Clinical decisions other than oxygen delivery that may be affected by pulse oximeter performance discrepancies should also be explored.
Conclusions
In this cohort study, Asian, Black, and Hispanic patients in the ICU received less supplemental oxygen than White patients, and this finding was associated with differences in pulse oximeter performance. Further research is needed to confirm these findings and explore other clinical factors associated with treatment disparities.
References
- 1.Miyasaka K, Shelley K, Takahashi S, et al. Tribute to Dr Takuo Aoyagi, inventor of pulse oximetry. J Anesth. 2021;35(5):671-709. doi: 10.1007/s00540-021-02967-z [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Chan ED, Chan MM, Chan MM. Pulse oximetry: understanding its basic principles facilitates appreciation of its limitations. Respir Med. 2013;107(6):789-799. doi: 10.1016/j.rmed.2013.02.004 [DOI] [PubMed] [Google Scholar]
- 3.Van de Louw A, Cracco C, Cerf C, et al. Accuracy of pulse oximetry in the intensive care unit. Intensive Care Med. 2001;27(10):1606-1613. doi: 10.1007/s001340101064 [DOI] [PubMed] [Google Scholar]
- 4.Webb RK, Ralston AC, Runciman WB. Potential errors in pulse oximetry. II. effects of changes in saturation and signal quality. Anaesthesia. 1991;46(3):207-212. doi: 10.1111/j.1365-2044.1991.tb09411.x [DOI] [PubMed] [Google Scholar]
- 5.Ries AL, Prewitt LM, Johnson JJ. Skin color and ear oximetry. Chest. 1989;96(2):287-290. doi: 10.1378/chest.96.2.287 [DOI] [PubMed] [Google Scholar]
- 6.Hakverdioğlu Yönt G, Akin Korhan E, Dizer B. The effect of nail polish on pulse oximetry readings. Intensive Crit Care Nurs. 2014;30(2):111-115. doi: 10.1016/j.iccn.2013.08.003 [DOI] [PubMed] [Google Scholar]
- 7.Sjoding MW, Dickson RP, Iwashyna TJ, Gay SE, Valley TS. Racial bias in pulse oximetry measurement. N Engl J Med. 2020;383(25):2477-2478. doi: 10.1056/NEJMc2029240 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Andrist E, Nuppnau M, Barbaro RP, Valley TS, Sjoding MW. Association of race with pulse oximetry accuracy in hospitalized children. JAMA Netw Open. 2022;5(3):e224584. doi: 10.1001/jamanetworkopen.2022.4584 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Henry NR, Hanson AC, Schulte PJ, et al. Disparities in hypoxemia detection by pulse oximetry across self-identified racial groups and associations with clinical outcomes. Crit Care Med. 2022;50(2):204-211. doi: 10.1097/CCM.0000000000005394 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Wong AI, Charpignon M, Kim H, et al. Analysis of discrepancies between pulse oximetry and arterial oxygen saturation measurements by race and ethnicity and association with organ dysfunction and mortality. JAMA Netw Open. 2021;4(11):e2131674. doi: 10.1001/jamanetworkopen.2021.31674 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Donaldson SV, Thomas AN, Gillum RF, Mehari A. Geographic variation in racial disparities in mortality from influenza and pneumonia in the United States in the pre-coronavirus disease 2019 era. Chest. 2021;159(6):2183-2190. doi: 10.1016/j.chest.2020.12.029 [DOI] [PubMed] [Google Scholar]
- 12.Boserup B, McKenney M, Elkbuli A. Disproportionate impact of COVID-19 pandemic on racial and ethnic minorities. Am Surg. 2020;86(12):1615-1622. doi: 10.1177/0003134820973356 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Nimgaonkar V, Thompson JC, Pantalone L, et al. Racial disparities in 30-day outcomes following index admission for COVID-19. Front Med (Lausanne). 2021;8:750650. doi: 10.3389/fmed.2021.750650 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Crooks CJ, West J, Morling JR, et al. Pulse oximeter measurements vary across ethnic groups: an observational study in patients with COVID-19. Eur Respir J. 2022;59(4):2103246. doi: 10.1183/13993003.03246-2021 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Hidalgo DC, Olusanya O, Harlan E. Critical care trainees call for pulse oximetry reform. Lancet Respir Med. 2021;9(4):e37. doi: 10.1016/S2213-2600(21)00102-8 [DOI] [PubMed] [Google Scholar]
- 16.Michard F, Shelley K, L’Her E. COVID-19: pulse oximeters in the spotlight. J Clin Monit Comput. 2021;35(1):11-14. doi: 10.1007/s10877-020-00550-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Eberly LA, Richterman A, Beckett AG, et al. Identification of racial inequities in access to specialized inpatient heart failure care at an academic medical center. Circ Heart Fail. 2019;12(11):e006214. doi: 10.1161/CIRCHEARTFAILURE.119.006214 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Johnson AEW, Pollard TJ, Shen L, et al. MIMIC-III, a freely accessible critical care database. Sci Data. 2016;3(1):160035. doi: 10.1038/sdata.2016.35 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Hirano Y, Shinmoto K, Okada Y, et al. Machine learning approach to predict positive screening of methicillin-resistant Staphylococcus aureus during mechanical ventilation using synthetic dataset from MIMIC-IV database. Front Med (Lausanne). 2021;8:694520. doi: 10.3389/fmed.2021.694520 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Vandenbroucke JP, von Elm E, Altman DG, et al. ; STROBE Initiative . Strengthening the Reporting of Observational Studies in Epidemiology (STROBE): explanation and elaboration. PLoS Med. 2007;4(10):e297. doi: 10.1371/journal.pmed.0040297 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Yan Y. MLmetrics: machine learning evaluation metrics. Published May 13, 2016. Accessed June 7, 2022. https://cran.r-project.org/web/packages/MLmetrics/index.html
- 22.Velho TR, Pereira RM, Paixão T, et al. Sequential organ failure assessment score in the ICU as a predictor of long-term survival after cardiac surgery. Crit Care Explor. 2022;4(5):e0682. doi: 10.1097/CCE.0000000000000682 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Gasparini A. Comorbidity: an R Package for computing comorbidity scores. J Open Source Softw. 2018;3(23):648. doi: 10.21105/joss.00648 [DOI] [Google Scholar]
- 24.Elixhauser A, Steiner C, Harris DR, Coffey RM. Comorbidity measures for use with administrative data. Med Care. 1998;36(1):8-27. doi: 10.1097/00005650-199801000-00004 [DOI] [PubMed] [Google Scholar]
- 25.Zhang Z, Zheng C, Kim C, Van Poucke S, Lin S, Lan P. Causal mediation analysis in the context of clinical research. Ann Transl Med. 2016;4(21):425-425. doi: 10.21037/atm.2016.11.11 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Prasad V, Jena AB. Prespecified falsification end points: can they validate true observational associations? JAMA. 2013;309(3):241-242. doi: 10.1001/jama.2012.96867 [DOI] [PubMed] [Google Scholar]
- 27.Witting MD, Scharf SM. Diagnostic room-air pulse oximetry: effects of smoking, race, and sex. Am J Emerg Med. 2008;26(2):131-136. doi: 10.1016/j.ajem.2007.04.002 [DOI] [PubMed] [Google Scholar]
- 28.L’Her E, Dias P, Gouillou M, et al. Automatic versus manual oxygen administration in the emergency department. Eur Respir J. 2017;50(1):1602552. doi: 10.1183/13993003.02552-2016 [DOI] [PubMed] [Google Scholar]
- 29.Denault M-H, Péloquin F, Lajoie A-C, Lacasse Y. Automatic versus manual oxygen titration in patients requiring supplemental oxygen in the hospital: a systematic review and meta-analysis. Respiration. 2019;98(2):178-188. doi: 10.1159/000499119 [DOI] [PubMed] [Google Scholar]
- 30.Fawzy A, Wu TD, Wang K, et al. Racial and ethnic discrepancy in pulse oximetry and delayed identification of treatment eligibility among patients with COVID-19. JAMA Intern Med. Published online May 31, 2022. doi: 10.1001/jamainternmed.2022.1906 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Danziger J, Ángel Armengol de la Hoz M, Li W, et al. Temporal trends in critical care outcomes in US minority-serving hospitals. Am J Respir Crit Care Med. 2020;201(6):681-687. doi: 10.1164/rccm.201903-0623OC [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.McGowan SK, Sarigiannis KA, Fox SC, Gottlieb MA, Chen E. Racial disparities in ICU outcomes: a systematic review. Crit Care Med. 2022;50(1):1-20. doi: 10.1097/CCM.0000000000005269 [DOI] [PubMed] [Google Scholar]
- 33.Soto GJ, Martin GS, Gong MN. Healthcare disparities in critical illness. Crit Care Med. 2013;41(12):2784-2793. doi: 10.1097/CCM.0b013e3182a84a43 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Obermeyer Z, Powers B, Vogeli C, Mullainathan S. Dissecting racial bias in an algorithm used to manage the health of populations. Science. 2019;366(6464):447-453. doi: 10.1126/science.aax2342 [DOI] [PubMed] [Google Scholar]
- 35.Inker LA, Eneanya ND, Coresh J, et al. ; Chronic Kidney Disease Epidemiology Collaboration . New creatinine- and cystatin C-based equations to estimate GFR without race. N Engl J Med. 2021;385(19):1737-1749. doi: 10.1056/NEJMoa2102953 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Ly BCK, Dyer EB, Feig JL, Chien AL, Del Bino S. Research techniques made simple: cutaneous colorimetry: a reliable technique for objective skin color measurement. J Invest Dermatol. 2020;140(1):3-12.e1. doi: 10.1016/j.jid.2019.11.003 [DOI] [PubMed] [Google Scholar]
- 37.Zingmond DS, Parikh P, Louie R, et al. Improving hospital reporting of patient race and ethnicity—approaches to data auditing. Health Serv Res. 2015;50(suppl 1):1372-1389. doi: 10.1111/1475-6773.12324 [DOI] [PMC free article] [PubMed] [Google Scholar]