Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2025 Dec 24.
Published in final edited form as: Clin Nurs Res. 2025 Oct 4;34(8):403–411. doi: 10.1177/10547738251374746

Do Differences in Skin Pigmentation Affect Detection of Hypoxemia by Pulse Oximetry: A Systematic Review of the Literature

Shannon A Cotton 1,2, Jung-ah Lee 2, Atul Malhotra 1, W Cameron McGuire 1
PMCID: PMC12723707  NIHMSID: NIHMS2116193  PMID: 41045137

Abstract

Pulse oximetry is a widely used, noninvasive method for estimating arterial oxygen saturation (SaO2). However, emerging evidence suggests that skin pigmentation may affect its accuracy, potentially leading to occult hypoxemia in individuals with darker skin tones. This systematic review examines the impact of skin pigmentation on pulse oximeter accuracy by comparing pulse oximetry (SpO2) readings with arterial blood gas-measured SaO2 across diverse populations. A systematic search of PubMed and Embase was conducted following PRISMA 2020 guidelines. Eligible studies included those comparing SpO2 to SaO2 while stratifying results by skin pigmentation or race/ethnicity. Data extraction focused on bias in SpO2 readings, study design, and population characteristics. Risk of bias was assessed using the QUADAS-2 tool. Forty-two studies met the inclusion criteria. Consistent evidence indicated that pulse oximeters overestimate SaO2 in individuals with darker skin tones, particularly at lower oxygen saturations. This overestimation may delay recognition of hypoxemia and critical interventions. Methodological variability was noted, including inconsistent racial classifications and skin tone assessment methods. Pulse oximeters exhibit a systematic bias in individuals with darker skin tones. Standardized skin pigmentation assessment and improved device calibration are needed to enhance accuracy and ensure equitable patient care.

Keywords: pulse oximetry accuracy, skin pigmentation bias, hypoxemia detection, racial disparities in healthcare, oxygen saturation measurement

Introduction

Pulse oximetry (SpO2) is a quick, inexpensive, noninvasive assessment of blood oxygenation that estimates the percentage of oxygen bound to hemoglobin (SaO2). The estimate is derived from a ratio of red to infrared light at a given point in time, which is driven by the amount of arterial versus venous blood, which changes with the pulse. As one of the five core vital signs, oxygen saturation often informs patient care, and a SpO2 of >92% is normal for most adults. While many biological processes can interfere with pulse oximetry readings (e.g., methemoglobinemia, anemia, hyperbilirubinemia), little is known about the extent to which skin pigmentation can affect the accuracy of the device.

Previous research has shown that pulse oximeters are less accurate in people who identify as non-white, including black (Fawzy et al., 2024; Jubran & Tobin, 1990; Sjoding et al., 2020; Valbuena et al., 2022). Concerningly, these studies demonstrated that the SpO2 value was often falsely high in non-white individuals, leading to a phenomenon termed “occult hypoxemia,” wherein the actual arterial saturation was low enough to warrant clinical action while the noninvasive saturation was reassuringly “normal.” In the study by Sjoding et al., for example, occult hypoxemia occurred three times as often in black patients as it did in white patients in data from an intensive care unit (ICU) database (Sjoding et al., 2020). These results prompted a letter to the United States Food and Drug Administration (FDA) (2021) and a collaborative working group (FDA, 2022) to address this problem. Despite these advances, little has been done to characterize what, specifically, about non-white individuals leads to these reproducible discrepancies. Some have hypothesized that differential melanin, the pigment responsible for skin color, expression, affects the absorbance of light used in pulse oximeters. However, there are scant prospective data that address this issue. Rather, most studies have focused on categorization of skin pigmentation using patient self-report, visual scales that lack nuance (Verkruysse et al., 2024) or subjective scales like the Fitzpatrick skin type (FST) that are misapplied to the context (Fitzpatrick, 1975). We have undertaken this systematic review to discover and categorize studies that reported skin pigmentation in addition to the difference between the noninvasively measured saturation (SpO2) and the invasively measured saturation (SaO2) through arterial blood gas (ABG) analysis.

This review encompasses articles that compared the SaO2 and the SpO2 and reported the difference between the two values. We searched for articles that included classification of skin tone and compared the difference between the two saturation values based on race, ethnicity, or skin tone. The authors believe that while much research has been done on SpO2 and race, it is important to review key literature that addresses skin color or skin pigmentation and how it relates to pulse oximetry. This systematic review includes articles validating the accuracy of pulse oximeters and those showing the difference between the two saturation values and the precision of the device. Based on this conceptual framework, this systematic review aims to test the hypothesis that, based on the literature, a discrepancy exists between SpO2 and SaO2 as a function of skin pigmentation or color.

Methods

Published, peer-reviewed journal articles were included if the following criteria were met: pulse oximeter bias was reported as a result of SpO2 minus SaO2, the study was written or translated into English, the full text of the study was available, the population was adults >18 years old, race and/or ethnicity was reported, and the method by which race/ethnicity was determined was reported. In the initial search, we included any articles, case reports, or conference presentations that met these criteria. For conference presentations, we included those that met the inclusion criteria, which could be interpreted from written and poster presentations.

We conducted a systematic literature search in two databases, including PubMed and Embase, to identify relevant studies examining the relationship between oximetry and skin pigmentation (Supplement 1). No restrictions were placed on language or publication date.

We searched for English-language articles published through October 2023, with subsequent updates to PubMed and Embase searches through February 2025. The original and subsequent search strategies yielded 1,575 articles. After duplicates were removed, titles and abstracts of the remaining 1,125 studies were independently reviewed by two authors (SC and WCM). Following the database search, four other systematic reviews and their included articles and citations were checked for inclusion criteria. To be included, articles must have investigated the effects of skin pigmentation on oximetry measurements; used spectrophotometry, colorimetry, or a skin classification system to assess skin pigmentation; compared pulse oximetry to the gold standard ABG measurement by co-oximetry; and reported quantitative or qualitative data on the interaction between skin pigment and oxygen saturations. Articles were subsequently excluded if no English translation was available, if study groups were pediatric or neonatal, focused solely on non-human subjects, did not include primary data (e.g., reviews, editorials), or lacked sufficient methodological detail for assessment.

Screening was done using Covidence (Covidence systematic review software, Veritas Health Innovation, Melbourne, Australia. Available at www.covidence.org, 2024). If there was disagreement between the two authors about study inclusion, a discussion was held, and if consensus could not be reached, a third author (AM) was consulted as the tiebreaker. After screening the 1,125 study titles and abstracts, 144 full-text articles were reviewed for inclusion. A similar approach was taken by the two authors (SC and WCM) with the 144 full-text articles, with a third author (AM) serving as a tiebreaker. The PRISMA diagram, as shown in Table 1 of the search results. Data were extracted by two reviewers (SC and WM) and managed using Covidence and Excel.

Table 1.

Pulse Oximetry and Skin Tone.

graphic file with name nihms-2116193-t0001.jpg

Results

Thirty-one studies were included from the search. Eleven additional studies were identified through citation investigation. Ultimately, 42 studies were included in the analysis (Table 1).

Studies from the literature search highlight the various pulse oximeter biases. The studies by Bickler et al. and Feiner et al. highlight this discrepancy (Bickler et al., 2005; Feiner et al., 2007), while Crooks et al. found that such inaccuracies led to delays in critical care interventions (Crooks et al., 2023). Seitz et al. and Gomaa et al. further confirmed the presence of racial bias in pulse oximetry, especially in critically ill patients, where unrecognized hypoxemia can have severe consequences (Gomaa et al., 2023; Kalra et al., 2023; Seitz et al., 2022). Clinical ramifications of these biases include delayed hypoxemia detection and treatment disparities, as demonstrated by Henry et al. and Sudat et al. (Henry et al., 2022; Sudat et al., 2023). Burnett et al. reported that occult hypoxemia was more prevalent among Black and Hispanic patients, raising concerns about potential inaccuracy (Burnett et al., 2022). Factors contributing to inaccuracy include probe technology and calibration errors, as examined by Mendelson et al. and Adler et al. (Adler et al., 1998; Mendelson et al., 1988), with Barker and Wilson showing device-specific variations in bias (Barker & Wilson, 2023). Blanchet et al. also identified inconsistencies among pulse oximeters, emphasizing the need for improved calibration (Blanchet et al., 2023). Errors persisted in clinical settings, as reported by Zeballos and Weisman during hypoxia and exercise (Zeballos & Weisman, 1991) and by Ebmeier et al. in ICU environments (Ebmeier et al., 2018). In addition, Ajizian et al. and Gudelunas et al. observed increased rates of missed hypoxemia under low perfusion conditions in darker-skinned individuals (Ajizian et al., 2023; Gudelunas et al., 2024). To address these issues, Khanna et al. and Bothma et al. advocate for recalibrating pulse oximeters using diverse skin pigmentation models (Bothma et al., 1996; Khanna et al., 2024). Wong et al. and Blanchet et al. emphasize the necessity of regulatory oversight to enhance accuracy across racial groups (Blanchet et al., 2023; Wong et al., 2023). Seitz et al. and Crooks et al. suggest incorporating ABG validation to counteract bias (Crooks et al., 2022, 2023; Seitz et al., 2022). Three studies relied on spectrophotometry to classify skin tone and concluded that, due to pulse oximeter biases at lower SaO2, the risk for occult hypoxemia could increase (Fawzy et al., 2024; Hao et al., 2024; Leeb et al., 2024). Ries and Lellouche grouped patients based on subjective “light” to “dark” and found bias was nearly equal across the studies (Blanchet et al., 2023; Ries et al., 1989). Both Balmaceda et al. and Wiles et al. utilized participant self-identified race and found that SpO2 and SaO2 were poorly correlated overall, with Balmaceda finding only a .814% SpO2 overestimation in Blacks (Balmaceda et al., 2022; Wiles et al., 2022). Using subjective skin scales, Marlar and Mastrototaro both found that while there was some bias in SpO2, the bias was within FDA guidelines (Marlar et al., 2025; Mastrototaro et al., 2024).

While not all studies stated the pulse oximeter device tested or utilized in the study, several studies included in this review specified the brands or types of pulse oximeters used in their analyses. Barker and Wilson (2023) conducted their laboratory study using Masimo pulse oximeters, while Blanchet et al. (2023) compared the performance of multiple devices, including Philips, Nonin, Nellcor, and Masimo, reporting variability in bias by device type. Bothma et al. (1996) tested Simed, Nihon Koden, and Ohmeda probes (ear and finger), demonstrating differing levels of agreement with arterial oxygen saturation. Gudelunas et al. (2024) evaluated Masimo and Nellcor oximeters across FSTs, identifying increasing bias in darker skin tones. Lellouche (2023) similarly compared Nonin and Philips devices, finding that Nonin underestimated and Philips overestimated arterial saturation. Leeb et al. (2024) tested 11 different fingertip oximeters in a controlled laboratory environment and found that five devices failed to meet FDA accuracy standards (ARMS <3%) in individuals with darker skin. Earlier studies also reported on device-specific performance: Mendelson et al. (1988) evaluated the Datascope ACCUSAT, Ries et al. (1989) tested Ohmeda Biox and Hewlett-Packard (HP) models, and Zeballos and Weisman (1991) compared HP and Biox IIA oximeters, observing higher error rates in black subjects under hypoxic conditions. Collectively, these findings underscore that pulse oximeter accuracy can vary not only by skin pigmentation but also by device manufacturer and model.

The reviewed studies consistently demonstrate racial and ethnic discrepancies in pulse oximetry accuracy, particularly underestimating hypoxemia in Black, Asian, and other non-White patients. Bangash et al. found that pulse oximeters overestimated SaO2 more frequently in Black, Asian, and mixed ethnic groups compared to White patients, increasing the risk of occult hypoxemia (Bangash et al., 2022). Fawzy et al. reported similar findings, highlighting delayed COVID-19 treatment eligibility due to overestimated oxygen levels in Black and Hispanic patients (Fawzy et al., 2022). Pilcher et al. confirmed these discrepancies in a multicenter study using the FST scale, showing bias related to skin pigmentation (Pilcher et al., 2020). Schallom et al. found racial differences in accuracy between nasal and forehead sensors (Schallom et al., 2018). Sjoding et al. and Valbuena et al. demonstrated that Black patients were more likely to have occult hypoxemia compared to White patients, with implications for delayed clinical interventions (Sjoding et al., 2020; Valbuena et al., 2022). Wong et al. linked these discrepancies to increased organ dysfunction and mortality risks (Wong et al., 2021). By contrast, earlier studies, such as Jubran and Lee (Jubran & Tobin, 1990; Lee et al., 1993), did not explicitly address racial bias, though Lee noted variability in accuracy among ethnic groups in Singapore. Finally, McGovern et al. and Thrush and Hodges only studied white subjects in a laboratory setting and only noted minimal bias during hypoxic states (McGovern et al., 1996; Thrush & Hodges, 1994). Collectively, these findings underscore the urgent need for improved pulse oximeter technologies that account for racial and ethnic differences to ensure equitable healthcare outcomes.

The research utilized various methods to determine skin pigmentation. In most studies, race and/or ethnicity was collected from the medical record or self-reported by the participant or the surrogate. A total of 17 studies used race or ethnicity as reported by the participant, surrogate, or recorded in the medical record (Balmaceda et al., 2022; Bangash et al., 2022; Burnett et al., 2022; Chesley et al., 2022; Crooks et al., 2022, 2023; Fawzy et al., 2022; Gomaa et al., 2023; Henry et al., 2022; Kalra et al., 2024; Schallom et al., 2018; Seitz et al., 2022; Sjoding et al., 2020; Sudat et al., 2023; Valbuena et al., 2022; Wiles et al., 2022; Wong et al., 2021). Some studies utilized subjective skin classification systems. Five studies utilized the FST (Blanchet et al., 2023; Ebmeier et al., 2018; Gudelunas et al., 2024; Leeb et al., 2024), three studies used the Massey-Martin scale (Barker & Wilson, 2023; Marlar et al., 2025; Wong et al., 2023), and two studies used both the FST and the Monk scale (Hao et al., 2024; Mastrototaro et al., 2024). Two studies used the Munsell color system and then classified participants into “light,” “intermediate,” or “dark” skin tone groups (Adler et al., 1998; Ries et al., 1989). Four studies utilized spectrophotometry to quantify light reflectance (Fawzy et al., 2024; Fong et al., 2024; Leeb et al., 2024). In six studies, the researchers classified the participants as “light” or “dark” pigmented based on observation (Ajizian et al., 2023; Bickler et al., 2005; Feiner et al., 2007; Jubran & Tobin, 1990; Khanna et al., 2024; Mendelson et al., 1988). Three studies enrolled only “lightly pigmented” or white men only (Bouvet et al., 2012; McGovern et al., 1996; Thrush & Hodges, 1994). One study enrolled only Black men (Zeballos & Weisman, 1991), and one study had patients self-identify ethnicity as Chinese, Malay, or Indian (Lee et al., 1993; Zeballos & Weisman, 1991). Sixteen studies enrolled only critically ill patients admitted to the ICU (Balmaceda et al., 2022; Blanchet et al., 2023; Bothma et al., 1996; Bouvet et al., 2012; Chesley et al., 2022; Crooks et al., 2023; Ebmeier et al., 2018; Fawzy et al., 2024; Hao et al., 2024; Henry et al., 2022; Jubran & Tobin, 1990; Lee et al., 1993; Marlar et al., 2025; Schallom et al., 2018; Seitz et al., 2022). Six studies included patients roomed in the hospital but outside the ICU (Adler et al., 1998; Bangash et al., 2022; Burnett et al., 2022; Crooks et al., 2022; Ebmeier et al., 2018; Fawzy et al., 2022). Eleven studies were completed in a pulmonary laboratory or clinic (Ajizian et al., 2023; Barker & Wilson, 2023; Bickler et al., 2005; Feiner et al., 2007; Gudelunas et al., 2024; Mastrototaro et al., 2024; McGovern et al., 1996; Mendelson et al., 1988; Ries et al., 1989; Thrush & Hodges, 1994; Zeballos & Weisman, 1991). Five studies utilized large databases (Kalra et al., 2023; Sjoding et al., 2020; Sudat et al., 2023; Valbuena et al., 2022; Wong et al., 2021). In three studies, the setting in which data were obtained was not reported.

Overestimation of SaO2 saturation by SpO2 in darker-skinned individuals was a consistent finding across numerous studies. Bias varied between devices, with levels ranging from minor (≤1%) to significant (+5%) overestimations depending on the population and device. A few studies identified that occult hypoxemia was more prevalent in Black patients when compared to White patients.

Studies performed in laboratory settings were able to titrate oxygen and other gases to achieve hypoxia and test oximeter accuracy in induced hypoxic states. These studies were performed on healthy volunteers. Bias reported from pulmonary laboratory studies ranged from 0% to 2%, staying within the FDA requirement for use in a healthcare setting. Studies performed in hospital settings or those that used large databases of hospitalized patients varied.

Discussion

This systematic review encompasses all studies through February 2025 that study the bias of pulse oximeters by race, ethnicity, or skin tone. A total of 42 studies were reviewed (Table 1). In November 2023, the FDA stated, “about 66% of SpO2 values were within 2 or 3% of blood gas values and about 95% of SpO2 values were within 4 to 6% of blood gas values, respectively.” However, the FDA goes on to comment that “current scientific evidence from lab desaturation studies suggests that there are some accuracy differences in pulse oximeter performance, especially in lower arterial saturations between lightly and darkly pigmented participants.”

During the height of the COVID-19 pandemic, there were reports that the pulse oximeter failed to accurately assess the saturation in patients who identified as Black or African American (Crooks et al., 2023; Fawzy et al., 2022; Sjoding et al., 2020; Sudat et al., 2023; Wiles et al., 2022). The reports highlighted that this group was at higher risk for unrecognized occult hypoxia, defined as a SpO2 in the normal range with a SaO2 of <88%. This failure of the device has been investigated retrospectively and prospectively in the studies found during this review. In the studies reported here, there is significant reliance on subjective, investigator-reported methods of race or self-reported races, which can introduce variability across the different studies. Objective tools like spectrophotometry were rarely used, but may be a potential area for standardization in future research on skin pigments and pulse oximetry. The studies that reported results by race/ethnicity found that SpO2 to SaO2 bias did vary by race, highlighting the fact that there may be less reporting and/or recognition of hypoxia in Black patients. In general, in patients with SpO2 >90%, the difference between SpO2 and SaO2 is generally less than 2 points (Jubran, 1999).

These reports of bias within the FDA guidelines are reassuring, but there is no reasoning in these studies as to why the bias is greater in some race categories when compared to White. In 2024, the FDA standard required devices to be tested with a minimum of 200 data points evenly distributed over an SaO2 range of 70% to 100% and that devices be tested on 10 or more healthy subjects who vary in age and gender and that the study has subjects with “a range of skin pigmentations, including at least 2 darkly pigmented subjects or 15% of the subject pool” (Goodman, 1988). However, many pulse oximeters were historically calibrated using data from light-skinned individuals. This can lead to less accurate measurements in individuals with darker skin tones because the algorithms may not adequately account for variations in light absorption due to pigmentation.

The authors acknowledge that there have been many published reviews focused on skin tone and pulse oximeter bias. This review is different because it only includes articles that compare the SpO2 to the gold standard SaO2 obtained by ABG sampling. In addition, this review encompasses both pulmonary laboratory and hospital setting studies and shows the differences between induced hypoxic state pulse oximetry in a pulmonary laboratory-induced and clinically relevant hypoxic states of patients admitted to hospitals. This is important because oftentimes perfusion during induced hypoxia remains normal in a healthy adult, while perfusion in a hospitalized patient may be decreased due to various factors such as sepsis, vasopressor administration, or other comorbid states like peripheral vascular disease.

The findings of our study are important for several reasons. First, we have observed that in aggregate, the discrepancy between SpO2 and SaO2 tends to be small, providing some reassurance that pulse oximeters are providing useful information for most patients. Second, there are situations where outliers are observed, such that in some patients a major discrepancy exists between the SpO2 and the SaO2. Third, the existing literature is somewhat flawed for a few reasons, including retrospective designs, non-concurrent measures of SaO2 and SpO2, and inadequate assessment of objective skin color. Thus, further work is clearly needed in this area.

The utility of pulse oximetry has been questioned in part because of an FDA warning regarding the accuracy of the recordings in people with dark skin pigment. Some providers rely on ABG measurement in the ICU because many of these patients have indwelling arterial lines for hemodynamic monitoring; thus, the risk of obtaining the gold standard measurements is minimal. On the other hand, some have questioned the need for indwelling arterial lines because of the availability of non-invasive blood pressure cuffs, which provide a reasonable estimate of arterial blood pressure (Li-wei et al., 2013). In addition, the definition of acute respiratory distress syndrome traditionally required the measurement of PaO2 to calculate the P/F ratio. However, increasingly the use of S/F ratio is being substituted as it provides a reasonable surrogate for the ratio of partial pressure of oxygen to the fraction of oxygen (P/F ratio; Cotton et al., 2022). Thus, the accuracy of SpO2 remains critically important, given the potential for misdiagnosis in this subset of patients.

The COVID-19 pandemic was particularly problematic since many patients were described as having “happy hypoxia” or “silent hypoxemia” (Simonson et al., 2021). The accuracy of SpO2 was critical during this period because some patients with occult hypoxemia may have been misclassified due to inaccurate readings. The COVID-19 pandemic affected communities of color preferentially, at least in some regions of the world, making the issues around skin color particularly important.

Despite our study’s strengths, we acknowledge several limitations. First, our analyses relied on published literature and thus we are limited by the design of the parent studies. Thus, many of the prospective studies were done in healthy participants in pulmonary laboratories, in whom estimation of SaO2 is not typically considered challenging. In addition, retrospectively designed studies did not always have simultaneous measurements of SpO2 and SaO2, which may be problematic with variables that are dynamically changing in critically ill patients. Second, the measurement of skin pigment is best done using spectrophotometry since self-report is subjective and may lead to reporting bias. Semi-quantitative scales, such as the FST, are considered by many to be culturally insensitive and thus better replaced by objective measurements. Third, prospective studies focused on clinically meaningful outcomes may be helpful to draw firmer conclusions (i.e., does aggressive treatment of occult hypoxemia lead to improved outcomes or are such abnormalities clinically inconsequential). Unfortunately, our findings suggest that there were few prospective studies among our 42 included references. Large-scale trials will be required to draw rigorous conclusions. Despite these limitations, we believe that our findings are important and hope that they drive further research in this area. For now, the findings are reassuring and may be clinically directive until more definitive data are available.

Conclusions and Nursing Implications

The variation in pulse oximetry accuracy based on skin pigmentation has significant implications for nursing practice. Nurses frequently rely on SpO2 values to assess oxygenation, guide clinical decision-making, and escalate care. However, overestimation of oxygen saturation in patients with darker skin tones may lead to missed or delayed recognition of hypoxemia, potentially resulting in undertreatment or poor outcomes. Nurses must remain vigilant, especially when caring for African American patients, by integrating additional assessment methods such as clinical signs of hypoxia, ABGs, and patient-reported symptoms to ensure equitable and accurate care. Awareness of device limitations and advocating for more inclusive technology are also essential components of nursing advocacy and patient safety.

Pulse oximetry should be interpreted in the context of patient presentation and symptoms. SpO2 alone may overestimate SaO2 in some patients, and this can be exacerbated by the patient’s race/ethnicity and perfusion. Care should be taken in hospitalized patients when deciding on and executing interventions based on SpO2 in isolation. Finally, it should be noted that while only a few studies in our review looked at occult hypoxia, overall, pulse oximeters become less accurate as the patient’s true SaO2 decreases. This finding underscores the need for inclusive research and development of practices to ensure equitable healthcare outcomes.

Supplementary Material

sup

Supplemental material for this article is available online.

Funding

The authors disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was supported by the WIN/CANS Martha J. Lentz Dissertation Grant.

Biographies

Dr. Shannon A. Cotton is a critical care nurse and researcher in the intensive care unit at UC San Diego.

Dr. Jung-ah Lee is a nurse scientist and faculty at UC Irvine with expertise in gerontology and healthcare system. Her research focuses primarily on translational approach to improve care delivery and quality of life for patients and caregivers. Within this focus, a recurring theme of her work is to explore racial and ethnic disparities in quality of care.

Dr Atul Malhotra is a board-certified pulmonologist, intensivist and research chief of Pulmonary, Critical Care and Sleep Medicine at UC San Diego. He is active clinically in pulmonary, critical care and sleep medicine.

Dr. W. Cameron Mcguire completed his pulmonary and critical care medicine fellowship at UC San Diego School of Medicine. Dr. McGuire earned his medical degree from Tulane University School of Medicine. His research interests include pulmonary physiology and pulmonary vascular medicine, and their overlap in the intensive care unit.

Footnotes

Declaration of Conflicting Interests

The authors declared the following potential conflicts of interest with respect to the research, authorship, and/or publication of this article: Dr. Malhotra is funded by NIH. He reports income from Livanova, Zoll, Eli Lilly, and Powell Mansfield. He is the cofounder of Clairyon (formerly Healcisio), unrelated to the subject matter. ResMed gave a philanthropic donation to UCSD.

Registration

Registered PROSPERO systematic review #1006973

References

  1. Adler JN, Hughes LA, Vtvilecchia R, & Camargo CA Jr (1998). Effect of skin pigmentation on pulse oximetry accuracy in the emergency department. Academic Emergency Medicine, 5(10), 965–970. [DOI] [PubMed] [Google Scholar]
  2. Ajizian S, McGonigle S, Dove J, Ting Y-J, Sethi R, & Milkes D (2023). 1030: Occult hypoxemia and pulse oximetry performance across skin pigmentation groups. Critical Care Medicine, 51(1), 509. [Google Scholar]
  3. Balmaceda J, Gerber ED, Arnold MJ, Williams DA, Snyder K, & Pandya SM (2022). Racial bias in pulse oximetry in acute hypoxemic respiratory failure. CHEST, 162(4), A1168. [Google Scholar]
  4. Bangash MN, Hodson J, Evison F, Patel JM, Johnston AM, Gallier S, Sapey E, & Parekh D (2022). Impact of ethnicity on the accuracy of measurements of oxygen saturations: A retrospective observational cohort study. EClinicalMedicine, 48. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Barker SJ, & Wilson WC (2023). Racial effects on Masimo pulse oximetry: A laboratory study. Journal of Clinical Monitoring and Computing, 37(2), 567–574. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Bickler PE, Feiner JR, & Severinghaus JW (2005). Effects of skin pigmentation on pulse oximeter accuracy at low saturation. Anesthesiology, 102(4), 715–719. [DOI] [PubMed] [Google Scholar]
  7. Blanchet M-A, Mercier G, Delobel A, Nayet E, Bouchard P-A, Simard S, L’Her E, Branson RD, & Lellouche F (2023). Accuracy of multiple pulse oximeters in stable critically ill patients. Respiratory Care, 68(5), 565–574. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Bothma P, Joynt G, Upman J, Hon H, Mathala B, Scribante J, & Kromberg J (1996). Accuracy of pulse oximetry in pigmented patients. South African Medical Journal, 86(5), 594–596. [PubMed] [Google Scholar]
  9. Bouvet G, Voisin G, Berthiaume Y, Rizk A, Bacon S, Wardini R, Chan-Thim E, Lavoie K, de Lorimier M, & Pepin V (2012). Abstracts from the Conjoint Annual Meeting of L’Association des pneumologues de la province de Québec and le Réseau en santé respiratoire du FRSQ [The association of pulmonologists of the province of quebec and the FRSQ respiratory health network]. Canadian Respiratory Journal, 19(5), e35–e65. [Google Scholar]
  10. Burnett GW, Stannard B, Wax DB, Lin H-M, Pyram-Vincent C, DeMaria S, & Levin MA (2022). Self-reported race/ethnicity and intraoperative occult hypoxemia: A retrospective cohort study. Anesthesiology, 136(5), 688–696. [DOI] [PubMed] [Google Scholar]
  11. Chesley CF, Lane-Fall MB, Panchanadam V, Harhay MO, Wani AA, Mikkelsen ME, & Fuchs BD (2022). Racial disparities in occult hypoxemia and clinically based mitigation strategies to apply in advance of technological advancements. Respiratory Care, 67(12), 1499–1507. [DOI] [PubMed] [Google Scholar]
  12. Cotton SA, McGuire WC, Hussain A, Pearce AK, Zawaydeh Q, Meehan MD, & Malhotra A (2022). Proning in COVID-19 acute respiratory distress syndrome: Role of paralytics. Critical Care Explorations, 4(2), e0646. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Crooks CJ, West J, Morling JR, Simmonds M, Juurlink I, Briggs S, Cruickshank S, Hammond-Pears S, Shaw D, & Card TR (2022). Pulse oximeter measurements vary across ethnic groups: An observational study in patients with COVID-19. European Respiratory Journal, 59(4), 63–67. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Crooks CJ, West J, Morling JR, Simmonds M, Juurlink I, Briggs S, Cruickshank S, Hammond-Pears S, Shaw D, & Card TR (2023). Differential pulse oximetry readings between ethnic groups and delayed transfer to intensive care units. QJM: An International Journal of Medicine, 116(1), 63–67. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Ebmeier S, Barker M, Bacon M, Beasley R, Bellomo R, Chong CK, Eastwood GM, Gilchrist J, Kagaya H, & Pilcher J (2018). A two centre observational study of simultaneous pulse oximetry and arterial oxygen saturation recordings in intensive care unit patients. Anaesthesia and Intensive Care, 46(3), 297–303. [DOI] [PubMed] [Google Scholar]
  16. Fawzy A, Ali H, Dziedzic PH, Potu N, Calvillo E, Golden SH, Iwashyna TJ, Suarez JI, Hager DN, & Garibaldi BT (2024). Skin pigmentation and pulse oximeter accuracy in the intensive care unit: A pilot prospective study. American Journal of Respiratory and Critical Care Medicine, 210(3), 355–358. 10.1164/rccm.202401-0036LE [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Fawzy A, Wu TD, Wang K, Robinson ML, Farha J, Bradke A, Golden SH, Xu Y, & Garibaldi BT (2022). Racial and ethnic discrepancy in pulse oximetry and delayed identification of treatment eligibility among patients with COVID-19. JAMA Internal Medicine, 182(7), 730–738. 10.1001/jamainternmed.2022.1906 [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Feiner JR, Severinghaus JW, & Bickler PE (2007). Dark skin decreases the accuracy of pulse oximeters at low oxygen saturation: The effects of oximeter probe type and gender. Anesthesia & Analgesia, 105(6), S18–S23. [DOI] [PubMed] [Google Scholar]
  19. Fitzpatrick TB (1975). Soleil et peau [Sun and skin]. J. Med. Esthet, 2, 33–34. [Google Scholar]
  20. Fong N, Lipnick MS, Behnke E, Chou Y, Elmankabadi S, Ortiz L, Almond CS, Auchus I, Burnett GW, Bisegerwa R, Conrad DR, Hendrickson CM, Hooli S, Kopotic R, Leeb G, Martin D, McCollum ED, Monk EP, Moore KL Jr., . . . Law T (2024). Open access data repository and common data model for pulse oximeter performance data. medRxiv. 10.1101/2024.08.30.24312744 [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Gomaa D, Smith M, Blakeman T, Goodman M, & Branson RD (2023). Racial differences in pulse oximetry accuracy: Paired measurements in mechanically ventilated patients. CHEST, 164(4), A1685–A1686. [Google Scholar]
  22. Goodman C (Ed.). (1988). Food and drug administration center for devices and radiological health. In Medical technology assessment directory: A pilot reference to organizations, assessments, and information resources. National Academies Press (US). [PubMed] [Google Scholar]
  23. Gudelunas MK, Lipnick M, Hendrickson C, Vanderburg S, Okunlola B, Auchus I, Feiner JR, & Bickler PE (2024). Low perfusion and missed diagnosis of hypoxemia by pulse oximetry in darkly pigmented skin: A prospective study. Anesthesia & Analgesia, 138(3), 552–561. 10.1213/ane.0000000000006755 [DOI] [PubMed] [Google Scholar]
  24. Hao S, Dempsey K, Matos J, Cox CE, Rotemberg V, Gichoya JW, Kibbe W, Hong C, & Wong A-KI (2024). Utility of skin tone on pulse oximetry in critically ill patients: A prospective cohort study. Critical Care Explorations, 6(9), e1133. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Henry NR, Hanson AC, Schulte PJ, Warner NS, Manento MN, Weister TJ, & Warner MA (2022). Disparities in hypoxemia detection by pulse oximetry across self-identified racial groups and associations with clinical outcomes. Critical Care Medicine, 50(2), 204–211. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Jubran A (1999). Pulse oximetry. Critical Care, 3, 1–7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Jubran A, & Tobin MJ (1990). Reliability of pulse oximetry in titrating supplemental oxygen therapy in ventilator-dependent patients. CHEST, 97(6), 1420–1425. 10.1378/chest.97.6.1420 [DOI] [PubMed] [Google Scholar]
  28. Kalra A, Wilcox C, Holmes SD, Tonna JE, Jeong IS, Rycus P, Anders MM, Zaaqoq AM, Lorusso R, Brodie D, Keller SP, Kim BS, Whitman GJR, & Cho SM (2023). Characterizing the racial discrepancy in hypoxemia detection in VV-ECMO: An ELSO registry analysis. Lung. 10.21203/rs.3.rs-3617237/v1 [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Kalra A, Wilcox C, Holmes SD, Tonna JE, Jeong IS, Rycus P, Anders MM, Zaaqoq AM, Lorusso R, Brodie D, Keller SP, Kim BS, Whitman GJR, & Cho SM (2024). Characterizing the racial discrepancy in hypoxemia detection in venovenous extracorporeal membrane oxygenation: An extracorporeal life support organization registry analysis. Lung, 202(4), 471–481. 10.1007/s00408-024-00711-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Khanna AK, Beard J, Lamminmäki S, Närväinen J, Antaki N, & Yapici HO (2024). Assessment of skin pigmentation-related bias in pulse oximetry readings among adults. Journal of Clinical Monitoring and Computing, 38(1), 113–120. 10.1007/s10877-023-01095-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Lee K, Hui K, Tan W, & Lim T (1993). Factors influencing pulse oximetry as compared to functional arterial saturation in multi-ethnic Singapore. Singapore Medical Journal, 34, 385–385. [PubMed] [Google Scholar]
  32. Leeb G, Auchus I, Law T, Bickler P, Feiner J, Hashi S, Monk E, Igaga E, Bernstein M, & Chou YC (2024). The performance of 11 fingertip pulse oximeters during hypoxemia in healthy human participants with varied, quantified skin pigment. EBioMedicine, 102. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Lellouche F (2023). Clinical impact of the oximeters’ inaccuracy (bias and errors) - secondary analysis of the OXYGAP study. Canadian Journal of Respiratory, Critical Care, and Sleep Medicine, 7, 44. [Google Scholar]
  34. Li-wei HL, Saeed M, Talmor D, Mark R, & Malhotra A (2013). Methods of blood pressure measurement in the ICU. Critical Care Medicine, 41(1), 34–40. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Marlar AI, Knabe BK, Taghikhan Y, Applegate RL, & Fleming NW (2025). Performance of pulse oximeters as a function of race compared to skin pigmentation: A single center retrospective study. Journal of Clinical Monitoring and Computing, 39(1), 119–125. 10.1007/s10877-024-01211-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Mastrototaro JJ, Leabman M, Shumate J, & Tompkins KL (2024). Performance of a wearable ring in controlled hypoxia: A prospective observational study. JMIR Formative Research, 8, e54256. 10.2196/54256 [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. McGovern JP, Sasse SA, Stansbury DW, Causing LA, & Light RW (1996). Comparison of oxygen saturation by pulse oximetry and co-oximetry during exercise testing in patients with COPD. CHEST, 109(5), 1151–1155. [DOI] [PubMed] [Google Scholar]
  38. Mendelson Y, Kent JC, Shahnarian A, Welch GW, & Giasi RM (1988). Evaluation of the Datascope ACCUSAT pulse oximeter in healthy adults. Journal of Clinical Monitoring, 4, 59–63. [DOI] [PubMed] [Google Scholar]
  39. Pilcher J, Ploen L, McKinstry S, Bardsley G, Chien J, Howard L, Lee S, Beckert L, Swanney M, & Weatherall M (2020). A multicentre prospective observational study comparing arterial blood gas values to those obtained by pulse oximeters used in adult patients attending Australian and New Zealand hospitals. BMC Pulmonary Medicine, 20, 1–9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Ries AL, Prewitt LM, & Johnson JJ (1989). Skin color and ear oximetry. CHEST, 96(2), 287–290. [DOI] [PubMed] [Google Scholar]
  41. Schallom M, Prentice D, Sona C, Arroyo C, & Mazuski J (2018). Comparison of nasal and forehead oximetry accuracy and pressure injury in critically ill patients. Heart & Lung, 47(2), 93–99. [DOI] [PubMed] [Google Scholar]
  42. Seitz KP, Wang L, Casey JD, Markus SA, Jackson KE, Qian ET, Self WH, Rice TW, & Semler MW (2022). Pulse oximetry and race in critically ill adults. Critical Care Explorations, 4(9), e0758. [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Simonson TS, Baker TL, Banzett RB, Bishop T, Dempsey JA, Feldman JL, Guyenet PG, Hodson EJ, Mitchell GS, & Moya EA (2021). Silent hypoxaemia in COVID-19 patients. The Journal of Physiology, 599(4), 1057–1065. [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Sjoding MW, Dickson RP, Iwashyna TJ, Gay SE, & Valley TS (2020). Racial bias in pulse oximetry measurement. The New England Journal of Medicine, 383(25), 2477– 2478. 10.1056/NEJMc2029240 [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Sudat SE, Wesson P, Rhoads KF, Brown S, Aboelata N, Pressman AR, Mani A, & Azar KM (2023). Racial disparities in pulse oximeter device inaccuracy and estimated clinical impact on COVID-19 treatment course. American Journal of Epidemiology, 192(5), 703–713. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Thrush D, & Hodges MR (1994). Accuracy of pulse oximetry during hypoxemia. Southern Medical Journal, 87(4), 518–521. [DOI] [PubMed] [Google Scholar]
  47. United States Food and Drug Administration (FDA) (2021). Pulse oximeter accuracy and limitations: FDA safety communication. FDA. [Google Scholar]
  48. United States Food and Drug Administration (FDA) (2022). Pulse oximeter accuracy and limitations: FDA safety communication. FDA. [Google Scholar]
  49. Valbuena VSM, Seelye S, Sjoding MW, Valley TS, Dickson RP, Gay SE, Claar D, Prescott HC, & Iwashyna TJ (2022). Racial bias and reproducibility in pulse oximetry among medical and surgical inpatients in general care in the Veterans Health Administration 2013–2019: Multicenter, retrospective cohort study. BMJ, 378, e069775. 10.1136/bmj-2021-069775 [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Verkruysse W, Jaffe MB, Lipnick M, & Zemouri C (2024). Challenges of subjective skin color scales: The case for the use of objective pigmentation measurement methods in regulatory pulse oximetry studies. Anesthesia & Analgesia, 139(2), e15– e17. 10.1213/ane.0000000000007086 [DOI] [PubMed] [Google Scholar]
  51. Wiles M, El-Nayal A, Elton G, Malaj M, Winterbottom J, Gillies C, Moppett I, & Bauchmuller K (2022). Effect of patient ethnicity on the accuracy of peripheral pulse oximetry in patients with COVID-19 pneumonitis requiring mechanical ventilation. Anaesthesia, 77(4). [DOI] [PubMed] [Google Scholar]
  52. Wong A-KI, Charpignon M, Kim H, Josef C, De Hond AA, Fojas JJ, Tabaie A, Liu X, Mireles-Cabodevila E, & Carvalho L (2021). Analysis of discrepancies between pulse oximetry and arterial oxygen saturation measurements by race and ethnicity and association with organ dysfunction and mortality. JAMA Network Open, 4(11), e2131674–e2131674. [DOI] [PMC free article] [PubMed] [Google Scholar]
  53. Wong B, Yan C, Rivas B, & Fleming N (2023). Possible racial bias in pulse oximetry. Journal of Investigative Medicine, 71(1), 186–188. [Google Scholar]
  54. Zeballos RJ, & Weisman IM (1991). Reliability of noninvasive oximetry in black subjects during exercise and hypoxia. American Review of Respiratory Disease, 144, 1240–1244. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

sup

RESOURCES