Skip to main content
Journal of Clinical Sleep Medicine : JCSM : Official Publication of the American Academy of Sleep Medicine logoLink to Journal of Clinical Sleep Medicine : JCSM : Official Publication of the American Academy of Sleep Medicine
. 2023 Dec 1;19(12):2113–2116. doi: 10.5664/jcsm.10800

Don’t hold PAT: watch for and correct oximetry artifact

David T Plante 1,2,, Meredith E Rumble 1,2
PMCID: PMC10692927  PMID: 37677072

Abstract

This study evaluated the accuracy of the algorithmic oxygen saturation (SpO2) nadir detection of WatchPAT (Zoll/Itamar, Caesarea, Israel) compared with visual inspection in a real-world setting. SpO2 tracings for 209 consecutive adult WatchPAT recordings were reviewed for SpO2 artifact, with erroneous SpO2 data removed manually. Error rates for SpO2 minima were determined across all studies, and relationships between correct and erroneous studies examined. The overall error rate for SpO2 nadir was 22.5%. Erroneous studies had overall less time spent at SpO2 ≤ 88%, higher true SpO2 nadir, lower mean body mass index, and greater artifact time; however, these variables were not associated with the magnitude of discrepancy between manual and algorithmically derived SpO2 minima. These data demonstrate that SpO2 nadir determined by WatchPAT algorithms should not be considered universally accurate. Like other home sleep apnea tests, visual inspection and manual correction of the study data are often required to derive accurate clinical results.

Citation:

Plante DT, Rumble ME. Don’t hold PAT: watch for and correct oximetry artifact. J Clin Sleep Med. 2023;19(12):2113–2116.

Keywords: WatchPAT, tonometry, sleep apnea, hypoxemia

INTRODUCTION

The ability to reliably measure oxygenation is a prerequisite for successful sleep studies that assess for presence and severity of sleep-disordered breathing. With the rise of portable monitoring for obstructive sleep apnea (OSA), the need to have accurate and reliable pulse oximetry measurements during unattended studies is particularly important, as there is no technician to verify the signal in real time. Even during attended polysomnography (PSG), the veracity of the oximetry signal is an important issue, as it is subject to several sources of artifact such as motion and low perfusion states.1

While signal processing methods have been developed to automatically remove artifact from pulse oximetry data, visual inspection of the raw data is still required to verify the accuracy of measured hypoxemia during sleep testing.2 As the field moves toward increased automation of sleep stage and event scoring,3 evaluation of the real-world performance of automated algorithms becomes a critical aspect of clinical care and research. In the case of hypoxemia associated with OSA, accurately determining the oxygen saturation (SpO2) nadir is important for risk stratification and prediction of several health-related outcomes, including postoperative complications,4 insulin resistance/metabolic syndrome,5 and myocardial infarction6 in OSA.

The WatchPAT is a Food and Drug Administration–approved device that utilizes peripheral arterial tonometry, pulse oximetry, accelerometry, and other ancillary signals to diagnose OSA. Several studies have compared the accuracy of WatchPAT to PSG.7 In the last several years and during the course of the coronavirus disease 2019 (COVID-19) pandemic, WatchPAT has become increasingly utilized in the practice of sleep medicine, and is considered a viable option for home sleep testing in those for whom it is not contraindicated.8 WatchPAT uses automated algorithms to remove failed data from the recording to estimate sleep and breathing parameters, including data from the pulse oximeter. However, because of inherent limitations of all algorithmic approaches to artifact detection when compared with visual inspection, this study was conducted to evaluate the real-world accuracy of the WatchPAT in determining minimal SpO2 values in clinical patients evaluated for OSA.

METHODS

A total of 209 consecutive WatchPAT studies conducted in adults (age ≥ 18 years) between November 11, 2022, and February 15, 2023, at Wisconsin Sleep, the sleep laboratory and clinic of the University of Wisconsin–Madison, that were deemed interpretable for clinical purposes by the interpreting physician were evaluated. Study procedures were reviewed and approved by the University of Wisconsin Health Sciences Institutional Review Board.

WatchPAT studies were performed using WatchPAT 300 devices and analyzed using zzzPAT (Zoll/Itamar, Caesarea, Israel). Each study underwent prospective evaluation and visual examination of the pulse oximetry signal by a board-certified sleep medicine provider who was also a member of the American Academy of Sleep Medicine (AASM) Scoring Manual Committee (D.T.P.). SpO2 was considered probable artifact if the decline was considered too sudden to be physiological and/or there was probable motion causing artifact. Intervals of artifact were manually removed within the zzzPAT software prior to final study interpretation. Studies were dichotomized by the congruence of the SpO2 nadir generated automatically vs the value derived after second-order visual inspection. If these values were equivalent, the automated SpO2 nadir was considered accurate; if different, the automated SpO2 nadir was considered erroneous.

The SpO2 nadir automatically generated by the WatchPAT scoring algorithm, SpO2 nadir after visual inspection, time spent at SpO2 ≤ 88%, apnea-hypopnea index (AHI; using ≥ 3% oxygen desaturation), total sleep time, technically valid sleep time (upon which respiratory indices were calculated), age, sex, and body mass index (BMI) of each patient were recorded. The overall error rate for SpO2 nadir was calculated as the proportion of erroneous studies among all evaluated studies. Comparisons of age, sex, BMI, AHI, total time spent at SpO2 ≤ 88%, and the amount of artifactual time derived by the WatchPAT algorithm (total sleep time minus technically valid sleep time) between erroneous and accurate studies were compared using 2-sample t tests and tests of proportions, where appropriate. Additional relationships between severity measures and degree of SpO2 nadir measurement error were examined using Pearson’s correlation on an ad hoc basis. Statistics were performed using JMP Pro version 15.0.0 (SAS Institute Inc., Cary, NC, USA).

RESULTS

The overall error rate for SpO2 nadir was 22.5% (47 out of 209) of studies. The average difference between erroneous and corrected SpO2 values was 6.6% ± 5.3% (range: 1% to 25%) (Figure 1). This occurred despite a significant relationship between correct and algorithmically derived SpO2 minima across all participants (r = .88, P < .0001). Among erroneous studies, the amount of additional data removed was 21.8 ± 49.1 seconds per study: 48.9% (23 out of 47) resulted in a change in SpO2 nadir from a value at or below 88% to above 88%; 34% (16 out of 47) resulted in a change in SpO2 nadir from below 90% to above 90%.

Figure 1. Erroneous SpO2 minima detected by WatchPAT.

Figure 1

(A) Individual differences between erroneous and corrected SpO2 nadirs across all individual participants (n = 47). Participant denoted by blue asterisks is shown in detail in parts B and C as an example. (B) 100-s SpO2 tracing derived from the native WatchPAT algorithm. Note failure of the algorithm to fully exclude artifactual data, which resulted in a reported SpO2 nadir of 73%. (C) Manual expansion of the excluded artifactual segment (denoted by the white text box) after visual inspection resulted in a change in SpO2 nadir from 73% to 87% for the study. SpO2 = oxygen saturation.

There were no differences in AHI, age, or sex between studies with correct vs erroneous SpO2 minima (Table 1). BMI was significantly lower and artifact time was significantly higher among erroneous vs correct studies. On average, studies with SpO2 nadirs correctly determined by the WatchPAT algorithm also had lower true SpO2 minima and greater time spent at SpO2 ≤ 88% during the recording than studies with erroneous SpO2 minima. However, among erroneous studies, there were no significant relationships between the magnitude of SpO2 difference between erroneous and corrected values and the time spent at SpO2 ≤ 88% (r = .17, P = .27), true SpO2 nadir (r = .14, P = .27), BMI (r = .08, P = .57), or artifact time (r = .13, P = .39).

Table 1.

Study and participant characteristics for correct and erroneous studies based on SpO2 minima.

Measure Correct (n = 162) Erroneous (n = 47) P
AHI, events/h 20.2 ± 19.8 18.5 ± 15.3 .58
SpO2 ≤ 88%, min 9.5 ± 27.9 0.8 ± 1.5 .03
True SpO2 nadir, % 84.6 ± 8.1 88.1 ± 3.1 .004
Age, y 52.2 ± 17.3 55.4 ± 17.4 .27
Sex, % female 53.1 53.2 .99
BMI, kg/m2 33.2 ± 8.6 29.8 ± 7.4 .02
Artifact time, min 9.1 ± 13.6 20.3 ± 29.7 .0003

Artifact time calculated as total sleep time minus valid sleep time from the WatchPAT algorithm. AHI = apnea-hypopnea index, BMI = body mass index, SpO2 = oxygen saturation.

DISCUSSION

This study demonstrates that the automated SpO2 artifact detection utilized by WatchPAT results in inaccurate SpO2 nadir values in approximately 1 out of every 4.5 studies conducted at a single academic sleep center. Inaccurate SpO2 minima were more likely in those with an overall lower hypoxic burden of OSA and BMI, as well as overall greater artifact time, but were otherwise unrelated to AHI, age, or sex. These results have important implications for the clinical practice of sleep medicine.

The ease of patient use and automated scoring algorithms have likely contributed to the increasing and widespread use of WatchPAT to diagnose OSA. While peripheral arterial tonometry–based technologies are an important advance in the diagnosis and assessment of OSA, it is critical for practicing providers to recognize that, like all sleep technologies, the assumption that “out-of-the-box” use is accurate is likely to result in misdiagnoses and/or inadequate risk stratification. The AASM clinical guidelines on the use of unattended portable monitors to diagnose OSA in adults requires that the raw data be reviewed and edited by qualified sleep technologists and providers.9,10 In practice, however, it is not clear how often sleep centers and providers edit WatchPAT studies. While methods have been proposed to increase the congruence of WatchPAT sleep staging and respiratory event scoring relative to PSG,11 they do not describe methods for managing pulse oximetry artifact.

The design of this study does not compare the performance of the WatchPAT algorithm with PSG or other objective sleep tests, relying instead on visual removal of SpO2 artifact. However, it is noteworthy that, in prior validation studies, the WatchPAT both over- and underestimated SpO2 minima across participants.12 In fact, several studies performed in selected populations have demonstrated PSG-derived SpO2 nadirs to be, on average, below that derived from the WatchPAT.1315 This is despite the strong overall correlation between SpO2 minima derived from PSG and WatchPAT reported in validation studies, which is very similar to what was observed in our dataset.12,13,16

Many of the inaccurate SpO2 nadirs in this study occurred in conjunction with the WatchPAT algorithm identifying some degree of SpO2 artifact but failing to remove sufficient samples to lead to an accurate result (see Figure 1 for an example). Future modifications of the algorithm may enhance the overall accuracy of the technology. In the interim, this study demonstrates that careful review of the SpO2 signal will be required to deliver accurate results and optimal patient care.

DISCLOSURE STATEMENT

Both authors have reviewed and approved the manuscript. Work for this study was performed at the University of Wisconsin-Madison and was not funded. Dr. Plante has received grant support from the National Institute on Aging (NIA), National Institute of Nursing Research (NINR), National Institute of Mental Health (NIMH), American Sleep Medicine Foundation, the Brain and Behavior Research Foundation, Wisconsin Alumni Research Foundation, Alzheimer’s Association, and the University of Illinois at Chicago Occupational and Environmental Health and Safety Education and Research Center/National Institute for Occupational Safety and Health; has served as a consultant for Teva Australia, Harmony Biosciences, Aditum Bio LLC, and Jazz Pharmaceuticals; and served on medical advisory boards for Jazz Pharmaceuticals and Alkermes, all unrelated to the current study. Dr. Rumble has received grant support from Merck, unrelated to the current study.

ABBREVIATIONS

AHI

apnea-hypopnea index

BMI

body mass index

OSA

obstructive sleep apnea

SpO2

oxygen saturation

REFERENCES

  • 1. Jubran A . Pulse oximetry . Crit Care. 2015. ; 19 ( 1 ): 272 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2. Poets CF , Stebbens VA . Detection of movement artifact in recorded pulse oximeter saturation . Eur J Pediatr. 1997. ; 156 ( 10 ): 808 – 811 . [DOI] [PubMed] [Google Scholar]
  • 3. Goldstein CA , Berry RB , Kent DT , et al . Artificial intelligence in sleep medicine: an American Academy of Sleep Medicine position statement . J Clin Sleep Med. 2020. ; 16 ( 4 ): 605 – 607 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4. Suen C , Ryan CM , Mubashir T , et al . Sleep study and oximetry parameters for predicting postoperative complications in patients with OSA . Chest. 2019. ; 155 ( 4 ): 855 – 867 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5. Lee JY , Kim CW , Lee KC , et al . Effect of intermittent hypoxia on metabolic syndrome and insulin resistance in the general male population . Medicina (Kaunas). 2021. ; 57 ( 7 ): 668 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6. Huang Z , Wu Y , Huang K , Chen P , Chen J , Wang L . The nadir oxygen-specific heart rate response in sleep apnea links with the occurrence of acute myocardial infarction . Front Cardiovasc Med. 2022. ; 9 : 807436 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7. Yalamanchali S , Farajian V , Hamilton C , Pott TR , Samuelson CG , Friedman M . Diagnosis of obstructive sleep apnea by peripheral arterial tonometry: meta-analysis . JAMA Otolaryngol Head Neck Surg. 2013. ; 139 ( 12 ): 1343 – 1350 . [DOI] [PubMed] [Google Scholar]
  • 8. Kapur VK , Auckley DH , Chowdhuri S , et al . Clinical practice guideline for diagnostic testing for adult obstructive sleep apnea: an American Academy of Sleep Medicine Clinical Practice Guideline . J Clin Sleep Med. 2017. ; 13 ( 3 ): 479 – 504 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9. Collop NA , Anderson WM , Boehlecke B , et al. Portable Monitoring Task Force of the American Academy of Sleep Medicine . Clinical guidelines for the use of unattended portable monitors in the diagnosis of obstructive sleep apnea in adult patients . J Clin Sleep Med. 2007. ; 3 ( 7 ): 737 – 747 . [PMC free article] [PubMed] [Google Scholar]
  • 10. Rosen IM , Kirsch DB , Chervin RD , et al. American Academy of Sleep Medicine Board of Directors . Clinical use of a home sleep apnea test: an American Academy of Sleep Medicine position statement . J Clin Sleep Med. 2017. ; 13 ( 10 ): 1205 – 1207 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11. Zhang Z , Sowho M , Otvos T , et al . A comparison of automated and manual sleep staging and respiratory event recognition in a portable sleep diagnostic device with in-lab sleep study . J Clin Sleep Med. 2020. ; 16 ( 4 ): 563 – 573 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12. Pinto JA , Godoy LB , Ribeiro RC , Mizoguchi EI , Hirsch LA , Gomes LM . Accuracy of peripheral arterial tonometry in the diagnosis of obstructive sleep apnea . Braz J Otorhinolaryngol. 2015. ; 81 ( 5 ): 473 – 478 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13. O’Brien LM , Bullough AS , Shelgikar AV , Chames MC , Armitage R , Chervin RD . Validation of Watch-PAT-200 against polysomnography during pregnancy . J Clin Sleep Med. 2012. ; 8 ( 3 ): 287 – 294 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14. Tanphaichitr A , Thianboonsong A , Banhiran W , Vathanophas V , Ungkanont K . Watch peripheral arterial tonometry in the diagnosis of pediatric obstructive sleep apnea . Otolaryngol Head Neck Surg. 2018. ; 159 ( 1 ): 166 – 172 . [DOI] [PubMed] [Google Scholar]
  • 15. Jen R , Orr JE , Li Y , DeYoung P , Smales E , Malhotra A , Owens RL . Accuracy of WatchPAT for the Diagnosis of Obstructive Sleep Apnea in Patients with Chronic Obstructive Pulmonary Disease . COPD. 2020. ; 17 ( 1 ): 34 – 39 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16. Özergin Coşkun ZO , Dursun E , Sahin U , et al . A evaluation of peripheral arterial tonometry for the diagnosis of obstructive sleep apnea . ENT Updates. 2018. ; 8 ( 1 ): 19 – 26 . [Google Scholar]

Articles from Journal of Clinical Sleep Medicine : JCSM : Official Publication of the American Academy of Sleep Medicine are provided here courtesy of American Academy of Sleep Medicine

RESOURCES