Skip to main content
Elsevier - PMC COVID-19 Collection logoLink to Elsevier - PMC COVID-19 Collection
. 2020 Sep 11;159(2):724–732. doi: 10.1016/j.chest.2020.08.2104

Smartphone Biosensor With App Meets FDA/ISO Standards for Clinical Pulse Oximetry and Can Be Reliably Used by a Wide Range of Patients

Sara H Browne a,b,, Mike Bernstein c, Samuel C Pan a, Jonathan Gonzalez Garcia a, Craig A Easson d, Chung-Che Huang d, Florin Vaida a
PMCID: PMC7485544  PMID: 32926871

Abstract

Background

Millions of smartphones contain a photoplethysmography (PPG) biosensor (Maxim Integrated) that accurately measures pulse oximetry. No clinical use of these embedded sensors is currently being made, despite the relevance of remote clinical pulse oximetry to the management of chronic cardiopulmonary disease, and the triage, initial management, and remote monitoring of people affected by respiratory viral pandemics, such as severe acute respiratory syndrome coronavirus 2 or influenza. To be used for clinical pulse oximetry the embedded PPG system must be paired with an application (app) and meet US Food and Drug Administration (FDA) and International Organization for Standardization (ISO) requirements.

Research Question

Does this smartphone sensor with app meet FDA/ISO requirements? Are measurements obtained using this system comparable to those of hospital reference devices, across a wide range of people?

Study Design and Methods

We performed laboratory testing addressing ISO and FDA requirements in 10 participants using the smartphone sensor with app. Subsequently, we performed an open-label clinical study on 320 participants with widely varying characteristics, to compare the accuracy and precision of readings obtained by patients with those of hospital reference devices, using rigorous statistical methodology.

Results

“Breathe down” testing in the laboratory showed that the total root-mean-square deviation of oxygen saturation (Spo2) measurement was 2.2%, meeting FDA/ISO standards. Clinical comparison of the smartphone sensor with app vs hospital reference devices determined that Spo2 and heart rate accuracy were 0.48% points (95% CI, 0.38-0.58; P < .001) and 0.73 bpm (95% CI, 0.33-1.14; P < .001), respectively; Spo2 and heart rate precision were 1.25 vs reference 0.95% points (P < .001) and 5.99 vs reference 3.80 bpm (P < .001), respectively. These small differences were similar to the variation found between two FDA-approved reference instruments for Spo2: accuracy, 0.52% points (95% CI, 0.41-0.64; P < .001) and precision, 1.01 vs 0.86% points (P < .001).

Interpretation

Our findings support the application for full FDA/ISO approval of the smartphone sensor with app tested for use in clinical pulse oximetry. Given the immense and immediate practical medical importance of remote intermittent clinical pulse oximetry to both chronic disease management and the global ability to respond to respiratory viral pandemics, the smartphone sensor with app should be prioritized and fast-tracked for FDA/ISO approval to allow clinical use.

Trial Registry

ClinicalTrials.gov; No.: NCT04233827; URL: www.clinicaltrials.gov;

Key Words: chronic disease management, remote clinical pulse oximetry, respiratory viral pandemics, smartphone sensor with app


FOR EDITORIAL COMMENT, SEE PAGE 477

Millions of smartphones contain photoplethysmography (PPG) biosensors with applications (apps) that accurately measure heart rate (HR) and blood oxygen saturation (Spo2).1,2 High-grade PPG biosensors (Maxim Integrated) are currently integrated into Android smartphones worldwide, totaling more than 300 million smartphones.3 The PPG sensor measures the distension of arteries and arterioles in the subcutaneous tissue, due to blood flow with each cardiac cycle. As blood flows through the vessels, the pulse pressure is detected by illuminating the skin with the light from two light-emitting diodes. The amount of light transmitted, absorbed, or reflected to a photodiode is measured. A signal-processing app containing an algorithm then detects and interprets the PPG signal to determine Spo 2 and HR values. Finally, additional software within the app then accesses and displays individual biosensor readings on the smartphone screen.

Direct use of this technology for remote clinical pulse oximetry is highly relevant to the expanding integration of digital technologies into clinical care models. The usefulness of smartphones with embedded sensors and apps for intermittent clinical pulse oximetry is broad, potentially supporting the effective management of a wide range of chronic cardiopulmonary disease, including congestive heart failure, COPD, or acute disorders such as pneumonia, and postoperative recovery.4, 5, 6 Smartphone sensors with apps for clinical pulse oximetry would allow patients to gather and track their own data to inform outpatient clinic or telemedicine visits, and increase communication with health-care providers, which may lead to earlier outpatient interventions that potentially reduce both morbidity and hospitalization risks. Smartphone sensors with apps for clinical pulse oximetry may be of considerable importance in low- and middle-income countries (LMICs) with less developed health-care infrastructure, enabling management support of infections prevalent in these settings such as AIDS-defining pneumonias like pneumocystis or cryptococcosis, TB, as well as chronic cardiopulmonary diseases.7, 8, 9, 10, 11, 12, 13

Further, during events of pandemic respiratory viral spread, such as the current severe acute respiratory syndrome coronavirus 2 or influenza, remote clinical pulse oximetry may support not only triage, but also the initial treatment of symptomatic adults.14 , 15 Spo 2 correlates with lung injury15 and in conjunction with clinical examination may be used to differentiate those who require close monitoring and hospital admission from those with milder disease.6 In circumstances of home quarantine, remote clinical pulse oximetry could allow patients to monitor their symptoms and include objective reports of Spo 2 and HR. Smartphone sensors with apps could allow measurements collected, with patient permissions, to be connected to regional and national data hubs allowing large-scale regional data analysis in near-real time. This could allow estimates of resource distribution, such as hospital beds needed, and could provide means for medical centers to contact and communicate with patients in home quarantine. The availability of smartphone sensors with apps for clinical pulse oximetry under such circumstances could be critical to LMICs, where medical infrastructure is easily overwhelmed.

Despite the existence of such a large number of smartphones embedded with these sensors with apps, no clinical use is being made of these. The clinical pulse oximetry requirement by the US Food and Drug Administration/International Organization for Standardization (FDA/ISO) for PPG sensors is < 3.5% root-mean-square deviation (RMSD) of the Spo 2 value. Herein we present data obtained using a smartphone sensor with app (Maxim Integrated)16 , 17 and report laboratory testing addressing FDA/ISO requirements.

Subsequently, we report findings from an open-label clinical study across a varied population, using a protocol that obtains the internal measurement error for smartphone sensor with app readings and hospital standard of care reference instruments, and allows comparison of measurement sets to analyze both accuracy and precision. Data obtained using the smartphone sensor with app across a varied population help determine how accurately it can be used by the general population.

Methods

Laboratory Testing of Smartphones Embedded With PPG Biosensors Against FDA/ISO Standards

A “breathe down” test, a technique introduced by J. Severinghaus (University of California, San Francisco [UCSF]) and refined by P. Bicker (UCSF),18 , 19 was performed in the certified Bickler-Ye Laboratory (Shenzhen University, China) under IRB PN-2019-01 and international human subjects research standards. The smartphone sensor with app consisted of (1) manufactured part MAX 86100A and (2) an app containing a signal-processing determination algorithm that detects and interprets PPG signal (MaximFast) (Maxim Integrated; and was evaluated as implemented in the ZUK phone [Lenovo]). The test set consisted of 10 volunteers comprising black (three), white (three), and Asian (four) racial/ethnic origins. All volunteers provided written informed consent. In each test volunteers placed their index finger over the smartphone sensor system, and a strap was placed over their hand to maintain it in place. The volunteer breathed a mixture of gases with reduced O2 compared with room air. By manipulating this mixture, the volunteer’s Spo 2 was decreased in steps. At each step, the measurements were allowed to reach a stable plateau, which was held for at least 45 s, before proceeding to the next step. Only the last 30 s of each plateau was used in the data analysis. Data were collected at Spo 2 levels near 100% to about 70%. Spo 2 values were recorded continuously, and an FDA/ISO-approved Masimo Radical reference device (Masimo) recorded Spo 2 simultaneously.

HR measurements data were taken with a ProSim 8 Fluke instrument (Fluke Biomedical).

Laboratory Testing Data Analysis

Each plateau was checked for quality of data and was rejected if the plateau was not stable within ± 2% Spo 2; either the Masimo reference or the test device did not report a value; motion of the volunteer was observed; or perfusion, defined as optical modulation of the transmitted light, was less than 0.5%. Data were analyzed according to methods established by Bland and Altman,20 providing RMSD and a Bland-Altman plot with upper and lower limits of agreement per Bland Altman as described in ISO 80601-2-61:2017.16 A second-order polynomial fit of the data was calculated.

Clinical Study Conduct

An open-label clinical protocol was developed and approved by the University of California, San Diego (UCSD) Human Research Protections Program (#161573). Measures of HR and Spo 2 from “test” units, that is, Android smartphone model ZUK Z2 X (ZUK; Lenovo) with embedded PPG biosensor and app (MAX 86100A and signal-processing algorithm MaximFast) (Maxim Integrated). The phones were enclosed within plastic cases (Fig 1 ) containing a circular indent with raised grooves to guide and stabilize finger placement over the PPG sensor. Vital sign measures were obtained simultaneously using two “reference standard of care” Welch Allyn Spot Vital Signs (WASVS) units [models (91)42NTB and (93)2268; Welch Allyn]. Participants were adults over 18 years of age recruited from UCSD Healthcare or Clinical Trials Units and who provided written informed consent. Initially, participants were recruited from outpatient settings (n = 250). For each participant, four pairs of measurements (total, eight data points) were taken in succession, with no break between sets, in the order indicated in Table 1 (top). This is a repeated measure, nested-factorial design, with three factors (measurement system, left/right index finger, experimental step), instrument units nested within measurement system (two units for each system), and repeated measures over participants.21 This design allows statistical evaluation and comparison of test and reference measurement systems, while correcting for other possible sources of variation, including within-participant vital sign change over four experimental steps, and left and right index finger differences. The hypotheses tested the accuracy and precision between (1) test and reference units, (2) two test units, and (3) two reference units.

Figure 1.

Figure 1

Smartphone with embedded photoplethysmography sensor suite and plastic case with braille guide for digit placement used as the test instrument.

Table 1.

Order and Device Combination Used to Take Simultaneous Heart Rate and Oxygen Saturation Measurements in Outpatient Participants (n = 250) and Inpatient/ED Participants (n = 70)

Setting Step Left Index Finger Right Index Finger
Outpatient (n = 250) 1 Reference unit 1 Reference unit 2
2 Test unit 1 Test unit 2
3 Test unit 2 Reference unit 1
4 Reference unit 2 Test unit 1
Inpatient (n = 70) 1 Test unit 1 Reference unit
2 Reference unit 1 Test unit 1

Subsequently, participants were recruited from inpatient and ED settings (n = 70) and followed a simplified protocol (Table 1, bottom) and no distinction was made between the Welch Allyn reference units, which were used interchangeably. Each measurement period was 2 min in length and respiratory rate was determined. Participant demographic data, including age, height, weight, sex, ethnicity, and hand dominance, were collected. Patient participants were verbally screened for the following comorbidities as exclusion criteria: coarctation of the aorta, severe peripheral vascular disease, severe anemia, severe sickle cell disease, and methemoglobinemia. However, none of the patient participants screened reported having these conditions.

Clinical Study Statistical Methods

Participant demographics were summarized. Erroneous readings with a recorded value of zero indicating a measurement was not obtained were reported and excluded from further analysis. The measurement error of an instrument is summarized by the root-mean-square deviation of prediction, RMSD, which is the square root of the mean-square deviation (error) between the value reported by the instrument and the true value. The RMSD can be further decomposed into the bias and SD of the instrument:

RMSD2=Bias2+SD2 (1)

where bias refers to the systematic (average) deviation of the instrument from the true value, and SD summarizes the variation of the instrument in repeated measures.22 Thus, bias measures accuracy and SD measures precision. The design of our clinical studies allows statistical analyses that tease out separately the bias and the SD of the two measurement systems and four units.

For the outpatient setting, accuracy and precision hypotheses were addressed by fitting linear mixed-effects models to HR and Spo 2 outcomes. The model equation is

Yij=β1+bi+β2×Si+β3×Ti+β4×Ri+β5×Sideij+β6×Stepij+εij (2)

Yij is the outcome (HR or Spo 2) for participant i at measurement j (j = 1, 2, …, 8). S i indicates the measurement system (S i = 1 for test and S i = 0 for reference units); T i indicates which test unit was used, T i = –1/2 for test unit 1, T i = 1/2 if test unit 2 was used (for reference unit measurements, T i = 0); R i indicates the reference units used, R i = –1/2 for unit 1, R i = 1/2 for unit 2 (R i = 0 for test units); Sideij = –0.5 for left-hand and 0.5 for right-hand measurements; Stepij indicates the step of measurement, as in Table 1 (top), but centered at 0; b i is a subject-specific random effect, common to all measurements of participant i; and ε ij is within-participant error, assumed to have a mean of 0. Furthermore, the within-participant errors ε ij were allowed to be autocorrelated, with an autoregressive of order 1 (AR1) structure, and to have variances that differ between groups—specifically, test and reference systems or the four measurement units.23

The predicted biological value of outcome (HR or Spo 2) for generic subject i is β 1 + b i. This allows a “best guess” of outcome based on eight measurements, freed of biases of particular units and of other experimental conditions (side, step), and free of within-participant error ε ij. It corresponds to the reference system (S i = 0) taken as a gold standard. The parameters β 2, β 3, and β 4 address accuracy aspects of hypotheses: β 2 measures the bias of test relative to reference units (by definition the reference unit bias is zero and RMSD = SD); β 3 measures the bias of test units relative to each other; and β 4 measures the bias of reference units relative to each other. The precision aspects are captured by the SD of ε ij, which is allowed to differ between test and reference systems (when comparing the two), or between individual units (when comparing the units). The R i = ±1/2 parametrization was chosen so that the gold standard is the average of the two reference units, and so that β 4 measures the bias between them. Similarly, T i = ±1/2 so that the bias between systems compares the average of the two test units vs the average of the two reference units, and so that β 3 measures the bias between the two test units.

The additional experimental conditions Side and Step are not central to our hypotheses, but they act as potential confounders, and may help in estimating the bias and precision of the instruments. These two factors were excluded if not significant at the 0.20 level in backward model selection.24 The statistical comparisons and 95% CIs used the Wald test for the mean (accuracy) coefficients, and the likelihood ratio test for the SD (precision) parameters. The choice between AR1 and independence correlation structure for within-participant errors ε ij used the likelihood ratio test. A sensitivity analysis was performed, removing outlier observations defined as ≥ 3 times their system-specific SD. These large outliers are likely due to the incorrect use of the system (eg, finger placement), and the sensitivity analysis results reflect the accuracy of the two systems under appropriate use conditions.

For the inpatient setting respiratory rates were summarized, with accuracy and precision evaluation simplified to two steps, using a single test and reference unit based on a similar mixed-effects models as for the outpatient setting but without terms for units T i and R i. Because of the small number of outliers for this setting a similar sensitivity analysis is not reported.

Statistical significance is considered at the P < .05 level. All analyses were conducted with the R statistical language25 and used the nlme package.26

Results

Laboratory Testing of Biosensor Plus App

Test data were collected from 10 participants during “breathe down” testing (see Methods), comparing in-phone sensors and Masimo Radical pulse oximetry device Spo 2 values. A Bland-Altman plot of the entire data set with second-order polynomial line fit is shown in Figure 2 . In this study eight plateaus were rejected for lack of stability within two Spo 2 counts, rejecting the corresponding 16 data points. All 10 subjects completed the study, and there were no adverse events.

Figure 2.

Figure 2

Plot of data collected from 10 volunteers during a “breathe down test” (see Methods). Each point on the plot represents one pair of a phone reading and Masimo Radical 7 reading. The root-mean-square deviation of bias between the two instruments in this test was 2.2 oxygen saturation counts. The upper and lower limits of agreement per Bland and Altman20 are shown as dashed lines. A second-order polynomial fit of the data is shown as a solid line. Spo2 = oxygen saturation.

The RMSD total average for data from all 10 volunteer participants tested was 2.2%. Heart rate error based on analysis of Fluke simulation testing showed that the heart rate error was < 2 bpm RMSD.

Clinical Study of Test-vs-Reference Measurements

Participant demographics confirmed a broad range of age and race/ethnicity (Table 2 ).

Table 2.

Demographics of Participants in Both Outpatient and Hospital Settings

Demographic Characteristic All Participants (N = 320) Participants in Outpatient Setting (n = 250) Participants in Hospital Setting (n = 70)
Age, mean (SD), y 44.4 (14.3) 46.1 (13.1) 38.5 (16.5)
 Range 18-89 19-81 18-89
Sex, No. (%)
 Male 229 189 (75.6) 40 (57.1)
 Female 89 59 (23.6) 30 (42.9)
 Other 2 2 (0.8) 0 (0)
Race and ethnicity, No. (%)
 White 129 99 (39.6) 30 (42.9)
 Hispanic 104 80 (32.0) 24 (34.3)
 Black 41 37 (14.8) 4 (5.7)
 Asian 24 13 (5.2) 11 (15.7)
 Other 22 21 (8.4) 1 (1.4)
Hand dominance, No. (%)
 Right 260 204 (81.6) 56 (80)
 Left 42 35 (14.0) 7 (10)
 Ambidextrous 18 11 (4.4) 7 (10)

In the outpatient setting test units gave 960/1,000 (96.0%) valid HR and 955/1,000 (95.5%) valid Spo 2 readings. Reference units gave 997/1,000 (99.7%) valid HR and 997/1,000 (99.7%) valid Spo 2 readings. The range of valid HR readings was 41 to 131 bpm for test, and 38 to 126 bpm for reference units. The range of valid Spo 2 readings was 90% to 99% for test, and 92% to 100% for reference units. In the inpatient/ED setting test units gave 137/140 (97.9%) valid HR readings (range, 41-131 bpm) and 136/140 (97.1%) valid Spo 2 readings (range, 90%-99%). Reference units gave 138/140 (98.6%) valid HR readings (range, 38-126 bpm) and 138/140 (98.6%) valid Spo 2 readings (range, 92%-100%). The respiratory rate averaged 12 breaths/min (median, 10; SD, 4.1; range, 7-30 breaths/min).

Bias (Accuracy) of Measurement Systems

In the outpatient setting for HR, a significant bias was found for the test relative to reference measurement system (bias β 2 = 0.73 bpm [95% CI, 0.33-1.14]; P < .001) (Table 3 , Fig 3 ). Significant bias was found between the two reference units (β 4 = –0.33 bpm [95% CI, –0.66 to 0.00]; P = .049) but not between the two test units (β 3 = –0.36 bpm [95% CI, –0.99 to 0.27]; P = .27) (Table 4 ).

Table 3.

Comparison of Bias (Accuracy), SD (Precision), and Root-Mean-Square Deviation of Test and Reference Measurement Systems, for Heart Rate and Oxygen Saturation, in Outpatient (n = 250) and Inpatient (n = 70) Studiesa

Study Setting Bias (Accuracy)
Test vs Reference System
SD (Precision)
Each System
Root-Mean-Square Deviation
Each System
Testb Referenceb P Value Test Reference P Value Test Reference
Outpatient study
 Heart rate, bpm (95% CI) 0.73 (0.33 to 1.14) 0 (ref) < .001 5.99 (5.61 to 6.40) 3.80 (3.56 to 4.06) < .001 6.03 (5.47 to 6.60) 3.80 (3.56 to 4.06)
 Spo2, % points (95% CI) 0.48 (0.38 to 0.58) 0 (ref) < .001 1.25 (1.17 to 1.33) 0.95 (0.89 to 1.01) < .001 1.34 (1.21 to 1.47) 0.95 (0.89 to 1.01)
Inpatient study
 Heart rate, bpm (95% CI) –0.19 (–1.59 to 1.22) 0 (ref) .79 6.58 (5.78 to 7.49) 4.99 (4.39 to 5.69) .004 6.58 (4.94 to 8.23) 4.99 (4.39 to 5.69)
 Spo2, % points (95% CI) –0.94 (–1.41 to –0.47) 0 (ref) < .001 2.62 (2.31 to 2.97) 0.89 (0.78 to 1.01) < .001 2.78 (2.21 to 3.36) 0.89 (0.78 to 1.01)

The bias comparison used the Wald test of mixed-effects linear model. ref = reference; Spo2 = oxygen saturation.

a

The reference system by definition has no bias and RMSD = SD. Bias and SD correspond to β2 and SD (εij), respectively, in model Equation 2 (see Methods).

b

Test = in-phone measurement system; Reference = Spot Vital Signs measurement system (Welch Allyn).

Figure 3.

Figure 3

A and B, Comparison of root-mean-square deviation (RMSD) and SD of the test (in-phone) and reference (Spot Vital Signs unit; Welch Allyn) measurement systems, for heart rate (A) and Spo2 (B), in the outpatient study (n = 250) and inpatient study (n = 70). The error bars correspond to 95% CIs. RMSD decomposes into SD and bias (see Methods, Equation 1). The reference system has no bias, by definition, and thus RMSD equals SD. See Figure 2 legend for expansion of abbreviation.

Table 4.

Comparison of Bias (Accuracy) and SD (Precision) for Heart Rate and Oxygen Saturation Within Test Unit (Smartphone Sensor With App) and Within Reference Unit (Welch Allyn Spot Vital Signs) Measurement Systems in Outpatient Study (n = 250)a,b

Vital Sign Model Bias (Accuracy) Within System
SD (Precision) Within System
Unit 1-Unit 2 P Value Unit 1 Unit 2 P Value
Heart rate, bpm Test units –0.36 (–0.99 to 0.27) .27 6.92 (6.42 to 7.46) 4.95 (4.60 to 5.34) < .001
Reference units –0.33 (–0.66 to 0.00) .049 3.86 (3.58 to 4.16) 3.65 (3.39 to 3.94) .27
Spo2, % points Test units –0.06 (–0.22 to 0.10) .43 1.26 (1.04 to 1.52) 1.25 (1.03 to 1.51) .90
Reference units 0.52 (0.41 to 0.64) < .001 1.01 (0.84 to 1.22) 0.86 (0.71 to 1.04) .001

The bias comparison used the Wald test of mixed-effects linear model. The SD comparison used the likelihood ratio test of the mixed-effects linear model. See Table 3 legend for expansion of abbreviation.

a

Bias corresponds to β3 (test units) and to β4 (reference units), and SD corresponds to SD (εij) in model Equation 2 (see Methods).

b

Test units, smartphone sensor with app; reference units, Spot Vital Signs (Welch Allyn).

For Spo 2, the test units had a significant positive bias compared with reference units (β 2 = 0.48% points [95% CI, 0.38-0.58% points]; P < .001) (Table 3). No significant bias was found between the two test units (β 3 = –0.06% [95% CI, –0.22 to 0.10]; P = .43). However, a significant bias was found between the two reference units (β 4 = 0.52 points [95% CI, 0.41-0.64 points]; P < .001) (Table 4).

In the inpatient/ED setting for HR, no significant bias (accuracy) was found for the test relative to the reference (Table 3, Fig 3A). For Spo 2, test units had significantly lower readings than reference units (Table 3, Fig 3B).

Precision (Variation) of Measurement Systems

In the outpatient setting for HR, the test measurement system had significantly higher variation (worse precision) than the reference system (SD = 5.99 bpm [95% CI, 5.61-6.40 bpm] vs 3.80 bpm [95% CI, 3.56-4.06 bpm]; P < .001) (Table 3, Fig 3). The precision differed significantly between the two test units: SD = 6.92 bpm for test unit 1 and 4.95 bpm for test unit 2 (P < .001). The precision did not differ significantly between the two reference units (P = .27) (Table 4). Similar findings were observed in the inpatient/ED setting (Table 4).

For Spo 2, the test measurement system had significantly higher variation (worse precision) than the reference system (SD = 1.25% points [95% CI, 1.17-1.33 points] vs 0.95 points [95% CI, 0.89-1.01 points]; P < .001) (Table 3, Fig 3). The precision did not differ between the two test units (1.26 vs 1.25 points; P = .90); however, it differed significantly between the two reference units (SD = 1.01 and 0.86 points; P = .001) (Table 4). Similar findings were observed in the inpatient/ED setting (Tables 3 and 4, Figs 3A and 3B).

Sensitivity Analysis

  • 1.

    Test vs reference systems had the same accuracy in HR measurement, with higher precision for the reference, whereas a small significant positive bias in accuracy of Spo 2 measurement persisted, but with similar precision.

  • 2.

    Reference units had a small significant bias observed in both HR and Spo 2 measures relative to each other, with a small significant difference in precision found in both HR and Spo 2 between units.

  • 3.

    Test units had no difference in accuracy or precision of HR, but they had a small significant difference in precision for Spo 2 (e-Tables 1, 2).

Discussion

Results from certified laboratory testing indicate the smartphone sensor with app tested met FDA/ISO standards for pulse oximetry for reflective sensors16 , 17 compared with FDA/ISO-approved reference. The testing facility calibrates and tests professional medical pulse oximeters before being cleared for sale by the US FDA and other, worldwide regulatory bodies. Full FDA/ISO approval standards also prescribe the testing we report in addition to at least 200 data points referenced to blood sample analysis.

The smartphone sensor with app tested is embedded within millions of smartphones worldwide currently.3 To evaluate whether this system could be used by a wide range of people to provide reliable, robust clinical measurements, we used a low-risk, inexpensive, but rigorous protocol and statistical analysis to compare measurement accuracy and precision with that of hospital medical reference devices. Our methodology also allowed determination of accuracy and precision of measurements within each system. The model used a predicted biological value based on eight sets of measurements within the same individual to produce a “true biological reading” free of bias associated with any one instrument.

In both outpatient and inpatient settings, the error of measurement (RMSD) was driven by measurement precision, with bias having a minor role. In the outpatient setting we found a small positive bias in the accuracy of HR and Spo 2 measurement by the smartphone sensor with app, which disappeared in the sensitivity analysis of HR, but persisted in Spo 2 measurement. Precision of the smartphone sensor with app was slightly less than that of reference units for HR and Spo 2, a difference that persisted only for Spo 2 following sensitivity analysis. In the hospital/ED setting the accuracy for HR was the same for both “test” and “reference” systems. Spo 2 measurement, however, showed a small but consistent bias. However, our analysis revealed similar significant differences in accuracy and precision between the two Welch Allyn reference devices. This provides clarification that the consequence of small but significant test unit differences detected is of no clinical importance, and tolerable within high-level medical settings. The reference instruments include FDA-approved devices from two different manufacturers, Masimo and Nelcor, that noninvasively measure oxygen saturation by a red and infrared light source, photo detectors, and a probe to transmit light through a translucent, pulsating arterial bed in the digit; this may explain observed measurement differences.

Clinical investigations used an inexpensive plastic cell phone case to allow guidance of the digit over the smartphone sensor with app. For adequate measures of Spo 2, exposure of the sensor to maximal areas of capillary circulation in the finger pulp is essential and the digit needs to remain immobile for approximately 30 s. Our data indicate that the precision (measurement variation) of smartphone pulse oximetry was worse when participants had to hold their finger on the sensor for 2 min (inpatient/ED setting), likely explained by the difficulty associated with maintaining a digit in place for 2 min. In our experience, PPG signal should be reliably obtained within 30 to 60 s of finger placement. The smartphone sensor with app tested in this study had no refinements for assessing PPG quality or excluding meaningless readings. Improved app software, performing basic sensitivity analyses allowing evaluation of finger contact or outlier values necessitating repeated measurements, could ensure even greater ease of use and reading reliability. In the real world, people will use the smartphone with embedded biosensor plus app as a spot checker; it is designed for the user to hold their finger in place for approximately 30 s and obtain a single reading.

Smartphone sensors with apps enabling clinical pulse oximetry over multiple geographical locations on demand are of immense practical medical importance globally, particularly in LMICs. During respiratory viral pandemics, smartphone sensors with apps allowing clinical pulse oximetry may support medical practitioners in triage and initial clinical management.14 , 15 Moreover, smartphone sensors with apps could easily be connected to regional hospitals and national data hubs, allowing large-scale regional data analysis in near-real time to estimate resource requirements, and may empower patients by storing intermittent historical remote data to present to hospital triage or medical practitioners, and could support management of chronic cardiopulmonary diseases and postoperative clinical follow-up care, including lung transplantation.

Conclusion

Our data show that the smartphone sensor with app tested met laboratory FDA/ISO standards and could be used to obtain highly accurate and repeatable measurements across a varied population. Full FDA/ISO approval would require additional laboratory testing to incorporate at least 200 data points referenced to blood sample analysis, which could be completed in a few weeks. Given the immediate practical medical importance of remote intermittent clinical pulse oximetry, industry should be encouraged and supported to pursue full FDA/ISO approval of this smartphone sensor with app. This approval should be prioritized and fast-tracked by the FDA and ISO.

Data Availability

The data files are held by UCSD in a data repository. For access, please e-mail the Antiviral Research Center (AVRC) Regulatory Group: avrcregulatory@ucsd.edu.

Acknowledgments

Author contributions: S. H. B. takes responsibility for the content of the manuscript, including the data and analysis. Overall study concept design and acquisition of sponsorship: S. H. B., F. V., and M. B. Data acquisition, analysis, and/or interpretation of data: S. H. B., F. V., M. B., S. C. P., J. G. G., and C.-C. H. Drafting of the work: S. H. B., F. V., and M. B. C. A. E. and C.-C. H. contributed expertise and edits to the content of this manuscript.

Financial/nonfinancial disclosures: The authors have reported to CHEST the following: C.-C. H. and C. A. E. were employees of Maxim Integrated during the course of this study. None declared (S. H. B., M. B., S. C. P., J. G. G., F. V.).

Role of sponsors: The sponsor had no role in the design of the study, the collection and analysis of the data, or the preparation of the manuscript.

Other contributions: Specialists in Global Health (SiGH) (https://sigh.global) initiated this collaboration and brought together the team that produced this research, having identified the potential contribution to global health this work could make.

Additional information: The e-Tables can be found in the Supplemental Materials section of the online article.

Footnotes

FUNDING/SUPPORT: This work was funded by a National Institutes of Health grant supplement [R01MH110057-04S] to S. H. B., a grant from Maxim Integrated to cover S. C. P., and a Technology Award from nonprofit Specialists in Global Health (https://sigh.global/) to J. G. G.

Supplementary Data

e-Online Data
mmc1.pdf (216.1KB, pdf)

References

  • 1.Alafeef M., Fraiwan M. Smartphone-based respiratory rate estimation using photoplethysmographic imaging and discrete wavelet transform. J Ambient Intell Human Comput. 2020;11:693–703. [Google Scholar]
  • 2.Ansermino J.M. Universal access to essential vital signs monitoring. Anesth Analg. 2013;117(4):883–890. doi: 10.1213/ANE.0b013e3182a1f22f. [DOI] [PubMed] [Google Scholar]
  • 3.Kooistra J. Newzoo’s 2018 global mobile market report: insights into the world’s 3 billion smartphone users. September 11, 2018. https://newzoo.com/insights/articles/newzoos-2018-global-mobile-market-report-insights-into-the-worlds-3-billion-smartphone-users/
  • 4.Hervás R., Fontecha J., Ausín D., Castanedo F., Bravo J., López-de-Ipiña D. Mobile monitoring and reasoning methods to prevent cardiovascular diseases. Sensors (Basel) 2013;13(5):6524–6541. doi: 10.3390/s130506524. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Tomasic I., Tomasic N., Trobec R., Krpan M., Kelava T. Continuous remote monitoring of COPD patients: justification and explanation of the requirements and a survey of the available technologies. Med Biol Eng Comput. 2018;56(4):547–569. doi: 10.1007/s11517-018-1798-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Kakria P., Tripathi N.K., Kitipawang P. A real-time health monitoring system for remote cardiac patients using smartphone and wearable sensors. Int J Telemed Appl. 2015;2015:373474. doi: 10.1155/2015/373474. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Foran M., Ahn R., Novik J., et al. Prevalence of undiagnosed hypoxemia in adults and children in an under-resourced district hospital in Zambia. Int J Emerg Med. 2010;3(4):351–356. doi: 10.1007/s12245-010-0241-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Maartens G., Stewart A., Griesel R., et al. Development of a clinical prediction rule to diagnose Pneumocystis jirovecii pneumonia in the World Health Organization’s algorithm for seriously ill HIV-infected patients. South Afr J HIV Med. 2018;19(1):851. doi: 10.4102/sajhivmed.v19i1.851. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Chouaid C., Maillard D., Housset B., Febvre M., Zaoui D., Lebeau B. Cost Effectiveness of noninvasive oxygen saturation measurement during exercise for the diagnosis of Pneumocystis carinii pneumonia. Am J Respir Dis. 1993;147(6 Pt 1):1360–1363. doi: 10.1164/ajrccm/147.6_Pt_1.1360. [DOI] [PubMed] [Google Scholar]
  • 10.McCollum E.D., King C., Deula R., et al. Pulse oximetry for children with pneumonia treated as outpatients in rural Malawi. Bull World Health Organ. 2016;94(12):893–902. doi: 10.2471/BLT.16.173401. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Lazzerini M., Sonego M., Pellegrin M.C. Hypoxaemia as a mortality risk factor in acute lower respiratory infections in children in low and middle-income countries: systematic review and meta-analysis. PLoS One. 2015;10(9) doi: 10.1371/journal.pone.0136166. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Aggarwal D., Gupta A., Janmeja A.K., Bhardwaj M. Evaluation of tuberculosis-associated chronic obstructive pulmonary disease at a tertiary care hospital: a case-control study. Lung India. 2017;34(5):415–419. doi: 10.4103/lungindia.lungindia_522_16. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Chin A.T., Rylance J., Makumbirofa S., et al. Chronic lung disease in adult recurrent tuberculosis survivors in Zimbabwe: a cohort study. Int J Tuberc Lung Dis. 2019;23(2):203–211. doi: 10.5588/ijtld.18.0313. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Sala H., Segovia Roca J., Zerbo C., et al. Initial clinical management of symptomatic adult patients during influenza A (H1N1) epidemics. J Emerg Med. 2011;41(4):435–440. doi: 10.1016/j.jemermed.2010.05.093. [DOI] [PubMed] [Google Scholar]
  • 15.Verhoeven D., Teijaro J.R., Farber D.L. Pulse-oximetry accurately predicts lung pathology and the immune response during influenza infection. Virology. 2009;390(2):151–156. doi: 10.1016/j.virol.2009.05.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.International Organization for Standardization ISO 80601-2-61:2017: Medical electrical equipment—Part 2-61: particular requirements for basic safety and essential performance of pulse oximeter equipment. 2017. https://www.iso.org/standard/67963.html
  • 17.Center for Devices and Radiological Health, US Food and Drug Administration Pulse oximeters: premarket notification submissions [510(k)s]: guidance for industry and Food and Drug Administration staff. March 2013. https://www.fda.gov/regulatory-information/search-fda-guidance-documents/pulse-oximeters-premarket-notification-submissions-510ks-guidance-industry-and-food-and-drug
  • 18.Severinghaus J.W., Naifeh K.H., Koh S.O. Errors in 14 pulse oximeters during profound hypoxia. J Clin Monit. 1989;5(2):72–81. doi: 10.1007/BF01617877. [DOI] [PubMed] [Google Scholar]
  • 19.Severinghaus J.W. Gadgeteering for health care: the John W. Severinghaus lecture on translational science. Anesthesiology. 2009;110(4):721–728. doi: 10.1097/ALN.0b013e31819c44ae. [DOI] [PubMed] [Google Scholar]
  • 20.Bland J.M., Altman D.G. Agreement between methods of measurement with multiple observations per individual. J Biopharm Stat. 2007;17(4):571–582. doi: 10.1080/10543400701329422. [DOI] [PubMed] [Google Scholar]
  • 21.Montgomery D.C. 10th ed. John Wiley & Sons; New York: 2019. Design and Analysis of Experiments; p. 530. [Google Scholar]
  • 22.Hastie T., Tibshirani R., Friedman J. 2nd ed. Springer; New York: 2009. The Elements of Statistical Learning: Data Mining, Inference, and Prediction; p. 24. [Google Scholar]
  • 23.Pinheiro J.C., Bates D.M. Springer; New York: 2000. Mixed-Effects Models in S and S-PLUS; p. 201. [Google Scholar]
  • 24.Maldonado G., Greenland S. Simulation study of confounder-selection strategies. Am J Epidemiol. 1993;138(11):923–936. doi: 10.1093/oxfordjournals.aje.a116813. [DOI] [PubMed] [Google Scholar]
  • 25.R Core Team R . R Foundation for Statistical Computing; Vienna, Austria: 2018. A Language and Environment for Statistical Computing.https://www.R-project.org/ [Google Scholar]
  • 26.Pinheiro J, Bates D, DebRoy S, Sarkar D; R Core Team. nlme: Linear and Nonlinear Mixed Effects Models. R package version 3.1-148. 2020. https://CRAN.R-project.org/package=nlme. Accessed September 25, 2020.

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

e-Online Data
mmc1.pdf (216.1KB, pdf)

Data Availability Statement

The data files are held by UCSD in a data repository. For access, please e-mail the Antiviral Research Center (AVRC) Regulatory Group: avrcregulatory@ucsd.edu.


Articles from Chest are provided here courtesy of Elsevier

RESOURCES