Fever is an important early sign of serious treatment-related adverse events, such as cytokine release syndrome (CRS) caused by chimeric antigen receptor T-cell (CAR-T) immunotherapy, and infection related to chemotherapy-induced neutropenia (Oved, Barrett and Teachey, 2019), commonly experienced by patients with cancer. The standard approach for detecting fever in hospitalized patients is intermittent temperature monitoring, typically every 4-8 hours, which could lead to inherent delays in diagnosis of febrile adverse events.
The availability of non-invasive, wireless, wearable sensors to “continuously” monitor body temperature raises the possibility of earlier detection and diagnosis of fever and its associated adverse events. Some studies have begun to investigate this possibility (Jordan et al., 2017; Sampson et al., 2019; Liu et al., 2020; Smarr et al., 2020), yet a systematic investigation in patients with cancer using FDA-approved devices and comparing to standard-of-care (SOC) monitoring is needed. Furthermore, the large volume of high-frequency temperature data that can be obtained from a wearable sensor opens the possibility of carrying out computational analysis to identify signals for anticipating fever before it occurs.
To investigate these possibilities, we conducted a prospective study in 68 patients receiving hematopoietic stem cell transplant (HCT) or CAR-T therapy in the inpatient setting (Figure S1A). After providing IRB-approved informed consent, patients were asked to wear a self-administered, non-invasive, and FDA-approved wearable sensor (TempTraq®, BlueSpark Technologies), applied as an axillary skin patch according to manufacturer’s instructions, to capture high-frequency temperature measurements (HFTM) every 2-minutes; data were wirelessly transmitted in real-time to a cloud-based server (Sampson et al., 2019). HFTM data from 62 patients (n=39 HCT, n=23 CAR-T) were available for analysis to compare timing of fever detection with SOC temperature measurements, which are typically taken every 4-8 hours by nursing staff as part of routine clinical care. During the monitoring period, we collected a total of 585 days of HFTM data across all 62 participants with a median data capture of 8.5 days/patient. When patients were wearing an HFTM patch, we collected ~90-fold more data points with HFTM (n = 421,367) than SOC (n = 4,816).
We first evaluated the timing of SOC- and HFTM-detected fevers and found that HFTM detected 89% (24/27) of these fevers a median 5.5 hours (h) earlier than SOC (Figure S1B). For three fevers detected earlier by SOC, the median time was 1.9 h earlier. Overall (n=27 fever events analyzed), HFTM showed a median 4.9 h earlier detection time than SOC.
As expected, most fevers detected in patients having received CAR-T therapy were related to CRS; whereas in HCT patients, infection-related fevers were more common (Figure S1C). Interestingly, we found that fevers caused by infections were detected by HFTM significantly earlier (median = 18.5 h) than by CRS (median = 4.4 h; p = 0.012, two-tailed t-test); examples are shown in Figures S1D–F.
We further investigated whether we could computationally identify potentially predictive signals that precede fever (i.e., before an HFTM-detected temperature rise to 38°C). We hypothesized that subtle perturbations in temperature dynamics may be discernible prior to fever and may manifest in circadian modeling analysis as deviations from baseline circadian pattern. To test our hypothesis, we fit a circadian profile based on 24-h of preceding data for every data point leading up to independent HFTM fever events that had sufficient data for circadian modeling (Figure S1G). This approach allowed for real-time updating of the circadian profile (magenta curves) with an average temperature measurement (green lines), and amplitude and phase (the time of the minimum of the circadian profile) estimates.
From the circadian fit, we computed circadian residuals (Figure S1H, blue dots), defined as the difference between the circadian fit and the data point recorded. To incorporate changes in average temperature that may have occurred across larger time scales, we also computed a standardized residual of the average temperature (orange dots). Finally, for our subsequent analysis, we computed a total residual, defined as the sum of the circadian and average temperature residuals (red dots). We separated the total residuals into two groups based on periods of time: pre-fever day residuals (i.e., residuals from 24-h immediately prior to fever, red shaded region) and a patient-specific baseline (i.e., residuals calculated prior to the pre-fever day, blue shaded region).
We then projected the patient-specific baseline residuals onto the pre-fever day, matching the phase estimates for each data point to the phase of the pre-fever day to account for specific measurement bias at certain times of the circadian cycle. We computed the residual difference (Figure S1I) between the pre-fever day and the patient-specific baseline period and predicted 95% confidence intervals after sampling with replacement 1000 times. In general, the residuals in the pre-fever day began to deviate positively from those of the patient-specific baseline ~12-h prior to fever. Moreover, this deviation was sustained and statistically significant from ~3.5-h pre-fever up to fever onset, with additional discrete spikes of statistical significance ~5-6 and 8-h before fever onset, collectively demonstrating signals in HFTM data that presaged the occurrence of fever.
Taken together, our results demonstrated the potential of an HFTM approach using wearable sensors to provide considerable lead time (4.9 h earlier than SOC) for early detection of febrile adverse events, with the potential to add 3.5 h or more lead time by circadian modeling. This duration of lead time is clinically significant for patients with cancer who are commonly immunocompromised and at risk for infection, because time-to-first antibiotics can play an important role in subsequent mortality in neutropenic fevers and sepsis (Mullen et al., 2000; Wingard, Hsu and Hiemenz, 2011), especially in the setting of septic shock where mortality increases with every hour of delay in antibiotic administration (Kumar et al., 2006). Our inpatient study provides a foundation for investigation in the outpatient setting. In particular, the impact of early detection of infection may be seen at a larger scale amongst outpatients with cancer receiving chemotherapy who are at-risk for febrile neutropenia. Our data was collected using an FDA-approved wearable sensor suitable for home use, making it readily implementable.
Furthermore, the lead time provided by HFTM is also clinically relevant to monitoring patients treated with CAR-T. It can enable earlier intervention in CRS through escalation of care, including earlier administration of anti-cytokine therapies [e.g., tocilizumab (an IL-6R antagonist)] (Oved, Barrett and Teachey, 2019), which may reduce life-threatening morbidity and mortality associated with CRS. This lead time could also facilitate the transition of extremely expensive inpatient CAR-T care to the outpatient setting, since the lead time could provide sufficient time to return to hospital in case of impending CRS.
We hope that our results will spur more in-depth investigation of the HFTM approach in patients with cancer, ultimately through prospective clinical trials. Elements to be investigated in future work include: optimizing patient education and support to minimize missing data; developing computational algorithms to probabilistically identify the cause and clinical actionability of a fever from temperature dynamics and additional clinical data; and increasingly individualizing prediction and detection of febrile adverse events using a patient’s own baseline temperature pattern as a reference, rather than a one-size-fits-all 38°C threshold approach.
Supplementary Material
Figure S1. High frequency temperature monitoring using a wearable device and computational analysis for early detection and anticipation of febrile adverse events in patients with cancer.
This shifts the phase of the time point ti to match the phase at fever onset. Finally, we generated the patient-specific baseline data set by taking the mean of ( in two-minute bins from 0 to 24 hours to match the time resolution of the fever data sets.
We ran the residual comparison on 17 fever and patient-specific baseline data sets. First, we fit a nonlinear profile to both the 24-h patient-specific baseline residuals and the 24-h pre-fever day residuals for each participant. We used the bs function in R to fit a b-spline with 22 internal knots and 2 boundary knots. From the nonlinear fits, we estimated the total residual at 15-minute intervals. Then, we took the difference between the fever residual estimate on the pre-fever day and the corresponding residual estimate from the 24-h patient-specific baseline estimate. Finally, we took the mean difference of the 17 fever events every fifteen minutes. We repeated this process 1000 times after sampling with replacement from the 17 fever events to generate a distribution of mean differences between the patient-specific baseline period and the pre-fever day every fifteen minutes from 24 to 0 h before fever onset. From the null distribution, we used the quantile function in R to compute a 95% quantile of the data with a probability range from 0.025 to 97.5. Then, any time point with a mean difference outside of this quantile was considered statistically significant (see I, shaded regions) with p-value <0.05.
Acknowledgements
CF and JT acknowledge support from a NIH Training Grant (T32 HL007622). We thank Erin Sandford, Annika Goicochea, Brittnie Cannon, Tracey Churay, Kristen Gilley, and Kirk Herman for assistance in research coordination and Greg Yanik for comments on the manuscript. This work was supported by a Taubman Medical Institute Grand Challenge grant and by a Taubman Institute Innovation Project grant. SWC is currently supported by NHLBI R01HL146354 and NCI R01CA249211 grants.
Declaration of Interests
DF is the CSO of Arcascope, a company that makes circadian rhythms software. DF and the University of Michigan are part owners of Arcascope. SWC and MT receive research funding from an Arcascope NIH SBIR grant for a different research project. However, Arcascope did not sponsor the research presented here. JT, CM, DF, CF, SWC and MT are inventors of intellectual property related to this work, for which the University of Michigan is pursuing intellectual property protections.
REFERENCES
- Freifeld AG et al. (2011) ‘Clinical practice guideline for the use of antimicrobial agents in neutropenic patients with cancer: 2010 update by the infectious diseases society of america’, Clinical infectious diseases: an official publication of the Infectious Diseases Society of America, 52(4), pp. e56–93. [DOI] [PubMed] [Google Scholar]
- Jordan J et al. (2017) ‘Statistical models for fever forecasting based on advanced body temperature monitoring’, Journal of critical care, 37, pp. 136–140. [DOI] [PubMed] [Google Scholar]
- Kumar A et al. (2006) ‘Duration of hypotension before initiation of effective antimicrobial therapy is the critical determinant of survival in human septic shock’, Critical care medicine, 34(6), pp. 1589–1596. [DOI] [PubMed] [Google Scholar]
- Lee DW et al. (2019) ‘ASTCT Consensus Grading for Cytokine Release Syndrome and Neurologic Toxicity Associated with Immune Effector Cells’, Biology of blood and marrow transplantation: journal of the American Society for Blood and Marrow Transplantation, 25(4), pp. 625–638. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Liu Y et al. (2020) ‘Evaluation of a wearable wireless device with artificial intelligence, iThermonitor WT705, for continuous temperature monitoring for patients in surgical wards: a prospective comparative study’, BMJ open, 10(11), p. e039474. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mullen CA et al. (2000) ‘Fever and neutropenia in pediatric hematopoietic stem cell transplant patients’, Bone marrow transplantation, 25(1), pp. 59–65. [DOI] [PubMed] [Google Scholar]
- Oved JH, Barrett DM and Teachey DT (2019) ‘Cellular therapy: Immune-related complications’, Immunological reviews, 290(1), pp. 114–126. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sampson M et al. (2019) ‘Feasibility of continuous temperature monitoring in pediatric immunocompromised patients: A pilot study’, Pediatric blood & cancer, 66(6), p. e27723. [DOI] [PubMed] [Google Scholar]
- Smarr BL et al. (2020) ‘Feasibility of continuous fever monitoring using wearable devices’, Scientific reports, 10(1), p. 21640. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wingard JR, Hsu J and Hiemenz JW (2011) ‘Hematopoietic stem cell transplantation: an overview of infection risks and epidemiology’, Hematology/oncology clinics of North America, 25(1), pp. 101–116. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Figure S1. High frequency temperature monitoring using a wearable device and computational analysis for early detection and anticipation of febrile adverse events in patients with cancer.
This shifts the phase of the time point ti to match the phase at fever onset. Finally, we generated the patient-specific baseline data set by taking the mean of ( in two-minute bins from 0 to 24 hours to match the time resolution of the fever data sets.
We ran the residual comparison on 17 fever and patient-specific baseline data sets. First, we fit a nonlinear profile to both the 24-h patient-specific baseline residuals and the 24-h pre-fever day residuals for each participant. We used the bs function in R to fit a b-spline with 22 internal knots and 2 boundary knots. From the nonlinear fits, we estimated the total residual at 15-minute intervals. Then, we took the difference between the fever residual estimate on the pre-fever day and the corresponding residual estimate from the 24-h patient-specific baseline estimate. Finally, we took the mean difference of the 17 fever events every fifteen minutes. We repeated this process 1000 times after sampling with replacement from the 17 fever events to generate a distribution of mean differences between the patient-specific baseline period and the pre-fever day every fifteen minutes from 24 to 0 h before fever onset. From the null distribution, we used the quantile function in R to compute a 95% quantile of the data with a probability range from 0.025 to 97.5. Then, any time point with a mean difference outside of this quantile was considered statistically significant (see I, shaded regions) with p-value <0.05.