Abstract
Fluid strategy is the key to the successful management of patients with sepsis. However, previous studies failed to consider individualized treatment strategy, and clinical trials typically included patients with sepsis as a homogeneous study population. We aimed to develop sequential decision rules for managing fluid intake in patients with sepsis by using the dynamic treatment regimen (DTR) model. A retrospective analysis of the eICU Collaborative Research Database comprising highly granular data collected from 335 units at 208 hospitals was performed. The DTR model used a backward induction algorithm to estimate the sequence of optimal rules. 22,868 patients who had sepsis according to the Acute Physiology and Chronic Health Evaluation (APACHE) IV diagnosis group were included. Optimal fluid management (liberal [> 40 ml/kg/d] versus restricted [< 40 ml/kg/d]) strategy were developed on the Day 1, 3 and 5 after ICU admission according to current states and treatment history. Important determinants of optimal fluid strategy included mean blood pressure, heart rate, previous urine output, previous fluid strategy, ICU type and mechanical ventilation. Different functional forms such as quadratic function and interaction terms were used at different stages. The proportion of subjects being inappropriately treated with liberal fluid strategy (i.e. those actually received liberal fluid strategy, but could have longer survival time if they received restricted fluid strategy) increased from day 1 to 5 (19.3% to 29.5%). The survival time could be significantly prolonged had all patients been treated with optimal fluid strategy (5.7 [2.0, 5.9] vs. 4.1 [2.0, 5.0] days; p < 0.001). With a large volume of sepsis data, we successfully computed out a sequence of dynamic fluid management strategy for sepsis patients over the first 5 days after ICU admission. The decision rules generated by the DTR model predicted a longer survival time compared to the true observed strategy, which sheds light for improving patient outcome with the aim from computer-assisted algorithm.
Subject terms: Bacterial infection, Prognosis
Introduction
Management of critically ill patients with sepsis is imperative as sepsis can attack all human organs leading to a fatal consequence in a short period of time. Among multiple aspects for saving the life of patients with sepsis, we believe that fluid management strategy remains essential to the successful treatment because the cardiac and renal functions, which play important roles in the physiological regulation of fluid balance, are usually impaired in sepsis. Numerous studies have been conducted to explore an optimal fluid strategy. A trio of multinational trials named Protocolized Care for Early Septic Shock (ProCESS), Australasian Resuscitation in Sepsis Evaluation (ARISE), and Protocolized Management in Sepsis (ProMISe) aimed to investigate whether early goal directed therapy (EGDT) could improve the mortality outcome1–3. The results showed that EGDT did not significantly reduce mortality rate as compared with the usual care group. Thus, the optimal fluid strategy remains largely unknown. One important limitation of these trials is that they included all sepsis patients without considering between-subject heterogeneity. As a matter of fact, there has been large body of evidence showing that sepsis is highly heterogenous and different subphenotypes can have distinct responses to fluid administration4. Latent profile analysis and hierarchical clustering are two major techniques to identify subphenotypes of sepsis5, which however are limited by two factors. First, the capacity to establish a causal relationship between treatment and subphenotypes is insufficient. Second, most studies used cross-sectional data when modeling the subphenotypes. We should aware that fluid management in a sepsis patient is a dynamic process over entire treatment course; we should not solely focus on the initial fluid strategy but ignore the importance at later stages. There was evidence that de-resuscitation (negative fluid balance) at later phase of sepsis is beneficial6. Therefore, we need to adjust our fluid management strategy according to the patient’s changing state and treatment history.
Reinforcement learning (RL) is a well-developed algorithm of machine learning focusing on how a computing system gives a response based on environmental inputs so the system can maximize cumulative reward7. RL has been used to determine the optimal fluid treatment strategy in sepsis8–10. However, these results are difficult to interpret due to the black-box algorithms, which significantly prohibited their use in clinical practice. Dynamic treatment regimen (DTR) method borrows the idea of RL, but simplifies the functional forms of the multi-dimensional feature space, which can help clinicians to better understand the decision rules 11. This study aimed to optimize fluid treatment strategy (liberal versus restrictive) by using DTR during the course of sepsis. Instead of using cross-sectional data on one single phase, we applied DTR model on three different days (Day 1, Day 3, and Day 5) over the first week of treatment with a purpose of examine dynamic nature of the fluid management on sepsis treatment. We hypothesized that the optimal fluid treatment strategy recommended by DTR would be better than the treatment actually received in terms of survival outcome.
Methods
Database and study population
The present study utilized the eICU Collaborative Research Database, which is a multi-center intensive care unit (ICU) database with high granularity data for over 200,000 admissions to ICUs monitored by eICU Programs across the United States12. Patients with sepsis were included for our analysis. Sepsis was defined according to the sepsis-2.0 definition13. Sepsis-2.0 was defined when a patient had suspected or documented infection plus 2 of the SIRS criteria including temperature 36℃ or 38℃,heart rate 90 bpm, respiratory rate 20 breath/min or PaCO2 < 32 mmHg, and white blood cell count > 12,000 or < 4000 cells/mm3 or > 10% band. Sepsis was further categorized by infection sites including cutaneous, gastrointestinal, pulmonary, urinary tract, other location and unknown location14.
Some definitions
The primary outcome was patient status (alive versus expired) at hospital discharge, which was considered as time to event survival data. Subjects who were discharged alive were considered as censored. We further defined the survival time of the patient from ICU admission to expiration. Patients who discharged alive were considered as censored. The state (feature space) of patients at each stage (Day 1, 3 and 5 after ICU admission) was constructed by variables including heart rate (HR), mean blood pressure (mBP), respiratory rate (RR), Glasgow coma scale (GCS), body temperature, creatinine, lactate, hemoglobin, bilirubin, use of vasopressor, platelet count, PaO2/FiO2 (P/F) ratio, daily urine output and the use of mechanical ventilation (MV).
Total fluid intake was calculated as the sum of fluid intake for a 24-h interval. We assumed that the fluid strategy was determined at the beginning of each interval. Fluid intake was also normalized by the body weight. We defined liberal and restricted fluid administration as ≥ 40 ml/kg/day and < 40 ml/kg/day, respectively. This cutoff point was chosen according to the distribution of the fluid intake in the study population so that both categories would have balanced number of observations.
Statistical analysis
Descriptive statistics were analyzed conventionally using the CBCgrps package in R15.
The DTR model estimated a sequence of treatment strategy that maximized the survival time across stages of clinical intervention16. We defined three stages on day 1, 3 and 5 after ICU entry and recorded the survival times within each stage. For the first stage, the survival time corresponded to the time (in days) from the beginning of day 1 to day 3 or day 1 to death if the patient died before day 3. The survival times and were defined similarly.
The DTR model used a backward induction algorithm to estimate the sequence of optimal rules. In the first step, the optimal stage 3 decision rule (day 5) was estimated by modeling the counterfactual survival time in stage 3 () as a function of the treatment received on day 5 (restricted [] or liberal [] fluid administration) and of the feature variables measured on day 5 or before ( and ):
The term is the stage 3 blip function. It represents the effect of receiving liberal fluid administration instead of restricted fluid administration and its interaction with feature variables. The term is a vector of coefficients for feature variables and represents information (previous treatment, covariates and survival times) available prior to making the stage 3 treatment. We included age, mBP, unit type, HR, RR, use of vasopressor, MV, urine output, temperature, P/F ratio, and treatment strategies on days 3 and 4 as potential features to determine the optimal treatment. Variables were excluded from the final model if the statistical significance level was greater than 0.05. We retained some important variables such as age and mBP according to expertise. The optimal stage 3 treatment was identified for each subject who entered stage 3 by . If optimal rule recommends liberal fluid administration on day 5 if the condition is satisfied, and restricted fluid administration otherwise.
In the second step, we estimated the optimal stage 2 treatment strategy by modeling the counterfactual survival time . It represents the survival time from day 3 (stage 2) onwards had the patient received his optimal stage 3 treatment. It is equal to the observed survival time if the patient received his optimal stage 3 treatment and is larger that otherwise. A similar strategy as in the third stage was adopted to find the optimal stage 2 treatment rule .
In the third step, we proceeded to the optimization of the first stage treatment by modeling the counterfactual survival time (overall survival time had both the stage 2 and 3 treatments been optimal). The optimal stage 1 treatment rule was also of the form . The DTR model was built with the DTRreg package (v1.5) in R 17.
Ethics approval and consent to participate
Data were available on request. This study was an analysis of the third-party anonymized databases with pre-existing IRB approval.
Results
Baseline characteristics of included subjects
A total of 22,868 patients with sepsis were identified from the database (Fig. 1), comprising 19,040 survivors and 3,838 non-survivors at hospital discharge (Table 1). There was no significant difference between the two groups in sex, ethnicity, admission height and presence of AIDS. Survivors were significantly younger (median [IQR]: 65 [53, 77] vs. 72 [60,83] years; p < 0.001), had larger weight (77.7 [64.1, 96.2] vs. 74.3 [61, 90.9] kg; p < 0.001), higher mBP (58 [50, 68] vs. 51 [40, 60] mmHg; p < 0.001), and more likely to have renal/UTI (24% vs. 15%; p < 0.001) than the non-survivors (Table 2).
Table 1.
Variables | Total (n = 22,868) | Alive(n = 19,040) | Expired(n = 3828) | p |
---|---|---|---|---|
Gender, Male, No. (%) | 11,600(51) | 9,653(51) | 1,947(51) | 0.898 |
Age | 66(54,78) | 65(53,77) | 72(60,83) | < 0.001 |
Ethnicity, No. (%) | 0.91 | |||
Missing | 200(1) | 165(1) | 35(1) | |
African American | 2282(10) | 1887(10) | 395(10) | |
Asian | 438(2) | 365(2) | 73(2) | |
Caucasian | 17,714(77) | 14,760(78) | 2954(77) | |
Hispanic | 920(4) | 757(4) | 163(4) | |
Native American | 201(1) | 170(1) | 31(1) | |
Other/Unknown | 1113(5) | 936(5) | 177(5) | |
Admission height | 167.75(160,177.8) | 168(160,177.8) | 167.6(160,177.8) | 0.194 |
Admission weight | 77.1(63.5,95.3) | 77.7(64.1,96.2) | 74.3(61,90.9) | < 0.001 |
Admit source, No. (%) | < 0.001 | |||
Operating Room | 46(0) | 42(0) | 4(0) | |
Recovery Room | 31(0) | 28(0) | 3(0) | |
Chest Pain Center | 3(0) | 3(0) | 0(0) | |
Floor | 4564(21) | 3530(20) | 1034(28) | |
Other ICU | 127(1) | 98(1) | 29(1) | |
Other Hospital | 481(2) | 373(2) | 108(3) | |
Direct Admit | 1291(6) | 1058(6) | 233(6) | |
Emergency Department | 15,158(70) | 12,910(72) | 2248(61) | |
Unit type, No. (%) | < 0.001 | |||
CCU-CTICU | 1339(6) | 1119(6) | 220(6) | |
CSICU | 379(2) | 312(2) | 67(2) | |
CTICU | 162(1) | 126(1) | 36(1) | |
Cardiac ICU | 1347(6) | 1094(6) | 253(7) | |
MICU | 3068(13) | 2475(13) | 593(15) | |
Med-Surg ICU | 15,270(67) | 12,836(67) | 2434(64) | |
Neuro ICU | 433(2) | 346(2) | 87(2) | |
SICU | 870(4) | 732(4) | 138(4) |
Characteristics expressed as median [IQR] unless specified otherwise.
Abbreviations: IQR: interquartile range; CCU-CTICU: coronary care unit- cardiothoracic ICU; CSICU: cardiac surgery ICU; CTICU: cardiothoracic ICU; SICU: surgical ICU; MICU: medical ICU; GI: gastrointestinal; BUN: blood urea nitrogen.
Table 2.
Variables | Total (n = 22,868) | Alive(n = 19,040) | Expired(n = 3,828) | p |
---|---|---|---|---|
Infection sites, No. (%) | < 0.001 | |||
GI | 2843(12) | 2264(12) | 579(15) | |
Cutaneous/soft tissue | 1903(8) | 1684(9) | 219(6) | |
Gynecologic | 75(0) | 65(0) | 10(0) | |
Other | 1500(7) | 1192(6) | 308(8) | |
Pulmonary | 8751(38) | 7110(37) | 1641(43) | |
Renal/UTI (including bladder) | 5212(23) | 4621(24) | 591(15) | |
unknown | 2584(11) | 2104(11) | 480(13) | |
MV, No. (%) | 5448(24) | 3807(20) | 1641(43) | < 0.001 |
GCS | 13(9.81,15) | 13.72(10.12,15) | 10(7,13.8) | < 0.001 |
Bilirubin (mg/dl), median (IQR) | 1.23(0.59,3.2) | 1.2(0.5,3.08) | 1.6(0.7,3.8) | < 0.001 |
Creatinine (mg/dl), median (IQR) | 1.37(0.86,2.43) | 1.28(0.82,2.25) | 1.97(1.2,3.12) | < 0.001 |
Platelet (), median (IQR) | 178(119,248) | 181(126,250) | 157(85,234) | < 0.001 |
PaO2 (mmHg), median (IQR) | 85(63.26,115.57) | 86.33(64,116.28) | 79(60.36,110.94) | < 0.001 |
MBP (mmHg), median (IQR) | 57(49,67) | 58(50,68) | 51(40,60) | < 0.001 |
SOFA, median (IQR) | 8(6,10) | 7(6,9) | 10(8,12) | < 0.001 |
Urine Output (ml/d), median (IQR) | 50(0,1000) | 100(0,1100) | 0(0,575) | < 0.001 |
Temperature (℃), median (IQR) | 37.3(36.9,38.11) | 37.33(36.9,38.11) | 37.3(36.8,38.2) | < 0.001 |
RR (/min), median (IQR) | 29(24,35) | 28(24,34) | 32(26,38) | < 0.001 |
HR (/min), median (IQR) | 109(95,124.99) | 108(94,123) | 117(100.36,134) | < 0.001 |
pH, median (IQR) | 7.35(7.27,7.41) | 7.36(7.29,7.42) | 7.3(7.2,7.38) | < 0.001 |
Hematocrit (%), median (IQR) | 30.6(26.4,35) | 30.8(26.7,35.1) | 29.3(25.1,34.25) | < 0.001 |
Albumin (mg/dl), median (IQR) | 2.5(2.1,2.9) | 2.5(2.1,2.9) | 2.3(1.9,2.71) | < 0.001 |
BUN (mg/dl), median (IQR) | 28(17,46) | 26(16,43) | 40(26,59) | < 0.001 |
Glucose (mg/dl), median (IQR) | 132(93,206) | 132(94,204) | 134(86,215) | 0.011 |
Comorbidities, No. (%) | ||||
AIDS | 66(0) | 50(0) | 16(0) | 0.152 |
Hepatic failure | 463(2) | 320(2) | 143(4) | < 0.001 |
lymphoma | 211(1) | 160(1) | 51(1) | 0.006 |
Metastatic cancer | 748(3) | 530(3) | 218(6) | < 0.001 |
leukemia | 338(2) | 240(1) | 98(3) | < 0.001 |
immunosuppression | 1232(6) | 947(5) | 285(8) | < 0.001 |
Cirrhosis | 611(3) | 416(2) | 195(5) | < 0.001 |
Characteristics expressed as median [IQR] unless specified otherwise.
Abbreviations: AIDS: acquired immune deficiency syndrome; IQR: interquartile range; SOFA: sequential organ failure assessment; MBP: mean blood pressure; RR: respiratory rate; GCS: Glasgow coma scale; MV: mechanical ventilation.
Fluid intake
Survivors consistently received less fluid intake from day 1 to 10 than non-survivors (day 1: 38.07 [17.71, 68.96] vs. 43.93 [17.17, 83.97] ml/kg/day; p < 0.001). The significance level of 0.05 was not reached on day 4, but the survivors still had less fluid intake than non-survivors (27.9 [12.85, 59.86] vs. 30.44 [13.88, 66.41] ml/kg/day; p = 0.074). The amount of fluid intake decreased gradually from day 1 to 10 from 38.88 (IQR: 17.63–71.52) to 27.88 (IQR: 14.12–56.71) ml/kg/day (Table 3). However, it was not straightforward to improve survival outcome by simply reducing fluid intake because the association was not causality. Thus, we needed to employ DTR to prescribe optimal amount of fluid for each individual, based on their current status and treatment history.
Table 3.
Fluid intake (ml/kg/day) | Total (n = 22,868) | Alive(n = 19,040) | Expired(n = 3828) | p |
---|---|---|---|---|
Day 1 | 38.88(17.63,71.52) | 38.07(17.71,68.96) | 43.93(17.17,83.97) | < 0.001 |
Day 2 | 32.67(14.87,62.71) | 31.93(14.69,60.65) | 36.77(16.28,73.04) | < 0.001 |
Day 3 | 30(13.09,61.74) | 29.49(12.99,60.72) | 33(14.01,67.3) | 0.025 |
Day 4 | 28.4(13.05,60.85) | 27.9(12.85,59.86) | 30.44(13.88,66.41) | 0.074 |
Day 5 | 27.81(13.06,60.81) | 27.09(12.77,58.75) | 31.67(14.31,70.96) | 0.005 |
Day 6 | 27.86(12.81,57.58) | 26.35(12.38,53.09) | 33.7(14.22,77.18) | < 0.001 |
Day 7 | 26.74(13.03,54.93) | 26.35(12.9,53.09) | 29.49(13.58,61.18) | 0.043 |
Day 8 | 27.94(13.77,58.23) | 26.32(13.39,54.84) | 33.99(15.3,74.1) | 0.001 |
Day 9 | 27.75(13.48,57.54) | 26.86(13.33,53.74) | 32.61(13.96,67.25) | 0.021 |
Day 10 | 27.88(14.12,56.71) | 27.06(13.47,51.67) | 30.94(15.08,69.36) | 0.022 |
Fluid intake expressed as median [IQR] unless specified otherwise.
The DTR and its interpretation
The DTR model estimated a sequence of treatment rules to recommend liberal or restricted fluid intake across stages in order to obtain a better survival outcome. On day 1 (stage 1), subjects should receive liberal fluid administration if they satisfied the following condition:
Otherwise, they should receive restricted fluid administration in order to achieve a better survival outcome. Subscripts of variables in the equation denote the ICU days. Positive coefficient of a variable means that the presence or increase in the variable favors liberal fluid administration. The rule indicated that liberal fluid administration was more likely to be beneficial to patients from cardiac surgery ICU (CSICU) (coefficient: 0.413; 95% CI 0.370–0.455; p < 0.001), NeuroICU (1.485; 95% CI 1.438–1.532; p < 0.001) and surgical ICU (SICU) (0.44; 95% CI 0.357–0.522; p < 0.001); but was harmful to patients from medical ICU (MICU) (− 0.198 [− 0.307, − 0.088]; p = 0.001), Med-Surg ICU (− 0.074 [− 0.136, − 0.013]; p = 0.048). Higher HR mandated more fluid administration (0.085 [0.056, 0.115]; p < 0.001). Patients on MV also required more fluid intake (0.357 [0.066, 0.648]; p = 0.045). The impact of mBP was a parabola opens down with the line of symmetry at mBP = 62.1 mmHg. Clinical interpretation was that more fluid intake was warranted for patients with mBP < 62.1 mmHg; while more restricted fluid administration was preferred with increasing mBP if mBP > 62.1 mmHg.
The condition for liberal fluid administration on day 3 (stage 2) was:
This rule indicated that more urine output (coefficient: 0.175; 95% CI: 0.128–0.223; p < 0.001 for day 2) in previous days mandated liberal fluid administration on day 3. The mBP on day 3 did not follow a parabolic function (the quadratic term is not statistically significant). In this case, higher mBP mandated less fluid administration.
The condition for liberal fluid administration on day 5 (stage 3) was:
There was a significant interaction (coefficient: − 0.457; 95% CI − 0.873 to − 0.041; p = 0.039) between day 3 and day 4 treatment strategy for determining the day 5 treatment strategy. The interaction indicated that if liberal fluid administration was given on day 3, restricted fluid administration was more likely to be beneficial on day 5 if liberal fluid balance was also given on day 4 (Table 4).
Table 4.
Coefficient (95% CI) | P value | |
---|---|---|
Blip function on day 1 | ||
Intercept | − 1.248(− 2.5,0.004) | 0.118 |
Age (per 10-year increase) | − 0.067(− 0.163,0.029) | 0.314 |
Unit type (CCU-CTICU as reference) | ||
CSICU | 0.413(0.37,0.455) | < 0.001 |
CTICU | 0.078(− 0.072,0.228) | 0.477 |
Cardiac ICU | 0.215(0.184,0.245) | < 0.001 |
MICU | − 0.198(− 0.307,− 0.088) | 0.001 |
Med-SurgICU | − 0.074(− 0.136,− 0.013) | 0.048 |
Neuro ICU | 1.485(1.438,1.532) | < 0.001 |
SICU | 0.44(0.357,0.522) | < 0.001 |
HR1 (per 20-beat increase) | 0.085(0.056,0.115) | < 0.001 |
MBP1 (per 20-mmHg increase) | 0.707(0.227,1.188) | 0.012 |
− 0.114(− 0.215,− 0.013) | 0.041 | |
MV1 | 0.357(0.066,0.648) | 0.045 |
Blip function on day 3 | ||
Intercept | 0.116(− 0.249,0.481) | 0.657 |
Temperature3 | − 0.004(− 0.006,− 0.002) | < 0.001 |
MBP3 | − 0.163(− 0.281,− 0.045) | 0.021 |
Urine output2 | 0.175(0.128,0.223) | < 0.001 |
Urine output1 | 0.019(− 0.02,0.058) | 0.509 |
MV3 | 0.534(0.396,0.671) | < 0.001 |
Fluid strategy2 | 0.151(0.11,0.193) | < 0.001 |
Fluid strategy1 | 0.241(0.213,0.27) | < 0.001 |
Blip function on day 5 | ||
Intercept | − 0.557(− 0.807,− 0.308) | < 0.001 |
MV5 | 0.656(0.6,0.711) | < 0.001 |
HR5 | 0.002(0.001,0.002) | < 0.001 |
MBP5 | − 0.002(− 0.008,0.004) | 0.64 |
Fluid strategy3 | 0.115(− 0.136,0.366) | 0.535 |
Fluid strategy4 | 0.432(0.198,0.666) | 0.001 |
− 0.457(− 0.873,− 0.041) | 0.039 |
Abbreviations: MBP: mean blood pressure; HR: heart rate; MV: mechanical ventilation; CCU-CTICU: coronary care unit- cardiothoracic ICU; CSICU: cardiac surgery ICU; CTICU: cardiothoracic ICU; SICU: surgical ICU; MICU: medical ICU.
Table 5 compares the difference between optimal and observed treatments for each subject. The proportion of patients who actually received liberal fluid administration but who would have better survival outcome had they received restricted fluid increased from 19.3% on day 1 to 29.5% on day 5. This result indicated that patients were more likely to receive too much fluid at latter phase of sepsis than that at the early phase. If all patients had received the optimal treatment strategy at all stages as recommended by DTR, the survival time could be significantly prolonged (5.7 [2.0, 5.9] vs. 4.1 [2.0, 5.0] days; p < 0.001).
Table 5.
Actually-received treatment | Optimal treatment | |
---|---|---|
Restricted fluid administration | Liberal fluid administration | |
Day 1, No. (%) | ||
Restricted fluid administration | 10,728(47) | 5,248(23) |
Liberal fluid administration | 4,410(19.3) | 2,455(10.7) |
Day 3, No. (%) | ||
Restricted fluid administration | 6,994(57.8) | 2,374(19.6) |
Liberal fluid administration | 1,737(14.4) | 992(8.2) |
Day 5, No. (%) | ||
Restricted fluid administration | 2,549(43.1) | 1,029(17.4) |
Liberal fluid administration | 1,746(29.5) | 595(10.1) |
The cross table shows the difference between the optimal treatment and the treatment that was received by the patient. For example, 10,728 patients received restricted fluid administration on day 1, which was consistent with the optimal treatment strategy. However, there were 4,410 patients who received liberal fluid administration, but they were expected to have better clinical outcome (survive to discharge) had they been treated with restricted fluid administration. In contrast, 5,248 patients received restricted fluid administration but would have had a better outcome had they received liberal fluid administration. Data on days 3 and 5 are interpreted in the same way.
Discussion
This study developed a simple and interpretable algorithm for calculating fluid management strategy in critically ill patient with sepsis. The DTR model was modified from complex ML algorithm and taking patients' current characteristics and their treatment history into the calculation. The fluid management recommendation was report on the Day 1, 3, and 5 after ICU admission. We also showed that following the optimal treatment strategy at each stage significantly improved the survival time. In other words, our hypothesis was supported. Inspection over different stages, we observed discrepancy between calculated and actual fluid administration. Specifically, we found that sepsis patients were more likely to receive inappropriate liberal fluid administration at later stage than that at early stage (Table 4). Here, we will discuss why calculated fluid management strategy by the DTR model would predict a better clinical outcome, and why clinician tended to employ inappropriate (liberal) fluid administration at the later stage of sepsis.
Conventionally, fluid administration was guided by a variety of biomarkers reflecting the circulatory status such as serum lactate, ScvO2, and capillary refill time. The 2016 version of the surviving sepsis bundle recommended maintaining MAP > 65 mmHg and normalization of lactate18. However, most studies investigating the effectiveness of fluid resuscitation targeting these parameters showed neural effect on mortality19–21. For example, the ANDROMEDA-SHOCK trial investigated fluid strategy by targeting lactate clearance versus normalization of capillary refill time, which showed no difference between the two groups22. One important reason for the failure of these trials lied in the heterogeneity of the sepsis population. It has been shown that sepsis population was highly heterogenous and that it could be further categorized into subphenotypes based on routinely measured clinical characteristics4,5,23. It is important to give different fluid strategy to different patients at different stages24; however, it is almost impossible to fulfill this goal using conventional model based on physician's judgement over a few clinical variables. Aids from computing technology are needed for this assignment. RL is a novel technique to help an agent to select appropriate treatment to maximize final outcome based on current states. By feeding in highly granular electronic healthcare data, RL is capable to adopt clinical reasoning from experienced physicians and yield the optimal appropriate fluid management strategy based on current condition and the treatment history for each individual patient 8,9. However, the RL method based on deep learning algorithm is difficult to understand for ICU physician and cannot be easily implemented in clinical practice. Thus, we used a simplified RL algorithm by considering a binary variable space (liberal vs. restricted fluid administration) and modeling the decision rule (blip function) with generalized linear model. The resultant DTR model was clinically interpretable and could easily guide clinical practice, which was an improvement over other less accessible RL algorithms.
The appropriate selection of feature variables for the blip function was essential for applying the DTR model. Although the SSC guideline recommended targeting lactate clearance as the resuscitation guide, lactate was not statistically significant in the blip function, and thus we excluded this variable. CVP was not included in the DTR model in our study because it has been documented less useful for assessing fluid status25. The mBP was an important determinant of fluid strategy and it was shown to be statistically significant in our blip function. However, the mapping from (functional form of) mBP to fluid strategy was not similar in early versus later stage. On the first day, a parabolic function was fit with the turning point at 62.1 mmHg, which was very close to the 65 mmHg as recommended in the SSC guideline. At the later stages (Day 3 and 5), we found that previous urine output and treatment strategy were important determinants of current fluid strategy. This novel finding indicated that the optimal treatment strategy must take previous responses (e.g. urine output in response to fluid administration) into consideration.
Another strength of this study was that we calculated fluid strategy over 5 days after ICU admission. The majority of previous trials focused on the first 6 or 12 h to investigate the effect of fluid resuscitation strategy1,21,22. This may lead to unsatisfactory results of previous trials. We argued that fluid management should be carried out during the entire disease course. Our DTR model correctly captured the clinical variables over the dynamic process of sepsis and provide a sequential decision rules to maximize the survival time. Our result showed that the proportion of subjects being inappropriately treated with liberal fluid strategy increased from Day 1 to 5 (i.e. these patients can have longer survival time had they treated with restrictive fluid administration). More recently, the concept of de-resuscitation (active removal of fluid using diuretics or renal replacement therapy) after hemodynamic stabilization has received more and more attentions26. There was evidence that negative fluid balance achieved with de-resuscitative measures resulted in lower mortality6. These studies also highlighted the importance of careful fluid management in later phase after hemodynamic stabilization.
There are limitations in the current study. For example, the study population of sepsis was based on sepsis-2.0 definition, which may identify different population than that identified by using the most updated Sepsis-3.0 criteria27. However, the study was not a prospective study in which screening criteria could be prospectively collected. In the dataset, there was missing data on required items for the definition of sepsis-3.0. For example, the sepsis-3.0 definition requires an acute increase in the SOFA score, which means that we must have information for the baseline SOFA score to implement the Sepsis-3.0 definition. In fact, the database did not contain such complete information for the implementation of sepsis-3.0 criteria.
Conclusions
In conclusion, the study successfully computed out a sequence of dynamic fluid management strategy for sepsis patients over the first 5 days after ICU admission with a large volume of electronic healthcare data. The decision rules on day 1, 3 and 5 adopted different functions of covariates and treatment histories. The optimal treatment strategy generated by the DTR model could significantly improve the survival outcome as compared with the actual fluid strategy. The decision rules developed in the study require further validation in prospective cohorts.
Abbreviations
- SOFA
Sequential organ failure assessment
- DTR
Dynamic treatment regimen
- RL
Reinforcement learning
- ICU
Intensive care unit
- mBP
Mean blood pressure
- HR
Heart rate
- RR
Respiratory rate
- CCU
Coronary artery unit
- CSRU
Cardiac surgery recovery unit
- MICU
Medical ICU
- SICU
Surgical ICU
- TSICU
Trauma-neuro ICU
- RRT
Renal replacement therapy
- IQR
Interquartile range
Author contributions
Z.Z. conceived the idea and drafted the manuscript; Z.Z. performed data analysis; B.Z. helped interpret the results and table preparation; N.L. helped data management and analysis; Z.Z. is the guarantor of the paper, taking responsibility for the integrity of the work as a whole, from inception to published article.
Funding
Z.Z. received funding from The public welfare research project of Zhejiang province (LGF18H150005) and the National Natural Science Foundation of China (Grant No. 81901929).
Data availability
Data were available on request.
Competing interests
The authors declare no competing interests.
Footnotes
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
References
- 1.ARISE Investigators et al. Goal-directed resuscitation for patients with early septic shock. N. Engl. J. Med.371, 1496–1506 (2014). [DOI] [PubMed]
- 2.Investigators ProCESS, et al. A randomized trial of protocol-based care for early septic shock. N. Engl. J. Med. 2014;370:1683–1693. doi: 10.1056/NEJMoa1401602. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Mouncey PR, et al. Trial of early, goal-directed resuscitation for septic shock. N. Engl. J. Med. 2015;372:1301–1311. doi: 10.1056/NEJMoa1500896. [DOI] [PubMed] [Google Scholar]
- 4.Zhang Z, Zhang G, Goyal H, Mo L, Hong Y. Identification of subclasses of sepsis that showed different clinical outcomes and responses to amount of fluid resuscitation: a latent profile analysis. Crit. Care. 2018;22:347. doi: 10.1186/s13054-018-2279-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Gårdlund B, et al. Six subphenotypes in septic shock: latent class analysis of the PROWESS Shock study. J. Crit. Care. 2018;47:70–79. doi: 10.1016/j.jcrc.2018.06.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Silversides JA, et al. Deresuscitation of patients with iatrogenic fluid overload is associated with reduced mortality in critical illness. Crit. Care Med. 2018;46:1600–1607. doi: 10.1097/CCM.0000000000003276. [DOI] [PubMed] [Google Scholar]
- 7.Silver D, et al. Mastering the game of Go with deep neural networks and tree search. Nature. 2016;529:484–489. doi: 10.1038/nature16961. [DOI] [PubMed] [Google Scholar]
- 8.Raghu, A. et al. Deep Reinforcement Learning for Sepsis Treatment. arXiv.org cs.AI, (2017).
- 9.Komorowski M, Celi LA, Badawi O, Gordon AC, Faisal AA. The Artificial Intelligence Clinician learns optimal treatment strategies for sepsis in intensive care. Nat. Med. 2018;24:1716–1720. doi: 10.1038/s41591-018-0213-5. [DOI] [PubMed] [Google Scholar]
- 10.Raghu, A., Komorowski, M. & Singh, S. Model-based reinforcement learning for sepsis treatment. arXiv.org cs.LG, (2018).
- 11.Statistical Methods for Dynamic Treatment Regimes - Reinforcement Learning, Causal Inference, and Personalized Medicine | Bibhas Chakraborty. Springer. (2013).
- 12.Pollard TJ, et al. The eICU Collaborative Research Database, a freely available multi-center database for critical care research. Sci. Data. 2018;5:180178. doi: 10.1038/sdata.2018.178. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Levy, M. M. et al. 2001 SCCM/ESICM/ACCP/ATS/SIS International Sepsis Definitions Conference. 29, 530–538 (2003). [DOI] [PubMed]
- 14.Zimmerman JE, Kramer AA, McNair DS, Malila FM. Acute Physiology and Chronic Health Evaluation (APACHE) IV: hospital mortality assessment for today's critically ill patients. Crit. Care Med. 2006;34:1297–1310. doi: 10.1097/01.CCM.0000215112.84523.F0. [DOI] [PubMed] [Google Scholar]
- 15.Zhang Z, Gayle AA, Wang J, Zhang H, Cardinal-Fernández P. Comparing baseline characteristics between groups: an introduction to the CBCgrps package. Ann. Transl. Med. 2017;5:484–484. doi: 10.21037/atm.2017.09.39. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Simoneau G, Moodie EEM, Nijjar JS, Platt RW, the Scottish Early Rheumatoid Arthr. Estimating optimal dynamic treatment regimes with survival outcomes. J. Am. Stat. Assoc. 2019;35:1–9. [Google Scholar]
- 17.Wallace, M. P., Moodie, E. E. M. & Stephens, D. A. Dynamic treatment regimen estimation via regression-based techniques: Introducing R package reg. J. Stat. Softw.80, (2017).
- 18.Rhodes A, et al. Surviving sepsis campaign: international guidelines for management of sepsis and septic shock: 2016. Intensive Care Med. 2017 doi: 10.1007/s00134-017-4683-6. [DOI] [PubMed] [Google Scholar]
- 19.Yu J, et al. Global end-diastolic volume index vs CVP goal-directed fluid resuscitation for COPD patients with septic shock: a randomized controlled trial. Am. J. Emerg. Med. 2017;35:101–105. doi: 10.1016/j.ajem.2016.10.015. [DOI] [PubMed] [Google Scholar]
- 20.Su L, et al. P(v-a)CO2/C(a-v)O2-directed resuscitation does not improve prognosis compared with SvO2 in severe sepsis and septic shock: A prospective multicenter randomized controlled clinical study. J. Crit. Care. 2018;48:314–320. doi: 10.1016/j.jcrc.2018.09.009. [DOI] [PubMed] [Google Scholar]
- 21.Andrews B, et al. Effect of an early resuscitation protocol on in-hospital mortality among adults with sepsis and hypotension: a randomized clinical trial. JAMA. 2017;318:1233–1240. doi: 10.1001/jama.2017.10913. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Ospina-Tascón GA, et al. Effect of a resuscitation strategy targeting peripheral perfusion status vs serum lactate levels on 28-day mortality among patients with septic shock: the ANDROMEDA-SHOCK Randomized Clinical Trial. JAMA. 2019;321:654–664. doi: 10.1001/jama.2019.0071. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Seymour CW, et al. Derivation, validation, and potential treatment implications of novel clinical phenotypes for sepsis. JAMA. 2019;321:2003–2017. doi: 10.1001/jama.2019.5791. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Jozwiak M, Hamzaoui O, Monnet X, Teboul J-L. Fluid resuscitation during early sepsis: a need for individualization. Minerva Anestesiol. 2018 doi: 10.23736/S0375-9393.18.12422-9. [DOI] [PubMed] [Google Scholar]
- 25.Reddi B, et al. The relationship between the change in central venous pressure and intravenous fluid volume in patients presenting to the emergency department with septic shock. Intensive Care Med. 2018;44:1591–1592. doi: 10.1007/s00134-018-5314-6. [DOI] [PubMed] [Google Scholar]
- 26.Silversides JA, et al. Conservative fluid management or deresuscitation for patients with sepsis or acute respiratory distress syndrome following the resuscitation phase of critical illness: a systematic review and meta-analysis. Intensive Care Med. 2017;43:155–170. doi: 10.1007/s00134-016-4573-3. [DOI] [PubMed] [Google Scholar]
- 27.Singer, M. et al. The Third International Consensus Definitions for Sepsis and Septic Shock (Sepsis-3). in 315, 801–810 (American Medical Association, 2016). [DOI] [PMC free article] [PubMed]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
Data were available on request.