Table 1.
Study characteristics
Study (year) | Clinical setting and data source | Sample sizea | Cohort criteria infection definition | Task and objective |
---|---|---|---|---|
Horng et al.47 (2017) |
|
230 936 patient visits
|
Angus Sepsis ICD-9-CM abstraction criteria79 | Identify patients with suspected infection to demonstrate benefits of using clinical text with structured data for detecting ED patients with suspected infection. |
Apostolova and Velez48 (2017) |
|
634 369 nursing notes
|
Notes describing patient taking or being prescribed antibiotics for treating infection | Identify notes with suspected or presence of infection to develop a system for detecting infection signs and symptoms in free-text nursing notes. |
Culliton et al.49 (2017) |
|
203 000 adult inpatient admission encounters
Test set: 2016 data:
|
Modified Baystate clinical definition of severe sepsis (8 structured variables) and severe sepsis ICD codes | Predict severe sepsis 4, 8, and 24 h before the earliest time structured variables meet the severe sepsis definition to compare accuracy of predicting patients that will meet the clinical definition of sepsis when using unstructured data only, structured data only, or both types. |
Delahanty et al.51 (2019) |
|
2 759 529 patient encounters
Train: 1 839 503 E; 66.7%
Test: 920 026 E; 33.3%
|
Rhee’s modified Sepsis-3 definition80 | Predict sepsis risk in patients 1, 3, 6, 12, and 24 h after the first vital sign or laboratory result is recorded in the EHR to develop a new sepsis screening tool comparable to benchmark screening tools. |
Liu et al.50 (2019) |
|
|
Sepsis-3 definition1 | Predict septic shock in sepsis patients before the earliest time septic shock criteria are met to demonstrate an approach using NLP features for septic shock prediction. |
Amrollahi et al.53 (2020) |
|
40 175 adult patients
|
Sepsis-3 definition1 | Predict sepsis onset hours in advance using a deep learning approach to show a pre-trained neural language representation model can improve early sepsis detection. |
Hammoud et al.54 (2020) |
|
17 763 patients
5-fold cross validation |
Sepsis definition based on what Henry et al78 used | Predict early septic shock in ICU patients using a model that can be optimized based on user preference or performance metrics. |
Goh et al.52 (2021) |
|
Train and validation: 3722 P (80 162 N)
Test: 1595 P (34 440 N)
|
ICU admission with an ICD-10 code for sepsis, severe sepsis, or sepsis shock | Identify if a patient has sepsis at consultation time or predict sepsis 4, 6, 12, 24, and 48 h after consultation to develop an algorithm that uses structured and unstructured data to diagnose and predict sepsis. |
Qin et al.55 (2021) |
|
Train: 33 434 P
Validation: 8358 P
Test: 7376 P
|
PhysioNet Challenge restrictive Sepsis-3 definition81 | Predict if a patient will develop sepsis to explore how numerical and textual features can be used to build a predictive model for early sepsis prediction. |
ED: emergency department; ICU: intensive care unit; ICD: International Classification of Diseases; ICD-9 CM: ICD Clinical Modification, 9th revision; ICD-10: ICD 10th revision; MIMIC-II: Multiparameter Intelligent Monitoring in Intensive Care II database; MIMIC-III: Medical Information Mart for Intensive Care dataset.
Sample size unit abbreviations: P: patients; N: notes; E: encounters.