Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2025 Jan 21.
Published in final edited form as: Pediatr Res. 2022 Sep 20;93(2):350–356. doi: 10.1038/s41390-022-02274-7

Artificial and Human Intelligence for Early Identification of Neonatal Sepsis

Brynne A Sullivan 1,*, Sherry L Kausch 1, Karen Fairchild 1
PMCID: PMC11749885  NIHMSID: NIHMS2045068  PMID: 36127407

Abstract

Artificial intelligence has a role in the early detection of sepsis in neonates. Machine learning can identify patterns that predict high or increasing risk for clinical deterioration from a sepsis-like illness with careful consideration of the data and methods used to develop, validate, and evaluate prediction models. Sepsis AI systems can alert clinicians to a change in a patient’s condition that warrants a bedside evaluation before signs and symptoms of sepsis become obvious. At this point, human intelligence and experience come into play to determine an appropriate course of action: evaluate and treat or wait and watch closely. With intelligently developed, validated, and implemented AI sepsis systems, both clinicians and patients stand to benefit.

Scenario 1:

An experienced neonatologist is rounding on 15 very low birth weight (VLBW) preterm infants in a level IV NICU. Nurses are concerned about five infants; one with more apnea than usual, one with feeding intolerance, one with a low platelet count on routine testing, one requiring more incubator heat to maintain temperature, and one who, according to her mother, is less active than usual. The neonatologist examines each of the five infants and reviews the vital signs, laboratory tests, and clinical data from the past several days. A sepsis workup is performed, antibiotics are started for one infant, and the others undergo close observation. One of these infants has clinical deterioration requiring intubation the next day, at which point antibiotics are started.

Scenario 2:

An artificial intelligence (AI) system for early detection of neonatal sepsis has been implemented in a level IV NICU. An automated algorithm incorporates analysis of heart rate and oxygen saturation patterns to give a score that is the fold increased chance that the infant’s condition will deteriorate due to a sepsis-like illness in the next 24 hours. A clinician is rounding on 15 VLBW infants, and the Sepsis Score is elevated for 7 of them. On further review, some infants have a chronically elevated score. After examining the infants and their available data, a sepsis workup is performed, and antibiotics are started for two infants. A point-of-care biomarker blood test is ordered for two other infants, and when the test result is abnormal for one a sepsis workup is performed, and antibiotics are initiated.

Which scenario is better?

As AI becomes a reality in healthcare, a more appropriate question is “How can computer intelligence combine with human intelligence to improve patient outcomes?” In this narrative review, we will address a number of questions related to AI for sepsis prediction and detection in NICU patients. First, we address aspects of neonatal sepsis that make it a tractable problem for machine learning (ML) predictive models. Next, we cover technical aspects of ML model development and validation, including variable selection using both static and dynamic data. We will then review some existing early warning and ML systems. And finally, we will discuss the benefits of and barriers to implementing sepsis prediction systems in the NICU, with the goal of “right timing” antibiotics to improve patient outcomes.

Q1: How suitable is the problem of neonatal sepsis for AI solutions?

Premature infants in the NICU are an ideal population for AI-based sepsis monitoring. They are a relatively homogeneous group with the same reason for needing ICU-level care: they were born too early and need medical support while they grow and develop. They are by default immune-compromised, with immaturity of both innate and adaptive arms of host defense1, and they require life-sustaining invasive devices that create a high risk for infections that can lead to sepsis. In contrast to adult and pediatric ICU patients, many preterm infants are relatively healthy for most of their weeks spent in the NICU. Therefore it is possible to derive models that detect the transition from “well” to “ill.” Moreover, neonatal sepsis AI models have a greater return on investment compared to those developed for adults due to the potential for improving health for a long life expectancy. The ultimate goal is for clinicians and data scientists to collaborate to develop and implement AI models that facilitate earlier detection and treatment of true sepsis. Ideally, models will be equitable (free of bias) and explainable (understood by clinicians). An important goal when implementing sepsis AI is to limit antibiotic overuse which disrupts the developing microbiome and negatively impacts health outcomes26.

Timing and gestational age impact the use of predictive models for neonatal sepsis. Early-onset sepsis (EOS) is generally transmitted perinatally and diagnosis occurs shortly after birth whereas late-onset sepsis is typically due to hospital-acquired pathogens and usually occurs beyond the first week of age7,8. Risk of both conditions increases with decreasing gestational age. For EOS, a static prediction model can be applied for term and late preterm infants not undergoing continuous monitoring9. In contrast, prediction of LOS for preterm NICU patients can incorporate continuous risk prediction from streaming vital sign data10.

Prediction models will perform best if the targeted outcome is well-defined and validated. For sepsis, this requires careful medical record review rather than simply relying on ICD codes since many studies have shown that diagnostic codes for sepsis are inaccurate 1111,12. A challenge with regard to neonatal sepsis is the lack of a consensus definition13,14. The diagnosis of sepsis in adults and older children is based on standard criteria that include measures of organ dysfunction. Because of the lack of consensus definitions for neonatal sepsis, studies vary widely in how they define sepsis as an outcome, making it difficult to compare and interpret results across studies15,16. Some prediction models train only on culture-positive sepsis, while many include cases of “clinical sepsis” in which an infant has significant signs of illness and clinicians opt to prolong antibiotic treatment despite negative cultures. Experts argue that in the setting of modern laboratory equipment and sufficient inoculation volume, the likelihood of a false negative blood culture is extremely low17,18. Nonetheless, as discussed later in this review, many prediction models have been developed using both clinical and culture-positive sepsis cases, and it is therefore important for clinicians to use judgment to decide on duration of therapy in the face of a high or rising risk score, since misuse of antibiotics can lead to adverse outcomes19, 6.

Another challenge to consider in developing neonatal sepsis AI is variability in illness severity. The clinical spectrum of culture-positive neonatal sepsis is broad, from a minimally symptomatic infant with rapid recovery and no apparent sequelae to an infant with fatal or life-threatening multi-organ dysfunction and permanent neurologic impairment. Consensus definitions not only of sepsis but of its severity would be useful both for clinical care and for research, including predictive model development13,14. The neonatal Sequential Organ Failure Assessment (nSOFA) assigns points for increasing signs of organ dysfunction and was built to discriminate sepsis survivors from non-survivors 24. Multicenter analysis of nSOFA showed that it predicts fatal sepsis with high discrimination accross centers and gestational age ranges25, making it a promising metric for defining sepsis severity.

A final consideration in model development is whether to include significant, physiologically overlapping conditions such as necrotizing enterocolitis (NEC). One approach is to develop models specific for NEC20,21, though developing robust models would require a very large population of preterm infants since NEC is an uncommon event. Another approach is to include cases of both NEC and sepsis as outcomes in model development, and then evaluate the model’s ability to predict each condition separately22. And finally, models may be trained on cases of sepsis and then tested for their ability to predict NEC, as was done for the HRC index discussed later in this review23.

Q2: What are the important AI and Machine Learning model development concepts?

Figures 1 and 2 provide a conceptual overview of aspects of AI relevant to healthcare applications, from algorithm development through clinical implementation and integration. The term “artificial intelligence” refers to many methodologies that use computers to “think” like humans. In some circumstances, properly developed AI may be superior to human intelligence because it is not susceptible to fatigue, distractions, inexperience, and understaffing that may lead to human errors.

Figure 1:

Figure 1:

Schema of artificial intelligence (AI), machine learning (ML), and clinical prediction models.

Figure 2:

Figure 2:

A conceptual diagram illustrating the process and key steps of developing sepsis AI technology.

Machine learning (ML) is a type of AI that involves “teaching” computers to predict or detect specific outcomes by finding patterns in data26. ML methods can be thought of in several general categories. Supervised methods include classification and regression, using algorithms to find structure in labeled data, while unsupervised ML involves clustering and dimension reduction of unlabeled data. Reinforcement ML involves an iterative trial-and-error process whereby penalties are assessed for model errors and rewards given for accurate predictions. Generally, sepsis prediction models use supervised ML with various modeling methods, including regression, tree-based methods, neural networks, and others. In some studies, a variety of modeling methods were shown to have similar predictive performance27,28, while in other studies, a specific method is found to have better performance than others 29.

Many methods exist to assess model performance when developing and reporting a predictive model, the most common being the area under the receiver operating characteristics curve (AUC). The AUC summarizes the model’s ability to discriminate cases from controls over all possible thresholds32. In models designed for clinical use, the AUC value alone is insufficient to evaluate model performance since it does not consider prior probability, does not provide information about the distribution of errors, and weights omission and commission errors equally33,34. Moreover, even a model with good discrimination may provide risk estimates that are unreliable35.

Another way to evaluate model performance is by calculating sensitivity, specificity, and negative and positive predictive values (NPV, PPV)36. Although late-onset sepsis occurs in approximately 15% of very preterm infants, the chance of an infant developing sepsis on any particular day is quite low. Thus, the PPV of non-static models (continuously evaluating the risk of imminent sepsis as vital sign patterns and clinical variables change) will be low in order to have acceptably high sensitivity. For a high morbidity and mortality condition such as sepsis, and one for which an action (antibiotics) can be life-saving, this is a fair tradeoff.

In addition to AUC and measures like sensitivity and PPV, ML model performance should be evaluated using qualitative methods. Figure 3 illustrates a ROC curve as well as two qualitative analyses, a calibration plot and a time-to-event plot. A model can have good discrimination (i.e., a high AUC) yet still have estimated risk predictions that are unreliable if it is poorly calibrated37. The calibration of a model’s risk predictions can be visualized by plotting the observed risk as a function of the predicted risk38. Figure 3b shows an example of a well-calibrated model in red, for which the predicted risk plotted in deciles on the x-axis is very close to the estimated event rate on the y-axis and models that over- or under-predict risk in blue and green, respectively. The reliability of these plots depends on a sufficiently large sample size and the number of events in the data39.

Figure 3:

Figure 3:

Examples of Model Performance Metrics. The left panel displays an area under the receiver operating characteristic curve for a population of patients. The AUC is created by plotting the sensitivity against the specificity across all thresholds of the model output. The middle panel is an example of three model calibration curves for a population of patients. Predicted risk relative to average is on the abscissa and observed risk relative to average is on the ordinate. Each point represents one decile of predicted risk. The line of identity is shown as a dashed line and represents a well-calibrated model. The other two lines represent models that either over- or under-predict risk. The right panel is an example of a “Time to Event” plot for an individual patient. In this example, the model output (risk of sepsis score) is rising steeply 4 hours before a patient had a blood culture sent that diagnosed sepsis at time zero. Theoretically, if clinicians could see and interpret the rising score, antibiotics might have been given 4 hours earlier in this example.

Another important way to evaluate model performance is with time-to-event plots, as shown in Figure 3c. These show the average model output in a cohort relative to the time of the event and illustrate the horizon or lead time for sepsis prediction. This qualitative model assessment provides valuable information about its clinical utility since a score without a rise before clinicians recognize illness is not likely to benefit patients. Evaluating models based on how much time in advance of sepsis the model predicts a rising risk of sepsis will improve clinical utility by facilitating the administration of antibiotics hours earlier than with conventional monitoring and clinical observation.

Once a model is developed or trained, testing or validation is an essential next step. A validation data set can be internal (a subset of the original dataset) or external, from a new cohort at a different center. Validation in external datasets with similar patient characteristics and practice patterns compared to the test data provides evidence for reproducibility of model performance, while external validation using data from cohorts with different characteristics (for example, different centers, patient demographics, level of illness, or clinical practices), provides evidence of model transportability40. In an example from our prior work, we showed that differences in invasive versus non-invasive respiratory support across NICUs impacted the performance of a sepsis prediction model that incorporated features to detect apnea22.

In addition to external validation, ongoing evaluation of ML models is essential to ensure adequate performance after implementation. Data shift or drift may occur over time as practices, hospital systems, and patient populations change41. Examples that could impact NICU sepsis model performance include a change in bedside monitors with differences in HR or SpO2 averaging times, change in practices for obtaining specific laboratory tests that serve as model inputs, or changes in the use of medications or respiratory support that may impact vital sign patterns.

The ultimate step in model evaluation is conducting randomized clinical trials to determine whether displaying the output of an AI algorithm leads to meaningfully improved outcomes for patients or for care providers. Only through well-designed large RCTs will sepsis AI systems be trusted, implemented, and routinely used for patient care. And finally, since many research groups are developing sepsis AI, it is important that results and algorithms be shared with other researchers. In order to better interpret results across studies, models should be reported using a standardized format such as TRIPOD (Transparent Reporting of a multivariable prediction model for Individual Prognosis or Diagnosis)30 or a subsequent format specific for AI (TRIPOD-AI)31.

Q3: What data can be used in neonatal sepsis prediction models?

A first step in ML model development is to determine which data variables to use for modeling. This is done through various methods, foremost of which is using domain knowledge of clinical associations with neonatal sepsis. Other factors often considered involve the accessibility of the data and the feasibility of its use in prospective implementation. When predicting imminent late-onset sepsis, other decisions include whether to use physiologic data derived from high-resolution data (e.g., the electrocardiogram waveform signal sampled at 250 Hz), low-resolution data (e.g., demographics, clinical risks or signs, intermittently sampled vital signs, or laboratory tests), or a combination of both.

Vital sign data for sepsis AI algorithms:

Physiological monitoring data for sepsis prediction models can be continuously or intermittently sampled. Continuous bedside monitoring data available for NICU patients include heart rate or inter-heartbeat intervals from the electrocardiogram, SpO2 and pulse rate from the pulse oximeter, and respiratory rate from chest impedance signals. Neonates have irregular respirations, so the displayed breathing rate on the monitor, averaged over a short time interval, often oscillates from high to low when the actual respiratory rate counted over 30–60 seconds is somewhere in between. NICU monitors are also notoriously inaccurate for detecting apnea42. For these reasons, respiratory rate from current NICU bedside monitors may not be particularly useful in sepsis detection algorithms.

Hypotension is an important marker of sepsis but often presents later in the course of illness. Many infants do not have an invasive arterial catheter at the time they develop sepsis, and cuff measurements can be unreliable43,44, but new technology for continuous non-invasive blood pressure monitoring is under development45. With regard to temperature, premature infants are cared for in incubators, thus considering both patient and incubator temperature would be needed to detect temperature instability (hypothermia or fever) as a sign of sepsis. Detecting a difference between core and peripheral temperature has been proposed as a warning sign of poor perfusion related to sepsis, and research is ongoing to determine how to incorporate this into early warning systems4648. Other new technologies for continuous monitoring of ICU patients are under development and may enable greater depth and breadth of vital signs and other physiologic signals that can be incorporated into sepsis AI models. Wireless sensors are being developed for PICU and NICU patients that can measure not only vital signs but also other clinical variables such as motion and vocalization that might provide clues to changes in patient status49.

With existing monitoring technology, a pathologic change in cardiorespiratory patterns can serve as a physiomarker of sepsis in the early stages as the body mounts a systemic inflammatory response50,51. Low variability of HR52 accompanied by HR decelerations was recognized as a signature of illness due to neonatal sepsis in premature infants 53. The mechanism of abnormal heart rate characteristics in sepsis is not completely understood but likely involves both cytokine effects and autonomic nervous system activation with increased vagus nerve firing5457. The heart rate characteristics (HRC) index, a continuous sepsis prediction model, was developed to capture these abnormal patterns and is discussed later in this review 10.

In searching beyond HR patterns for other physiomarkers of neonatal sepsis, a logical place to look is in the respiratory data. While respiratory rate has limitations due to inaccuracies in measurement and sampling, SpO2 data contain information that may add to HR analysis. An increase in central apnea is one of the major signs of sepsis in preterm NICU patients, 5660 due at least in part to cytokine-triggered release of endogenous prostaglandins 5658. Apnea detection through chest impedance waveform analysis is complicated, while detection of a decline in HR and SpO2 that often accompany apnea is simpler. One analytic that serves this purpose is cross-correlation of HR and SpO2 which measures the degree to which the two signals co-trend within a set lag time. An increase in this metric captures deceleration-desaturation events, which have been shown to correlate with increased central apnea or exaggerated pathologic periodic breathing in preterm infants22,59.

Although continuous physiologic monitoring data are important for sepsis prediction, there are limitations. First, differences in equipment sampling rate, averaging time, or artifact filtering may impact predictive model performance when applied at different centers or in different epochs. Second, clinical practices that impact vital signs vary widely among NICUs, including medications, respiratory support, and monitor alarm settings. Our group conducted a three-NICU comparison of vital signs measured every two seconds from VLBW infants and found small differences in SpO2 patterns and in number of bradycardia episodes between sites. These were likely due to differences in equipment and in site-specific clinical practices such as use of mechanical ventilation 61. Another important consideration is that cardiorespiratory instability in preterm infants is not a specific sign of sepsis. For example we have shown that infants with severe intraventricular hemorrhage and severe acute or chronic lung disease have abnormal heart rate characteristics in absence of sepsis 62,63.

Demographic, Laboratory, and Clinical Data for sepsis AI algorithms:

Gestational age and birth weight are inversely proportional to the rate of neonatal sepsis and can be incorporated into prediction models 10,77, 7,8,78. Postnatal and postmenstrual age also provide information on sepsis risk, since late-onset sepsis incidence peaks at 1–3 weeks of age and decreases with maturation as immunity becomes more robust and invasive lines and tubes are no longer needed79,80. Male sex has been associated with a higher risk of sepsis, with the hypothesis that sex hormones impact immune system development 81,82. Finally, race and ethnicity may be associated with increased risk, and there is some evidence this may be due not to biological but rather to socioeconomic factors 83,84.

Laboratory tests that measure components of the host response to infection, such as immature neutrophils or serial c-reactive protein (CRP) values, are the most commonly used tests for sepsis screening6468. These tests do not have adequate sensitivity to stand on their own, but may serve as decision support for either starting or withholding antibiotics in conjunction with other clinical variables. Cytokines, such as interleukin-6, are likely to be more useful than CRP for sepsis prediction6972 or sepsis severity and mortality risk prediction73, but are not yet readily available in most clinical laboratories, and research into other biomarkers is ongoing74,75. Salivary diagnostics are increasingly used for non-invasive screening and might lead to testing strategies to identify infants with sepsis prior to clinical deterioration 76.

Clinical risk factors for LOS include the presence of a central vascular catheter or the need for mechanical ventilation. Certain medications may be associated with increased sepsis risk, either because they weaken the immune system or because they were prescribed due to physiologic instability. For example, hydrocortisone may be given to preterm infants with severe chronic lung disease and might slightly increase risk of late-onset sepsis by immune suppression, but hydrocortisone is also commonly prescribed for hypotension 85 and therefore may indicate presence of (and not simply risk for) sepsis. Variables known to decrease sepsis risk include an exclusive human milk diet and probiotics. While such variables might improve the predictive performance of a sepsis ML model, they generally report on things that clinicians already know and are not particularly useful in developing early warning systems.

Clinical signs of sepsis are the main way that humans (in contrast to machines) assess whether a preterm infant should be evaluated and treated. Incorporating clinical signs into sepsis algorithms generally requires user input. For example, the EOS calculator, described later in this review, incorporates risk factors but also allows users to input their clinical assessment of the infant as asymptomatic, equivocal, or having clinical illness 9. With regard to LOS, common signs include increased apnea, respiratory distress, feeding intolerance (ileus), poor perfusion (delayed capillary refill, oliguria), temperature instability, hypotension, and lethargy. Several studies have developed logistic regression models to assess the sensitivity or predictive value of clinical signs documented as prompting sepsis evaluation60,86,87. These signs may be captured from EHR documentation, but often not reliably or in a time frame that could go into an AI algorithm and help a patient. For example, text mining from clinician notes might extract specific words or word combinations like “lethargic” or “poor perfusion”, but once an ICU clinician writes this in the EHR they typically have already ordered the blood culture and antibiotics. One solution to automatically measure lethargy is monitoring for decreased movements, either with an actimeter 49 or by detecting changes in motion artifact in waveforms from standard bedside monitors.88

Q4: What advanced warning systems for sepsis exist, and what lies in the future?

Before discussing developed and future AI systems for sepsis, consideration should be given to “Early Warning Scores” (EWS). EWS and ML models are both designed to alert the medical team to concerning vital signs, laboratory values, or other clinical changes that might otherwise go unnoticed. Both can be integrated into the EHR or displayed at the bedside, and both can incorporate information from a mix of static and dynamic clinical variables. EWS employ a “track and trigger” approach, whereas AI models use math and the data to learn temporal trends and correlation among parameters 89. For example, the Pediatric Early Warning Score (PEWS) is calculated based on periodic observations of multiple physiological parameters and designed to predict clinical deterioration (including but not limited to sepsis) in hospitalized children 90,91. Thresholds are set and points assigned for behavioral, respiratory, and cardiovascular compromise. An RCT of PEWS compared to standard care, performed across multiple centers in Canada, found no mortality reduction but fewer significant clinical deterioration events92. In a retrospective single-center review of 72 hospitalized children that required emergent transfer to the ICU (mostly for respiratory distress), a barrier to earlier identification of patient deterioration was lack of continuous telemetry or pulse oximetry monitoring in about one-third of cases.93 Furthermore, in that study PEWS was calculated incorrectly in nearly half of cases in the 24h period before ICU transfer. These studies highlight the potential benefits of continuous rather than intermittent vital sign monitoring for high risk non-ICU patients, as well as the importance of having a system for correct input of data into EWS algorithms.

Several EWS scores for adult patients have been implemented, including the National Early Warning Score (NEWS)94 in the United Kingdom and the Modified Early Warning Score (MEWS)95. Like PEWS, these scores track patient data and trigger an alert based on points assigned by thresholds. In contrast, a recently described Targeted Real-time Early Warning System (TREWS) uses a machine learning model, rather than threshold-based points, to predict septic shock96. TREWS is calculated using 27 vital signs and laboratory inputs from the EHR. A comparison of TREWS versus MEWS for predicting septic shock in hospitalized adults showed improved performance, demonstrated by AUC of 0.83 compared to 0.73. In a recent report, adoption of TREWS was reported to be high, with 89% of alerts linked to a clinician response, and about one-third of alerts associated with septic shock97. AI models would be expected to perform better than EWS because they can use the data as continuous rather than categorical values determined by thresholds. Additionally, modeling, rather than empirically derived cutoffs, can detect more subtle and complex patterns in the data associated with the target outcome.

One ML model for adults, the Advance Alert Monitor98, uses EHR data to predict clinical deterioration (not limited to sepsis) in hospitalized patients. This system sends an alert to a centralized team of specially trained nurses and was tested using a staggered implementation clinical trial design in 21 hospitals99. For about 15,000 patients at hospitals using the system, mortality was 16% lower compared to about 30,000 similar patients at hospitals not using the system. Another consideration is that ML models that display illness trajectories over time are useful for determining whether a patient’s condition is improving or worsening. One such system, Continuous Monitoring of Event Trajectories (CoMET), analyzes vital sign data to predict clinical deterioration and is undergoing a cluster-randomized clinical trial to determine impact on important clinical outcomes100.

ML models have been developed and implemented for pediatric sepsis. In a study of children 2–17 years of age presenting to an Emergency Department, a model developed using boosted ensemble decision trees incorporated hourly vital signs and (if available) platelet count, white blood cell count, and Glascow Coma Scale. The model was trained on data from nearly 10,000 patient encounters to detect severe sepsis which occurred in 101 patients, and had AUC of 0.916 at the time of diagnosis and (more importantly) AUC 0.718 four hours prior to clinical diagnosis101. Another group used data from 1,425 PICU patients with 187 cases of sepsis to develop a sepsis ML model incorporating continuous vital sign data, 11 laboratory values, and 4 clinical variables (age, sex, central line and mechanical ventilation)102. Both logistic regression and random forest modeling were tested and had similar AUC, but the latter captured a relationship between age and sepsis better.

Neonatal sepsis AI

A model for predicting EOS has now been widely implemented as a web-based calculator 9,103,104. The model was developed from a large dataset generated by a network of newborn hospitals, the Northern California Kaiser Permanente System and deployed as an online calculator. It uses perinatal risk factors known at the time of birth in a logistic regression model to derive prior probability and then incorporates the clinicians’ assessment (asymptomatic, equivocal, or clinically ill) using Bayes’ theorem. The risk per 1,000 live births is displayed for each of three categories of illness9,103. Clinical decision support is provided, allowing for clinical judgment to guide the application of the AI technology, which is likely a factor in the widespread adoption of this model. Studies of the impact of the EOS calculator have shown it reduces the number of asymptomatic or equivocal infants with sepsis risk factors undergoing laboratory evaluations and exposure to antibiotics.103

A number of models for prediction of LOS in NICU patients before they are obviously sick have been published113. Table 1 summarizes models using continuous or intermittently sampled data to make predictions about imminent sepsis, with the goal of early detection and treatment which may improve outcomes10,88,107,111,112. Several other studies have used data at the time of blood culture to predict whether sepsis will be ruled in or out (positive versus negative culture) which may be useful for determining when to start and stop antibiotics114,115. And finally, several studies have used vital sign data shortly after birth to predict risk of developing sepsis later in the NICU course, which might identify highest risk infants in need of enhanced vigilance or preventive strategies not suitable for the entire preterm population116,117.

Table 1.

A summary of select studies reporting the development and performance of machine learning (ML) models to predict imminent late-onset sepsis.

First author (Model name) Population Late-onset sepsis cases used for modeling Data Input Sepsis training window (hours) External validation Modeling method
Griffin et al. (HRC index) 7,74,89 VLBW culture positive and clinical ECG heart rate characteristics (low variability, decels) −24 to 0 Yes Logistic regression
Gur et al. (RALIS) 67,90,91 VLBW culture positive and clinical HR, RR, temp, weight, desaturation, bradycardia episodes Not reported Not reported Not specified
Joshi, et al.60; Cabrera-Quiros, et al.92 <32 weeks GA culture positive 4 feature subsets: (1) HRV, (2) movement, (3) respiration, and (4) combination of above −24 to 0 and
−3 to 0
No Logistic regression, naive Bayes, nearest mean classifier
Mani et al.69 All NICU patients culture positive and clinical EHR data: 71 laboratory 30 demographic + clinical variables −48 to +12 No 9 ML models
Masino et al. 93 All NICU patients culture positive and clinical 36 features from EHR, including VS and clinical status −48 to −4 No 8 ML models
Song et al.68 All NICU patients culture positive and clinical Vital signs, blood gas data, CBC −24 to 0 No 8 ML models

To date, the only commercially available system for NICU predictive monitoring using continuous bedside monitor data is the HRC index, or HeRO Score. This is also the only ML model for neonatal sepsis that has been tested in a randomized clinical trial and shown to improve important clinical outcomes105. The HRC algorithm uses electrocardiogram data from standard NICU bedside monitors to calculate the fold-increased risk of a clinical deterioration due to sepsis (culture-proven) or a sepsis-like illness (clinical sepsis) in the next 24 hours. The algorithm uses mathematical calculations that report on decreased HR variability and transient HR decelerations, patterns shown in pre-clinical models to reflect pathogen-induced inflammatory cytokine release and vagus nerve firing55,118. In a randomized clinical trial of 3003 VLBW infants at 9 NICUs105, display of this risk score was associated with significantly lower sepsis-associated mortality(12% versus 20%), presumably due to earlier treatment119. Importantly, display of the score was associated with only a small increase in the number of blood cultures and antibiotic days, perhaps indicating that clinicians also used the score for its negative predictive value to decide not to start antibiotics or to discontinue antibiotics in some patients with non-specific, mild clinical signs.

The HRC index provides an example of AI development and translation from the bench to bedside, and it uses relatively limited input data. Adding respiratory analytics to heart rate characteristics might lead to better prediction of LOS during the pre-clinical (non-obvious) phase of infection. The major sign of sepsis in preterm infants is apnea, which may manifest as spells of bradycardia and oxygen desaturation. High or increasing cross-correlation of HR and SpO2, reflecting concurrent transient decelerations and desaturations, was shown to be the best predictor of impending sepsis or necrotizing enterocolitis in a 2-NICU study of more than 1,000 VLBW infants22. In a 3-NICU study of LOS compared with sepsis ruled out, cross-correlation HR-SpO2 increased more for LOS cases, and combining vital sign data with clinical data improved discrimination and classification of events as sepsis ruled-in or ruled-out86. Another group developed a ML model incorporating HR, SpO2, and RR features as well as ECG waveform analysis to detect decreased movement that may occur in sepsis.88 These examples represent the power of combining domain expertise of neonatologists, mathematicians, engineers, and data scientists to optimize sepsis AI.

Q5: What are some benefits and barriers for sepsis ML model implementation and clinical integration?

Much has been written recently about the potential benefits of AI implementation in healthcare120. On the cautionary side, hype about “AI solutions” implies there is a relatively simple way to implement AI and improve outcomes of complex medical problems, and this is not the case. Even optimal AI models will not replace the hard work of clinicians deciding which tests to order and which therapies to administer. 121 That said, with regard to sepsis the obvious benefit of implementing an automated prediction system is earlier antibiotic treatment and supportive care leading to improved outcomes. The 20% reduction in sepsis-associated mortality with continuous HRC index display in the HeRO RCT is an example. For survivors of neonatal sepsis, earlier treatment may not improve neurodevelopmental outcomes 122 but might reduce NICU length of stay 123.

Beyond direct patient benefits, other potential benefits of using AI risk models for sepsis include resource allocation and risk stratification, which can be useful for cost-effectiveness analyses, classification for research, and benchmarking across hospitals. Also, AI might reduce bias in healthcare, but only if care is taken to avoid introducing biased data into algorithms. An example of racial bias in a NICU algorithm was recently discovered in a risk calculator for bronchopulmonary dysplasia. The algorithm did not account for the competing outcome of death, which was higher among Black infants leading to a lower risk of being diagnosed with BPD. This might create inequity in clinical care, since the calculator has been promoted as a decision support tool for administering glucocorticoids to reduce the severity of chronic lung disease. A first step in addressing bias in AI is to develop and test algorithms for performance across the spectrum of patient sex, race, ethnicity, and socioeconomic status. With regard to sepsis, a potential advantage of physiology-based algorithms is that heart rate patterns of neonates tend to be similar across the spectrum of patient diversity. Adding pulse oximetry data to heart rate characteristics will need close scrutiny since racial differences in accuracy of pulse oximetry data have recently been described in adults124, children125, and neonates126. Regardless of what sources of data serve as model input, AI algorithms should be developed and validated in large, diverse patient populations with efforts made to minimize bias of all types.

Although there are many potential benefits of AI, there are also many barriers. We developed the acronym “BARRIERS” to summarize some major challenges in this field: Babies, Analytics, Reactors, Reassurance, Integration, Equipment, Reeducation, and Space127. Babies themselves can complicate the development and deployment of early warning systems for sepsis since they cannot announce that they feel sick, and their signs of sepsis are non-specific and overlap with normal preterm physiology. This creates the problem of false alarms in a unit already prone to alarm fatigue128,129. Another barrier, “Analytics,” refers to the difficulty in creating models due to heterogeneity of event identification, variable selection, and modeling techniques, as previously discussed in this review. “Reactors” are the model users, NICU clinicians with varied education, experience, and responsibilities. The barrier, in this case, is the difficulty in displaying data and clinical decision support in a way that is effective for a broad range of clinicians. “Reassurance” can be a problem with AI models if a low-risk score falsely reassures the clinical team faced with an infant with significant signs of illness, leading to a delay in treatment. “Integration” refers to the challenge of introducing an AI model without creating too many distracting false alarms. One way to mitigate alarm fatigue yet assure that critical information is transmitted to the right person is to have a centralized clinical team that reviews alerts and determines which ones should be transmitted to the care team99, an approach that may not be broadly feasible. The “E” in BARRIERS is equipment that must be integrated into the clinical workflow. Once the system is integrated, education and “Re-education” for users are critically important to assure proper implementation. And finally, “Space” can be a barrier since the NICU bedside may already be crowded with equipment and monitors. A new sepsis prediction system needs to be positioned in such a way to be noticeable but not overwhelming.

In the case of an AI system that is shown to improve patient outcomes, implementation relies on hospital administrators and clinicians “buying in.” A survey-based study of continuous predictive monitoring reported that users had positive engagement with the system if they trusted the data used in the model and if they understood the science behind the model outputs130. This is the basis for the term “explainable AI” which some view as essential for clinicians to utilize the system, although others argue that methods to make models explainable sacrifice their performance131. A final consideration is that AI systems may introduce unintended consequences such as inappropriate testing and therapies. Other sources of harm if AI algorithms are not appropriately developed and utilized is the creation of distractions which can lead to medical errors and contribute to burnout which is an increasing problem among healthcare workers.

CONCLUSION

Sepsis AI is a way to analyze and present data to clinicians for earlier detection and treatment leading to improved patient outcomes. If properly developed and implemented, AI systems can alert clinicians to a change in a patient’s condition that warrants a bedside evaluation. At that point, human intelligence (and experience) kick in. Initiating a workup and antibiotics for all patients with a bad score would constitute malpractice. Rather, clinicians should consider the AI result and then incorporate what they see and what they know to make the best decisions for individual patients.

Impact Statement:

  • This narrative review highlights the application of AI in neonatal sepsis prediction. It describes issues in clinical prediction model development specific to this population.

  • This article reviews the methods, considerations, and literature on neonatal sepsis model development and validation.

  • Challenges of AI technology and potential barriers to using sepsis AI systems in the NICU are discussed.

Bibliography

  • 1.Wynn J, Cornell TT, Wong HR, Shanley TP & Wheeler DS The host response to sepsis and developmental impact. Pediatrics 125, 1031–1041 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Gasparrini AJ et al. Antibiotic perturbation of the preterm infant gut microbiome and resistome. Gut Microbes 7, 443–449 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Dardas M et al. The impact of postnatal antibiotics on the preterm intestinal microbiome. Pediatr. Res. 76, 150–158 (2014). [DOI] [PubMed] [Google Scholar]
  • 4.Pammi M et al. Intestinal dysbiosis in preterm infants preceding necrotizing enterocolitis: a systematic review and meta-analysis. Microbiome 5, 31 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Cantey JB, Pyle AK, Wozniak PS, Hynan LS & Sánchez PJ Early antibiotic exposure and adverse outcomes in preterm, very low birth weight infants. J. Pediatr. 203, 62–67 (2018). [DOI] [PubMed] [Google Scholar]
  • 6.Ting JY et al. Association Between Antibiotic Use and Neonatal Mortality and Morbidities in Very Low-Birth-Weight Infants Without Culture-Proven Sepsis or Necrotizing Enterocolitis. JAMA Pediatr. 170, 1181–1187 (2016). [DOI] [PubMed] [Google Scholar]
  • 7.Köstlin-Gille N et al. Epidemiology of early and late onset neonatal sepsis in very low birthweight infants: data from the german neonatal network. Pediatr. Infect. Dis. J. 40, 255–259 (2021). [DOI] [PubMed] [Google Scholar]
  • 8.Shane AL, Sánchez PJ & Stoll BJ Neonatal sepsis. Lancet 390, 1770–1780 (2017). [DOI] [PubMed] [Google Scholar]
  • 9.Escobar GJ et al. Stratification of risk of early-onset sepsis in newborns ≥ 34 weeks’ gestation. Pediatrics 133, 30–36 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Griffin MP et al. Abnormal heart rate characteristics preceding neonatal sepsis and sepsis-like illness. Pediatr. Res. 53, 920–926 (2003). [DOI] [PubMed] [Google Scholar]
  • 11.Iwashyna TJ et al. Identifying patients with severe sepsis using administrative claims: patient-level validation of the angus implementation of the international consensus conference definition of severe sepsis. Med. Care 52, e39–43 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Ramanathan R et al. Validity of International Classification of Diseases, Ninth Revision, Clinical Modification (ICD-9-CM) screening for sepsis in surgical mortalities. Surg Infect (Larchmt) 15, 513–516 (2014). [DOI] [PubMed] [Google Scholar]
  • 13.Wynn JL & Polin RA Progress in the management of neonatal sepsis: the importance of a consensus definition. Pediatr. Res. 83, 13–15 (2018). [DOI] [PubMed] [Google Scholar]
  • 14.Molloy EJ et al. Neonatal sepsis: need for consensus definition, collaboration and core outcomes. Pediatr. Res. 88, 2–4 (2020). [DOI] [PubMed] [Google Scholar]
  • 15.Henry CJ et al. Neonatal sepsis: a systematic review of core outcomes from randomised clinical trials. Pediatr. Res. (2022) doi: 10.1038/s41390-021-01883-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Hayes R et al. Neonatal sepsis definitions from randomised clinical trials. Pediatr. Res. (2021) doi: 10.1038/s41390-021-01749-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Cantey JB & Baird SD Ending the Culture of Culture-Negative Sepsis in the Neonatal ICU. Pediatrics 140, (2017). [DOI] [PubMed] [Google Scholar]
  • 18.Cantey JB & Prusakov P A proposed framework for the clinical management of neonatal “culture-negative” sepsis. J. Pediatr. (2022) doi: 10.1016/j.jpeds.2022.01.006. [DOI] [PubMed] [Google Scholar]
  • 19.Mukhopadhyay S & Puopolo KM Antibiotic Use and Mortality Among Premature Infants Without Confirmed Infection-Perpetrator or Innocent Bystander? JAMA Pediatr. 170, 1144–1146 (2016). [DOI] [PubMed] [Google Scholar]
  • 20.Meister AL et al. Vagal tone and proinflammatory cytokines predict feeding intolerance and necrotizing enterocolitis risk. Adv. Neonatal Care 21, 452–461 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Gephart SM et al. Discrimination of GutCheck(NEC): a clinical risk index for necrotizing enterocolitis. J. Perinatol. 34, 468–475 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Fairchild KD et al. Vital signs and their cross-correlation in sepsis and NEC: a study of 1,065 very-low-birth-weight infants in two NICUs. Pediatr. Res. 81, 315–321 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Stone ML et al. Abnormal heart rate characteristics before clinical diagnosis of necrotizing enterocolitis. J. Perinatol. 33, 847–850 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Wynn JL & Polin RA A neonatal sequential organ failure assessment score predicts mortality to late-onset sepsis in preterm very low birth weight infants. Pediatr. Res. 88, 85–90 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Fleiss N et al. Evaluation of the Neonatal Sequential Organ Failure Assessment and Mortality Risk in Preterm Infants With Late-Onset Infection. JAMA Netw. Open 4, e2036518 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Liu Y, Chen P-HC, Krause J & Peng L How to read articles that use machine learning: users’ guides to the medical literature. JAMA 322, 1806–1816 (2019). [DOI] [PubMed] [Google Scholar]
  • 27.Beaulieu-Jones BK et al. Machine learning for patient risk stratification: standing on, or looking over, the shoulders of clinicians? npj Digital Med. 4, 62 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Beam AL & Kohane IS Translating artificial intelligence into clinical care. JAMA 316, 2368–2369 (2016). [DOI] [PubMed] [Google Scholar]
  • 29.Spaeder MC et al. Perioperative Near-Infrared Spectroscopy Monitoring in Neonates With Congenital Heart Disease: Relationship of Cerebral Tissue Oxygenation Index Variability With Neurodevelopmental Outcome. Pediatr. Crit. Care Med. 18, 213–218 (2017). [DOI] [PubMed] [Google Scholar]
  • 30.Collins GS, Reitsma JB, Altman DG & Moons KGM Transparent Reporting of a multivariable prediction model for Individual Prognosis or Diagnosis (TRIPOD): the TRIPOD statement. Ann. Intern. Med. 162, 55–63 (2015). [DOI] [PubMed] [Google Scholar]
  • 31.Collins GS et al. Protocol for development of a reporting guideline (TRIPOD-AI) and risk of bias tool (PROBAST-AI) for diagnostic and prognostic prediction model studies based on artificial intelligence. BMJ Open 11, e048008 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Hastie T, Tibshirani R & Friedman J The Elements of Statistical Learning. 106–119 (Springer New York, 2009). doi: 10.1007/978-0-387-84858-7. [DOI] [Google Scholar]
  • 33.Justice AC, Covinsky KE & Berlin JA Assessing the generalizability of prognostic information. Ann. Intern. Med. 130, 515–524 (1999). [DOI] [PubMed] [Google Scholar]
  • 34.Harrell FE, Lee KL & Mark DB Multivariable prognostic models: issues in developing models, evaluating assumptions and adequacy, and measuring and reducing errors. Stat. Med. 15, 361–387 (1996). [DOI] [PubMed] [Google Scholar]
  • 35.Diamond GA What price perfection? Calibration and discrimination of clinical prediction models. J. Clin. Epidemiol. 45, 85–89 (1992). [DOI] [PubMed] [Google Scholar]
  • 36.Pinker E Reporting accuracy of rare event classifiers. npj Digital Med. 1, 56 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Van Calster B et al. Calibration: the Achilles heel of predictive analytics. BMC Med. 17, 230 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Harrell F Regression Modeling Strategies: With Applications to Linear Models, Logistic and Ordinal Regression, and Survival Analysis (Springer Series in Statistics). (Springer International Publishing; :, 2015). [Google Scholar]
  • 39.Van Calster B et al. A calibration hierarchy for risk models was defined: from utopia to empirical data. J. Clin. Epidemiol. 74, 167–176 (2016). [DOI] [PubMed] [Google Scholar]
  • 40.Nevin L & PLOS Medicine Editors. Advancing the beneficial use of machine learning in health care and medicine: Toward a community understanding. PLoS Med. 15, e1002708 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Subbaswamy A, Adams R & Saria S Evaluating Model Robustness and Stability to Dataset Shift. arXiv (2020) doi: 10.48550/arxiv.2010.15100. [DOI] [Google Scholar]
  • 42.Vergales BD et al. Accurate automated apnea analysis in preterm infants. Am. J. Perinatol. 31, 157–162 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Alonzo CJ et al. Blood pressure ranges via non-invasive and invasive monitoring techniques in premature neonates using high resolution physiologic data. J. Neonatal Perinatal Med. 13, 351–358 (2020). [DOI] [PubMed] [Google Scholar]
  • 44.Werther T et al. Discrepancy between invasive and non-invasive blood pressure readings in extremely preterm infants in the first four weeks of life. PLoS ONE 13, e0209831 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Quan X et al. Advances in Non-Invasive Blood Pressure Monitoring. Sensors 21, (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Ralphe JL, Silva SG, Dail RB & Brandon DH Body temperature instability and respiratory morbidity in the very low birth weight infant: a multiple case, intensive longitudinal study. BMC Pediatr. 20, 485 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Dail RB et al. Predicting infection in very preterm infants: A study protocol. Nurs. Res. 70, 142–149 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Knobel-Dail RB, Sloane R, Holditch-Davis D & Tanaka DT Negative temperature differential in preterm infants less than 29 weeks gestational age: associations with infection and maternal smoking. Nurs. Res. 66, 442–453 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Chung HU et al. Skin-interfaced biosensors for advanced wireless physiological monitoring in neonatal and pediatric intensive-care units. Nat. Med. 26, 418–429 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Sullivan BA & Fairchild KD Vital signs as physiomarkers of neonatal sepsis. Pediatr. Res. (2021) doi: 10.1038/s41390-021-01709-x. [DOI] [PubMed] [Google Scholar]
  • 51.Kumar N, Akangire G, Sullivan B, Fairchild K & Sampath V Continuous vital sign analysis for predicting and preventing neonatal diseases in the twenty-first century: big data to the forefront. Pediatr. Res. 87, 210–220 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Sullivan BA & Fairchild KD Predictive monitoring for sepsis and necrotizing enterocolitis to prevent shock. Semin. Fetal Neonatal Med. 20, 255–261 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Griffin MP & Moorman JR Toward the early diagnosis of neonatal sepsis and sepsis-like illness using novel heart rate analysis. Pediatrics 107, 97–104 (2001). [DOI] [PubMed] [Google Scholar]
  • 54.Tracey KJ Physiology and immunology of the cholinergic antiinflammatory pathway. J. Clin. Invest. 117, 289–296 (2007). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Fairchild KD, Srinivasan V, Moorman JR, Gaykema RPA & Goehler LE Pathogen-induced heart rate changes associated with cholinergic nervous system activation. Am. J. Physiol. Regul. Integr. Comp. Physiol. 300, R330–9 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Fairchild K et al. Clinical associations of immature breathing in preterm infants: part 1-central apnea. Pediatr. Res. 80, 21–27 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Siljehav V, Hofstetter AM, Leifsdottir K & Herlenius E Prostaglandin E2 Mediates Cardiorespiratory Disturbances during Infection in Neonates. J. Pediatr. 167, 1207–13.e3 (2015). [DOI] [PubMed] [Google Scholar]
  • 58.Herlenius E An inflammatory pathway to apnea and autonomic dysregulation. Respir. Physiol. Neurobiol. 178, 449–457 (2011). [DOI] [PubMed] [Google Scholar]
  • 59.Fairchild KD & Lake DE Cross-Correlation of Heart Rate and Oxygen Saturation in Very Low Birthweight Infants: Association with Apnea and Adverse Events. Am. J. Perinatol. 35, 463–469 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Das A, Shukla S, Rahman N, Gunzler D & Abughali N Clinical Indicators of Late-Onset Sepsis Workup in Very Low-Birth-Weight Infants in the Neonatal Intensive Care Unit. Am. J. Perinatol. 33, 856–860 (2016). [DOI] [PubMed] [Google Scholar]
  • 61.Zimmet AM et al. Vital sign metrics of VLBW infants in three NICUs: implications for predictive algorithms. Pediatr. Res. (2021) doi: 10.1038/s41390-021-01428-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62.Fairchild KD et al. Abnormal heart rate characteristics are associated with abnormal neuroimaging and outcomes in extremely low birth weight infants. J. Perinatol. 34, 375–379 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Sullivan BA, Grice SM, Lake DE, Moorman JR & Fairchild KD Infection and other clinical correlates of abnormal heart rate characteristics in preterm infants. J. Pediatr. 164, 775–780 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64.Ohlin A, Björkqvist M, Montgomery SM & Schollin J Clinical signs and CRP values associated with blood culture results in neonates evaluated for suspected sepsis. Acta Paediatr. 99, 1635–1640 (2010). [DOI] [PubMed] [Google Scholar]
  • 65.Coggins SA et al. Use of a computerized C-reactive protein (CRP) based sepsis evaluation in very low birth weight (VLBW) infants: a five-year experience. PLoS ONE 8, e78602 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66.Rønnestad A, Abrahamsen TG, Gaustad P & Finne PH C-reactive protein (CRP) response patterns in neonatal septicaemia. APMIS 107, 593–600 (1999). [DOI] [PubMed] [Google Scholar]
  • 67.Benitz WE, Han MY, Madan A & Ramachandra P Serial serum C-reactive protein levels in the diagnosis of neonatal infection. Pediatrics 102, E41 (1998). [DOI] [PubMed] [Google Scholar]
  • 68.Brown JVE, Meader N, Wright K, Cleminson J & McGuire W Assessment of C-Reactive Protein Diagnostic Test Accuracy for Late-Onset Infection in Newborn Infants: A Systematic Review and Meta-analysis. JAMA Pediatr. 174, 260–268 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69.Raynor LL et al. Cytokine screening identifies NICU patients with Gram-negative bacteremia. Pediatr. Res. 71, 261–266 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70.Angurana SK, Bansal A, Muralidharan J, Aggarwal R & Singhi S Cytokine levels in critically ill children with severe sepsis and their relation with the severity of illness and mortality. J. Intensive Care Med. 885066620912989 (2020). [DOI] [PubMed] [Google Scholar]
  • 71.Ng PC et al. Proinflammatory and anti-inflammatory cytokine responses in preterm infants with systemic infections. Arch. Dis. Child. Fetal Neonatal Ed. 88, F209–13 (2003). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 72.Küster H et al. Interleukin-1 receptor antagonist and interleukin-6 for early diagnosis of neonatal sepsis 2 days before clinical manifestation. Lancet 352, 1271–1277 (1998). [DOI] [PubMed] [Google Scholar]
  • 73.Kurul Ş et al. Association of inflammatory biomarkers with subsequent clinical course in suspected late onset sepsis in preterm neonates. Crit. Care 25, 12 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 74.Cantey JB & Lee JH Biomarkers for the diagnosis of neonatal sepsis. Clin. Perinatol. 48, 215–227 (2021). [DOI] [PubMed] [Google Scholar]
  • 75.Ruoss JL & Wynn JL Biomarkers in the Diagnosis of Neonatal Sepsis. in Infectious Disease and Pharmacology: Neonatology Questions and Controversies (eds. Benitz WE & Smith PB) 103–112 (Elsevier, 2019). [Google Scholar]
  • 76.Iyengar A & Maron JL Detecting infection in neonates: promises and challenges of a salivary approach. Clin. Ther. 37, 523–528 (2015). [DOI] [PubMed] [Google Scholar]
  • 77.Lake DE, Fairchild KD & Moorman JR Complex signals bioinformatics: evaluation of heart rate characteristics monitoring as a novel risk marker for neonatal sepsis. J. Clin. Monit. Comput. 28, 329–339 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 78.Shane AL & Stoll BJ Neonatal sepsis: progress towards improved outcomes. J. Infect. 68 Suppl 1, S24–32 (2014). [DOI] [PubMed] [Google Scholar]
  • 79.Stoll BJ et al. Late-onset sepsis in very low birth weight neonates: the experience of the NICHD Neonatal Research Network. Pediatrics 110, 285–291 (2002). [DOI] [PubMed] [Google Scholar]
  • 80.Hornik CP et al. Early and late onset sepsis in very-low-birth-weight infants from a large group of neonatal intensive care units. Early Hum. Dev. 88 Suppl 2, S69–74 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 81.Stevenson DK et al. Sex differences in outcomes of very low birthweight infants: the newborn male disadvantage. Arch. Dis. Child. Fetal Neonatal Ed. 83, F182–5 (2000). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 82.O’Driscoll DN, McGovern M, Greene CM & Molloy EJ Gender disparities in preterm neonatal outcomes. Acta Paediatr. (2018) doi: 10.1111/apa.14390. [DOI] [PubMed] [Google Scholar]
  • 83.Travers CP et al. Racial/ethnic disparities among extremely preterm infants in the united states from 2002 to 2016. JAMA Netw. Open 3, e206757 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 84.Wallace ME et al. Racial/ethnic differences in preterm perinatal outcomes. Am. J. Obstet. Gynecol. 216, 306.e1–306.e12 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 85.Rios DR, Moffett BS & Kaiser JR Trends in pharmacotherapy for neonatal hypotension. J. Pediatr. 165, 697–701.e1 (2014). [DOI] [PubMed] [Google Scholar]
  • 86.Sullivan BA et al. Clinical and vital sign changes associated with late-onset sepsis in very low birth weight infants at 3 NICUs. J. Neonatal Perinatal Med. 14, 553–561 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 87.Fanaroff AA et al. Incidence, presenting features, risk factors and significance of late onset septicemia in very low birth weight infants. The National Institute of Child Health and Human Development Neonatal Research Network. Pediatr. Infect. Dis. J. 17, 593–598 (1998). [DOI] [PubMed] [Google Scholar]
  • 88.Joshi R et al. Predicting Neonatal Sepsis Using Features of Heart Rate Variability, Respiratory Characteristics, and ECG-Derived Estimates of Infant Motion. IEEE J. Biomed. Health Inform. 24, 681–692 (2020). [DOI] [PubMed] [Google Scholar]
  • 89.Gao H et al. Systematic review and evaluation of physiological track and trigger warning systems for identifying at-risk patients on the ward. Intensive Care Med. 33, 667–679 (2007). [DOI] [PubMed] [Google Scholar]
  • 90.Duncan H, Hutchison J & Parshuram CS The Pediatric Early Warning System score: a severity of illness score to predict urgent medical need in hospitalized children. J. Crit. Care 21, 271–278 (2006). [DOI] [PubMed] [Google Scholar]
  • 91.Lambert V, Matthews A, MacDonell R & Fitzsimons J Paediatric early warning systems for detecting and responding to clinical deterioration in children: a systematic review. BMJ Open 7, e014497 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 92.Parshuram CS et al. Effect of a Pediatric Early Warning System on All-Cause Mortality in Hospitalized Pediatric Patients: The EPOCH Randomized Clinical Trial. JAMA 319, 1002–1012 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 93.Kowalski RL, Lee L, Spaeder MC, Moorman JR & Keim-Malpass J Accuracy and monitoring of pediatric early warning score (PEWS) scores prior to emergent pediatric intensive care unit (ICU) transfer: retrospective analysis. JMIR Pediatr. Parent. 4, e25991 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 94.The Royal College of Physicians. The National Early Warning Score (NEWS) thresholds and triggers. 1 (2012). [Google Scholar]
  • 95.Stenhouse C, Coates S, Tivey M, Allsop P & Parker T Prospective evaluation of a modified Early Warning Score to aid earlier detection of patients developing critical illness on a general surgical ward. Br. J. Anaesth. 84, 663P (2000). [Google Scholar]
  • 96.Henry KE, Hager DN, Pronovost PJ & Saria S A targeted real-time early warning score (TREWScore) for septic shock. Sci. Transl. Med. 7, 299ra122 (2015). [DOI] [PubMed] [Google Scholar]
  • 97.Henry KE et al. Evaluating Adoption, Impact, and Factors Driving Adoption for TREWS, a Machine Learning-Based Sepsis Alerting System. medRxiv (2021). [Google Scholar]
  • 98.Kipnis P et al. Development and validation of an electronic medical record-based alert score for detection of inpatient deterioration outside the ICU. J. Biomed. Inform. 64, 10–19 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 99.Escobar GJ et al. Automated Identification of Adults at Risk for In-Hospital Clinical Deterioration. N. Engl. J. Med. 383, 1951–1960 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 100.Keim-Malpass J et al. Predictive Monitoring-Impact in Acute Care Cardiology Trial (PM-IMPACCT): Protocol for a Randomized Controlled Trial. JMIR Res. Protoc. 10, e29631 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 101.Le S et al. Pediatric severe sepsis prediction using machine learning. Front. Pediatr. 7, 413 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 102.Spaeder MC et al. Predictive analytics in the pediatric intensive care unit for early identification of sepsis: capturing the context of age. Pediatr. Res. 86, 655–661 (2019). [DOI] [PubMed] [Google Scholar]
  • 103.Kuzniewicz MW et al. A Quantitative, Risk-Based Approach to the Management of Neonatal Early-Onset Sepsis. JAMA Pediatr. 171, 365–371 (2017). [DOI] [PubMed] [Google Scholar]
  • 104.Puopolo KM et al. Estimating the probability of neonatal early-onset infection on the basis of maternal risk factors. Pediatrics 128, e1155–63 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 105.Moorman JR et al. Mortality reduction by heart rate characteristic monitoring in very low birth weight neonates: a randomized trial. J. Pediatr. 159, 900–6.e1 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 106.Fairchild KD & O’Shea TM Heart rate characteristics: physiomarkers for detection of late-onset neonatal sepsis. Clin. Perinatol. 37, 581–598 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 107.Gur I et al. Pilot study of a new mathematical algorithm for early detection of late-onset sepsis in very low-birth-weight infants. Am. J. Perinatol. 32, 321–330 (2015). [DOI] [PubMed] [Google Scholar]
  • 108.Gur I et al. A mathematical algorithm for detection of late-onset sepsis in very-low birth weight infants: a preliminary diagnostic test evaluation. Indian Pediatr. 51, 647–650 (2014). [DOI] [PubMed] [Google Scholar]
  • 109.Mithal LB, Yogev R, Palac H, Gur I & Mestan KK Computerized vital signs analysis and late onset infections in extremely low gestational age infants. J. Perinat. Med. 44, 491–497 (2016). [DOI] [PubMed] [Google Scholar]
  • 110.Masino AJ et al. Machine learning models for early sepsis recognition in the neonatal intensive care unit using readily available electronic health record data. PLoS ONE 14, e0212665 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 111.Mani S et al. Medical decision support using machine learning for early detection of late-onset neonatal sepsis. J. Am. Med. Inform. Assoc. 21, 326–336 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 112.Song W et al. A Predictive Model Based on Machine Learning for the Early Detection of Late-Onset Neonatal Sepsis: Development and Observational Study. JMIR Med. Inform. 8, e15965 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 113.Persad E et al. Neonatal sepsis prediction through clinical decision support algorithms: A systematic review. Acta Paediatr. 110, 3201–3226 (2021). [DOI] [PubMed] [Google Scholar]
  • 114.Goldberg O et al. Can we improve early identification of neonatal late-onset sepsis? A validated prediction model. J. Perinatol. 40, 1315–1322 (2020). [DOI] [PubMed] [Google Scholar]
  • 115.Sweeney TE et al. Validation of the sepsis metascore for diagnosis of neonatal sepsis. J. Pediatric Infect. Dis. Soc. 7, 129–135 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 116.Saria S, Rajani AK, Gould J, Koller D & Penn AA Integration of early physiological responses predicts later illness severity in preterm infants. Sci. Transl. Med. 2, 48ra65 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 117.Sullivan BA et al. Early pulse oximetry data improves prediction of death and adverse outcomes in a two-center cohort of very low birth weight infants. Am. J. Perinatol. 35, 1331–1338 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 118.Fairchild KD et al. Endotoxin depresses heart rate variability in mice: cytokine and steroid effects. Am. J. Physiol. Regul. Integr. Comp. Physiol. 297, R1019–27 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 119.Fairchild KD et al. Septicemia mortality reduction in neonates in a heart rate characteristics monitoring trial. Pediatr. Res. 74, 570–575 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 120.James CA, Wachter RM & Woolliscroft JO Preparing clinicians for a clinical world influenced by artificial intelligence. JAMA (2022) doi: 10.1001/jama.2022.3580. [DOI] [PubMed] [Google Scholar]
  • 121.Emanuel EJ & Wachter RM Artificial intelligence in health care: will the value match the hype? JAMA 321, 2281–2282 (2019). [DOI] [PubMed] [Google Scholar]
  • 122.King WE, Carlo WA, O’Shea TM, Schelonka RL & HRC neurodevelopmental follow-up investigators. Multivariable predictive models of death or neurodevelopmental impairment among extremely low birth weight infants using heart rate characteristics. J. Pediatr. (2021) doi: 10.1016/j.jpeds.2021.11.026. [DOI] [PubMed] [Google Scholar]
  • 123.Swanson JR et al. Neonatal intensive care unit length of stay reduction by heart rate characteristics monitoring. J. Pediatr. 198, 162–167 (2018). [DOI] [PubMed] [Google Scholar]
  • 124.Sjoding MW, Dickson RP, Iwashyna TJ, Gay SE & Valley TS Racial bias in pulse oximetry measurement. N. Engl. J. Med. 383, 2477–2478 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 125.Andrist E, Nuppnau M, Barbaro RP, Valley TS & Sjoding MW Association of race with pulse oximetry accuracy in hospitalized children. JAMA Netw. Open 5, e224584 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 126.Vesoulis Z, Tims A, Lodhi H, Lalos N & Whitehead H Racial discrepancy in pulse oximeter accuracy in preterm infants. J. Perinatol. 42, 79–85 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 127.Sullivan BA & Keim-Malpass J BARRIERS to early detection of deterioration in hospitalized infants using predictive analytics. Hosp. Pediatr. (2021) doi: 10.1542/hpeds.2020-004382. [DOI] [PubMed] [Google Scholar]
  • 128.Winters BD et al. Technological distractions (part 2): A summary of approaches to manage clinical alarms with intent to reduce alarm fatigue. Crit. Care Med. 46, 130–137 (2018). [DOI] [PubMed] [Google Scholar]
  • 129.Joshi R et al. Pattern discovery in critical alarms originating from neonates under intensive care. Physiol. Meas. 37, 564–579 (2016). [DOI] [PubMed] [Google Scholar]
  • 130.Keim-Malpass J et al. Advancing Continuous Predictive Analytics Monitoring: Moving from Implementation to Clinical Action in a Learning Health System. Crit. Care Nurs. Clin. North Am. 30, 273–287 (2018). [DOI] [PubMed] [Google Scholar]
  • 131.Ghassemi M, Oakden-Rayner L & Beam AL The false hope of current approaches to explainable artificial intelligence in health care. Lancet Digit. Health 3, e745–e750 (2021). [DOI] [PubMed] [Google Scholar]

RESOURCES