Skip to main content
Journal of the American Medical Informatics Association : JAMIA logoLink to Journal of the American Medical Informatics Association : JAMIA
. 2021 Feb 24;28(6):1242–1251. doi: 10.1093/jamia/ocab006

Healthcare Process Modeling to Phenotype Clinician Behaviors for Exploiting the Signal Gain of Clinical Expertise (HPM-ExpertSignals): Development and evaluation of a conceptual framework

Sarah Collins Rossetti 1,2,, Chris Knaplund 1, Dave Albers 1,3, Patricia C Dykes 4,5, Min Jeoung Kang 4,5, Tom Z Korach 4,5, Li Zhou 4,5, Kumiko Schnock 4,5, Jose Garcia 4, Jessica Schwartz 2, Li-Heng Fu 1, Jeffrey G Klann 5, Graham Lowenthal 4, Kenrick Cato 2
PMCID: PMC8200261  PMID: 33624765

Abstract

Objective

There are signals of clinicians’ expert and knowledge-driven behaviors within clinical information systems (CIS) that can be exploited to support clinical prediction. Describe development of the Healthcare Process Modeling Framework to Phenotype Clinician Behaviors for Exploiting the Signal Gain of Clinical Expertise (HPM-ExpertSignals).

Materials and Methods

We employed an iterative framework development approach that combined data-driven modeling and simulation testing to define and refine a process for phenotyping clinician behaviors. Our framework was developed and evaluated based on the Communicating Narrative Concerns Entered by Registered Nurses (CONCERN) predictive model to detect and leverage signals of clinician expertise for prediction of patient trajectories.

Results

Seven themes—identified during development and simulation testing of the CONCERN model—informed framework development. The HPM-ExpertSignals conceptual framework includes a 3-step modeling technique: (1) identify patterns of clinical behaviors from user interaction with CIS; (2) interpret patterns as proxies of an individual’s decisions, knowledge, and expertise; and (3) use patterns in predictive models for associations with outcomes. The CONCERN model differentiated at risk patients earlier than other early warning scores, lending confidence to the HPM-ExpertSignals framework.

Discussion

The HPM-ExpertSignals framework moves beyond transactional data analytics to model clinical knowledge, decision making, and CIS interactions, which can support predictive modeling with a focus on the rapid and frequent patient surveillance cycle.

Conclusions

We propose this framework as an approach to embed clinicians’ knowledge-driven behaviors in predictions and inferences to facilitate capture of healthcare processes that are activated independently, and sometimes well before, physiological changes are apparent.

Keywords: electronic health records, predictive modeling, clinical informatics, conceptual framework

INTRODUCTION

Clinical and physiological measurement data have limited predictive power for early detection of deterioration in hospitalized patients,1 prompting the search for additional, complementary predictive data sources. Clinicians are continuously engaged in complex decision making and pattern matching, informed by their knowledge and expertise, to drive skilled judgments (eg, prediction of patient trajectories) and subsequent actions (eg, increasing the frequency and type of clinical assessments needed).2 Measurement of the knowledge and expertise that drive clinical actions, particularly at scale, is inherently challenging. However, clinical actions are captured continuously through interactions with clinical information systems (CIS), including the electronic health record (EHR), medical devices (eg, intravenous pumps), and other healthcare equipment (eg, Pyxis); these actions provide an opportunity for scalable measures that can be exploited as signals of clinician knowledge and expertise (eg, the decision to order a test or further assess a patient).

System log files have been utilized to model patient states3 and can also be used to model clinician behavior patterns and workflows within and across institutions. Judgments and subsequent actions of clinicians may differ across individuals, specialties, settings, and expertise levels; yet, the care process is conducted in a consistent manner. The nursing process consistently drives nursing practice, is adapted from the scientific method, and is defined as the cyclical process: assess, diagnose, plan, intervene, and evaluate.4 Similar care process patterns drive the practice of other health professionals. While the correct action within each part of the care process may not always be executed (ie, care gaps and disparities exist), the care process is widely accepted and utilized across healthcare domains and can be used as an analytical framework.

OBJECTIVE

To describe our development of the Healthcare Process Modeling Framework to Phenotype Clinician Behaviors for Exploiting the Signal Gain of Clinical Expertise (HPM-ExpertSignals). This framework illustrates (1) how to use CIS interactions to characterize behaviors, (2) how behavioral actions form proxies for clinical expertise and knowledge, and (3) how these proxies provide context that increase clinical predictive power of patient trajectories. This framework, which applies to all types of clinicians (eg, nurses, physicians, therapists), was developed based on our analytical approaches to phenotyping nurses’ behaviors related to clinical concerns to inform a predictive early warning score (EWS).5 While healthcare processes have been modeled previously,6–8 there has been no unified framework to date for systematically exploiting signals produced by the healthcare process in a way that can directly lead to changes in decisions and action at the bedside using clinical decision support (CDS).

MATERIALS AND METHODS

We employed an iterative framework development approach that combined data-driven modeling and simulation testing with subject matter experts (SMEs) as part of the CONCERN (Communicating Narrative Concerns Entered by RNs) study (1R01NR016941-01), which includes 2 large academic medical centers in the northeastern United States. Our approach included iterative cycles of model refinement, engagement with SMEs, and triangulation of themes (see Figure 1). Institutional review board approval was obtained from both study sites.

Figure 1.

Figure 1.

Approach to iterative conceptual framework development leveraging thematic analyses of processes and findings from data driven modeling and simulation testing for triangulation of themes. Thematic analysis of the iterative processes and contextual information that informed development of the Communicating Narrative Concerns Entered by Registered Nurses (CONCERN) model were triangulated with thematic analysis of clinical subject matter expert perceptions of the CONCERN model during simulation testing. These triangulated findings were used to define a conceptual framework for phenotyping clinician behaviors to detect and leverage signals of clinician expertise for prediction of patient trajectories.

Data driven modeling use case: The CONCERN model

The CONCERN predictive model uses EHR data to identify signals of nurses’ concern that a hospitalized patient may be entering a risky state. The model is being implemented as an EWS in a user-centered designed CDS tool to foster communication and increase situational awareness among the interprofessional care team, particularly physicians and nurses. Nurses continuously monitor patients and document concerns throughout the EHR, such as in flowsheet comments; yet, this documentation is rarely viewed,9–11 despite evidence that a nurse’s concern is a valid reason to intervene without waiting for physiological changes.2,12,13 Unfortunately, if a nurse is concerned about a patient, there are a lack of visual functionality in the EHR that stand out in a similar way to visual cues used in a paper-based chart, such as “putting a note or colored paper in the front of the chart” or “circling concerning values in red” so that the chart or value can be readily identified among all the other patient charts or values. Our approach is to identify signals of equivalent types of behaviors in the EHR, validate which signals are predictive of patient trajectories, and surface those signals to the care team. A clinical trial will prospectively evaluate if the CONCERN CDS increases situational awareness among the care team and impacts outcomes (in-hospital mortality, length of stay, cardiac arrest, unanticipated transfer to the intensive care unit [ICU], and 30-day hospital readmission rates).

Other EWS use different approaches to identify at risk patients. The Modified Early Warning Score (MEWS),14 National Early Warning Score (NEWS),15–18 and the Rothman Index19,20 use changes in physiological data, such as abnormal vital signs, known to be late indicators of deterioration. The Rothman Index predicts death within 24 hours but not ICU transfer21—a clinically important marker of deterioration and an outcome in our study. The “worry factor”22 and the DENWIS (Dutch Early Nurse Worry Indicator Score)23,24 in their current state add to documentation burden. The CONCERN CDS does not require any new documentation or workflows by the nurse because our modeling approach uses behavior patterns to predict patient outcomes.

We achieve this by implicitly measuring expertise (nurse’s concern about patient status) and by explicitly measuring behavior-phenotypic differences between documentation patterns for deteriorating vs nondeteriorating patients.5 We incorporate information such as documentation patterns beyond standards of care and hospital requirements as environmental and system modifiers. Choosing to document beyond minimum requirements indicates a nurse likely determined an observed patient state was clinically significant enough to assess and record.5,25,26 This approach enables modeling when assessments and interventions are activated independent of standards of care or documentation requirements to infer that the nurse’s decision making reflected an increased level of concern.

We used nurses’ EHR interaction data, which are high volume, curated using the standardized ontology that we developed within the i2b2 (Informatics for Integrating Biology and the Bedside; https://www.i2b2.org) system of all inpatient encounter information, including patient demographics, flowsheets, notes, diagnoses, significant events, laboratory tests, orders, medication administrations, clinician information, and admission, discharge, and transfer information.27,28 We previously published about this sharable ontology, which leverages the Unified Medical Language System, the Clinical Care Classification System, and nurse SME validation.29,30 This ontology is the conceptual data model for the CONCERN study’s databases of hospitalizations from 2015 to 2018 (site A = 123 981 patients and 697 928 178 data points; site B = 188 512 patients and 191 904 328 data points). The 2 sites each used a different EHR, accounting for the wide range of data points generated per patient.

We targeted general medical-surgical acute care units, intermediate care (ie, stepdown units), and intensive care units (ie, critical care) to analyze data for patients that spent more than 24 hours on a study unit, excluding patients younger than 18 years of age, receiving hospice and palliative care, or with a hospital stay exceeding 60 days.14

We capture metadata associated with each structured clinical data point, such as unique patient and hospital visit identifiers and timestamps. Our analyses focused on the number of entries (not the value, but rather the frequencies of data entry such as vital signs, notes, or comments). Model refinement used longitudinal logistic and hazard regressions with time-varying covariates.31–33 Our data-driven analyses includes natural language processing of unstructured, narrative data from nursing notes and free-text comments.30,34 Methods for data cleaning and validation are previously reported,5,34,35 as are the development and performance of CONCERNv1.0 model.5 We report on the CONCERNv2.0 model in this study. The output of the CONCERNv2.0 model was translated into a categorical score of red (high risk), yellow (moderate risk), and green (low risk) based on user specified feedback, reduction of score variability, and corrections for demographic biases. A description of our score development will be published separately.

We measured the performance of the CONCERNv2.0 model by using the Cox time-varying proportional hazards model with our composite outcome as the endpoint, defined as the first occurrence of in-hospital mortality, cardiac arrest, unanticipated transfer to the intensive care unit, rapid response, or sepsis. We selected the well-published MEWS and NEWS as comparators because they are based solely on a patient’s physiological data and are the most widely used and validated EWS models.14–18 Specifically, we used as a feature set each patient’s hourly CONCERN score (low = green, moderate = yellow, high = red), MEWS score (low = 0-2, moderate = 3-4, high = 5+), and NEWS score (low = 0-3, moderate = 4-6, high = 7+) to quantify whether the presence of an increasing score predicted our composite outcome. The thresholds used to determine low, moderate, and high risk for MEWS and NEWS are described elsewhere.36,37 Further, as part of our evaluation, we constructed a model using the CONCERN, MEWS, and NEWS scores, with low risk scores as the control. We also computed a “lead time” statistic which describes how well an observed risk level differentiates between events and nonevents at various hours in the future. The statistic is based on the likelihood ratio of the 2 probability measures induced by events and nonevents and is a natural choice for sequential hypothesis testing.

Thematic analysis from development of the CONCERN model

Thematic analysis from our data-driven modeling was performed by Sarah Collins Rossetti by analyzing the processes and contextual information that informed feature selection and grouping of the types of features that emerged. Group sessions were conducted during team meetings to achieve consensus related to the identified themes.

Thematic analysis from simulation testing of the CONCERN model

We conducted simulation studies with nurses and physicians to understand their perspective of the CONCERN model and CDS tool using the Situation Awareness Global Assessment Technique method.38 CDS prototypes5 were presented to 17 nurses and 6 physicians as part of 6 simulated patient cases with varied risk levels to evaluate impact on situational awareness and provide user-centered design feedback. Data were analyzed by Sarah Collins Rossetti, Min Jeoung Kang, and GL for themes related to nurses’ and physicians’ perspectives of the model. Group sessions were conducted during team meetings to achieve consensus related to the identified themes.

Triangulation of themes for iterative conceptual framework development

Themes identified from our data-driven modeling and simulation testing were triangulated to define a conceptual framework for phenotyping clinician behaviors to detect and leverage signals of clinician expertise for prediction of patient trajectories (see Figure 1). Specifically, the themes were compared and evaluated to understand how they align with each other and were integrated into a conceptual framework by Sarah Collins Rossetti with group consensus sessions for refinements conducted during team meetings.

RESULTS

Data driven modeling use case: The CONCERN model

The CONCERNv2.0 model included 15 measurement features, which were expanded to 30 measurement features based on whether they were measured at common or uncommon times, 21 nursing note content features, and 4 temporal features (Table 1). The integration of additional data types (eg, demographics, location in hospital, time features) were modifiers that enriched predictive power.

Table 1.

CONCERN model features

Features
Measurements and Temporal Clustereda Note Content Clustereda
Heart rate measurementb Yes Abdominal painb No
Respiratory rate measurementb Yes Abnormal heart rhythmb No
Blood pressure measurementb Yes Abnormal mental stateb No
Temperature measurementb Yes Abnormal rate, rhythm, depth and effort of respirationsb No
SpO2 measurementb Yes Abnormal temperatureb No
All 5 vital measurements taken at same timeb Yes Back painb No
Only 1 vital measurement takenb Yes Chest painb No
Heart rate commentb Yes Communication problemb No
Respiratory rate commentb Yes Diagnosis related with infectionb No
Blood pressure commentb Yes Deficit of circulationb No
Temperature commentb Yes Fall riskb No
SpO2 commentb Yes Fluid volume alterationb No
PRN medication administeredb Yes General concernb No
Scheduled medication withheldb Yes Headacheb No
Nursing note writtenb Yes Improper renal functionb No
Month No Medication related with infectionb No
Day of week No Monitoringb No
Hour No Mood disorderb No
Patient Hour No Musculoskeletal painb No
Pain levelb No
Violence gestureb No

CONCERN: Communicating Narrative Concerns Entered by Registered Nurses; PRN: as needed; SpO2: oxygen saturation.

a

Whether the feature is clustered into times it is commonly measured or uncommonly measured.

b

Feature is aggregated over the past 12 hours.

Our model evaluation, using the Cox time-varying proportional hazards model, showed that, with CONCERN low risk as the control, the hazard ratio for CONCERN moderate risk is 3.42 (95% confidence interval, 3.28-3.57; P < .0001), meaning that a moderate risk implies 2.42 greater hazard of cardiac arrest above low risk. Similarly, the hazard ratio for CONCERN high risk is 13.32 (95% confidence interval, 11.02-16.1; P < .0001), meaning that a high risk implies 12.32 greater hazard of cardiac arrest above low risk. In our model using CONCERN, MEWS, and NEWS low risk scores as the control, the CONCERN high risk score implied greater hazard than both the MEWS and NEWS high risk scores (6.69 vs 1.74 and 1.59). The CONCERN moderate risk score also implied greater hazard than both the MEWS and the NEWS moderate risk scores (1.88 vs 1.08 and 1.09) (see Figure 2). All 4 comparisons (CONCERN high vs NEWS high; CONCERN high vs MEWS high; CONCERN moderate vs NEWS moderate; CONCERN moderate vs MEWS moderate) are significant at the P < .0001 level. We also demonstrated that the CONCERNv2.0 model had a better “lead time” compared with MEWS and NEWS in the sense that CONCERN was able to better differentiate between a patient’s probability of an event earlier (see Figure 3). For instance, the likelihood of an event occurring 48 hours after observing a CONCERN high risk score is comparable to the likelihood of an event occurring 6 hours after observing a high risk MEWS or NEWS score—a difference of 42 hours.

Figure 2.

Figure 2.

Time-varying survival regression. Forest plot of the covariates used in Cox time-varying proportional hazards model and associated statistics. HR: hazard ratio; MEWS: Modified Early Warning Score; NEWS: National Early Warning Score.

Figure 3.

Figure 3.

Comparison of log likelihood ratios at various hours before event. The likelihood ratio, defined as L(x,h) = P(x | patient has an event h hours in the future) / P(x | patient does not have an event h hours in the future). For example, L(‘CONCERN score = yellow’, 6) quantifies how well the Communicating Narrative Concerns Entered by Registered Nurses (CONCERN) algorithm separate the probability measures induced by whether the patient has an event 6 hours in the future after observing a “yellow” score. Larger values represent more weight given to the numerator vs the denominator, while smaller values represent more weight given to the denominator. MEWS: Modified Early Warning Score; NEWS: National Early Warning Score.

Themes from development of the CONCERN model

We identified 3 themes described subsequently based on our findings from our approach to developing the CONCERN model.

Theme 1: Predictive signals may be derived from clinical behaviors

The first theme was that user interaction data contain patterns of clinician behaviors that can be interpreted as proxies of an individual’s decisions, knowledge, and expertise and used in predictive models and visualizations for associations with outcomes. This theme is supported in our CONCERNv2.0 model features, such as “documentation at uncommon times” and “withheld scheduled medication.” These types of actions reflect nursing decisions that are consistent with increased clinical concern and can predict patients’ states. The predictive signals are derived from analysis of complex patterns of clinician behaviors (ie, behavioral phenotypes),39 which are distilled and displayed to clinicians in the CONCERN CDS. These novel patterns identified through machine learning are in contrast to MEWS and NEWS that use physiological data thresholds (eg, abnormal vital signs) that are already applied by clinicians as part of routine clinical decision making.

Theme 2: Clinical domain expertise are essential for interpretations

The second theme is that clinical domain expertise is essential for accurate and comprehensive data integrations and interpretations. For example, in investigating our findings, we identified that the frequency of documenting respiratory rate and oxygen saturation (SpO2) comments consistently had a strong signal. We previously highlighted how understanding that respiratory rate is the only vital sign manually measured in the hospital setting for nonventilated patients and provides important contextual information when interpreting the frequency with which it is documented.5 We also previously reported how nurses explained that SpO2 comments are typically used to highlight to the physician when the nurse continues to titrate the supplemental oxygen in order to keep the SpO2 reading normal for the patient—an indicator of deteriorating status that can be lost among EHR data points.11

Theme 3: Temporal focus drives clinical utility

The third theme is that time-based features, and other types of modifiers, drive utility of healthcare process modeling in the clinical setting. Most EWS models use vital sign abnormalities to predict deterioration but have shown limited impact on clinical outcomes because vital signs are a late indicator of deterioration.1,40,41 The CONCERN CDS tool aims to surface early changes in a patient state for the care team’s situational awareness of patient risk. In order to do so, we identify when nurses were concerned that a patient may be entering a risky state—as opposed to when physiological values indicate they are already in a risky state—by modeling with a simulated real-time prospective analytical approach that accounts for the temporal signal of each data point.

The themes described previously are aligned with an approach to feature selection based on user interaction with the machinery that supports and captures the timely delivery of care and clinical interventions (eg, EHR data/metadata, smart pumps, communication devices) and is informed by clinical domain knowledge.

Themes from simulation testing of the CONCERN model

We also identified 4 themes from the simulation study, each with 2 to 4 subthemes. The 4 themes were clinical decision making, paradigm shift, believability, and CIS interactions. Subthemes were coded with positive and negative perspectives as shown in Table 2.

Table 2.

Themes derived through CONCERN model simulation testing

Theme Subtheme
Clinical decision making (+) Use of model drives improved critical thinking
(+) Use of model drives improved patient prioritization
(+) Use of model drives improved team-based care and communication
(+) Features from the model synthesize the chart to decrease information overload
Paradigm shift (+) Model is an innovative method for recognizing increased patient risk above baseline
(-) Model does not use clinical data values
Believability (+) Model’s concerning surveillance patterns reflect clinical practice
(+) Model’s use of nursing documentation validates the value of nurses’ documentation efforts
CIS interactions (-) Missing and back-charted EHR data may limit predictions
(-) Model’s focus on CIS interactions may perpetuate an over-reliance on clinical systems and decreased patient interactions

CIS: clinical information systems; CONCERN: Communicating Narrative Concerns Entered by Registered Nurses; EHR: electronic health record.

(+) indicates a positively perceived theme by simulation testing participants and (-) indicates a negatively perceived theme by simulation testing participants.

Identified themes demonstrated the relevance of the CONCERN model from the perspective of practicing clinicians. The perceived benefits to clinical decision-making were positive, including improved prioritization, team-based communication, critical thinking, and EHR information overload. Clinicians recognized the important paradigm shift and potential in detecting features not readily observable from physiological measures, while acknowledging dissemination challenges for a prediction model in the clinical setting that does not rely on physiological values.42,43 The model was believable by clinicians, with several noting that it reflected their observed practice patterns. However, some noted that “back-charting” (ie, data entered retrospectively in-batch) may limit real-time CDS interventions (although back-charting can be computationally detected). Some cautioned that a system-derived model may further decrease face-to-face patient time; yet, others recognized the utility in driving more efficient patient prioritization through rank ordering patients by risk level.

Triangulated themes and HPM-ExpertSignals conceptual framework development

Figure 4 conveys our HPM-ExpertSignals Conceptual Framework, which leverages an adaptation of Donabedian’s structure-process-outcome framework, 44 and is focused on information that can be mined from clinical data structures, generated by clinician processes, and driven by knowledge-based behaviors. The developed framework includes a 3-step modeling technique to phenotype clinician behaviors as proxies of clinician knowledge and expertise to inform predictive models: (1) identify features from user interaction with clinical systems that are patterns of clinical behaviors; (2) interpret patterns as proxies of an individual’s decisions, knowledge, and expertise; and (3) use patterns in predictive models for associations with outcomes. We found that when observing patterns of clinical activity and behavior, domain experts interpret those patterns in the context of clinical knowledge and expertise, deducing the sources of those behaviors. The framework conveys that by observing clinician activity pattern distributions for subgroups of patients, we can quantify and systemize the observed behavioral expertise such that it can be leveraged to predict patient trajectories. Based on our iterative data-driven modeling and simulation testing with SMEs, we identified 4 main challenges in measuring clinician expertise and 4 main challenges in analyzing CIS interactions Table 3). These challenges are surmountable when informed by appropriate domain expertise, data-driven methodologies, and awareness of the limitations for inferencing.

Figure 4.

Figure 4.

Healthcare Process Modeling Framework to Phenotype Clinician Behaviors for Exploiting the Signal Gain of Clinical Expertise (HPM-ExpertSignals). The framework is focused on information that can be mined from clinical data structures, is generated by clinician processes, and is driven by knowledge-based behaviors in order to identify features from user interaction with clinical systems, which are patterns of clinical behaviors and can be interpreted and used in predictions.

Table 3.

Main challenges in measuring clinician expertise and analyzing CIS interactions

Challenges in measuring clinician expertise
1. Expert judgments may derive from unconscious or unrecorded observations.
2. Experts are often unable to articulate guiding cues.45,46
3. Differentiating expert-driven actions from inexperienced actions is not always possible.
4. Expertise changes overtime and is context dependent.
Challenges in analyzing CIS interactions
1. Health professionals’ care processes include shared and divergent activities and reasoning, requiring appropriate domain knowledge to identify and interpret.
2. Factors that modify system interactions (eg, configurations, standards of care, policies) vary within and across institutions.
3. Individuals do not always behave rationally—noise and diversity exist among the consistency of clinical processes.
4. All possible actions and best practices are not known or captured.

CIS: clinical information systems.

Several factors modify CISs interactions (see Figure 4). At the individual level, clinician characteristics (eg, experience, patient load), patient characteristics (eg, acuity, code status), and physiological and disease processes may modify CIS interactions. Environmental and system modifiers also impact CIS interactions, such as system configurations (eg, hard stops, documentation by exception), standards of care, policy or regulatory requirements, and protocols, all of which may vary by setting (eg, academic medical center, rural hospital) or specialty. Some factors are challenging to explicitly incorporate, including local environmental factors, such as clinical standards adherence, variable clinical roles and responsibilities, and specialist availability. Traditional predictive modeling can detect some environmental and system modifiers through latent factors, but HPM-ExpertSignals encourage explicit modeling of difficult-to-detect modifiers using a top-down modeling approach.6,47,48 For example, our team has demonstrated the implicit identification of contextual factors and data-driven covariates for modeling laboratory data47 and survival models of chronic kidney disease48 for healthcare process biases and effects.

DISCUSSION

HPM-ExpertSignals moves beyond transactional data analytics to model clinical knowledge, decision making, and behavior to make predictions about patients whose risk is unexpectedly higher than it appears from physiological data alone, in contrast to patients whose risk state is obvious to clinicians based on abnormal data values. An evaluation of the CONCERN model, or any EWS model, based on clinical outcomes may underrepresent its impact because some “at-risk” patients are intervened on in the clinical setting and therefore do not experience a negative outcome. The CONCERN use case verifies our ability to phenotype nursing behaviors, associate behaviors to outcomes, and use those associations to make actionable predictions. This approach to health data modeling, broadly, can advance how we understand clinical observational skills and clinician-entered data, and can inform phenotyping studies.

“Surveillance” is characterized within the nursing process by the rapid and frequent cycle of assess, diagnose, plan, intervene, and evaluate, and includes the iterative relationship between decision making and care processes. In applying these concepts, the CONCERN model captured when (1) nurses change surveillance and interventions patterns (eg, PRN [as needed] medications administered, scheduled medications withheld) and documentation frequency and (2) nurses use notes and short narrative comments to contextualize and highlight specific flowsheet values. These concerning patterns include features consistent with implicit (eg, increased number of documentation entries during uncommon times) and explicit (eg, EHR comments to highlight inadequate physiological response) models of expertise. Used in context, a nurse may convey increased surveillance of SpO2 and supplemental oxygen by entering these data more frequently, and use comments associated with those structured data to annotate and highlight the patient’s inadequate SpO2 response to increased supplemental oxygen.25 In other words, documentation patterns contain predictive signals that can be detected before signals that are derived from the data values within documentation. Increased streaming of continuous monitoring data into EHRs will decrease manual data entries; however, nurses will likely increase annotations and highlighting of data. Annotation and highlighting of data are the types of EHR workflows HPM-ExpertSignals targets. The CONCERN model will be adapted to new EHR workflows as they emerge.

Our prior work confirmed nurses’ intentions to highlight deteriorating status to physicians.25 Patient deterioration is a complex patient safety problem with dependencies on effective team communication and intervention activation.2,24,49,50 In the CONCERN example, the nurse is already aware that the patient is at risk. The CONCERN CDS surfaces predictions of nurses’ concern so that they are visible to the care team.

Our model development demonstrated improvement when inferences on EHR data were informed by clinical knowledge, data generation processes, and clinical practice; omitting these inputs is usually negative. For example, medication administration data have patterns and signals associated with outcomes that are distinct from medication orders27,51; when interpretation is not informed by clinical knowledge, medication administration data might be assumed to be redundant with orders, potentially leading to inaccurate assumptions.52 In contrast to unsupervised approaches, which miss the signal gain of clinical expertise,52 HPM-ExpertSignals provides a framework for making clinical domain knowledge inferences explicit. For example, analyses of flowsheet53 and medication data51 must account for workarounds and the shift-based nature of nursing work to overcome data incompleteness and timestamp issues that may not be apparent to nondomain experts.11,25,51,53 Phenotyping studies in the absence of HPM-ExpertSignals may miss the greater context of clinical expertise, workflows, and decision making (eg, differential diagnoses, nursing surveillance) driving data capture and the characteristics of clinical data being used to define phenotypes.54–60

Features that are proxies of clinical judgment are, by definition, interpretable and may advance predictive power.2,61 Clinical observations and clinician-entered data, when analyzed in the context of standards of care and policy or regulatory requirements, provide insight into the decision-making processes that drive data capture.54–60 To accelerate phenotyping of clinician behaviors, we recommend that common data models incorporate additional data and metadata (eg, user, time of day, system configuration) of CIS interactions, including EHR flowsheets. Further, phenotypes of system interactions and clinician-entered data could expose problematic documentation burden patterns.

Future work includes application and validation of the HPM-ExpertSignals framework to use cases in other clinical domains, such as concern for patients’ self-management capability or identifying implicit and explicit biases linked to outcome disparities.62 Characterizing system interactions from a team perspective, and as potential intervention points for CDS or decreasing documentation burden, are also additional future research areas. Future methodological advancements include approaches to handle temporal data and broader incorporation of healthcare process variables, including physiology.

CONCLUSION

In this study our CONCERN model use case demonstrated that focusing only on clinical values may miss healthcare processes and interventions that are activated independently, and in some cases, long before physiological changes are apparent. Physiological values contain different information than documentation patterns. We developed the HPM-ExpertSignals framework as an approach to predict information that cannot be inferred from physiological values. Phenotyping clinician behaviors through characterizations of system interactions may advance how we characterize and utilize clinical observations and clinician-entered data for knowledge generation and modeling of decision making, and to embed clinician knowledge in predictions.

Health data volume increases every day; our HPM-ExpertSignals framework illustrates that the value trapped within these data extends far beyond explicitly recorded physiological values and treatments. Novel signals can be exploited to provide more information than is present in explicit data elements alone.5,26,42,63 CIS metadata are widely underutilized and our work demonstrates that these data contain unique, implicit behavioral markers of bedside observation and decisions about care intensity that cannot be acquired through other data collection methods. Understanding clinicians’ choice to interact with CIS, and how those choices impact data signals or statistical biases, is essential in predictive modeling.

FUNDING

This work was supported by the National Institute of Nursing Research: 1R01NR016941-01, Communicating Narrative Concerns Entered by RNs (CONCERN): Clinical Decision Support Communication for Risky Patient States and the National Institute of Nursing Research Reducing Health Disparities Through Informatics T32NR007969. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health.

AUTHOR CONTRIBUTIONS

SCR and KC conceived and designed the study. All authors contributed to collection and analysis of the data. SCR drafted the manuscript, and all authors contributed substantially to its revision. SCR and KC take responsibility for the paper as a whole.

COMPETING INTERESTS STATEMENT

The authors have no competing interests to declare.

DATA AVAILABILITY STATEMENT

The data underlying this article were accessed from Columbia University Irving Medical Center and Mass General Brigham. The derived data generated in this research will be shared on reasonable request to the corresponding author.

References

  • 1. Fu L, Schwartz J, Moy A, et al. Development and Validation of Early Warning Score System: A Systematic Literature Review. J Biomed Inform 2020; 105: 103410. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2. Odell M, Victor C, Oliver D.. Nurses’ role in detecting deterioration in ward patients: systematic literature review. J. Adv. Nurs 2009; 65: 1992–2006. [DOI] [PubMed] [Google Scholar]
  • 3. Fauber J, Shelton CR.. Modeling ‘Presentness’ of Electronic Health Record Data to Improve Patient State Estimation. Proc. Mach. Learn. Res 2018; 85: 1–13. [Google Scholar]
  • 4.American Nurses Association. The Nursing Process. nursingworld.org Available at: https://www.nursingworld.org/practice-policy/workforce/what-is-nursing/the-nursing-process/. (Accessed: 25th November 2020)
  • 5. Rossetti SC, Knaplund C, Albers D, et al. Leveraging clinical expertise as a feature—not an outcome—of predictive models: evaluation of an early warning system use case. In: proceeding of the American Medical Informatics Association Annual Fall Symposium; November 16–20, 2019: 323–32; Washington, DC. [PMC free article] [PubMed] [Google Scholar]
  • 6. Hripcsak G, Albers DJ.. Next-generation phenotyping of electronic health records. J Am Med Inform Assoc 2013; 20 (1): 117–21. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7. Hripcsak G, Albers DJ.. Correlating electronic health record concepts with healthcare process events. J Am Med Inform Assoc 2013; 20 (e2): e311–8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8. Agniel D, Kohane IS, Weber GM.. Biases in electronic health record data due to processes within the healthcare system: retrospective observational study. BMJ 2018; 361: k1479. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9. Fukui S, Salyers MP, Matthias MS, et al. Predictors of shared decision making and level of agreement between consumers and providers in psychiatric care. Community Ment Heal J 2013; 50: 375–82. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10. Collins SA, Cato K, Albers D, et al. Relationship between nursing documentation and mortality. Am J Crit Care 2013; 22 (4): 306–13. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11. Collins SA, Vawdrey DK.. Reading between the lines’ of flowsheet data: Nurses’ optional documentation associated with cardiac arrest outcomes. Appl Nurs Res 2012; 25 (4): 251–7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12. Cioffi J. Recognition of patients who require emergency assistance: a descriptive study. Hear Lung 2000; 29 (4): 262–8. [DOI] [PubMed] [Google Scholar]
  • 13. Jones L, King L, Wilson C.. A literature review: factors that impact on nurses’ effective use of the Medical Emergency Team (MET). J Clin Nurs 2009; 18 (24): 3379–90. [DOI] [PubMed] [Google Scholar]
  • 14. Subbe CP, Kruger M, Rutherford P, Gemmel L.. Validation of a modified early warning score in medical admissions. QJM 2001; 94 (10): 521–6. [DOI] [PubMed] [Google Scholar]
  • 15. Tirkkonen J, Karlsson S, Skrifvars MB.. National early warning score (NEWS) and the new alternative SpO2 scale during rapid response team reviews: a prospective observational study. Scand. J Trauma Resusc Emerg Med 2019; 27: 111. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16. Pimentel MAF, Redfern OC, Gerry S, et al. A comparison of the ability of the National Early Warning Score and the National Early Warning Score 2 to identify patients at risk of in-hospital mortality: a multi-centre database study. Resuscitation 2019; 134: 147–56. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17. Chen J. Does the national early warning score 2 system serve its purpose? Resuscitation 2019; 134: 145–6. [DOI] [PubMed] [Google Scholar]
  • 18. Gerry S, Bonnici T, Birks J, et al. Early warning scores for detecting deterioration in adult hospital patients: systematic review and critical appraisal of methodology. BMJ 2020; 369: m1501. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19. Finlay GD, Rothman MJ, Smith RA.. Measuring the modified early warning score and the Rothman index: advantages of utilizing the electronic medical record in an early warning system. J Hosp Med 2014; 9 (2): 116–9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20. Rothman MJ, Solinger AB, Rothman SI, Finlay GD.. Clinical implications and validity of nursing assessments: a longitudinal measure of patient condition from analysis of the Electronic Medical Record. BMJ Open 2012; 2 (4): e000646. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21. Linnen DT, Escobar GJ, Hu X, Scruth E, Liu V, Stephens C.. Statistical modeling and aggregate-weighted scoring systems in prediction of mortality and ICU transfer: a systematic review. J Hosp Med 2019; 14 (3): 161–9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22. Romero-Brufau S, Gaines K, Nicolas CT, et al. The fifth vital sign? Nurse worry predicts inpatient deterioration within 24 hours. JAMIA Open 2019; 2 (4): 465–70. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23. Douw G, Huisman-de Waal G, van Zanten ARH, van der Hoeven JG, Schoonhoven L.. Nurses’ ‘worry’ as predictor of deteriorating surgical ward patients: A prospective cohort study of the Dutch-Early-Nurse-Worry-Indicator-Score. Int J Nurs Stud 2016; 59: 134–40. [DOI] [PubMed] [Google Scholar]
  • 24. Douw G, Schoonhoven L, Holwerda T, et al. Nurses’ worry or concern and early recognition of deteriorating patients on general wards in acute care hospitals: a systematic review. Crit Care 2015; 19: 230. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25. Collins SA, Fred MR, Wilcox L, Vawdrey DK.. Workarounds used by nurses to overcome design constraints of electronic health records. NI 2012 2012; 2012: 93–7. [PMC free article] [PubMed] [Google Scholar]
  • 26. Collins SA. Relationship between nursing documentation and patients’ mortality. Am J Crit Care 2013; 22: 306–13. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27. Klann JG, Collins S, Cato K, Waitman L, Westra B. Nursing documentation and the clinical research informatics pipeline. In: AMIA 2018 Informatics Summit; March 12, 2018; San Francisco, CA.
  • 28. Klann J, Kang MJ, Schnock K, et al. A data warehousing design for multi-domain inpatient nursing data in i2b2. In: Abstract at AMIA 2019 Clinical Informatics Conference; April 30–May 2, 2019; Atlanta, GA.
  • 29. Kang M-J, Dykes PC, Korach TZ, et al. Identifying and grading nurses’ concern concepts about patient deterioration using a standard nursing terminology. In: AMIA 2019 Informatics Summit; March 27, 2019; San Francisco, CA. [DOI] [PMC free article] [PubMed]
  • 30. Kang M-J, Dykes PC, Korach TZ, et al. Identifying nurses’ concern concepts about patient deterioration using a standard nursing terminology. Int J Med Inform 2020; 133: 104016. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31. Kalbfleisch JD, Prentice RL.. The Statistical Analysis of Failure Time Data. Hoboken, NJ: Wiley; 2002. [Google Scholar]
  • 32. Cox DR. Regression models and life-tables. J R Stat Soc Series B Stat Methodol 1972; 34 (2): 187–220. [Google Scholar]
  • 33. Fisher LD, Lin DY.. Time-dependent covariates in the Cox proportional-hazards regression model. Annu Rev Public Health 1999; 20 (1): 145–57. [DOI] [PubMed] [Google Scholar]
  • 34. Korach ZT, Yang J, Rossetti SC, et al. Mining clinical phrases from nursing notes to discover risk factors of patient deterioration. Int J Med Inform 2020; 135: 104053. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35. Korach ZT, Cato KD, Collins SA, et al. Unsupervised machine learning of topics documented by nurses about hospitalized patients prior to a rapid-response event. Appl Clin Inform 2019; 10 (05): 952–63. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Royal College of Physicians. National Early Warning Score (NEWS) 2: Standardising the Assessment of Acute-Illness Severity in the NHS. Updated Report of a Working Party. London, United Kingdom: Royal College of Physicians; 2017. [Google Scholar]
  • 37. Kim WY, Shin YJ, Lee JM, et al. Modified early warning score changes prior to cardiac arrest in general wards. PLoS One 2015; 10 (6): e0130523. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38. Endsley MR. Direct measurement of situation awareness: validity and use of SAGAT development of queries. In: Endsley MR, Garland DJ, eds. Situation Awareness Analysis and Measurement. Hillsdale, NJ: Erlbaum; 2000; 1–21. [Google Scholar]
  • 39. Levine ME, Albers DJ, Burgermaster M, Davidson PG, Smaldone AM, Mamykina L. Behavioral-clinical phenotyping with type 2 diabetes self-monitoring data. arXiv, doi: https://arxiv.org/abs/1802.08761, 23 Feb 2018, preprint: not peer reviewed.
  • 40. Le Lagadec MD, Dwyer T.. Scoping review: The use of early warning systems for the identification of in-hospital patients at risk of deterioration. Aust Crit Care 2017; 30 (4): 211–8. [DOI] [PubMed] [Google Scholar]
  • 41. Alam N, Hobbelink EL, van Tienhoven AJ, et al. The impact of the use of the Early Warning Score (EWS) on patient outcomes: a systematic review. Resuscitation 2014; 85 (5): 587–94. [DOI] [PubMed] [Google Scholar]
  • 42. Pivovarov R, Albers DJ, Sepulveda JL, Elhadad N.. Identifying and mitigating biases in EHR laboratory tests. J Biomed Inform 2014; 51: 24–34. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43. Pivovarov R, Albers DJ, Hripcsak G, Sepulveda JL, Elhadad N.. Temporal trends of hemoglobin A1c testing. J Am Med Inform Assoc 2014; 21 (6): 1038–44. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44. Donabedian A. The quality of care. How can it be assessed? JAMA 1988; 260 (12): 1743–8. [DOI] [PubMed] [Google Scholar]
  • 45. Benner P, Tanner C, Chesla C.. Expertise in Nursing Practice, Second Edition: Caring, Clinical Judgment, and Ethics. New York, NY: Springer; 2009. [Google Scholar]
  • 46. Kahneman D, Klein G.. Conditions for intuitive expertise: A failure to disagree. Am Psychol 2009; 64 (6): 515–26. [DOI] [PubMed] [Google Scholar]
  • 47. Albers DJ, Elhadad N, Claassen J, et al. Estimating summary statistics for electronic health record laboratory data for use in high-throughput phenotyping algorithms. J Biomed Inform 2018; 78: 87–101. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48. Hagar Y, Albers D, Pivovarov R, et al. Survival analysis with electronic health record data: Experiments with chronic kidney disease. Stat Anal Data Min 2014; 7 (5): 385–403. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49. Chan PS, Jain R, Nallmothu BK, Berg R, Sasson C.. Rapid response teams: a systematic review and meta-analysis. Arch Intern Med 2010; 170 (1): 18–26. [DOI] [PubMed] [Google Scholar]
  • 50. McGaughey J, O'Halloran P, Porter S, Blackwood B.. Early warning systems and rapid response to the deteriorating patient in hospital: A systematic realist review. J Adv Nurs 2017; 73 (12): 2877–91. [DOI] [PubMed] [Google Scholar]
  • 51. Cato K. Nursing medication administration data quality: assessment for reuse in predictive analytics. In: AMIA 2019 Clinical Informatics Conference; May 2, 2019; Atlanta, GA.
  • 52. Rajkomar A, Oren E, Chen K, et al. Scalable and accurate deep learning for electronic health records. arXiv, doi: https://arxiv.org/abs/1801.07860, 26 Jan 2001, preprint: not peer reviewed.
  • 53. Collins S, Couture B, Kang MJ, et al. Quantifying and visualizing nursing flowsheet documentation burden in acute and critical care. AMIA Annu Symp Proc 2018; 2018: 348–57; San Francisco, CA. [PMC free article] [PubMed] [Google Scholar]
  • 54. Thoroddsen A, Ehnfors M, Ehrenberg A.. Content and completeness of care plans after implementation of standardized nursing terminologies and computerized records. Comput Inform Nurs 2011; 29 (10): 599–607. [DOI] [PubMed] [Google Scholar]
  • 55. Ehrenberg A, Ehnfors M.. The accuracy of patient records in Swedish nursing homes: congruence of record content and nurses’ and patients’ descriptions. Scand J Caring Sci 2001; 15 (4): 303–10. [DOI] [PubMed] [Google Scholar]
  • 56. Törnvall E, Wilhelmsson S.. Nursing documentation for communicating and evaluating care. J Clin Nurs 2008; 17 (16): 2116–24. [DOI] [PubMed] [Google Scholar]
  • 57. Tornvall E, Wilhelmsson S, Wahren LK.. Electronic nursing documentation in primary health care. Scand J Caring Sci 2004; 18 (3): 310–7. [DOI] [PubMed] [Google Scholar]
  • 58. Keenan GM, Yakel E, Tschannen D, Mandeville M.. Documentation and the nurse care planning process. In: Hughes R, ed. Patient Safety and Quality: An Evidence-Based Handbook for Nurses. Rockville, NJ: Agency for Healthcare Research and Quality; 2008: 175–206. [PubMed] [Google Scholar]
  • 59. Taylor C. Assessing patients’ needs: does the same information guide expert and novice nurses? Int Nurs Rev 2002; 49 (1): 11–9. [DOI] [PubMed] [Google Scholar]
  • 60. Poissant L, Perfeira J, Tamblyn R, Kawasumi Y.. The impact of electronic health records on time efficiency of physicians and nurses: a systematic review. J Am Med Inform Assoc 2005; 12 (5): 505–14. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61. Shickel B, Tighe PJ, Bihorac A, Rashidi P.. Deep EHR: a survey of recent advances in deep learning techniques for electronic health record (EHR) analysis. IEEE J Biomed Health Inform 2018; 22: 1589–1604. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62. Singhal A, Tien Y, Hsia RY.. Racial-ethnic disparities in opioid prescriptions at emergency department visits for conditions commonly associated with prescription drug abuse. PLoS One 2016; 11 (8): e0159224. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63. Albers DJ, Hripcsak G.. A statistical dynamics approach to the study of human health data: resolving population scale diurnal variation in laboratory data. Phys. Lett. A 2010; 374 (9): 1159–64. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

The data underlying this article were accessed from Columbia University Irving Medical Center and Mass General Brigham. The derived data generated in this research will be shared on reasonable request to the corresponding author.


Articles from Journal of the American Medical Informatics Association : JAMIA are provided here courtesy of Oxford University Press

RESOURCES