Skip to main content
. 2019 Mar 19;21(3):e11732. doi: 10.2196/11732

Table 2.

Table comparing different types of clinical data on some points important to clinical decision support systems.

Clinical decision support issues Electronic health record free-text/unstructured data (eg, clinical notes) Registry/trial data (eg, case record forms case record forms and questionnaires) Structured data/electronic health record (eg, lab values and smoking status)
Context completeness Excellent: contextual information can be included. Poor: context is essentially absent as a priori interpretation is an integral part of recording data in case record forms. Depends on implementation. Context may be lost because of predetermined categorization.
Machine readability Poor: information is mostly useful for case-specific usage by humans. May require text mining/text retrieval to convert to a machine-readable format. Good: data are uniformly formatted and can be parsed by computers. Excellent: data can be parsed or directly used by computers.
Translatability (between institutions) Poor: free text contains jargon-specific, ambiguous abbreviations (eg, PCI: percutaneous coronary intervention/prophylactic cranial irradiation). Excellent: trial data are usually collected using a standardized protocol, allowing for interoperability between institutions. Good: lab values can be converted using reference values. Structured data, such as smoking and hypertensive status, can be reformatted for interoperability.
Noise resistance Very poor: These type of data are very sensitive to interobserver noise (eg, personal abbreviations, spelling mistakes, and personal focus in recording certain types of information). Excellent: data are recorded in a standardized way, designed to prevent noise. Good: data are often machine-derived or recorded in a standardized way. However, bias because of differences in information-recording habits among health care professionals may arise.
Availability for reuse/general applicability Excellent: these type of data are readily available, contain a lot of context (see Context completeness), and can thus be repurposed for a variety of applications. Limited: trials are designed and conducted for one specific research question. Excellent: these type of data are readily available and can thus be used for a plethora of purposes.
Design flexibility Excellent: study design can be revisited if unanticipated bias effects arise. In this sense, bias could be corrected by altering the data selection. Poor: study design is hit-or-miss. Bias cannot be corrected after the data recording process. Excellent: study design can be revisited if unanticipated bias effects arise. In this sense, bias could be corrected by altering the data selection.