Context completeness |
Excellent: contextual information can be included. |
Poor: context is essentially absent as a priori interpretation is an integral part of recording data in case record forms. |
Depends on implementation. Context may be lost because of predetermined categorization. |
Machine readability |
Poor: information is mostly useful for case-specific usage by humans. May require text mining/text retrieval to convert to a machine-readable format. |
Good: data are uniformly formatted and can be parsed by computers. |
Excellent: data can be parsed or directly used by computers. |
Translatability (between institutions) |
Poor: free text contains jargon-specific, ambiguous abbreviations (eg, PCI: percutaneous coronary intervention/prophylactic cranial irradiation). |
Excellent: trial data are usually collected using a standardized protocol, allowing for interoperability between institutions. |
Good: lab values can be converted using reference values. Structured data, such as smoking and hypertensive status, can be reformatted for interoperability. |
Noise resistance |
Very poor: These type of data are very sensitive to interobserver noise (eg, personal abbreviations, spelling mistakes, and personal focus in recording certain types of information). |
Excellent: data are recorded in a standardized way, designed to prevent noise. |
Good: data are often machine-derived or recorded in a standardized way. However, bias because of differences in information-recording habits among health care professionals may arise. |
Availability for reuse/general applicability |
Excellent: these type of data are readily available, contain a lot of context (see Context completeness), and can thus be repurposed for a variety of applications. |
Limited: trials are designed and conducted for one specific research question. |
Excellent: these type of data are readily available and can thus be used for a plethora of purposes. |
Design flexibility |
Excellent: study design can be revisited if unanticipated bias effects arise. In this sense, bias could be corrected by altering the data selection. |
Poor: study design is hit-or-miss. Bias cannot be corrected after the data recording process. |
Excellent: study design can be revisited if unanticipated bias effects arise. In this sense, bias could be corrected by altering the data selection. |