Semantic DQ Design Principles |
Clinical Data Factors |
Expresses clinical concept for which data quality (DQ) must be measured
Considers the ways in which underlying workflow affects potential variables
Connects clinical concepts and data provenance
|
|
|
Analytic Uses |
|
|
|
DQ Principles |
Addresses the combination of established DQ theory with current needs
Develops roadmap to determine appropriate DQ method
Focuses the results of variable testing
|
Benchmarking hypertension metrics across institutions for face validity requires a different set of tools than attempting to use external sources to test the plausibility of blood pressure values
Common DQ principles include outlier detection, completeness of records, variable concordance, and plausible distribution of facts
|
Semantic DQ Practice |
Representation |
|
|
|
Assessment Lenses |
|
Common lenses to consider in clinical research are epidemiology, diagnoses, clinical care, and health care utilization.
|
|
DQ Methods |
|
Methods can range from simple (eg, proportions or frequency distributions) to complex (eg, PCA, clustering, or other machine learning)
Results can be categorical or can rely on visualization.
Thresholds for acceptable DQ can be pre‐determined or part of the applied methodology.
|