Limited Data Capture |
-
■
Complex HSCT procedure with numerous post-transplant complications
-
■
Lack of continuous and real-time capture of various data streams involved
-
■
Mix of automated and manual data capture
|
|
Data Quality Issues |
|
|
High Dimensional Data |
|
|
Data Privacy Issues |
-
■
Large amount of sensitive patient data is required in building predictive models due to numerous factors involved
-
■
Combining multiple data streams from disperse data stores leads to potential data privacy issues
|
-
■
Developing appropriate privacy measures, such as data anonymization techniques to ensure complete privacy of patients’ data
-
■
Using technique such as “federated learning” [47] that trains a shared global model via a centralized aggregation server, while keeping sensitive data in local institutions of their origin
-
■
Enabling some form of privacy access control to different data streams that can ensure that only those with proper authorization can access a patient’s data streams
|
Obsolete Predictive Models |
|
|
Diverse Data Types |
|
|
Data Integration issues |
-
■
Most of the captured data are typically dispersed among various data stores (e.g., cloud storage, EHR, individually-managed databases)
|
|