Skip to main content
. Author manuscript; available in PMC: 2018 Mar 20.
Published in final edited form as: IEEE Trans Biomed Eng. 2016 Oct 10;64(2):263–273. doi: 10.1109/TBME.2016.2573285

TABLE VIII.

Selected Methods for EHR Data Mining

Method Advantages Limitations
Logistic regression, cox
regression, local regression
(LOESS)* [133]
Simple to implement and
interpret; direct estimates of
relevant hazards for Cox
regression
Sensitive to outliers
Logistic regression with
LASSO regularization [134]
Reduces feature space Prone to overfitting
Hidden Markov models
[135]
Simultaneous detection,
segmentation, and
classification in a waveform
Sensitive to the design of
the Markov model being
trained
Conditional random fields
[136]
Supports temporal analysis;
resistant to differences in
class prevalence
Sensitive to regularization
and feature space size
Relational subgroup
discovery, episode rule
mining, windowing* [137]
Valid sequential techniques
for some clinical
applications
Tradeoffs between
simplicity, complexity,
and temporal resolution
Rule mining, Allen’s
interval algebra, directed
acyclic graph* [138]
Temporal mining/modeling
capabilities
Requires specific
experimental design
*

Highly impactful method with more than 50,000 relevant papers.