Table 6.
Performance without customized dictionary on gold standard corpus.
PHI Type | PHI sub-type | Count | # FNs | Per Category Recall | Per Category Precision |
Name | Patient Name | 54 | 1 | 0.981 | |
Patient Name Initial | 2 | 2 | 0.00 | ||
Relative/Proxy Name | 175 | 5 | 0.971 | ||
Clinician Name | 593 | 24 | 0.973 | 0.731 | |
Date | Date (not year) | 482 | 26 | 0.946 | |
Year | 46 | 11 | 0.761 | 0.712 | |
Location | 367 | 231 | 0.371 | 0.840 | |
Phone | 53 | 0 | 1.00 | 0.898 | |
Age over 89 | 4 | 1 | 0.750 | 0.600 | |
Undefined | 3 | 2 | 0.333 | N/A | |
Overall | 1779 | 295 | 0.834 | 0.725 |
(FNs are false negatives and N/A indicates not applicable.)