Abstract
We present an original system for locating and removing personally-identifying information in patient records. In this experiment, anonymization is seen as a particular case of knowledge extraction. We use natural language processing tools provided by the MEDTAG framework: a semantic lexicon specialized in medicine, and a toolkit for word-sense and morpho-syntactic tagging. The system finds 98-99% of all personally-identifying information.
Full text
PDFSelected References
These references are in PubMed. This may not be the complete list of references from this article.
- Ruch P., Wagner J., Bouillon P., Baud R. H., Rassinoux A. M., Scherrer J. R. MEDTAG: tag-like semantics for medical document indexing. Proc AMIA Symp. 1999:137–141. [PMC free article] [PubMed] [Google Scholar]