ANONIMYTEXT: ANONIMIZATION OF UNSTRUCTURED DOCUMENTS
Rebeca Perez-Lainez, Ana Iglesias, Cesar de Pablo-Sanchez
2009
Abstract
The anonymization of unstructured texts is nowadays a task of great importance in several text mining applications. Medical records anonymization is needed both to preserve personal health information privacy and enable further data mining efforts. The described ANONYMITEXT system is designed to de-identify sensible data from unstructured documents. It has been applied to Spanish clinical notes to recognize sensible concepts that would need to be removed if notes are used beyond their original scope. The system combines several medical knowledge resources with semantic clinical notes induced dictionaries. An evaluation of the semi-automatic process has been carried on a subset of the clinical notes on the most frequent attributes.
DownloadPaper Citation
in Harvard Style
Perez-Lainez R., Iglesias A. and de Pablo-Sanchez C. (2009). ANONIMYTEXT: ANONIMIZATION OF UNSTRUCTURED DOCUMENTS . In Proceedings of the International Conference on Knowledge Discovery and Information Retrieval - Volume 1: KDIR, (IC3K 2009) ISBN 978-989-674-011-5, pages 284-287. DOI: 10.5220/0002297102840287
in Bibtex Style
@conference{kdir09,
author={Rebeca Perez-Lainez and Ana Iglesias and Cesar de Pablo-Sanchez},
title={ANONIMYTEXT: ANONIMIZATION OF UNSTRUCTURED DOCUMENTS},
booktitle={Proceedings of the International Conference on Knowledge Discovery and Information Retrieval - Volume 1: KDIR, (IC3K 2009)},
year={2009},
pages={284-287},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0002297102840287},
isbn={978-989-674-011-5},
}
in EndNote Style
TY - CONF
JO - Proceedings of the International Conference on Knowledge Discovery and Information Retrieval - Volume 1: KDIR, (IC3K 2009)
TI - ANONIMYTEXT: ANONIMIZATION OF UNSTRUCTURED DOCUMENTS
SN - 978-989-674-011-5
AU - Perez-Lainez R.
AU - Iglesias A.
AU - de Pablo-Sanchez C.
PY - 2009
SP - 284
EP - 287
DO - 10.5220/0002297102840287