Automatic Removal of Identifying

The European MAPA (Multilingual Anonymisation for Public Admin-istrations) project aims at developing an open-source solution for automatic de-identification of medical and legal documents. We introduce here the context, partners and aims of the project, and report on preliminary results.

[1]  知秀 柴田 5分で分かる!? 有名論文ナナメ読み:Jacob Devlin et al. : BERT : Pre-training of Deep Bidirectional Transformers for Language Understanding , 2020 .

[2]  Christian Lovis,et al.  Use and Understanding of Anonymization and De-Identification in the Biomedical Literature: Scoping Review , 2019, Journal of medical Internet research.

[3]  E. Hyvönen,et al.  Anonymization Service for Finnish Case Law: Opening Data without Sacrificing Data Protection and Privacy of Citizens , 2018 .

[4]  Iryna Gurevych,et al.  The INCEpTION Platform: Machine-Assisted and Knowledge-Oriented Interactive Annotation , 2018, COLING.

[5]  S. Meystre,et al.  Automatic de-identification of textual documents in the electronic health record: a review of recent research , 2010, BMC medical research methodology.

[6]  Peter Szolovits,et al.  Evaluating the state-of-the-art in automatic de-identification. , 2007, Journal of the American Medical Informatics Association : JAMIA.

[7]  Erik F. Tjong Kim Sang,et al.  Introduction to the CoNLL-2003 Shared Task: Language-Independent Named Entity Recognition , 2003, CoNLL.

[8]  Erik F. Tjong Kim Sang,et al.  Introduction to the CoNLL-2002 Shared Task: Language-Independent Named Entity Recognition , 2002, CoNLL.

[9]  R. Grishman,et al.  Design of the MUC-6 Evaluation , 1995, TIPSTER.

[10]  Ming-Wei Chang,et al.  BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[11]  Satoshi Sekine,et al.  A survey of named entity recognition and classification , 2007 .

[12]  G. Lapalme,et al.  Anonymisation de décisions de justice , 2004, JEPTALNRECITAL.

[13]  Natalia Grabar,et al.  Building a Text Corpus for Representing the Variety of Medical Language , 2001, MedInfo.