Machine Learning for Automatic Annotation of References in DH scholarly papers

Introduction In this paper, we present Bilbo, new software for automatic annotation of references that employs a machine learning approach, and the three CLEO’s OpenEdition corpora in the DH fields we annotated to train it. Our aim is to allow a reliable and language independent detection and annotation of common references in papers and to provide users with a dynamic reference database and crosslinking facilities including for scientific blogs and research notebooks.