Automatic indexing of scientific papers Presentation and results of DEFT 2016 text mining challenge

This paper presents the 2016 edition of the DEFT text mining challenge. This edition adresses the keyword-based indexing of scientific papers with the aim of simulating a professional indexer. The corpus is composed of French bibliographic records from four domains : linguistics, information science, archaeology and chemisty. The results have been evaluated in terms of precision, recall and f-measure computed on stemmed texts against a reference manual indexation.