Extraction of the Useful Words from a Decisional Corpus. Contribution of Correspondence Analysis

In the framework of the JuriSent case study, carried out within the European NEMIS thematic network, we analyze the contribution of text mining techniques to improve the consultation of jurisprudence textual databases. We mainly focus on correspondence analysis (CA) techniques, but also provide some insights on similar visualization techniques, such as self organizing maps (Kohonen maps), and review the potential impact of various Natural Language pre-processing techniques. CA is described in more detail, as well as its use in all the steps of the analysis. A concrete example is provided to illustrate the value of the results obtained with CA techniques for an enhanced access to the studied jurisprudence corpus.