INTENSIVE USE OF FACTORIAL CORRESPONDENCE ANALYSIS FOR TEXT MINING: APPLICATION WITH STATISTICAL EDUCATION PUBLICATIONS

Textual data are found in any survey or study and can be easily transform in frequency tables. Any method working on contingency tables can be used to process them. Besides, with the important amount of available textual data, we need to find convenient ways to process the data and to get invaluable information. It appears that the use of factorial correspondence analysis(CA) allows to get most of the information included in the data. CA produces a visual representation of the relationships between the row categories and the column categories in the same space. But there are several problems: the first one is the interpretation of the results. And even after the data processing, we still have a big amount of material and we need visualization tools to display it. In this paper, we present some methods to process the data and to get invaluable information. We also show how to use correspondence analysis in a sensible way and we give results of studies of publications dealing with statistical education.