论文信息 - Using the probability of readability to order Swedish texts

Using the probability of readability to order Swedish texts

In this study we present a new approach to rank readability in Swedish texts based on lexical, morpho-syntactic and syntactic analysis of text as well as machine learning. The basic premise and theory is presented as well as a small experiment testing the feasibility, but not actual performance, of the approach. The experiment shows that it is possible to implement a system based on the approach, however, the actual performance of such a system has not been evaluated as the necessary resources for such an evaluation does not yet exist for Swedish. The experiment also shows that a classifier based on the aforementioned linguistic analysis, on our limited test set, outperforms classifiers based on established metrics used to assess readability such as LIX, OVIX and Nominal Ratio.

Katarina Heimann Mühlenbock | Johan Falkenjack

[1] Zeng Zhi-qiang. Sequential minimal optimization algorithm based on parallel processing , 2009 .

[2] Kevyn Collins-Thompson,et al. A Language Modeling Approach to Predicting Reading Difficulty , 2004, NAACL.

[3] Chutima Boonthum-Denecke,et al. Natural Language Processing Tools , 2012 .

[4] Lijun Feng,et al. Cognitively Motivated Features for Readability Assessment , 2009, EACL.

[5] Mari Ostendorf,et al. Natural language processing tools for reading level assessment and text simplification for bilingual education , 2007 .

[6] J. Platt. Sequential Minimal Optimization : A Fast Algorithm for Training Support Vector Machines , 1998 .

[7] Ani Nenkova,et al. Revisiting Readability: A Unified Framework for Predicting Text Quality , 2008, EMNLP.

[8] Simonetta Montemagni,et al. READ–IT: Assessing Readability of Italian Texts with a View to Text Simplification , 2011, SLPAT.