Feature extraction of the “Tourism English Proficiency Test” using data mining
暂无分享,去创建一个
According to the White Paper on Tourism for 2018, 17.89 million Japanese people travelled abroad, and 28.69 million foreigners came to Japan for sightseeing in 2017. It can be said that it is just the time of sightseeing right now. Therefore, knowledge of tourism has become more and more important, and the necessity for using English, which can be said to be a world common language, has increased. As a measurement of English communication competence needed at tourism sites, the “Tourism English Proficiency Test” started in 1989. In this study, English sentences of the “Tourism English Proficiency Test” were examined, and compared with other proficiency tests and English textbooks for junior high and high school students in terms of metrical linguistics. In short, frequency characteristics of characterand word-appearance were investigated using a program written in C++. These characteristics were approximated by an exponential function. Furthermore, the percentage of Japanese junior high school required vocabulary and American basic vocabulary was calculated to obtain the difficulty-level as well as the K-characteristic of each material.
[1] M. Kendall,et al. The Statistical Study of Literary Vocabulary , 1944, Nature.
[2] DATA MINING OF ENGLISH GUIDEBOOKS AVAILABLE AT LOCAL AIRPORTS IN JAPAN , 2013 .
[3] Text mining of English articles on the Noto Hanto Earthquake in 2007 , 2016 .
[6] Metrical feature extraction of English books on tourism , 2017 .
[7] Hiromi Ban,et al. Difficulty-Level Classification for English Writings , 2015 .