A Study on Utilization of Wikipedia Contents for Automatic Construction of Linguistic Resources

Abstract Various linguistic knowledge resources are required in order that machine can understand diverse variation in natural languages. This paper aims to devise an automatic construction method of linguistic resources by reflecting characteristics of online contents toward continuous expansion. Especially we focused to build NE(Named-Entity) dictionary because the applicability of NEs is very high in linguistic analysis processes. Based on the investigation on Korean Wikipedia, we suggested an efficient construction method of NE dictionary using the syntactic patterns and structural features such as metadatas. Key Words : Linguistic Resource Construction, Wikipedia, Named-Entity Dictionary, Knowledge Construction, Utilization of online contents * 본 연구는 2014년도 전북대학교 연구기반 조성비 지원에 의하여 연구되었음Received 17 March 2015, Revised 22 April 2015Accepted 20 May 2015Corresponding Author: Bo-Hyun Yun (Dept. of Computer Science Education, Mokwon University) Email: ybh@mokwon.ac.krⒸ The Society of Digital Policy & Management. All rights reserved. This is an open-access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is ISSN: 1738-1916 properly cited.