Relation extraction from wikipedia articles by entities clustering

Wikipedia is an encyclopedia based on wiki technology. It is multilingual high quality knowledge base. In this work a episode based extraction method are proposed to extract relations from Wikipedia articles. The entities are clustered and labeled. The relation extraction is benefited by the information redundancy provided by the clusters. A strict Wikipedia entities clustering algorithm based on the category system and first sentence of the article is approached. This work required less manual assist. And the relations are abundant. The results are comparable with other works [1, 2].

[1]  Zhongzhi Shi,et al.  Intelligent Science , 2009, RSFDGrC.

[2]  Fuji Ren,et al.  From Cloud Computing to Language Engineering, Affective Computing and Advanced Intelligence ∗ , 2010 .

[3]  Ian H. Witten,et al.  Mining Meaning from Wikipedia , 2008, Int. J. Hum. Comput. Stud..

[4]  Yau-Hwang Kuo,et al.  Automated ontology construction for unstructured text documents , 2007, Data & Knowledge Engineering.

[5]  Naoaki Okazaki,et al.  Unsupervised Relation Extraction by Mining Wikipedia Texts Using Information from the Web , 2009, ACL.

[6]  Daniel Jurafsky,et al.  Distant supervision for relation extraction without labeled data , 2009, ACL.

[7]  Gerhard Weikum,et al.  YAGO: A Large Ontology from Wikipedia and WordNet , 2008, J. Web Semant..

[8]  Joakim Nivre,et al.  Characterizing the Errors of Data-Driven Dependency Parsing Models , 2007, EMNLP.

[9]  Simone Paolo Ponzetto,et al.  Knowledge Derived From Wikipedia For Computing Semantic Relatedness , 2007, J. Artif. Intell. Res..

[10]  Daniel S. Weld,et al.  Information extraction from Wikipedia: moving down the long tail , 2008, KDD.

[11]  Satoshi Sekine,et al.  Preemptive Information Extraction using Unrestricted Relation Discovery , 2006, NAACL.

[12]  Heikki Mannila,et al.  Levelwise Search and Borders of Theories in Knowledge Discovery , 1997, Data Mining and Knowledge Discovery.

[13]  Heikki Mannila,et al.  Discovery of Frequent Episodes in Event Sequences , 1997, Data Mining and Knowledge Discovery.

[14]  Simone Paolo Ponzetto,et al.  Deriving a Large-Scale Taxonomy from Wikipedia , 2007, AAAI.