Support Vector Machines for Semantic Relation Extraction in Spanish Language

Relation Extraction (RE) is one of the most important topics in NLP (Natural Language Processing). Many tasks such as semantic relation extraction, sentiment analysis, opinion mining, question answering systems and text summarization are supported by RE. The aim of this paper is to present a semantic relations classifier in which are incorporate lexical features, named entity features and syntactic structures. Relations between two entities are classified based on the Datasets for Generic Relation Extraction (reACE). We translate the reACE corpus to the Spanish language for all relation types and subtypes. The results shows a F-score of 75.25%, it is a significant improvement of 11.5% over the baseline model. Finally, we discuss the results according to the model and the useful information to support the forecasting process.

[1]  Mihai Surdeanu,et al.  The Stanford CoreNLP Natural Language Processing Toolkit , 2014, ACL.

[2]  Lluís Padró,et al.  FreeLing 3.0: Towards Wider Multilinguality , 2012, LREC.

[3]  Zhu Zhang,et al.  Weakly-supervised relation classification for information extraction , 2004, CIKM '04.

[4]  James H. Martin,et al.  Speech and Language Processing, 2nd Edition , 2008 .

[5]  Thorsten Joachims,et al.  Making large scale SVM learning practical , 1998 .

[6]  Mark A. Przybocki,et al.  The Automatic Content Extraction (ACE) Program – Tasks, Data, and Evaluation , 2004, LREC.

[7]  Alessandro Moschitti,et al.  A Study on Convolution Kernels for Shallow Statistic Parsing , 2004, ACL.

[8]  Aron Culotta,et al.  Dependency Tree Kernels for Relation Extraction , 2004, ACL.

[9]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[10]  Richard Tobin,et al.  Datasets for generic relation extraction* , 2011, Natural Language Engineering.

[11]  Nanda Kambhatla,et al.  Combining Lexical, Syntactic, and Semantic Features with Maximum Entropy Models for Information Extraction , 2004, ACL.

[12]  Jun Zhao,et al.  Relation Classification via Convolutional Deep Neural Network , 2014, COLING.

[13]  Shantanu Kumar,et al.  A Survey of Deep Learning Methods for Relation Extraction , 2017, ArXiv.

[14]  Scott Miller,et al.  A Novel Use of Statistical Parsing to Extract Information from Text , 2000, ANLP.

[15]  Koby Crammer,et al.  On the Algorithmic Implementation of Multiclass Kernel-based Vector Machines , 2002, J. Mach. Learn. Res..

[16]  Dmitry Zelenko,et al.  Kernel methods for relation extraction , 2003 .