Grammatical Relations Identification of Korean Parsed Texts Using Support Vector Machines

This study aims to improve the performance of identifying grammatical relations between a noun phrase and a verb phrase in Korean sentences. The key task is to determine the relation between the two constituents in terms of such grammatical relational categories as subject, object, complement, and adverbial. To tackle this problem, we propose to employ the Support Vector Machines (SVM) in determining the grammatical relations. Through an experiment with a tagged corpus for training SVMs, we found the proposed model to be more useful than both the Maximum Entropy model and the backed-off method.

[1]  Adwait Ratnaparkhi,et al.  Learning to Parse Natural Language with Maximum Entropy Models , 1999, Machine Learning.

[2]  Jungyun Seo,et al.  The Grammatical Function Analysis between Adnoun Clause and Noun Phrase in Korean , 2001, NLPRS.

[3]  Jungyun Seo,et al.  Analysis of the grammatical functions between adnoun and noun phrases in Korean using Support Vector Machines , 2003, Nat. Lang. Eng..

[4]  Timothy W. Finin,et al.  The KERNEL Text Understanding System , 1993, Artif. Intell..

[5]  Thorsten Joachims,et al.  Text Categorization with Support Vector Machines: Learning with Many Relevant Features , 1998, ECML.

[6]  Wojciech Skut,et al.  Tagging Grammatical Functions , 1997, EMNLP.

[7]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[8]  Eugene Charniak,et al.  Assigning Function Tags to Parsed Text , 2000, ANLP.

[9]  Slava M. Katz,et al.  Estimation of probabilities from sparse data for the language model component of a speech recognizer , 1987, IEEE Trans. Acoust. Speech Signal Process..

[10]  Gregory Grefenstette Short Query Linguistic Expansion Techniques: Palliating One-Word Queries by Providing Intermediate Structure to Text , 1997, SCIE.

[11]  Jungyun Seo,et al.  Analysis System of Speech Acts and Discourse Structures Using Maximum Entropy Model , 1999, ACL.

[12]  Adam L. Berger,et al.  A Maximum Entropy Approach to Natural Language Processing , 1996, CL.

[13]  Michael Collins,et al.  A New Statistical Parser Based on Bigram Lexical Dependencies , 1996, ACL.

[14]  Ted Briscoe,et al.  High Precision Extraction of Grammatical Relations , 2001 .

[15]  Alexander S. Yeh,et al.  Using Existing Systems to Supplement Small Amounts of Annotated Grammatical Relations Training Data , 2000, ACL.

[16]  Ian H. Witten,et al.  The zero-frequency problem: Estimating the probabilities of novel events in adaptive text compression , 1991, IEEE Trans. Inf. Theory.

[17]  Eric Sven Ristad,et al.  Maximum Entropy Modeling Toolkit , 1996, ArXiv.

[18]  Vladimir Vapnik,et al.  The Nature of Statistical Learning , 1995 .