Improving Indian Language Dependency Parsing by Combining Transition-based and Graph-based Parsers

our dependency parsing experiments on two Indian Languages, Telugu and Hindi. We first explore two most popular dependency parsers namely, Malt parser and MST parser. Considering pros of both these parsers, we develop a hybrid approach combining the output of these two parsers in an intuitive manner. For Hindi, we report our results on test data provided in the for gold standard track of Hindi Shared Task on Parsing at workshop on Machine Translation and parsing in Indian Languages, Coling 2012. Our system secured unlabeled attachment score of 95.2% and labelled attachment score 90.7%. For Telugu, we report our results on test data provided in the ICON 2010 Tools Contest on Indian Languages Dependency Parsing. Our system secured unlabeled attachment score of 92.0% and labelled attachment score 69.5%.

[1]  Joakim Nivre,et al.  Pseudo-Projective Dependency Parsing , 2005, ACL.

[2]  Dipti Misra Sharma,et al.  Dependency Annotation Scheme for Indian Languages , 2008, IJCNLP.

[3]  Fernando Pereira,et al.  Multilingual Dependency Analysis with a Two-Stage Discriminative Parser , 2006, CoNLL.

[4]  Daniel Zeman,et al.  Maximum Spanning Malt: Hiring World's Leading Dependency Parsers to Plant Indian Trees , 2009 .

[5]  Sivaji Bandyopadhyay,et al.  Dependency Parser for Bengali: the JU System at ICON 2009 , 2009 .

[6]  Sebastian Riedel,et al.  The CoNLL 2007 Shared Task on Dependency Parsing , 2007, EMNLP.

[7]  Akshar Bharati,et al.  Natural language processing : a Paninian perspective , 1996 .

[8]  Ruken Cakici,et al.  Multi-lingual Dependency Parsing with Incremental Integer Linear Programming , 2006, CoNLL.

[9]  Joakim Nivre,et al.  Algorithms for Deterministic Incremental Dependency Parsing , 2008, CL.

[10]  Dependency Parsers for Indian Languages , 2009 .

[11]  Samar Husain A two-stage constraint based dependency parser for free word order languages , 2011 .

[12]  Dipti Misra Sharma,et al.  Two stage constraint based hybrid approach to free word order language dependency parsing , 2009, IWPT.

[13]  Meher Vijay Yeleti,et al.  Constraint based Hindi dependency parsing , 2022 .

[14]  Maria Simi,et al.  Dependency Parsing of Indian Languages with DeSR. , 2010 .

[15]  Samar Husain,et al.  A Two Stage Constraint Based Hybrid Dependency Parser for Telugu , 2010 .

[16]  Yuji Matsumoto MaltParser: A language-independent system for data-driven dependency parsing , 2005 .

[17]  Prashanth Mannem,et al.  The ICON-2010 tools contest on Indian language dependency parsing , 2010 .

[18]  Sabine Buchholz,et al.  CoNLL-X Shared Task on Multilingual Dependency Parsing , 2006, CoNLL.

[19]  Joakim Nivre,et al.  An Efficient Algorithm for Projective Dependency Parsing , 2003, IWPT.

[20]  Joakim Nivre,et al.  Characterizing the Errors of Data-Driven Dependency Parsing Models , 2007, EMNLP.

[21]  Joakim Nivre,et al.  Parsing Indian Languages with MaltParser , 2009 .

[22]  Beatrice Santorini,et al.  Building a Large Annotated Corpus of English: The Penn Treebank , 1993, CL.