Dependency Parsing in Bangla

A grammar-driven dependency parsing has been attempted for Bangla (Bengali). The free-word order nature of the language makes the development of an accurate parser very difficult. The Paninian grammatical model has been used to tackle the free-word order problem. The approach is to simplify complex and compound sentences and then to parse simple sentences by satisfying the Karaka demands of the Demand Groups (Verb Groups). Finally, parsed structures are rejoined with appropriate links and Karaka labels. The parser has been trained with a Treebank of 1000 annotated sentences and then evaluated with un-annotated test data of 150 sentences. The evaluation shows that the proposed approach achieves 90.32% and 79.81% accuracies for unlabeled and labeled attachments, respectively.

[1]  John W. Mullennix,et al.  Overview: Important Issues for Researchers and Practitioners Using Computer Synthesized Speech as an Assistive Aid , 2010 .

[2]  Alan Wee-Chung Liew,et al.  Visual Speech Recognition: Lip Segmentation and Mapping , 2008 .

[3]  Steven E. Stern,et al.  Computer Synthesized Speech Technologies: Tools for Aiding Impairment , 2010 .

[4]  Jan Žižka,et al.  Modern Computational Models of Semantic Discovery in Natural Language , 2015 .

[5]  Goutam Kumar Saha Parsing Bengali text: an intelligent approach , 2006, UBIQ.

[6]  Dipti Misra Sharma,et al.  Two stage constraint based hybrid approach to free word order language dependency parsing , 2009, IWPT.

[7]  Sivaji Bandyopadhyay,et al.  Emerging Applications of Natural Language Processing: Concepts and New Research , 2012 .

[8]  Dipti Misra Sharma,et al.  Developing Verb Frames for Hindi , 2008, LREC.

[9]  Mark Warschauer,et al.  Technology and Second Language Writing: A Framework-Based Synthesis of Research , 2014 .

[10]  Joakim Nivre,et al.  MaltParser: A Data-Driven Parser-Generator for Dependency Parsing , 2006, LREC.

[11]  Alan Wee-Chung Liew,et al.  Lip Region Segmentation with complex background , 2009 .

[12]  Akshar Bharati,et al.  Parsing Free Word Order Languages in the Paninian Framework , 1993, ACL.

[13]  Hiroshi Maruyama,et al.  Structural Disambiguation With Constraint Propagation , 1990, ACL.

[14]  Dipti Misra Sharma,et al.  Constraint Based Hybrid Approach to Parsing Indian Languages , 2009, PACLIC.

[15]  K. M. Azharul Hasan,et al.  Recognizing Bangla Grammar using Predictive Parser , 2012, ArXiv.

[16]  Dipti Misra Sharma,et al.  Dependency Annotation Scheme for Indian Languages , 2008, IJCNLP.

[17]  Hercules Dalianis,et al.  Applied Natural Language Processing: Identification, Investigation and Resolution , 2011 .

[18]  Dipti Misra Sharma,et al.  A Modular Cascaded Approach to Complete Parsing , 2009, 2009 International Conference on Asian Language Processing.

[19]  John F. Sowa,et al.  From Existential Graphs to Conceptual Graphs , 2006, Int. J. Concept. Struct. Smart Appl..