Error Patterns and Analysis of Hindi Shallow Parser
暂无分享,去创建一个
Simplification is an integral part of Machine translation, however, it still remains the most complex part of the process. A sentence in Hindi can be written in multiple ways. They can be complex sentences or simple ones. These sentences then need to be translated into English language. For this, the complex sentences need to be converted into simple sentences before being translated. This paper concerns Sentence Simplification of Complex Hindi Sentences for the Hindi shallow parser that was developed by IIIT. Complexity of a sentence can occur due to presence of clauses, multiple verbs and usage of conjunctions. So splitting the sentence works if the above mentioned properties of the sentence can be eliminated. To automate the process, a simplification algorithm has been formed. The paper shall talk about errors and patterns that have been analyzed, so as to improve the accuracy of the simplified sentence, and preserving the sense of the original sentence. General Terms Computational Linguistics, Natural Language Processing, Sentence Simplification, Shallow Parser, Translation.
[1] Dipti Misra Sharma,et al. Exploring the effects of Sentence Simplification on Hindi to English Machine Translation System , 2014 .
[2] Walter Daelemans,et al. Automatic Sentence Simplification for Subtitling in Dutch and English , 2004, LREC.
[3] Sambhav Jain,et al. Exploring Verb Frames for Sentence Simplification in Hindi , 2013, IJCNLP.