Approaches for Improving Hindi to English Machine Translation System

Objectives: To provide approaches for effective Hindi-to-English Machine Translation (MT) that can be helpful in inexpensive and ease implementation of and MT systems. Methods/Statistical Analysis: Structure of the Hindi and English languages have been studied thoroughly. The possible steps towards the Natural languages have also been studied. The methods, rules, approaches, tools, resources etc. related to MT have been discussed in detail. Findings: MT is an idea for automatic translation of a language. India is the country with full of diversity in culture and languages. More than 20 regional languages are spoken along with several dialects. Hindi is a widely spoken language in all the states of country. A lot of literature, poetries and valuable texts are available in Hindi which gives opportunities to retranslate into English. However, new generation is learning English rapidly and also showing keenness to learn it in simplified lucid manner. Several efforts have been made in this direction. A large number of approaches and solutions exist for MT still there is a huge scope. The paper addresses the challenges of MT and solution efforts made in this direction. This motivates researchers to implement new Hindi-to-English Machine translation systems. Application/Improvements: Efficient, inexpensive and ease translation for available Hindi literature, poetries and other valuable texts into English. Children can easily learn the culture through the poetries and literatures hence the Machine Translation of these will bring wonderful impact.

[1]  Chandra Shekhar,et al.  MaTra: A Practical Approach to Fully-Automatic Indicative English- Hindi Machine Translation , .

[2]  P. J. Antony,et al.  Machine Translation Approaches and Survey for Indian Languages , 2013, ROCLING/IJCLCLP.

[3]  Pushpak Bhattacharyya,et al.  Natural Language Processing : A Perspective from Computation in Presence of Ambiguity , Resource Constraint and Multilinguality , 2012 .

[4]  Mahmud Hasan,et al.  A Novel Spatial Domain Lossless Image Compression Scheme , 2012 .

[5]  Sugata Sanyal,et al.  Machine Translation Systems in India , 2013, ArXiv.

[6]  Pushpak Bhattacharyya,et al.  Interlingua-based English–Hindi Machine Translation and Language Divergence , 2001, Machine Translation.

[7]  J. Novak The Theory Underlying Concept Maps and How To Construct Them , 2004 .

[8]  James H. Martin,et al.  Speech and language processing: an introduction to natural language processing, computational linguistics, and speech recognition, 2nd Edition , 2000, Prentice Hall series in artificial intelligence.

[9]  Ola Mohammad Ali,et al.  Word Sense Disambiguation in Machine Translation using Monolingual Corpus , 2008 .

[10]  Parul Rastogi,et al.  Critical analysis of WSD algorithms , 2009, ICAC3 '09.

[11]  Ю М Бегла PROBLEMS IN TRANSLATING POETRY , 2012 .

[12]  Latha R. Nair,et al.  Machine Translation Systems for Indian Languages , 2012 .

[13]  Hemant Darbari,et al.  Computer assisted translation system – an Indian perspective , 1999, MTSUMMIT.

[14]  Sivaji Bandyopadhyay Teaching MT - an Indian pespective , 2002, EAMT.

[15]  Pushpak Bhattacharyya,et al.  The IIT Bombay Hindi-English Translation System at WMT 2014 , 2014, WMT@ACL.

[16]  Rada Mihalcea,et al.  Word Sense Disambiguation , 2015, Encyclopedia of Machine Learning.

[17]  George A. Miller,et al.  WordNet: A Lexical Database for English , 1995, HLT.

[18]  Xiaomei Zhang,et al.  Research on Chinese-Japanese Sign Language Translation System , 2010, 2010 Fifth International Conference on Frontier of Computer Science and Technology.

[19]  Alberto J. Cañas,et al.  Using WordNet for Word Sense Disambiguation to Support Concept Map Construction , 2003, SPIRE.

[20]  Vishal Gupta,et al.  A Survey of Word-sense Disambiguation Effective Techniques and Methods for Indian Languages , 2013 .

[21]  Pushpak Bhattacharyya,et al.  Some Issues in Automatic Evaluation of English-Hindi MT: More Blues for BLEU , 2006 .

[22]  Eneko Agirre,et al.  Word Sense Disambiguation: Algorithms and Applications , 2007 .

[23]  Sneha Tripathi,et al.  Approaches to machine translation , 2010 .

[24]  A. Jain,et al.  ANGLABHARTI: a multilingual machine aided translation project on translation from English to Indian languages , 1995, 1995 IEEE International Conference on Systems, Man and Cybernetics. Intelligent Systems for the 21st Century.

[25]  Akshar Bharati,et al.  Anusaaraka: Overcoming the Language Barrier in India , 2003, ArXiv.

[26]  S. Naskar,et al.  Use of Machine Translation in India: Current Status , 2005, MTSUMMIT.

[27]  D. W. Barron Machine Translation , 1968, Nature.

[28]  Ayan Das,et al.  Word Sense Disambiguation in Bengali applied to Bengali-Hindi Machine Translation , 2013 .

[29]  Sanjay K. Dwivedi,et al.  Machine Translation System in Indian Perspectives , 2010 .

[30]  G V Garje,et al.  Survey of Machine Translation Systems in India , 2013 .

[31]  Mirella Lapata,et al.  An Experimental Study of Graph Connectivity for Unsupervised Word Sense Disambiguation , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[32]  R. Sinha,et al.  Machine Translation of Bi-lingual Hindi-English (Hinglish) Text , 2005, MTSUMMIT.

[33]  Ajai Kumar Jain,et al.  AnglaHindi: an English to Hindi machine-aided translation system , 2003, MTSUMMIT.