The TTS technology helps easy learning of the written material as it comes in audio form. TTS technique comprises of getting the particular text as an input then converts the graphemes into the phonemes and finally converting those Phonemes into actual speech. Graphemes are the actual text. Phonemes are the smallest meaningful sound element of the language. There are numerous approaches and algorithms available for TTS conversion. Also there are number of automated tools in the market for the TTS conversion of any written material. This work puts forth the various research papers and the novel approach regarding the TTS technology. The basic idea behind the proposed tool, “MathsSays”, is helping legally blind students, also the normal users; understand mathematics better via audio form. The written mathematics material is scanned as an image. Then that image is processed for the text detection. The detected text is then extracted from that image. The extracted text will be provided to the TTS conversion component and as a result, the audio file will be generated which speaks the text in that image. The challenge in this work is how to tackle the formulae which are not in proper English form. For this obstacle, the table will be maintained which has each formula and its respective text. Whenever the formula is encountered, the same will be searched in the table and the respective text will be extracted and spoken out loud. In this way, this system works for the betterment of understanding the mathematics.
[1]
Haiyan Li,et al.
Research on Mathematical Formulas Extraction from Chinese Document
,
2006,
2006 6th World Congress on Intelligent Control and Automation.
[2]
Dennis H. Klatt,et al.
The klattalk text-to-speech conversion system
,
1982,
ICASSP.
[3]
Md Monirul Islam,et al.
Bangla text to speech conversion: A syllabic unit selection approach
,
2013,
2013 International Conference on Informatics, Electronics and Vision (ICIEV).
[4]
Ralph Ewerth,et al.
A robust algorithm for text detection in images
,
2003,
3rd International Symposium on Image and Signal Processing and Analysis, 2003. ISPA 2003. Proceedings of the.
[5]
Nick Cercone,et al.
Better access to math for visually impaired
,
2009,
2009 IEEE Toronto International Conference Science and Technology for Humanity (TIC-STH).
[6]
Iain Murray,et al.
Mathspeak: An Audio Method for Presenting Mathematical Formulae to Blind Students
,
2012,
2012 5th International Conference on Human System Interactions.
[7]
Edward M. Riseman,et al.
TextFinder: An Automatic System to Detect and Recognize Text In Images
,
1999,
IEEE Trans. Pattern Anal. Mach. Intell..