Hybrid baseform builder for phonetic languages

We present a novel technique of automatically building baseforms from the spelling for languages that are phonetic. For such languages, although rule-based techniques give fairly accurate baseforms, they have some ambiguities depending upon the language. To handle these, we apply a statistical method to improve the correctness of phonetic spelling builders. The rule-based baseforms are used as a training corpus for improving the system. We also present an alternative method of building decision trees over the phone context to modify the rule-based baseforms. The novel framework of generating the baseforms using both, spelling-to-sound rules and statistics, one after the other, requires very small amount of training data. Correction results and recognition results are presented by using the Hindi language baseform builder and by using the baseforms generated in a Hindi speech recognition task.

[1]  J. Makhoul,et al.  Automatic modeling for adding new words to a large-vocabulary continuous speech recognition system , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[2]  T. Holter,et al.  A comparison of lexicon-building methods for subword-based speech recognisers , 1996, Proceedings of Digital Processing Applications (TENCON '96).

[3]  L. Venkata Subramaniam,et al.  On deriving a phoneme model for a new language , 2000, INTERSPEECH.

[4]  Bhuvana Ramabhadran,et al.  Acoustics-only based automatic phonetic baseform generation , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[5]  Joseph Picone,et al.  An advanced system to generate pronunciations of proper nouns , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[6]  Ashish Verma,et al.  A large-vocabulary continuous speech recognition system for Hindi , 2004, IBM J. Res. Dev..

[7]  Michael Picheny,et al.  Automatic phonetic baseform determination , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[8]  L. Venkata Subramaniam,et al.  Adapting phonetic decision trees between languages for continuous speech recognition , 2000, INTERSPEECH.