Data collection of Japanese dialects and its influence into speech recognition

Reports the successful completion of a Japanese POLYPHONE project-the Voice Across Japan (VAJ) data collection project. The database has the following characteristics: (1) a large speaker database (8,866 speakers) through a telephone line, (2) gathering of the participants' personal information such as gender, age, place where they grew up, and so on, and (3) data segmented by phone or word boundaries. This paper describes several aspects of Japanese dialects and also reports the results of experiments. How much does dialect influence speech recognition? In our results, dialect influences the speech recognition rate by 2-4%. The results are useful information for building practical speech recognition systems as well as for data collection.

[1]  Joseph Picone,et al.  Voice across America: Toward robust speaker-independent speech recognition for telecommunications applications , 1991, Digit. Signal Process..

[2]  Tomoko Watanabe,et al.  An estimation of speaker sampling in Voice Across Japan database , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[3]  Shuichi Itahashi,et al.  A method of classification among Japanese dialects , 1993, EUROSPEECH.

[4]  Joseph Picone,et al.  The voice across Japan database-the Japanese language contribution to Polyphone , 1994, Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing.

[5]  Shuichi Itahashi,et al.  A discrimination method between Japanese dialects , 1992, ICSLP.

[6]  Takao Nakama,et al.  The data collection of voice across Japan (VAJ) project , 1994, ICSLP.