Cross-Lingual Approaches: The Basque Case

Cross-lingual speech recognition could be relevant for Multilingual Automatic Speech Recognition (ASR) systems which work with under-resourced languages and appropriately equipped languages. In the Basque Country, the interest on Multilingual Automatic Speech Recognition systems comes from the fact that there are three official languages in use (Basque, Spanish, and French). Multilingual Basque speakers tend to mix words and sentences in the three languages in their discourse, and there are strong acoustic interactions among languages and among the Basque dialects. Moreover Basque has fewer resources and in order to decrease the negative impact that the lack of resources could generate the alternative surges in the form of cross-lingual alternatives, graphemes and data optimization methods.

[1]  Joseph Picone,et al.  Hybrid SVM/HMM architectures for speech recognition , 2000, INTERSPEECH.

[2]  Alberto Sanfeliu,et al.  Progress in Pattern Recognition, Speech and Image Analysis , 2003, Lecture Notes in Computer Science.

[3]  Mark J. F. Gales,et al.  Speech Recognition using SVMs , 2001, NIPS.

[4]  Laurent Besacier,et al.  Automatic Speech Recognition for Under-Resourced Languages: Application to Vietnamese Language , 2009, IEEE Transactions on Audio, Speech, and Language Processing.

[5]  João Paulo da Silva Neto,et al.  The COST278 Pan-European Broadcast News Database , 2004, LREC.

[6]  J. Friedman Regularized Discriminant Analysis , 1989 .

[7]  Antonio Moreno-Sandoval,et al.  Acoustic-phonetic decoding of different types of spontaneous speech in Spanish , 2005, DiSS.

[8]  Karmele López de Ipiña,et al.  Development of multimodal resources for multilingual information retrieval in the basque context , 2007, INTERSPEECH.

[9]  Tanja Schultz,et al.  Multilingual and Crosslingual Speech Recognition , 1998 .

[10]  Neil D. Lawrence,et al.  Advances in Neural Information Processing Systems 14 , 2002 .

[11]  Avinash C. Kak,et al.  PCA versus LDA , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[12]  David A. Landgrebe,et al.  Covariance Matrix Estimation and Classification With Limited Training Data , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[13]  Hermann Ney,et al.  Context-dependent acoustic modeling using graphemes for large vocabulary speech recognition , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[14]  Kazuhiro Kondo,et al.  An evaluation of cross-language adaptation for rapid HMM development in a new language , 1994, Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing.

[15]  Tanja Schultz,et al.  Multilingual Speech Processing , 2006 .

[16]  King-Sun Fu,et al.  IEEE Transactions on Pattern Analysis and Machine Intelligence Publication Information , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[17]  Laurent Besacier,et al.  Which units for acoustic and language modeling for Khmer automatic speech recognition? , 2008, SLTU.

[18]  David A. Landgrebe,et al.  Covariance estimation with limited training samples , 1999, IEEE Trans. Geosci. Remote. Sens..

[19]  Fernando Díaz-de-María,et al.  Support Vector Machines for continuous speech recognition , 2006, 2006 14th European Signal Processing Conference.

[20]  Eliathamby Ambikairajah,et al.  Robust language identification based on fused phonotactic information with MLKSFM pre-classifier , 2009, 2009 IEEE International Conference on Multimedia and Expo.

[21]  Manuel Graña,et al.  Selection of Lexical Units for Continuous Speech Recognition of Basque , 2003, CIARP.