Phoneme classification based on supervised manifold learning

This paper proposes an approach for phoneme classification based on supervised manifold learning. It has been shown that speech sounds exist on a low dimensional manifold nonlinearly embedded in high dimensional space and the manifold learning technique can get high phoneme classification accuracy. To improve the performance of phoneme classification, the proposed algorithm calculates the supervised geodesic distance using the minimum distance and the set distance of different class points to enhance the discriminability of low-dimensional embedded data. Experiments show that the proposed algorithm can significantly improve phoneme classification compared to the baseline features.

[1]  Kap Luk Chan,et al.  Face recognition based on discriminative manifold learning , 2004, ICPR 2004.

[2]  Ian T. Jolliffe,et al.  Principal Component Analysis , 2002, International Encyclopedia of Statistical Science.

[3]  Aren Jansen,et al.  Intrinsic Fourier Analysis on the Manifold of Speech Sounds , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[4]  Mikhail Belkin,et al.  Semi-Supervised Learning on Riemannian Manifolds , 2004, Machine Learning.

[5]  Richard O. Duda,et al.  Pattern classification and scene analysis , 1974, A Wiley-Interscience publication.

[6]  Heng Tao Shen,et al.  Principal Component Analysis , 2009, Encyclopedia of Biometrics.

[7]  Biing-Hwang Juang,et al.  Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.

[8]  Kap Luk Chan,et al.  An extended Isomap algorithm for learning multi-class manifold , 2004, Proceedings of 2004 International Conference on Machine Learning and Cybernetics (IEEE Cat. No.04EX826).

[9]  Kuldip K. Paliwal,et al.  Feature extraction and dimensionality reduction algorithms and their applications in vowel recognition , 2003, Pattern Recognit..

[10]  Andrew Errity,et al.  Manifold Learning-Based Feature Transformation for Phone Classification , 2007, NOLISP.