Fine-tuning SVM for Enhancing Speech/Music Classification

Support vector machines have been extensively studied and utilized in pattern recognition area for years. One of interesting applications of this technique is music/speech classification for a standardized codec such as 3GPP2 selectable mode vocoder. In this paper, we propose a novel approach that improves the speech/music classification of support vector machines. While conventional support vector machine optimization techniques apply during training phase, the proposed technique can be adopted in classification phase. In this regard, the proposed approach can be developed and employed in parallel with conventional optimizations, resulting in synergistic boost in classification performance. We first analyze the impact of kernel width parameter on the classifications made by support vector machines. From this analysis, we observe that we can fine-tune outputs of support vector machines with the kernel width parameter. To make the most of this capability, we identify strong correlation among neighboring input frames, and use this correlation information as a guide to adjusting kernel width parameter. According to the experimental results, the proposed algorithm is found to have potential for improving the performance of support vector machines.

[1]  Huang Zhihong,et al.  Infrared Human Face Auto Locating Based on SVM and A Smart Thermal Biometrics System , 2006, Sixth International Conference on Intelligent Systems Design and Applications.

[2]  Chih-Jen Lin,et al.  Asymptotic Behaviors of Support Vector Machines with Gaussian Kernel , 2003, Neural Computation.

[3]  Pankaj Rabha,et al.  SMVLite: Reduced Complexity Selectable Mode Vocoder , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[4]  Yang Gao,et al.  The SMV algorithm selected by TIA and 3GPP2 for CDMA applications , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[5]  Joon-Hyuk Chang,et al.  Discriminative Weight Training for Support Vector Machine-Based Speech/Music Classification in 3GPP2 SMV Codec , 2010, IEICE Trans. Fundam. Electron. Commun. Comput. Sci..

[6]  Joseph Picone,et al.  Applications of support vector machines to speech recognition , 2004, IEEE Transactions on Signal Processing.

[7]  Joon-Hyuk Chang,et al.  Speech/Music Classification Enhancement for 3GPP2 SMV Codec Based on Support Vector Machine , 2009, IEICE Trans. Fundam. Electron. Commun. Comput. Sci..

[8]  S. Craig Greer,et al.  Standardization of the selectable mode vocoder , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[9]  Jing Tian,et al.  Weighted Gaussian Kernel with Multiple Widths and Network Kernel Pattern , 2008, EIAT/IETA.

[10]  Le-Peng Bi,et al.  New heuristic for determination Gaussian kernels parameter , 2005, 2005 International Conference on Machine Learning and Cybernetics.

[11]  Ching Y. Suen,et al.  Automatic model selection for the optimization of SVM kernels , 2005, Pattern Recognit..