Support vector machines (SVMs) have been recognised as a promising technique in the field of pattern recognition, and one of the interesting applications of this technique is speech/music classification. In this study, the authors propose a novel approach to improve the SVM-based speech/music classification using the second-order conditional maximum a posteriori (CMAP). To do this, the authors first devise a method to estimate a posteriori probability to select between speech and music from the SVM output. This is achieved by employing the sigmoid function, obtained by optimised data training. A final speech/music classification is then acquired using the second-order CMAP with a maximum a posteriori probability depending not only on the current observation, but also on the classification results of two previous frames, incorporating substantial inter-frame correlations. While conventional SVM optimisation techniques are used during the training phase, the proposed technique can be inherently adopted in the classification phase. In this regard, the proposed approach can be developed and employed in parallel with other optimisation techniques. Experimental results show that the proposed algorithm yields better results than the speech/music classification rule in SVM.
[1]
Volume Assp,et al.
ACOUSTICS. SPEECH. AND SIGNAL PROCESSING
,
1983
.
[2]
Nam Soo Kim,et al.
Voice Activity Detection Based on Conditional MAP Criterion
,
2008,
IEEE Signal Processing Letters.
[3]
John Platt,et al.
Probabilistic Outputs for Support vector Machines and Comparisons to Regularized Likelihood Methods
,
1999
.
[4]
Joon-Hyuk Chang,et al.
Discriminative Weight Training for Support Vector Machine-Based Speech/Music Classification in 3GPP2 SMV Codec
,
2010,
IEICE Trans. Fundam. Electron. Commun. Comput. Sci..
[5]
Joseph Picone,et al.
Applications of support vector machines to speech recognition
,
2004,
IEEE Transactions on Signal Processing.
[6]
Joon-Hyuk Chang,et al.
Speech/Music Classification Enhancement for 3GPP2 SMV Codec Based on Support Vector Machine
,
2009,
IEICE Trans. Fundam. Electron. Commun. Comput. Sci..
[7]
Vladimir Vapnik,et al.
An overview of statistical learning theory
,
1999,
IEEE Trans. Neural Networks.