Audio classification utilizing a rule-based approach and the support vector machine classifier

The evaluation of two classification architectures utilizing the rule-based approach and the one-against-one support vector machine (OAO-SVM) is presented in this paper. The classification of the audio stream is carried out in two steps. At first, the rule-based speech/non-speech and music/environment sound discrimination is conducted. The set of adopted features, with a high efficiency in separation of speech and music signals, is implemented in order to find the best discriminator. Consequently, speech segments are classified into pure speech, speech with music and speech with env. sound using the OAO-SVM multi-class classification scheme. Experimental results show that the used classification architecture can decrease the classification error in comparison with OAO-SVM by using MFCC features only.

[1]  Shigeo Abe,et al.  Support Vector Machines for Pattern Classification (Advances in Pattern Recognition) , 2005 .

[2]  Wen Gao,et al.  A fast and robust speech/music discrimination approach , 2003, Fourth International Conference on Information, Communications and Signal Processing, 2003 and the Fourth Pacific Rim Conference on Multimedia. Proceedings of the 2003 Joint.

[3]  Nima Mesgarani,et al.  Discrimination of speech from nonspeech based on multiscale spectro-temporal Modulations , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[4]  Chin Kim On,et al.  Mel-frequency cepstral coefficient analysis in speech recognition , 2006, 2006 International Conference on Computing & Informatics.

[5]  Muhammad Saeid Muhammad Alnadabi,et al.  Speech/music discrimination : novel features in time domain , 2010 .

[6]  Lie Lu,et al.  Content analysis for audio classification and segmentation , 2002, IEEE Trans. Speech Audio Process..

[7]  Matej Grasic,et al.  Online Speech/Music Segmentation Based on the Variance Mean of Filter Bank Energy , 2009, EURASIP J. Adv. Signal Process..