Chinese opera genre classification based on multi-feature fusion and extreme learning machine

Chinese traditional opera plays an important role in Chinese traditional culture, it reflects the customs and value tendency of different areas. Though researchers have already gained some achievements, studies on this field are scarce and the existing achievements still need to be improved. This paper proposes a system based on multi-feature fusion and extreme learning machine (ELM) to classify Chinese traditional opera genre. Inspired by music genre classification, each aria is split into multiple segments. 19 features are then extracted and fused to generate the fusion feature. Finally, we use ELM and majority voting methods to determine the genre of the whole aria. The research data are 800 arias of 8 typical genres collected from Internet. This system achieves a mean classification accuracy of 92% among 8 famous Chinese traditional opera genres. The experimental results demonstrated that multi-feature fusion improves classification accuracy of Chinese traditional opera genres. Feature fusion is more effective than decision fusion in dealing with this problem.

[1]  Yiqiang Chen,et al.  RECOGNIZING TRANSPORTATION MODE ON MOBILE PHONE USING PROBABILITY FUSION OF EXTREME LEARNING MACHINES , 2013 .

[2]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[3]  George Tzanetakis,et al.  Musical genre classification of audio signals , 2002, IEEE Trans. Speech Audio Process..

[4]  Kun-Ming Yu,et al.  Automatic Music Genre Classification using Modulation Spectral Contrast Feature , 2007, 2007 IEEE International Conference on Multimedia and Expo.

[5]  Thomas Sikora,et al.  MPEG-7 Audio and Beyond: Audio Content Indexing and Retrieval , 2005 .

[6]  Ming-Hsuan Yang,et al.  Fast sparse representation with prototypes , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[7]  Liang Bao-er A Literature Review on the Protection and Utilization of China's Intangible Cultural Heritage , 2008 .

[8]  Gregory H. Wakefield,et al.  Audio thumbnailing of popular music using chroma-based representations , 2005, IEEE Transactions on Multimedia.

[9]  Beth Logan,et al.  Mel Frequency Cepstral Coefficients for Music Modeling , 2000, ISMIR.

[10]  Les E. Atlas,et al.  Modulation-scale analysis for content identification , 2004, IEEE Transactions on Signal Processing.

[11]  Sukhendu Das,et al.  A Survey of Decision Fusion and Feature Fusion Strategies for Pattern Classification , 2010, IETE Technical Review.

[12]  Mohan S. Kankanhalli,et al.  Multimodal fusion for multimedia analysis: a survey , 2010, Multimedia Systems.

[13]  Douglas Eck,et al.  Aggregate features and ADABOOST for music classification , 2006, Machine Learning.

[14]  Jian Yang,et al.  Feature fusion: parallel strategy vs. serial strategy , 2003, Pattern Recognit..

[15]  Mohan S. Kankanhalli,et al.  Music Key Detection for Musical Audio , 2005, 11th International Multimedia Modelling Conference.

[16]  Yin-Fu Huang,et al.  Music genre classification based on local feature selection using a self-adaptive harmony search algorithm , 2014, Data Knowl. Eng..

[17]  Kun-Ming Yu,et al.  Automatic Music Genre Classification Based on Modulation Spectral Analysis of Spectral and Cepstral Features , 2009, IEEE Transactions on Multimedia.

[18]  Xinwei Wang,et al.  Structure analysis of Chinese Peking Opera , 2011, 2011 Seventh International Conference on Natural Computation.

[19]  Xie Wang,et al.  A study on Chinese traditional opera , 2008, 2008 International Conference on Machine Learning and Cybernetics.

[20]  Zong Woo Geem,et al.  A New Heuristic Optimization Algorithm: Harmony Search , 2001, Simul..

[21]  Chang D. Yoo,et al.  Music information retrieval using novel features and a weighted voting method , 2009, 2009 IEEE International Symposium on Industrial Electronics.

[22]  Thomas Sikora,et al.  Audio classification based on MPEG-7 spectral basis representations , 2004, IEEE Transactions on Circuits and Systems for Video Technology.

[23]  Kuldip K. Paliwal,et al.  Identity verification using speech and face information , 2004, Digit. Signal Process..

[24]  Benoît Maison,et al.  Joint processing of audio and visual information for multimedia indexing and human-computer interaction , 2000, RIAO.

[25]  Guang-Bin Huang,et al.  Extreme learning machine: a new learning scheme of feedforward neural networks , 2004, 2004 IEEE International Joint Conference on Neural Networks (IEEE Cat. No.04CH37541).

[26]  Lie Lu,et al.  Music type classification by spectral contrast feature , 2002, Proceedings. IEEE International Conference on Multimedia and Expo.

[27]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[28]  Johan Sundberg,et al.  Acoustical study of classical Peking opera singing , 2012 .

[29]  HU Zhao-liang THE CHARACTERISTICS OF CHINESE TRADITIONAL OPERAS GEOGRAPHY , 2000 .