Music fingerprinting based on bhattacharya distance for song and cover song recognition

People often have trouble recognizing a song especially, if the song is sung by a not original artist which is called cover song. Hence, an identification system might be used to help recognize a song or to detect copyright violation. In this study, we try to recognize a song and a cover song by using the fingerprint of the song represented by features extracted from MPEG-7. The fingerprint of the song is represented by Audio Signature Type. Moreover, the fingerprint of the cover song is represented by Audio Spectrum Flatness and Audio Spectrum Projection. Furthermore, we propose a sliding algorithm and k-Nearest Neighbor (k-NN) with Bhattacharyya distance for song recognition and cover song recognition. The results of this experiment show that the proposed fingerprint technique has an accuracy of 100% for song recognition and an accuracy of 85.3% for cover song recognition.

[1]  Riccardo Leonardi,et al.  A heuristic for distance fusion in cover song identification , 2013, 2013 14th International Workshop on Image Analysis for Multimedia Interactive Services (WIAMIS).

[2]  Riyanarto Sarno,et al.  Music mood classification using audio power and audio harmonicity based on MPEG-7 audio features and Support Vector Machine , 2017, 2017 3rd International Conference on Science in Information Technology (ICSITech).

[3]  Riyanarto Sarno,et al.  Development of mobile electronic nose for beef quality monitoring , 2017 .

[4]  T. Kailath The Divergence and Bhattacharyya Distance Measures in Signal Selection , 1967 .

[5]  Alicja Wieczorkowska,et al.  Music Information Retrieval , 2009, Encyclopedia of Data Warehousing and Mining.

[6]  Antonio Camarena-Ibarrola,et al.  Entropy per chroma for Cover song identification , 2016, 2016 IEEE International Autumn Meeting on Power, Electronics and Computing (ROPEC).

[7]  P. K. Bora,et al.  Multi-band sum of spectrogram based audio fingerprinting of Indian film songs for multi-lingual song retrieval , 2015, 2015 International Conference on Advances in Computing, Communications and Informatics (ICACCI).

[8]  Riyanarto Sarno,et al.  Music tempo classification using audio spectrum centroid, audio spectrum flatness, and audio spectrum spread based on MPEG-7 audio features , 2017, 2017 3rd International Conference on Science in Information Technology (ICSITech).

[9]  Thierry Bertin-Mahieux,et al.  Large-scale cover song recognition using hashed chroma landmarks , 2011, 2011 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA).

[10]  Anssi Klapuri,et al.  Identifying Cover Songs Using Information-Theoretic Measures of Similarity , 2014, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[11]  Riyanarto Sarno,et al.  Classification of Music Mood Using MPEG-7 Audio Features and SVM with Confidence Interval , 2018, Int. J. Artif. Intell. Tools.

[12]  Xavier Serra,et al.  Chroma Binary Similarity and Local Alignment Applied to Cover Song Identification , 2008, IEEE Transactions on Audio, Speech, and Language Processing.

[13]  D. Wijaya,et al.  Information Quality Ratio as a novel metric for mother wavelet selection , 2017 .

[14]  Shingchern D. You,et al.  Music Identification System Using MPEG-7 Audio Signature Descriptors , 2013, TheScientificWorldJournal.

[15]  Thomas Sikora,et al.  MPEG-7 Audio and Beyond: Audio Content Indexing and Retrieval , 2005 .

[16]  Made Sudarma,et al.  Design and Analysis System of KNN and ID3 Algorithm for Music Classification based on Mood Feature Extraction , 2017 .

[17]  Riyanarto Sarno,et al.  Gas concentration analysis of resistive gas sensor array , 2016, 2016 International Symposium on Electronics and Smart Devices (ISESD).

[18]  J. Stephen Downie,et al.  Cochlear pitch class profile for cover song identification , 2015 .

[19]  Pao-Chi Chang,et al.  Deep learning of chroma representation for cover song identification in compression domain , 2018, Multidimens. Syst. Signal Process..

[20]  Wei Li,et al.  Fusing similarity functions for cover song identification , 2017, Multimedia Tools and Applications.

[21]  Pao-Chi Chang,et al.  Cover song identification with direct chroma feature extraction from AAC files , 2013, 2013 IEEE 2nd Global Conference on Consumer Electronics (GCCE).

[22]  Riyanarto Sarno,et al.  Sensor Array Optimization for Mobile Electronic Nose: Wavelet Transform and Filter Based Feature Selection Approach , 2016 .

[23]  T. Adilakshmi,et al.  Music Recommendation System with User-based and Item-based Collaborative Filtering Technique , 2017, Indonesian Journal of Electrical Engineering and Informatics (IJEEI).

[24]  Deshun Yang,et al.  Two-layer large-scale cover song identification system based on music structure segmentation , 2016, 2016 IEEE 18th International Workshop on Multimedia Signal Processing (MMSP).

[25]  Prem Seetharaman,et al.  Cover song identification with 2D Fourier Transform sequences , 2017, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[26]  Riyanarto Sarno,et al.  Detection of diabetes from gas analysis of human breath using e-Nose , 2017, 2017 11th International Conference on Information & Communication Technology and System (ICTS).

[27]  Rosa Lancini,et al.  Robust audio fingerprinting for song identification , 2004, 2004 12th European Signal Processing Conference.