Machine Learning Approaches for Music Information Retrieval

The rapid growth of the Internet and the advancements of Internet technologies have made it possible for music listeners to have access to a large amount of on-line music data, including music sound signals, lyrics, biographies, and discographies. Music artists in the 21st century are promoted through various kinds of websites that are managed by themselves, by their fans, or by their record companies. Also, they are subjects of discussions in Internet newsgroups and bulletin boards. This raises the question of whether computer programs can enrich the experience of music listeners by enabling the listeners to have access to such a large volume of on-line music data. Multimedia conferences, e.g. ISMIR (International Conference on Music Information Retrieval) and WEDELMUSIC (Web Delivery of Music), have a focus on the development of computational techniques for analyzing, summarizing, indexing, and classifying music data. In [Huron, 2000] Huron points out that since the preeminent functions of music are social and psychological, the most useful characterizationwould be based on four types of information:

[1]  Nuria Oliver,et al.  PAPA: Physiology and Purpose-Aware Automatic Playlist Generation , 2006, ISMIR.

[2]  Roger Mitton,et al.  Spelling checkers, spelling correctors and the misspellings of poor spellers , 1987, Inf. Process. Manag..

[3]  Beth Logan,et al.  A Content-Based Music Similarity Function , 2001 .

[4]  K. Hevner Experimental studies of the elements of expression in music , 1936 .

[5]  Shingo Uchihashi,et al.  The beat spectrum: a new approach to rhythm analysis , 2001, IEEE International Conference on Multimedia and Expo, 2001. ICME 2001..

[6]  Tao Li,et al.  Content-based music similarity search and emotion detection , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[7]  Tao Li,et al.  Detecting emotion in music , 2003, ISMIR.

[8]  Yao-Chang Huang,et al.  An audio recommendation system based on audio signature description scheme in MPEG-7 Audio , 2004, 2004 IEEE International Conference on Multimedia and Expo (ICME) (IEEE Cat. No.04TH8763).

[9]  Charles J. Burkhart,et al.  The Social Psychology of Music , 1958 .

[10]  John Riedl,et al.  An algorithmic framework for performing collaborative filtering , 1999, SIGIR '99.

[11]  Glenn Fung,et al.  Multicategory Proximal Support Vector Machine Classifiers , 2005, Machine Learning.

[12]  Wei-Ying Ma,et al.  Learning an image manifold for retrieval , 2004, MULTIMEDIA '04.

[13]  David Heckerman,et al.  Empirical Analysis of Predictive Algorithms for Collaborative Filtering , 1998, UAI.

[14]  Thomas S. Huang,et al.  Optimizing learning in image retrieval , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[15]  David W. Aha,et al.  Weighting Features , 1995, ICCBR.

[16]  Daniel P. W. Ellis,et al.  The Quest for Ground Truth in Musical Artist Similarity , 2002, ISMIR.

[17]  Zoubin Ghahramani,et al.  Combining active learning and semi-supervised learning using Gaussian fields and harmonic functions , 2003, ICML 2003.

[18]  Chris H. Q. Ding,et al.  Orthogonal nonnegative matrix t-factorizations for clustering , 2006, KDD '06.

[19]  Sharon L. Oviatt,et al.  Multimodal Integration - A Statistical View , 1999, IEEE Trans. Multim..

[20]  Bernhard Schölkopf,et al.  Learning with Local and Global Consistency , 2003, NIPS.

[21]  Chris H. Q. Ding,et al.  A learning framework using Green's function and kernel regularization with application to recommender system , 2007, KDD '07.

[22]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[23]  Tao Li,et al.  Semisupervised learning from different information sources , 2005, Knowledge and Information Systems.

[24]  Yoram Singer,et al.  BoosTexter: A Boosting-based System for Text Categorization , 2000, Machine Learning.

[25]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[26]  Haruhiro Katayose,et al.  Music Genre Classification of Audio Signals from Bass Part , 2008 .

[27]  Raymond J. Mooney,et al.  Content-boosted collaborative filtering for improved recommendations , 2002, AAAI/IAAI.

[28]  Jonathan Foote,et al.  Audio Retrieval by Rhythmic Similarity , 2002, ISMIR.

[29]  Tao Li,et al.  Toward intelligent music information retrieval , 2006, IEEE Transactions on Multimedia.

[30]  Joydeep Ghosh,et al.  Cluster Ensembles --- A Knowledge Reuse Framework for Combining Multiple Partitions , 2002, J. Mach. Learn. Res..

[31]  Jill P. Mesirov,et al.  Consensus Clustering: A Resampling-Based Method for Class Discovery and Visualization of Gene Expression Microarray Data , 2003, Machine Learning.

[32]  Ingrid Daubechies,et al.  Ten Lectures on Wavelets , 1992 .

[33]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[34]  Peter Knees,et al.  Combining audio-based similarity with web-based data to accelerate automatic music playlist generation , 2006, MIR '06.

[35]  Arbee L. P. Chen,et al.  A music recommendation system based on music data grouping and user interests , 2001, CIKM '01.

[36]  Tao Li,et al.  Music Recommendation Based on Acoustic Features and User Access Patterns , 2009, IEEE Transactions on Audio, Speech, and Language Processing.

[37]  William W. Cohen,et al.  Web-collaborative filtering: recommending music by crawling the Web , 2000, Comput. Networks.

[38]  David Huron Perceptual and Cognitive Applications in Music Information Retrieval , 2000, ISMIR.

[39]  David M. Pennock,et al.  Probabilistic Models for Unified Collaborative and Content-Based Recommendation in Sparse-Data Environments , 2001, UAI.

[40]  Cormac Herley,et al.  Inferring similarity between music objects with application to playlist generation , 2005, MIR '05.

[41]  Shenghuo Zhu,et al.  Integrating Features from Different Sources for Music Information Retrieval , 2006, Sixth International Conference on Data Mining (ICDM'06).

[42]  John C. Platt,et al.  Learning a Gaussian Process Prior for Automatically Generating Music Playlists , 2001, NIPS.

[43]  Tommi S. Jaakkola,et al.  Partially labeled classification with Markov random walks , 2001, NIPS.

[44]  Wei-Ying Ma,et al.  Scalable music recommendation by search , 2007, ACM Multimedia.

[45]  Masataka Goto,et al.  Hybrid Collaborative and Content-based Music Recommendation Using Probabilistic Model with Latent User Preferences , 2006, ISMIR.

[46]  Jung-Hyun Lee,et al.  Hybrid Collaborative Filtering and Content-Based Filtering for Improved Recommender System , 2004, International Conference on Computational Science.

[47]  Marc Leman,et al.  Prediction of Musical Affect Using a Combination of Acoustic Structural Cues , 2005 .

[48]  Alexander J. Smola,et al.  Kernels and Regularization on Graphs , 2003, COLT.

[49]  Steffen Pauws,et al.  Fast Generation of Optimal Music Playlists using Local Search , 2006, ISMIR.

[50]  Tao Li,et al.  Factors in automatic musical genre classification of audio signals , 2003, 2003 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (IEEE Cat. No.03TH8684).

[51]  Michael I. Jordan,et al.  Distance Metric Learning with Application to Clustering with Side-Information , 2002, NIPS.

[52]  Beth Logan,et al.  Music Recommendation from Song Sets , 2004, ISMIR.

[53]  Tao Li,et al.  A comparative study on content-based music genre classification , 2003, SIGIR.

[54]  Donghai Guan,et al.  A music recommender based on audio features , 2004, SIGIR '04.

[55]  Aristides Gionis,et al.  Clustering Aggregation , 2005, ICDE.

[56]  John D. Lafferty,et al.  Diffusion Kernels on Graphs and Other Discrete Input Spaces , 2002, ICML.

[57]  George Karypis,et al.  Empirical and Theoretical Comparisons of Selected Criterion Functions for Document Clustering , 2004, Machine Learning.