Content-based Music Access: Combining Audio Features and Semantic Information for Music Search Engines

[1]  Beth Logan,et al.  Mel Frequency Cepstral Coefficients for Music Modeling , 2000, ISMIR.

[2]  George Tzanetakis,et al.  MARSYAS: a framework for audio analysis , 1999, Organised Sound.

[3]  Emilia Gómez Gutiérrez,et al.  Tonal description of music audio signals , 2006 .

[4]  Freddy Y. Y. Choi Advances in domain independent linear text segmentation , 2000, ANLP.

[5]  John Platt,et al.  Probabilistic Outputs for Support vector Machines and Comparisons to Regularized Likelihood Methods , 1999 .

[6]  Daniel P. W. Ellis,et al.  Identifying `Cover Songs' with Chroma Features and Dynamic Programming Beat Tracking , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[7]  Malcolm Slaney,et al.  Semantic-audio retrieval , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[8]  Ning Hu,et al.  The MUSART Testbed for Query-by-Humming Evaluation , 2004, Computer Music Journal.

[9]  Perry R. Cook,et al.  Content-Based Musical Similarity Computation using the Hierarchical Dirichlet Process , 2008, ISMIR.

[10]  Gert R. G. Lanckriet,et al.  Smarter than Genius? Human Evaluation of Music Recommender Systems , 2009, ISMIR.

[11]  George Tzanetakis,et al.  Musical genre classification of audio signals , 2002, IEEE Trans. Speech Audio Process..

[12]  Takuya Fujishima,et al.  Realtime Chord Recognition of Musical Sound: a System Using Common Lisp Music , 1999, ICMC.

[13]  Maurizio Omologo,et al.  Use of Hidden Markov Models and Factored Language Models for Automatic Chord Recognition , 2009, ISMIR.

[14]  J. Stephen Downie,et al.  The music information retrieval evaluation exchange (2005-2007): A window into music information retrieval research , 2008, Acoustical Science and Technology.

[15]  Masataka Goto,et al.  A chorus section detection method for musical audio signals and its application to a music listening station , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[16]  Tie-Yan Liu,et al.  Listwise approach to learning to rank: theory and algorithm , 2008, ICML '08.

[17]  Gregory H. Wakefield,et al.  Audio thumbnailing of popular music using chroma-based representations , 2005, IEEE Transactions on Multimedia.

[18]  Ton Kalker,et al.  A Highly Robust Audio Fingerprinting System , 2002, ISMIR.

[19]  Tim Pohle,et al.  Combining Features Reduces Hubness in Audio Similarity , 2010, ISMIR.

[20]  Daniel P. W. Ellis,et al.  Multiple-Instance Learning for Music Information Retrieval , 2008, ISMIR.

[21]  Riccardo Miotto,et al.  Improving Auto-tagging by Modeling Semantic Co-occurrences , 2010, ISMIR.

[22]  T. Minka Estimating a Dirichlet distribution , 2012 .

[23]  Nasser M. Nasrabadi,et al.  Pattern Recognition and Machine Learning , 2006, Technometrics.

[24]  Shivani Agarwal,et al.  Ranking on graph data , 2006, ICML.

[25]  E. Chew Towards a mathematical model of tonality , 2000 .

[26]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[27]  Gert R. G. Lanckriet,et al.  Semantic Annotation and Retrieval of Music and Sound Effects , 2008, IEEE Transactions on Audio, Speech, and Language Processing.

[28]  François Pachet,et al.  Signal + Context = Better Classification , 2007, ISMIR.

[29]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[30]  Gert R. G. Lanckriet,et al.  Heterogeneous Embedding for Subjective Artist Similarity , 2009, ISMIR.

[31]  Biing-Hwang Juang,et al.  Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.

[32]  Yi-Hsuan Yang,et al.  Improving Musical Concept Detection by Ordinal Regression and Context Fusion , 2009, ISMIR.

[33]  Gert R. G. Lanckriet,et al.  Five Approaches to Collecting Tags for Music , 2008, ISMIR.

[34]  Nando de Freitas,et al.  A Statistical Model for General Contextual Object Recognition , 2004, ECCV.

[35]  Christopher Raphael,et al.  Automatic Segmentation of Acoustic Musical Signals Using Hidden Markov Models , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[36]  Geoffroy Peeters Chroma-based estimation of musical key from audio-signal analysis , 2006, ISMIR.

[37]  Gustavo Carneiro,et al.  Supervised Learning of Semantic Classes for Image Annotation and Retrieval , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[38]  Petri Toiviainen,et al.  MIR in Matlab (II): A Toolbox for Musical Feature Extraction from Audio , 2007, ISMIR.

[39]  Jyh-Shing Roger Jang,et al.  On the Use of Anti-Word Models for Audio Music Annotation and Retrieval , 2009, IEEE Transactions on Audio, Speech, and Language Processing.

[40]  William P. Birmingham,et al.  HMM-based musical query retrieval , 2002, JCDL '02.

[41]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[42]  Òscar Celma,et al.  Music recommendation and discovery in the long tail , 2008 .

[43]  Juan Pablo Bello,et al.  Audio-Based Cover Song Retrieval Using Approximate Chord Sequences: Testing Shifts, Gaps, Swaps and Beats , 2007, ISMIR.

[44]  Douglas Turnbull,et al.  Using Regression to Combine Data Sources for Semantic Music Discovery , 2009, ISMIR.

[45]  Thierry Bertin-Mahieux,et al.  Clustering Beat-Chroma Patterns in a Large Music Database , 2010, ISMIR.

[46]  Tao Li,et al.  Are Tags Better Than Audio? The Effect of Joint Use of Tags and Audio Content Features for Artistic Style Clustering , 2010, ISMIR.

[47]  Rainer Typke,et al.  Music Retrieval based on Melodic Similarity , 2007 .

[48]  Nicola Orio,et al.  A scalable cover identification engine , 2010, ACM Multimedia.

[49]  David A. Forsyth,et al.  Learning the semantics of words and pictures , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[50]  Thierry Bertin-Mahieux,et al.  Autotagger: A Model for Predicting Social Tags from Acoustic Features on Large Music Databases , 2008 .

[51]  Daniel P. W. Ellis,et al.  Automatic Record Reviews , 2004, ISMIR.

[52]  Masataka Goto,et al.  Recent studies on music information processing , 2004 .

[53]  Nuno Vasconcelos,et al.  From Pixels to Semantic Spaces: Advances in Content-Based Image Retrieval , 2007, Computer.

[54]  Edith Law,et al.  Input-agreement: a new mechanism for collecting data using human computation games , 2009, CHI.

[55]  Hinrich Schütze,et al.  Introduction to information retrieval , 2008 .

[56]  George Tzanetakis,et al.  Improving automatic music tag annotation using stacked generalization of probabilistic SVM outputs , 2009, ACM Multimedia.

[57]  David A. Forsyth,et al.  Object Recognition as Machine Translation: Learning a Lexicon for a Fixed Image Vocabulary , 2002, ECCV.

[58]  Michael I. Jordan,et al.  Modeling annotated data , 2003, SIGIR.

[59]  M. Slaney,et al.  Locality-Sensitive Hashing for Finding Nearest Neighbors [Lecture Notes] , 2008, IEEE Signal Processing Magazine.

[60]  Klaus Seyerlehner,et al.  FRAME LEVEL AUDIO SIMILARITY - A CODEBOOK APPROACH , 2008 .

[61]  Matthias Mauch,et al.  Recognising Classical Works in Historical Recordings , 2010, ISMIR.

[62]  Gert R. G. Lanckriet,et al.  Combining audio content and social context for semantic music discovery , 2009, SIGIR.

[63]  Riccardo Miotto,et al.  A Probabilistic Approach to Merge Context and Content Information for Music Retrieval , 2010, ISMIR.

[64]  James Ze Wang,et al.  Automatic Linguistic Indexing of Pictures by a Statistical Modeling Approach , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[65]  Thierry Bertin-Mahieux,et al.  Automatic Generation of Social Tags for Music Recommendation , 2007, NIPS.

[66]  Youngmoo E. Kim,et al.  Beat-Sync-Mash-Coder: A web application for real-time creation of beat-synchronous music mashups , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[67]  Riccardo Miotto,et al.  A Methodology for the Segmentation and Identification of Music Works , 2007, ISMIR.

[68]  Gert R. G. Lanckriet,et al.  Design and development of a semantic music discovery engine , 2008 .

[69]  Riccardo Miotto,et al.  Statistical Music Modeling Aimed at Identification and Alignment , 2010, Advances in Music Information Retrieval.

[70]  Nicola Orio,et al.  Music Retrieval: A Tutorial and Review , 2006, Found. Trends Inf. Retr..

[71]  Xavier Serra,et al.  Chroma Binary Similarity and Local Alignment Applied to Cover Song Identification , 2008, IEEE Transactions on Audio, Speech, and Language Processing.

[72]  Gert R. G. Lanckriet,et al.  Towards musical query-by-semantic-description using the CAL500 data set , 2007, SIGIR.

[73]  Ryan M. Rifkin,et al.  Musical query-by-description as a multiclass learning problem , 2002, 2002 IEEE Workshop on Multimedia Signal Processing..

[74]  Elias Pampalk,et al.  Content-based organization and visualization of music archives , 2002, MULTIMEDIA '02.

[75]  Pedro Cano,et al.  A Review of Audio Fingerprinting , 2005, J. VLSI Signal Process..

[76]  Jr. G. Forney,et al.  The viterbi algorithm , 1973 .

[77]  Daniel P. W. Ellis,et al.  A Large-Scale Evaluation of Acoustic and Subjective Music-Similarity Measures , 2004, Computer Music Journal.

[78]  Nuno Vasconcelos,et al.  Bridging the Gap: Query by Semantic Example , 2007, IEEE Transactions on Multimedia.

[79]  Gerhard Widmer,et al.  Improvements of Audio-Based Music Similarity and Genre Classificaton , 2005, ISMIR.

[80]  Chun Chen,et al.  Music recommendation by unified hypergraph: combining social media information and music content , 2010, ACM Multimedia.

[81]  George Tzanetakis,et al.  An experimental comparison of audio tempo induction algorithms , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[82]  Peter Knees,et al.  Augmenting Text-based Music Retrieval with Audio Similarity: Advantages and Limitations , 2009, ISMIR.

[83]  Piotr Indyk,et al.  Similarity Search in High Dimensions via Hashing , 1999, VLDB.

[84]  Perry R. Cook,et al.  Easy As CBA: A Simple Probabilistic Model for Tagging Music , 2009, ISMIR.

[85]  Antoni B. Chan,et al.  Automatic Music Tagging With Time Series Models , 2010, ISMIR.

[86]  Chihli Hung and Chih-Fong Tsai,et al.  Automatically Annotating Images with Keywords: A Review of Image Annotation Systems , 2008 .

[87]  Òscar Celma,et al.  Foafing the Music: Bridging the Semantic Gap in Music Recommendation , 2006, SEMWEB.

[88]  R. Manmatha,et al.  Multiple Bernoulli relevance models for image and video annotation , 2004, CVPR 2004.

[89]  C. Harte,et al.  Detecting harmonic change in musical audio , 2006, AMCMM '06.

[90]  Nuno Vasconcelos,et al.  Holistic context modeling using semantic co-occurrences , 2009, CVPR.

[91]  Nicola Orio,et al.  A Discrete Filter Bank Approach to Audio to Score Matching for Polyphonic Music , 2009, ISMIR.

[92]  Kilian Q. Weinberger,et al.  ISMIR 2008 – Session 3a – Content-Based Retrieval, Categorization and Similarity 1 LEARNING A METRIC FOR MUSIC SIMILARITY , 2022 .

[93]  Gediminas Adomavicius,et al.  Toward the next generation of recommender systems: a survey of the state-of-the-art and possible extensions , 2005, IEEE Transactions on Knowledge and Data Engineering.

[94]  Daniel P. W. Ellis,et al.  Song-Level Features and Support Vector Machines for Music Classification , 2005, ISMIR.

[95]  Marc Leman,et al.  Content-Based Music Information Retrieval: Current Directions and Future Challenges , 2008, Proceedings of the IEEE.

[96]  Riccardo Miotto,et al.  A Music Identification System Based on Chroma Indexing and Statistical Modeling , 2008, ISMIR.

[97]  Ning Hu,et al.  Understanding Search Performance in Query-by-Humming Systems , 2004, ISMIR.

[98]  Òscar Celma,et al.  Annotating Music Collections: How Content-Based Similarity Helps to Propagate Labels , 2007, ISMIR.

[99]  Peter Knees,et al.  A music search engine built upon audio-based and web-based similarity measures , 2007, SIGIR.

[100]  Riccardo Miotto,et al.  Automatic Identification of Music Works Through Audio Matching , 2007, ECDL.

[101]  Youngmoo E. Kim,et al.  Exploring automatic music annotation with "acoustically-objective" tags , 2010, MIR '10.

[102]  Paul Lamere,et al.  Social Tagging and Music Information Retrieval , 2008 .