Direct access in content-based Audio Information Retrieval: A state of the art and challenges

This paper surveys Audio Information Retrieval (AIR) using a literature review and classification of articles from 1994 to 2010 with a keyword index and article abstract in order to explore how AIR methodologies and applications have developed during this period. Based on the scope of many papers and journals of AIR, this paper surveys and classifies AIR problem domains into five domains: AIR framework, audio feature extraction, audio classification, audio/music similarity, and audio tools/applications with their applications for different research and problem domains. Based on the current state of the art, we discuss the major challenges for the future.

[1]  Margaret Cahill,et al.  Melodic Similarity Algorithms -- Using Similarity Ratings for Development and Early Evaluation , 2005, ISMIR.

[2]  Gert R. G. Lanckriet,et al.  Combining Feature Kernels for Semantic Music Retrieval , 2008, ISMIR.

[3]  Jamie Bullock,et al.  Libxtract: a Lightweight Library for audio Feature Extraction , 2007, ICMC.

[4]  Avery Wang,et al.  An Industrial Strength Audio Search Algorithm , 2003, ISMIR.

[5]  Peter Knees,et al.  A HIGH-LEVEL AUDIO FEATURE FOR MUSIC RETRIEVAL AND SORTING , 2010 .

[6]  Meinard Müller,et al.  The Cyclic Beat Spectrum: Tempo-Related Audio Features for Time-Scale Invariant Audio Identification , 2006, ISMIR.

[7]  O. Lartillot,et al.  A MATLAB TOOLBOX FOR MUSICAL FEATURE EXTRACTION FROM AUDIO , 2007 .

[8]  Petri Toiviainen,et al.  MIR in Matlab (II): A Toolbox for Musical Feature Extraction from Audio , 2007, ISMIR.

[9]  Remco C. Veltkamp,et al.  A Survey of Music Information Retrieval Systems , 2005, ISMIR.

[10]  Christian Breiteneder,et al.  Features for Content-Based Audio Retrieval , 2010, Adv. Comput..

[11]  Jeroen Breebaart,et al.  Features for audio and music classification , 2003, ISMIR.

[12]  Ian Taylor,et al.  DART: A Framework for Distributed Audio Analysis and Music Information Retrieval , 2008 .

[13]  Kunio Kashino,et al.  A Robust Musical Audio Search Method Based on Diagonal Dynamic Programming Matching of Self-Similarity Matrices , 2008, ISMIR.

[14]  George Tzanetakis,et al.  MARSYAS: a framework for audio analysis , 1999, Organised Sound.

[15]  Nicu Sebe,et al.  Content-based multimedia information retrieval: State of the art and challenges , 2006, TOMCCAP.

[16]  Masataka Goto,et al.  A Stochastic Representation of the Dynamics of Sung Melody , 2007, ISMIR.

[17]  Ramesh C. Jain,et al.  ACM SIGMM retreat report on future directions in multimedia research , 2005, TOMCCAP.

[18]  L. Deng Ieee Transactions on Speech and Audio Processing, Speech Trajectory Discrimination Using the Minimum Classiication Error Learning , 1997 .

[19]  Douglas Keislar,et al.  Content-Based Classification, Search, and Retrieval of Audio , 1996, IEEE Multim..

[20]  Sergios Theodoridis,et al.  Music Retrieval by Rhythmic Similarity Applied on Greek and African Traditional Music , 2007, ISMIR.

[21]  Igor Vatolkin,et al.  AMUSE (Advanced MUSic Explorer) - A Multitool Framework for Music Data Analysis , 2010, ISMIR.

[22]  George Tzanetakis,et al.  Musical genre classification of audio signals , 2002, IEEE Trans. Speech Audio Process..

[23]  G. Widmer Mirage - High-Performance Music Similarity Computation and Automatic Playlist Generation , 2007 .

[24]  Mohd Afizi Mohd Shukran,et al.  A new hybrid audio classification algorithm based on SVM weight factor and Euclidean distance , 2007 .

[25]  Nicola Orio,et al.  A Measure of Melodic Similarity based on a Graph Representation of the Music Structure , 2009, ISMIR.

[26]  Torben Bach Pedersen,et al.  High-Level Audio Features: Distributed Extraction and Similarity Search , 2008, ISMIR.

[27]  Laurent Imbert,et al.  Accelerating Query-by-Humming on GPU , 2009, ISMIR.

[28]  Guodong Guo,et al.  Content-based audio classification and retrieval by support vector machines , 2003, IEEE Trans. Neural Networks.

[29]  George Tzanetakis,et al.  Pitch Histograms in Audio and Symbolic Music Information Retrieval , 2003, ISMIR.

[30]  Marcel Worring,et al.  Content-Based Image Retrieval at the End of the Early Years , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[31]  Chunru Wan,et al.  Content-based audio classification and retrieval using a fuzzy logic system: towards multimedia search engines , 2002, Soft Comput..

[32]  Remco C. Veltkamp,et al.  Muugle: A Modular Music Information Retrieval Framework , 2006, ISMIR.

[33]  Sea Ling,et al.  Bio-inspired Audio Content-Based Retrieval Framework (B-ACRF) , 2009 .

[34]  Gert R. G. Lanckriet,et al.  Learning Similarity from Collaborative Filters , 2010, ISMIR.

[35]  David García,et al.  CLAM: an OO framework for developing audio and music applications , 2002, OOPSLA '02.

[36]  John P. Eakins,et al.  Shape Feature Matching for Trademark Image Retrieval , 2003, CIVR.

[37]  Makoto P. Kato RhythMiXearch: Searching for Unknown Music by Mixing Known Music , 2009, ISMIR.

[38]  Guerino Mazzola,et al.  The RUBATO Performance Workstation on NeXTSTEP , 1994, ICMC.

[39]  J. Stephen Downie,et al.  The International Music Information Retrieval Systems Evaluation Laboratory: Governance, Access and Security , 2004, ISMIR.

[40]  Masataka Goto,et al.  Query-by-conducting: An Interface to Retrieve Classical-music Interpretations by Real-time Tempo Input , 2010, ISMIR.

[41]  Remco C. Veltkamp,et al.  Applying Rhythmic Similarity Based on Inner Metric Analysis to Folksong Research , 2007, ISMIR.

[42]  Simon J. Godsill,et al.  A Probabilistic Framework for Matching Music Representations , 2007, ISMIR.

[43]  Lei Chen,et al.  Searching musical audio datasets by a batch of multi-variant tracks , 2008, MIR '08.

[44]  Chin-Hui Lee,et al.  A Study on Music Genre Classification Based on Universal Acoustic Models , 2006, ISMIR.

[45]  Peter Hlavac,et al.  Map-based music interfaces for mobile devices , 2008, ACM Multimedia.

[46]  Jonathan Foote,et al.  An overview of audio information retrieval , 1999, Multimedia Systems.

[47]  Fraunhofer IDMT Langewiesener Two Note Based Approaches to Query by Singing / Humming Christian Sailer , 2006 .