An Evaluation of Audio Feature Extraction Toolboxes

Audio feature extraction underpins a massive proportion of audio processing, music information retrieval, audio effect design and audio synthesis. Design, analysis, synthesis and evaluation often rely on audio features, but there are a large and diverse range of feature extraction tools presented to the community. An evaluation of existing audio feature extraction libraries was undertaken. Ten libraries and toolboxes were evaluated with the Cranfield Model for evaluation of information retrieval systems, reviewing the coverage, effort, presentation and time lag of a system. Comparisons are undertaken of these tools and example use cases are presented as to when toolboxes are most suitable. This paper allows a software engineer or researcher to quickly and easily select a suitable audio feature extraction toolbox.

[1]  Jakub Fiala,et al.  Meyda: an Audio Feature Extraction Library for the Web Audio API , 2015 .

[2]  Patrick Susini,et al.  The Timbre Toolbox: extracting audio descriptors from musical signals. , 2011, The Journal of the Acoustical Society of America.

[3]  Douglas Repetto,et al.  librosa: v0.3.1 , 2014 .

[4]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[5]  Thomas Fillon,et al.  YAAFE, an Easy to Use and Efficient Audio Feature Extraction Software , 2010, ISMIR.

[6]  Jürgen Herre,et al.  MPEG-7 and MPEG-7 audio: An overview , 2001 .

[7]  Sebastian Ewert,et al.  The Audio Degradation Toolbox and Its Application to Robustness Evaluation , 2013, ISMIR.

[8]  Christian Breiteneder,et al.  Features for Content-Based Audio Retrieval , 2010, Adv. Comput..

[9]  Eero P. Simoncelli,et al.  Article Sound Texture Perception via Statistics of the Auditory Periphery: Evidence from Sound Synthesis , 2022 .

[10]  O. Lartillot,et al.  A MATLAB TOOLBOX FOR MUSICAL FEATURE EXTRACTION FROM AUDIO , 2007 .

[11]  Xavier Serra,et al.  Essentia: An Audio Analysis Library for Music Information Retrieval , 2013, ISMIR.

[12]  Thierry Bertin-Mahieux,et al.  The Million Song Dataset , 2011, ISMIR.

[13]  György Fazekas,et al.  SAFE: A System for the Extraction and Retrieval of Semantic Audio Descriptors , 2014 .

[14]  Marc Leman,et al.  Content-Based Music Information Retrieval: Current Directions and Future Challenges , 2008, Proceedings of the IEEE.

[15]  J. Stephen Downie,et al.  Reuse, Remix, Repeat: the Workflows of MIR , 2012, ISMIR.

[16]  Alexander Lerch,et al.  FEAPI: A low level feature extraction plugin API , 2005 .

[17]  Perry R. Cook,et al.  Feature-Based Synthesis: A Tool for Evaluating, Designing, and Interacting with Music IR Systems , 2006, ISMIR.

[18]  Daniel P. W. Ellis,et al.  MIR_EVAL: A Transparent Implementation of Common MIR Metrics , 2014, ISMIR.

[19]  Jeroen Breebaart,et al.  Features for audio and music classification , 2003, ISMIR.

[20]  J. Reiss,et al.  Beyond Recall and Precision : A Full Framework for MIR System Evaluation , 2002 .

[21]  George Tzanetakis,et al.  Distributed Audio Feature Extraction for Music , 2005, ISMIR.

[22]  B. S. Manjunath,et al.  Introduction to MPEG-7: Multimedia Content Description Interface , 2002 .

[23]  Brian Gygi,et al.  Similarity and categorization of environmental sounds , 2007, Perception & psychophysics.

[24]  Mark B. Sandler,et al.  Sonic visualiser: an open source application for viewing, analysing, and annotating music audio files , 2010, ACM Multimedia.

[25]  Torben Bach Pedersen,et al.  High-Level Audio Features: Distributed Extraction and Similarity Search , 2008, ISMIR.

[26]  Björn W. Schuller,et al.  Recent developments in openSMILE, the munich open-source multimedia feature extractor , 2013, ACM Multimedia.

[27]  George Tzanetakis,et al.  MARSYAS: a framework for audio analysis , 1999, Organised Sound.

[28]  Ichiro Fujinaga,et al.  jAudio: An Feature Extraction Library , 2005, ISMIR.

[29]  J. Stephen Downie,et al.  The Scientific Evaluation of Music Information Retrieval Systems: Foundations and Future , 2004, Computer Music Journal.

[30]  William Brent A Timbre Analysis And Classification Toolkit For Pure Data , 2010, ICMC.

[31]  Jamie Bullock,et al.  Libxtract: a Lightweight Library for audio Feature Extraction , 2007, ICMC.

[32]  Paul M. Brossier,et al.  THE AUBIO LIBRARY AT MIREX 2006 , 2006 .

[33]  Joshua D. Reiss,et al.  Physical Modeling and Synthesis of Motor Noise for Replication of a Sound Effects Library , 2010 .

[34]  Michael Keen,et al.  ASLIB CRANFIELD RESEARCH PROJECT FACTORS DETERMINING THE PERFORMANCE OF INDEXING SYSTEMS VOLUME 2 , 1966 .

[35]  J. Reiss,et al.  Benchmarking Music Information Retrieval Systems , 2002 .

[36]  Andreas Franck Performance Evaluation of Algorithms for Arbitrary Sample Rate Conversion , 2011 .

[37]  Josh Reiss MIR Benchmarking: Lessons Learned from the Multimedia Community , 2002 .