Sound Art - Synthesis Based on Rhythm and Fea- ture Extraction

This paper presents a software tool that was developed for the transformation of audio signals, be it short sounds or whole songs. This can subsequently be used to synthesize music from audio signal fragments (“Concatenative Sound Synthesis“). It is attempted to use timbre-related features to re-synthesize audio signals. As in a mosaic, an existing song is newly constructed from small parts (“frames”) of other songs already stored in a database. Suitable database frames are found by feature similarity analysis. The length of the frames corresponds to musically meaningful units. For the implementation suitable onset- and beat tracking methods are evaluated. The selection of suitable parameters describing the subjective semantic similarities is determined by listening tests.

[1]  Yoichi Muraoka,et al.  A Real-Time Beat Tracking System for Audio Signals , 1996, ICMC.

[2]  Mark B. Sandler,et al.  A tutorial on onset detection in music signals , 2005, IEEE Transactions on Speech and Audio Processing.

[3]  Nick Collins BBCut2: Integrating beat tracking and on-the-fly event analysis , 2006 .

[4]  Laurent Daudet,et al.  Transients modelling by pruned wavelet trees , 2001, ICMC.

[5]  Yoichi Muraoka,et al.  Real-time Rhythm Tracking for Drumless Audio Signals --- Chord Change Detection for Musical Decision , 1997, International Joint Conference on Artificial Intelligence.

[6]  Robert Tibshirani,et al.  An Introduction to the Bootstrap , 1994 .

[7]  Jaakko Astola,et al.  Analysis of the meter of acoustic musical signals , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[8]  George Tzanetakis,et al.  Automatic Musical Genre Classification of Audio Signals , 2001, ISMIR.

[9]  James Harley,et al.  Xenakis: His Life in Music , 2004 .

[10]  Jonathan Foote,et al.  Automatic audio segmentation using a measure of audio novelty , 2000, 2000 IEEE International Conference on Multimedia and Expo. ICME2000. Proceedings. Latest Advances in the Fast Changing World of Multimedia (Cat. No.00TH8532).

[11]  Bob L. Sturm Adaptive Concatenative Sound Synthesis and Its Application to Micromontage Composition , 2006, Computer Music Journal.

[12]  F. Mörchen,et al.  MusicMiner : Visualizing timbre distances of music as topographical maps , 2005 .

[13]  Teresa H. Y. Meng,et al.  Transient Modeling Synthesis: a flexible analysis/synthesis tool for transient signals , 1998, ICMC.

[14]  Michael M. Goodwin,et al.  Frequency-Domain Algorithms for Audio Signal Enhancement Based on Transient Modification , 2006 .

[15]  Juan Pablo Bello,et al.  A Robust Mid-Level Representation for Harmonic Content in Music Signals , 2005, ISMIR.

[16]  Diemo Schwarz Concatenative sound synthesis: The early years , 2006 .

[17]  Perry R. Cook,et al.  MOSIEVIUS: FEATURE DRIVEN INTERACTIVE AUDIO MOSAICING , 2003 .

[18]  G. H. Wakefield,et al.  To catch a chorus: using chroma-based representations for audio thumbnailing , 2001, Proceedings of the 2001 IEEE Workshop on the Applications of Signal Processing to Audio and Acoustics (Cat. No.01TH8575).

[19]  Mark B. Sandler,et al.  On the use of phase and energy for musical onset detection in the complex domain , 2004, IEEE Signal Processing Letters.

[20]  Stephen Cox,et al.  Features and classifiers for the automatic classification of musical audio signals , 2004, ISMIR.

[21]  Anssi Klapuri,et al.  Sound onset detection by applying psychoacoustic knowledge , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).

[22]  Yoichi Muraoka,et al.  Real-time beat tracking for drumless audio signals: Chord change detection for musical decisions , 1999, Speech Commun..

[23]  Ulas Bagci,et al.  Automatic Classification of Musical Genres Using Inter-Genre Similarity , 2007, IEEE Signal Processing Letters.

[24]  Bernhard Feiten,et al.  A Sound-Retrieval Index Based on Two-Dimensional Similarity Maps , 1993 .

[25]  Simon Dixon,et al.  Automatic Extraction of Tempo and Beat From Expressive Performances , 2001 .

[26]  M. Davies,et al.  A HYBRID APPROACH TO MUSICAL NOTE ONSET DETECTION , 2002 .

[27]  Diemo Schwarz,et al.  A SYSTEM FOR DATA-DRIVEN CONCATENATIVE SOUND SYNTHESIS , 2000 .

[28]  Stephen McAdams,et al.  Instrument Sound Description in the Context of MPEG-7 , 2000, ICMC.

[29]  Udo Zoelzer,et al.  DAFX: Digital Audio Effects , 2011 .

[30]  Mark D. Plumbley,et al.  PROBABILITY AS METADATA: EVENT DETECTION IN MUSIC USING ICA AS A CONDITIONAL DENSITY MODEL , 2003 .

[31]  David Salesin,et al.  Audio Analogies: Creating New Music from an Existing Performance by concatenative synthesis , 2005, ICMC.

[32]  Kaare Brandt Petersen,et al.  Mel Frequency Cepstral Coefficients: An Evaluation of Robustness of MP3 Encoded Music , 2006, ISMIR.

[33]  Nick Collins A Comparison of Sound Onset Detection Algorithms with Emphasis on Psychoacoustically Motivated Detection Functions , 2005 .

[34]  Fabien Gouyon,et al.  Determination of the meter of musical audio signals: Seeking recurrences in beat segment descriptors , 2003 .

[35]  Mark Sandler,et al.  Automatic Chord Identifcation using a Quantised Chromagram , 2005 .