Interactive Music with Active Audio CDs

With a standard compact disc (CD) audio player, the only possibility for the user is to listen to the recorded track, passively: the interaction is limited to changing the global volume or the track. Imagine now that the listener can turn into a musician, playing with the sound sources present in the stereo mix, changing their respective volumes and locations in space. For example, a given instrument or voice can be either muted, amplified, or more generally moved in the acoustic space. This will be a kind of generalized karaoke, useful for disc jockeys and also for music pedagogy (when practicing an instrument). Our system shows that this dream has come true, with active CDs fully backward compatible while enabling interactive music. The magic is that "the music is in the sound": the structure of the mix is embedded in the sound signal itself, using audio watermarking techniques, and the embedded information is exploited by the player to perform the separation of the sources (patent pending) used in turn by a spatializer.

[1]  Harald Viste,et al.  Binaural localization and separation techniques , 2004 .

[2]  Laurent Girin,et al.  A high-capacity watermarking technique for audio signals based on MDCT-domain quantization , 2010 .

[3]  A. Lodge,et al.  On the Acoustic Shadow of a Sphere. With an Appendix, Giving the Values of Legendre's Functions from P$_{0}$ to P$_{20}$ at Intervals of 5 Degrees , 1904 .

[4]  John Princen,et al.  Analysis/Synthesis filter bank design based on time domain aliasing cancellation , 1986, IEEE Trans. Acoust. Speech Signal Process..

[5]  Laurent Girin,et al.  Informed source separation of underdetermined instantaneous stereo mixtures using source index embedding , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[6]  Barak A. Pearlmutter,et al.  Survey of sparse and non‐sparse methods in source separation , 2005, Int. J. Imaging Syst. Technol..

[7]  Christof Faller,et al.  Improved Time Delay Analysis/Synthesis for Parametric Stereo Audio Coding , 2006 .

[8]  Boris Mansencal,et al.  RetroSpat: a Perception-Based System for Semi-Automatic Diffusion of Acousmatic Music , 2008 .

[9]  Hiroshi Sawada,et al.  Underdetermined blind sparse source separation for arbitrarily arranged multiple sensors , 2007, Signal Process..

[10]  Lord Rayleigh,et al.  LXI. Acoustical observations , 1877 .

[11]  Gregory W. Wornell,et al.  Quantization index modulation: A class of provably good methods for digital watermarking and information embedding , 2001, IEEE Trans. Inf. Theory.

[12]  R. G. Scurlock,et al.  Nuclear orientation of praseodymium 142 , 1958 .

[13]  C. Avendano,et al.  The CIPIC HRTF database , 2001, Proceedings of the 2001 IEEE Workshop on the Applications of Signal Processing to Audio and Acoustics (Cat. No.01TH8575).

[14]  John M. Chowning,et al.  THE SIMULATION OF MOVING SOUND SOURCES , 1970 .

[15]  Laurent Girin,et al.  Informed Source Separation of Linear Instantaneous Under-Determined Audio Mixtures by Source Index Embedding , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[16]  Rémi Gribonval,et al.  Oracle estimators for the benchmarking of source separation algorithms , 2007, Signal Process..

[17]  Rémi Gribonval,et al.  Sparse Representations in Audio and Music: From Coding to Source Separation , 2010, Proceedings of the IEEE.

[18]  John William Strutt Scientific Papers: On the Acoustic Shadow of a Sphere , 2009 .

[19]  A. J. Zuckerwar,et al.  Atmospheric absorption of sound: Further developments , 1995 .

[20]  Scott Rickard,et al.  Blind separation of speech mixtures via time-frequency masking , 2004, IEEE Transactions on Signal Processing.

[21]  G. F. Kuhn Model for the interaural time differences in the azimuthal plane , 1977 .

[22]  Michael Zibulevsky,et al.  Underdetermined blind source separation using sparse representations , 2001, Signal Process..

[23]  Thomas Sporer,et al.  PEAQ - The ITU Standard for Objective Measurement of Perceived Audio Quality , 2000 .

[24]  Hiroshi Sawada,et al.  K-means Based Underdetermined Blind Speech Separation , 2007, Blind Speech Separation.