Adaptive whitening for Improved Real-Time audio onset Detection

We describe a new method for preprocessing STFT phasevocoder frames for improved performance in real-time onset detection, which we term "adaptive whitening". The procedure involves normalising the magnitude of each bin according to a recent maximum value for that bin, with the aim of allowing each bin to achieve a similar dynamic range over time, which helps to mitigate against the influence of spectral roll-off and strongly-varying dynamics. Adaptive whitening requires no training, is relatively lightweight to compute, and can run in real-time. Yet it can improve onset detector performance by more than ten percentage points (peak F-measure) in some cases, and improves the performance of most of the onset detectors tested. We present results demonstrating that adaptive whitening can significantly improve the performance of various STFT-based onset detection functions, including functions based on the power, spectral flux, phase deviation, and complex deviation measures. Our results find the process to be especially beneficial for certain types of audio signal (e.g. complex mixtures such as pop music).

[1]  David A. Patterson,et al.  Computer Architecture: A Quantitative Approach , 1969 .

[2]  E. Owens,et al.  An Introduction to the Psychology of Hearing , 1997 .

[3]  Anssi Klapuri,et al.  Sound onset detection by applying psychoacoustic knowledge , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).

[4]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques, 3rd Edition , 1999 .

[5]  Laurent Daudet,et al.  Transients modelling by pruned wavelet trees , 2001, ICMC.

[6]  M. Davies,et al.  A HYBRID APPROACH TO MUSICAL NOTE ONSET DETECTION , 2002 .

[7]  James McCartney,et al.  Rethinking the Computer Music Language: SuperCollider , 2002, Computer Music Journal.

[8]  Mark D. Plumbley,et al.  PROBABILITY AS METADATA: EVENT DETECTION IN MUSIC USING ICA AS A CONDITIONAL DENSITY MODEL , 2003 .

[9]  Mark D. Plumbley,et al.  Real-time temporal segmentation of note objects in music signals , 2004, ICMC.

[10]  Gaël Richard,et al.  Methodology and Tools for the evaluation of automatic onset detection algorithms in music , 2004, ISMIR.

[11]  Mark D. Plumbley,et al.  Fast labelling of notes in music signals , 2004, ISMIR.

[12]  Avi Pfeffer,et al.  A Hierarchical Approach to Onset Detection , 2004, ICMC.

[13]  Nick Collins Using a Pitch Detector for Onset Detection , 2005, ISMIR.

[14]  Mark B. Sandler,et al.  A tutorial on onset detection in music signals , 2005, IEEE Transactions on Speech and Audio Processing.

[15]  Paul M. Brossier,et al.  Automatic annotation of musical audio for interactive applications , 2006 .

[16]  S. Dixon ONSET DETECTION REVISITED , 2006 .

[17]  Douglas Eck,et al.  A Supervised Classification Algorithm for Note Onset Detection , 2006, EURASIP J. Adv. Signal Process..

[18]  N. M. Collins,et al.  Towards autonomous agents for live computer music : realtime machine listening and interactive music systems , 2007 .