Low-Latency Audio Pitch Tracking: A Multi-Modal Sensor-Assisted Approach

This paper presents a multi-modal approach to musical instrument pitch tracking combining audio and position sensor data. Finger location on a violin fingerboard is measured using resistive sensors, allowing rapid detection of approximate pitch. The initial pitch estimate is then used to restrict the search space of an audio pitch tracking algorithm. Most audio pitch tracking algorithms face a fundamental tradeoff between accuracy and latency, with longer analysis windows producing better pitch estimates at the cost of noticeable lag in a live performance environment. Conversely, sensor-only strategies struggle to achieve the fine pitch accuracy a human listener would expect. While this paper is violin centric, it demonstrates a more general concept for augmented instruments that by combining the two differing approaches, high accuracy and low latency pitch can be simultaneiously achieved.

[1]  Hideki Kawahara,et al.  Comparative evaluation of F0 estimation algorithms , 2001, INTERSPEECH.

[2]  Judith C. Brown Musical fundamental frequency tracking using a pattern recognition method , 1992 .

[3]  Adrian Freed,et al.  "Old" is the New "New": a Fingerboard Case Study in Recrudescence as a NIME Development Strategy , 2013, NIME.

[4]  Emmanouil Benetos,et al.  Automatic Transcription of Polyphonic Music Exploiting Temporal Evolution , 2012 .

[5]  Andrew P. McPherson,et al.  Near-Field Optical Reflective Sensing for Bow Tracking , 2013, NIME.

[6]  A. Tveit,et al.  The Overtone Fiddle: an Actuated Acoustic Instrument , 2011 .

[7]  David A. Krubsack,et al.  A spectral autocorrelation method for measurement of the fundamental frequency of noise-corrupted speech , 1987, IEEE Trans. Acoust. Speech Signal Process..

[8]  Adrian Freed,et al.  Novel and Forgotten Current-steering Techniques for Resistive Multitouch, Duotouch, and Polytouch Position Sensing with Pressure , 2009, NIME.

[9]  Amar Chaudhary,et al.  Operating Systems Latency Measurement and Analysis for Sound Synthesis and Processing Applications , 1997, ICMC.

[10]  Jordi Bonada,et al.  Acquisition of Violin instrumental gestures using a Commercial EMF tracking Device , 2007, ICMC.

[11]  Frédéric Bevilacqua,et al.  Gesture Analysis of Violin Bow Strokes , 2005, Gesture Workshop.

[12]  Nicolas Courty,et al.  Gesture in Human-Computer Interaction and Simulation , 2006 .

[13]  Lawrence R. Rabiner,et al.  On the use of autocorrelation analysis for pitch detection , 1977 .

[14]  Perttu Hämäläinen,et al.  Latency Tolerance for Gesture Controlled Continuous Sound Instrument without Tactile Feedback , 2004, ICMC.

[15]  A. Noll Cepstrum pitch determination. , 1967, The Journal of the Acoustical Society of America.

[16]  Joseph A. Paradiso,et al.  Musical Applications of Electric Field Sensing , 1997 .

[17]  Durand R. Begault,et al.  Sensitivity to haptic-audio asynchrony , 2003, ICMI '03.

[18]  Dan Overholt,et al.  The Overtone Violin: a New Computer Music Instrument , 2005, ICMC.

[19]  Anders Askenfelt,et al.  Measurement of bow motion and bow force in violin playing , 1986 .

[20]  Alexander Refsum Jensenius,et al.  Performing the Electric Violin in a Sonic Space , 2012, Computer Music Journal.

[21]  Mark B. Sandler,et al.  The Sonic Visualiser: A Visualisation Platform for Semantic Descriptors from Musical Signals , 2006, ISMIR.

[22]  Hideki Kawahara,et al.  YIN, a fundamental frequency estimator for speech and music. , 2002, The Journal of the Acoustical Society of America.

[23]  Erwin Schoonderwaldt,et al.  Extraction of bowing parameters from violin performance combining motion capture and sensors. , 2009, The Journal of the Acoustical Society of America.

[24]  Cornelius Pöpel,et al.  Recent Developments in Violin-related Digital Musical Instruments: Where Are We and Where Are We Going? , 2006, NIME.

[25]  Ajay Kapur,et al.  The Electronic Sitar Controller , 2004, NIME.

[26]  M. Mathews,et al.  Electronic simulation of violin resonances , 1973 .