Improved Auditory-Inspired Signal Processing Algorithm Design for Tracking Multiple Frequency Components

An adaptive synchrony capture filterbank (SCFB) signal processing architecture for tracking signal frequency components was proposed in a previous paper. Detailed analysis and further improvements are presented here. The original SCFB signal processing architecture consisted of a fixed array of traditional, passive linear, gammatone filters in cascade with a bank of three adaptively tunable bandpass filters that form part of a frequency discriminator loop (FDL). The SCFB exhibits many desirable properties for processing speech, music, and other complex sounds. In this article, an improved SCFB architecture is presented along with detailed specification of parameters that yield better frequency tracking. This architecture combines a FDL with a phase-locked loop, a concept borrowed from communication systems, to acquire and track frequency components in a time-varying signal. This modified algorithm estimates frequencies with higher accuracy even in the presence of closely spaced tones in the input. The improved performance of the algorithm is demonstrated by comparing the mean square error in the frequency estimates of synthetic time-varying signals.

[1]  R. Kumaresan,et al.  Model-based approach to envelope and positive instantaneous frequency estimation of signals with speech applications , 1999 .

[2]  Campbell L. Searle,et al.  Time‐domain analysis of auditory‐nerve fiber firing rates , 1989 .

[3]  Ramdas Kumaresan,et al.  RISC: An Improved Costas Estimator-Predictor Filter Bank For Decomposing Multicomponent Signals , 1994, IEEE Seventh SP Workshop on Statistical Signal and Array Processing.

[4]  Francis D. Natali,et al.  AFC Tracking Algorithms , 1984, IEEE Trans. Commun..

[5]  Floyd M. Gardner Properties of Frequency Difference Detectors , 1985, IEEE Trans. Commun..

[6]  B. Delgutte,et al.  Speech coding in the auditory nerve: I. Vowel-like sounds. , 1984, The Journal of the Acoustical Society of America.

[7]  Ramdas Kumaresan,et al.  Multiple pitch identification using cochlear-like frequency capture and harmonic grouping , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[8]  Patricia A. Pelle A Robust Pitch Extraction System Based on Phase Locked Loops , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[9]  M. Sachs,et al.  Encoding of steady-state vowels in the auditory nerve: representation in terms of discharge rate. , 1979, The Journal of the Acoustical Society of America.

[10]  B. Widrow,et al.  Adaptive noise cancelling: Principles and applications , 1975 .

[11]  Ian C. Bruce,et al.  Robust Formant Tracking for Continuous Speech With Speaker Variability , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[12]  L. Jackson,et al.  Parameter estimation by linear prediction in cascade form , 1977 .

[13]  Steven Kay,et al.  A Fast and Accurate Single Frequency Estimator , 2022 .

[14]  Floyd M. Gardner,et al.  Phaselock techniques , 1984, IEEE Transactions on Systems, Man, and Cybernetics.

[15]  Patrick E. Mantey,et al.  Automatic frequency control via digital filtering , 1968 .

[16]  Ray Meddis,et al.  Auditory Periphery: From Pinna to Auditory Nerve , 2010 .

[17]  Ramdas Kumaresan,et al.  On decomposing speech into modulated components , 2000, IEEE Trans. Speech Audio Process..

[18]  Michael Rice,et al.  Digital Communications: A Discrete-Time Approach , 2008 .

[19]  Peter Atkinson,et al.  Phase-Locked and Frequency Feedback Systems , 1979, IEEE Transactions on Systems, Man, and Cybernetics.

[20]  Jacob Klapper,et al.  CHAPTER 2 – Review of Concepts , 1972 .

[21]  Vijay Kumar Peddinti,et al.  Synchrony capture filterbank: auditory-inspired signal processing for tracking individual frequency components in speech. , 2013, The Journal of the Acoustical Society of America.

[22]  David G. Messerschmitt Frequency Detectors for PLL Acquisition in Timing and Carrier Recovery , 1979, IEEE Trans. Commun..

[23]  Boualem Boashash,et al.  Instantaneous Frequency Of Signals: Concepts, Estimation Techniques And Applications , 1989, Optics & Photonics.

[24]  M. Padmanabhan A hyperstable adaptive line enhancer for fast tracking of sinusoidal inputs , 1996 .

[25]  Avery Li-Chun Wang,et al.  Instantaneous and frequency-warped signal processing techniques for auditory source separation , 1994 .