Least-Squares Estimation of the Extrema in the Narrow-Band Music Data

The aim of this study is to estimate the maximum and minimum data of the 10-channel music waveforms, each of which contains the spectro-temporal information with a pass-band of one octave at a conventional rate of 44.1 kHz. The least-squares estimation by using a parabola function has approximated the upper or lower half of a period of the filtered waveform, so that 88.1% of the extrema measured in ad- vance has significantly reduced phase error at a bandwidth of channels 3 to 9 (80-10,240 Hz) and also the quantization error for channels 7, 8 and 9 (2,560-10240 Hz). Because the small amplitude for 9.44% of the measured extrema is the primary impediment to the estimation, the neighboring data series around the peaks and/or valleys of the narrow-band waveform is locally regarded as a simple and symmetric structure, which is supported by the auto-regression (AR) process. It is suggested that the appearance of the extrema in the filtered waveform is plausibly predictable in the human auditory system, and the extremal informa- tion plays a key role in encoding the acoustic structure, i.e., the phase-lock discharge in the acoustic nerve.

[1]  R. M. Warren,et al.  Spectral redundancy: Intelligibility of sentences heard through narrow spectral slits , 1995, Perception & psychophysics.

[2]  B. Atal,et al.  Speech analysis and synthesis by linear prediction of the speech wave. , 1971, The Journal of the Acoustical Society of America.

[3]  J. Makhoul,et al.  Linear prediction: A tutorial review , 1975, Proceedings of the IEEE.

[4]  Michael Elad,et al.  Superresolution restoration of an image sequence: adaptive filtering approach , 1999, IEEE Trans. Image Process..

[5]  J. Kelso,et al.  Multifunctionality and switching in the coordination dynamics of reaching and grasping , 1994 .

[6]  F. Attneave,et al.  Pitch as a medium: a new approach to psychophysical scaling. , 1971, The American journal of psychology.

[7]  A. Palmer,et al.  Phase-locking in the cochlear nerve of the guinea-pig and its relation to the receptor potential of inner hair-cells , 1986, Hearing Research.

[8]  Biing-Hwang Juang,et al.  Discriminative learning for minimum error classification [pattern recognition] , 1992, IEEE Trans. Signal Process..

[9]  A. Hudspeth,et al.  Sensitivity, polarity, and conductance change in the response of vertebrate hair cells to controlled mechanical stimuli. , 1977, Proceedings of the National Academy of Sciences of the United States of America.

[10]  E. Somersalo,et al.  Visualization of Magnetoencephalographic Data Using Minimum Current Estimates , 1999, NeuroImage.

[11]  D. D. Greenwood A cochlear frequency-position function for several species--29 years later. , 1990, The Journal of the Acoustical Society of America.

[12]  Keinosuke Fukunaga,et al.  Bias of Nearest Neighbor Error Estimates , 1987, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13]  H. Yoshida,et al.  The Extremal Sampling Technique , 2008, 2008 3rd International Conference on Innovative Computing Information and Control.

[14]  R V Shannon,et al.  Speech Recognition with Primarily Temporal Cues , 1995, Science.

[15]  Albert S. Bregman,et al.  The Auditory Scene. (Book Reviews: Auditory Scene Analysis. The Perceptual Organization of Sound.) , 1990 .

[16]  M. R. Schroeder,et al.  Adaptive predictive coding of speech signals , 1970, Bell Syst. Tech. J..

[17]  Max Wertheimer,et al.  Untersuchungen zur Lehre von der Gestalt , .

[18]  Y. Fujiwara,et al.  The Visualization Technique of Daily Corridor Noises , 2007, Second International Conference on Innovative Computing, Informatio and Control (ICICIC 2007).

[19]  R. Ilmoniemi,et al.  Magnetoencephalography-theory, instrumentation, and applications to noninvasive studies of the working human brain , 1993 .

[20]  William T. Freeman,et al.  Example-Based Super-Resolution , 2002, IEEE Computer Graphics and Applications.

[21]  Tomaso Poggio,et al.  Computational vision and regularization theory , 1985, Nature.

[22]  R. M. Warren Perceptual Restoration of Missing Speech Sounds , 1970, Science.

[23]  P M Sellick,et al.  Intracellular studies of hair cells in the mammalian cochlea. , 1978, The Journal of physiology.

[24]  James L. Flanagan,et al.  Adaptive quantization in differential PCM coding of speech , 1973 .

[25]  J. C. Middlebrooks,et al.  Two-dimensional sound localization by human listeners. , 1990, The Journal of the Acoustical Society of America.

[26]  Peter Dallos,et al.  Cochlear Inner and Outer Hair Cells: Functional Differences , 1972, Science.