Feature dependence in the automatic identification of musical woodwind instruments.

The automatic identification of musical instruments is a relatively unexplored and potentially very important field for its promise to free humans from time-consuming searches on the Internet and indexing of audio material. Speaker identification techniques have been used in this paper to determine the properties (features) which are most effective in identifying a statistically significant number of sounds representing four classes of musical instruments (oboe, sax, clarinet, flute) excerpted from actual performances. Features examined include cepstral coefficients, constant-Q coefficients, spectral centroid, autocorrelation coefficients, and moments of the time wave. The number of these coefficients was varied, and in the case of cepstral coefficients, ten coefficients were sufficient for identification. Correct identifications of 79%-84% were obtained with cepstral coefficients, bin-to-bin differences of the constant-Q coefficients, and autocorrelation coefficients; the latter have not been used previously in either speaker or instrument identification work. These results depended on the training sounds chosen and the number of clusters used in the calculation. Comparison to a human perception experiment with sounds produced by the same instruments indicates that, under these conditions, computers do as well as humans in identifying woodwind instruments.

[1]  Biing-Hwang Juang,et al.  Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.

[2]  Keith Dana Martin,et al.  Sound-source recognition: a theory and computational model , 1999 .

[3]  Roger A. Kendall,et al.  The role of acoustic signal partitions in listener categorization of musical phrases. , 1986 .

[4]  James Anderson Moorer,et al.  On the segmentation and analysis of continuous musical sound by digital computer , 1975 .

[5]  James W. Beauchamp,et al.  Synthesis by Spectral Amplitude and "Brightness" Matching of Analyzed Musical Instrument Tones , 1981 .

[6]  W. Strong,et al.  Synthesis of Wind‐Instrument Tones , 1967 .

[7]  Lynn Wilcox,et al.  Audio indexing using speaker identification , 1994, Optics & Photonics.

[8]  S. Handel,et al.  Chapter 12 – Timbre Perception and Auditory Object Identification , 1995 .

[9]  S. McAdams,et al.  Discrimination of musical instrument sounds resynthesized with simplified spectrotemporal parameters. , 1999, The Journal of the Acoustical Society of America.

[10]  Halson Vashon Eagleson,et al.  Identification of Musical Instruments When Heard Directly and over a Public‐Address System , 1947 .

[11]  Melville Clark,et al.  Dependence of Timbre on the Tonal Loudness Produced by Musical Instruments , 1964 .

[12]  Janet Marques,et al.  An automatic annotation system for audio data containing music , 1999 .

[13]  S. McAdams,et al.  Auditory Cognition. (Book Reviews: Thinking in Sound. The Cognitive Psychology of Human Audition.) , 1993 .

[14]  J. F. Corso,et al.  Timbre Cues and the Identification of Musical Instruments , 1962 .

[15]  Xavier Rodet,et al.  Timbre Recognition with Combined Stationary and Temporal Features , 1998, ICMC.

[16]  Judith C. Brown Cluster‐based probability model for musical instrument identification , 1997 .

[17]  Climent Nadeu,et al.  Comparison of time & frequency filtering and cepstral-time matrix approaches in ASR , 1999, EUROSPEECH.

[18]  William J. Strong,et al.  Perturbations of Synthetic Orchestral Wind‐Instrument Tones , 1967 .

[19]  J C Brown Computer identification of musical instruments using pattern recognition with cepstral coefficients as features. , 1999, The Journal of the Acoustical Society of America.

[20]  K. Berger Some Factors in the Recognition of Timbre , 1964 .

[21]  Douglas A. Reynolds,et al.  Robust text-independent speaker identification using Gaussian mixture speaker models , 1995, IEEE Trans. Speech Audio Process..

[22]  Laveen N. Kanal,et al.  Patterns in pattern recognition: 1968-1974 , 1974, IEEE Trans. Inf. Theory.

[23]  Shlomo Dubnov,et al.  Polyspectra as measures of sound texture and timbre , 1997 .

[24]  Judith C. Brown Computer identification of wind instruments using cepstral coefficients , 1998 .