论文信息 - Monaural room acoustic parameters from music and speech.

Monaural room acoustic parameters from music and speech.

This paper compares two methods for extracting room acoustic parameters from reverberated speech and music. An approach which uses statistical machine learning, previously developed for speech, is extended to work with music. For speech, reverberation time estimations are within a perceptual difference limen of the true value. For music, virtually all early decay time estimations are within a difference limen of the true value. The estimation accuracy is not good enough in other cases due to differences between the simulated data set used to develop the empirical model and real rooms. The second method carries out a maximum likelihood estimation on decay phases at the end of notes or speech utterances. This paper extends the method to estimate parameters relating to the balance of early and late energies in the impulse response. For reverberation time and speech, the method provides estimations which are within the perceptual difference limen of the true value. For other parameters such as clarity, the estimations are not sufficiently accurate due to the natural reverberance of the excitation signals. Speech is a better test signal than music because of the greater periods of silence in the signal, although music is needed for low frequency measurement.

Yonggang Zhang | Paul Kendrick | Trevor J Cox | Francis F Li | Jonathon A Chambers

[1] F F Li,et al. Speech transmission index from running speech: a neural network approach. , 2003, The Journal of the Acoustical Society of America.

[2] Mohammad Bagher Menhaj,et al. Training feedforward networks with the Marquardt algorithm , 1994, IEEE Trans. Neural Networks.

[3] T.,et al. Training Feedforward Networks with the Marquardt Algorithm , 2004 .

[4] P. Mahalanobis. On the generalized distance in statistics , 1936 .

[5] Henrik Møller,et al. The acoustic conditions in Finnish concert spaces - Preliminary results , 2001 .

[6] Francis F. Li,et al. Blind estimation of reverberation parameters for non-diffuse rooms , 2007 .

[7] Heinrich Kuttruff,et al. Room acoustics , 1973 .

[8] Francis F. Li,et al. A neural network for blind identification of speech transmission index , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[9] Douglas L. Jones,et al. Blind estimation of reverberation time. , 2003, The Journal of the Acoustical Society of America.

[10] J. Aldrich. R.A. Fisher and the making of maximum likelihood 1912-1922 , 1997 .

[11] Wj Davies,et al. The sensitivity of listeners to early sound field changes in auditoriums , 1993 .