论文信息 - Computing auditory perception

Computing auditory perception

In this paper the ingredients of computing auditory perception are reviewed. On the basic level there is neurophysiology, which is abstracted to artificial neural nets (ANNs) and enhanced by statistics to machine learning. There are high-level cognitive models derived from psychoacoustics (especially Gestalt principles). The gap between neuroscience and psychoacoustics has to be filled by numerics, statistics and heuristics. Computerised auditory models have a broad and diverse range of applications: hearing aids and implants, compression in audio codices, automated music analysis, music composition, interactive music installations, and information retrieval from large databases of music samples.

[1] Richard F. Lyon,et al. Auditory model inversion for sound separation , 1994, Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing.

[2] Ian Whalley,et al. Emotion, Theme And Structure: Enhancing Computer Music Through System Dynamics Modelling , 2000, ICMC.

[3] O. S. Marin,et al. Neurological Aspects of Music Perception and Performance , 1999 .

[4] R. Shepard. Geometrical approximations to the structure of musical pitch. , 1982, Psychological review.

[5] G. Soete,et al. Perceptual scaling of synthesized musical timbres: Common dimensions, specificities, and latent subject classes , 1995, Psychological research.

[6] R. Shepard. The analysis of proximities: Multidimensional scaling with an unknown distance function. II , 1962 .

[7] Tomohiro Nakatani,et al. Residue-Driven Architecture for Computational Auditory Scene Analysis , 1995, IJCAI.

[8] S. Lakatos. A common perceptual space for harmonic and percussive timbres , 2000, Perception & psychophysics.

[9] W Singer,et al. Role of the temporal domain for response selection and perceptual binding. , 1997, Cerebral cortex.

[10] Klaus Obermayer,et al. A new method for tracking modulations in tonal music in audio data format , 2000, Proceedings of the IEEE-INNS-ENNS International Joint Conference on Neural Networks. IJCNN 2000. Neural Computing: New Challenges and Perspectives for the New Millennium.

[11] Cornelius Weber,et al. Maximum a posteriori models for cortical modeling: feature detectors, topography and modularity , 2008 .

[12] Jonathan Berger,et al. A Neural Network Model of Metric Perception and Cognition in the Audition of Functional Tonal Music , 1997, ICMC.

[13] David Cope,et al. Experiments In Musical Intelligence , 1996 .

[14] M. Alexander,et al. Principles of Neural Science , 1981 .

[15] Gary L. Dannenbring,et al. The effect of continuity on auditory stream segregation , 1973 .

[16] Ian Whalley,et al. Applications of system dynamics modelling to computer music , 2000, Organised Sound.

[17] E. Terhardt,et al. Algorithm for extraction of pitch and pitch salience from complex tonal signals , 1982 .

[18] R. Benjamin Knapp,et al. A Bioelectric Controller for Computer Music Applications , 1990 .

[19] M. P. Friedman,et al. ACADEMIC PRESS SERIES IN COGNITION AND PERCEPTION , 1982 .

[20] Judith C. Brown. Calculation of a constant Q spectral transform , 1991 .

[21] Guy J. Brown,et al. Temporal synchronization in a neural oscillator model of primitive auditory stream segregation , 1998 .

[22] Johannes Feulner,et al. Neural Networks that Learn and Reproduce Various Styles of Harmonization , 1993, ICMC.