There are lots of different music recommendation systems to help users to get relevant items among the enormous amount of digital music items on all different purposes. The quality of a recommendation system is critically based on what the system can understand about the music objects. This has stimulated the research on automatic music information retrieval. Numerous approaches have been proposed for instrument recognition in terms of feature extraction and selection. Moving Picture Experts Group (MPEG) standardized a set of features based on the digital audio content data for the purpose of interpretation of the information meaning. Features investigated so far are intended to describe a frame window, the whole sound segment, or arbitrarily split bins of the sound segment. Sound vibration in the transient state is known to be significantly different from the one in the quasi-steady state, while information in the transient state is important for instrument recognition by human. However, the boundary of the transient state and feature behaviors in the transient state has been barely investigated. We proposed a differentiated analysis to harmonic features with transient duration boundary detection by instantaneous fundamental frequency in each frame.
[1]
S. Cessie,et al.
Ridge Estimators in Logistic Regression
,
1992
.
[2]
J. Ross Quinlan,et al.
C4.5: Programs for Machine Learning
,
1992
.
[3]
Andrew W. Moore,et al.
Locally Weighted Learning for Control
,
1997,
Artificial Intelligence Review.
[4]
James R. Glass,et al.
Hidden feature models for speech recognition using dynamic Bayesian networks
,
2003,
INTERSPEECH.
[5]
G. Balzano.
What Are Musical Pitch and Timbre
,
1986
.
[6]
Andrew W. Moore,et al.
Locally Weighted Learning
,
1997,
Artificial Intelligence Review.
[7]
Claude Cadoz.
Timbre et causalité
,
1987
.
[8]
Q. Summerfield.
Book Review: Auditory Scene Analysis: The Perceptual Organization of Sound
,
1992
.
[9]
Geoffrey Zweig,et al.
Speech Recognition with Dynamic Bayesian Networks
,
1998,
AAAI/IAAI.
[10]
Piotr Synak,et al.
Application of Temporal Descriptors to Musical Instrument Sound Recognition
,
2003,
Journal of Intelligent Information Systems.