论文信息 - Automatic Transcription of Polyphonic Music Exploiting Temporal Evolution

Automatic Transcription of Polyphonic Music Exploiting Temporal Evolution

This work was funded by a Queen Mary University of London Westfield Trust Research Studentship.

[1] D. Wang,et al. Computational Auditory Scene Analysis: Principles, Algorithms, and Applications , 2008, IEEE Trans. Neural Networks.

[2] Nicolás Ruiz-Reyes,et al. A Joint Approach to Extract Multiple Fundamental Frequency in Polyphonic Signals Minimizing Gaussian Spectral Distance , 2009 .

[3] Mert Bay,et al. Evaluation of Multiple-F0 Estimation and Tracking Systems , 2009, ISMIR.

[4] José M. Iñesta,et al. MULTIPLE FUNDAMENTAL FREQUENCY ESTIMATION BASED ON SPECTRAL PATTERN LOUDNESS AND SMOOTHNESS , 2007 .

[5] Dirk T. M. Slock,et al. Perceptually motivated quasi-periodic signal selection for polyphonic music transcription , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[6] Hirokazu Kameoka,et al. Nonnegative Matrix Factorization with Markov-Chained Bases for Modeling Time-Varying Patterns in Music Spectrograms , 2010, LVA/ICA.

[7] Hirokazu Kameoka,et al. Specmurt Analysis of Polyphonic Music Signals , 2008, IEEE Transactions on Audio, Speech, and Language Processing.

[8] David J. Field,et al. Sparse coding with an overcomplete basis set: A strategy employed by V1? , 1997, Vision Research.

[9] J. Barbour. Tuning and Temperament: A Historical Survey , 2004 .

[10] Paris Smaragdis. Polyphonic pitch tracking by example , 2011, 2011 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA).

[11] Isabelle Guyon,et al. What Size Test Set Gives Good Error Rate Estimates? , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[12] Anssi Klapuri,et al. Automatic Music Transcription: Breaking the Glass Ceiling , 2012, ISMIR.

[13] Paris Smaragdis,et al. Relative pitch estimation of multiple instruments , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[14] Anssi Klapuri,et al. Shift-variant non-negative matrix deconvolution for music transcription , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[15] Malcolm D. Macleod,et al. The Automated Music Transcription Problem , 2004 .

[16] Patrick J. Wolfe,et al. High Time-Resolution Estimation of Multiple Fundamental Frequencies , 2007, ISMIR.

[17] Ana Paula Rocha,et al. Fragmentation and Frontier Evolution for Genetic Algorithms Optimization in Music Transcription , 2008, IBERAMIA.

[18] Tuomas Virtanen,et al. Musical Instrument Sound Multi-Excitation Model for Non-Negative Spectrogram Factorization , 2011, IEEE Journal of Selected Topics in Signal Processing.

[19] Mark D. Plumbley,et al. Automatic Music Transcription and Audio Source Separation , 2002, Cybern. Syst..

[20] Matthew Brand,et al. Pattern discovery via entropy minimization , 1999, AISTATS.

[21] Emmanuel Vincent,et al. Multiple Pitch Transcription using DBN-based Musicological Models , 2010, ISMIR.

[22] Mark D. Plumbley,et al. Unsupervised analysis of polyphonic music by sparse coding , 2006, IEEE Transactions on Neural Networks.

[23] Judith C. Brown. Calculation of a constant Q spectral transform , 1991 .

[24] Judith C. Brown. Musical fundamental frequency tracking using a pattern recognition method , 1992 .

[25] A. Röbel,et al. A NEW SCORE FUNCTION FOR JOINT EVALUATION OF MULTIPLEF0 HYPOTHESES , 2004 .

[26] Ana M. Barbancho,et al. SIC receiver for polyphonic piano music , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[27] Masataka Goto,et al. A Nonparametric Bayesian Multipitch Analyzer Based on Infinite Latent Harmonic Allocation , 2012, IEEE Transactions on Audio, Speech, and Language Processing.

[28] Simon J. Godsill,et al. Transcription of Musical Audio Using Poisson Point Processes and Sequential MCMC , 2010, CMMR.

[29] Simon J. Godsill,et al. Generative Spectrogram Factorization Models for Polyphonic Piano Transcription , 2010, IEEE Transactions on Audio, Speech, and Language Processing.

[30] Shyh-Kang Jeng,et al. Automatic Transcription for Music with Two Timbres from Monaural Sound Source , 2010, 2010 IEEE International Symposium on Multimedia.

[31] Hideki Kawahara,et al. YIN, a fundamental frequency estimator for speech and music. , 2002, The Journal of the Acoustical Society of America.

[32] Anssi Klapuri,et al. Score-informed transcription for automatic piano tutoring , 2012, 2012 Proceedings of the 20th European Signal Processing Conference (EUSIPCO).

[33] Corey Cheng,et al. Multiple F0 Estimation in the Transform Domain , 2009, ISMIR.

[34] Guy J. Brown,et al. Multiple F0 Estimation , 2006 .

[35] Bhiksha Raj,et al. Adobe Systems , 1998 .

[36] Anssi Klapuri. A Method for Visualizing the Pitch Content of Polyphonic Music Signals , 2009, ISMIR.

[37] Anssi Klapuri,et al. Signal Processing Methods for Music Transcription , 2006 .

[38] Simon Dixon,et al. A temporally-constrained convolutive probabilistic model for pitch detection , 2011, 2011 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA).

[39] S. Dixon,et al. MULTIPLE FUNDAMENTAL FREQUENCY ESTIMATION USING SPECTRAL STRUCTURE AND TEMPORAL EVOLUTION RULES , 2010 .

[40] Ruohua Zhou,et al. Feature extraction of musical content for automatic music transcription , 2006 .

[41] Matti Karjalainen,et al. A computationally efficient multipitch analysis model , 2000, IEEE Trans. Speech Audio Process..

[42] Simon Dixon,et al. Joint Multi-Pitch Detection Using Harmonic Envelope Estimation for Polyphonic Music Transcription , 2011, IEEE Journal of Selected Topics in Signal Processing.

[43] Mark D. Plumbley,et al. Polyphonic transcription by non-negative sparse coding of power spectra , 2004, ISMIR.

[44] H. Sebastian Seung,et al. Learning the parts of objects by non-negative matrix factorization , 1999, Nature.

[45] Nicolás Ruiz-Reyes,et al. Polyphonic transcription based on temporal evolution of spectral similarity of gaussian mixture models , 2009, 2009 17th European Signal Processing Conference.

[46] Anssi Klapuri,et al. Multiple fundamental frequency estimation based on harmonicity and spectral smoothness , 2003, IEEE Trans. Speech Audio Process..

[47] Masataka Goto,et al. RWC Music Database: Music genre database and musical instrument sound database , 2003, ISMIR.

[48] Anssi Klapuri,et al. Separation of harmonic sounds using linear models for the overtone series , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[49] Mark B. Sandler,et al. Automatic Piano Transcription Using Frequency and Time-Domain Information , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[50] Anssi Klapuri,et al. Multiple Fundamental Frequency Estimation by Summing Harmonic Amplitudes , 2006, ISMIR.

[51] Christopher Raphael,et al. Automatic Transcription of Piano Music , 2002, ISMIR.

[52] Roland Badeau,et al. NMF With Time–Frequency Activations to Model Nonstationary Audio Events , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[53] Matija Marolt. Gaussian Mixture Models For Extraction Of Melodic Lines From Audio Recordings , 2004, ISMIR.

[54] Andres Kwasinski,et al. Automatic real-time electric guitar audio transcription , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[55] Matti Karjalainen,et al. Multi-pitch and periodicity analysis model for sound separation and auditory scene analysis , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).

[56] Dan Stowell,et al. Adaptive whitening for Improved Real-Time audio onset Detection , 2007, ICMC.

[57] Andreas Jakobsson,et al. Subspace-based fundamental frequency estimation , 2004, 2004 12th European Signal Processing Conference.

[58] Daniel P. W. Ellis,et al. A Discriminative Model for Polyphonic Piano Transcription , 2007, EURASIP J. Adv. Signal Process..

[59] Hirokazu Kameoka,et al. Explicit beat structure modeling for non-negative matrix factorization-based multipitch analysis , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[60] Hirokazu Kameoka,et al. Probabilistic Approach to Automatic Music Transcription from Audio Signals , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[61] J C Brown. Computer identification of musical instruments using pattern recognition with cepstral coefficients as features. , 1999, The Journal of the Acoustical Society of America.

[62] Juan Pablo,et al. Towards the automated analysis of simple polyphonic music : a knowledge-based approach , 2003 .

[63] Daniel P. W. Ellis,et al. Classification-based melody transcription , 2006, Machine Learning.

[64] Roland Badeau,et al. NMF With Time-Frequency Activations to Model Nonstationary Audio Events , 2011, IEEE Trans. Speech Audio Process..

[65] Roland Badeau,et al. Expectation-maximization algorithm for multi-pitch estimation and separation of overlapping harmonic spectra , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[66] Andreas Jakobsson,et al. The Multi-Pitch Estimation Problem: some New Solutions , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[67] Michael Elad,et al. K-SVD and its non-negative variant for dictionary design , 2005, SPIE Optics + Photonics.

[68] Emmanuel Vincent,et al. Fast bayesian nmf algorithms enforcing harmonicity and temporal continuity in polyphonic music transcription , 2009, 2009 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics.

[69] Tuomas Virtanen,et al. Separation of sound sources by convolutive sparse coding , 2004, SAPA@INTERSPEECH.

[70] Roland Badeau,et al. Multipitch Estimation of Piano Sounds Using a New Probabilistic Spectral Smoothness Principle , 2010, IEEE Transactions on Audio, Speech, and Language Processing.

[71] Nello Cristianini,et al. An Introduction to Support Vector Machines and Other Kernel-based Learning Methods , 2000 .

[72] P. Smaragdis,et al. Shift-Invariant Probabilistic Latent Component Analysis , 2007 .

[73] C.-C. Jay Kuo,et al. Sparse Music Representation With Source-Specific Dictionaries and Its Application to Signal Separation , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[74] Shunzheng Yu,et al. Hidden semi-Markov models , 2010, Artif. Intell..

[75] David Gunawan,et al. Identification of Partials in Polyphonic Mixtures Based on Temporal Envelope Similarity , 2007 .

[76] Hirokazu Kameoka,et al. A Multipitch Analyzer Based on Harmonic Temporal Structured Clustering , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[77] Simon Dixon,et al. A Shift-Invariant Latent Variable Model for Automatic Music Transcription , 2012, Computer Music Journal.

[78] F. J. Cañadas Quesada,et al. A Multiple-F0 Estimation Approach Based on Gaussian Spectral Modelling for Polyphonic Music Transcription , 2010 .

[79] Md. Al Mehedi Hasan,et al. Template music transcription for different types of musical instruments , 2010, 2010 The 2nd International Conference on Computer and Automation Engineering (ICCAE).

[80] Daniel P. W. Ellis,et al. Spectral vs. spectro-temporal features for acoustic event detection , 2011, 2011 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA).

[81] Francisco Fernández de Vega,et al. Hybrid Genetic Algorithm Based on Gene Fragment Competition for Polyphonic Music Transcription , 2008, EvoWorkshops.

[82] Ali Taylan Cemgil,et al. Probabilistic Models for Real-time Acoustic Event Detection with Application to Pitch Tracking , 2011 .

[83] Simon Dixon,et al. Accurate Real-time Windowed Time Warping , 2010, ISMIR.

[84] Simon J. Godsill,et al. Point process MCMC for sequential music transcription , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[85] D. Rubin,et al. Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[86] José Manuel Iñesta Quereda,et al. Multiple fundamental frequency estimation using Gaussian smoothness , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[87] Emmanuel Vincent,et al. Enforcing Harmonicity and Smoothness in Bayesian Non-Negative Matrix Factorization Applied to Polyphonic Music Transcription , 2010, IEEE Transactions on Audio, Speech, and Language Processing.

[88] Francisco Javier Casajús-Quirós,et al. Multi-pitch estimation for polyphonic musical signals , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[89] Eamonn Keogh. Exact Indexing of Dynamic Time Warping , 2002, VLDB.

[90] Antonio Pertusa Ibáñez,et al. Computationally efficient methods for polyphonic music transcription , 2010 .

[91] D. Chazan,et al. Automatic transcription of piano polyphonic music , 2005, ISPA 2005. Proceedings of the 4th International Symposium on Image and Signal Processing and Analysis, 2005..

[92] Anssi Klapuri,et al. Signal Processing Methods for the Automatic Transcription of Music , 2004 .

[93] Michael I. Jordan,et al. Discriminative training of hidden Markov models for multiple pitch tracking [speech processing examples] , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[94] Christopher Raphael,et al. Automatic Transcription of Music Audio Through Continuous Parameter Tracking , 2007, ISMIR.

[95] Anssi Klapuri,et al. Automatic Transcription of Melody, Bass Line, and Chords in Polyphonic Music , 2008, Computer Music Journal.

[96] Mark Sandler,et al. A Partial Searching Algorithm and Its Application for Polyphonic Music Transcription , 2005, ISMIR.

[97] R. Meddis,et al. A unitary model of pitch perception. , 1997, The Journal of the Acoustical Society of America.

[98] José Manuel Iñesta Quereda,et al. Efficient methods for joint estimation of multiple fundamental frequencies in music signals , 2012, EURASIP Journal on Advances in Signal Processing.

[99] Simon J. Godsill,et al. Bayesian harmonic models for musical pitch estimation and analysis , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[100] Daniel P. W. Ellis,et al. Multi-voice polyphonic music transcription using eigeninstruments , 2009, 2009 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics.

[101] Brian E Anderson,et al. The effect of inharmonic partials on pitch of piano tones. , 2002, The Journal of the Acoustical Society of America.

[102] B. Shinn-Cunningham,et al. Latent variable framework for modeling and separating single-channel acoustic sources , 2008 .

[103] Jaakko Astola,et al. Analysis of the meter of acoustic musical signals , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[104] Simon Godsill,et al. Poisson point process modeling for polyphonic music transcription. , 2007, The Journal of the Acoustical Society of America.

[105] Roland Badeau,et al. Weighted maximum likelihood autoregressive and moving average spectrum modeling , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[106] Guillaume Lemaitre,et al. Real-time Polyphonic Music Transcription with Non-negative Matrix Factorization and Beta-divergence , 2010, ISMIR.

[107] Fabrizio Argenti,et al. AUTOMATIC TRANSCRIPTION OF POLYPHONIC MUSIC BASED ON CONSTANT-Q BISPECTRAL ANALYSIS FOR MIREX 2009 , 2009 .

[108] Ye Wang,et al. Low Level Descriptors for Automatic Violin Transcription , 2006, ISMIR.

[109] Masataka Goto,et al. Unsupervised music understanding based on nonparametric Bayesian models , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[110] Liming Chen,et al. Use of Continuous Wavelet-like Transform in automated music transcription , 2006, 2006 14th European Signal Processing Conference.

[111] Anssi Klapuri,et al. Automatic Transcription of Guitar Chords and Fingering From Audio , 2012, IEEE Transactions on Audio, Speech, and Language Processing.

[112] Mark B. Sandler,et al. A tutorial on onset detection in music signals , 2005, IEEE Transactions on Speech and Audio Processing.

[113] Bhiksha Raj,et al. Non-negative Hidden Markov Modeling of Audio with Application to Source Separation , 2010, LVA/ICA.

[114] Xavier Rodet,et al. MULTIPLE-F0 TRACKING BASED ON A HIGH-ORDER HMM MODEL , 2008 .

[115] Nicola Orio,et al. An HMM-based pitch tracker for audio queries , 2003, ISMIR.

[116] Shyh-Kang Jeng,et al. An automatic transcription system with octave detection , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[117] Roland Badeau,et al. Score informed audio source separation using a parametric model of non-negative spectrogram , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[118] Tuomas Virtanen,et al. Combining pitch-based inference and non-negative spectrogram factorization in separating vocals from polyphonic music , 2008, SAPA@INTERSPEECH.

[119] José Manuel Iñesta Quereda,et al. Pattern Recognition Algorithms for Polyphonic Music Transcription , 2004, PRIS.

[120] A. Noll. Cepstrum pitch determination. , 1967, The Journal of the Acoustical Society of America.

[121] Thomas Hofmann,et al. Probabilistic Latent Semantic Analysis , 1999, UAI.

[122] Anssi Klapuri. A CLASSIFICATION APPROACH TO MULTIPITCH ANALYSIS , 2009 .

[123] Roland Badeau,et al. Adaptive harmonic time-frequency decomposition of audio using shift-invariant PLCA , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[124] Mark F. Bocko,et al. Polyphonic music transcription employing max-margin classification of spectrograhic features , 2009, 2009 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics.

[125] Claus Weihs,et al. Parameter Optimization in Automatic Transcription of Music , 2005, GfKl.

[126] Shigeki Sagayama,et al. Note detection with dynamic bayesian networks as a postanalysis step for NMF-based multiple pitch estimation techniques , 2009, 2009 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics.

[127] Jun Wu,et al. Polyphonic Pitch Estimation and Instrument Identification by Joint Modeling of Sustained and Attack Sounds , 2011, IEEE Journal of Selected Topics in Signal Processing.

[128] Fabrizio Argenti,et al. Automatic Transcription of Polyphonic Music Based on the Constant-Q Bispectral Analysis , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[129] Michael Johansson April. Automatic Transcription of Polyphonic Music Using Harmonic Relations , 2003 .

[130] Judith C. Brown,et al. An efficient algorithm for the calculation of a constant Q transform , 1992 .

[131] R. Badeau,et al. Multipitch estimation of quasi-harmonic sounds in colored noise , 2007 .

[132] Daniel P. W. Ellis,et al. IMPROVING GENERALIZATION FOR POLYPHONIC PIANO TRANSCRIPTION , 2007 .

[133] Kunio Kashino,et al. Application of the Bayesian probability network to music scene analysis , 1998 .

[134] Yi-Hsuan Yang,et al. Automatic transcription of piano music by sparse representation of magnitude spectra , 2011, 2011 IEEE International Conference on Multimedia and Expo.

[135] Simon Dixon,et al. Automatically detecting key modulations in J.S. Bach chorale recordings , 2011 .

[136] Rémi Gribonval,et al. Harmonic decomposition of audio signals with matching pursuit , 2003, IEEE Trans. Signal Process..

[137] Bhiksha Raj,et al. Probabilistic Latent Variable Models as Nonnegative Factorizations , 2008, Comput. Intell. Neurosci..

[138] Hirokazu Kameoka,et al. Infinite-state spectrum model for music signal analysis , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[139] Jean-Pierre Martens,et al. Assessment of State-of-the-Art Meter Analysis Systems with an Extended Meter Description Model , 2007, ISMIR.

[140] Simon J. Godsill,et al. Bayesian harmonic models for musical signal analysis , 2003 .

[141] Andrew McCallum,et al. Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.

[142] Paris Smaragdis. Relative-pitch tracking of multiple arbitrary sounds. , 2009, The Journal of the Acoustical Society of America.

[143] Randal J. Leistikow,et al. A New Probabilistic Spectral Pitch Estimator: Exact and MCMC-approximate Strategies , 2004, CMMR.

[144] Mark B. Sandler,et al. Techniques for Automatic Music Transcription , 2000, ISMIR.

[145] Daniel P. W. Ellis,et al. Signal Processing for Music Analysis , 2011, IEEE Journal of Selected Topics in Signal Processing.

[146] Amara Lynn Graps,et al. An introduction to wavelets , 1995 .

[147] Andreas Rauber,et al. Improving Genre Classification by Combination of Audio and Symbolic Descriptors Using a Transcription Systems , 2007, ISMIR.

[148] DeLiang Wang,et al. Pitch Detection in Polyphonic Music using Instrument Tone Models , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[149] Simon Dixon,et al. On the Computer Recognition of Solo Piano Music , 2000 .

[150] Yi-Hsuan Yang,et al. Multipitch Estimation of Piano Music by Exemplar-Based Sparse Representation , 2012, IEEE Transactions on Multimedia.

[151] Ana M. Barbancho,et al. Polyphony Number Estimator for Piano Recordings Using Different Spectral Patterns , 2010 .

[152] Dan Zhang,et al. Multi-Pitch Estimation Based on Partial Event and Support Transfer , 2007, 2007 IEEE International Conference on Multimedia and Expo.

[153] Shigeki Sagayama,et al. Extending Nonnegative Matrix Factorization—A discussion in the context of multiple frequency estimation of musical signals , 2009, 2009 17th European Signal Processing Conference.

[154] M.G. Christensen,et al. Multi-Pitch Estimation Using Harmonic Music , 2006, 2006 Fortieth Asilomar Conference on Signals, Systems and Computers.

[155] Geoffroy Peeters,et al. Music Pitch Representation by Periodicity Measures Based on Combined Temporal and Spectral Representations , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[156] Jun Wu,et al. Multipitch estimation by joint modeling of harmonic and transient sounds , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[157] Karin Dressler. MULTIPLE FUNDAMENTAL FREQUENCY EXTRACTION FOR MIREX 2012 , 2011 .

[158] Samer A. Abdallah,et al. Towards music perception by redundancy reduction and unsupervised learning in probabilistic models , 2002 .

[159] Nicolás Ruiz-Reyes,et al. Polyphonic Piano Transcription Based on Spectral Separation , 2008 .

[160] Matija Marolt,et al. Automatic Transcription of Bell Chiming Recordings , 2012, IEEE Transactions on Audio, Speech, and Language Processing.

[161] Tetsuya Ogata,et al. Initialization-robust multipitch estimation based on latent harmonic allocation using overtone corpus , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[162] Bernhard Niedermayer. Non-Negative Matrix Division for the Automatic Transcription of Polyphonic Music , 2008, ISMIR.

[163] Paris Smaragdis,et al. Mitsubishi Electric Research Laboratories , 1994 .

[164] Derry Fitzgerald,et al. GENERALISED PRIOR SUBSPACE ANALYSIS FOR POLYPHONIC PITCH TRANSCRIPTION , 2005 .

[165] M.P. Ryynanen,et al. Polyphonic music transcription using note event modeling , 2005, IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2005..

[166] Vesa T. Peltonen,et al. Audio-based context recognition , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[167] Masataka Goto,et al. A robust predominant-F0 estimation method for real-time detection of melody and bass lines in CD recordings , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).

[168] Mark F. Bocko,et al. A Real-Time Signal Processing Framework of Musical Expressive Feature Extraction Using Matlab , 2011, ISMIR.

[169] Roland Badeau,et al. Scale-invariant probabilistic latent component analysis , 2011, 2011 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA).

[170] Axel Röbel,et al. Multiple Fundamental Frequency Estimation and Polyphony Inference of Polyphonic Music Signals , 2010, IEEE Transactions on Audio, Speech, and Language Processing.

[171] Francisco Javier Casajus-Quiros,et al. Approaching polyphonic transcription of piano sounds , 2007 .

[172] Siegmund Levarie,et al. A theory of harmony , 1985 .

[173] Nicolás Ruiz-Reyes,et al. Overlapped event-note separation based on partials amplitude and phase estimation for polyphonic music transcription , 2009, 2009 17th European Signal Processing Conference.

[174] Roland Badeau,et al. Automatic transcription of piano music based on HMM tracking of jointly-estimated pitches , 2008, 2008 16th European Signal Processing Conference.

[175] Changshui Zhang,et al. Multiple Fundamental Frequency Estimation by Modeling Spectral Peaks and Non-Peak Regions , 2010, IEEE Transactions on Audio, Speech, and Language Processing.

[176] Raul Kompass,et al. A Generalized Divergence Measure for Nonnegative Matrix Factorization , 2007, Neural Computation.

[177] Xavier Rodet,et al. Fundamental frequency estimation and tracking using maximum likelihood harmonic matching and HMMs , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[178] Anssi Klapuri,et al. Multipitch Analysis of Polyphonic Music and Speech Signals Using an Auditory Model , 2008, IEEE Transactions on Audio, Speech, and Language Processing.

[179] Matija Marolt,et al. A connectionist approach to automatic transcription of polyphonic piano music , 2004, IEEE Transactions on Multimedia.

[180] Anssi Klapuri,et al. Latent semantic analysis in sound event detection , 2011, 2011 19th European Signal Processing Conference.

[181] Matti Ryynänen,et al. Automatic Transcription of Pitch Content in Music and Selected Applications , 2008 .

[182] Anssi Klapuri,et al. Multipitch estimation and sound separation by the spectral smoothness principle , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[183] Michael I. Jordan,et al. Factorial Hidden Markov Models , 1995, Machine Learning.

[184] Dan Tidhar,et al. Estimation of harpsichord inharmonicity and temperament from musical recordings. , 2012, The Journal of the Acoustical Society of America.

[185] Christian Schörkhuber. CONSTANT-Q TRANSFORM TOOLBOX FOR MUSIC PROCESSING , 2010 .

[186] M. Davy,et al. Bayesian analysis of polyphonic western tonal music. , 2006, The Journal of the Acoustical Society of America.

[187] Dan Tidhar,et al. High precision frequency estimation for harpsichord tuning classification , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[188] Hirokazu Kameoka,et al. Extraction of Multiple Fundamental Frequencies from Polyphonic Music Using Harmonic Clustering , 2003 .

[189] Björn Schuller,et al. Automatic Transcription of Recorded Music , 2012 .

[190] Bryan Pardo,et al. Song-level multi-pitch tracking by heavily constrained clustering , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[191] David A. Krubsack,et al. A spectral autocorrelation method for measurement of the fundamental frequency of noise-corrupted speech , 1987, IEEE Trans. Acoust. Speech Signal Process..

[192] Gert Cauwenberghs,et al. Monaural separation of independent acoustical components , 1999, ISCAS'99. Proceedings of the 1999 IEEE International Symposium on Circuits and Systems VLSI (Cat. No.99CH36349).

[193] P. Smaragdis,et al. Non-negative matrix factorization for polyphonic music transcription , 2003, 2003 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (IEEE Cat. No.03TH8684).

[194] Daniel P. W. Ellis,et al. A Probabilistic Subspace Model for Multi-instrument Polyphonic Transcription , 2010, ISMIR.

[195] Hirokazu Kameoka,et al. Bayesian nonparametric spectrogram modeling based on infinite factorial infinite hidden Markov model , 2011, 2011 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA).

[196] Joshua D. Reiss,et al. A REAL-TIME POLYPHONIC MUSIC TRANSCRIPTION SYSTEM , 2008 .

[197] Xuejing Sun. A pitch determination algorithm based on subharmonic-to-harmonic ratio , 2000, INTERSPEECH.

[198] Richard F. Lyon,et al. A perceptual pitch detector , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[199] Mark D. Plumbley,et al. Structure-aware dictionary learning with harmonic atoms , 2011, 2011 19th European Signal Processing Conference.

[200] Giovanni Saggio,et al. A sensor interface based on sparse NMF for piano musical transcription , 2011, 2011 4th IEEE International Workshop on Advances in Sensors and Interfaces (IWASI).

[201] Graham E. Poliner,et al. Improving Generalization for Classification-Based Polyphonic Piano Transcription , 2007, 2007 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics.

[202] Giovanni Costantini,et al. Event based transcription system for polyphonic piano music , 2009, Signal Process..

[203] Vesa Välimäki,et al. The effect of inharmonicity on pitch in string instrument sounds , 2000, ICMC.

[204] Julius O. Smith,et al. A non-negative framework for joint modeling of spectral structure and temporal dynamics in sound mixtures , 2010 .

[205] Emmanuel Vincent,et al. Harmonic and inharmonic Nonnegative Matrix Factorization for Polyphonic Pitch transcription , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[206] Xavier Rodet,et al. Spectral Envelope Estimation and Representation for Sound Analysis-Synthesis , 1999, ICMC.

[207] Bhiksha Raj,et al. A Probabilistic Latent Variable Model for Acoustic Modeling , 2006 .

[208] Kuansan Wang,et al. Auditory representations of acoustic signals , 1992, IEEE Trans. Inf. Theory.

[209] David Barber,et al. Generative model based polyphonic music transcription , 2003, 2003 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (IEEE Cat. No.03TH8684).

[210] Mathieu Lagrange,et al. Characterisation of acoustic scenes using a temporally-constrained shift-invariant model , 2012 .

[211] Francisco Cañadas,et al. MULTIPLE FUNDAMENTAL FREQUENCY ESTIMATION & TRACKING IN POLYPHONIC MUSIC FOR MIREX 2010 , 2010 .

[212] Markus Schedl,et al. Polyphonic piano note transcription with recurrent neural networks , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[213] Simon Dixon,et al. Polyphonic music transcription using note onset and offset detection , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[214] David Barber,et al. A generative model for music transcription , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[215] Mark D. Plumbley,et al. Polyphonic music transcription by non-negative sparse coding of power spectra , 2004 .

[216] David Wessel,et al. Realtime Multiple-pitch and Multiple-instrument Recognition For Music Signals using Sparse Non-negative Constraints , 2007 .

[217] E. Vincent,et al. TWO NONNEGATIVE MATRIX FACTORIZATION METHODS FOR POLYPHONIC PITCH TRANSCRIPTION , 2007 .

[218] Daniel P. W. Ellis,et al. Transcribing Multi-Instrument Polyphonic Music With Hierarchical Eigeninstruments , 2011, IEEE Journal of Selected Topics in Signal Processing.

[219] Paris Smaragdis,et al. Non-negative Matrix Factor Deconvolution; Extraction of Multiple Sound Sources from Monophonic Inputs , 2004, ICA.

[220] Marc Moonen,et al. A Robust and Computationally Efficient Subspace-Based Fundamental Frequency Estimator , 2010, IEEE Transactions on Audio, Speech, and Language Processing.

[221] Graham E. Poliner,et al. Melody Transcription From Music Audio: Approaches and Evaluation , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[222] Paul M. Brossier,et al. Automatic annotation of musical audio for interactive applications , 2006 .

[223] Shigeki Sagayama,et al. Multipitch Analysis with Harmonic Nonnegative Matrix Approximation , 2007, ISMIR.

[224] John A. Nelder,et al. A Simplex Method for Function Minimization , 1965, Comput. J..

[225] S. Schwerman,et al. The Physics of Musical Instruments , 1991 .

[226] Lawrence R. Rabiner,et al. On the use of autocorrelation analysis for pitch detection , 1977 .

[227] Anssi Klapuri,et al. Automatic Music Transcription as We Know it Today , 2004 .

[228] Andreas Jakobsson,et al. Multi-Pitch Estimation , 2009, Multi-Pitch Estimation.

[229] J. Stephen Downie,et al. How Significant is Statistically Significant? The case of Audio Music Similarity and Retrieval , 2012, ISMIR.

[230] Roy D. Patterson,et al. An instantaneous-frequency-based pitch extraction method for high-quality speech transformation: revised TEMPO in the STRAIGHT-suite , 1998, ICSLP.

[231] Stephen W. Hainsworth,et al. Techniques for the Automated Analysis of Musical Audio , 2004 .

[232] David Lu. Automatic Music Transcription Using Genetic Algorithms and Electronic Synthesis David Lu ! ! April , 2006 .

[233] R Meddis,et al. Modeling the identification of concurrent vowels with different fundamental frequencies. , 1992, The Journal of the Acoustical Society of America.

[234] Olivier Derrien. Multi-Scale Frame-Based Analysis of Audio Signals for Musical Transcription Using a Dictionary of Chromatic Waveforms , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[235] Juhan Nam,et al. A Classification-Based Polyphonic Piano Transcription Approach Using Learned Feature Representations , 2011, ISMIR.

[236] Xavier Rodet,et al. Discrete Cepstrum Coefficients as Perceptual Features , 2003, ICMC.

[237] Mike E. Davies,et al. Unsupervised learning of sparse and shift-invariant decompositions of polyphonic music , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[238] Umut Şimşekli,et al. A COMPARISON OF PROBABILISTIC MODELS FOR ONLINE PITCH TRACKING , 2010 .

[239] DeLiang Wang,et al. Monaural Musical Sound Separation Based on Pitch and Common Amplitude Modulation , 2009, IEEE Transactions on Audio, Speech, and Language Processing.

[240] R. O. Schmidt,et al. Multiple emitter location and signal Parameter estimation , 1986 .

[241] Simon Dixon,et al. Temporally-Constrained Convolutive Probabilistic Latent Component Analysis for Multi-pitch Detection , 2012, LVA/ICA.

[242] Liu Sheng,et al. Automatic Transcription Method for Polyphonic Music Based on Adaptive Comb Filter and Neural Network , 2007, 2007 International Conference on Mechatronics and Automation.

[243] Ye Wang,et al. Music transcription using an instrument model , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[244] A.P. Klapuri,et al. A perceptually motivated multiple-F0 estimation method , 2005, IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2005..

[245] Roland Badeau,et al. Blind Signal Decompositions for Automatic Transcription of Polyphonic Music: NMF and K-SVD on the Benchmark , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[246] José Manuel Iñesta Quereda,et al. Polyphonic monotimbral music transcription using dynamic networks , 2005, Pattern Recognit. Lett..

[247] Masataka Goto,et al. Infinite Latent Harmonic Allocation: A Nonparametric Bayesian Approach to Multipitch Analysis , 2010, ISMIR.

[248] Mark R. Every,et al. Separation of synchronous pitched notes by spectral filtering of harmonics , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[249] Ana M. Barbancho,et al. Transcription of piano recordings , 2004 .

[250] Joshua D. Reiss,et al. A Computationally Efficient Method for Polyphonic Pitch Estimation , 2009, EURASIP J. Adv. Signal Process..

[251] M. Ross,et al. Average magnitude difference function pitch extractor , 1974 .

[252] Meinard Müller,et al. Estimating note intensities in music recordings , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[253] Nicolás Ruiz-Reyes,et al. Music Scene-Adaptive Harmonic Dictionary for Unsupervised Note-Event Detection , 2010, IEEE Transactions on Audio, Speech, and Language Processing.

[254] Masataka Goto,et al. A real-time music-scene-description system: predominant-F0 estimation for detecting melody and bass lines in real-world audio signals , 2004, Speech Commun..

[255] Ali Taylan Cemgil,et al. Probabilistic latent tensor factorization framework for audio modeling , 2011, 2011 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA).

[256] Patrick Susini,et al. Perceptual study of soundscapes in train stations , 2008 .

[257] Emmanuel Vincent,et al. Adaptive Harmonic Spectral Decomposition for Multiple Pitch Estimation , 2010, IEEE Transactions on Audio, Speech, and Language Processing.

[258] Christian Uhle. An Investigation of Low-Level Signal Descriptors Characterizing the Noise-Like Nature of an Audio Signal , 2010 .

[259] Dan Tidhar,et al. The Temperament Police: The Truth, the Ground Truth, and Nothing but the Truth , 2011, ISMIR.

[260] Simon Dixon,et al. Multiple-instrument polyphonic music transcription using a convolutive probabilistic model , 2011 .

[261] Simon J. Godsill,et al. Multiple Pitch Estimation Using Non-Homogeneous Poisson Processes , 2011, IEEE Journal of Selected Topics in Signal Processing.

[262] Michael Groble. MULTIPLE FUNDAMENTAL FREQUENCY ESTIMATION , 2008 .

[263] Mark D. Plumbley,et al. Structured sparsity for automatic music transcription , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[264] Arshia Cont. Realtime Multiple Pitch Observation using Sparse Non-negative Constraints , 2006, ISMIR.

[265] Eamonn J. Keogh,et al. Searching and Mining Trillions of Time Series Subsequences under Dynamic Time Warping , 2012, KDD.

[266] Mikkel N. Schmidt,et al. Sparse Non-negative Matrix Factor 2-D Deconvolution , 2006 .

[267] Yonghong Yan,et al. MULTIPLE F0 ESTIMATION IN POLYPHONIC MUSIC (MIREX 2007) , 2007 .

[268] A. de Cheveigné. Cancellation model of pitch perception. , 1998, The Journal of the Acoustical Society of America.

[269] Nicolás Ruiz-Reyes,et al. Improving multiple-F0 estimation by onset detection for polyphonic music transcription , 2010, 2010 IEEE International Workshop on Multimedia Signal Processing.

[270] Jens WELLHAUSEN. Towards Automatic Music Transcription : Extraction of MIDI-Data out of Polyphonic Piano Music , 2013 .

[271] Axel Röbel,et al. Multiple fundamental frequency estimation of polyphonic music signals , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[272] Grosvenor W. Cooper,et al. The Rhythmic Structure of Music , 1971 .

[273] Yannis Stylianou,et al. Three Dimensions of Pitched Instrument Onset Detection , 2010, IEEE Transactions on Audio, Speech, and Language Processing.

[274] Xavier Rodet,et al. Music Transcription with ISA and HMM , 2004, ICA.

[275] Andreas Jakobsson,et al. Joint High-Resolution Fundamental Frequency and Order Estimation , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[276] François Pachet,et al. The bag-of-frames approach to audio pattern recognition: a sufficient model for urban soundscapes but not for polyphonic music. , 2007, The Journal of the Acoustical Society of America.

[277] Joseph Tabrikian,et al. Maximum A Posteriori Probability Multiple-Pitch Tracking Using the Harmonic Model , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[278] Peng Li,et al. Multipitch Detection Based on Weighted Summary Correlogram , 2008, 2008 6th International Symposium on Chinese Spoken Language Processing.

[279] Arnold Schoenberg. Theory of Harmony , 1948 .

[280] P. Stoica,et al. Cyclic minimizers, majorization techniques, and the expectation-maximization algorithm: a refresher , 2004, IEEE Signal Process. Mag..

[281] Simon Dixon,et al. Multiple-F0 estimation of piano sounds exploiting spectral structure and temporal evolution , 2010, SAPA@INTERSPEECH.

[282] Anssi Klapuri,et al. Accompaniment separation and karaoke application based on automatic melody transcription , 2008, 2008 IEEE International Conference on Multimedia and Expo.

[283] Luis I. Ortiz-Berenguer,et al. PIANO TRANSCRIPTION USING PATTERN RECOGNITION: ASPECTS ON PARAMETER EXTRACTION , 2004 .

[284] Mikkel N. Schmidt,et al. Shift Invariant Sparse Coding of Image and Music Data , 2007 .

[285] Ali Taylan Cemgil,et al. Bayesian Music Transcription , 1997 .

[286] Aníbal Ferreira,et al. Measuring music transcription results based on a hybrid decay/sustain evaluation , 2009 .

[287] Gaël Richard,et al. A Musically Motivated Mid-Level Representation for Pitch Estimation and Musical Audio Source Separation , 2011, IEEE Journal of Selected Topics in Signal Processing.

[288] Lawrence R. Rabiner,et al. A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[289] L. Lathauwer,et al. Signal Processing based on Multilinear Algebra , 1997 .

[290] Bingjun Zhang,et al. Application-Specific Music Transcription for Tutoring , 2008, IEEE MultiMedia.

[291] William P. Birmingham,et al. Algorithms for Chordal Analysis , 2002, Computer Music Journal.