Automatic Transcription of Polyphonic Music Exploiting Temporal Evolution

This work was funded by a Queen Mary University of London Westfield Trust Research Studentship.

[1]  D. Wang,et al.  Computational Auditory Scene Analysis: Principles, Algorithms, and Applications , 2008, IEEE Trans. Neural Networks.

[2]  Nicolás Ruiz-Reyes,et al.  A Joint Approach to Extract Multiple Fundamental Frequency in Polyphonic Signals Minimizing Gaussian Spectral Distance , 2009 .

[3]  Mert Bay,et al.  Evaluation of Multiple-F0 Estimation and Tracking Systems , 2009, ISMIR.

[4]  José M. Iñesta,et al.  MULTIPLE FUNDAMENTAL FREQUENCY ESTIMATION BASED ON SPECTRAL PATTERN LOUDNESS AND SMOOTHNESS , 2007 .

[5]  Dirk T. M. Slock,et al.  Perceptually motivated quasi-periodic signal selection for polyphonic music transcription , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[6]  Hirokazu Kameoka,et al.  Nonnegative Matrix Factorization with Markov-Chained Bases for Modeling Time-Varying Patterns in Music Spectrograms , 2010, LVA/ICA.

[7]  Hirokazu Kameoka,et al.  Specmurt Analysis of Polyphonic Music Signals , 2008, IEEE Transactions on Audio, Speech, and Language Processing.

[8]  David J. Field,et al.  Sparse coding with an overcomplete basis set: A strategy employed by V1? , 1997, Vision Research.

[9]  J. Barbour Tuning and Temperament: A Historical Survey , 2004 .

[10]  Paris Smaragdis Polyphonic pitch tracking by example , 2011, 2011 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA).

[11]  Isabelle Guyon,et al.  What Size Test Set Gives Good Error Rate Estimates? , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[12]  Anssi Klapuri,et al.  Automatic Music Transcription: Breaking the Glass Ceiling , 2012, ISMIR.

[13]  Paris Smaragdis,et al.  Relative pitch estimation of multiple instruments , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[14]  Anssi Klapuri,et al.  Shift-variant non-negative matrix deconvolution for music transcription , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[15]  Malcolm D. Macleod,et al.  The Automated Music Transcription Problem , 2004 .

[16]  Patrick J. Wolfe,et al.  High Time-Resolution Estimation of Multiple Fundamental Frequencies , 2007, ISMIR.

[17]  Ana Paula Rocha,et al.  Fragmentation and Frontier Evolution for Genetic Algorithms Optimization in Music Transcription , 2008, IBERAMIA.

[18]  Tuomas Virtanen,et al.  Musical Instrument Sound Multi-Excitation Model for Non-Negative Spectrogram Factorization , 2011, IEEE Journal of Selected Topics in Signal Processing.

[19]  Mark D. Plumbley,et al.  Automatic Music Transcription and Audio Source Separation , 2002, Cybern. Syst..

[20]  Matthew Brand,et al.  Pattern discovery via entropy minimization , 1999, AISTATS.

[21]  Emmanuel Vincent,et al.  Multiple Pitch Transcription using DBN-based Musicological Models , 2010, ISMIR.

[22]  Mark D. Plumbley,et al.  Unsupervised analysis of polyphonic music by sparse coding , 2006, IEEE Transactions on Neural Networks.

[23]  Judith C. Brown Calculation of a constant Q spectral transform , 1991 .

[24]  Judith C. Brown Musical fundamental frequency tracking using a pattern recognition method , 1992 .

[25]  A. Röbel,et al.  A NEW SCORE FUNCTION FOR JOINT EVALUATION OF MULTIPLEF0 HYPOTHESES , 2004 .

[26]  Ana M. Barbancho,et al.  SIC receiver for polyphonic piano music , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[27]  Masataka Goto,et al.  A Nonparametric Bayesian Multipitch Analyzer Based on Infinite Latent Harmonic Allocation , 2012, IEEE Transactions on Audio, Speech, and Language Processing.

[28]  Simon J. Godsill,et al.  Transcription of Musical Audio Using Poisson Point Processes and Sequential MCMC , 2010, CMMR.

[29]  Simon J. Godsill,et al.  Generative Spectrogram Factorization Models for Polyphonic Piano Transcription , 2010, IEEE Transactions on Audio, Speech, and Language Processing.

[30]  Shyh-Kang Jeng,et al.  Automatic Transcription for Music with Two Timbres from Monaural Sound Source , 2010, 2010 IEEE International Symposium on Multimedia.

[31]  Hideki Kawahara,et al.  YIN, a fundamental frequency estimator for speech and music. , 2002, The Journal of the Acoustical Society of America.

[32]  Anssi Klapuri,et al.  Score-informed transcription for automatic piano tutoring , 2012, 2012 Proceedings of the 20th European Signal Processing Conference (EUSIPCO).

[33]  Corey Cheng,et al.  Multiple F0 Estimation in the Transform Domain , 2009, ISMIR.

[34]  Guy J. Brown,et al.  Multiple F0 Estimation , 2006 .

[35]  Bhiksha Raj,et al.  Adobe Systems , 1998 .

[36]  Anssi Klapuri A Method for Visualizing the Pitch Content of Polyphonic Music Signals , 2009, ISMIR.

[37]  Anssi Klapuri,et al.  Signal Processing Methods for Music Transcription , 2006 .

[38]  Simon Dixon,et al.  A temporally-constrained convolutive probabilistic model for pitch detection , 2011, 2011 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA).

[39]  S. Dixon,et al.  MULTIPLE FUNDAMENTAL FREQUENCY ESTIMATION USING SPECTRAL STRUCTURE AND TEMPORAL EVOLUTION RULES , 2010 .

[40]  Ruohua Zhou,et al.  Feature extraction of musical content for automatic music transcription , 2006 .

[41]  Matti Karjalainen,et al.  A computationally efficient multipitch analysis model , 2000, IEEE Trans. Speech Audio Process..

[42]  Simon Dixon,et al.  Joint Multi-Pitch Detection Using Harmonic Envelope Estimation for Polyphonic Music Transcription , 2011, IEEE Journal of Selected Topics in Signal Processing.

[43]  Mark D. Plumbley,et al.  Polyphonic transcription by non-negative sparse coding of power spectra , 2004, ISMIR.

[44]  H. Sebastian Seung,et al.  Learning the parts of objects by non-negative matrix factorization , 1999, Nature.

[45]  Nicolás Ruiz-Reyes,et al.  Polyphonic transcription based on temporal evolution of spectral similarity of gaussian mixture models , 2009, 2009 17th European Signal Processing Conference.

[46]  Anssi Klapuri,et al.  Multiple fundamental frequency estimation based on harmonicity and spectral smoothness , 2003, IEEE Trans. Speech Audio Process..

[47]  Masataka Goto,et al.  RWC Music Database: Music genre database and musical instrument sound database , 2003, ISMIR.

[48]  Anssi Klapuri,et al.  Separation of harmonic sounds using linear models for the overtone series , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[49]  Mark B. Sandler,et al.  Automatic Piano Transcription Using Frequency and Time-Domain Information , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[50]  Anssi Klapuri,et al.  Multiple Fundamental Frequency Estimation by Summing Harmonic Amplitudes , 2006, ISMIR.

[51]  Christopher Raphael,et al.  Automatic Transcription of Piano Music , 2002, ISMIR.

[52]  Roland Badeau,et al.  NMF With Time–Frequency Activations to Model Nonstationary Audio Events , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[53]  Matija Marolt Gaussian Mixture Models For Extraction Of Melodic Lines From Audio Recordings , 2004, ISMIR.

[54]  Andres Kwasinski,et al.  Automatic real-time electric guitar audio transcription , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[55]  Matti Karjalainen,et al.  Multi-pitch and periodicity analysis model for sound separation and auditory scene analysis , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).

[56]  Dan Stowell,et al.  Adaptive whitening for Improved Real-Time audio onset Detection , 2007, ICMC.

[57]  Andreas Jakobsson,et al.  Subspace-based fundamental frequency estimation , 2004, 2004 12th European Signal Processing Conference.

[58]  Daniel P. W. Ellis,et al.  A Discriminative Model for Polyphonic Piano Transcription , 2007, EURASIP J. Adv. Signal Process..

[59]  Hirokazu Kameoka,et al.  Explicit beat structure modeling for non-negative matrix factorization-based multipitch analysis , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[60]  Hirokazu Kameoka,et al.  Probabilistic Approach to Automatic Music Transcription from Audio Signals , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[61]  J C Brown Computer identification of musical instruments using pattern recognition with cepstral coefficients as features. , 1999, The Journal of the Acoustical Society of America.

[62]  Juan Pablo,et al.  Towards the automated analysis of simple polyphonic music : a knowledge-based approach , 2003 .

[63]  Daniel P. W. Ellis,et al.  Classification-based melody transcription , 2006, Machine Learning.

[64]  Roland Badeau,et al.  NMF With Time-Frequency Activations to Model Nonstationary Audio Events , 2011, IEEE Trans. Speech Audio Process..

[65]  Roland Badeau,et al.  Expectation-maximization algorithm for multi-pitch estimation and separation of overlapping harmonic spectra , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[66]  Andreas Jakobsson,et al.  The Multi-Pitch Estimation Problem: some New Solutions , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[67]  Michael Elad,et al.  K-SVD and its non-negative variant for dictionary design , 2005, SPIE Optics + Photonics.

[68]  Emmanuel Vincent,et al.  Fast bayesian nmf algorithms enforcing harmonicity and temporal continuity in polyphonic music transcription , 2009, 2009 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics.

[69]  Tuomas Virtanen,et al.  Separation of sound sources by convolutive sparse coding , 2004, SAPA@INTERSPEECH.

[70]  Roland Badeau,et al.  Multipitch Estimation of Piano Sounds Using a New Probabilistic Spectral Smoothness Principle , 2010, IEEE Transactions on Audio, Speech, and Language Processing.

[71]  Nello Cristianini,et al.  An Introduction to Support Vector Machines and Other Kernel-based Learning Methods , 2000 .

[72]  P. Smaragdis,et al.  Shift-Invariant Probabilistic Latent Component Analysis , 2007 .

[73]  C.-C. Jay Kuo,et al.  Sparse Music Representation With Source-Specific Dictionaries and Its Application to Signal Separation , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[74]  Shunzheng Yu,et al.  Hidden semi-Markov models , 2010, Artif. Intell..

[75]  David Gunawan,et al.  Identification of Partials in Polyphonic Mixtures Based on Temporal Envelope Similarity , 2007 .

[76]  Hirokazu Kameoka,et al.  A Multipitch Analyzer Based on Harmonic Temporal Structured Clustering , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[77]  Simon Dixon,et al.  A Shift-Invariant Latent Variable Model for Automatic Music Transcription , 2012, Computer Music Journal.

[78]  F. J. Cañadas Quesada,et al.  A Multiple-F0 Estimation Approach Based on Gaussian Spectral Modelling for Polyphonic Music Transcription , 2010 .

[79]  Md. Al Mehedi Hasan,et al.  Template music transcription for different types of musical instruments , 2010, 2010 The 2nd International Conference on Computer and Automation Engineering (ICCAE).

[80]  Daniel P. W. Ellis,et al.  Spectral vs. spectro-temporal features for acoustic event detection , 2011, 2011 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA).

[81]  Francisco Fernández de Vega,et al.  Hybrid Genetic Algorithm Based on Gene Fragment Competition for Polyphonic Music Transcription , 2008, EvoWorkshops.

[82]  Ali Taylan Cemgil,et al.  Probabilistic Models for Real-time Acoustic Event Detection with Application to Pitch Tracking , 2011 .

[83]  Simon Dixon,et al.  Accurate Real-time Windowed Time Warping , 2010, ISMIR.

[84]  Simon J. Godsill,et al.  Point process MCMC for sequential music transcription , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[85]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[86]  José Manuel Iñesta Quereda,et al.  Multiple fundamental frequency estimation using Gaussian smoothness , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[87]  Emmanuel Vincent,et al.  Enforcing Harmonicity and Smoothness in Bayesian Non-Negative Matrix Factorization Applied to Polyphonic Music Transcription , 2010, IEEE Transactions on Audio, Speech, and Language Processing.

[88]  Francisco Javier Casajús-Quirós,et al.  Multi-pitch estimation for polyphonic musical signals , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[89]  Eamonn Keogh Exact Indexing of Dynamic Time Warping , 2002, VLDB.

[90]  Antonio Pertusa Ibáñez,et al.  Computationally efficient methods for polyphonic music transcription , 2010 .

[91]  D. Chazan,et al.  Automatic transcription of piano polyphonic music , 2005, ISPA 2005. Proceedings of the 4th International Symposium on Image and Signal Processing and Analysis, 2005..

[92]  Anssi Klapuri,et al.  Signal Processing Methods for the Automatic Transcription of Music , 2004 .

[93]  Michael I. Jordan,et al.  Discriminative training of hidden Markov models for multiple pitch tracking [speech processing examples] , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[94]  Christopher Raphael,et al.  Automatic Transcription of Music Audio Through Continuous Parameter Tracking , 2007, ISMIR.

[95]  Anssi Klapuri,et al.  Automatic Transcription of Melody, Bass Line, and Chords in Polyphonic Music , 2008, Computer Music Journal.

[96]  Mark Sandler,et al.  A Partial Searching Algorithm and Its Application for Polyphonic Music Transcription , 2005, ISMIR.

[97]  R. Meddis,et al.  A unitary model of pitch perception. , 1997, The Journal of the Acoustical Society of America.

[98]  José Manuel Iñesta Quereda,et al.  Efficient methods for joint estimation of multiple fundamental frequencies in music signals , 2012, EURASIP Journal on Advances in Signal Processing.

[99]  Simon J. Godsill,et al.  Bayesian harmonic models for musical pitch estimation and analysis , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[100]  Daniel P. W. Ellis,et al.  Multi-voice polyphonic music transcription using eigeninstruments , 2009, 2009 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics.

[101]  Brian E Anderson,et al.  The effect of inharmonic partials on pitch of piano tones. , 2002, The Journal of the Acoustical Society of America.

[102]  B. Shinn-Cunningham,et al.  Latent variable framework for modeling and separating single-channel acoustic sources , 2008 .

[103]  Jaakko Astola,et al.  Analysis of the meter of acoustic musical signals , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[104]  Simon Godsill,et al.  Poisson point process modeling for polyphonic music transcription. , 2007, The Journal of the Acoustical Society of America.

[105]  Roland Badeau,et al.  Weighted maximum likelihood autoregressive and moving average spectrum modeling , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[106]  Guillaume Lemaitre,et al.  Real-time Polyphonic Music Transcription with Non-negative Matrix Factorization and Beta-divergence , 2010, ISMIR.

[107]  Fabrizio Argenti,et al.  AUTOMATIC TRANSCRIPTION OF POLYPHONIC MUSIC BASED ON CONSTANT-Q BISPECTRAL ANALYSIS FOR MIREX 2009 , 2009 .

[108]  Ye Wang,et al.  Low Level Descriptors for Automatic Violin Transcription , 2006, ISMIR.

[109]  Masataka Goto,et al.  Unsupervised music understanding based on nonparametric Bayesian models , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[110]  Liming Chen,et al.  Use of Continuous Wavelet-like Transform in automated music transcription , 2006, 2006 14th European Signal Processing Conference.

[111]  Anssi Klapuri,et al.  Automatic Transcription of Guitar Chords and Fingering From Audio , 2012, IEEE Transactions on Audio, Speech, and Language Processing.

[112]  Mark B. Sandler,et al.  A tutorial on onset detection in music signals , 2005, IEEE Transactions on Speech and Audio Processing.

[113]  Bhiksha Raj,et al.  Non-negative Hidden Markov Modeling of Audio with Application to Source Separation , 2010, LVA/ICA.

[114]  Xavier Rodet,et al.  MULTIPLE-F0 TRACKING BASED ON A HIGH-ORDER HMM MODEL , 2008 .

[115]  Nicola Orio,et al.  An HMM-based pitch tracker for audio queries , 2003, ISMIR.

[116]  Shyh-Kang Jeng,et al.  An automatic transcription system with octave detection , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[117]  Roland Badeau,et al.  Score informed audio source separation using a parametric model of non-negative spectrogram , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[118]  Tuomas Virtanen,et al.  Combining pitch-based inference and non-negative spectrogram factorization in separating vocals from polyphonic music , 2008, SAPA@INTERSPEECH.

[119]  José Manuel Iñesta Quereda,et al.  Pattern Recognition Algorithms for Polyphonic Music Transcription , 2004, PRIS.

[120]  A. Noll Cepstrum pitch determination. , 1967, The Journal of the Acoustical Society of America.

[121]  Thomas Hofmann,et al.  Probabilistic Latent Semantic Analysis , 1999, UAI.

[122]  Anssi Klapuri A CLASSIFICATION APPROACH TO MULTIPITCH ANALYSIS , 2009 .

[123]  Roland Badeau,et al.  Adaptive harmonic time-frequency decomposition of audio using shift-invariant PLCA , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[124]  Mark F. Bocko,et al.  Polyphonic music transcription employing max-margin classification of spectrograhic features , 2009, 2009 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics.

[125]  Claus Weihs,et al.  Parameter Optimization in Automatic Transcription of Music , 2005, GfKl.

[126]  Shigeki Sagayama,et al.  Note detection with dynamic bayesian networks as a postanalysis step for NMF-based multiple pitch estimation techniques , 2009, 2009 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics.

[127]  Jun Wu,et al.  Polyphonic Pitch Estimation and Instrument Identification by Joint Modeling of Sustained and Attack Sounds , 2011, IEEE Journal of Selected Topics in Signal Processing.

[128]  Fabrizio Argenti,et al.  Automatic Transcription of Polyphonic Music Based on the Constant-Q Bispectral Analysis , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[129]  Michael Johansson April Automatic Transcription of Polyphonic Music Using Harmonic Relations , 2003 .

[130]  Judith C. Brown,et al.  An efficient algorithm for the calculation of a constant Q transform , 1992 .

[131]  R. Badeau,et al.  Multipitch estimation of quasi-harmonic sounds in colored noise , 2007 .

[132]  Daniel P. W. Ellis,et al.  IMPROVING GENERALIZATION FOR POLYPHONIC PIANO TRANSCRIPTION , 2007 .

[133]  Kunio Kashino,et al.  Application of the Bayesian probability network to music scene analysis , 1998 .

[134]  Yi-Hsuan Yang,et al.  Automatic transcription of piano music by sparse representation of magnitude spectra , 2011, 2011 IEEE International Conference on Multimedia and Expo.

[135]  Simon Dixon,et al.  Automatically detecting key modulations in J.S. Bach chorale recordings , 2011 .

[136]  Rémi Gribonval,et al.  Harmonic decomposition of audio signals with matching pursuit , 2003, IEEE Trans. Signal Process..

[137]  Bhiksha Raj,et al.  Probabilistic Latent Variable Models as Nonnegative Factorizations , 2008, Comput. Intell. Neurosci..

[138]  Hirokazu Kameoka,et al.  Infinite-state spectrum model for music signal analysis , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[139]  Jean-Pierre Martens,et al.  Assessment of State-of-the-Art Meter Analysis Systems with an Extended Meter Description Model , 2007, ISMIR.

[140]  Simon J. Godsill,et al.  Bayesian harmonic models for musical signal analysis , 2003 .

[141]  Andrew McCallum,et al.  Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.

[142]  Paris Smaragdis Relative-pitch tracking of multiple arbitrary sounds. , 2009, The Journal of the Acoustical Society of America.

[143]  Randal J. Leistikow,et al.  A New Probabilistic Spectral Pitch Estimator: Exact and MCMC-approximate Strategies , 2004, CMMR.

[144]  Mark B. Sandler,et al.  Techniques for Automatic Music Transcription , 2000, ISMIR.

[145]  Daniel P. W. Ellis,et al.  Signal Processing for Music Analysis , 2011, IEEE Journal of Selected Topics in Signal Processing.

[146]  Amara Lynn Graps,et al.  An introduction to wavelets , 1995 .

[147]  Andreas Rauber,et al.  Improving Genre Classification by Combination of Audio and Symbolic Descriptors Using a Transcription Systems , 2007, ISMIR.

[148]  DeLiang Wang,et al.  Pitch Detection in Polyphonic Music using Instrument Tone Models , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[149]  Simon Dixon,et al.  On the Computer Recognition of Solo Piano Music , 2000 .

[150]  Yi-Hsuan Yang,et al.  Multipitch Estimation of Piano Music by Exemplar-Based Sparse Representation , 2012, IEEE Transactions on Multimedia.

[151]  Ana M. Barbancho,et al.  Polyphony Number Estimator for Piano Recordings Using Different Spectral Patterns , 2010 .

[152]  Dan Zhang,et al.  Multi-Pitch Estimation Based on Partial Event and Support Transfer , 2007, 2007 IEEE International Conference on Multimedia and Expo.

[153]  Shigeki Sagayama,et al.  Extending Nonnegative Matrix Factorization—A discussion in the context of multiple frequency estimation of musical signals , 2009, 2009 17th European Signal Processing Conference.

[154]  M.G. Christensen,et al.  Multi-Pitch Estimation Using Harmonic Music , 2006, 2006 Fortieth Asilomar Conference on Signals, Systems and Computers.

[155]  Geoffroy Peeters,et al.  Music Pitch Representation by Periodicity Measures Based on Combined Temporal and Spectral Representations , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[156]  Jun Wu,et al.  Multipitch estimation by joint modeling of harmonic and transient sounds , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[157]  Karin Dressler MULTIPLE FUNDAMENTAL FREQUENCY EXTRACTION FOR MIREX 2012 , 2011 .

[158]  Samer A. Abdallah,et al.  Towards music perception by redundancy reduction and unsupervised learning in probabilistic models , 2002 .

[159]  Nicolás Ruiz-Reyes,et al.  Polyphonic Piano Transcription Based on Spectral Separation , 2008 .

[160]  Matija Marolt,et al.  Automatic Transcription of Bell Chiming Recordings , 2012, IEEE Transactions on Audio, Speech, and Language Processing.

[161]  Tetsuya Ogata,et al.  Initialization-robust multipitch estimation based on latent harmonic allocation using overtone corpus , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[162]  Bernhard Niedermayer Non-Negative Matrix Division for the Automatic Transcription of Polyphonic Music , 2008, ISMIR.

[163]  Paris Smaragdis,et al.  Mitsubishi Electric Research Laboratories , 1994 .

[164]  Derry Fitzgerald,et al.  GENERALISED PRIOR SUBSPACE ANALYSIS FOR POLYPHONIC PITCH TRANSCRIPTION , 2005 .

[165]  M.P. Ryynanen,et al.  Polyphonic music transcription using note event modeling , 2005, IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2005..

[166]  Vesa T. Peltonen,et al.  Audio-based context recognition , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[167]  Masataka Goto,et al.  A robust predominant-F0 estimation method for real-time detection of melody and bass lines in CD recordings , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).

[168]  Mark F. Bocko,et al.  A Real-Time Signal Processing Framework of Musical Expressive Feature Extraction Using Matlab , 2011, ISMIR.

[169]  Roland Badeau,et al.  Scale-invariant probabilistic latent component analysis , 2011, 2011 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA).

[170]  Axel Röbel,et al.  Multiple Fundamental Frequency Estimation and Polyphony Inference of Polyphonic Music Signals , 2010, IEEE Transactions on Audio, Speech, and Language Processing.

[171]  Francisco Javier Casajus-Quiros,et al.  Approaching polyphonic transcription of piano sounds , 2007 .

[172]  Siegmund Levarie,et al.  A theory of harmony , 1985 .

[173]  Nicolás Ruiz-Reyes,et al.  Overlapped event-note separation based on partials amplitude and phase estimation for polyphonic music transcription , 2009, 2009 17th European Signal Processing Conference.

[174]  Roland Badeau,et al.  Automatic transcription of piano music based on HMM tracking of jointly-estimated pitches , 2008, 2008 16th European Signal Processing Conference.

[175]  Changshui Zhang,et al.  Multiple Fundamental Frequency Estimation by Modeling Spectral Peaks and Non-Peak Regions , 2010, IEEE Transactions on Audio, Speech, and Language Processing.

[176]  Raul Kompass,et al.  A Generalized Divergence Measure for Nonnegative Matrix Factorization , 2007, Neural Computation.

[177]  Xavier Rodet,et al.  Fundamental frequency estimation and tracking using maximum likelihood harmonic matching and HMMs , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[178]  Anssi Klapuri,et al.  Multipitch Analysis of Polyphonic Music and Speech Signals Using an Auditory Model , 2008, IEEE Transactions on Audio, Speech, and Language Processing.

[179]  Matija Marolt,et al.  A connectionist approach to automatic transcription of polyphonic piano music , 2004, IEEE Transactions on Multimedia.

[180]  Anssi Klapuri,et al.  Latent semantic analysis in sound event detection , 2011, 2011 19th European Signal Processing Conference.

[181]  Matti Ryynänen,et al.  Automatic Transcription of Pitch Content in Music and Selected Applications , 2008 .

[182]  Anssi Klapuri,et al.  Multipitch estimation and sound separation by the spectral smoothness principle , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[183]  Michael I. Jordan,et al.  Factorial Hidden Markov Models , 1995, Machine Learning.

[184]  Dan Tidhar,et al.  Estimation of harpsichord inharmonicity and temperament from musical recordings. , 2012, The Journal of the Acoustical Society of America.

[185]  Christian Schörkhuber CONSTANT-Q TRANSFORM TOOLBOX FOR MUSIC PROCESSING , 2010 .

[186]  M. Davy,et al.  Bayesian analysis of polyphonic western tonal music. , 2006, The Journal of the Acoustical Society of America.

[187]  Dan Tidhar,et al.  High precision frequency estimation for harpsichord tuning classification , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[188]  Hirokazu Kameoka,et al.  Extraction of Multiple Fundamental Frequencies from Polyphonic Music Using Harmonic Clustering , 2003 .

[189]  Björn Schuller,et al.  Automatic Transcription of Recorded Music , 2012 .

[190]  Bryan Pardo,et al.  Song-level multi-pitch tracking by heavily constrained clustering , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[191]  David A. Krubsack,et al.  A spectral autocorrelation method for measurement of the fundamental frequency of noise-corrupted speech , 1987, IEEE Trans. Acoust. Speech Signal Process..

[192]  Gert Cauwenberghs,et al.  Monaural separation of independent acoustical components , 1999, ISCAS'99. Proceedings of the 1999 IEEE International Symposium on Circuits and Systems VLSI (Cat. No.99CH36349).

[193]  P. Smaragdis,et al.  Non-negative matrix factorization for polyphonic music transcription , 2003, 2003 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (IEEE Cat. No.03TH8684).

[194]  Daniel P. W. Ellis,et al.  A Probabilistic Subspace Model for Multi-instrument Polyphonic Transcription , 2010, ISMIR.

[195]  Hirokazu Kameoka,et al.  Bayesian nonparametric spectrogram modeling based on infinite factorial infinite hidden Markov model , 2011, 2011 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA).

[196]  Joshua D. Reiss,et al.  A REAL-TIME POLYPHONIC MUSIC TRANSCRIPTION SYSTEM , 2008 .

[197]  Xuejing Sun A pitch determination algorithm based on subharmonic-to-harmonic ratio , 2000, INTERSPEECH.

[198]  Richard F. Lyon,et al.  A perceptual pitch detector , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[199]  Mark D. Plumbley,et al.  Structure-aware dictionary learning with harmonic atoms , 2011, 2011 19th European Signal Processing Conference.

[200]  Giovanni Saggio,et al.  A sensor interface based on sparse NMF for piano musical transcription , 2011, 2011 4th IEEE International Workshop on Advances in Sensors and Interfaces (IWASI).

[201]  Graham E. Poliner,et al.  Improving Generalization for Classification-Based Polyphonic Piano Transcription , 2007, 2007 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics.

[202]  Giovanni Costantini,et al.  Event based transcription system for polyphonic piano music , 2009, Signal Process..

[203]  Vesa Välimäki,et al.  The effect of inharmonicity on pitch in string instrument sounds , 2000, ICMC.

[204]  Julius O. Smith,et al.  A non-negative framework for joint modeling of spectral structure and temporal dynamics in sound mixtures , 2010 .

[205]  Emmanuel Vincent,et al.  Harmonic and inharmonic Nonnegative Matrix Factorization for Polyphonic Pitch transcription , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[206]  Xavier Rodet,et al.  Spectral Envelope Estimation and Representation for Sound Analysis-Synthesis , 1999, ICMC.

[207]  Bhiksha Raj,et al.  A Probabilistic Latent Variable Model for Acoustic Modeling , 2006 .

[208]  Kuansan Wang,et al.  Auditory representations of acoustic signals , 1992, IEEE Trans. Inf. Theory.

[209]  David Barber,et al.  Generative model based polyphonic music transcription , 2003, 2003 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (IEEE Cat. No.03TH8684).

[210]  Mathieu Lagrange,et al.  Characterisation of acoustic scenes using a temporally-constrained shift-invariant model , 2012 .

[211]  Francisco Cañadas,et al.  MULTIPLE FUNDAMENTAL FREQUENCY ESTIMATION & TRACKING IN POLYPHONIC MUSIC FOR MIREX 2010 , 2010 .

[212]  Markus Schedl,et al.  Polyphonic piano note transcription with recurrent neural networks , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[213]  Simon Dixon,et al.  Polyphonic music transcription using note onset and offset detection , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[214]  David Barber,et al.  A generative model for music transcription , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[215]  Mark D. Plumbley,et al.  Polyphonic music transcription by non-negative sparse coding of power spectra , 2004 .

[216]  David Wessel,et al.  Realtime Multiple-pitch and Multiple-instrument Recognition For Music Signals using Sparse Non-negative Constraints , 2007 .

[217]  E. Vincent,et al.  TWO NONNEGATIVE MATRIX FACTORIZATION METHODS FOR POLYPHONIC PITCH TRANSCRIPTION , 2007 .

[218]  Daniel P. W. Ellis,et al.  Transcribing Multi-Instrument Polyphonic Music With Hierarchical Eigeninstruments , 2011, IEEE Journal of Selected Topics in Signal Processing.

[219]  Paris Smaragdis,et al.  Non-negative Matrix Factor Deconvolution; Extraction of Multiple Sound Sources from Monophonic Inputs , 2004, ICA.

[220]  Marc Moonen,et al.  A Robust and Computationally Efficient Subspace-Based Fundamental Frequency Estimator , 2010, IEEE Transactions on Audio, Speech, and Language Processing.

[221]  Graham E. Poliner,et al.  Melody Transcription From Music Audio: Approaches and Evaluation , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[222]  Paul M. Brossier,et al.  Automatic annotation of musical audio for interactive applications , 2006 .

[223]  Shigeki Sagayama,et al.  Multipitch Analysis with Harmonic Nonnegative Matrix Approximation , 2007, ISMIR.

[224]  John A. Nelder,et al.  A Simplex Method for Function Minimization , 1965, Comput. J..

[225]  S. Schwerman,et al.  The Physics of Musical Instruments , 1991 .

[226]  Lawrence R. Rabiner,et al.  On the use of autocorrelation analysis for pitch detection , 1977 .

[227]  Anssi Klapuri,et al.  Automatic Music Transcription as We Know it Today , 2004 .

[228]  Andreas Jakobsson,et al.  Multi-Pitch Estimation , 2009, Multi-Pitch Estimation.

[229]  J. Stephen Downie,et al.  How Significant is Statistically Significant? The case of Audio Music Similarity and Retrieval , 2012, ISMIR.

[230]  Roy D. Patterson,et al.  An instantaneous-frequency-based pitch extraction method for high-quality speech transformation: revised TEMPO in the STRAIGHT-suite , 1998, ICSLP.

[231]  Stephen W. Hainsworth,et al.  Techniques for the Automated Analysis of Musical Audio , 2004 .

[232]  David Lu Automatic Music Transcription Using Genetic Algorithms and Electronic Synthesis David Lu ! ! April , 2006 .

[233]  R Meddis,et al.  Modeling the identification of concurrent vowels with different fundamental frequencies. , 1992, The Journal of the Acoustical Society of America.

[234]  Olivier Derrien Multi-Scale Frame-Based Analysis of Audio Signals for Musical Transcription Using a Dictionary of Chromatic Waveforms , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[235]  Juhan Nam,et al.  A Classification-Based Polyphonic Piano Transcription Approach Using Learned Feature Representations , 2011, ISMIR.

[236]  Xavier Rodet,et al.  Discrete Cepstrum Coefficients as Perceptual Features , 2003, ICMC.

[237]  Mike E. Davies,et al.  Unsupervised learning of sparse and shift-invariant decompositions of polyphonic music , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[238]  Umut Şimşekli,et al.  A COMPARISON OF PROBABILISTIC MODELS FOR ONLINE PITCH TRACKING , 2010 .

[239]  DeLiang Wang,et al.  Monaural Musical Sound Separation Based on Pitch and Common Amplitude Modulation , 2009, IEEE Transactions on Audio, Speech, and Language Processing.

[240]  R. O. Schmidt,et al.  Multiple emitter location and signal Parameter estimation , 1986 .

[241]  Simon Dixon,et al.  Temporally-Constrained Convolutive Probabilistic Latent Component Analysis for Multi-pitch Detection , 2012, LVA/ICA.

[242]  Liu Sheng,et al.  Automatic Transcription Method for Polyphonic Music Based on Adaptive Comb Filter and Neural Network , 2007, 2007 International Conference on Mechatronics and Automation.

[243]  Ye Wang,et al.  Music transcription using an instrument model , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[244]  A.P. Klapuri,et al.  A perceptually motivated multiple-F0 estimation method , 2005, IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2005..

[245]  Roland Badeau,et al.  Blind Signal Decompositions for Automatic Transcription of Polyphonic Music: NMF and K-SVD on the Benchmark , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[246]  José Manuel Iñesta Quereda,et al.  Polyphonic monotimbral music transcription using dynamic networks , 2005, Pattern Recognit. Lett..

[247]  Masataka Goto,et al.  Infinite Latent Harmonic Allocation: A Nonparametric Bayesian Approach to Multipitch Analysis , 2010, ISMIR.

[248]  Mark R. Every,et al.  Separation of synchronous pitched notes by spectral filtering of harmonics , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[249]  Ana M. Barbancho,et al.  Transcription of piano recordings , 2004 .

[250]  Joshua D. Reiss,et al.  A Computationally Efficient Method for Polyphonic Pitch Estimation , 2009, EURASIP J. Adv. Signal Process..

[251]  M. Ross,et al.  Average magnitude difference function pitch extractor , 1974 .

[252]  Meinard Müller,et al.  Estimating note intensities in music recordings , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[253]  Nicolás Ruiz-Reyes,et al.  Music Scene-Adaptive Harmonic Dictionary for Unsupervised Note-Event Detection , 2010, IEEE Transactions on Audio, Speech, and Language Processing.

[254]  Masataka Goto,et al.  A real-time music-scene-description system: predominant-F0 estimation for detecting melody and bass lines in real-world audio signals , 2004, Speech Commun..

[255]  Ali Taylan Cemgil,et al.  Probabilistic latent tensor factorization framework for audio modeling , 2011, 2011 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA).

[256]  Patrick Susini,et al.  Perceptual study of soundscapes in train stations , 2008 .

[257]  Emmanuel Vincent,et al.  Adaptive Harmonic Spectral Decomposition for Multiple Pitch Estimation , 2010, IEEE Transactions on Audio, Speech, and Language Processing.

[258]  Christian Uhle An Investigation of Low-Level Signal Descriptors Characterizing the Noise-Like Nature of an Audio Signal , 2010 .

[259]  Dan Tidhar,et al.  The Temperament Police: The Truth, the Ground Truth, and Nothing but the Truth , 2011, ISMIR.

[260]  Simon Dixon,et al.  Multiple-instrument polyphonic music transcription using a convolutive probabilistic model , 2011 .

[261]  Simon J. Godsill,et al.  Multiple Pitch Estimation Using Non-Homogeneous Poisson Processes , 2011, IEEE Journal of Selected Topics in Signal Processing.

[262]  Michael Groble MULTIPLE FUNDAMENTAL FREQUENCY ESTIMATION , 2008 .

[263]  Mark D. Plumbley,et al.  Structured sparsity for automatic music transcription , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[264]  Arshia Cont Realtime Multiple Pitch Observation using Sparse Non-negative Constraints , 2006, ISMIR.

[265]  Eamonn J. Keogh,et al.  Searching and Mining Trillions of Time Series Subsequences under Dynamic Time Warping , 2012, KDD.

[266]  Mikkel N. Schmidt,et al.  Sparse Non-negative Matrix Factor 2-D Deconvolution , 2006 .

[267]  Yonghong Yan,et al.  MULTIPLE F0 ESTIMATION IN POLYPHONIC MUSIC (MIREX 2007) , 2007 .

[268]  A. de Cheveigné Cancellation model of pitch perception. , 1998, The Journal of the Acoustical Society of America.

[269]  Nicolás Ruiz-Reyes,et al.  Improving multiple-F0 estimation by onset detection for polyphonic music transcription , 2010, 2010 IEEE International Workshop on Multimedia Signal Processing.

[270]  Jens WELLHAUSEN Towards Automatic Music Transcription : Extraction of MIDI-Data out of Polyphonic Piano Music , 2013 .

[271]  Axel Röbel,et al.  Multiple fundamental frequency estimation of polyphonic music signals , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[272]  Grosvenor W. Cooper,et al.  The Rhythmic Structure of Music , 1971 .

[273]  Yannis Stylianou,et al.  Three Dimensions of Pitched Instrument Onset Detection , 2010, IEEE Transactions on Audio, Speech, and Language Processing.

[274]  Xavier Rodet,et al.  Music Transcription with ISA and HMM , 2004, ICA.

[275]  Andreas Jakobsson,et al.  Joint High-Resolution Fundamental Frequency and Order Estimation , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[276]  François Pachet,et al.  The bag-of-frames approach to audio pattern recognition: a sufficient model for urban soundscapes but not for polyphonic music. , 2007, The Journal of the Acoustical Society of America.

[277]  Joseph Tabrikian,et al.  Maximum A Posteriori Probability Multiple-Pitch Tracking Using the Harmonic Model , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[278]  Peng Li,et al.  Multipitch Detection Based on Weighted Summary Correlogram , 2008, 2008 6th International Symposium on Chinese Spoken Language Processing.

[279]  Arnold Schoenberg Theory of Harmony , 1948 .

[280]  P. Stoica,et al.  Cyclic minimizers, majorization techniques, and the expectation-maximization algorithm: a refresher , 2004, IEEE Signal Process. Mag..

[281]  Simon Dixon,et al.  Multiple-F0 estimation of piano sounds exploiting spectral structure and temporal evolution , 2010, SAPA@INTERSPEECH.

[282]  Anssi Klapuri,et al.  Accompaniment separation and karaoke application based on automatic melody transcription , 2008, 2008 IEEE International Conference on Multimedia and Expo.

[283]  Luis I. Ortiz-Berenguer,et al.  PIANO TRANSCRIPTION USING PATTERN RECOGNITION: ASPECTS ON PARAMETER EXTRACTION , 2004 .

[284]  Mikkel N. Schmidt,et al.  Shift Invariant Sparse Coding of Image and Music Data , 2007 .

[285]  Ali Taylan Cemgil,et al.  Bayesian Music Transcription , 1997 .

[286]  Aníbal Ferreira,et al.  Measuring music transcription results based on a hybrid decay/sustain evaluation , 2009 .

[287]  Gaël Richard,et al.  A Musically Motivated Mid-Level Representation for Pitch Estimation and Musical Audio Source Separation , 2011, IEEE Journal of Selected Topics in Signal Processing.

[288]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[289]  L. Lathauwer,et al.  Signal Processing based on Multilinear Algebra , 1997 .

[290]  Bingjun Zhang,et al.  Application-Specific Music Transcription for Tutoring , 2008, IEEE MultiMedia.

[291]  William P. Birmingham,et al.  Algorithms for Chordal Analysis , 2002, Computer Music Journal.