Pitch Analysis for Active Music Discovery

A significant proportion of commercial music is comprised of pitched content: a melody, a bass line, a famous guitar solo, etc. Consequently, algorithms that are capable of extracting and understanding this type of pitched content open up numerous opportunities for active music discovery, ranging from query-by-humming to musicalfeature-based exploration of Indian art music or recommendation based on singing style. In this talk I will describe some of my work on algorithms for pitch content analysis of music audio signals and their application to music discovery, the role of machine learning in these algorithms, and the challenge posed by the scarcity of labeled data and how we may address it.

[1]  Roland Badeau,et al.  Multipitch Estimation of Piano Sounds Using a New Probabilistic Spectral Smoothness Principle , 2010, IEEE Transactions on Audio, Speech, and Language Processing.

[2]  Emilia Gómez,et al.  Melody Extraction From Polyphonic Music Signals Using Pitch Contour Characteristics , 2012, IEEE Transactions on Audio, Speech, and Language Processing.

[3]  Axel Röbel,et al.  Statistical Characterisation of Melodic Pitch Contours and its Application for Melody Extraction , 2012, ISMIR.

[4]  Xavier Serra,et al.  Dunya: a system for browsing audio music collections exploiting cultural context , 2013, ISMIR 2013.

[5]  Daniel P. W. Ellis,et al.  Melody Extraction from Polyphonic Music Signals: Approaches, applications, and challenges , 2014, IEEE Signal Processing Magazine.

[6]  Daniel P. W. Ellis,et al.  Classification-based melody transcription , 2006, Machine Learning.

[7]  Emilia Gómez,et al.  Musical genre classification using melody features extracted from polyphonic music signals , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[8]  Shigeki Sagayama,et al.  Melody line estimation in homophonic music audio signals based on temporal-variability of melodic source , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[9]  Peter Knees,et al.  Music Recommender Systems , 2015, Recommender Systems Handbook.

[10]  Xavier Serra,et al.  A Multipitch Approach to Tonic Identification in Indian Classical Music , 2012, ISMIR.

[11]  Slim Essid,et al.  Melody Extraction by Contour Classification , 2015, ISMIR.

[12]  Jordi Bonada WIDE-BAND HARMONIC SINUSOIDAL MODELING , 2008 .

[13]  György Fazekas,et al.  The Open Multitrack Testbed , 2014 .

[14]  Matthias Mauch,et al.  MedleyDB: A Multitrack Dataset for Annotation-Intensive MIR Research , 2014, ISMIR.

[15]  José Miguel Díaz-Báñez,et al.  Tracking Melodic Patterns in Flamenco Singing by Analyzing Polyphonic Music Recordings , 2012, ISMIR.

[16]  Gaël Richard,et al.  Source/Filter Model for Unsupervised Main Melody Extraction From Polyphonic Audio Signals , 2010, IEEE Transactions on Audio, Speech, and Language Processing.

[17]  Yi-Hsuan Yang,et al.  Escaping from the Abyss of Manual Annotation: New Methodology of Building Polyphonic Datasets for Automatic Music Transcription , 2015, CMMR.