Competence-based song recommendation

Singing is a popular social activity and a good way of expressing one's feelings. One important reason for unsuccessful singing performance is because the singer fails to choose a suitable song. In this paper, we propose a novel singing competence-based song recommendation framework. It is distinguished from most existing music recommendation systems which rely on the computation of listeners' interests or similarity. We model a singer's vocal competence as singer profile, which takes voice pitch, intensity, and quality into consideration. Then we propose techniques to acquire singer profiles. We also present a song profile model which is used to construct a human annotated song database. Finally, we propose a learning-to-rank scheme for recommending songs by singer profile. The experimental study on real singers demonstrates the effectiveness of our approach and its advantages over two baseline methods. To the best of our knowledge, our work is the first to study competence-based song recommendation.

[1]  Jaana Kekäläinen,et al.  IR evaluation methods for retrieving highly relevant documents , 2000, SIGIR '00.

[2]  T. Baer,et al.  Harmonics-to-noise ratio as an index of the degree of hoarseness. , 1982, The Journal of the Acoustical Society of America.

[3]  Gang Chen,et al.  myDJ: recommending karaoke songs from one's own voice , 2012, SIGIR '12.

[4]  John B. Shoven,et al.  I , Edinburgh Medical and Surgical Journal.

[5]  P H DeJonckere,et al.  Efficacy of voice therapy assessed with the Voice Range Profile (Phonetogram). , 2003, Revue de laryngologie - otologie - rhinologie.

[6]  W Seidner,et al.  Recommendation by the Union of European Phoniatricians (UEP): standardizing voice area measurement/phonetography. , 1983, Folia phoniatrica.

[7]  Dimitar D. Deliyski,et al.  Acoustic model and evaluation of pathological voice production , 1993, EUROSPEECH.

[8]  P. Van cauwenberge,et al.  Toward improved ecological validity in the acoustic measurement of overall voice quality: combining continuous speech and sustained vowels. , 2010, Journal of voice : official journal of the Voice Foundation.

[9]  Keiichiro Hoashi,et al.  Personalization of user profiles for content-based music retrieval based on relevance feedback , 2003, ACM Multimedia.

[10]  J. P. Pabon,et al.  Objective acoustic voice-quality parameters in the computer phonetogram , 1991 .

[11]  Wei-Ho Tsai,et al.  Automatic Evaluation of Karaoke Singing Based on Pitch, Volume, and Rhythm Features , 2012, IEEE Transactions on Audio, Speech, and Language Processing.

[12]  Brian Christopher Smith,et al.  Query by humming: musical information retrieval in an audio database , 1995, MULTIMEDIA '95.

[13]  H. K. Schutte,et al.  Differences in phonetogram features between male and female subjects with and without vocal training. , 1995, Journal of voice : official journal of the Voice Foundation.

[14]  J. Estis,et al.  The singing power ratio as an objective measure of singing voice quality in untrained talented and nontalented singers. , 2006, Journal of voice : official journal of the Voice Foundation.

[15]  R Plomp,et al.  Automatic phonetogram recording supplemented with acoustical voice-quality parameters. , 1988, Journal of speech and hearing research.

[16]  P. Boersma Praat : doing phonetics by computer (version 4.4.24) , 2006 .

[17]  Douglas B. Terry,et al.  Using collaborative filtering to weave an information tapestry , 1992, CACM.

[18]  F L Wuyts,et al.  Normative voice range profiles of male and female professional voice users. , 2002, Journal of voice : official journal of the Voice Foundation.

[19]  S. K. Wolf,et al.  Quantitative Studies on the Singing Voice , 1935 .

[20]  Tie-Yan Liu,et al.  Learning to rank: from pairwise approach to listwise approach , 2007, ICML '07.

[21]  Berit Schneider,et al.  Normative voice range profiles in vocally trained and untrained children aged between 7 and 10 years. , 2010, Journal of voice : official journal of the Voice Foundation.

[22]  F L Wuyts,et al.  Phonetography in voice diagnoses. , 1996, Acta oto-rhino-laryngologica Belgica.

[23]  Anssi Klapuri,et al.  Automatic Music Transcription: Breaking the Glass Ceiling , 2012, ISMIR.

[24]  Anssi Klapuri,et al.  Multi-Template Shift-Variant Non-Negative Matrix Deconvolution for Semi-Automatic Music Transcription , 2012, ISMIR.