Predicting the Geographical Origin of Music

Traditional research into the arts has almost always been based around the subjective judgment of human critics. The use of data mining tools to understand art has great promise as it is objective and operational. We investigate the distribution of music from around the world: geographical ethnomusicology. We cast the problem as training a machine learning program to predict the geographical origin of pieces of music. This is a technically interesting problem as it has features of both classification and regression, and because of the spherical geometry of the surface of the Earth. Because of these characteristics of the representation of geographical positions, most standard classification/regression methods cannot be directly used. Two applicable methods are K-Nearest Neighbors and Random forest regression, which are robust to the non-standard structure of data. We also investigated improving performance through use of bagging. We collected 1,142 pieces of music from 73 countries/areas, and described them using 2 different sets of standard audio descriptors using MARSYAS. 10-fold cross validation was used in all experiments. The experimental results indicate that Random forest regression produces significantly better results than KNN, and the use of bagging improves the performance of KNN. The best performing algorithm achieved a mean great circle distance error of 3,113 km.

[1]  Jens Grivolla,et al.  Multimodal Music Mood Classification Using Audio and Lyrics , 2008, 2008 Seventh International Conference on Machine Learning and Applications.

[2]  Zhe Jiang,et al.  Spatial Statistics , 2013 .

[3]  Tim Pohle,et al.  The ISMIR Cloud: A Decade of ISMIR Conferences at Your Fingertips , 2009, ISMIR.

[4]  George Tzanetakis,et al.  MARSYAS: a framework for audio analysis , 1999, Organised Sound.

[5]  Emilia Gómez,et al.  Music and Geography: Content Description of Musical Audio from Different Parts of the World , 2009, ISMIR.

[6]  Donald Adjeroh,et al.  Random knn modeling and variable selection for high dimensional data , 2009 .

[7]  Erik Duval,et al.  A Web-based Approach to Determine the Origin of an Artist , 2009, ISMIR.

[8]  J. R. Quinlan Learning With Continuous Classes , 1992 .

[9]  Jeroen Breebaart,et al.  Features for audio and music classification , 2003, ISMIR.

[10]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[11]  Neil Davey,et al.  Prediction of Skin Penetration Using Machine Learning Methods , 2008, 2008 Eighth IEEE International Conference on Data Mining.

[12]  L. Torgo,et al.  Inductive learning of tree-based regression models , 1999 .

[13]  Richard O. Duda,et al.  Pattern classification and scene analysis , 1974, A Wiley-Interscience publication.

[14]  Lianhong Cai,et al.  Cultural style based music classification of audio signals , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[15]  R. Lewis An Introduction to Classification and Regression Tree (CART) Analysis , 2000 .

[16]  Peter Knees,et al.  A music search engine built upon audio-based and web-based similarity measures , 2007, SIGIR.

[17]  G. Widmer,et al.  From Sound to ‘ Sense ’ via Feature Extraction and Machine Learning : Deriving High-Level Descriptors for Characterising Music , 2005 .

[18]  J. Grey Multidimensional perceptual scaling of musical timbres. , 1977, The Journal of the Acoustical Society of America.

[19]  Wei-Yin Loh,et al.  Classification and regression trees , 2011, WIREs Data Mining Knowl. Discov..

[20]  Gert R. G. Lanckriet,et al.  Combining audio content and social context for semantic music discovery , 2009, SIGIR.