Finding geographically representative music via social media

People can draw a myriad of semantic associations with music. The semantics can be geographical, ethnographical, society- or time-driven, or simply personal. For certain types of music, however, this semantic association is more prominent and coherent across most peoples. Such music can often serve as an ideal accompaniment for a user activity or setting (that shares the semantics of the music), especially in media authoring applications. Among the strongest associations a piece of music can have is with the geographical area from which it generates. With video-sharing in sites such as YouTube having become a norm, one would expect that music videos tagged with a geographic location keyword are representative of the respective geographical theme. However, in the past few years, the proliferance of western pop culture throughout the world has resulted in popularity of ethnic pop (resembling Western pop) that sounds quite distinct from traditional regional music. While a human expert may easily distinguish between such ethnic pop and traditional regional music, the problem of automatically differentiating between them is still new. The problem becomes more challenging with similarities in music from many different regions. In this paper, we attempt to automatically identify music with strong geographical semantics (that is, "traditional-sounding music" for different geographical regions), using only music gathered from social media sources as our training and testing data. We also explore the use of hierarchical clustering to discover relationships between the music of different cultures, again using only social media.

[1]  Tao Jin,et al.  Automatic Generation of Music Slide Show Using Personal Photos , 2008, 2008 Tenth IEEE International Symposium on Multimedia.

[2]  Thierry Bertin-Mahieux,et al.  The Million Song Dataset , 2011, ISMIR.

[3]  O. Lartillot,et al.  A MATLAB TOOLBOX FOR MUSICAL FEATURE EXTRACTION FROM AUDIO , 2007 .

[4]  Jiebo Luo,et al.  Photo-centric multimedia authoring enhanced by cross-media indexing , 2005, Visual Communications and Image Processing.

[5]  Petri Toiviainen,et al.  MIR in Matlab (II): A Toolbox for Musical Feature Extraction from Audio , 2007, ISMIR.

[6]  P. Mermelstein,et al.  Distance measures for speech recognition, psychological and instrumental , 1976 .

[7]  Jiebo Luo,et al.  Geotagging in multimedia and computer vision—a survey , 2010, Multimedia Tools and Applications.

[8]  Kilian Q. Weinberger,et al.  Distance Metric Learning for Large Margin Nearest Neighbor Classification , 2005, NIPS.

[9]  Keinosuke Fukunaga,et al.  Introduction to statistical pattern recognition (2nd ed.) , 1990 .

[10]  Thierry Bertin-Mahieux,et al.  Automatic Generation of Social Tags for Music Recommendation , 2007, NIPS.

[11]  Mark D. Wood Matching Songs to Events in Image Collections , 2009, 2009 IEEE International Conference on Semantic Computing.

[12]  Marc Leman,et al.  Content-Based Music Information Retrieval: Current Directions and Future Challenges , 2008, Proceedings of the IEEE.

[13]  Eric R. Ziegel,et al.  The Elements of Statistical Learning , 2003, Technometrics.

[14]  Alexandra L. Uitdenbogerd,et al.  Combining Demographic Data with Collaborative Filtering for Automatic Music Recommendation , 2005, KES.

[15]  Jiebo Luo,et al.  Suggesting Songs for Media Creation Using Semantics , 2010, 2010 20th International Conference on Pattern Recognition.

[16]  N. Scaringella,et al.  Automatic genre classification of music content: a survey , 2006, IEEE Signal Process. Mag..

[17]  Jun-Cheng Chen,et al.  Tiling slideshow , 2006, MM '06.

[18]  Yuen-Hsien Tseng,et al.  Content-based retrieval for music collections , 1999, SIGIR '99.

[19]  Daniel P. W. Ellis,et al.  A Large-Scale Evaluation of Acoustic and Subjective Music-Similarity Measures , 2004, Computer Music Journal.

[20]  J. Wade Davis,et al.  Statistical Pattern Recognition , 2003, Technometrics.