Automatically Detecting Members and Instrumentation of Music Bands Via Web Content Mining

In this paper, we present an approach to automatically detecting music band members and instrumentation using web content mining techniques. To this end, we combine a named entity detection method with a rule-based linguistic text analysis approach extended by a rule filtering step. We report on the results of different evaluation experiments carried out on two test collections of bands covering a wide range of popularities. The performance of the proposed approach is evaluated using precision and recall measures. We further investigate the influence of different query schemes for the web page retrieval, of a critical parameter used in the rule filtering step, and of different string matching functions which are applied to deal with inconsistent spelling of band members.

[1]  Markus Koppenberger,et al.  The emergence of complex network patterns in music networks , 2004, ISMIR.

[2]  Masataka Goto,et al.  Musicream: New Music Playback Interface for Streaming, Sticking, Sorting, and Recalling Musical Pieces , 2005, ISMIR.

[3]  David E. Millard,et al.  Automatic Ontology-Based Knowledge Extraction from Web Documents , 2003, IEEE Intell. Syst..

[4]  Steffen Staab,et al.  Towards the self-annotating web , 2004, WWW '04.

[5]  Peter Knees,et al.  An innovative three-dimensional user interface for exploring music collections enriched , 2006, MM '06.

[6]  Teruko Mitamura,et al.  Knowledge-based extraction of named entities , 2002, CIKM '02.

[7]  Peter Knees,et al.  Assigning and Visualizing Music Genres by Web-based Co-Occurrence Analysis , 2006, ISMIR.

[8]  Marti A. Hearst Automatic Acquisition of Hyponyms from Large Text Corpora , 1992, COLING.

[9]  Rajeev Motwani,et al.  The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.

[10]  Peter Knees,et al.  Discovering and Visualizing Prototypical Artists by Web-Based Co-Occurrence Analysis , 2005, ISMIR.

[11]  Peter Knees,et al.  A music search engine built upon audio-based and web-based similarity measures , 2007, SIGIR.

[12]  Pradeep Ravikumar,et al.  A Comparison of String Distance Metrics for Name-Matching Tasks , 2003, IIWeb.

[13]  Mario Nöcker,et al.  Databionic Visualization of Music Collections According to Perceptual Distance , 2005, ISMIR.

[14]  Fabio Vignoli,et al.  Mapping Music In The Palm Of Your Hand, Explore And Discover Your Collection , 2004, ISMIR.

[15]  Satoshi Sekine,et al.  Named Entity Discovery Using Comparable News Articles , 2004, COLING.

[16]  James Allan,et al.  Text classification and named entities for new event detection , 2004, SIGIR '04.

[17]  Ichiro Fujinaga,et al.  Web Services for Music Information Retrieval , 2004, ISMIR.