论文信息 - The impact of author ranking in a library catalogue

The impact of author ranking in a library catalogue

The field of information retrieval has witnessed over 50 years of research on retrieval methods for metadata descriptions and controlled indexing languages, the prototypical example being the library catalogue. It seems only natural to resort to additional data for improving book retrieval, such as the text of the book in whole or in part (table of contents, abstract) or contributed social data acquired through crowdsourcing social cataloguing sites like LibraryThing. Without denying the potential value of such additional data, we want to challenge the underlying assumption that applying novel retrieval methods to traditional book descriptions cannot improve book retrieval. Specifically, this paper investigates the effectiveness of author rankings in a library catalogue. We show that a standard retrieval model results in a book ranking that meets and exceeds the effectiveness of catalogue systems. We show that using expert finding methods we also can obtain effective author rankings that complement the traditional book rankings. Moreover, ranking books on author scores leads to substantial and significant improvements over the original book rankings. If we base our book ranking on the combination of the author scores and the book scores we see no further improvements. Hence our results clearly demonstrate the importance of author ranking for retrieving library catalogue records: authors capture an important aspect of relevance and one that is not obvious to those unfamiliar with specific area of interest.

Jaap Kamps

[1] Christine L. Borgman,et al. Why are Online Catalogs Hard to Use? Lessons Learned from Information=Retrieval Studies , 1986 .

[2] Nick Craswell,et al. Overview of the TREC 2005 Enterprise Track , 2005, TREC.

[3] M. de Rijke,et al. A language modeling framework for expert finding , 2009, Inf. Process. Manag..

[4] Christine L. Borgman,et al. Why are online catalogs still hard to use , 1996 .

[5] Mounia Lalmas,et al. Overview of the INEX 2007 Entity Ranking Track , 2008, INEX.

[6] Karen Markey,et al. The Online Library Catalog: Paradise Lost and Paradise Regained? , 2007, D Lib Mag..

[7] Djoerd Hiemstra,et al. PFTijah: text search in an XML database system , 2006 .

[8] Christine L. Borgman,et al. Why are online catalogs hard to use? Lessons learned from information-retrieval studies , 1986, J. Am. Soc. Inf. Sci..

[9] Karen Coyle,et al. Resource Description and Access (RDA): Cataloging Rules for the 20th Century , 2007 .

[10] Peter Bailey,et al. The CSIRO enterprise search test collection , 2007, SIGF.

[11] Djoerd Hiemstra,et al. Modeling Documents as Mixtures of Persons for Expert Finding , 2008, ECIR.