LogCLEF 2009: the CLEF 2009 Multilingual Logfile Analysis Track Overview

Log data constitute a relevant aspect in the evaluation process of the quality of a search engine and the quality of a multilingual search service; log data can be used to study the usage of a search engine, and to better adapt it to the objectives the users were expecting to reach. The interest in multilingual log analysis was promoted by the Cross Language Evaluation Forum (CLEF) for the first time with a track named LogCLEF. LogCLEF is an evaluation initiative for the analysis of queries and other logged activities as expression of user behavior. The goal is the analysis and classification of queries in order to understand search behavior especially in multilingual contexts and ultimately to improve search systems. Two tasks were defined: Log Analysis and Geographic Query Identification (LAGI) which aimed at the identification of queries for geographic content and Log Analysis for Digital Societies (LADS) which was based on analyzing the user behavior of the search logs the service of The European Library. Five groups using a variety of approaches submitted experiments. The data for the track, the evaluation methodology and results are presented and discussed.

[1]  Adrian Iftene,et al.  UAIC: Participation in LAGI Task , 2009, CLEF.

[2]  Leonardo Candela,et al.  Digital Libraries: Research and Development, First International DELOS Conference, Pisa, Italy, February 13-14, 2007, Revised Selected Papers , 2007, DELOS.

[3]  Maria Gäde,et al.  Ambiguity of Queries and the Challenges for Query Language Detection , 2010, CLEF.

[4]  Ralph Kölle,et al.  Search Path Visualization and Session Performance Evaluation with Log Files from The European Library (TEL) , 2009, CLEF.

[5]  Nina Mishra,et al.  Releasing search queries and clicks privately , 2009, WWW '09.

[6]  Carol Peters,et al.  Advances in Multilingual and Multimodal Information Retrieval, 8th Workshop of the Cross-Language Evaluation Forum, CLEF 2007, Budapest, Hungary, September 19-21, 2007, Revised Selected Papers , 2008, CLEF.

[7]  Amanda Spink,et al.  Handbook of Research on Web Log Analysis , 2008 .

[8]  Fredric C. Gey,et al.  GeoCLEF 2008: The CLEF 2008 Cross-Language Geographic Information Retrieval Track Overview , 2008, CLEF.

[9]  Wessel Kraaij,et al.  How does the Library Searcher Behave? A Contrastive Study of Library Search against Ad-hoc Search , 2010, CLEF.

[10]  Giorgio Maria Di Nunzio,et al.  Gathering and Mining Information from Web Log Files , 2007, DELOS.

[11]  Xing Xie,et al.  Query Parsing Task for GeoCLEF2007 Report , 2007, CLEF.

[12]  Walid Magdy,et al.  DCU-TCD@LogCLEF 2010: Re-ranking Document Collections and Query Performance Estimation , 2010, CLEF.

[13]  Judit Bar-Ilan,et al.  Handbook of Research on Web Log Analysis , 2009 .

[14]  Luca Dini,et al.  CACAO Project at the LogCLEF Track , 2009, CLEF.

[15]  Xing Xie,et al.  MSRA Columbus at GeoCLEF 2006 , 2006, CLEF.

[16]  Dong Zhou,et al.  TCD-DCU at LogCLEF 2009: An Analysis of Queries, Actions, and Interface Languages , 2009, CLEF.

[17]  Noriko Kando,et al.  CRES at LogCLEF 2010: Towards Understanding the User Behaviors through an Analysis of Search Sessions, Search Units and Click Ranks , 2010, CLEF.

[18]  Giorgio Maria Di Nunzio,et al.  Web Log Mining : A Study of User Sessions , 2007 .

[19]  Katja Hofmann,et al.  A Semantic Perspective on Query Log Analysis , 2009, CLEF.

[20]  José Carlos González,et al.  DAEDALUS at LogCLEF 2010: Analyzing the Success of Search Queries , 2010, CLEF.

[21]  Maristella Agosti Log Data in Digital Libraries , 2008, IRCDL.

[22]  Luca Dini,et al.  Language Identification Strategies for Cross Language Information Retrieval , 2010, CLEF.