论文信息 - TIRA: Configuring, Executing, and Disseminating Information Retrieval Experiments

TIRA: Configuring, Executing, and Disseminating Information Retrieval Experiments

With its close ties to the Web, the information retrieval community is destined to leverage the dissemination and collaboration capabilities that the Web provides today. Especially with the advent of the software as a service principle, an information retrieval community is conceivable that publishes executable experiments by anyone over the Web. A review of recent SIGIR papers shows that we are far away from this vision of collaboration. The benefits of publishing information retrieval experiments as a service are striking for the community as a whole, including potential to boost research profiles and reputation. However, the additional work must be kept to a minimum and sensitive data must be kept private for this paradigm to become an accepted practice. In order to foster experiments as a service in information retrieval, we present the TIRA (Testbed for Information Retrieval Algorithms) web framework that addresses the outlined challenges and possesses a unique set of compelling features in comparison to existing web-based solutions. To describe TIRA in a practical setting, we explain how it is currently used as an official evaluation platform for the well-established PAN international plagiarism detection competition. We also describe how it can be used in future scenarios for search result clustering of non-static collections of web query results, as well as within a simulation data mining setting to support interactive structural design in civil engineering.

[1] Benno Stein,et al. Simulation Data Mining for Supporting Bridge Design , 2011, AusDM.

[2] Benno Stein,et al. Beyond precision@10: clustering the long tail of web search results , 2011, CIKM '11.

[3] Matthias Hagen,et al. Overview of the 1st international competition on plagiarism detection , 2009 .

[4] Alistair Moffat,et al. Improvements that don't add up: ad-hoc retrieval results since 1998 , 2009, CIKM.

[5] Benno Stein,et al. Ousting ivory tower research: towards a web framework for providing experiments as a service , 2012, SIGIR '12.

[6] Eugene Agichtein,et al. ViewSer: enabling large-scale remote user studies of web search examination and interaction , 2011, SIGIR.

[7] Benno Stein,et al. Search result presentation based on faceted clustering , 2012, CIKM.

[8] Benno Stein,et al. Challenges in Document Mining (Dagstuhl Seminar 11171) , 2011, Dagstuhl Reports.