Overview of INEX 2007 Link the Wiki Track

Wikipedia is becoming ever more popular. Linking between documents is typically provided in similar environments in order to achieve collaborative knowledge sharing. However, this functionality in Wikipedia is not integrated into the document creation process and the quality of automatically generated links has never been quantified. The Link the Wiki (LTW) track at INEX in 2007 aimed at producing a standard procedure, metrics and a discussion forum for the evaluation of link discovery. The tasks offered by the LTW track as well as its evaluation present considerable research challenges. This paper briefly described the LTW task and the procedure of evaluation used at LTW track in 2007. Automated link discovery methods used by participants are outlined. An overview of the evaluation results is concisely presented and further experiments are reported.

[1]  Linda Schamber Relevance and Information Behavior. , 1994 .

[2]  Ludovic Denoyer,et al.  The XML Wikipedia Corpus , 2006 .

[3]  Li Xiong,et al.  NNexus: Towards an Automatic Linker for a Massively-Distributed Collaborative Corpus , 2006, 2006 International Conference on Collaborative Computing: Networking, Applications and Worksharing.

[4]  Jihong Zeng,et al.  From keywords to links: an automatic approach , 2004, International Conference on Information Technology: Coding and Computing, 2004. Proceedings. ITCC 2004..

[5]  Evgeniy Gabrilovich,et al.  Computing Semantic Relatedness Using Wikipedia-based Explicit Semantic Analysis , 2007, IJCAI.

[6]  Alan F. Smeaton,et al.  Automatic link generation , 1999, CSUR.

[7]  David Ellis,et al.  On the measurement of inter-linker consistency and retrieval effectiveness in hypertext databases , 1994, SIGIR '94.

[8]  Stephen J. Green,et al.  Automated Link Generation: Can we do Better than Term Repetition? , 1998, Comput. Networks.

[9]  Jaana Kekäläinen,et al.  Using graded relevance assessments in IR evaluation , 2002, J. Assoc. Inf. Sci. Technol..

[10]  Pierre Senellart,et al.  Finding Related Pages Using Green Measures: An Illustration with Wikipedia , 2007, AAAI.

[11]  Monika Henzinger,et al.  Finding Related Pages in the World Wide Web , 1999, Comput. Networks.

[12]  M. de Rijke,et al.  Discovering missing links in Wikipedia , 2005, LinkKDD '05.

[13]  Andrew Trotman,et al.  Passage Retrieval and other XML-Retrieval Tasks , 2006, SIGIR 2006.

[14]  Shlomo Geva,et al.  The Methodology of Manual Assessment in the Evaluation of Link Discovery , 2009 .

[15]  Stephen J. Green,et al.  Building Hypertext Links By Computing Semantic Similarity , 1999, IEEE Trans. Knowl. Data Eng..

[16]  Ravi Kumar,et al.  Trawling the Web for Emerging Cyber-Communities , 1999, Comput. Networks.

[17]  Simone Paolo Ponzetto,et al.  WikiRelate! Computing Semantic Relatedness Using Wikipedia , 2006, AAAI.

[18]  Ludovic Denoyer,et al.  The Wikipedia XML Corpus , 2006, INEX.

[19]  Pertti Vakkari,et al.  Changes in relevance criteria and problem stages in task performance , 2000, J. Documentation.

[20]  Charles L. A. Clarke,et al.  University of Waterloo at INEX2007: Adhoc and Link-the-Wiki Tracks , 2007, INEX.

[21]  Andrew Trotman,et al.  Experiments and evaluation of link discovery in the Wikipedia , 2008 .

[22]  Aaron Phillip Krowne,et al.  An Architecture for Collaborative Math and Science Digital Libraries , 2003 .

[23]  Jennifer Widom,et al.  SimRank: a measure of structural-context similarity , 2002, KDD.

[24]  Péter Schönhofen,et al.  Identifying Document Topics Using the Wikipedia Category Network , 2006, 2006 IEEE/WIC/ACM International Conference on Web Intelligence (WI 2006 Main Conference Proceedings)(WI'06).

[25]  M. M. Kessler Bibliographic coupling between scientific papers , 1963 .

[26]  James Allan Building Hypertext Using Information Retrieval , 1997, Inf. Process. Manag..