The University of Amsterdam at TREC 2012

We describe our participation in the TREC 2002 Novelty, Question answering, and Web tracks. We provide a detailed account of the ideas underlying our approaches to these tasks. All our runs used the FlexIR information retrieval system.

[1]  Jaap Kamps,et al.  Web-centric language models , 2005, CIKM '05.

[2]  Donna K. Harman,et al.  Overview of the TREC 2002 Novelty Track , 2002, TREC.

[3]  M. de Rijke,et al.  Approaches to Robust and Web Retrieval , 2003, TREC.

[4]  Maarten de Rijke,et al.  Shallow Morphological Analysis in Monolingual Information Retrieval for Dutch, German, and Italian , 2001, CLEF.

[5]  Dekang Lin,et al.  PRINCIPAR - An Efficient, Broad-coverage, Principle-based Parser , 1994, COLING.

[6]  Gilad Mishne,et al.  How frogs built the Berlin Wall , 2004 .

[7]  Eduard H. Hovy,et al.  Offline Strategies for Online Question Answering: Answering Questions Before They Are Asked , 2003, ACL.

[8]  Valentin Jijkoun,et al.  Recognizing Textual Entailment: Is Word Similarity Enough? , 2005, MLCW.

[9]  Jimmy J. Lin,et al.  Web question answering: is more always better? , 2002, SIGIR '02.

[10]  M. de Rijke,et al.  Credibility Improves Topical Blog Post Retrieval , 2008, ACL.

[11]  Edward A. Fox,et al.  Combination of Multiple Searches , 1993, TREC.

[12]  David Hawking,et al.  Overview of the TREC-2001 Web track , 2002 .

[13]  Christopher D. Manning,et al.  Incorporating Non-local Information into Information Extraction Systems by Gibbs Sampling , 2005, ACL.

[14]  Sergey Brin,et al.  The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.

[15]  Jungyun Seo,et al.  SiteQ: Engineering High Performance QA System Using Lexico-Semantic Pattern Matching and Shallow NLP , 2001, TREC.

[16]  M. de Rijke,et al.  Type Checking in Open-Domain Question Answering , 2004, ECAI.

[17]  M. de Rijke,et al.  Adding semantics to microblog posts , 2012, WSDM '12.

[18]  Gilad Mishne,et al.  Language Models for Searching in Web Corpora , 2004, TREC.

[19]  Valentin Jijkoun,et al.  Answer Selection in a Multi-stream Open Domain Question Answering System , 2004, ECIR.

[20]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[21]  Stephen E. Robertson,et al.  On Collection Size and Retrieval Effectiveness , 2004, Information Retrieval.

[22]  Valentin Jijkoun,et al.  Data-driven type checking in open domain question answering , 2007, J. Appl. Log..

[23]  Eric R. Ziegel,et al.  The Elements of Statistical Learning , 2003, Technometrics.

[24]  W. Bruce Croft,et al.  A Markov random field model for term dependencies , 2005, SIGIR '05.

[25]  Jennifer Chu-Carroll,et al.  Use of WordNet Hypernyms for Answering What-Is Questions , 2001, TREC.

[26]  Oren Tsur,et al.  BioGrapher: Biography Questions as a Restricted Domain Question Answering Task , 2004 .

[27]  Djoerd Hiemstra,et al.  The Importance of Prior Probabilities for Entry Page Search , 2002, SIGIR '02.

[28]  Gilad Mishne,et al.  The University of Amsterdam at the TREC 2003 Question Answering Track , 2003, TREC.

[29]  Dragomir R. Radev,et al.  Centroid-based summarization of multiple documents , 2004, Inf. Process. Manag..

[30]  Yoram Singer,et al.  Pegasos: primal estimated sub-gradient solver for SVM , 2011, Math. Program..

[31]  Stephen E. Robertson,et al.  Experimentation as a way of life: Okapi at TREC , 2000, Inf. Process. Manag..

[32]  Kenney Ng A Maximum Likelihood Ratio Information Retrieval Model , 1999, TREC.

[33]  Eugene Charniak,et al.  A Maximum-Entropy-Inspired Parser , 2000, ANLP.

[34]  Gilad Mishne,et al.  Making Stone Soup: Evaluating a Recall-Oriented Multi-stream Question Answering System for Dutch , 2004, CLEF.

[35]  Alberto H. F. Laender,et al.  The effectiveness of automatically structured queries in digital libraries , 2004, Proceedings of the 2004 Joint ACM/IEEE Conference on Digital Libraries, 2004..

[36]  Christof Monz,et al.  From document retrieval to question answering , 2003 .

[37]  Ellen M. Voorhees,et al.  The Tenth Text REtrieval Conference, TREC 2001 | NIST , 2002 .

[38]  Gilad Mishne,et al.  Preprocessing documents to answer Dutch questions , 2003 .

[39]  Bernardo Magnini,et al.  Is It the Right Answer? Exploiting Web Redundancy for Answer Validation , 2002, ACL.

[40]  M. de Rijke,et al.  A few examples go a long way: constructing query models from elaborate query formulations , 2008, SIGIR '08.

[41]  J. J. Rocchio,et al.  Relevance feedback in information retrieval , 1971 .

[42]  Rada Mihalcea,et al.  Wikify!: linking documents to encyclopedic knowledge , 2007, CIKM '07.

[43]  Gerard Salton,et al.  Term-Weighting Approaches in Automatic Text Retrieval , 1988, Inf. Process. Manag..

[44]  Djoerd Hiemstra,et al.  Using language models for information retrieval , 2001 .

[45]  Loren G. Terveen,et al.  Does “authority” mean quality? predicting expert quality ratings of Web documents , 2000, SIGIR '00.

[46]  Frank van Harmelen,et al.  A semantic web primer , 2004 .

[47]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[48]  Pushpak Bhattacharyya,et al.  Is question answering an acquired skill? , 2004, WWW '04.

[49]  M. de Rijke,et al.  Mapping queries to the Linking Open Data cloud: A case study using DBpedia , 2011, J. Web Semant..

[50]  Joel Tetreault,et al.  A Corpus-Based Evaluation of Centering and Pronoun Resolution , 2001, Computational Linguistics.

[51]  Valentin Jijkoun,et al.  Information Extraction for Question Answering: Improving Recall Through Syntactic Patterns , 2004, COLING.

[52]  Jon Kleinberg,et al.  Authoritative sources in a hyperlinked environment , 1999, SODA '98.

[53]  M. de Rijke,et al.  Generating links to background knowledge: a case study using narrative radiology reports , 2011, CIKM '11.

[54]  Joon Ho Lee,et al.  Combining multiple evidence from different properties of weighting schemes , 1995, SIGIR '95.

[55]  M. de Rijke,et al.  Tequesta: The University of Amsterdam's Textual Question Answering System , 2001, TREC.

[56]  Suresh Manandhar,et al.  The Use of Sentence Similarity as a Semantic Relevance Metric for Question Answering , 2003, New Directions in Question Answering.

[57]  Wouter Weerkamp,et al.  Microblog language identification: overcoming the limitations of short, unedited and idiomatic text , 2012, Language Resources and Evaluation.

[58]  Julian Kupiec,et al.  MURAX: a robust linguistic approach for question answering using an on-line encyclopedia , 1993, SIGIR.

[59]  Dekang Lin,et al.  An Information-Theoretic Definition of Similarity , 1998, ICML.

[60]  Garrison W. Cottrell,et al.  Predicting the performance of linearly combined IR systems , 1998, SIGIR '98.

[61]  W. Bruce Croft,et al.  Quantifying query ambiguity , 2002 .

[62]  Niranjan Balasubramanian,et al.  Topic Pages: An Alternative to the Ten Blue Links , 2010, 2010 IEEE Fourth International Conference on Semantic Computing.

[63]  Chris Buckley,et al.  New Retrieval Approaches Using SMART: TREC 4 , 1995, TREC.

[64]  Ellen M. Voorhees,et al.  Overview of the TREC 2004 Novelty Track. , 2005 .

[65]  Michalis Faloutsos,et al.  On power-law relationships of the Internet topology , 1999, SIGCOMM '99.

[66]  M. F. Porter,et al.  An algorithm for suffix stripping , 1997 .

[67]  Charles L. A. Clarke,et al.  Exploiting redundancy in question answering , 2001, SIGIR '01.

[68]  Sanda M. Harabagiu,et al.  Performance Issues and Error Analysis in an Open-Domain Question Answering System , 2002, ACL.

[69]  William John Teahan,et al.  Bangor at TREC 2003: Q&A and Genomics Tracks , 2003, TREC.

[70]  Stephen E. Robertson,et al.  Effective site finding using link anchor information , 2001, SIGIR '01.

[71]  Gilad Mishne,et al.  The University of Amsterdam at QA@CLEF 2004 , 2003, CLEF.

[72]  M. de Rijke,et al.  Ranking related entities: components and analyses , 2010, CIKM.

[73]  L. Buckland UvA-DARE (Digital Academic Repository) The University of Amsterdam at TREC 2012 , 2013 .

[74]  John D. Lafferty,et al.  Model-based feedback in the language modeling approach to information retrieval , 2001, CIKM '01.

[75]  Paul Rayson,et al.  Comparing Corpora using Frequency Profiling , 2000, Proceedings of the workshop on Comparing corpora -.