Proceedings of the Third Dutch-Belgian Information Retrieval Workshop (DIR 2002)

Statistically-based approaches to language and information processing (LIP) were novelties in the 1960s. They were hard to implement in practice, through lack of data and machine power, and hard for many to accept in theory. But they were successfully applied to document retrieval, showing how LIP tasks could be viewed in a productive new way. In the last decade there have been striking further developments, extending statistically-based LIP theory and practice in novel and challenging directions. The talk will illustrate some of the possibilities and problems involved.

[1]  Michael L. Littman,et al.  Automatic Cross-Language Retrieval Using Latent Semantic Indexing , 1997 .

[2]  W. Bruce Croft,et al.  Resolving ambiguity for cross-language retrieval , 1998, SIGIR '98.

[3]  Cornelis H. A. Koster,et al.  Uncertainty-Based Noise Reduction and Term Selection in Text Categorization , 2002, ECIR.

[4]  Fredric C. Gey,et al.  Manual Queries and Machine Translation in Cross-Language Retrieval and Interactive Retrieval with Cheshire II at TREC-7 , 1998, TREC.

[5]  Marc Krier,et al.  Automatic categorisation applications at the European patent office , 2002 .

[6]  Wessel Kraaij,et al.  Porter's stemming algorithm for Dutch , 1994 .

[7]  Douglas W. Oard,et al.  Cross-language Information Retrieval , 2021, ArXiv.

[8]  Geschiedenis van het Nederlandsche belasting-wezen , 1865 .

[9]  Paul Buitelaar,et al.  Ranking and Selecting Synsets by Domain Relevance , 2001 .

[10]  Claire Cardie,et al.  Using clustering and SuperConcepts within SMART: TREC 6 , 1997, Inf. Process. Manag..

[11]  Michael J. Fischer,et al.  The String-to-String Correction Problem , 1974, JACM.

[12]  Yonggang Qiu Automatic query expansion based on a similarity thesaurus , 1995 .

[13]  Yiming Yang,et al.  Translingual Information Retrieval: A Comparative Evaluation , 1997, IJCAI.

[14]  Sonia Vandepitte A Pragmatic Study of the Expression and Interpretation of Causality. Conjuncts and Conjunctions in Modern Spoken British English , 1993 .

[15]  Fredric C. Gey,et al.  English-German Cross-Language Retrieval for the GIRT Collection - Exploiting a Multilingual Thesaurus , 1999, TREC.

[16]  Douglas W. Oard,et al.  A comparative study of query and document translation for cross-language information retrieval , 1998, AMTA.

[17]  Lorraine K. Tanabe,et al.  Tagging gene and protein names in biomedical text , 2002, Bioinform..

[18]  Helmut Schmidt,et al.  Probabilistic part-of-speech tagging using decision trees , 1994 .

[19]  HANS PAIJMANS Gravity wells of meaning: detecting information-rich passages in scientific texts , 1997, J. Documentation.

[20]  Djoerd Hiemstra,et al.  Twenty-One at TREC7: Ad-hoc and Cross-Language Track , 1998, TREC.

[21]  James Mayfield,et al.  Scalable Multilingual Information Access , 2002, CLEF.

[22]  Avi Arampatzis,et al.  KUN on the TREC-9 Filtering Track: Incrementality, Decay, and Threshold Optimization for Adaptive Fi , 2000 .

[23]  Martin F. Porter,et al.  An algorithm for suffix stripping , 1997, Program.

[24]  K. Hoover Causality in Macroeconomics , 2001 .

[25]  Kenneth Katzner The languages of the world , 1975 .

[26]  Wessel Kraaij,et al.  TNO at CLEF-2001: Comparing Translation Resources , 2001, CLEF.

[27]  Maarten de Rijke,et al.  Shallow Morphological Analysis in Monolingual Information Retrieval for Dutch, German, and Italian , 2001, CLEF.

[28]  Yoram Singer,et al.  Context-sensitive learning methods for text categorization , 1996, SIGIR '96.

[29]  Karen Spärck Jones,et al.  Natural language processing for information retrieval , 1996, CACM.

[30]  J. J. Paijmans Text categorization as an information retrieval task , 1998 .

[31]  Mark Craven,et al.  Constructing Biological Knowledge Bases by Extracting Information from Text Sources , 1999, ISMB.

[32]  Jian-Yun Nie,et al.  Cross-language information retrieval based on parallel texts and automatic mining of parallel texts from the Web , 1999, SIGIR '99.

[33]  Peter Schäuble,et al.  Building a Large Multilingual Test Collection from Comparable News Documents , 1998 .

[34]  W. Bruce Croft,et al.  Phrasal translation and query expansion techniques for cross-language information retrieval , 1997, SIGIR '97.

[35]  Michael McGill,et al.  Introduction to Modern Information Retrieval , 1983 .

[36]  Allen Ginsberg,et al.  A unified approach to automatic indexing and information retrieval , 1993, IEEE Expert.

[37]  Graeme Hirst,et al.  Lexical Cohesion Computed by Thesaural relations as an indicator of the structure of text , 1991, CL.

[38]  Mark W. Davis,et al.  New Experiments In Cross-Language Text Retrieval At NMSU's Computing Research Lab , 1996, TREC.

[39]  Gerard Salton,et al.  Automatic Text Processing: The Transformation, Analysis, and Retrieval of Information by Computer , 1989 .

[40]  Miguel A. Andrade-Navarro,et al.  Automatic extraction of keywords from scientific text: application to the knowledge domain of protein families , 1998, Bioinform..

[41]  Hideki Kozima,et al.  Text Segmentation Based on Similarity between Words , 1993, ACL.

[42]  Djoerd Hiemstra,et al.  Cross Language Retrieval with the Twenty-One system , 1997, TREC.

[43]  David A. Hull Using Structured Queries for Disambiguation in Cross-Language Information Retrieval , 1997 .

[44]  Hinrich Schütze,et al.  Customizing a Lexicon to Better Suit a Computational Task , 1996 .

[45]  Ido Dagan,et al.  Mistake-Driven Learning in Text Categorization , 1997, EMNLP.

[46]  Julio Gonzalo,et al.  An Approach to Conceptual Text Retrieval Using the EuroWordNet Multilingual Semantic Database , 1997 .

[47]  Eric Brill,et al.  An Overview of Empirical Natural Language Processing , 1997, AI Mag..

[48]  Sholom M. Weiss,et al.  Towards language independent automated learning of text categorization models , 1994, SIGIR '94.

[49]  Walter Daelemans,et al.  Cascaded Grammatical Relation Assignment , 1999, EMNLP.

[50]  Padmini Srinivasan,et al.  Cross-language information retrieval with the UMLS metathesaurus , 1998, SIGIR '98.

[51]  Piek Vossen,et al.  EuroWordNet: a multilingual database for information retrieval , 1997 .

[52]  H. C. Card,et al.  Personal Adaptive Web agent: a tool for information filtering , 1997, CCECE '97. Canadian Conference on Electrical and Computer Engineering. Engineering Innovation: Voyage of Discovery. Conference Proceedings.

[53]  Paul Buitelaar,et al.  An Efficient and Flexible Format for Linguistic and Semantic Annotation , 2002, LREC.

[54]  Gregory Grefenstette,et al.  Querying across languages: a dictionary-based approach to multilingual information retrieval , 1996, SIGIR '96.

[55]  James Mayfield,et al.  Indexing Using Both N-Grams and Words , 1998, TREC.

[56]  Dirk Vervenne,et al.  Advanced document management through thesaurus-based indexing: the IKEM platform , 1999 .

[57]  Dale Schuurmans,et al.  General Convergence Results for Linear Discriminant Updates , 1997, COLT '97.

[58]  Christos Faloutsos,et al.  FastMap: a fast algorithm for indexing, data-mining and visualization of traditional and multimedia datasets , 1995, SIGMOD '95.

[59]  Jong-Hak Lee,et al.  Analyses of multiple evidence combination , 1997, SIGIR '97.

[60]  A.M.J.A. Berkvens Digitalisering: een virtuele toekomst voor oude rechtsbronnen , 2001 .

[61]  Edward A. Fox,et al.  Combination of Multiple Searches , 1993, TREC.

[62]  Marti A. Hearst Automated Discovery of WordNet Relations , 2004 .

[63]  Mark Sanderson,et al.  Word sense disambiguation and information retrieval , 1994, SIGIR '94.

[64]  Peter Willett,et al.  Searching for historical word-forms in a database of 17th-century English text using spelling-correction methods , 1992, SIGIR '92.

[65]  Yiming Yang,et al.  A Comparative Study on Feature Selection in Text Categorization , 1997, ICML.

[66]  James Mayfield,et al.  Comparing cross-language query expansion techniques by degrading translation resources , 2002, SIGIR '02.

[67]  E. A. Fox,et al.  Combining the Evidence of Multiple Query Representations for Information Retrieval , 1995, Inf. Process. Manag..

[68]  Alexander Dekhtyar,et al.  Information Retrieval , 2018, Lecture Notes in Computer Science.

[69]  James F. Allen Natural language understanding , 1987, Bejnamin/Cummings series in computer science.

[70]  Susan T. Dumais,et al.  Using Linear Algebra for Intelligent Information Retrieval , 1995, SIAM Rev..

[71]  Douglas W. Oard,et al.  Document Translation for Cross-Language Text Retrieval at the University of Maryland , 1997, TREC.

[72]  W. Bruce Croft,et al.  Dictionary Methods for Cross-Lingual Information Retrieval , 1996, DEXA.

[73]  Joon Ho Lee Combining Multiple Evidence from Different Relevant Feedback Networks , 1997, DASFAA.