Charting a new course : natural language processing and information retrieval : essays in honour of Karen Spärck Jones

Preface.- A Retrospective View of Synonymy and Semantic Classification.- On the Early History of Evaluation in IR.- The Emergence of Probabilistic Accounts of Information Retrieval.- Lovins Revisited.- The History of IDF and its Influences on IR and Other Fields.- Beyond English Text: Multilingual and Multimedia Information Retrieval.- Summarization.- Question Answering.- Noun Compounds Revisited.- Lexical Decomposition: For and Against.- The Importance of Focused Evaluations: a Case Study of TREC and DUC.- Mice from a Mountain: Reflections on Current Issues in Evaluation of Written Language Technology.- The Evaluation of Retrieval Effectiveness in Chemical Database Searching.- Unhappy Bedfellows: the Relationship of AI and IR.- Index.

[1]  R. Manmatha,et al.  A search engine for historical manuscript images , 2004, SIGIR '04.

[2]  M. F. Porter,et al.  An algorithm for suffix stripping , 1997 .

[3]  Karen Sparck Jones Automatic keyword classification for information retrieval , 1971 .

[4]  J. Stephen Downie,et al.  Evaluation of a simple and effective music information retrieval method , 2000, SIGIR '00.

[5]  K. Sparck Jones,et al.  INFORMATION RETRIEVAL TEST COLLECTIONS , 1976 .

[6]  K. Sparck Jones,et al.  KEYWORDS AND CLUMPS , 1964 .

[7]  Karen Spärck Jones,et al.  A Natural Language Front End to Databases with Evaluative Feedback , 1983, ICOD-2 Workshop on New Applications of Data Bases.

[8]  Gerard Salton,et al.  Term-Weighting Approaches in Automatic Text Retrieval , 1988, Inf. Process. Manag..

[9]  Cyril W. Cleverdon,et al.  Factors determining the performance of indexing systems , 1966 .

[10]  Michael Johnston,et al.  Qualia Structure and the Compositional Interpretation of Compounds , 1999 .

[11]  David Yarowsky,et al.  Hierarchical Decision Lists for Word Sense Disambiguation , 2000, Comput. Humanit..

[12]  Michael E. Lesk,et al.  Computer Evaluation of Indexing and Text Processing , 1968, JACM.

[13]  Hsin-Hsi Chen,et al.  Cross-Language Chinese Text Retrieval in NTCIR Workshop: towards Cross-Language multilingual Text Retrieval , 2001, SIGF.

[14]  Jian-Yun Nie,et al.  Cross-language information retrieval based on parallel texts and automatic mining of parallel texts from the Web , 1999, SIGIR '99.

[15]  Rosemary Leonard,et al.  The Interpretation of English Noun Sequences on the Computer , 1984 .

[16]  Tetsuya Sakai,et al.  Toshiba BRIDJE at NTCIR-4 CLIR: Monolingual/Bilingual IR and Flexible Feedback , 2004, NTCIR.

[17]  C. J. van Rijsbergen,et al.  Report on the need for and provision of an 'ideal' information retrieval test collection , 1975 .

[18]  Karen Spärck Jones,et al.  Experiments in Spoken Document Retrieval , 1996, Inf. Process. Manag..

[19]  M. E. Maron,et al.  On Relevance, Probabilistic Indexing and Information Retrieval , 1960, JACM.

[20]  J. Scott McCarley Should we Translate the Documents or the Queries in Cross-language Information Retrieval? , 1999, ACL.

[21]  Karen Sparck Jones A statistical interpretation of term specificity and its application in retrieval , 1972 .

[22]  Dan Flickinger,et al.  Minimal Recursion Semantics: An Introduction , 2005 .

[23]  Kenneth Ward Church,et al.  Coping with Syntactic Ambiguity or How to Put the Block in the Box on the Table , 1982, CL.

[24]  Karen Spärck Jones,et al.  Readings in natural language processing , 1986 .

[25]  Alexander M. Fraser,et al.  TREC 2001 Cross-lingual Retrieval at BBN , 2001, TREC.

[26]  Sanda M. Harabagiu,et al.  Open-domain textual question answering techniques , 2003, Natural Language Engineering.

[27]  Karen Spärck Jones,et al.  Retrieving spoken documents by combining multiple index sources , 1996, SIGIR '96.

[28]  Mark T. Maybury Toward a Question Answering Roadmap , 2003, New Directions in Question Answering.

[29]  Robert Krovetz,et al.  Word sense disambiguation for large text databases , 1996 .

[30]  Jacques Savoy,et al.  Combining Multiple Strategies for Effective Monolingual and Cross-Language Retrieval , 2004, Information Retrieval.

[31]  Gilbert Harman,et al.  Semantics of natural language , 2004, Synthese.

[32]  Ted Briscoe,et al.  The Derivation of a Grammatically Indexed Lexicon from the Longman Dictionary of Contemporary English , 1987, ACL.

[33]  Mirella Lapata,et al.  Detecting Novel Compounds: The Role of Distributional Evidence , 2003, EACL.

[34]  Martin Kay,et al.  The MIND System , 1970 .

[35]  Judith N. Levi,et al.  The syntax and semantics of complex nominals , 1978 .

[36]  Karen Spärck Jones Document Retrieval: Shallow Data, Deep Theories; Historical Reflections, Potential Directions , 2003, ECIR.

[37]  Barbara Rosario,et al.  Classifying the Semantic Relations in Noun Compounds via a Domain-Specific Lexical Hierarchy , 2001, EMNLP.

[38]  Dan Flickinger,et al.  An Open Source Grammar Development Environment and Broad-coverage English Grammar Using HPSG , 2000, LREC.

[39]  Peter Bailey,et al.  Engineering a multi-purpose test collection for Web retrieval experiments , 2003, Inf. Process. Manag..

[40]  Paul Over,et al.  The TREC VIdeo Retrieval Evaluation (TRECVID): A Case Study and Status Report , 2004, RIAO.

[41]  Philip Ball Snap, crackle and pop , 2000 .

[42]  Graeme Hirst,et al.  Semantic interpretation against ambiguity , 1984 .

[43]  Jean Paul Ballerini,et al.  Experiments in multilingual information retrieval using the SPIDER system , 1996, SIGIR '96.

[44]  Cyril W. Cleverdon,et al.  Aslib Cranfield research project: report on the testing and analysis of an investigation into the comparative efficiency of indexing systems , 1962 .

[45]  Karen Spärck Jones,et al.  Generic summaries for indexing in information retrieval , 2001, SIGIR '01.

[46]  James Pustejovsky,et al.  The Generative Lexicon , 1995, CL.

[47]  F. W. Lancaster,et al.  Vocabulary control for information retrieval , 1972 .

[48]  Donna K. Harman,et al.  The Importance of Focused Evaluations: a Case Study of TREC and DUC" , 2001, NTCIR.

[49]  Julie Beth Lovins,et al.  Development of a stemming algorithm , 1968, Mech. Transl. Comput. Linguistics.

[50]  Manabu Okumura,et al.  Text summarization challenge 2: text summarization evaluation at NTCIR workshop 3 , 2001, HLT-NAACL 2003.

[51]  M. Liberman,et al.  The Stress and Structure of Modified Noun Phrases in English , 1992 .

[52]  Karen Spärck Jones Experiments in relevance weighting of search terms , 1979, Inf. Process. Manag..

[53]  Smaranda Muresan,et al.  GIST-IT: Combining Linguistic and Machine Learning Techniques for Email Summarization , 2001 .

[54]  John Tait,et al.  On the Generality of Thesaurally derived Lexical Links , 2000 .

[55]  Thijs Westerveld,et al.  Multimedia Retrieval Using Multiple Examples , 2004, CIVR.

[56]  Mark T. Maybury,et al.  Planning multisentential English text using communicative acts , 1991 .

[57]  F. R. Palmer,et al.  A linguistic study of the English verb , 1968 .

[58]  Paul Procter,et al.  Longman Dictionary of Contemporary English , 1978 .

[59]  Karen Spärck Jones Index term weighting , 1973, Inf. Storage Retr..

[60]  Mirella Lapata,et al.  A Probabilistic Account of Logical Metonymy , 2003, Computational Linguistics.

[61]  K. Sparck Jones,et al.  What makes an automatic keyword classification effective , 1971 .

[62]  Richard Tucker,et al.  Automatic summarising and the CLASP system , 2000 .

[63]  Ellen M. Voorhees,et al.  Building a question answering test collection , 2000, SIGIR '00.

[64]  Gareth J. F. Jones,et al.  An Investigation of Mixed-Media Information Retrieval , 2002, ECDL.

[65]  S. Robertson The probability ranking principle in IR , 1997 .

[66]  Martin Hassel,et al.  Evaluation of Automatic Text Summarization , 2004 .

[67]  Edward A. Fox,et al.  Characterization of Two New Experimental Collections in Computer and Information Science Containing Textual and Bibliographic Concepts , 1983 .

[68]  Donna K. Harman,et al.  Overview of the First Text REtrieval Conference (TREC-1) , 1992, TREC.

[69]  Thierry Pun,et al.  Efficient access methods for content-based image retrieval with inverted files , 1999, Optics East.

[70]  Richard A. Posner,et al.  Reasoning by Analogy , 1886, The Indian medical gazette.

[71]  I. A. Richards English Through Pictures , 2005 .

[72]  Branimir Konstantinov Boguraev,et al.  Automatic resolution of linguistic ambiguities , 1979 .

[73]  Mark T. Maybury,et al.  Advances in Automatic Text Summarization , 1999 .

[74]  Karen Sparck Jones Natural Language Processing: A Historical Review , 1994 .

[75]  Gregory Grefenstette,et al.  Querying across languages: a dictionary-based approach to multilingual information retrieval , 1996, SIGIR '96.

[76]  Karen Sparck Jones,et al.  Spoken Document Retrieval for TREC-8 at Cambridge University , 1998, TREC.

[77]  Gareth J. F. Jones,et al.  Experiments in Japanese text retrieval and routing using the NEAT system , 1998, SIGIR '98.

[78]  Chin-Yew Lin,et al.  ROUGE: A Package for Automatic Evaluation of Summaries , 2004, ACL 2004.

[79]  Mark T. Maybury Universal multimedia information access , 2003, Universal Access in the Information Society.

[80]  David R. Dowty,et al.  Word Meaning and Montague Grammar , 1979 .

[81]  Wessel Kraaij,et al.  Evaluation of a Dutch stemming algorithm , 1994 .

[82]  Ted Briscoe,et al.  Robust Accurate Statistical Annotation of General Text , 2002, LREC.

[83]  Eugene A. Nida,et al.  A System for the Description of Semantic Elements , 1951 .

[84]  Stephen E. Robertson,et al.  Some simple effective approximations to the 2-Poisson model for probabilistic weighted retrieval , 1994, SIGIR '94.

[85]  Gerard Salton,et al.  On the Specification of Term Values in Automatic Indexing , 1973 .

[86]  Paul Over,et al.  The Effects of Human Variation in DUC Summarization Evaluation , 2004 .

[87]  李幼升,et al.  Ph , 1989 .

[88]  Jamie Callan,et al.  DISTRIBUTED INFORMATION RETRIEVAL , 2002 .

[89]  Regina Barzilay,et al.  Towards Multidocument Summarization by Reformulation: Progress and Prospects , 1999, AAAI/IAAI.

[90]  Karen Sparck Jones User Models, Discourse Models, and Some Others , 1988, CL.

[91]  Karen Spärck Jones,et al.  Information Retrieval from Unsegmented Broadcast News Audio , 2001, Int. J. Speech Technol..

[92]  Pamela A. Downing On the Creation and Use of English Compound Nouns. , 1977 .

[93]  Mary Elizabeth Stevens,et al.  Automatic indexing : a state-of-the art report , 1965 .

[94]  Karen Spärck Jones,et al.  Inference in Natural Language Front Ends , 1986, DS-2.

[95]  Jacques Savoy,et al.  Cross-language information retrieval: experiments based on CLEF 2000 corpora , 2003, Inf. Process. Manag..

[96]  Arthur W. S. Cater Analysis and inference for English , 1981 .

[97]  Karen Spärck Jones Shifting Meaning Representations , 1983, IJCAI.

[98]  Ellen M. Voorhees,et al.  The TREC Spoken Document Retrieval Track: A Success Story , 2000, TREC.

[99]  G. Lakoff Linguistics and natural logic , 1970, Synthese.

[100]  Yi Su,et al.  TREC-9 CLIR Experiments at MSRCN , 2000, TREC.

[101]  John Tait,et al.  Automatic summarising of English texts , 1982 .

[102]  Peter Willett,et al.  An evaluation of some conflation algorithms for information retrieval , 1981 .

[103]  John M. Carroll,et al.  Computer selection of keywords using word-frequency analysis , 1969 .

[104]  W. Bruce Croft,et al.  Using Probabilistic Models of Document Retrieval without Relevance Information , 1979, J. Documentation.

[105]  Peter Willett,et al.  Readings in information retrieval , 1997 .

[106]  Ellen M. Voorhees,et al.  Overview of the TREC 2004 Novelty Track. , 2005 .

[107]  Karen Sparck Jones Computer security – a layperson’s guide, from the bottom up , 2002 .

[108]  Fabio Crestani,et al.  Information Retrieval: Uncertainty and Logics , 1998, The Kluwer International Series on Information Retrieval.

[109]  Jinxi Xu,et al.  Evaluation of an extraction-based approach to answering definitional questions , 2004, SIGIR '04.

[110]  Karen Sparck Jones,et al.  Statistical bases of relevance assessment for the ideal information retrieval test collection , 1979 .

[111]  W. Bruce Croft,et al.  Resolving ambiguity for cross-language retrieval , 1998, SIGIR '98.

[112]  F. W. Lancaster,et al.  MEDLARS: Report on the Evaluation of Its Operating Efficiency. , 1997 .

[113]  Paul Thompson,et al.  Subjective probability, combination of expert opinion, and probabilistic approaches to information retrieval , 1987 .

[114]  Hans Peter Luhn,et al.  A Statistical Approach to Mechanized Encoding and Searching of Literary Information , 1957, IBM J. Res. Dev..

[115]  Gareth J. F. Jones,et al.  Exeter at CLEF 2001: Experiments with Machine Translation for Bilingual Retrieval , 2001, CLEF.

[116]  Gareth J. F. Jones,et al.  EXETER at CLEF 2003: Experiments with Machine Translation for Monolingual, Bilingual and Multilingual Retrieval , 2003, CLEF.

[117]  Jerry R. Hobbs,et al.  Interpretation as Abduction , 1993, Artif. Intell..

[118]  Stephen Pulman,et al.  Shallow processing and automatic summarising: a first study , 1991 .

[119]  Uwe Reyle,et al.  From discourse to logic , 1993 .

[120]  Eugene Charniak,et al.  A Maximum-Entropy-Inspired Parser , 2000, ANLP.

[121]  Djoerd Hiemstra,et al.  Relating the new language models of information retrieval to the traditional retrieval models , 2000 .

[122]  Daniel Marcu,et al.  The automatic construction of large-scale corpora for summarization research , 1999, SIGIR '99.

[123]  Stephen E. Robertson,et al.  A probabilistic model of information retrieval: development and comparative experiments - Part 1 , 2000, Inf. Process. Manag..

[124]  Alexander H. Waibel,et al.  DIASUMM: Flexible Summarization of Spontaneous Dialogues in Unrestricted Domains , 2000, COLING.

[125]  Ellen M. Voorhees,et al.  Evaluation by highly relevant documents , 2001, SIGIR '01.

[126]  Harold Borko,et al.  Automatic indexing , 1981, ACM '81.

[127]  Wendy Grace Lehnert,et al.  The Process of Question Answering , 2022 .

[128]  P. Pietroski Actions, adjuncts, and agency , 1998 .

[129]  Masaru Tomita,et al.  An Efficient Augmented-Context-Free Parsing Algorithm , 1987, Comput. Linguistics.

[130]  Mitchell P. Marcus,et al.  A theory of syntactic recognition for natural language , 1979 .

[131]  Ellen M. Voorhees,et al.  Evaluating Evaluation Measure Stability , 2000, SIGIR 2000.

[132]  S. Laurence,et al.  Radical concept nativism , 2002, Cognition.

[133]  Stephen E. Robertson,et al.  Application of probabilistic methods to Chinese , 1997, J. Documentation.

[134]  Karen Spärck Jones Search Term Relevance Weighting given Little Relevance Information , 1997, J. Documentation.

[135]  Karen Spärck Jones Natural language processing: she needs something old and something new (maybe something borrowed and something blue, too) , 1995, ArXiv.

[136]  H. P. Edmundson,et al.  Automatic abstracting and indexing—survey and recommendations , 1961, CACM.

[137]  Stephen E. Robertson,et al.  Relevance weighting of search terms , 1976, J. Am. Soc. Inf. Sci..

[138]  C F Overhage Plans for project intrex. , 1966, Science.

[139]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[140]  Michael McGill,et al.  Introduction to Modern Information Retrieval , 1983 .

[141]  Michael Collins,et al.  Head-Driven Statistical Models for Natural Language Parsing , 2003, CL.

[142]  Victor Poznański A relevance-based utterance processing system , 1992 .

[143]  Peter Schäuble,et al.  A system for retrieving speech documents , 1992, SIGIR '92.

[144]  Kathleen R. McKeown,et al.  Summarization Evaluation Methods: Experiments and Analysis , 1998 .

[145]  Julia Galliers,et al.  Evaluating natural language processing systems , 1995 .

[146]  Karen Spärck Jones,et al.  Natural language interfaces to databases , 1990, The Knowledge Engineering Review.

[147]  Xuanjing Huang,et al.  FDU at TREC-9: CLIR, Filtering and QA Tasks , 2000, TREC.

[148]  Herbert A. Simon,et al.  The Sciences of the Artificial , 1970 .

[149]  Sanda M. Harabagiu,et al.  FALCON: Boosting Knowledge for Answer Engines , 2000, TREC.

[150]  Timothy Baldwin,et al.  Multiword expressions: linguistic precision and reusability , 2002, LREC.

[151]  Karen Spärck Jones,et al.  Open-vocabulary speech indexing for voice and video mail retrieval , 1997, MULTIMEDIA '96.

[152]  Ted Briscoe,et al.  High Precision Extraction of Grammatical Relations , 2001, COLING.

[153]  Karen Sparck Jones Privacy: what's different now? , 2003 .

[154]  Karen Spärck Jones Automatic summarising: factors and directions , 1998, ArXiv.

[155]  Ii Gerald Francis Dejong Skimming stories in real time: an experiment in integrated understanding. , 1979 .