Language Modeling for Information Retrieval

[1]  Garrison W. Cottrell,et al.  Fusion Via a Linear Combination of Scores , 1999, Information Retrieval.

[2]  Yiming Yang,et al.  An Evaluation of Statistical Approaches to Text Categorization , 1999, Information Retrieval.

[3]  William T. Morgan,et al.  The role of variance in term weighting for probabilistic information retrieval , 2002, CIKM '02.

[4]  W. Bruce Croft,et al.  Cross-lingual relevance models , 2002, SIGIR '02.

[5]  Djoerd Hiemstra,et al.  The Importance of Prior Probabilities for Entry Page Search , 2002, SIGIR '02.

[6]  W. Bruce Croft,et al.  Predicting query performance , 2002, SIGIR '02.

[7]  R. Manmatha,et al.  A formal approach to score normalization for meta-search , 2002 .

[8]  James Allan,et al.  Relevance models for topic detection and tracking , 2002 .

[9]  E. Caglioti,et al.  Language trees and zipping. , 2001, Physical review letters.

[10]  Hinrich Schütze,et al.  Book Reviews: Foundations of Statistical Natural Language Processing , 1999, CL.

[11]  W. Bruce Croft Combining Approaches to Information Retrieval , 2002 .

[12]  Wessel Kraaij,et al.  Unsupervised Event Clustering in Multilingual News Streams , 2002 .

[13]  Javed A. Aslam,et al.  Relevance score normalization for metasearch , 2001, CIKM '01.

[14]  Yi Zhang,et al.  Maximum likelihood estimation for filtering thresholds , 2001, SIGIR '01.

[15]  R. Manmatha,et al.  Modeling score distributions for combining the outputs of search engines , 2001, SIGIR '01.

[16]  W. Bruce Croft,et al.  Relevance-Based Language Models , 2001, SIGIR '01.

[17]  Javed A. Aslam,et al.  Models for metasearch , 2001, SIGIR '01.

[18]  W. Bruce Croft,et al.  Workshop on language modeling and information retrieval , 2001, SIGF.

[19]  William John Teahan,et al.  Combining PPM models using a text mining approach , 2001, Proceedings DCC 2001. Data Compression Conference.

[20]  Warren R. Greiff,et al.  Fine-Grained Hidden Markov Modeling for Broadcast-News Story Segmentation , 2001, HLT.

[21]  Nigel G. Ward Machine Translation: Past, Present, Future , 2001 .

[22]  James P. Callan,et al.  Experiments Using the Lemur Toolkit , 2001, TREC.

[23]  Yi Zhang,et al.  The Bias Problem and Language Models in Adaptive Filtering , 2001, TREC.

[24]  Wessel Kraaij,et al.  Using language models for tracking events of interest over time , 2001 .

[25]  John D. Lafferty,et al.  A study of smoothing methods for language models applied to Ad Hoc information retrieval , 2001, SIGIR '01.

[26]  Dmitry V. Khmelev Disputed Authorship Resolution through Using Relative Empirical Entropy for Markov Chains of Letters in Human Language Texts , 2000, J. Quant. Linguistics.

[27]  Efstathios Stamatatos,et al.  Automatic Text Categorization In Terms Of Genre and Author , 2000, CL.

[28]  Stephen E. Robertson,et al.  A probabilistic model of information retrieval: development and comparative experiments - Part 1 , 2000, Inf. Process. Manag..

[29]  Daniel Marcu,et al.  Statistics-Based Summarization - Step One: Sentence Compression , 2000, AAAI/IAAI.

[30]  Byoung-Tak Zhang,et al.  Text filtering by boosting naive Bayes classifiers , 2000, SIGIR '00.

[31]  Stephen E. Robertson,et al.  Threshold setting in adaptive filtering , 2000, J. Documentation.

[32]  Charles L. Wayne Multilingual Topic Detection and Tracking: Successful Research Enabled by Corpora and Evaluation , 2000, LREC.

[33]  William John Teahan,et al.  Text classification and segmentation using minimum cross-entropy , 2000, RIAO.

[34]  Ian H. Witten,et al.  Text categorization using compression models , 2000, Proceedings DCC 2000. Data Compression Conference.

[35]  Yi Zhang,et al.  YFilter at TREC-9 , 2000, TREC.

[36]  Djoerd Hiemstra,et al.  Relating the new language models of information retrieval to the traditional retrieval models , 2000 .

[37]  Stephen E. Robertson,et al.  Microsoft Cambridge at TREC-9: Filtering Track , 2000, TREC.

[38]  Richard M. Schwartz,et al.  Topic tracking for radio, TV broadcast, and newswire , 1999, EUROSPEECH.

[39]  Richard M. Schwartz,et al.  A hidden Markov model information retrieval system , 1999, SIGIR '99.

[40]  John D. Lafferty,et al.  Information retrieval as statistical translation , 1999, SIGIR '99.

[41]  Christoph Baumgarten,et al.  A probabilistic solution to the selection and fusion problem in distributed information retrieval , 1999, SIGIR '99.

[42]  Jade Goldstein-Stewart,et al.  Selecting Text Spans for Document Summaries: Heuristics and Metrics , 1999, AAAI/IAAI.

[43]  William W. Cohen,et al.  Context-sensitive learning methods for text categorization , 1999, TOIS.

[44]  Djoerd Hiemstra,et al.  Twenty-One at TREC-8: using Language Technology for Information Retrieval , 1999, TREC.

[45]  Kenney Ng A Maximum Likelihood Ratio Information Retrieval Model , 1999, TREC.

[46]  Donna K. Harman,et al.  Overview of the Eighth Text REtrieval Conference (TREC-8) , 1999, TREC.

[47]  Ian H. Witten,et al.  Using language models for generic entity extraction , 1999 .

[48]  Stephen E. Robertson,et al.  The TREC-8 Filtering Track Final Report , 1999, TREC.

[49]  Marc Light,et al.  Hiding a Semantic Hierarchy in a Markov Model , 1999, ACL 1999.

[50]  Susan T. Dumais,et al.  Inductive learning algorithms and representations for text categorization , 1998, CIKM '98.

[51]  Douglas W. Oard,et al.  A comparative study of query and document translation for cross-language information retrieval , 1998, AMTA.

[52]  Djoerd Hiemstra,et al.  A Linguistically Motivated Probabilistic Model of Information Retrieval , 1998, ECDL.

[53]  Garrison W. Cottrell,et al.  Predicting the performance of linearly combined IR systems , 1998, SIGIR '98.

[54]  Ari Pirkola,et al.  The effects of query structure and dictionary setups in dictionary-based cross-language information retrieval , 1998, SIGIR '98.

[55]  W. Bruce Croft,et al.  Resolving ambiguity for cross-language retrieval , 1998, SIGIR '98.

[56]  Yoram Singer,et al.  Boosting and Rocchio applied to text filtering , 1998, SIGIR '98.

[57]  Philip Resnik,et al.  Parallel strands: a preliminary investigation into mining the Web for bilingual text , 1998, AMTA.

[58]  Larry Gillick,et al.  A hidden Markov model approach to text segmentation and event tracking , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[59]  Thorsten Joachims,et al.  Text Categorization with Support Vector Machines: Learning with Many Relevant Features , 1998, ECML.

[60]  David D. Lewis,et al.  Naive (Bayes) at Forty: The Independence Assumption in Information Retrieval , 1998, ECML.

[61]  Peter Jansen,et al.  Threshold Calibration in CLARIT Adaptive Filtering , 1998, TREC.

[62]  Djoerd Hiemstra,et al.  Twenty-One at TREC7: Ad-hoc and Cross-Language Track , 1998, TREC.

[63]  W. Bruce Croft,et al.  Corpus-based stemming using cooccurrence of word variants , 1998, TOIS.

[64]  S. Robertson The probability ranking principle in IR , 1997 .

[65]  Ronald Rosenfeld,et al.  Statistical language modeling using the CMU-cambridge toolkit , 1997, EUROSPEECH.

[66]  Trevor J. Hastie,et al.  Discriminative vs Informative Learning , 1997, KDD.

[67]  Yiming Yang,et al.  A Comparative Study on Feature Selection in Text Categorization , 1997, ICML.

[68]  Hinrich Schütze,et al.  Automatic Detection of Text Genre , 1997, ACL.

[69]  Christoph Baumgarten,et al.  A probabilistic model for distributed information retrieval , 1997, Annual International ACM SIGIR Conference on Research and Development in Information Retrieval.

[70]  Jong-Hak Lee,et al.  Analyses of multiple evidence combination , 1997, SIGIR '97.

[71]  Eduard Hovy,et al.  Automated Text Summarization in SUMMARIST , 1997, ACL 1997.

[72]  Gerald J. Kowalski,et al.  Information Retrieval Systems , 1997, The Information Retrieval Series.

[73]  David A. Hull Using Structured Queries for Disambiguation in Cross-Language Information Retrieval , 1997 .

[74]  Frederick Jelinek,et al.  Statistical methods for speech recognition , 1997 .

[75]  Daniel Marcu,et al.  From discourse structures to text summaries , 1997 .

[76]  Therese Firmin Hand,et al.  A Proposal for Task-based Evaluation of Text Summarization Systems , 1997, Workshop On Intelligent Scalable Text Summarization.

[77]  James Allan,et al.  Incremental relevance feedback for information filtering , 1996, SIGIR '96.

[78]  Chris Buckley,et al.  Pivoted Document Length Normalization , 1996, SIGIR Forum.

[79]  Joon Ho Lee,et al.  Combining multiple evidence from different properties of weighting schemes , 1995, SIGIR '95.

[80]  Francine Chen,et al.  A trainable document summarizer , 1995, SIGIR '95.

[81]  Dragomir R. Radev,et al.  Generating summaries of multiple news articles , 1995, SIGIR '95.

[82]  Andreas S. Weigend,et al.  A neural network approach to topic spotting , 1995 .

[83]  Stephen E. Robertson,et al.  Okapi at TREC-4 , 1995, TREC.

[84]  Stephen E. Robertson,et al.  Some simple effective approximations to the 2-Poisson model for probabilistic weighted retrieval , 1994, SIGIR '94.

[85]  Chris Buckley,et al.  OHSUMED: an interactive retrieval evaluation and new large test collection for research , 1994, SIGIR '94.

[86]  W. Bruce Croft,et al.  Document Retrieval and Routing Using the INQUERY System , 1994, TREC.

[87]  Alan T. Sherman,et al.  Statistical Techniques for Language Recognition: an Introduction and Guide for Cryptanalysts , 1993, Cryptologia.

[88]  Robert L. Mercer,et al.  The Mathematics of Statistical Machine Translation: Parameter Estimation , 1993, CL.

[89]  Paul G. Howard,et al.  The design and analysis of efficient lossless data compression systems , 1993 .

[90]  Edward A. Fox,et al.  Combination of Multiple Searches , 1993, TREC.

[91]  Norbert Fuhr,et al.  Probabilistic Models in Information Retrieval , 1992, Comput. J..

[92]  Robert L. Mercer,et al.  An Estimate of an Upper Bound for the Entropy of English , 1992, CL.

[93]  Yiyu Yao,et al.  An Information-Theoretic Measure of Term Specificity , 1992, J. Am. Soc. Inf. Sci..

[94]  Alistair Moffat,et al.  Implementing the PPM data compression scheme , 1990, IEEE Trans. Commun..

[95]  John Cocke,et al.  A Statistical Approach to Machine Translation , 1990, CL.

[96]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[97]  W. Bruce Croft,et al.  Inference networks for document retrieval , 1989, SIGIR '90.

[98]  Ian H. Witten,et al.  Data Compression Using Adaptive Coding and Partial String Matching , 1984, IEEE Trans. Commun..

[99]  Frederick Mosteller,et al.  Applied Bayesian and classical inference : the case of the Federalist papers , 1984 .

[100]  W. Nelson Francis,et al.  FREQUENCY ANALYSIS OF ENGLISH USAGE: LEXICON AND GRAMMAR , 1983 .

[101]  Martin F. Porter,et al.  An algorithm for suffix stripping , 1997, Program.

[102]  Stephen E. Robertson,et al.  Relevance weighting of search terms , 1976, J. Am. Soc. Inf. Sci..

[103]  Stephen P. Harter,et al.  A probabilistic approach to automatic keyword indexing. Part II. An algorithm for probabilistic indexing , 1975, J. Am. Soc. Inf. Sci..

[104]  Stephen P. Harter,et al.  A probabilistic approach to automatic keyword indexing. Part I. On the Distribution of Specialty Words in a Technical Literature , 1975, J. Am. Soc. Inf. Sci..

[105]  James E. Rush,et al.  Improvement of automatic abstracts by the use of structural analysis , 1973, J. Am. Soc. Inf. Sci..

[106]  J. J. Rocchio,et al.  Relevance feedback in information retrieval , 1971 .

[107]  M. E. Maron,et al.  On Relevance, Probabilistic Indexing and Information Retrieval , 1960, JACM.