论文信息 - A Language Model Approach to Keyphrase Extraction

A Language Model Approach to Keyphrase Extraction

We present a new approach to extracting keyphrases based on statistical language models. Our approach is to use pointwise KL-divergence between multiple language models for scoring both phraseness and informativeness, which can be unified into a single score to rank extracted phrases.

Matthew Hurst | Takashi Tomokiyo | Matthew F. Hurst | T. Tomokiyo | Takashi Tomokiyo

[1] Ramakrishnan Srikant,et al. Fast Algorithms for Mining Association Rules in Large Databases , 1994, VLDB.

[2] Hinrich Schütze,et al. Book Reviews: Foundations of Statistical Natural Language Processing , 1999, CL.

[3] Kenneth Ward Church,et al. Using Statistics in Lexical Analysis , 2003, Lexical Acquisition: Exploiting On-Line Resources to Build a Lexicon.

[4] Kenneth Ward Church,et al. Using Suffix Arrays to Compute Term Frequency and Document Frequency for All Substrings in a Corpus , 2001, Computational Linguistics.

[5] F ChenStanley,et al. An Empirical Study of Smoothing Techniques for Language Modeling , 1996, ACL.

[6] Rakesh Agarwal,et al. Fast Algorithms for Mining Association Rules , 1994, VLDB 1994.

[7] Fred J. Damerau,et al. Generating and Evaluating Domain-Oriented Multi-Word Terms from Texts , 1993, Inf. Process. Manag..

[8] Patrick Pantel,et al. A Statistical Corpus-Based Term Extractor , 2001, Canadian Conference on AI.

[9] J. Cleary,et al. \self-organized Language Modeling for Speech Recognition". In , 1997 .

[10] Peter D. Turney. Learning Algorithms for Keyphrase Extraction , 2000, Information Retrieval.

[11] Ted Dunning,et al. Accurate Methods for the Statistics of Surprise and Coincidence , 1993, CL.

[12] Frank Smadja,et al. Retrieving Collocations from Text: Xtract , 1993, CL.

[13] Yi-fang Brook Wu,et al. Domain-specific keyphrase extraction , 2005, CIKM '05.

[14] Thomas M. Cover,et al. Elements of Information Theory , 2005 .

[15] Kenneth Ward Church,et al. Word Association Norms, Mutual Information, and Lexicography , 1989, ACL.