Using lexical chains for keyword extraction

Keywords can be considered as condensed versions of documents and short forms of their summaries. In this paper, the problem of automatic extraction of keywords from documents is treated as a supervised learning task. A lexical chain holds a set of semantically related words of a text and it can be said that a lexical chain represents the semantic content of a portion of the text. Although lexical chains have been extensively used in text summarization, their usage for keyword extraction problem has not been fully investigated. In this paper, a keyword extraction technique that uses lexical chains is described, and encouraging results are obtained.

[1]  Gonenc Ercan AUTOMATED TEXT SUMMARIZATION AND KEYPHRASE EXTRACTION , 2006 .

[2]  Christopher D. Manning,et al.  Enriching the Knowledge Sources Used in a Maximum Entropy Part-of-Speech Tagger , 2000, EMNLP.

[3]  Simone Teufel,et al.  Sentence extraction as a classification task , 1997 .

[4]  Kathleen F. McCoy,et al.  Efficient text summarization using lexical chains , 2000, IUI '00.

[5]  Marie-Francine Moens,et al.  The use of topic segmentation for automatic summarization , 2002, ACL 2002.

[6]  Peter D. Turney Learning Algorithms for Keyphrase Extraction , 2000, Information Retrieval.

[7]  Regina Barzilay,et al.  Lexical Chains for Summarization , 1997 .

[8]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[9]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[10]  Kathleen F. McCoy,et al.  Efficiently Computed Lexical Chains as an Intermediate Representation for Automatic Text Summarization , 2002, CL.

[11]  Regina Barzilay,et al.  Using Lexical Chains for Text Summarization , 1997 .

[12]  Francine Chen,et al.  A trainable document summarizer , 1995, SIGIR '95.

[13]  Graeme Hirst,et al.  Lexical Cohesion Computed by Thesaural relations as an indicator of the structure of text , 1991, CL.

[14]  Dekang Lin,et al.  WordNet: An Electronic Lexical Database , 1998 .

[15]  Dan Klein,et al.  Feature-Rich Part-of-Speech Tagging with a Cyclic Dependency Network , 2003, NAACL.

[16]  Anette Hulth,et al.  Enhancing Linguistically Oriented Automatic Keyword Extraction , 2004, NAACL.

[17]  Alberto Maria Segre,et al.  Programs for Machine Learning , 1994 .

[18]  Peter D. Turney Learning to Extract Keyphrases from Text , 2002, ArXiv.

[19]  Carl Gutwin,et al.  KEA: practical automatic keyphrase extraction , 1999, DL '99.

[20]  Carl Gutwin,et al.  Domain-Specific Keyphrase Extraction , 1999, IJCAI.

[21]  Kathleen McKeown,et al.  Improving Word Sense Disambiguation in Lexical Chaining , 2003, IJCAI.