Keyword and Keyphrase Extraction Techniques: A Literature Review

In this paper we present a survey of various techniques available in text mining for keyword and keyphrase extraction.

[1]  Clement T. Yu,et al.  A theory of term importance in automatic text analysis , 1974, J. Am. Soc. Inf. Sci..

[2]  Richard K. Belew,et al.  Exporting phrases: a statistical analysis of topical language , 1991 .

[3]  Slava M. Katz,et al.  Technical terminology: some linguistic properties and an algorithm for identification in text , 1995, Natural Language Engineering.

[4]  Jonathan D. Cohen,et al.  Highlights: Language- and Domain-Independent Automatic Indexing Terms for Abstracting , 1995, J. Am. Soc. Inf. Sci..

[5]  Jonathan D. Cohen Highlights: language- and domain-independent automatic indexing terms for abstracting , 1995 .

[6]  Bruce Krulwich,et al.  Learning user information interests through extraction of semantically significant phrases , 1996 .

[7]  Alberto Muòoz,et al.  Compound Key Word Generation from Document Databases Using A Hierarchical Clustering ART Model , 1997 .

[8]  Ken Barker,et al.  Using Noun Phrase Heads to Extract Document Keyphrases , 2000, Canadian Conference on AI.

[9]  Pedro Carpena,et al.  Keyword detection in natural languages and DNA , 2002 .

[10]  Min Song,et al.  KPSpotter: a flexible information gain-based keyphrase extraction system , 2003, WIDM '03.

[11]  Anette Hulth,et al.  Improved Automatic Keyword Extraction Given More Linguistic Knowledge , 2003, EMNLP.

[12]  Peter D. Turney Coherent Keyphrase Extraction via Web Mining , 2003, IJCAI.

[13]  Matthew Hurst,et al.  A Language Model Approach to Keyphrase Extraction , 2003, ACL 2003.

[14]  Peter D. Turney Learning Algorithms for Keyphrase Extraction , 2000, Information Retrieval.

[15]  Juan-Zi Li,et al.  Loss Minimization Based Keyword Distillation , 2004, APWeb.

[16]  Rada Mihalcea,et al.  TextRank: Bringing Order into Text , 2004, EMNLP.

[17]  Helen J. Seaton,et al.  International Encyclopedia of Information and Library Science , 2004 .

[18]  F. Ren,et al.  Multilingual single document keyword extraction for information retrieval , 2005, 2005 International Conference on Natural Language Processing and Knowledge Engineering.

[19]  Yi-fang Brook Wu,et al.  Domain-specific keyphrase extraction , 2005, CIKM '05.

[20]  Ian H. Witten,et al.  Thesaurus based automatic keyphrase indexing , 2006, Proceedings of the 6th ACM/IEEE-CS Joint Conference on Digital Libraries (JCDL '06).

[21]  Juan-Zi Li,et al.  Keyword Extraction Using Support Vector Machine , 2006, WAIM.

[22]  Pedro A. Pury,et al.  Statistical keyword detection in literary corpora , 2007, ArXiv.

[23]  Min-Yen Kan,et al.  Keyphrase Extraction in Scientific Publications , 2007, ICADL.

[24]  Chengzhi Zhang,et al.  Automatic Keyword Extraction from Documents Using Conditional Random Fields , 2008 .

[25]  P. Carpena,et al.  Level statistics of words: finding keywords in literary texts and symbolic sequences. , 2009, Physical review. E, Statistical, nonlinear, and soft matter physics.

[26]  Zhiyuan Liu,et al.  Clustering to Find Exemplar Terms for Keyphrase Extraction , 2009, EMNLP.

[27]  Weiguang Qu,et al.  A Semi-Supervised Key Phrase Extraction Approach: Learning from Title Phrases through a Document Semantic Network , 2010, ACL.

[28]  Christian Wartena,et al.  Thesaurus Based Term Ranking for Keyword Extraction , 2010, 2010 Workshops on Database and Expert Systems Applications.

[29]  Xindong Wu,et al.  Keyword extraction based on sequential pattern mining , 2011, ICIMCS '11.

[30]  Sujian Li,et al.  Hypergraph-based inductive learning for generating implicit key phrases , 2011, WWW.

[31]  Abraham Kandel,et al.  DegExt - A Language-Independent Graph-Based Keyphrase Extractor , 2011, AWIC.

[32]  A. Mehri,et al.  Keyword extraction by nonextensivity measure. , 2011, Physical review. E, Statistical, nonlinear, and soft matter physics.

[33]  Bao Hong,et al.  An Extended Keyword Extraction Method , 2012 .