Turkish keyphrase extraction using multi-criterion ranking

Keyphrases have been extensively used for indexing and searching in databases and information retrieval systems. In addition, they provide useful information about semantic content of a document. In this paper, we propose an algorithm for automating Turkish keyphrase extraction. Several features of candidate phrases are exploited and form the extraction task as a problem of finding optimal set of candidate phrases. We use multi-criterion ranking to tackle this problem.

[1]  Carl Gutwin,et al.  KEA: practical automatic keyphrase extraction , 1999, DL '99.

[2]  I. Cicekli,et al.  Turkish keyphrase extraction using KEA , 2007, 2007 22nd international symposium on computer and information sciences.

[3]  I. Cicekli,et al.  TurKeyX: Turkish keyphrase extractor , 2008, 2008 23rd International Symposium on Computer and Information Sciences.

[4]  Ilyas Cicekli,et al.  A Rule-Based Morphological Disambiguator for Turkish , 2007 .

[5]  Shmuel T. Klein,et al.  Clumping properties of content-bearing words , 1998 .

[6]  Ken Barker,et al.  Using Noun Phrase Heads to Extract Document Keyphrases , 2000, Canadian Conference on AI.

[7]  Ian H. Witten,et al.  Thesaurus based automatic keyphrase indexing , 2006, Proceedings of the 6th ACM/IEEE-CS Joint Conference on Digital Libraries (JCDL '06).

[8]  Peter D. Turney Learning Algorithms for Keyphrase Extraction , 2000, Information Retrieval.

[9]  G. P. Patil,et al.  Multiple indicators, partially ordered sets, and linear extensions: Multi-criterion ranking and prioritization , 2004, Environmental and Ecological Statistics.

[10]  Alice M. Agogino,et al.  Automating keyphrase extraction with multi-objective genetic algorithms , 2004, 37th Annual Hawaii International Conference on System Sciences, 2004. Proceedings of the.