Selectivity-Based Keyword Extraction Method

In this work the authors propose a novel Selectivity-Based Keyword Extraction (SBKE) method, which extracts keywords from the source text represented as a network. The node selectivity value is calculated from a weighted network as the average weight distributed on the links of a single node and is used in the procedure of keyword candidate ranking and extraction. The authors show that selectivity-based keyword extraction slightly outperforms an extraction based on the standard centrality measures: in/out-degree, betweenness and closeness. Therefore, they include selectivity and its modification – generalized selectivity as node centrality measures in the SBKE method. Selectivity-based extraction does not require linguistic knowledge as it is derived purely from statistical and structural information of the network. The experimental results point out that selectivity-based keyword extraction has a great potential for the collection-oriented keyword extraction task.

[1]  Aditi Sharan,et al.  Keyword and Keyphrase Extraction Techniques: A Literature Review , 2015 .

[2]  Loll N. Rolling Indexing consistency, quality and efficiency , 1981, Inf. Process. Manag..

[3]  G. J. Rodgers,et al.  Network properties of written human language. , 2006, Physical review. E, Statistical, nonlinear, and soft matter physics.

[4]  Yuzhong Qu,et al.  Searching Linked Objects with Falcons: Approach, Implementation and Evaluation , 2009, Int. J. Semantic Web Inf. Syst..

[5]  Leandro Nunes de Castro,et al.  A keyword extraction method from twitter messages represented as graphs , 2014, Appl. Math. Comput..

[6]  Siegfried Handschuh,et al.  Towards Controlled Natural Language for Semantic Annotation , 2010, Int. J. Semantic Web Inf. Syst..

[7]  Sanda Martinčić-Ipšić,et al.  An Overview of Graph-Based Keyword Extraction Methods and Approaches , 2015 .

[8]  Gordon W. Paynter,et al.  Automatic extraction of document keyphrases for use in digital libraries: Evaluation and applications , 2002, J. Assoc. Inf. Sci. Technol..

[9]  John Skvoretz,et al.  Node centrality in weighted networks: Generalizing degree and shortest paths , 2010, Soc. Networks.

[10]  U. Brandes A faster algorithm for betweenness centrality , 2001 .

[11]  Arash Joorabchi,et al.  Automatic keyphrase annotation of scientific documents using Wikipedia and genetic algorithms , 2013, J. Inf. Sci..

[12]  Haitao Liu,et al.  What role does syntax play in a language network , 2008 .

[13]  Hai Zhuge,et al.  Topological centrality and its e-Science applications , 2010 .

[14]  G. J. Rodgers,et al.  Differences between Normal and Shuffled Texts: Structural Properties of Weighted Networks , 2008, Adv. Complex Syst..