A graph based representative keywords extraction model from news articles

In an age of the deluge of information, a blizzard of documents such as news articles is being generated in a real-time. To grasp the contents of documents, keyword extraction methods have researched actively. In this paper, we propose a model to extract representative keywords of news articles based on graph model. We evaluate the accuracy of the proposed model compared with TextRank and TFIDF. The results show that proposed model's accuracy is improved to 40% and 90% respectively without increasing computational time.

[1]  Han-Joon Kim,et al.  News Keyword Extraction for Topic Tracking , 2008, 2008 Fourth International Conference on Networked Computing and Advanced Information Management.

[2]  Rajeev Motwani,et al.  The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.

[3]  Mingyong Liu,et al.  An improvement of TFIDF weighting in text categorization , .

[4]  이성직Sungjick Lee,et al.  Keyword Extraction from News Corpus using Modified TF-IDF , 2009 .

[5]  Seung-Hee Han,et al.  A Study on Keyword Extraction From a Single Document Using Term Clustering , 2010 .

[6]  Gang Chen,et al.  Large-scale documents reduction based on domain ontology and E2LSH , 2014, Proceedings of the 11th IEEE International Conference on Networking, Sensing and Control.

[7]  Michael McGill,et al.  Introduction to Modern Information Retrieval , 1983 .

[8]  Gerard Salton,et al.  Term-Weighting Approaches in Automatic Text Retrieval , 1988, Inf. Process. Manag..

[9]  Leandro Nunes de Castro,et al.  A keyword extraction method from twitter messages represented as graphs , 2014, Appl. Math. Comput..

[10]  Bernard Harris,et al.  Graph theory and its applications , 1970 .

[11]  Juan Wang,et al.  An optimized features extraction algorithm on VSM , 2012, 2012 9th International Conference on Fuzzy Systems and Knowledge Discovery.

[12]  Jonathan L. Gross,et al.  Graph Theory and Its Applications, Second Edition (Discrete Mathematics and Its Applications) , 2005 .