Bio-inspired approaches for extractive document summarization: A comparative study

Abstract With the exponential growth of information in World Wide Web, extracting relevant information from huge amount of data has become a critical task. Text summarization has been appeared as one of the solution to such problem. As the main objective is to retrieve a condensed document that pertain the original information, so it can be considered as an optimization problem. In this paper, a comparative analysis of few meta-heuristic approaches such as Cuckoo Search (CS), Cat Swarm Optimization (CSO), Particle Swarm Optimization (PSO), Harmony Search (HS), and Differential Evolution (DE) algorithm is presented for single document summarization problem. The performance of all these algorithms are compared in terms of different evaluation metrics such as F score, true positive rate and positive predicate value to validate summary relevancy and non-redundancy over traditional and standard Document Understanding Conference (DUC) datasets.

[1]  Rafael Dueire Lins,et al.  Assessing shallow sentence scoring techniques and combinations for single and multi-document summarization , 2016, Expert Syst. Appl..

[2]  Rasim M. Alguliyev,et al.  MCMR: Maximum coverage and minimum redundant text summarization model , 2011, Expert Syst. Appl..

[3]  Ahmed Elkeran,et al.  A new approach for sheet nesting problem using guided cuckoo search and pairwise clustering , 2013, Eur. J. Oper. Res..

[4]  Zong Woo Geem,et al.  A New Heuristic Optimization Algorithm: Harmony Search , 2001, Simul..

[5]  Ramiz M. Aliguliyev,et al.  A new sentence similarity measure and sentence based extractive technique for automatic text summarization , 2009, Expert Syst. Appl..

[6]  Russell C. Eberhart,et al.  A new optimizer using particle swarm theory , 1995, MHS'95. Proceedings of the Sixth International Symposium on Micro Machine and Human Science.

[7]  Jianzhou Wang,et al.  Forecasting solar radiation using an optimized hybrid model by Cuckoo Search algorithm , 2015 .

[8]  Rasim M. Alguliyev,et al.  GenDocSum + MCLR: Generic document summarization based on maximum coverage and less redundancy , 2012, Expert Syst. Appl..

[9]  Fabien Meinguet,et al.  Control Strategies for Open-End Winding Drives Operating in the Flux-Weakening Region , 2014, IEEE Transactions on Power Electronics.

[10]  S. Hr. Aghay Kaboli,et al.  Rain-fall optimization algorithm: A population based algorithm for solving constrained optimization problems , 2017, J. Comput. Sci..

[11]  Seyed Hossein Mirshojaei,et al.  Text Summarization Using Cuckoo Search Optimization Algorithm , 2015 .

[12]  Rakesh Chandra Balabantaray,et al.  Cat swarm optimization based evolutionary framework for multi document summarization , 2017 .

[13]  Jeyraj Selvaraj,et al.  Long-term electrical energy consumption formulating and forecasting via optimized gene expression programming , 2017 .

[14]  Ajith Abraham,et al.  An Improved Harmony Search Algorithm with Differential Mutation Operator , 2009, Fundam. Informaticae.

[15]  Kumaresh Nandhini,et al.  Extracting easy to understand summary using differential evolution algorithm , 2014, Swarm Evol. Comput..

[16]  Saad Mekhilef,et al.  A PSO-DQ Current Control Scheme for Performance Enhancement of Z-Source Matrix Converter to Drive IM Fed by Abnormal Voltage , 2018, IEEE Transactions on Power Electronics.

[17]  Xin-She Yang,et al.  Cuckoo search: recent advances and applications , 2013, Neural Computing and Applications.

[18]  Leila Sharif Hassanabadi,et al.  Text summarization with harmony search algorithm-based sentence extraction , 2008, CSTST.

[19]  Janga M Reddy,et al.  Computational algorithms inspired by biological processes and evolution , 2012 .

[20]  Rainer Storn,et al.  Differential Evolution – A Simple and Efficient Heuristic for global Optimization over Continuous Spaces , 1997, J. Glob. Optim..

[21]  Xin-She Yang,et al.  Cuckoo Search via Lévy flights , 2009, 2009 World Congress on Nature & Biologically Inspired Computing (NaBIC).

[22]  Xiao-Zhi Gao,et al.  The Overview of Harmony Search , 2015 .

[23]  Fuji Ren,et al.  GA, MR, FFNN, PNN and GMM based models for automatic text summarization , 2009, Comput. Speech Lang..

[24]  Mostafa Modiri-Delshad,et al.  Backtracking search algorithm for solving economic dispatch problems with valve-point effects and multiple fuel options , 2016 .

[25]  Sakti Prasad Ghoshal,et al.  Cat Swarm Optimization algorithm for optimal linear phase FIR filter design. , 2013, ISA transactions.

[26]  Naomie Salim,et al.  Swarm Based Text Summarization , 2009, 2009 International Association of Computer Science and Information Technology - Spring Conference.

[27]  Niladri Chatterjee,et al.  Discrete Differential Evolution for Text Summarization , 2014, 2014 International Conference on Information Technology.

[28]  Michael D. Gordon Probabilistic and genetic algorithms in document retrieval , 1988, CACM.

[29]  Naomie Salim,et al.  Differential evolution cluster-based text summarization methods , 2013, 2013 INTERNATIONAL CONFERENCE ON COMPUTING, ELECTRICAL AND ELECTRONIC ENGINEERING (ICCEEE).

[30]  Mohsen Amini Salehi,et al.  A Comprehensive Survey on Text Summarization Systems , 2009, 2009 2nd International Conference on Computer Science and its Applications.

[31]  Rasim M. Alguliyev,et al.  Effective summarization method of text documents , 2005, The 2005 IEEE/WIC/ACM International Conference on Web Intelligence (WI'05).

[32]  Xin-She Yang,et al.  Engineering optimisation by cuckoo search , 2010 .

[33]  S. Siva Sathya,et al.  A Survey of Bio inspired Optimization Algorithms , 2012 .

[34]  Behrooz Masoumi,et al.  Automatic text summarization based on multi-agent particle swarm optimization , 2014, 2014 Iranian Conference on Intelligent Systems (ICIS).

[35]  S. G. Ponnambalam,et al.  Cuckoo Search Algorithm for Optimization of Sequence in PCB Holes Drilling Process , 2012 .

[36]  Rakesh Chandra Balabantaray,et al.  Document Summarization Using Sentence Features , 2015, Int. J. Inf. Retr. Res..

[37]  Debalina Ghosh,et al.  Linear antenna array synthesis using cat swarm optimization , 2014 .

[38]  Vicente P. Guerrero-Bote,et al.  Order-based Fitness Functions for Genetic Algorithms Applied to Relevance Feedback , 2003, J. Assoc. Inf. Sci. Technol..

[39]  Elizabeth León Guzman,et al.  Extractive single-document summarization based on genetic operators and guided local search , 2014, Expert Syst. Appl..

[40]  N. Jawahar,et al.  Reliability-based total cost of ownership approach for supplier selection using cuckoo-inspired hybrid algorithm , 2014 .

[41]  Rakesh Chandra Balabantaray,et al.  Comparative Study of DE and PSO over Document Summarization , 2015 .

[42]  Azlan Mohd Zain,et al.  Cuckoo Search Algorithm for Optimization Problems—A Literature Review and its Applications , 2014, Appl. Artif. Intell..

[43]  Félix de Moya Anegón,et al.  A GA-P algorithm to automatically formulate extended Boolean queries for a fuzzy information retrieval system , 2000 .

[44]  Ahmad Amiri,et al.  Toward improved mechanical, tribological, corrosion and in-vitro bioactivity properties of mixed oxide nanotubes on Ti-6Al-7Nb implant using multi-objective PSO. , 2017, Journal of the mechanical behavior of biomedical materials.

[45]  Nasrudin Abd Rahim,et al.  Long-term electric energy consumption forecasting via artificial cooperative search algorithm , 2016 .

[46]  Bijan Bihari Misra,et al.  Pipelining the ranking techniques for microarray data classification: A case study , 2016, Appl. Soft Comput..