An evolutionary framework for multi document summarization using Cuckoo search approach: MDSCSA

Abstract In today's scenario the rate of growth of information is expanding exponentially in the World Wide Web. As a result, extracting valid and useful information from a huge data has become a challenging issue. Recently text summarization is recognized as one of the solution to extract relevant information from large documents. Based on number of documents considered for summarization, the summarization task is categorized as single document or multi-document summarization. Rather than single document, multi-document summarization is more challenging for the researchers to find accurate summary from multiple documents. Hence in this study, a novel Cuckoo search based multi-document summarizer (MDSCSA) is proposed to address the problem of multi-document summarization. The proposed MDSCSA is also compared with two other nature inspired based summarization techniques such as Particle Swarm Optimization based summarization (PSOS) and Cat Swarm Optimization based summarization (CSOS). With respect to the benchmark dataset Document Understanding Conference (DUC) datasets, the performance of all algorithms are compared in terms of ROUGE score, inter sentence similarity and readability metric to validate non-redundancy, cohesiveness and readability of the summary respectively. The experimental analysis clearly reveals that the proposed approach outperforms the other summarizers included in this study.

[1]  Rafael Dueire Lins,et al.  Assessing shallow sentence scoring techniques and combinations for single and multi-document summarization , 2016, Expert Syst. Appl..

[2]  Rasim M. Alguliyev,et al.  MCMR: Maximum coverage and minimum redundant text summarization model , 2011, Expert Syst. Appl..

[3]  Samrat L. Sabat,et al.  Optimal chiller loading for energy conservation using a new differential cuckoo search approach , 2014 .

[4]  Abbas Khosravi,et al.  Intelligent cuckoo search optimized traffic signal controllers for multi-intersection network , 2015, Expert Syst. Appl..

[5]  Enrique Herrera-Viedma,et al.  Clustering of web search results based on the cuckoo search algorithm and Balanced Bayesian Information Criterion , 2014, Inf. Sci..

[6]  P. Balasubramanie,et al.  Clustering based optimal summary generation using Genetic Algorithm , 2010, 2010 International Conference on Communication and Computational Intelligence (INCOCCI).

[7]  Behrooz Masoumi,et al.  Automatic text summarization based on multi-agent particle swarm optimization , 2014, 2014 Iranian Conference on Intelligent Systems (ICIS).

[8]  Jun Tang,et al.  Query-focused Summarization Based on Genetic Algorithm , 2010, 2010 International Conference on Measuring Technology and Mechatronics Automation.

[9]  Rasim M. Alguliyev,et al.  Effective summarization method of text documents , 2005, The 2005 IEEE/WIC/ACM International Conference on Web Intelligence (WI'05).

[10]  Pak Kin Wong,et al.  Modeling and optimization of biodiesel engine performance using kernel-based extreme learning machine and cuckoo search , 2015 .

[11]  Rasim M. Alguliyev,et al.  Multiple documents summarization based on evolutionary optimization algorithm , 2013, Expert Syst. Appl..

[12]  Thang Trung Nguyen,et al.  Cuckoo search algorithm for short-term hydrothermal scheduling , 2014 .

[13]  Naomie Salim,et al.  Swarm Based Text Summarization , 2009, 2009 International Association of Computer Science and Information Technology - Spring Conference.

[14]  Rakesh Chandra Balabantaray,et al.  Cat swarm optimization based evolutionary framework for multi document summarization , 2017 .

[15]  Kumaresh Nandhini,et al.  Extracting easy to understand summary using differential evolution algorithm , 2014, Swarm Evol. Comput..

[16]  N. Jawahar,et al.  Reliability-based total cost of ownership approach for supplier selection using cuckoo-inspired hybrid algorithm , 2014 .

[17]  Seyed Hossein Mirshojaei,et al.  Text Summarization Using Cuckoo Search Optimization Algorithm , 2015 .

[18]  Ahmed Elkeran,et al.  A new approach for sheet nesting problem using guided cuckoo search and pairwise clustering , 2013, Eur. J. Oper. Res..

[19]  Vicente P. Guerrero-Bote,et al.  Order-based Fitness Functions for Genetic Algorithms Applied to Relevance Feedback , 2003, J. Assoc. Inf. Sci. Technol..

[20]  Niladri Chatterjee,et al.  Discrete Differential Evolution for Text Summarization , 2014, 2014 International Conference on Information Technology.

[21]  Xiaohui Liu,et al.  Parameter estimation of Takagi-Sugeno fuzzy system using heterogeneous cuckoo search algorithm , 2015, Neurocomputing.

[22]  Lalit Chandra Saikia,et al.  Comparison of performances of several Cuckoo search algorithm based 2DOF controllers in AGC of multi-area thermal system , 2014 .

[23]  S. G. Ponnambalam,et al.  Cuckoo Search Algorithm for Optimization of Sequence in PCB Holes Drilling Process , 2012 .

[24]  Rasim M. Alguliyev,et al.  GenDocSum + MCLR: Generic document summarization based on maximum coverage and less redundancy , 2012, Expert Syst. Appl..

[25]  Rakesh Chandra Balabantaray,et al.  Document Summarization Using Sentence Features , 2015, Int. J. Inf. Retr. Res..

[26]  Michael D. Gordon Probabilistic and genetic algorithms in document retrieval , 1988, CACM.

[27]  Naomie Salim,et al.  Differential evolution cluster-based text summarization methods , 2013, 2013 INTERNATIONAL CONFERENCE ON COMPUTING, ELECTRICAL AND ELECTRONIC ENGINEERING (ICCEEE).

[28]  Eduard H. Hovy,et al.  Automatic Evaluation of Summaries Using N-gram Co-occurrence Statistics , 2003, NAACL.

[29]  Rasim M. Alguliyev,et al.  DESAMC+DocSum: Differential evolution with self-adaptive mutation and crossover parameters for multi-document summarization , 2012, Knowl. Based Syst..

[30]  Ashish Kumar Bhandari,et al.  Cuckoo search algorithm and wind driven optimization based study of satellite image segmentation for multilevel thresholding using Kapur's entropy , 2014, Expert Syst. Appl..

[31]  Xiaojun Wan,et al.  Exploiting neighborhood knowledge for single document summarization and keyphrase extraction , 2010, TOIS.

[32]  Rasim M. Alguliyev,et al.  Sentence selection for generic document summarization using an adaptive differential evolution algorithm , 2011, Swarm Evol. Comput..

[33]  Xin-She Yang,et al.  Engineering optimisation by cuckoo search , 2010 .

[34]  Tarun Kumar Rawat,et al.  Optimal design of FIR fractional order differentiator using cuckoo search algorithm , 2015, Expert Syst. Appl..

[35]  Aouatif Amine,et al.  A hybrid mobile object tracker based on the modified Cuckoo Search algorithm and the Kalman Filter , 2014, Pattern Recognit..

[36]  Thang Trung Nguyen,et al.  Modified cuckoo search algorithm for short-term hydrothermal scheduling , 2015 .

[37]  Seif-Eddeen K. Fateen,et al.  A note on effective phase stability calculations using a Gradient Based Cuckoo Search algorithm , 2014 .

[38]  Lalit Chandra Saikia,et al.  Comparison of performances of several FACTS devices using Cuckoo search algorithm optimized 2DOF controllers in multi-area AGC , 2015 .

[39]  Dong-Hong Ji,et al.  MSBGA: A Multi-Document Summarization System Based on Genetic Algorithm , 2006, 2006 International Conference on Machine Learning and Cybernetics.

[40]  Xin-She Yang,et al.  Cuckoo search: recent advances and applications , 2013, Neural Computing and Applications.

[41]  Josef Steinberger,et al.  Automatic Text Summarization (The state of the art 2007 and new challenges) , 2008 .

[42]  Rutuparna Panda,et al.  Edge magnitude based multilevel thresholding using Cuckoo search technique , 2013, Expert Syst. Appl..

[43]  Mhamed Zineddine,et al.  Vulnerabilities and mitigation techniques toning in the cloud: A cost and vulnerabilities coverage optimization approach using Cuckoo search algorithm with Lévy flights , 2015, Comput. Secur..

[44]  Xin-She Yang,et al.  Cuckoo Search via Lévy flights , 2009, 2009 World Congress on Nature & Biologically Inspired Computing (NaBIC).

[45]  Fuji Ren,et al.  GA, MR, FFNN, PNN and GMM based models for automatic text summarization , 2009, Comput. Speech Lang..

[46]  Jianzhou Wang,et al.  Forecasting solar radiation using an optimized hybrid model by Cuckoo Search algorithm , 2015 .

[47]  Rakesh Chandra Balabantaray,et al.  Comparative Study of DE and PSO over Document Summarization , 2015 .

[48]  Azlan Mohd Zain,et al.  Cuckoo Search Algorithm for Optimization Problems—A Literature Review and its Applications , 2014, Appl. Artif. Intell..

[49]  Seif-Eddeen K. Fateen,et al.  Cuckoo Search: A new nature-inspired optimization method for phase equilibrium calculations , 2013 .

[50]  Félix de Moya Anegón,et al.  A GA-P algorithm to automatically formulate extended Boolean queries for a fuzzy information retrieval system , 2000 .

[51]  Jagadeesh Kondru Using part of speech structure of text in the prediction of its readability , 2006 .

[52]  Elizabeth León Guzman,et al.  Extractive single-document summarization based on genetic operators and guided local search , 2014, Expert Syst. Appl..

[53]  Ramiz M. Aliguliyev,et al.  A new sentence similarity measure and sentence based extractive technique for automatic text summarization , 2009, Expert Syst. Appl..

[54]  Bestoun S. Ahmed,et al.  Achievement of minimized combinatorial test suite for configuration-aware software functional testing using the Cuckoo Search algorithm , 2015, Inf. Softw. Technol..

[55]  Elena Lloret,et al.  Text summarisation in progress: a literature review , 2011, Artificial Intelligence Review.

[56]  Almoataz Y. Abdelaziz,et al.  Cuckoo Search algorithm based load frequency controller design for nonlinear interconnected power system , 2015 .

[57]  Anh Viet Truong,et al.  Distribution network reconfiguration for power loss minimization and voltage profile improvement using cuckoo search algorithm , 2015 .

[58]  Mostafa Zamanian,et al.  Readability of Texts: State of the Art , 2012 .

[59]  Rasim M. Alguliyev,et al.  CDDS: Constraint-driven document summarization models , 2013, Expert Syst. Appl..

[60]  Ramasamy Alagirusamy,et al.  Performance analysis and feasibility study of ant colony optimization, particle swarm optimization and cuckoo search algorithms for inverse heat transfer problems , 2015 .