Particle Swarm Optimization for Punjabi Text Summarization

Particle swarm optimization (PSO) algorithm is proposed to deal with text summarization for the Punjabi language. PSO is based on intelligence that predicts among a given set of solutions which is the best solution. The search is carried out by extremely high-speed particles. It updates particle position and velocity at the end of iteration so that during the development of generations, the personal best solution and global best solution are updated. Calculation within PSO is performed using fitness function which looks into various statistical and linguistic features of the Punjabi datasets. Two Punjabi datasets—monolingual Punjabi corpus from Indian Languages Corpora Initiative Phase-II and Punjabi-Hindi parallel corpus—are considered. The parallel corpus comprises 1,000 Punjabi sentences from the tourism domain while monolingual corpus contains 30,000 Punjabi sentences of the general domain. ROUGE measures evaluate summary where the highest measure, ROUGE-1, is achieved for parallel corpus with precision, recall, and F-measure as 0.7836, 0.7957, and 0.7896, respectively.

[1]  Hassan Ismail Abdalla,et al.  PSO-Based Feature Selection for Arabic Text Summarization , 2015, J. Univers. Comput. Sci..

[2]  Rahmat Budiarto,et al.  Automatic Text Summarization for Indonesian Language Using TextTeaser , 2017 .

[3]  Ahmad T. Al-Taani,et al.  Arabic Single-Document Text Summarization Using Particle Swarm Optimization Algorithm , 2017, ACLING.

[4]  Haitao Huang,et al.  Abstractive text summarization using LSTM-CNN based deep learning , 2018, Multimedia Tools and Applications.

[5]  Parminder Singh,et al.  Punjabi Dialects Conversion System for Malwai and Doabi Dialects , 2015 .

[6]  Kamaldeep Kaur,et al.  TOPIC TRACKING FOR PUNJABI LANGUAGE , 2011 .

[7]  Jing Yang,et al.  Multiobjective particle swarm community discovery arithmetic based on representation learning , 2020, Concurr. Comput. Pract. Exp..

[8]  Sanjeev Kumar Sharma Sentence Reduction for Syntactic Analysis of Compound Sentences in Punjabi Language , 2019, EAI Endorsed Trans. Scalable Inf. Syst..

[9]  Miguel Jimeno,et al.  A Tabu Search Method for Load Balancing in Fog Computing , 2018 .

[10]  Naomie Salim,et al.  Pseudo genetic and probabilistic-based feature selection method for extractive single document summarization , 2011 .

[11]  Anuja Arora,et al.  OntoHindi NER - An Ontology Based Novel Approach for Hindi Named Entity Recognition , 2018 .

[12]  Khurram Shahzad,et al.  Named Entity Recognition and Classification for Punjabi Shahmukhi , 2020, ACM Trans. Asian Low Resour. Lang. Inf. Process..

[13]  Vishal Gupta,et al.  Recent automatic text summarization techniques: a survey , 2016, Artificial Intelligence Review.

[14]  Gupta Vishal,et al.  Named Entity Recognition for Punjabi Language Text Summarization , 2011 .

[15]  Anuja Arora,et al.  Named Entity System for Tweets in Hindi Language , 2018, Int. J. Intell. Inf. Technol..

[16]  Dharam Veer Sharma,et al.  Recognition of Isolated Handwritten Characters in Gurmukhi Script , 2010 .

[17]  Laiali Almazaydeh Automatic Arabic text summarisation system (AATSS) based on morphological analysis , 2018, Int. J. Intell. Syst. Technol. Appl..

[18]  Vishal Gupta,et al.  A Novel Hybrid Text Summarization System for Punjabi Text , 2015, Cognitive Computation.

[19]  J. P. Gupta,et al.  A TENGRAM method based part-of-speech tagging of multi-category words in Hindi language , 2011, Expert Syst. Appl..

[20]  Vishal Gupta,et al.  Proposed Algorithm of Sentiment Analysis for Punjabi Text , 2014 .