Advances in Meta-Heuristic Optimization Algorithms in Big Data Text Clustering

This paper presents a comprehensive survey of the meta-heuristic optimization algorithms on the text clustering applications and highlights its main procedures. These Artificial Intelligence (AI) algorithms are recognized as promising swarm intelligence methods due to their successful ability to solve machine learning problems, especially text clustering problems. This paper reviews all of the relevant literature on meta-heuristic-based text clustering applications, including many variants, such as basic, modified, hybridized, and multi-objective methods. As well, the main procedures of text clustering and critical discussions are given. Hence, this review reports its advantages and disadvantages and recommends potential future research paths. The main keywords that have been considered in this paper are text, clustering, meta-heuristic, optimization, and algorithm.

[1]  Zahir Tari,et al.  A Survey of Clustering Algorithms for Big Data: Taxonomy and Empirical Analysis , 2014, IEEE Transactions on Emerging Topics in Computing.

[2]  Nicoletta Del Buono,et al.  Non-negative Matrix Tri-Factorization for co-clustering: An analysis of the block matrix , 2015, Inf. Sci..

[3]  Ahmad M. Khasawneh,et al.  Feature selection method using improved CHI Square on Arabic text classifiers: analysis and application , 2020, Multimedia Tools and Applications.

[4]  Cong Wang,et al.  Data clustering using bacterial foraging optimization , 2011, Journal of Intelligent Information Systems.

[5]  Soon Cheol Park,et al.  Less-redundant Text Summarization Using Ensemble Clustering Algorithm Based on GA and PSO , 2017 .

[6]  Anupam Joshi,et al.  Low-complexity fuzzy relational clustering algorithms for Web mining , 2001, IEEE Trans. Fuzzy Syst..

[7]  Jagatheeshkumar Gopal,et al.  Text Clustering Algorithm Using Fuzzy Whale Optimization Algorithm , 2019 .

[8]  Jian Zhuang,et al.  Novel soft subspace clustering with multi-objective evolutionary approach for high-dimensional data , 2013, Pattern Recognit..

[9]  Scott J. Peters,et al.  Schoolwide Mathematics Achievement Within the Gifted Cluster Grouping Model , 2012 .

[10]  Lemin Li,et al.  High performance genetic algorithm based text clustering using parts of speech and outlier elimination , 2012, Applied Intelligence.

[11]  Salina Dangol,et al.  Analysis of Document Clustering Using K-means Algorithm with Cosine Similarity for Large Scale Text Documents With and Without Hadoop , 2016 .

[12]  Ron Shamir,et al.  Clustering Gene Expression Patterns , 1999, J. Comput. Biol..

[13]  Wang Chun-hong,et al.  Research on the text clustering algorithm based on latent semantic analysis and optimization , 2011, 2011 IEEE International Conference on Computer Science and Automation Engineering.

[14]  Suresh Kumar,et al.  An Extensive Study of Similarity and Dissimilarity Measures Used for Text Document Clustering using K-means Algorithm , 2018 .

[15]  Beakcheol Jang,et al.  Bi-LSTM Model to Increase Accuracy in Text Classification: Combining Word2vec CNN and Attention Mechanism , 2020, Applied Sciences.

[16]  Chao Shen,et al.  Sentiment word co-occurrence and knowledge pair feature extraction based LDA short text clustering algorithm , 2020, Journal of Intelligent Information Systems.

[17]  Gengui Zhou,et al.  A Clustering Algorithm for Chinese Text Based on SOM Neural Network and Density , 2005, ISNN.

[18]  Bo Xing,et al.  Gravitational Search Algorithm , 2014 .

[19]  R. Janani,et al.  Text document clustering using Spectral Clustering algorithm with Particle Swarm Optimization , 2019, Expert Syst. Appl..

[20]  Nur Evin Özdemirel,et al.  Ant Colony Optimization based clustering methodology , 2015, Appl. Soft Comput..

[21]  Tansel Özyer,et al.  Parallel clustering of high dimensional data by integrating multi-objective genetic algorithm with divide and conquer , 2009, Applied Intelligence.

[22]  Kalyanmoy Deb,et al.  A fast and elitist multiobjective genetic algorithm: NSGA-II , 2002, IEEE Trans. Evol. Comput..

[23]  Angel Cobo,et al.  Document Management with Ant Colony Optimization Metaheuristic: A Fuzzy Text Clustering Approach Using Pheromone Trails , 2011 .

[24]  Laith Mohammad Abualigah,et al.  An Improved B-hill Climbing Optimization Technique for Solving the Text Documents Clustering Problem. , 2020, Current medical imaging.

[25]  Jamuna Kanta Sing,et al.  Local contextual information and Gaussian function induced fuzzy clustering algorithm for brain MR image segmentation and intensity inhomogeneity estimation , 2018, Appl. Soft Comput..

[26]  G. Sahoo,et al.  A hybrid approach using genetic algorithm and the differential evolution heuristic for enhanced initialization of the k-means algorithm with applications in text clustering , 2018, Soft Comput..

[27]  G. Loshma SEMANTIC ANALYSIS BASED TEXT CLUSTERING BY THE FUSION OF BISECTING K-MEANS AND UPGMA ALGORITHM , 2016 .

[28]  D. Sornette,et al.  Apparent clustering and apparent background earthquakes biased by undetected seismicity , 2005, physics/0501049.

[29]  Nishant Agarwal A Real-time Temporal Clustering Algorithm for short text, and its applications , 2017 .

[30]  Hui Wang,et al.  Design and Application of a Text Clustering Algorithm Based on Parallelized K-Means Clustering , 2019, Rev. d'Intelligence Artif..

[31]  Bin He,et al.  A Text Clustering Method Based on Two-Dimensional OTSU and PSO Algorithm , 2009, 2009 International Symposium on Computer Network and Multimedia Technology.

[32]  Laith Mohammad Abualigah,et al.  Hybrid clustering analysis using improved krill herd algorithm , 2018, Applied Intelligence.

[33]  Arindam Roy,et al.  A Comparative Analysis of Particle Swarm Optimization and K-means Algorithm For Text Clustering Using Nepali Wordnet , 2014 .

[34]  Jing Liu,et al.  A multi-objective memetic algorithm based on decomposition for big optimization problems , 2016, Memetic Comput..

[35]  Essam Said Hanandeh,et al.  A novel hybridization strategy for krill herd algorithm applied to clustering techniques , 2017, Appl. Soft Comput..

[36]  Majid Sarrafzadeh,et al.  Optimal Energy Aware Clustering in Sensor Networks , 2002 .

[37]  M. K. Tiwari,et al.  Clustering Indian stock market data for portfolio management , 2010, Expert Syst. Appl..

[38]  Youjin Rong,et al.  Staged text clustering algorithm based on K-means and hierarchical agglomeration clustering , 2020, 2020 IEEE International Conference on Artificial Intelligence and Computer Applications (ICAICA).

[39]  Yanchun Liang,et al.  An incremental affinity propagation algorithm and its applications for text clustering , 2009, 2009 International Joint Conference on Neural Networks.

[40]  Aishan Wumaier,et al.  Study and Implementing K-mean Clustering Algorithm on English Text and Techniques to Find the Optimal Value of K , 2018, International Journal of Computer Applications.

[41]  Guillermo Ricardo Simari,et al.  Non-commercial Research and Educational Use including without Limitation Use in Instruction at Your Institution, Sending It to Specific Colleagues That You Know, and Providing a Copy to Your Institution's Administrator. All Other Uses, Reproduction and Distribution, including without Limitation Comm , 2022 .

[42]  Pramod Kumar Singh,et al.  A three-stage unsupervised dimension reduction method for text clustering , 2014, J. Comput. Sci..

[43]  Anil K. Jain,et al.  Data clustering: a review , 1999, CSUR.

[44]  Alex Alves Freitas,et al.  A Survey of Evolutionary Algorithms for Clustering , 2009, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[45]  Vivek Kumar Singh,et al.  Document Clustering Using K-Means, Heuristic K-Means and Fuzzy C-Means , 2011, 2011 International Conference on Computational Intelligence and Communication Networks.

[46]  Dervis Karaboga,et al.  A novel clustering approach: Artificial Bee Colony (ABC) algorithm , 2011, Appl. Soft Comput..

[47]  Alhareth Mohammed Abu Hussein,et al.  Sentiment Analysis in Healthcare: A Brief Review , 2019 .

[48]  Turdi Tohti Combined algorithm of GAAC and K-means for Uyghur text clustering , 2013 .

[49]  Yuefeng Li,et al.  Effective Pattern Discovery for Text Mining , 2012, IEEE Transactions on Knowledge and Data Engineering.

[50]  Martin J. Oates,et al.  PESA-II: region-based selection in evolutionary multiobjective optimization , 2001 .

[51]  Spyros Sioutas,et al.  CSMR: A Scalable Algorithm for Text Clustering with Cosine Similarity and MapReduce , 2014, AIAI Workshops.

[52]  Nadjet Kamel,et al.  High-Dimensional Text Datasets Clustering Algorithm Based on Cuckoo Search and Latent Semantic Indexing , 2018, J. Inf. Knowl. Manag..

[53]  Laith Mohammad Abualigah,et al.  Unsupervised text feature selection technique based on hybrid particle swarm optimization algorithm with genetic operators for the text clustering , 2017, The Journal of Supercomputing.

[54]  M. Ali Fauzi,et al.  Optimizing K-means text document clustering using latent semantic indexing and pillar algorithm , 2017, 2017 5th International Symposium on Computational and Business Intelligence (ISCBI).

[55]  Mohammed Azmi Al-Betar,et al.  Feature Selection with β-Hill Climbing Search for Text Clustering Application , 2017, 2017 Palestinian International Conference on Information and Communication Technology (PICICT).

[56]  Meghana P. Lokhande,et al.  TEXT SUMMARIZATION USING HIERARCHICAL CLUSTERING ALGORITHM AND EXPECTATION MAXIMIZATION CLUSTERING ALGORITHM , 2015 .

[57]  Mehran Sahami,et al.  Text Mining: Classification, Clustering, and Applications , 2009 .

[58]  Laith Mohammad Abualigah,et al.  APPLYING GENETIC ALGORITHMS TO INFORMATION RETRIEVAL USING VECTOR SPACE MODEL , 2015 .

[59]  Joydeep Ghosh,et al.  Under Consideration for Publication in Knowledge and Information Systems Generative Model-based Document Clustering: a Comparative Study , 2003 .

[60]  Jonathan Timmis,et al.  A resource limited artificial immune system for data analysis , 2001, Knowl. Based Syst..

[61]  Xiangwei Liu,et al.  An Improved K-Means Text Clustering Algorithm Based on Local Search , 2008, 2008 4th International Conference on Wireless Communications, Networking and Mobile Computing.

[62]  Rizwan Patan,et al.  Optimization of Routing-Based Clustering Approaches in Wireless Sensor Network: Review and Open Research Issues , 2020, Electronics.

[63]  Maosheng Huang,et al.  Study on Chinese Text Clustering Algorithm Based on K-mean and Evaluation Method on Effect of Clustering for Software-intensive System , 2020, 2020 International Conference on Computer Engineering and Application (ICCEA).

[64]  Dong Yue-hu Text clustering algorithm with improved weighting factor and feature vector , 2015 .

[65]  Aboul Ella Hassanien,et al.  Bat Algorithm (BA) , 2015, Swarm Intelligence.

[66]  Hui-hui Liu,et al.  Research and Application of Improved K-means Algorithm in Text Clustering , 2018 .

[67]  K. Arun Prabha,et al.  Improved Particle Swarm Optimization Based K-Means Clustering , 2014, 2014 International Conference on Intelligent Computing Applications.

[68]  Alok Chakrabarty,et al.  Text Clustering using a WordNet-based Knowledge-Base and the Lesk Algorithm , 2012 .

[69]  B. Janet,et al.  Clustering Quality Improvement using a hybrid Social spider optimization , 2017 .

[70]  Pramod Kumar Singh,et al.  Chaotic gradient artificial bee colony for text clustering , 2016, Soft Comput..

[71]  Osama Abedl Fattah Ghanem Evaluating the Effect of Preprocessing in Arabic Documents Clustering , 2014 .

[72]  Seyed Abolghasem Mirroshandel,et al.  A novel combinatorial merge-split approach for automatic clustering using imperialist competitive algorithm , 2019, Expert Syst. Appl..

[73]  Li Xinwu Research on Text Clustering Algorithm Based on K_means and SOM , 2008, 2008 International Symposium on Intelligent Information Technology Application Workshops.

[74]  Vivek Sharma,et al.  Multi-label text categorization based on feature optimization using ant colony optimization and relevance clustering technique , 2015, 2015 International Conference on Computers, Communications, and Systems (ICCCS).

[75]  Mohammed Azmi Al-Betar,et al.  A krill herd algorithm for efficient text documents clustering , 2016, 2016 IEEE Symposium on Computer Applications & Industrial Electronics (ISCAIE).

[76]  Hassan Abolhassani,et al.  Harmony K-means algorithm for document clustering , 2009, Data Mining and Knowledge Discovery.

[77]  Z. Zenn Bien,et al.  Effective learning system techniques for human-robot interaction in service environment , 2007, Knowl. Based Syst..

[78]  Wei Song,et al.  Genetic algorithm for text clustering using ontology and evaluating the validity of various semantic similarity measures , 2009, Expert Syst. Appl..

[79]  Gerard Salton,et al.  A vector space model for automatic indexing , 1975, CACM.

[80]  Mohammed Azmi Al-Betar,et al.  Gene selection for cancer classification by combining minimum redundancy maximum relevancy and bat-inspired algorithm , 2017, Int. J. Data Min. Bioinform..

[81]  Malik Braik,et al.  A Grey Wolf Optimizer for Text Document Clustering , 2018, J. Intell. Syst..

[82]  An Enhanced Image Retrieval Using K-Mean Clustering Algorithm in Integrating Text and Visual Features , 2014 .

[83]  Frank Hoeppner,et al.  Fuzzy shell clustering algorithms in image processing: fuzzy C-rectangular and 2-rectangular shells , 1997, IEEE Trans. Fuzzy Syst..

[84]  Nilanjan Dey,et al.  MEDLINE Text Mining: An Enhancement Genetic Algorithm Based Approach for Document Clustering , 2016, Applications of Intelligent Optimization in Biology and Medicine.

[85]  Ahmad M. Khasawneh,et al.  A parallel hybrid krill herd algorithm for feature selection , 2020, Int. J. Mach. Learn. Cybern..

[86]  Weiguo Sheng,et al.  A Niching Memetic Algorithm for Simultaneous Clustering and Feature Selection , 2008, IEEE Transactions on Knowledge and Data Engineering.

[87]  Sung-Sam Hong,et al.  The Feature Selection Method based on Genetic Algorithm for Efficient of Text Clustering and Text Classification , 2015 .

[88]  K. R. Harrigan An Application of Clustering for Strategic Group Analysis , 1985 .

[89]  Laith Mohammad Abualigah,et al.  A Novel Weighting Scheme Applied to Improve the Text Document Clustering Techniques , 2018 .

[90]  Ahmad Taher Azar,et al.  A novel hybrid feature selection method based on rough set and improved harmony search , 2015, Neural Computing and Applications.

[91]  Andrew Lewis,et al.  The Whale Optimization Algorithm , 2016, Adv. Eng. Softw..

[92]  Mehrnoush Shamsfard,et al.  An improved bee colony optimization algorithm with an application to document clustering , 2015, Neurocomputing.

[93]  Sam Kwong,et al.  Semi-Supervised Non-Negative Matrix Factorization With Dissimilarity and Similarity Regularization , 2020, IEEE Transactions on Neural Networks and Learning Systems.

[94]  Pramod Kumar H. Kulkarni,et al.  Multipath data transmission in WSN using exponential cat swarm and fuzzy optimisation , 2019, IET Communications.

[95]  Mingyan Jiang,et al.  Novel Clustering Algorithms Based on Improved Artificial Fish Swarm Algorithm , 2009, 2009 Sixth International Conference on Fuzzy Systems and Knowledge Discovery.

[96]  Divya D. Dev,et al.  A NOVEL APPROACH FOR TEXT CLUSTERING USING MUST LINK AND CANNOT LINK ALGORITHM , 2014 .

[97]  Ganapati Panda,et al.  A survey on nature inspired metaheuristic algorithms for partitional clustering , 2014, Swarm Evol. Comput..

[98]  Ali Diabat,et al.  A Comprehensive Survey of the Harmony Search Algorithm in Clustering Applications , 2020, Applied Sciences.

[99]  De Vries,et al.  Document clustering algorithms, representations and evaluation for information retrieval , 2014 .

[100]  Lipeng Yang,et al.  A Text Clustering Algorithm based on Weeds and Differential Optimization , 2016 .

[101]  Omprakash Kaiwartya,et al.  Green Computing in Underwater Wireless Sensor Networks Pressure Centric Energy Modeling , 2020, IEEE Systems Journal.

[102]  Husniza Husni,et al.  GF-CLUST: A nature-inspired algorithm for automatic text clustering , 2016 .

[103]  Sheng Chen,et al.  A clustering technique for digital communications channel equalization using radial basis function networks , 1993, IEEE Trans. Neural Networks.

[104]  Mohammad Reza Meybodi,et al.  Efficient stochastic algorithms for document clustering , 2013, Inf. Sci..

[105]  Emanuel Falkenauer,et al.  Genetic Algorithms and Grouping Problems , 1998 .

[106]  Wang XiaohuaWang Rongbo Lu Guoli Text Clustering Research on the Max Term Contribution Dimension Reduction and Simulated Annealing Algorithm , 2008 .

[107]  Mohammed Azmi Al-Betar,et al.  Text feature selection with a robust weight scheme and dynamic dimension reduction to text document clustering , 2017, Expert Syst. Appl..

[108]  Victor J. Rayward-Smith,et al.  Metaheuristics for clustering in KDD , 2005, 2005 IEEE Congress on Evolutionary Computation.

[109]  K. Srinivasa Rao,et al.  Text Independent Speaker Identification with Finite Multivariate Generalized Gaussian Mixture Model and Hierarchical Clustering Algorithm , 2010 .

[110]  Li Cui-xia Study and Simulation of Text Clustering Using Attribute Weighted Fuzzy C-means Algorithm , 2011 .

[111]  Li Wang,et al.  Improved Text Clustering Algorithm and Application in Microblogging Public Opinion Analysis , 2013, 2013 Fourth World Congress on Software Engineering.

[112]  N. Sandhya,et al.  Particle Grey Wolf Optimizer (PGWO) Algorithm and Semantic Word Processing for Automatic Text Clustering , 2019, Int. J. Uncertain. Fuzziness Knowl. Based Syst..

[113]  Wilfrido Gómez-Flores,et al.  Automatic clustering using nature-inspired metaheuristics: A survey , 2016, Appl. Soft Comput..

[114]  S. P. Rajagopalan,et al.  Hybridizing Gray Wolf Optimization (GWO) with Grasshopper Optimization Algorithm (GOA) for text feature selection and clustering , 2020, Appl. Soft Comput..

[115]  Hossein Nezamabadi-pour,et al.  GSA: A Gravitational Search Algorithm , 2009, Inf. Sci..

[116]  Amir H. Gandomi,et al.  The Arithmetic Optimization Algorithm , 2021, Computer Methods in Applied Mechanics and Engineering.

[117]  Dino Isa,et al.  An enhanced Support Vector Machine classification framework by using Euclidean distance function for text document categorization , 2011, Applied Intelligence.

[118]  Amir Hossein Alavi,et al.  Krill herd: A new bio-inspired optimization algorithm , 2012 .

[119]  Xiao Wang,et al.  Research on a New Automatic Generation Algorithm of Concept Map Based on Text Clustering and Association Rules Mining , 2018, ICIC.

[120]  C. A. Murthy,et al.  A similarity assessment technique for effective grouping of documents , 2015, Inf. Sci..

[121]  Chao Shen,et al.  BTM and GloVe Similarity Linear Fusion-Based Short Text Clustering Algorithm for Microblog Hot Topic Discovery , 2020, IEEE Access.

[122]  Husniza Husni,et al.  Integrated bisect K-means and firefly algorithm for hierarchical text clustering , 2016 .

[123]  Laurence T. Yang,et al.  PPHOPCM: Privacy-Preserving High-Order Possibilistic c-Means Algorithm for Big Data Clustering with Cloud Computing , 2017, IEEE Transactions on Big Data.

[124]  Wang Yong-gu Hybrid text clustering algorithm based on dual particle swarm optimization and K-means algorithm , 2014 .

[125]  Laith Mohammad Abualigah,et al.  A new feature selection method to improve the document clustering using particle swarm optimization algorithm , 2017, J. Comput. Sci..

[126]  Muhammad Ihsan Jambak,et al.  Comparison of dimensional reduction using the Singular Value Decomposition Algorithm and the Self Organizing Map Algorithm in clustering result of text documents , 2019, IOP Conference Series: Materials Science and Engineering.

[127]  L. Abualigah,et al.  MRMR BA : A HYBRID GENE SELECTION ALGORITHM FOR CANCER CLASSIFICATION , 2017 .

[128]  Jia Shi-jie Text Clustering Algorithm Based on Ant Colony Algorithm , 2010 .

[129]  Laurence T. Yang,et al.  High-order possibilistic c-means algorithms based on tensor decompositions for big data in IoT , 2018, Inf. Fusion.

[130]  Anima Naik,et al.  Data Clustering Based on Teaching-Learning-Based Optimization , 2011, SEMCCO.

[131]  Naitong Zhang,et al.  Improved GA-based text clustering algorithm , 2011, 2011 4th IEEE International Conference on Broadband Network and Multimedia Technology.

[132]  Shengwu Xiong,et al.  Multiobjective big data optimization based on a hybrid salp swarm algorithm and differential evolution , 2020 .

[133]  Kehua Yang,et al.  Research and application of MapReduce-based MST text clustering algorithm , 2012, 2012 IEEE International Conference on Information Science and Technology.

[134]  Abdelaziz Bouroumi,et al.  A multipopulation cultural algorithm using fuzzy clustering , 2007, Appl. Soft Comput..

[135]  M. Narasimha Murty,et al.  Clustering with evolution strategies , 1994, Pattern Recognit..

[136]  Laith Mohammad Abualigah,et al.  A novel bat algorithm with dynamic membrane structure for optimization problems , 2020, Applied Intelligence.

[137]  Fu Zhi-chao Study on Text Categorization Based on Genetic Algorithm and Fuzzy Clustering , 2009 .

[138]  Mina Mirhosseini A clustering approach using a combination of gravitational search algorithm and k-harmonic means and its application in text document clustering , 2017 .

[139]  Neha Garg,et al.  Performance Evaluation of New Text Mining Method Based on GA and K -Means Clustering Algorithm , 2018 .

[140]  Mohammad Shehab,et al.  Text Summarization: A Brief Review , 2019, Studies in Computational Intelligence.

[141]  Jiayin Kang,et al.  Combination of Fuzzy C-Means and Particle Swarm Optimization for Text Document Clustering , 2012 .

[142]  J. Li,et al.  Market Segmentation by Travel Motivations under a Transforming Economy: Evidence from the Monte Carlo of the Orient , 2018, Sustainability.

[143]  Laith Abualigah,et al.  Improved binary gray wolf optimizer and SVM for intrusion detection system in wireless sensor networks , 2020, Journal of Ambient Intelligence and Humanized Computing.

[144]  Licheng Jiao,et al.  A granular agent evolutionary algorithm for classification , 2011, Appl. Soft Comput..

[145]  Laith Mohammad Abualigah,et al.  A combination of objective functions and hybrid Krill herd algorithm for text document clustering analysis , 2018, Eng. Appl. Artif. Intell..

[146]  Pramod Kumar Singh,et al.  Hybrid dimension reduction by integrating feature selection with feature extraction method for text clustering , 2015, Expert Syst. Appl..

[147]  Recent Advances in NLP: The Case of Arabic Language , 2020, Studies in Computational Intelligence.

[148]  Lei Chen A novel clustering algorithm for large-scale text collection and its incremental version , 2016, Inf. Technol. Control..

[149]  Ji-Wei Wu,et al.  A hybrid linear text segmentation algorithm using hierarchical agglomerative clustering and discrete particle swarm optimization , 2014, Integr. Comput. Aided Eng..

[150]  Rana Forsati,et al.  Web Text Mining Using Harmony Search , 2010, Recent Advances In Harmony Search Algorithm.

[151]  B. Janet,et al.  A social spider optimization approach for clustering text documents , 2016, 2016 2nd International Conference on Advances in Electrical, Electronics, Information, Communication and Bio-Informatics (AEEICB).

[152]  Yongping Huang,et al.  A Text Classification Algorithm Based on Rocchio and Hierarchical Clustering , 2011, ICIC.

[153]  Mohammed Azmi Al-Betar,et al.  Multi-objectives-based text clustering technique using K-mean algorithm , 2016, 2016 7th International Conference on Computer Science and Information Technology (CSIT).

[155]  Xu Jia-nin The Two-stage Text Clustering Algorithm Based on K-mesans and aiNet , 2009 .

[156]  Laith Mohammad Abualigah,et al.  Feature Selection and Enhanced Krill Herd Algorithm for Text Document Clustering , 2018, Studies in Computational Intelligence.

[157]  Mohammed Azmi Al-Betar,et al.  Unsupervised Text Feature Selection Technique Based on Particle Swarm Optimization Algorithm for Improving the Text Clustering , 2017 .

[158]  Lalit Kumar,et al.  A novel hybrid BPSO–SCA approach for feature selection , 2019, Natural Computing.

[159]  Pramod Kumar Singh,et al.  Opposition chaotic fitness mutation based adaptive inertia weight BPSO for feature selection in text clustering , 2016, Appl. Soft Comput..

[160]  Yuliang Shi,et al.  Research on Hadoop-based massive short text clustering algorithm , 2019, International Workshop on Pattern Recognition.