Web Pages Classification with Parliamentary Optimization Algorithm

In recent years, data on the Internet has grown exponentially, attaining enormous dimensions. This situation makes it difficult to obtain useful information from such data. Web mining is the process of using data mining techniques such as association rules, classification, clustering, and statistics to discover and extract information from Web documents. Optimization algorithms play an important role in such techniques. In this work, the parliamentary optimization algorithm (POA), which is one of the latest social-based metaheuristic algorithms, has been adopted for Web page classification. Two different data sets (Course and Student) were selected for experimental evaluation, and HTML tags were used as features. The data sets were tested using different classification algorithms implemented in WEKA, and the results were compared with those of the POA. The POA was found to yield promising results compared to the other algorithms. This study is the first to propose the POA for effective Web page classifica...

[1]  Lu Liu,et al.  Clustering-Based Topical Web Crawling for Topic-Specific Information Retrieval Guided by Incremental Classifier , 2015, Int. J. Softw. Eng. Knowl. Eng..

[2]  B. Alatas,et al.  Overlapping Community Detection in Social Networks Using Parliamentary Optimization Algorithm , 2015 .

[3]  Selma Ayse Ozel,et al.  Web page classification using firefly optimization , 2013, 2013 IEEE INISTA.

[4]  Selma Ayse Ozel A Web page classification system based on a genetic algorithm using tagged-terms as features , 2011 .

[5]  Reha Uzsoy,et al.  Experimental Evaluation of Heuristic Optimization Algorithms: A Tutorial , 2001, J. Heuristics.

[6]  Rafael Corchuelo,et al.  CALA: An unsupervised URL-based web page classification system , 2014, Knowl. Based Syst..

[7]  Jie Qin,et al.  A Web Page Classification Algorithm Based on Link Information , 2011, 2011 10th International Symposium on Distributed Computing and Applications to Business, Engineering and Science.

[8]  A. Borji,et al.  A NEW APPROACH TO GLOBAL OPTIMIZATION MOTIVATED BY PARLIAMENTARY POLITICAL COMPETITIONS , 2008 .

[9]  Ahmet Bedri Özer,et al.  CIDE: Chaotically Initialized Differential Evolution , 2010, Expert Syst. Appl..

[10]  Bilal Alatas,et al.  ACROA: Artificial Chemical Reaction Optimization Algorithm for global optimization , 2011, Expert Syst. Appl..

[11]  Alex Alves Freitas,et al.  Web Page Classification with an Ant Colony Algorithm , 2004, PPSN.

[12]  Kin Fun Li,et al.  On-Chip Hardware Support for Similarity Measures , 2007, 2007 IEEE Pacific Rim Conference on Communications, Computers and Signal Processing.

[13]  Ali Borji,et al.  A New Global Optimization Algorithm Inspired by Parliamentary Political Competitions , 2007, MICAI.

[14]  Christian Blum,et al.  Metaheuristics in combinatorial optimization: Overview and conceptual comparison , 2003, CSUR.

[15]  Bilal Alatas,et al.  Automatic Mining of Numerical Classification Rules with Parliamentary Optimization Algorithm , 2015 .

[16]  Erhan Akin,et al.  Mining Fuzzy Classification Rules Using an Artificial Immune System with Boosting , 2005, ADBIS.

[17]  Junghoo Cho,et al.  A fast regular expression indexing engine , 2002, Proceedings 18th International Conference on Data Engineering.

[18]  Juan Carlos Gomez,et al.  PCA document reconstruction for email classification , 2012, Comput. Stat. Data Anal..

[19]  Shrish Verma,et al.  A Comparative Study of Bug Classification Algorithms , 2014, Int. J. Softw. Eng. Knowl. Eng..

[20]  Li Xiaoping,et al.  Mixture Models for Web Page Classification , 2012 .

[21]  Oren Etzioni,et al.  The World-Wide Web: quagmire or gold mine? , 1996, CACM.

[22]  Damianos Gavalas,et al.  An Effective fuzzy Clustering Algorithm for Web Document Classification: a Case Study in Cultural Content Mining , 2013, Int. J. Softw. Eng. Knowl. Eng..