Clustering-Based Learning Approach for Ant Colony Optimization Model to Simulate Web User Behavior

In this paper we propose a novel methodology for analyzing web user behavior based on session simulation by using an Ant Colony Optimization algorithm which incorporates usage, structure and content data originating from a real web site. In the first place, artificial ants learn from a clustered web user session set through the modification of a text preference vector. Then, trained ants are released through a web graph and the generated artificial sessions are compared with real usage. The main result is that the proposed model explains approximately 80% of real usage in terms of a predefined similarity measure.

[1]  V. Palade,et al.  Adaptive Web Sites - A Knowledge Extraction from Web Data Approach , 2008, Frontiers in Artificial Intelligence and Applications.

[2]  Pablo E. Román,et al.  Stochastic Simulation of Web Users , 2010, 2010 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology.

[3]  Corso Elvezia,et al.  Ant colonies for the traveling salesman problem , 1997 .

[4]  Ajith Abraham,et al.  Web usage mining using artificial ant colony clustering and linear genetic programming , 2003, The 2003 Congress on Evolutionary Computation, 2003. CEC '03..

[5]  Debashis Ganguly,et al.  A Novel Approach for Determination of Optimal Number of Cluster , 2009, 2009 International Conference on Computer and Automation Engineering.

[6]  Tony White,et al.  On How Ants Put Advertisements on the Web , 2010, IEA/AIE.

[7]  Manuel López-Ibáñez,et al.  Ant colony optimization , 2010, GECCO '10.

[8]  Dan Gusfield,et al.  Algorithms on Strings, Trees, and Sequences - Computer Science and Computational Biology , 1997 .

[9]  Sergey Brin,et al.  The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.

[10]  Chang-Chun Lin,et al.  Website reorganization using an ant colony system , 2010, Expert Syst. Appl..

[11]  Myra Spiliopoulou,et al.  A Framework for the Evaluation of Session Reconstruction Heuristics in Web-Usage Analysis , 2003, INFORMS J. Comput..

[12]  김동규,et al.  [서평]「Algorithms on Strings, Trees, and Sequences」 , 2000 .

[13]  Juan D. Velásquez,et al.  Web site keywords: A methodology for improving gradually the web site text content , 2012, Intell. Data Anal..

[14]  M Dorigo,et al.  Ant colonies for the travelling salesman problem. , 1997, Bio Systems.

[15]  Bing Liu,et al.  Web Data Mining: Exploring Hyperlinks, Contents, and Usage Data , 2006, Data-Centric Systems and Applications.