User Behavior Analysis Using Alignment Based Grammatical Inference from Web Server Access Log

Application of data mining technique to the World Wide Web refers to as Web mining. Web based origination collects large volume of data for their operation. Analysis of such data can help the organization for better working (Marketing strategy, services, evaluation of effectiveness, promotional campaigns etc). This type of analysis require discovery of meaningful relationships from the large collection of primarily unstructured data stored in Web server access logs. We propose a new approach for automatically learning (context-free) grammar rules form server access log text (positive set) samples, based on the alignments between the sentences. Our approach works on pairs of unstructured sentences that have one or more words common.

[1]  Yasubumi Sakakibara,et al.  Learning context-free grammars from structural data in polynomial time , 1988, COLT '88.

[2]  Yasubumi Sakakibara,et al.  Learning context-free grammars using tabular representations , 2005, Pattern Recognit..

[3]  Menno van Zaanen Implementing Alignment-Based Learning , 2002, ICGI.

[4]  Jaideep Srivastava,et al.  Web mining: information and pattern discovery on the World Wide Web , 1997, Proceedings Ninth IEEE International Conference on Tools with Artificial Intelligence.

[5]  Narendra S. Chaudhari,et al.  Incremental Discovery of Sequential Pattern from Semi-structured Document Using Grammatical Inference , 2012, ICDCIT.

[6]  Jaideep Srivastava,et al.  Web usage mining: discovery and applications of usage patterns from Web data , 2000, SKDD.

[7]  Yasubumi Sakakibara,et al.  Simple recurrent networks as generalized hidden Markov models with distributed representations , 1995, Proceedings of ICNN'95 - International Conference on Neural Networks.

[8]  Enrique Vidal,et al.  What Is the Search Space of the Regular Inference? , 1994, ICGI.

[9]  Ulf Leser,et al.  High-performance information extraction with AliBaba , 2009, EDBT '09.

[10]  Narendra S. Chaudhari,et al.  Language Structure Using Fuzzy Similarity , 2009, IEEE Transactions on Fuzzy Systems.

[11]  E. Mark Gold,et al.  Language Identification in the Limit , 1967, Inf. Control..

[12]  Dana Angluin,et al.  Queries and concept learning , 1988, Machine Learning.

[13]  L. De Raedt,et al.  Logical Hidden Markov Models , 2011, J. Artif. Intell. Res..

[14]  Pieter W. Adriaans,et al.  The EMILE 4.1 Grammar Induction Toolbox , 2002, ICGI.