Dynamic Web Document Classification in E-CRM Using Neuro-Fuzzy Approach

Internet technology enables companies to capture new customers, track their performances and online behavior, and customize communications, products, services, and price. The analysis of customers and customer interactions for electronic customer relationship management (eCRM) can be performed by data-mining (DM), optimization methods, or combined approaches. Web mining is defined as the discovery and analysis of useful information from World Wide Web (WWW). Some of web mining techniques include analyses of user access patterns, web document clustering and classification. Most existing methods of classification are based on a model that assumes a fixed-size collection of keywords or key terms with predefined set of categories. This assumption is not realistic in large and diverse document collections such as World Wide Web. The researchers here propose a new approach to obtain category-keyword sets with unknown number of categories. On the basis of the training set of Web documents, the approach is used to classify test documents into a set of initial categories. Finally evolutionary rules are applied to these new sets of keywords and training documents to update the categorykeyword sets to realize dynamic document classification.

[1]  Lihui Chen,et al.  Using Web structure and summarisation techniques for Web content mining , 2005, Inf. Process. Manag..

[2]  Zhiqiang Zheng,et al.  Personalization from incomplete data: what you don't know can hurt , 2001, KDD '01.

[3]  Jaideep Srivastava,et al.  Data Preparation for Mining World Wide Web Browsing Patterns , 1999, Knowledge and Information Systems.

[4]  Carla E. Brodley,et al.  KDD-Cup 2000 organizers' report: peeling the onion , 2000, SKDD.

[5]  Gregory Piatetsky-Shapiro,et al.  Web Content Mining , 2009, Encyclopedia of Database Systems.

[6]  Jaideep Srivastava,et al.  Web usage mining: discovery and applications of usage patterns from Web data , 2000, SKDD.

[7]  C. Dussart Transformative Power of e-Business Over Consumer Brands , 2001 .

[8]  X. Drèze,et al.  To Pester or Leave Alone: Lifetime Value Maximization through Optimal Communication Timing , 2003 .

[9]  C. Theusinger,et al.  Analyzing the footsteps of your customers , 2000 .

[10]  Zhiqiang Zheng,et al.  On the Existence and Significance of Data Preprocessing Biases in Web-Usage Mining , 2003, INFORMS J. Comput..

[11]  Gediminas Adomavicius,et al.  Expert-Driven Validation of Rule-Based User Models in Personalization Applications , 2004, Data Mining and Knowledge Discovery.

[12]  David W. Corne,et al.  The BankSearch web document dataset: investigating unsupervised clustering and category similarity , 2005, J. Netw. Comput. Appl..

[13]  Robert C. Blattberg,et al.  Interactive Marketing: Exploiting the Age of Addressability , 1991 .

[14]  Chris H. Q. Ding,et al.  Web document clustering using hyperlink structures , 2001, Comput. Stat. Data Anal..

[15]  M. Shamim Khan,et al.  Web document clustering using a hybrid neural network , 2004, Appl. Soft Comput..

[16]  Krithi Ramamritham,et al.  Enabling scalable online personalization on the Web , 2000, EC '00.

[17]  Thomas E. Potok,et al.  A flocking based algorithm for document clustering analysis , 2006, J. Syst. Archit..

[18]  Ronald R. Yager,et al.  Implementing fuzzy logic controllers using a neural network framework , 1992 .

[19]  Balaji Padmanabhan,et al.  On the Use of Optimization for Data Mining: Theoretical Interactions and eCRM Opportunities , 2003, Manag. Sci..

[20]  David C. Schmittlein,et al.  Counting Your Customers: Who-Are They and What Will They Do Next? , 1987 .