Improving Customer Relationship Management throughIntegrated Mining of Heterogeneous Data

The volume of information available on the Internet and corporate intranets continues to increase along with the corresponding increase in the data (structured and unstructured) stored by many organizations. In customer relationship management, information is the raw material for decision making. For this to be effective, there is need to discover knowledge from the seamless integration of structured and unstructured data for completeness and comprehensiveness which is the main focus of this paper. In the integration process, the structured component is selected based on the resulting keywords from the unstructured text preprocessing process, and association rules is generated based on the modified GARW (Generating Association Rules Based on Weighting Scheme) Algorithm. The main contribution of this technique is that the unstructured component of the integration is based on Information retrieval technique which is based on content similarity of XML (Extensible Markup Language) document. This similarity is based on the combination of syntactic and semantic relevance. Experiments carried out revealed that the extracted association rules contain important features which form a worthy platform for making effective decisions as regards customer relationship management. The performance of the integration approach is also compared with a similar approach which uses just syntactic relevance in its information extraction process to reveal a significant reduction in the large itemsets and execution time. This leads to reduction in rules generated to more interesting ones due to the semantic clustering of XML documents introduced into the improved integrated mining technique.

[1]  Sergio Greco,et al.  Semantic clustering of XML documents , 2010, TOIS.

[2]  Gerhard Weikum,et al.  The Index-Based XXL Search Engine for Querying XML Data with Relevance Ranking , 2002, EDBT.

[3]  Raymond D. Frost,et al.  E-marketing , 2005 .

[4]  Fawzy A. Torkey,et al.  A Text Mining Technique Using Association Rules Extraction , 2008 .

[5]  PREDICTIVE DATA MINING ON WEB-BASED E-COMMERCE STORE , 2002 .

[6]  Ophir Frieder,et al.  Integrating structured data and text: a relational approach , 1997 .

[7]  Watson,et al.  Integrating Structured and Unstructured Data Using Text Tagging and Annotation 8 , 2006 .

[8]  Jayram S. Thathachar,et al.  AVATAR : Using text analytics to bridge the structured – unstructured divide , 2004 .

[9]  Chia-Hui Chang,et al.  Exploring Evolutionary Technical Trends from Academic Research Papers , 2010 .

[10]  W. Bruce Croft,et al.  A loosely-coupled integration of a text retrieval system and an object-oriented database system , 1992, SIGIR '92.

[11]  Vagelis Hristidis,et al.  DISCOVER: Keyword Search in Relational Databases , 2002, VLDB.

[12]  Ronen Feldman,et al.  Book Reviews: The Text Mining Handbook: Advanced Approaches to Analyzing Unstructured Data by Ronen Feldman and James Sanger , 2008, CL.

[13]  Xiaoshan Du Data Mining Analysis and Modeling for Marketing Based on Attributes of Customer Relationship , 2006 .

[14]  T. Breur,et al.  Integration of Analytical CRM in Business Processes : An Application , 2003 .

[15]  Norbert Fuhr,et al.  XIRQL: a query language for information retrieval in XML documents , 2001, SIGIR '01.

[16]  Surajit Chaudhuri,et al.  DBXplorer: a system for keyword-based search over relational databases , 2002, Proceedings 18th International Conference on Data Engineering.

[17]  W. Scott Spangler,et al.  The integration of business intelligence and knowledge management , 2002, IBM Syst. J..

[18]  Kristof Coussement,et al.  Improving Customer Complaint Management by Automatic Email Classification Using Linguistic Style Features as Predictors , 2007 .

[19]  Tetsuya Nasukawa,et al.  Text analysis and knowledge mining system , 2001, IBM Syst. J..

[20]  Michael J. Shaw,et al.  Knowledge management and data mining for marketing , 2001, Decis. Support Syst..

[21]  Ashish Verma,et al.  Enabling analysts in managed services for CRM analytics , 2009, KDD.

[22]  Un Yong Nahm and Raymond J. Mooney,et al.  Using Information Extraction to Aid the Discovery of Prediction Rules from Text , 2000 .

[23]  Chia-Hui Chang,et al.  Exploring Evolutionary Technical Trends from Academic Research Papers , 2008, 2008 The Eighth IAPR International Workshop on Document Analysis Systems.

[24]  Dirk Van den Poel,et al.  Predicting customer loyalty using the internal transactional database , 2007, Expert Syst. Appl..

[25]  Cong Yu,et al.  Querying structured text in an XML database , 2003, SIGMOD '03.