Design of fusion technique-based mining engine for smart business

Keys to successful implementation of smart business require a wide spectrum of domain knowledge, experts, and their correlated experiences. Excluding those external factors—which can be collected by well-deployed sensors—being aware of user (or consumer) has the highest priority on the to-do-list. The more user is understood, the more user can be satisfied from an intuitive point of view, and thus, data plays a rather essential role in the scenario. However, it is never easy to achieve comprehensive understanding as the data requires further processing before its values can be extracted and used. So how the data can be properly transformed into something useful for smart business development is exactly what we pursue in this study. As a pioneer, three major tasks are focused. First, a data mining engine based on the concept of the KID model is designed and developed to be responsible for the universal collection of data and mining valuable information which is primarily from real world, cyber world, and social world. Second, we go further into the fusion process of the collected data and meaningful information extracted and interpreted by algorithms or fused algorithms in the data mining engine (e.g., the consumer purchase data shared by real-world company) and turn them into valuable knowledge about the situation of customers and business situations based on the concept of knowledge, information, and data. A three-layer analysis and mining procedure is designed to enhance the mining engine through conventional RFM (Recency, Frequency, and Monetary Value) model and a set of fusion techniques. And in the end, we make planning-based predictions for a real-world company for expansion of the business interests.

[1]  I. Jolliffe Principal Component Analysis , 2002 .

[2]  Jay Lee,et al.  Smart products and service systems for e-business transformation , 2003, Int. J. Technol. Manag..

[3]  Jianhua Ma,et al.  Cyber-Individual Meets Brain Informatics , 2011, IEEE Intelligent Systems.

[4]  Simon Parsons,et al.  Addendum to "Current Approaches to Handling Imperfect Information in Data and Knowledge Bases" , 1996, IEEE Trans. Knowl. Data Eng..

[5]  Chaim Zins,et al.  Conceptual approaches for defining data, information, and knowledge , 2007, J. Assoc. Inf. Sci. Technol..

[6]  Izak Benbasat,et al.  E-Commerce Product Recommendation Agents: Use, Characteristics, and Impact , 2007, MIS Q..

[7]  Seyed Mohammad Seyedhosseini,et al.  Cluster analysis using data mining approach to develop CRM methodology to assess the customer loyalty , 2010, Expert Syst. Appl..

[8]  Ian T. Jolliffe,et al.  Principal Component Analysis , 2002, International Encyclopedia of Statistical Science.

[9]  Heng Tao Shen,et al.  Principal Component Analysis , 2009, Encyclopedia of Biometrics.

[10]  Sotiris B. Kotsiantis,et al.  Machine learning: a review of classification and combining techniques , 2006, Artificial Intelligence Review.

[11]  James Llinas,et al.  Handbook of Multisensor Data Fusion : Theory and Practice, Second Edition , 2008 .

[12]  Eric van Heck,et al.  Smart business networks: how the network wins , 2007, CACM.

[13]  Rajesh Parekh,et al.  Lessons and Challenges from Mining Retail E-Commerce Data , 2004, Machine Learning.

[14]  Barbara Wixom,et al.  The Current State of Business Intelligence , 2007, Computer.

[15]  Jianhua Ma,et al.  Smart Business Services via Consumer Purchasing Behaviour Modeling , 2013, 2013 IEEE International Conference on Green Computing and Communications and IEEE Internet of Things and IEEE Cyber, Physical and Social Computing.

[16]  Runhe Huang,et al.  A Generic Formulated KID Model for Pragmatic Processing of Data, Information, and Knowledge , 2015, 2015 IEEE 12th Intl Conf on Ubiquitous Intelligence and Computing and 2015 IEEE 12th Intl Conf on Autonomic and Trusted Computing and 2015 IEEE 15th Intl Conf on Scalable Computing and Communications and Its Associated Workshops (UIC-ATC-ScalCom).

[17]  Sanjiv Kumar,et al.  Google Correlate Whitepaper , 2011 .

[18]  W. Hoeffding A Class of Statistics with Asymptotically Normal Distribution , 1948 .

[19]  Jaana Kekäläinen,et al.  IR evaluation methods for retrieving highly relevant documents , 2000, SIGIR Forum.

[20]  Duen-Ren Liu,et al.  Integrating AHP and data mining for product recommendation based on customer lifetime value , 2005, Inf. Manag..

[21]  L. Moutinho Consumer Behaviour in Tourism , 1987 .

[22]  Markus Grünwald,et al.  Business Intelligence , 2009, Informatik-Spektrum.

[23]  Scott D. Johnson,et al.  Influences on consumer use of word-of-mouth recommendation sources , 1997 .

[24]  Veda C. Storey,et al.  Business Intelligence and Analytics: From Big Data to Big Impact , 2012, MIS Q..

[25]  Pin Luarn,et al.  Predicting consumer intention to use mobile service , 2006, Inf. Syst. J..

[26]  X. Zhu,et al.  iCARE: A framework for big data-based banking customer analytics , 2014, IBM J. Res. Dev..

[27]  Wu He,et al.  Internet of Things in Industries: A Survey , 2014, IEEE Transactions on Industrial Informatics.

[28]  Roger A. Sugden,et al.  Multiple Imputation for Nonresponse in Surveys , 1988 .

[29]  Efraim Turban,et al.  Business Intelligence: Second European Summer School, eBISS 2012, Brussels, Belgium, July 15-21, 2012, Tutorial Lectures , 2013 .

[30]  Ephraim R. McLean,et al.  Measuring e-Commerce Success: Applying the DeLone & McLean Information Systems Success Model , 2004, Int. J. Electron. Commer..

[31]  John Elder,et al.  Practical Text Mining and Statistical Analysis for Non-structured Text Data Applications , 2012 .

[32]  James A. Narus,et al.  Business marketing: understand what customers value. , 1998, Harvard business review.

[33]  A. Kimber,et al.  Subset Selection in Regression , 1992 .

[34]  Klaus Pawelzik,et al.  Quantifying the neighborhood preservation of self-organizing feature maps , 1992, IEEE Trans. Neural Networks.

[35]  W. Scott Spangler,et al.  The integration of business intelligence and knowledge management , 2002, IBM Syst. J..

[36]  Susanne Rässler,et al.  Data Fusion: Identification Problems, Validity, and Multiple Imputation , 2004 .

[37]  Alan J. Miller Subset Selection in Regression , 1992 .

[38]  D. Boyd,et al.  CRITICAL QUESTIONS FOR BIG DATA , 2012 .

[39]  Isabelle Guyon,et al.  An Introduction to Variable and Feature Selection , 2003, J. Mach. Learn. Res..

[40]  Wenbin Li,et al.  WaaS: Wisdom as a Service , 2014, IEEE Intell. Syst..

[41]  Wagner A. Kamakura,et al.  Statistical Data Fusion for Cross-Tabulation , 1997 .