Parallel Clustering for Data Mining in CRM

In modern business conditions that are characterized by a stronger process of globalization, uncertainty, risk and competition, companies have to struggle every day to maintain market share and achieving better business results. In order to achieve this, the company must always be a step ahead of the competition. This means anybody must anticipate the needs of its clients and each client must access individual. This work is based on addressing this goal. Due to the fact that it is a large amount of data, it is simply impossible to do manual data analysis. Analyses are left to specially developed programs; a new kind of technology whose goal is precisely the solution of the problems that has been faced in Business Intelligence. Business Intelligence (BI) refers to be a broad set of applications and technologies for data collection, access to data and expert analysis of data, and in order to provide adequate support to the decision making process. BI represents a family of products that includes Data mining Algorithms, Data mining products for creating reports. Improving efficiency in this process is discussed in this work. The M-Clustering algorithm which is conceived in this work provides solution to data mining using clusters in twofolds—setting boundary limits during filtering and historical data processing. Define a set of data to be used for training which can be taken from filtering various attributes and the fields from the classifications set given. The data processing activity will be done using this training datasets to get expected result. This is evaluated for processing actual dataset or further execution for provisional trained dataset preparation. This work covers high-level view of the proposed system along with the processing steps used in the system. It also covers experimental evaluation carried out with customized algorithm implementation in WEKA tool and compared the processing efficiency of experimental data with k-means evaluation.

[1]  Pasi Fränti,et al.  A grid-growing clustering algorithm for geo-spatial data , 2015, Pattern Recognit. Lett..

[2]  Ming-Syan Chen,et al.  Catching the Trend: A Framework for Clustering Concept-Drifting Categorical Data , 2009, IEEE Transactions on Knowledge and Data Engineering.

[3]  Mohammad Miyan Applications of Data Mining in Banking Sector , 2017 .

[4]  Pietro Perona,et al.  Self-Tuning Spectral Clustering , 2004, NIPS.

[5]  Ravindra Jain,et al.  A hybrid clustering algorithm for data mining , 2012, ArXiv.

[6]  Bharat Chaudhari,et al.  A Comparative Study of clustering algorithms Using weka tools , 2012 .

[7]  Illhoi Yoo,et al.  Data Mining in Healthcare and Biomedicine: A Survey of the Literature , 2012, Journal of Medical Systems.

[8]  G. Veereswara Swamy,et al.  Role of the scaling factor in spectral clustering algorithm , 2016, 2016 International Conference on Inventive Computation Technologies (ICICT).

[9]  Daling Wang,et al.  An Improved Spectral Clustering Algorithm for Community Discovery , 2009, 2009 Ninth International Conference on Hybrid Intelligent Systems.

[10]  Arthur Zimek,et al.  The blind men and the elephant: on meeting the problem of multiple truths in data from clustering and pattern mining perspectives , 2013, Machine Learning.

[11]  Jure Leskovec,et al.  Defining and evaluating network communities based on ground-truth , 2012, Knowledge and Information Systems.

[12]  An-yu Yu,et al.  USING THE AGGLOMERATIVE METHOD OF HIERARCHICAL CLUSTERING AS A DATA MINING TOOL IN CAPITAL MARKET 1 , 2008 .

[13]  Xianyi Qian,et al.  A New Study of DSS Based on Neural Network and Data Mining , 2009, 2009 International Conference on E-Business and Information System Security.