Clustering Method Study on High-Dimensional Trading Data

Existing clustering algorithms are not designed specially for the features of trading data s and most clustering analyses lack scalability for large-scale transactions. Therefore, a rapid and scalable clustering algorithm using little space is proposed by us, to effectively process high-dimensional trading data without setting parameters manually. The improved method introduces weighted coverage density as similarity metrics of data. On this basis, the clustering criterion function is established for clustering analysis. We assume further implementation is to find association rules in clustering rules. Then two transaction-oriented evaluation measures for clustering quality are put forward. The large item size ratio is based on the concept of big data, which is used to measure the percentage in clustering; the average pair-clusters merging index is adopted to indicate the difference among clustering results with coverage density. The experimental results of artificial data and real data sets have shown that the improved method for clustering analysis can generate high-grade clustering results on most of the experimental data sets, compared to traditional algorithms

[1]  J. K. Yates Construction Decision Support System for Delay Analysis , 1993 .

[2]  Qinping Zhao,et al.  A survey on virtual reality , 2009, Science in China Series F: Information Sciences.

[3]  Efendi N. Nasibov,et al.  Comparative clustering analysis of bispectral index series of brain activity , 2010, Expert Syst. Appl..

[4]  Rafal A. Angryk,et al.  GDClust: A Graph-Based Document Clustering Technique , 2007 .

[5]  HalkidiMaria,et al.  Cluster validity methods , 2002 .

[6]  Liang Kai-jian A New Algorithm of Mining Exceptional Association Pattern , 2005 .

[7]  NI Zhi-we Effective algorithm to cluster customers'actions , 2010 .

[8]  Michalis Vazirgiannis,et al.  Cluster validity methods: part I , 2002, SGMD.

[9]  Zhi Chao Ma,et al.  Based on the Method of Fuzzy Clustering Analysis of the Smartphone Product Market , 2012 .

[10]  Chen Xue-jin Research of Cluster Analysis in Data Mining , 2006 .

[11]  Minia Manteiga,et al.  Hierarchical Clustering Analysis with SOM Networks , 2010 .

[12]  Phuc Do,et al.  Applying Data Mining in Money Laundering Detection for the Vietnamese Banking Industry , 2012, ACIIDS.

[13]  Xin-Xin Weng,et al.  [Rapid determination of hypoglycemic tablets by handheld Raman spectrometer and KPCA-clustering analysis]. , 2010, Guang pu xue yu guang pu fen xi = Guang pu.

[14]  K. Thangavel,et al.  Evaluation of socio-economic patterns of SHG members in Kerala using clustering analysis , 2012 .

[15]  Yupin Luo,et al.  Saliency Detection by Selective Strategy for Salient Object Segmentation , 2012, J. Multim..

[16]  William Marsh,et al.  Decision support system for Warfarin therapy management using Bayesian networks , 2013, Decis. Support Syst..

[17]  Haiqiao Huang,et al.  A robust adaptive clustering analysis method for automatic identification of clusters , 2012, Pattern Recognit..

[18]  A. Monreal-Ibero,et al.  A study of the interplay between ionized gas and star clusters in the central region of NGC 5253 with 2D spectroscopy , 2010, 1003.5329.

[19]  Mohammed Yakoob Siyal,et al.  Overcoming the ill-balanced data problem in functional MRI clustering analysis , 2009, 2009 7th International Conference on Information, Communications and Signal Processing (ICICS).