An Improved Affinity Propagation Clustering Algorithm Based on Entropy Weight Method and Principal Component Analysis

Traditional affinity propagation algorithm has inefficient results when conducting clustering analysis of high dimensional data because "dimension effect" lead to difficult find the proper class structure .In view of this, the author proposes an improved algorithm on the basis of Entropy Weight Method and Principal Component Analysis (EWPCA-AP). EWPCA-AP algorithm empowers the sample data by Entropy Weight Method, eliminate data irrelevant attributes by Principal Component Analysis, and travel with neighbor clustering algorithm, realization of high-dimensional data clustering in low dimension space. The numerical result of simulation experiment shows that the new EWPCA-AP algorithm can effectively eliminate the redundancy and irrelevant attributes of data and improve the performance of clustering. In addition, the proposed algorithm is applied in the area of the economy in our country and the clustering result is consistent with the real one. This algorithm provides a new intelligent evaluation method for Chinese economy.

[1]  Age K. Smilde,et al.  Principal Component Analysis , 2003, Encyclopedia of Machine Learning.

[2]  Mathias Drehmann,et al.  The Credit-to-GDP Gap and Countercyclical Capital Buffers: Questions and Answers , 2014 .

[3]  Shen Chunhui,et al.  Distributed Affinity Propagation Clustering Based on MapReduce , 2012 .

[4]  V. K. Bhuvaneswari,et al.  A Comparative Study of Various Clustering Algorithms in Data Mining , 2014 .

[5]  Li Shaomei,et al.  Parallel Affinity Propagation Clustering Algorithm Based on Hybrid Measure , 2013 .

[6]  Li Xiaobo The space evolution research of economic competition in Golden Delta Counties of the Yellow River , 2014 .

[7]  Yanfu Zhang TOPSIS Method Based on Entropy Weight for Supplier Evaluation of Power Grid Enterprise , 2015 .

[8]  Heng Tao Shen,et al.  Principal Component Analysis , 2009, Encyclopedia of Biometrics.

[9]  Zongming Ma Sparse Principal Component Analysis and Iterative Thresholding , 2011, 1112.2432.

[10]  Darong Luo,et al.  New Uncertainty Measure of Rough Fuzzy Sets and Entropy Weight Method for Fuzzy-Target Decision-Making Tables , 2014, J. Appl. Math..

[11]  Zhang Jin-song Research on K-means algorithm based on density , 2011 .

[12]  Lin Fan,et al.  An Efficient Clustering Algorithm Based on Local Optimality of K -Means: An Efficient Clustering Algorithm Based on Local Optimality of K -Means , 2008 .

[13]  I. Johnstone,et al.  Augmented sparse principal component analysis for high dimensional data , 2012, 1202.1242.

[14]  Delbert Dueck,et al.  Clustering by Passing Messages Between Data Points , 2007, Science.

[15]  Manish Verma,et al.  A Comparative Study of Various Clustering Algorithms in Data Mining , 2012 .

[16]  Yang He Feature Selection and Sample Classification for SELDI-TOF Mass Spectrometry Data Based on Affinity Propagation Clustering , 2013 .

[17]  Maoguo Gong,et al.  Fuzzy C-Means Clustering With Local Information and Kernel Metric for Image Segmentation , 2013, IEEE Transactions on Image Processing.

[18]  Michael R. Anderberg,et al.  Cluster Analysis for Applications , 1973 .