Clustering Stock Market Companies via K- Means Algorithm

One of the main concepts in pattern recognition is clustering. This technique is used as important knowledge discovery tools in modern machine learning process. Clustering of high-performance companies is very important not only for investors, but also for the creditors, financial creditors, stockholders, etc. Hence, firms’ clustering is considered as one of the important issues in Tehran Stock Exchange (TSE). To this end, we have used financial statement data of three industries in TSE for the year 2012. After selecting profit criteria (attributes) and prioritizing them using AHP, k means clustering algorithm is used to classify these companies. Also, to obtain the optimal number of clusters, different validity measures are presented. The identification of clusters of companies of TSE can be exploited to improve planning and get to more comprehensive decision making about companies.

[1]  Jianhong Wu,et al.  Data clustering - theory, algorithms, and applications , 2007 .

[2]  M. K. Tiwari,et al.  Clustering Indian stock market data for portfolio management , 2010, Expert Syst. Appl..

[3]  D.M. Mount,et al.  An Efficient k-Means Clustering Algorithm: Analysis and Implementation , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[4]  Sushil Kumar,et al.  Analytic hierarchy process: An overview of applications , 2006, Eur. J. Oper. Res..

[5]  Michalis Vazirgiannis,et al.  Clustering validity checking methods: part II , 2002, SGMD.

[6]  Stephanie Boehm,et al.  Applied Multivariate Techniques , 2016 .

[7]  Michael Q. Zhang,et al.  Evaluation and comparison of clustering algorithms in analyzing es cell gene expression data , 2002 .

[8]  Ujjwal Maulik,et al.  Performance Evaluation of Some Clustering Algorithms and Validity Indices , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[9]  Guojun Gan,et al.  Data Clustering: Theory, Algorithms, and Applications (ASA-SIAM Series on Statistics and Applied Probability) , 2007 .

[10]  Kainaz Bomi Sherdiwala Data Mining Techniques in Stock Market , 2011 .

[11]  Ricardo J. G. B. Campello,et al.  Relative clustering validity criteria: A comparative overview , 2010, Stat. Anal. Data Min..

[12]  Roberto Bellotti,et al.  Clustering stock market companies via chaotic map synchronization , 2005 .

[13]  Michalis Vazirgiannis,et al.  Clustering algorithms and validity measures , 2001, Proceedings Thirteenth International Conference on Scientific and Statistical Database Management. SSDBM 2001.

[14]  Thomas L. Saaty,et al.  Models, Methods, Concepts & Applications of the Analytic Hierarchy Process , 2012 .

[15]  So Young Sohn,et al.  Segmentation of stock trading customers according to potential value , 2004, Expert Syst. Appl..

[16]  Ujjwal Maulik,et al.  Validity index for crisp and fuzzy clusters , 2004, Pattern Recognit..

[17]  Anil K. Jain Data clustering: 50 years beyond K-means , 2008, Pattern Recognit. Lett..

[18]  M. Suresh Babu,et al.  Clustering Approach to Stock Market Prediction , 2011 .

[19]  Hui Xiong,et al.  Understanding of Internal Clustering Validation Measures , 2010, 2010 IEEE International Conference on Data Mining.

[20]  K. Doherty,et al.  Hierarchical topological clustering learns stock market sectors , 2005, 2005 ICSC Congress on Computational Intelligence Methods and Applications.

[21]  James C. Bezdek,et al.  Some new indexes of cluster validity , 1998, IEEE Trans. Syst. Man Cybern. Part B.