A Practical Evaluation of Dynamic Time Warping in Financial Time Series Clustering

Selecting the portfolio of huge stocks is the main issue for investors to reduce a risk investment. This problem could be solved by employing a clustering as a data mining task. The technique groups together the stocks with less distance. A method to measure those dissimilarities is crucial to produce a good clustering quality. This paper is to confirm the improved distance method, dynamic time warping (DTW), stated as powerful compared to the classical way, Euclidean. Then, hierarchical clustering (HC) was used to examine the sequences from two sources of the dataset. It includes 25 stocks indexed by Indonesia Sharia Stock Index (ISSI) and 30 stocks indexed by the Jakarta Islamic Index (JII) during the year 2000-2020. The effectiveness of the distance method in clustering was evaluated by calculating the Silhouette index and running time. According to the value of the average Silhouette index, DTW-based HC gave a higher output which was not significantly different from Euclidean-based HC. In the other hand, the type of dataset contributes significantly. The Silhouette Index for JII dataset (homogeneous) is better than that for ISSI (heterogeneous). Therefore, it prefers to use similarity method with faster processing time to cluster the data if its quality is merely similar to any similarity methods.

[1]  Eamonn J. Keogh,et al.  On the Need for Time Series Data Mining Benchmarks: A Survey and Empirical Demonstration , 2002, Data Mining and Knowledge Discovery.

[2]  Eric R. Ziegel,et al.  Analysis of Financial Time Series , 2002, Technometrics.

[3]  Daniel T. Larose,et al.  Data Mining and Predictive Analytics , 2015 .

[4]  Xiaozhe Wang,et al.  Characteristic-Based Clustering for Time Series Data , 2006, Data Mining and Knowledge Discovery.

[5]  Christos Faloutsos,et al.  Efficient Similarity Search In Sequence Databases , 1993, FODO.

[6]  Ying Wah Teh,et al.  Time-series clustering - A decade review , 2015, Inf. Syst..

[7]  Anil K. Jain,et al.  Data clustering: a review , 1999, CSUR.

[8]  N. Powell,et al.  Supervised and Unsupervised Methods for Stock Trend Forecasting , 2008 .

[9]  Dimitrios Gunopulos,et al.  Iterative Incremental Clustering of Time Series , 2004, EDBT.

[10]  Philip Chan,et al.  Toward accurate dynamic time warping in linear time and space , 2007, Intell. Data Anal..

[11]  T. Mathew,et al.  Clustering and Regression Techniques for Stock Prediction , 2016 .

[12]  S. Chiba,et al.  Dynamic programming algorithm optimization for spoken word recognition , 1978 .

[13]  Sachin P. Patel,et al.  Euclidean distance based feature ranking and subset selection for bearing fault diagnosis , 2020, Expert Syst. Appl..

[14]  Eamonn J. Keogh,et al.  Scaling up dynamic time warping for datamining applications , 2000, KDD '00.

[15]  M. K. Tiwari,et al.  Clustering Indian stock market data for portfolio management , 2010, Expert Syst. Appl..

[16]  Jan Korbel,et al.  Modeling Financial Time Series , 2013 .

[17]  Chi-Ming Chen,et al.  Visualizing the clustering of financial networks and profitability of stocks , 2015, J. Complex Networks.

[18]  Walid G. Aref,et al.  Incremental, online, and merge mining of partial periodic patterns in time-series databases , 2004, IEEE Transactions on Knowledge and Data Engineering.

[19]  Yuliang Shi,et al.  Time Series Clustering Based on Dynamic Time Warping , 2018, 2018 IEEE 9th International Conference on Software Engineering and Service Science (ICSESS).

[20]  Mitsuyoshi Imamura,et al.  Stock Price Prediction using k-Medoids Clustering with Indexing Dynamic Time Warping , 2018, IEEJ Transactions on Electronics Information and Systems.

[21]  Witold Pedrycz,et al.  Clustering of interval-valued time series of unequal length based on improved dynamic time warping , 2019, Expert Syst. Appl..

[22]  Yanru Guo Stock Trading Based on Principal Component Analysis and Clustering Analysis , 2020 .

[23]  T. Warren Liao,et al.  Clustering of time series data - a survey , 2005, Pattern Recognit..

[24]  Indranil Ghosh,et al.  Using Clustering Method to Understand Indian Stock Market Volatility , 2016, ArXiv.

[25]  Prodromos Tsinaslanidis,et al.  Subsequence dynamic time warping for charting: Bullish and bearish class predictions for NYSE stocks , 2018, Expert Syst. Appl..

[26]  Vladimir Filimonov,et al.  On the Modeling of Financial Time Series , 2015 .

[27]  Kyoji Kawagoe,et al.  New Time Series Data Representation ESAX for Financial Applications , 2006, 22nd International Conference on Data Engineering Workshops (ICDEW'06).

[28]  Geeta Sikka,et al.  Recent Techniques of Clustering of Time Series Data: A Survey , 2012 .

[29]  Eamonn J. Keogh,et al.  Three Myths about Dynamic Time Warping Data Mining , 2005, SDM.

[30]  Anil K. Jain,et al.  A self-organizing network for hyperellipsoidal clustering (HEC) , 1994, Proceedings of 1994 IEEE International Conference on Neural Networks (ICNN'94).

[31]  P. Nagabhushan,et al.  Time Series as a Point - A Novel Approach for Time Series Cluster Visualization , 2006, DMIN.