Reinforcement learning based distance metric filtering approach in clustering

Conventional feature selection methods may not provide sufficient means to deal with the diverse growing amount of data nowadays. Evolutionary Distance Metric Learning (EDML) relies on an evolutionary approach in its distance metric learning process; this process in case of diagonal EDML can be viewed as an embedded feature weighting one. However, such process is done simultaneously on all features and does not explicitly select the features. This paper introduces a new hybrid system R-EDML, combining the sequential decision making of Reinforcement Learning (RL) with the evolutionary feature prioritizing process of EDML in clustering. The goal is to create a feature selection control strategy that aims to optimize the input space by reducing the number of selected features while maintaining the clustering performance. This can lead to future data collection time and cost reduction. In the proposed method, features represented by the elements of EDML distance transformation matrices are prioritized by a differential evolution algorithm. Then a selection control strategy using reinforcement learning is learned by sequentially inserting and evaluating the prioritized elements. This process is repeated with the aim to optimize the matrices by filtering the elements used in them. The outcome is the selection of the best R-EDML generation matrices with the least number of elements possible. R-EDML was compared to normal EDML in terms of feature selection and accuracy. Results show a decrease in the number of features compared to EDML, while maintaining a similar accuracy level.

[1]  福井 健一,et al.  Semi-supervised Evolutionary Distance Metric Learning for Clustering , 2015 .

[2]  Yingjie Tian,et al.  A Comprehensive Survey of Clustering Algorithms , 2015, Annals of Data Science.

[3]  Huan Liu,et al.  Feature Selection for Classification , 1997, Intell. Data Anal..

[4]  Fei Wang,et al.  Survey on distance metric learning and dimensionality reduction in data mining , 2014, Data Mining and Knowledge Discovery.

[5]  Ludovic Denoyer,et al.  Datum-Wise Classification: A Sequential Approach to Sparsity , 2011, ECML/PKDD.

[6]  Janez Brest,et al.  Self-Adapting Control Parameters in Differential Evolution: A Comparative Study on Numerical Benchmark Problems , 2006, IEEE Transactions on Evolutionary Computation.

[7]  Michael I. Jordan,et al.  Distance Metric Learning with Application to Clustering with Side-Information , 2002, NIPS.

[8]  Thomas Frank Rückstieß,et al.  Reinforcement Learning in Supervised Problem Domains , 2016 .

[9]  Tomer Hertz,et al.  Boosting margin based distance functions for clustering , 2004, ICML.

[10]  Majid Nili Ahmadabadi,et al.  Attention control with reinforcement learning for face recognition under partial occlusion , 2011, Machine Vision and Applications.

[11]  Masayuki Numao,et al.  Evolutionary Distance Metric Learning Approach to Semi-supervised Clustering with Neighbor Relations , 2013, 2013 IEEE 25th International Conference on Tools with Artificial Intelligence.

[12]  Kilian Q. Weinberger,et al.  Distance Metric Learning for Large Margin Nearest Neighbor Classification , 2005, NIPS.

[13]  Masayuki Numao,et al.  Neighborhood-Based Smoothing of External Cluster Validity Measures , 2012, PAKDD.