A new Unsupervised Clustering-based Feature Extraction Method

In manipulating data such as in supervised or unsupervised learning, we need to extract new features from the original features for the purpose of reducing the dimension of feature space and achieving better performance. In this paper, we investigate a novel schema for unsupervised feature extraction for classification problems. We based our method on clustering to achieve feature extraction. A new similarity measure based on trend analysis is devised to identify redundant information in the data. Clustering is then performed on the feature space. Once groups of similar features are formed, linear transformation is realized to extract a new set of features. The simulation results on classification problems for experimental data sets from UCI machine learning repository and face recognition problem show that the proposed method is effective in almost cases when compared to conventional unsupervised methods like PCA and ICA. General Terms Pattern Recognition, Machine Learning, Feature Extraction, and Data mining.

[1]  Kilian Q. Weinberger,et al.  Spectral Methods for Dimensionality Reduction , 2006, Semi-Supervised Learning.

[2]  Deniz Erdogmus,et al.  Feature extraction using information-theoretic learning , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3]  Yvan Saeys,et al.  Feature Extraction Using Clustering of Protein , 2006, CIARP.

[4]  Kari Torkkola,et al.  Feature Extraction by Non-Parametric Mutual Information Maximization , 2003, J. Mach. Learn. Res..

[5]  Brian D. Ripley,et al.  Pattern Recognition and Neural Networks , 1996 .

[6]  Jun Liu,et al.  A Semi-Supervised Relief Based Feature Extraction Algorithm , 2008, 2008 Second International Conference on Future Generation Communication and Networking Symposia.

[7]  Chong-Ho Choi,et al.  Feature Extraction Based on ICA for Binary Classification Problems , 2003, IEEE Trans. Knowl. Data Eng..

[8]  Andrew McCallum,et al.  Distributional clustering of words for text classification , 1998, SIGIR '98.

[9]  Xiaoli Z. Fern,et al.  Cluster Ensembles for High Dimensional Clustering: An Empirical Study , 2006 .

[10]  Yoshua Bengio,et al.  Pattern Recognition and Neural Networks , 1995 .

[11]  Ulrike von Luxburg,et al.  Consistent Minimization of Clustering Objective Functions , 2007, NIPS.

[12]  Naftali Tishby,et al.  The Power of Word Clusters for Text Classification , 2006 .

[13]  Isabelle Guyon,et al.  An Introduction to Variable and Feature Selection , 2003, J. Mach. Learn. Res..

[14]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[15]  Terry R. Payne,et al.  Implicit Feature Selection with the Value Difference Metric , 1998, ECAI.

[16]  Sylvie Charbonnier,et al.  A trend-based alarm system to improve patient monitoring in intensive care units , 2007 .