Comparative Analysis of Dimensionality Reduction Techniques

Datasets are most important for performing all the type of data mining tasks. Every dataset has many numbers of attributes and instances. Dimensionality reduction (DR) is one of the preprocessing steps which is used to reduce the dimensions (attributes or features) without losing the data. There are two divisions of reduction they are feature extraction and feature reduction. Feature extraction is the process of decomposition of attributes of the original data (i.e.) merging the attributes of the data Feature selection is the process of selecting the subset of attributes by eliminating features with little or no predictive information. Feature extraction techniques are more adequate than the feature selection. Reduction is done to the larger dataset to decrease the curse of dimensionality. The main objective of this paper is to provide a systematic comparative analysis on feature reduction algorithms such as PCA, LDA and FA to medical dataset (Thyroid, Oesophagal).The performance factor considered are number of attributes reduced and time is observed.

[1]  Michael C. Hout,et al.  Multidimensional Scaling , 2003, Encyclopedic Dictionary of Archaeology.

[2]  Keinosuke Fukunaga,et al.  Introduction to Statistical Pattern Recognition , 1972 .

[3]  I K Fodor,et al.  A Survey of Dimension Reduction Techniques , 2002 .

[4]  Christopher J. C. Burges,et al.  Geometric Methods for Feature Extraction and Dimensional Reduction , 2005 .

[5]  Anil K. Jain,et al.  Dimensionality reduction using genetic algorithms , 2000, IEEE Trans. Evol. Comput..

[6]  S. Vijayarani Partitioning Clustering Algorithms for DataStream Outlier Detection , 2014 .

[7]  Aapo Hyvärinen,et al.  Survey on Independent Component Analysis , 1999 .

[8]  Kilian Q. Weinberger,et al.  Spectral Methods for Dimensionality Reduction , 2006, Semi-Supervised Learning.

[9]  Erkki Oja,et al.  Independent component analysis: algorithms and applications , 2000, Neural Networks.

[10]  Rosina Kharal,et al.  Semidefinite Embedding for the Dimensionality Reduction of DNA Microarray Data , 2006 .

[11]  Miguel Á. Carreira-Perpiñán,et al.  A Review of Dimension Reduction Techniques , 2009 .

[12]  J. Tenenbaum,et al.  A global geometric framework for nonlinear dimensionality reduction. , 2000, Science.

[13]  S T Roweis,et al.  Nonlinear dimensionality reduction by locally linear embedding. , 2000, Science.

[14]  Iryna Gurevych,et al.  A Comparative Study of Feature Extraction Algorithms in Customer Reviews , 2008, 2008 IEEE International Conference on Semantic Computing.