Parallel Rough Set: Dimensionality Reduction and Feature Discovery of Multi-Dimensional Data in Visualization

Attempt to visualize high dimensional datasets typically encounter over plotting and decline in visual comprehension that makes the knowledge discovery and feature subset analysis difficult. Hence, reshaping the datasets using dimensionality reduction technique is paramount by removing the superfluous attributes to improve visual analytics. In this work, we applied rough set theory as dimensionality reduction and feature selection methods on visualization to facilitate knowledge discovery of multi-dimensional datasets. We provided the case study using real datasets and comparison against other methods to demonstrate the effectiveness of our approach.

[1]  Alfred Inselberg,et al.  The plane with parallel coordinates , 1985, The Visual Computer.

[2]  R. Cattell The Scree Test For The Number Of Factors. , 1966, Multivariate behavioral research.

[3]  Yiyu Yao,et al.  Attribute reduction in decision-theoretic rough set models , 2008, Inf. Sci..

[4]  Z. Pawlak Rough Sets: Theoretical Aspects of Reasoning about Data , 1991 .

[5]  I K Fodor,et al.  A Survey of Dimension Reduction Techniques , 2002 .

[6]  Gilbert Saporta,et al.  Some simple rules for interpreting outputs of principal components and correspondence analysis , 1999 .

[7]  Teuvo Kohonen,et al.  The self-organizing map , 1990, Neurocomputing.

[8]  Shusaku Tsumoto,et al.  Accuracy and Coverage in Rough Set Rule Induction , 2002, Rough Sets and Current Trends in Computing.

[9]  S. Johansson,et al.  Interactive Dimensionality Reduction Through User-defined Combinations of Quality Metrics , 2009, IEEE Transactions on Visualization and Computer Graphics.

[10]  Wojciech Ziarko,et al.  Variable Precision Rough Set Model , 1993, J. Comput. Syst. Sci..

[11]  Jaegul Choo,et al.  Two-stage framework for visualization of clustered high dimensional data , 2009, 2009 IEEE Symposium on Visual Analytics Science and Technology.

[12]  Stefan Berchtold,et al.  Similarity clustering of dimensions for an enhanced visualization of multidimensional data , 1998, Proceedings IEEE Symposium on Information Visualization (Cat. No.98TB100258).

[13]  Sadaaki Miyamoto,et al.  Rough Sets and Current Trends in Computing , 2012, Lecture Notes in Computer Science.