Visual Exploration of Feature-Class Matrices for Classification Problems

When a classification algorithm does not work on a data set, it is a non-trivial problem to figure out what went wrong on a technical level. It is even more challenging to communicate findings to domain experts who can interpret the data set but do not understand the algorithms. We propose a method for the interactive visual exploration of the feature-class matrix used to represent data sets for classification purposes. This method combines a novel matrix reordering algorithm revealing patterns of interest with an interactive visualization application. It facilitates the investigation of feature-class matrices and the identification of reasons for failure or success of a classifier on the feature level. We discuss results obtained by applying the method to the Reuters text collection.

[1]  Innar Liiv,et al.  Seriation and matrix reordering methods: An historical overview , 2010, Stat. Anal. Data Min..

[2]  Jacques Bertin,et al.  Matrix theory of graphics , 2001 .

[3]  Erkki Mäkinen,et al.  Reordering the Reorderable Matrix as an Algorithmic Problem , 2000, Diagrams.

[4]  Jacques Bertin,et al.  Graphics and graphic information-processing , 1981 .

[5]  Erkki Mäkinen,et al.  Constructing and Reconstructing the Reorderable Matrix , 2005, Inf. Vis..

[6]  Yiming Yang,et al.  Text categorization , 2008, Scholarpedia.

[7]  Jean-Daniel Fekete,et al.  ZAME: Interactive Large-Scale Graph Visualization , 2008, 2008 IEEE Pacific Visualization Symposium.

[8]  Salim Hariri,et al.  A new dependency and correlation analysis for features , 2005, IEEE Transactions on Knowledge and Data Engineering.

[9]  Erkki Mäkinen,et al.  The Barycenter Heuristic and the Reorderable Matrix , 2005, Informatica.

[10]  Ramana Rao,et al.  The table lens: merging graphical and symbolic representations in an interactive focus + context visualization for tabular information , 1994, CHI '94.

[11]  Jean-Daniel Fekete,et al.  MatrixExplorer: a Dual-Representation System to Explore Social Networks , 2006, IEEE Transactions on Visualization and Computer Graphics.

[12]  Harri Siirtola,et al.  Interaction with the Reorderable Matrix , 1999, 1999 IEEE International Conference on Information Visualization (Cat. No. PR00210).