Knowledge Visualization Techniques for Machine Learning

Researchers in machine learning primarily use decision trees, production rules, and decision graphs for visualizing classification data, with the graphic form in which a structure is portrayed as having a strong influence on comprehensibility. We analyze the questions that, in our experience, end users of machine learning tend to ask of the structures inferred from their empirical data. By mapping these questions onto visualization tasks, we have created new graphical representations that show the flow of examples through a decision structure. These knowledge visualization techniques are particularly appropriate in helping to answer the questions that users typically ask, and we describe their use in discovering new properties of a data set. In the case of decision trees, an automated software tool has been developed to construct the visualizations.

[1]  R. P. Halverson An empirical investigation comparing IF-THEN rules and decision tables for programming rule-based expert systems , 1993, [1993] Proceedings of the Twenty-sixth Hawaii International Conference on System Sciences.

[2]  Stephen M. Casner,et al.  Task-analytic approach to the automated design of graphic presentations , 1991, TOGS.

[3]  H. Stone Discrete Mathematical Structures and Their Applications , 1973 .

[4]  Art Lew Decision tables for general‐purpose scientific programming , 1983, Softw. Pract. Exp..

[5]  John J. Bertin,et al.  The semiology of graphics , 1983 .

[6]  Steven F. Roth,et al.  Data characterization for intelligent graphics presentation , 1990, CHI '90.

[7]  Rik Maes,et al.  On the Role of Ambiguity and Incompleteness in the Design of Decision Tables and Rule-Based Systems , 1988, Comput. J..

[8]  Ron Kohavi,et al.  The Power of Decision Tables , 1995, ECML.

[9]  John B. Goodenough,et al.  Toward a theory of test data selection , 1975 .

[10]  Mark D. Apperley,et al.  E3: Towards the Metrication of Graphical Presentation Techniques for Large Data Sets , 1993, EWHCI.

[11]  John A. Sparrow,et al.  Graphical displays in information systems: some data properties influencing the effectiveness of alternative forms , 1989 .

[12]  Richard A. Becker,et al.  Brushing scatterplots , 1987 .

[13]  Matthew C. Humphrey A graphical notation for the design of information visualizations , 1999, Int. J. Hum. Comput. Stud..

[14]  Art Lew,et al.  Proof of Correctness of Decision Table Programs , 1984, Comput. J..

[15]  Girish H. Subramanian,et al.  A comparison of the decision table and tree , 1992, CACM.

[16]  Jason Catlett,et al.  Rule Induction as Exploratory Data Analysis , 1995, AISTATS.