Building an EDA Assistant: A Progress Report

Since 1993 we have been working on a system to help people with exploratory data analysis (EDA). AIDE, an Assistant for Intelligent Data Exploration, is a knowledge-based planning system that incrementally explores a dataset, guided by user directives and its own evaluation of indications in the data. Its plan library contains strategies for generating and interpreting indications in data, selecting techniques to build appropriate descriptions of data, carrying out relevant procedures, and combining individual results into a coherent larger picture. The system is mixed-initiative, autonomously pursuing highand low-level goals while still allowing the user to inform or override its decisions. Elsewhere we have described AIDE's operations and primitive data structures [22), its planning representation [23], its user interface [25, 24), and the system as a whole [21). This progress report discusses a recent evaluation we conducted with AIDE and explains why we believe that this line of research is important to AI and statistics researchers. We will begin with a very brief overview of the system. The bulk of the paper describes the evaluation, our analysis of the results, and the lessons we learned through the experience of building and evaluating AIDE. We end with a discussion of the generality of our results and the potential for future work.

[1]  Frederick Mosteller,et al.  Data Analysis and Regression , 1978 .

[2]  D. F. Andrews,et al.  Data : a collection of problems from many fields for the student and research worker , 1985 .

[3]  David J. Hand,et al.  Patterns in statistical strategy , 1986 .

[4]  William A. Gale,et al.  REX review , 1986 .

[5]  John W. Tukey,et al.  An alphabet for statisticians' expert systems , 1986 .

[6]  Daryl Pregibon A DIY guide to statistical strategy , 1986 .

[7]  Jrg Nievergelt,et al.  Sites, modes, and trails: Telling the user of an interactive system where he is, what he can do, and how to get to places (excerpt) , 1987 .

[8]  David J. Hand,et al.  A Statistical Knowledge Enhancement System , 1987 .

[9]  William B. Rouse,et al.  An Architecture for Intelligent Interfaces: Outline of an Approach to Supporting Operators of Complex Systems , 1987, Hum. Comput. Interact..

[10]  Eric R. Ziegel,et al.  Data: A Collection of Problems From Many Fields for the Student and Research Worker , 1987 .

[11]  M. Braga,et al.  Exploratory Data Analysis , 2018, Encyclopedia of Social Network Analysis and Mining. 2nd Ed..

[12]  W. Rouse,et al.  An Architecture for Intelligent Interfaces: Outline of an Approach to Supporting Operators of Complex Systems , 1987, SGCH.

[13]  Daryl Pregibon,et al.  Data analysis as search , 1988 .

[14]  Jakob Nielsen,et al.  Hypertext and hypermedia , 1990 .

[15]  Loren Terveen,et al.  Intelligent Systems as Cooperative Systems , 1993 .

[16]  William A. Gale,et al.  Statistical Applications of Artificial Intelligence , 1993 .

[17]  Hanga C. Galfalvy,et al.  Omega-Stat: An Environment for Implementing Intelligent Modeling Strategies , 1995, AISTATS.

[18]  Loren G. Terveen,et al.  Overview of human-computer collaboration , 1995, Knowl. Based Syst..

[19]  David L. Westbrook,et al.  A Toolbox for Analyzing Programs , 1995, Int. J. Artif. Intell. Tools.

[20]  S.J.J. Smith,et al.  Empirical Methods for Artificial Intelligence , 1995 .

[21]  Paul R. Cohen,et al.  Control Representation in an EDA Assistant , 1995, AISTATS.

[22]  Paul R. Cohen,et al.  A Mixed-Initiative Planning Approach to Exploratory Data Analysis , 1996 .

[23]  Paul R. Cohen,et al.  A Planner for Exploratory Data Analysis , 1997, AIPS.

[24]  Paul R. Cohen,et al.  Interaction with a mixed-initiative system for exploratory data analysis , 1997, IUI '97.

[25]  Paul R. Cohen,et al.  Evaluation of a semi-autonomous assistant for exploratory data analysis , 1997, AGENTS '97.