On the Use of Perceptual Cues & Data Mining for Effective Visualization of Scientific Datasets

Scientific datasets are often difficult to analyse or visualize, due to their large size and high dimensionality. We propose a twostep approach to address this problem. We begin by using data mining algorithms to identify areas of interest within the dataset. This allows us to reduce a dataset’s size and dimensionality, and to estimate missing values or correct erroneous entries. We display the results of the data mining step using visualization techniques based on perceptual cues. Our visualization tools are designed to exploit the power of the low-level human visual system. The result is a set of displays that allow users to perform rapid and accurate exploratory data analysis. In order to demonstrate our techniques, we visualized an environmental dataset being used to model salmon growth and migration patterns. Data mining was used to identify significant attributes and to provide accurate estimates of plankton density. We used colour and texture to visualize the significant attributes and estimated plankton densities for each month for the years 1956 to 1964. Experiments run in our laboratory showed that the colours and textures we chose support rapid and accurate element identification, boundary detection, region tracking, and estimation. The result is a visualization tool that allows users to quickly locate specific plankton densities and the boundaries they form. Users can compare plankton densities to other environmental conditions like sea surface temperature and current strength. Finally, users can track changes in any of the dataset’s attributes on a monthly or yearly basis. CR Categories: H.5.2 [Information Interfaces and Presentation]: User Interfaces—ergonomics, screen design, theory and methods; I.3.6 [Computer Graphics]: Methodology and Techniques— ergonomics, interaction techniques; J.2 [Physical Sciences and Engineering]: Earth and Atmospheric Sciences

[1]  Chen Li Knowledge Discovery in Database , 1999 .

[2]  Paul R. Cohen,et al.  Intelligent Support for Exploratory Data Analysis , 1998 .

[3]  B Julesz,et al.  Experiments in the visual perception of texture. , 1975, Scientific American.

[4]  Johanna D. Moore,et al.  Decomposition and Causality in Partial-order Planning , 1994, AIPS.

[5]  James T. Enns,et al.  Visualizing real-time multivariate data using preattentive processing , 1995, TOMC.

[6]  Hideyuki Tamura,et al.  Textural Features Corresponding to Visual Perception , 1978, IEEE Transactions on Systems, Man, and Cybernetics.

[7]  Georges G. Grinstein,et al.  Iconographic Displays For Visualizing Multidimensional Data , 1988, Proceedings of the 1988 IEEE International Conference on Systems, Man, and Cybernetics.

[8]  Robert St. Amant,et al.  User Interface Affordances in a Planning Representation , 1999, Hum. Comput. Interact..

[9]  Fang Liu,et al.  Periodicity, directionality, and randomness: Wold features for perceptual pattern recognition , 1994, Proceedings of the 12th IAPR International Conference on Pattern Recognition, Vol. 3 - Conference C: Signal Processing (Cat. No.94CH3440-5).

[10]  Tomasz Imielinski,et al.  An Interval Classifier for Database Mining Applications , 1992, VLDB.

[11]  Thomas A. DeFanti,et al.  Visualization in Scientific Computing-A Synopsis , 1987, IEEE Computer Graphics and Applications.

[12]  Paul H. Smith,et al.  Data and Visualization Corridors: Report on the 1998 DVC Workshop Series , 1998 .

[13]  Johanna D. Moore,et al.  Towards a Principled Representation of Discourse Plans , 1994, Proceedings of the Sixteenth Annual Conference of the Cognitive Science Society.

[14]  James D. Foley,et al.  Effective software systems for scientific data visualization , 1989, SIGGRAPH '89.

[15]  Wojciech Ziarko,et al.  The Discovery, Analysis, and Representation of Data Dependencies in Databases , 1991, Knowledge Discovery in Databases.

[16]  R. Michael Young,et al.  Cooperative Plan Identification: Constructing Concise and Effective Plan Descriptions , 1999, AAAI/IAAI.

[17]  Michael Stonebraker,et al.  Tioga: A database-oriented visualization tool , 1993, Proceedings Visualization '93.

[18]  Andrew K. C. Wong,et al.  Statistical Technique for Extracting Classificatory Knowledge from Databases , 1991, Knowledge Discovery in Databases.

[19]  James T. Enns,et al.  Effective visualization of large multidimensional datasets , 1996 .

[20]  Colin Ware,et al.  Color sequences for univariate maps: theory, experiments and principles , 1988, IEEE Computer Graphics and Applications.

[21]  David J. Hand,et al.  Patterns in statistical strategy , 1986 .

[22]  Christopher G. Healey,et al.  Choosing effective colours for data visualization , 1996, Proceedings of Seventh Annual IEEE Visualization '96.

[23]  Philip K. Robertson,et al.  Towards Perceptual Control of Markov Random Field Textures , 1995, Perceptual Issues in Visualization.

[24]  A. Ravishankar Rao,et al.  Identifying High Level Features of Texture Perception , 1993, CVGIP Graph. Model. Image Process..

[25]  P. Leblond,et al.  Computer Simulations of the Influence of Ocean Currents on Fraser River Sockeye Salmon (Oncorhynchus nerka) Return Times , 1994 .

[26]  Drew McDermott,et al.  Issues in the Development of Human-Computer Mixed Initiative Planning , 1994 .

[27]  Johanna D. Moore,et al.  AutoBrief: a multimedia presentation system for assisting data analysis , 1997, Comput. Stand. Interfaces.

[28]  Lawrence J. Rosenblum,et al.  Research issues in scientific visualization , 1994, IEEE Computer Graphics and Applications.

[29]  William Knight,et al.  Using visual texture for information display , 1995, TOGS.

[30]  J. M. Hans du Buf,et al.  A review of recent texture segmentation and feature extraction techniques , 1993 .

[31]  Steven K. Feiner,et al.  Generating Multimedia Briefings: Coordinating Language and Illustration , 1998, Artif. Intell..