Knowledge Discovery in an Oceanographic Database

Knowledge discovery from image data is a multi-step iterative process. This paper describes the procedure we have used to develop a knowledge discovery system that classifies regions of the ocean floor based on textural features extracted from acoustic imagery. The image is subdivided into rectangular cells called texture elements (texels); a gray-level co-occurence matrix (GLCM) is computed for each texel in four directions. Secondary texture features are then computed from the GLCM resulting in a feature vector representation of each texel instance. Alternatively, a region-growing approach is used to identify irregularly shaped regions of varying size which have a homogenous texture and for which the texture features are computed. The Bayesian classifier Autoclass is used to cluster the instances. Feature extraction is one of the major tasks in knowledge discovery from images. The initial goal of this research was to identify regions of the image characterized by sand waves. Experiments were designed to use expert judgements to select the most effective set of features, to identify the best texel size, and to determine the number of meaningful classes in the data. The region-growing approach has proven to be more successful than the texel-based approach. This method provides a fast and accurate method for identifying provinces in the ocean floor of interest to geologists.

[1]  James Kelly,et al.  AutoClass: A Bayesian Classification System , 1993, ML.

[2]  Peter C. Cheeseman,et al.  Bayesian Classification (AutoClass): Theory and Results , 1996, Advances in Knowledge Discovery and Data Mining.

[3]  Philip S. Yu,et al.  Data Mining: An Overview from a Database Perspective , 1996, IEEE Trans. Knowl. Data Eng..

[4]  Alberto Malinverno,et al.  Segmentation of topographic profiles of the seafloor based on a self-affine model , 1989 .

[5]  Daniel Sauter,et al.  Spatial filtering for speckle reduction, contrast enhancement, and texture analysis of GLORIA images , 1994 .

[6]  Robert M. Haralick,et al.  Textural Features for Image Classification , 1973, IEEE Trans. Syst. Man Cybern..

[7]  Ronald J. Brachman,et al.  The Process of Knowledge Discovery in Databases , 1996, Advances in Knowledge Discovery and Data Mining.

[8]  Thomas H. Jordan,et al.  Stochastic modeling of seafloor morphology; Resolution of topographic parameters by sea beam data , 1989 .

[9]  Steven J. Fenves,et al.  Applying AI clustering to engineering tasks , 1993, IEEE Expert.

[10]  Padhraic Smyth,et al.  From Data Mining to Knowledge Discovery: An Overview , 1996, Advances in Knowledge Discovery and Data Mining.

[11]  David Haussler,et al.  KDD for Science Data Analysis: Issues and Examples , 1996, KDD.

[12]  Richard R. Muntz,et al.  Scalable Exploratory Data Mining of Distributed Geoscientific Data , 1996, KDD.

[13]  M.,et al.  Statistical and Structural Approaches to Texture , 2022 .

[14]  T. Reed,et al.  Digital image processing techniques for enhancement and classification of SeaMARC II side scan sonar imagery , 1989 .

[15]  Casimir A. Kulikowski,et al.  Computer Systems That Learn: Classification and Prediction Methods from Statistics, Neural Nets, Machine Learning and Expert Systems , 1990 .