Data Mining Opportunities in Very Large Object Oriented Databases

Information overload can hamper extracting hidden knowledge from a database. Data mining techniques ooer automated exploratory data analysis of databases. The mining process reveals knowledge buried in the data and provides insights into these data. Conventional mining techniques such as neural nets, regression analysis, and the discovering of rules uncover hidden information from user-provided training examples. A user indicates several of objects which are of interest to him (positive training examples) and few objects which are of no interest to him (negative training examples). The system then automatically discovers all interesting objects. In this position paper we present data mining opportunities that arise in very large object oriented databases, wherein the users no longer need to provide training examples. Data mining is achieved by user set priorities, user speciied guessed queries, and the notion of complete known information (i.e. information closure) about an object. From these inputs, hidden knowledge buried among huge amounts of information can be extracted in a user-driven manner.

[1]  Jiawei Han,et al.  Discovery of Data Evolution Regularities in Large Databases , 1994 .

[2]  Beat Wüthrich,et al.  On the learning of rule uncertainties and their integration into probabilistic knowledge bases , 1993, Journal of Intelligent Information Systems.

[3]  Luc De Raedt,et al.  Multiple Predicate Learning , 1993, IJCAI.

[4]  Saso Dzeroski,et al.  Inductive Learning in Deductive Databases , 1993, IEEE Trans. Knowl. Data Eng..

[5]  Philip K. Chan,et al.  Systems for Knowledge Discovery in Databases , 1993, IEEE Trans. Knowl. Data Eng..

[6]  Peter Schäuble,et al.  On the expressive power of query languages , 1994, TOIS.

[7]  Steven Salzberg,et al.  A Weighted Nearest Neighbor Algorithm for Learning with Symbolic Features , 2004, Machine Learning.

[8]  Kevin Wilkinson,et al.  The Iris Architecture and Implementation , 1990, IEEE Trans. Knowl. Data Eng..

[9]  Jiawei Han,et al.  Data-Driven Discovery of Quantitative Rules in Relational Databases , 1993, IEEE Trans. Knowl. Data Eng..

[10]  Oren Etzioni,et al.  Representation design and brute-force induction in a Boeing manufacturing domain , 1994, Appl. Artif. Intell..

[11]  Alberto Maria Segre,et al.  Programs for Machine Learning , 1994 .

[12]  Beat Wüthrich Probabilistic Knowledge Bases , 1995, IEEE Trans. Knowl. Data Eng..

[13]  Jörg Rech,et al.  Knowledge Discovery in Databases , 2001, Künstliche Intell..

[14]  Harris Drucker,et al.  Capacity and Complexity Control in Predicting the Spread Between Borrowing and Lending Interest Rates , 1995, KDD.

[15]  Beat Wüthrich,et al.  A Temporal and Probabilistic, Deductive and Object-Oriented Query Language , 1995, KDOOD/TDOOD.