Generalized Affinity-Based Association Rule Mining for Multimedia Database Queries

Abstract. The recent progress in high-speed communication networks and large-capacity storage devices has led to a tremendous increase in the number of databases and the volume of data in them. This has created a need to discover structural equivalence relationships from the databases since queries tend to access information from structurally equivalent media objects residing in different databases. The more databases there are, the more query-processing performance improvement can be achieved when the structural equivalence relationships are automatically discovered. In response to such a demand, association rule mining has emerged and proven to be a highly successful technique for discovering knowledge from large databases. In this paper, we explore a generalized affinity-based association rule mining approach to discover the quasi-equivalence relationships from a network of databases. The algorithm is implemented and two empirical studies on real databases are conducted. The results show that the proposed generalized affinity-based association rule mining approach not only correctly exploits the set of quasi-equivalent media objects from the databases, but also outperforms the basic association rule mining approach in the discovery of the quasi-equivalent media object pairs.

[1]  Hongjun Lu,et al.  NeuroRule: A Connectionist Approach to Data Mining , 1995, VLDB.

[2]  Tian Zhang,et al.  BIRCH: an efficient data clustering method for very large databases , 1996, SIGMOD '96.

[3]  Peter C. Cheeseman,et al.  Bayesian Classification (AutoClass): Theory and Results , 1996, Advances in Knowledge Discovery and Data Mining.

[4]  Daryl Pregibon,et al.  A Statistical Perspective on Knowledge Discovery in Databases , 1996, Advances in Knowledge Discovery and Data Mining.

[5]  Rangasami L. Kashyap,et al.  Information Retrieval Using Markov Model Mediators In Multimedia Database Systems , 1998 .

[6]  Evangelos Simoudis,et al.  Integrating Inductive and Deductive Reasoning for Data Mining , 1996, Advances in Knowledge Discovery and Data Mining.

[7]  Ramez Elmasri,et al.  Fundamentals of database systems (2nd ed.) , 1994 .

[8]  W. H. Inmon,et al.  Building the data warehouse , 1992 .

[9]  James A. Larson,et al.  A Theory of Attribute Equivalence in Databases with Application to Schema Integration , 1989, IEEE Trans. Software Eng..

[10]  Thomas G. Dietterich,et al.  Readings in Machine Learning , 1991 .

[11]  Pat Langley,et al.  Elements of Machine Learning , 1995 .

[12]  James A. Larson,et al.  Integrating User Views in Database Design , 1986, Computer.

[13]  Hans-Peter Kriegel,et al.  Knowledge Discovery in Large Spatial Databases: Focusing Techniques for Efficient Class Identification , 1995, SSD.

[14]  Ramakrishnan Srikant,et al.  Mining generalized association rules , 1995, Future Gener. Comput. Syst..

[15]  Ramakrishnan Srikant,et al.  Mining quantitative association rules in large relational tables , 1996, SIGMOD '96.

[16]  Garrett L. Gleason Semantic Query Optimization in an Object-Oriented Semantic Association Model (OSAM) , 1990 .

[17]  Rangasami L. Kashyap,et al.  Discovering quasi-equivalence relationships from database systems , 1999, CIKM '99.

[18]  Sandra Heiler,et al.  Querying Part Hierarchies: A Knowledge-Based Approach , 1987, 24th ACM/IEEE Design Automation Conference.

[19]  Padhraic Smyth,et al.  From Data Mining to Knowledge Discovery: An Overview , 1996, Advances in Knowledge Discovery and Data Mining.

[20]  P. Venkat Rangan,et al.  Collaborative Multimedia Systems: Synthesis of Media Objects , 1998, IEEE Trans. Knowl. Data Eng..

[21]  Vidette Poe Building a Data Warehouse for Decision Support , 1995 .

[22]  Philip S. Yu,et al.  Data Mining: An Overview from a Database Perspective , 1996, IEEE Trans. Knowl. Data Eng..

[23]  Rangasami L. Kashyap,et al.  Temporal And Spatial Semantic Models For Multimedia Presentations , 1997 .

[24]  Hing-Yan Lee,et al.  Exploiting Visualization in Knowledge Discovery , 1995, KDD.

[25]  Ramez Elmasri,et al.  Fundamentals of Database Systems , 1989 .

[26]  Tomasz Imielinski,et al.  Mining association rules between sets of items in large databases , 1993, SIGMOD Conference.