Knowledge Discovery in Multiple Databases

Summary form is only given. Knowledge discovery in multiple databases is an important research area because (1) there is an urgent need for analyzing data in different sources, (2) there are essential differences between mono and multidatabase mining, and (3) there are limitations in existing multidatabase mining efforts. This talk describe a multidatabase mining process, and review some research issues related to multidatabase mining, including database clustering and local pattern analysis.

[1]  Xiaodong Chen,et al.  A Framework for Temporal Data Mining , 1998, DEXA.

[2]  Moustafa Ghanem,et al.  Large Scale Data Mining: Challenges and Responses , 1997, KDD.

[3]  Shichao Zhang,et al.  Association Rule Mining: Models and Algorithms , 2002 .

[4]  Shichao Zhang,et al.  Database clustering for mining multi-databases , 2002, 2002 IEEE World Congress on Computational Intelligence. 2002 IEEE International Conference on Fuzzy Systems. FUZZ-IEEE'02. Proceedings (Cat. No.02CH37291).

[5]  Ali R. Hurson,et al.  Multidatabase Systems: An Advance Solution for Global Information Sharing , 1993 .

[6]  Xindong Wu,et al.  Mining Both Positive and Negative Association Rules , 2002, ICML.

[7]  Joseph Albert,et al.  Theoretical foundations of schema restructuring in heterogeneous multidatabase systems , 2000, CIKM '00.

[8]  Salvatore J. Stolfo,et al.  An extensible meta-learning approach for scalable and accurate inductive learning , 1996 .

[9]  Srinivasan Parthasarathy,et al.  Parallel Data Mining for Association Rules on Shared-memory Systems , 1998 .

[10]  Gregory Piatetsky-Shapiro,et al.  Knowledge discovery workbench for exploring business databases , 1992, Int. J. Intell. Syst..

[11]  Hongjun Lu,et al.  Exception Rule Mining with a Relative Interestingness Measure , 2000, PAKDD.

[12]  Hongjun Lu,et al.  Toward Multidatabase Mining: Identifying Relevant Databases , 2001, IEEE Trans. Knowl. Data Eng..

[13]  Rajeev Motwani,et al.  Scalable Techniques for Mining Causal Structures , 1998, Data Mining and Knowledge Discovery.

[14]  Jiawei Han,et al.  GeoMiner: a system prototype for spatial data mining , 1997, SIGMOD '97.

[15]  Mounia Lalmas,et al.  Merging techniques for performing data fusion on the web , 2001, CIKM '01.

[16]  Padhraic Smyth,et al.  Rule Induction Using Information Theory , 1991, Knowledge Discovery in Databases.

[17]  Jiawei Han,et al.  Attribute-Oriented Induction in Relational Databases , 1991, Knowledge Discovery in Databases.

[18]  Abraham Silberschatz,et al.  Reliable transaction management in a multidatabase system , 1990, SIGMOD '90.

[19]  Hillol Kargupta,et al.  Distributed Clustering Using Collective Principal Component Analysis , 2001, Knowledge and Information Systems.

[20]  Philip S. Yu,et al.  Data Mining: An Overview from a Database Perspective , 1996, IEEE Trans. Knowl. Data Eng..

[21]  Bryan Horling,et al.  A Next Generation Information Gathering Agent , 1998 .

[22]  Chengqi Zhang,et al.  MINING DEPENDENT PATTERNS IN PROBABILISTIC DATABASES , 2004, Cybern. Syst..

[23]  Victor R. Lesser,et al.  BIG: An agent for resource-bounded information gathering and decision making , 2000, Artif. Intell..

[24]  Ali R. Hurson,et al.  A taxonomy and current issues in multidatabase systems , 1992, Computer.

[25]  Philip S. Yu,et al.  A new framework for itemset generation , 1998, PODS '98.

[26]  Chengqi Zhang,et al.  Discovering causality in large databases , 2002, Appl. Artif. Intell..

[27]  Nupur Bhatnagar Spatial Data Mining , 2006 .

[28]  Jiawei Han,et al.  Maintenance of discovered association rules in large databases: an incremental updating technique , 1996, Proceedings of the Twelfth International Conference on Data Engineering.

[29]  Shonali Krishnaswamy,et al.  An architecture to support distributed data mining services in e-commerce environments , 2000, Proceedings Second International Workshop on Advanced Issues of E-Commerce and Web-Based Information Systems. WECWIS 2000.

[30]  Saul A. Kripke,et al.  Semantical Analysis of Modal Logic I Normal Modal Propositional Calculi , 1963 .

[31]  Shamkant B. Navathe,et al.  Mining for strong negative associations in a large database of customer transactions , 1998, Proceedings 14th International Conference on Data Engineering.

[32]  Foster J. Provost,et al.  A Survey of Methods for Scaling Up Inductive Algorithms , 1999, Data Mining and Knowledge Discovery.

[33]  Jiawei Han,et al.  Knowledge Discovery in Databases: An Attribute-Oriented Approach , 1992, VLDB.

[34]  Jian Tang,et al.  Mining exception instances to facilitate workflow exception handling , 1999, Proceedings. 6th International Conference on Advanced Systems for Advanced Applications.

[35]  Chengqi Zhang,et al.  Mining small databases by collecting knowledge , 2001, Proceedings Seventh International Conference on Database Systems for Advanced Applications. DASFAA 2001.

[36]  Gregory F. Cooper,et al.  A Simple Constraint-Based Algorithm for Efficiently Mining Observational Databases for Causal Relationships , 1997, Data Mining and Knowledge Discovery.

[37]  Xindong Wu,et al.  Large scale data mining based on data partitioning , 2001, Appl. Artif. Intell..

[38]  Yiyu Yao,et al.  Peculiarity Oriented Multi-database Mining , 1999, PKDD.

[39]  Jian Pei,et al.  Mining frequent patterns without candidate generation , 2000, SIGMOD '00.

[40]  Rajeev Motwani,et al.  Dynamic itemset counting and implication rules for market basket data , 1997, SIGMOD '97.

[41]  Chengqi Zhang,et al.  Post-mining: maintenance of association rules by weighting , 2003, Inf. Syst..

[42]  Roberto J. Bayardo,et al.  Efficiently mining long patterns from databases , 1998, SIGMOD '98.

[43]  Hongjun Lu,et al.  Identifying Relevant Databases for Multidatabase Mining , 1998, PAKDD.

[44]  Stefan Wrobel,et al.  An Algorithm for Multi-relational Discovery of Subgroups , 1997, PKDD.

[45]  Shichao Zhang A nearest neighborhood algebra for probabilistic databases , 2000, Intell. Data Anal..

[46]  Masato Oguchi,et al.  Dynamic Remote Memory Acquiring for Parallel Data Mining on PC Cluster: Prliminary Performance Results , 1999, HPCN Europe.

[47]  Joseph Y. Halpern,et al.  A Guide to Completeness and Complexity for Modal Logics of Knowledge and Belief , 1992, Artif. Intell..

[48]  Hillol Kargupta,et al.  Collective Principal Component Analysis from Distributed, Heterogeneous Data , 2000, PKDD.

[49]  Xindong Wu,et al.  Identifying Quality Knowledge , 2004 .

[50]  Haym Hirsh,et al.  Incremental batch learning , 1989, ICML 1989.

[51]  Rajeev Motwani,et al.  Beyond market baskets: generalizing association rules to correlations , 1997, SIGMOD '97.

[52]  Xindong Wu,et al.  Multi-layer Incremental Induction , 1998, PRICAI.

[53]  Jinxin Lin Frameworks for dealing with conflicting information and applications , 1996 .