A Methodology for Learning Across Application Domains for Database Design Systems

Although database design tools have been developed that attempt to automate (or semiautomate) the design process, these tools do not have the capability to capture common sense knowledge about business applications and store it in a context-specific manner. As a result, they rely on the user to provide a great deal of "trivial" details and do not function as well as a human designer who usually has some general knowledge of how an application might work based on his or her common sense knowledge of the real world. Common sense knowledge could be used by a database design system to validate and improve the quality of an existing design or even generate new designs. This requires that context-specific information about different database design applications be stored and generalized into information about specific application domains (e.g., pharmacy, daycare, hospital, university, manufacturing). Such information should be stored at the appropriate level of generality in a hierarchically structured knowledge base so that it can be inherited by the subdomains below. For this to occur, two types of learning must take place. First, knowledge about a particular application domain that is acquired from specific applications within that domain are generalized into a domain node (e.g., entities, relationships, and attributes from various hospital applications are generalized to a hospital node). This is referred to as within domain learning. Second, the information common to two (or more) related application domain nodes is generalized to a higher-level node; for example, knowledge from the car rental and video rental domains may be generalized to a rental node. This is called across domain learning. This paper presents a methodology for learning across different application domains based on a distance measure. The parameters used in this methodology were refined by testing on a set of representative cases; empirical testing provided further validation.

[1]  Sandeep Purao,et al.  Intelligent Support for Retrieval and Synthesis of Patterns for Object-Oriented Design , 1997, ER.

[2]  Doug Fang,et al.  The identification and resolution of semantic heterogeneity in multidatabase systems , 1991, [1991] Proceedings. First International Workshop on Interoperability in Multidatabase Systems.

[3]  Silvano Mussi,et al.  A Method for Putting Common Sense into an Expert Systems , 1993, IEEE Trans. Knowl. Data Eng..

[4]  Thomas R. Gruber,et al.  The Role of Common Ontology in Achieving Sharable, Reusable Knowledge Bases , 1991, KR.

[5]  Herbert A. Simon,et al.  WHY SHOULD MACHINES LEARN , 1983 .

[6]  Joobin Choobineh,et al.  A form-based approach for database analysis and design , 1992, CACM.

[7]  Shamkant B. Navathe,et al.  Conceptual Database Design: An Entity-Relationship Approach , 1991 .

[8]  Vijayan Sugumaran,et al.  Supporting reuse in systems analysis , 2000, CACM.

[9]  Veda C. Storey,et al.  Naive Semantics to Support Automated Database Design , 2002, IEEE Trans. Knowl. Data Eng..

[10]  Veda C. Storey,et al.  A methodology for creating user views in database design , 1988, TODS.

[11]  Michael R. Anderberg,et al.  Cluster Analysis for Applications , 1973 .

[12]  Stefano Spaccapietra,et al.  A Generalized Expert System for Database Design , 1989, IEEE Trans. Software Eng..

[13]  Douglas B. Lenat,et al.  CYC: a large-scale investment in knowledge infrastructure , 1995, CACM.

[14]  Umeshwar Dayal,et al.  View Definition and Generalization for Database Integration in a Multidatabase System , 1984, IEEE Transactions on Software Engineering.

[15]  T. J. Teorey,et al.  A logical design methodology for relational databases using the extended entity-relationship model , 1986, CSUR.

[16]  Peretz Shoval,et al.  ADDS: A System for Automatic Database Schema Design Based on the Binary-Relationship Model , 1987, Data Knowl. Eng..

[17]  Kathleen Dahlgren,et al.  A linguistic ontology , 1995, Int. J. Hum. Comput. Stud..

[18]  Dinesh Batra,et al.  Novice errors in conceptual database design , 1994 .

[19]  H. Kuhn The Hungarian method for the assignment problem , 1955 .

[20]  Joachim Biskup,et al.  A formal view integration method , 1986, SIGMOD '86.

[21]  G. M. Nijssen,et al.  Conceptual schema and relational database design - a fact oriented approach , 1989 .

[22]  Wolfgang Effelsberg,et al.  A rule-based approach for merging generalization hierarchies , 1988, Inf. Syst..

[23]  Dinesh Batra,et al.  A conceptual database design approach based on rules and heuristics , 1994 .

[24]  Shahrul Azman Mohd. Noah,et al.  An Evaluation of Two Approaches to Exploiting Real-World Knowledge by Intelligent Database Design Models , 1998, ER.

[25]  Veda C. Storey,et al.  Database design with common sense business reasoning and learning , 1997, TODS.

[26]  Kathleen Dahlgren,et al.  Naive semantics for natural language understanding , 1988 .

[27]  Charles W. Krueger,et al.  Software reuse , 1992, CSUR.

[28]  Pericles Loucopoulos,et al.  CASE Methods and Support Tools , 1992 .

[29]  Veda C. Storey,et al.  Improving database design through the analysis of relationships , 1999, TODS.

[30]  Veda C. Storey,et al.  An Ontology-Based Expert System for Database Design , 1998, Data Knowl. Eng..

[31]  Dennis McLeod,et al.  The Identification and Resolution of Semantic Heterogeneity , 1991 .

[32]  Michael Lloyd-Williams,et al.  Exploiting Domain Knowledge During the Automated Design of Object-Oriented Databases , 1997, ER.

[33]  Jay F. Nunamaker,et al.  An Expert Database Design System Based on Analysis of Forms , 1988, IEEE Trans. Software Eng..

[34]  Ali R. Hurson,et al.  Automated resolution of semantic heterogeneity in multidatabases , 1994, TODS.

[35]  Veda C. Storey,et al.  Relational database design based on the Entity-Relationship model , 1991, Data Knowl. Eng..

[36]  Veda C. Storey,et al.  Knowledge-Based Approaches to Database Design , 1993, MIS Q..

[37]  Amit P. Sheth,et al.  Specifying interdatabase dependencies in a multidatabase environment , 1991, Computer.

[38]  Paul Beynon-Davies,et al.  Expert systems for database design: A comparative review , 1992, Artificial Intelligence Review.