Database Semantic Interoperability based on Information Flow Theory and Formal Concept Analysis

As databases become widely used, there is a growing need to translate information between multiple databases. Semantic interoperability and integration has been a long standing challenge for the database community and has now become a prominent area of database research. In this paper, we aim to answer the question how semantic interoperability between two databases can be achieved by using Formal Concept Analysis (FCA for short) and Information Flow (IF for short) theories. For our purposes, firstly we discover knowledge from different databases by using FCA, and then align what is discovered by using IF and FCA. The development of FCA has led to some software systems such as TOSCANA and TUPLEWARE, which can be used as a tool for discovering knowledge in databases. A prototype based on the IF and FCA has been developed. Our method is tested and verified by using this prototype and TUPLEWARE.

[1]  Yun-Heh Chen-Burger,et al.  FCA in Knowledge Technologies: Experiences and Opportunities , 2004, ICFCA.

[2]  Robert Godin,et al.  Design of a browsing interface for information retrieval , 1989, SIGIR '89.

[3]  Robert E. Kent Distributed Conceptual Structures , 2001, RelMiCS.

[4]  Craig A. Knoblock,et al.  Learning domain-independent string transformation weights for high accuracy object identification , 2002, KDD.

[5]  Roger King,et al.  Report of the Workshop on Semantic Heterogeneity and Interpolation in multidatabase Systems , 1993, SGMD.

[6]  Dennis McLeod,et al.  The design and experimental evaluation of an information discovery mechanism for networks of autonomous database systems , 1995, Proceedings of the Eleventh International Conference on Data Engineering.

[7]  Rudolf Wille,et al.  Restructuring Lattice Theory: An Approach Based on Hierarchies of Concepts , 2009, ICFCA.

[8]  Salvatore J. Stolfo,et al.  The merge/purge problem for large databases , 1995, SIGMOD '95.

[9]  Gerd Stumme,et al.  Conceptual Knowledge Discovery in Databases Using Formal Concept Analysis Methods , 1998, PKDD.

[10]  John F. Sowa,et al.  Conceptual Structures: Fulfilling Peirce's Dream , 1997, Lecture Notes in Computer Science.

[11]  Yannis Kalfoglou,et al.  Using Information-Flow Theory to Enable Semantic Interoperability , 2003 .

[12]  Jan Komorowski,et al.  Principles of Data Mining and Knowledge Discovery , 2001, Lecture Notes in Computer Science.

[13]  DoanAnHai,et al.  Semantic-integration research in the database community , 2005 .

[14]  William M. Tepfenhart,et al.  Conceptual Structures: Standards and Practices , 1999, Lecture Notes in Computer Science.

[15]  Alon Y. Halevy,et al.  Semantic Integration Research in the Database Community : A Brief Survey , 2005 .

[16]  Rudolf Wille,et al.  The Lattice of Concept Graphs of a Relationally Scaled Context , 1999, ICCS.

[17]  Rudolf Wille,et al.  Conceptual Graphs and Formal Concept Analysis , 1997, ICCS.

[18]  Raymond J. Mooney,et al.  Adaptive duplicate detection using learnable string similarity measures , 2003, KDD '03.

[19]  Laks V. S. Lakshmanan,et al.  SchemaSQL - A Language for Interoperability in Relational Multi-Database Systems , 1996, VLDB.

[20]  Uta Priss Formal concept analysis in information science , 2006 .

[21]  Yannis Kalfoglou,et al.  The Information Flow Approach to Ontology-Based Semantic Alignment , 2010 .

[22]  Yannis Kalfoglou,et al.  Centre for Intelligent Systems and Their Applications , 2006 .

[23]  Jon Barwise,et al.  Information Flow: The Logic of Distributed Systems , 1997 .

[24]  R. Wille Concept lattices and conceptual knowledge systems , 1992 .

[25]  Chris Clifton,et al.  Database Integration Using Neural Networks: Implementation and Experiences , 2000, Knowledge and Information Systems.