Instance-based domain ontological view creation towards semantic integration

In many domains today there are very limited explicit ontologies established for implementing information systems. Traditional ontology-driven semantic integration approaches cannot be directly applied in integrating these information systems. Usually, the information systems have schemas, a type of formal information model, for their information repositories which to some extent imply the semantics of the information. Each schema actually reflects a specific view of the domain conceptualization. This paper investigates the theoretical foundation of ontologies and extends the traditional ontology concept to the ontological view concept. It proposes to use ontological views to address the challenge of semantic integration. The proposed approach adopts the schemas to create local ontological views, uses data instances of the information systems to discover semantic relationships between the concepts within the ontological views, and builds a domain ontological view based on the discovered equivalence mappings. It applies the hierarchical clustering technique on the data instances and, in the further analysis, uses the clusters to reduce the cost of processing a large amount of data. The matching of concept properties is based on the probability distribution of the data instances. The experimental results have demonstrated the effectiveness of this approach.

[1]  Weiming Shen,et al.  Multi-Agent Systems for Concurrent Intelligent Design and Manufacturing , 2000 .

[2]  Erhard Rahm,et al.  A survey of approaches to automatic schema matching , 2001, The VLDB Journal.

[3]  Thomas R. Gruber,et al.  Toward principles for the design of ontologies used for knowledge sharing? , 1995, Int. J. Hum. Comput. Stud..

[4]  Marvin Minsky,et al.  A framework for representing knowledge , 1974 .

[5]  Andrew McCallum,et al.  A unified approach for schema matching, coreference and canonicalization , 2008, KDD.

[6]  Amit P. Sheth,et al.  Changing Focus on Interoperability in Information Systems:From System, Syntax, Structure to Semantics , 1999 .

[7]  Mark A. Musen,et al.  Mediating Knowledge between Application Components , 2003 .

[8]  Michael F. Goodchild,et al.  Interoperating Geographic Information Systems , 2012 .

[9]  David A. Bell,et al.  A Novel Clustering-Based Approach to Schema Matching , 2006, ADVIS.

[10]  C. Tsokos,et al.  Developments in Nonparametric Density-Estimation , 1980 .

[11]  Dieter Fensel,et al.  Knowledge Engineering: Principles and Methods , 1998, Data Knowl. Eng..

[12]  Yuh-Min Chen,et al.  Developing a semantic-enable information retrieval mechanism , 2010, Expert Syst. Appl..

[13]  Willem Jonker,et al.  Using Element Clustering to Increase the Efficiency of XML Schema Matching , 2006, 22nd International Conference on Data Engineering Workshops (ICDEW'06).

[14]  Jacques-François Thisse,et al.  The Minimax-min Location Problem , 1986 .

[15]  Soe-Tsyr Yuan,et al.  A synthesis of semantic social network and attraction theory for innovating community-based e-service , 2010, Expert Syst. Appl..

[16]  Patrick Henry Winston,et al.  The psychology of computer vision , 1976, Pattern Recognit..

[17]  Stephan R. Sain,et al.  Multi-dimensional Density Estimation , 2004 .

[18]  Heiner Stuckenschmidt,et al.  Ontology-Based Integration of Information - A Survey of Existing Approaches , 2001, OIS@IJCAI.

[19]  Pedro M. Domingos,et al.  Reconciling schemas of disparate data sources: a machine-learning approach , 2001, SIGMOD '01.

[20]  Matthew Davidson,et al.  Essays In The Metaphysics Of Modality , 2003 .

[21]  N. Guarino,et al.  Formal Ontology in Information Systems : Proceedings of the First International Conference(FOIS'98), June 6-8, Trento, Italy , 1998 .

[22]  Wenfei Fan,et al.  Putting context into schema matching , 2006, VLDB.

[23]  Natalya F. Noy,et al.  Semantic integration: a survey of ontology-based approaches , 2004, SGMD.