Probabilistic-Logical Web Data Integration

The integration of both distributed schemas and data repositories is a major challenge in data and knowledge management applications. Instances of this problem range from mapping database schemas to object reconciliation in the linked open data cloud. We present a novel approach to several important data integration problems that combines logical and probabilistic reasoning. We first provide a brief overview of some of the basic formalisms such as description logics and Markov logic that are used in the framework. We then describe the representation of the different integration problems in the probabilistic-logical framework and discuss efficient inference algorithms. For each of the applications, we conducted extensive experiments on standard data integration and matching benchmarks to evaluate the efficiency and performance of the approach. The positive results of the evaluation are quite promising and the flexibility of the framework makes it easily adaptable to other realworld data integration problems.

[1]  Dean Allemang,et al.  The Semantic Web - ISWC 2006, 5th International Semantic Web Conference, ISWC 2006, Athens, GA, USA, November 5-9, 2006, Proceedings , 2006, SEMWEB.

[2]  J. Euzenat,et al.  Ontology Matching , 2007, Springer Berlin Heidelberg.

[3]  Heiner Stuckenschmidt,et al.  Results of the Ontology Alignment Evaluation Initiative , 2007 .

[4]  Heiner Stuckenschmidt,et al.  Analyzing Mapping Extraction Approaches , 2007, OM.

[5]  Mansur R. Kabuka,et al.  Ontology matching with semantic verification , 2009, J. Web Semant..

[6]  Matthew Richardson,et al.  Markov logic networks , 2006, Machine Learning.

[7]  Alon Y. Halevy,et al.  P-CLASSIC: A Tractable Probablistic Description Logic , 1997, AAAI/IAAI.

[8]  Manuel Ferre Haptics: Perception, Devices and Scenarios, 6th International Conference, EuroHaptics 2008, Madrid, Spain, June 10-13, 2008, Proceedings , 2008, EuroHaptics.

[9]  Li Ding,et al.  Characterizing the Semantic Web on the Web , 2006, SEMWEB.

[10]  Ian Horrocks,et al.  The OWL Instance Store: System Description , 2005, CADE.

[11]  Heiner Stuckenschmidt,et al.  Results of the Ontology Alignment Evaluation Initiative 2007 , 2006, OM.

[12]  Jacques Calmet,et al.  OntoBayes: An Ontology-Driven Uncertainty Model , 2005 .

[13]  Ben Taskar,et al.  Learning structured prediction models: a large margin approach , 2005, ICML.

[14]  Heiner Stuckenschmidt,et al.  Repairing Ontology Mappings , 2007, AAAI.

[15]  Enrico Motta,et al.  The Semantic Web - ISWC 2005, 4th International Semantic Web Conference, ISWC 2005, Galway, Ireland, November 6-10, 2005, Proceedings , 2005, SEMWEB.

[16]  Alexander Schrijver,et al.  Theory of linear and integer programming , 1986, Wiley-Interscience series in discrete mathematics and optimization.

[17]  Paulo Cesar G. da Costa,et al.  PR-OWL: A Framework for Probabilistic Ontologies , 2006, FOIS.

[18]  Eero Hyvönen,et al.  Modeling Uncertainty in Semantic Web Taxonomies , 2006 .

[19]  Cosmin Stroe,et al.  Efficient Selection of Mappings and Automatic Quality-driven Combination of Matching Methods , 2009, OM.

[20]  Steffen Staab,et al.  The Semantic Web - ISWC 2008, 7th International Semantic Web Conference, ISWC 2008, Karlsruhe, Germany, October 26-30, 2008. Proceedings , 2008, SEMWEB.

[21]  Feng Shi,et al.  RiMOM Results for OAEI 2009 , 2008, OM.

[22]  Graham Steel,et al.  Deduction with XOR Constraints in Security API Modelling , 2005, CADE.

[23]  Sriraam Natarajan,et al.  Speeding Up Inference in Markov Logic Networks by Preprocessing to Reduce the Size of the Resulting Grounded Network , 2009, IJCAI.

[24]  Daniel S. Weld,et al.  Automatically refining the wikipedia infobox ontology , 2008, WWW.

[25]  Yun Peng,et al.  BayesOWL: Uncertainty Modeling in Semantic Web Ontologies , 2006 .

[26]  Yun Peng,et al.  A Bayesian Network Approach to Ontology Mapping , 2005, SEMWEB.

[27]  Alfio Ferrara,et al.  Towards a Benchmark for Instance Matching , 2008, OM.

[28]  Heiner Stuckenschmidt A Semantic Similarity Measure for Ontology-Based Information , 2009, FQAS.

[29]  Philipp M. Yelland An Alternative Combination of Bayesian Networks and Description Logics , 2000, KR.

[30]  Heiner Stuckenschmidt,et al.  Log-Linear Description Logics , 2011, IJCAI.

[31]  References , 1971 .

[32]  Heiko Stoermer,et al.  Results of OKKAM Feature based Entity Matching Algorithm for Instance Matching Contest of OAEI 2009 , 2009, OM.

[33]  Ian Horrocks,et al.  A Software Framework for Matchmaking Based on Semantic Web Technology , 2004, Int. J. Electron. Commer..

[34]  Jeffrey M. Bradshaw,et al.  Applying KAoS Services to Ensure Policy Compliance for Semantic Web Services Workflow Composition and Enactment , 2004, SEMWEB.

[35]  Paulo Cesar G. da Costa,et al.  Of Starships and Klingons: Bayesian Logic for the 23rd Century , 2005, UAI.

[36]  Mathias Niepert A Delayed Column Generation Strategy for Exact k-Bounded MAP Inference in Markov Logic Networks , 2010, UAI.

[37]  Mansur R. Kabuka,et al.  ASMOV Results for OAEI 2007 , 2007, OM.

[38]  Nathalie Pernelle,et al.  Combining a Logical and a Numerical Method for Data Reconciliation , 2009, J. Data Semant..

[39]  Solomon Eyal Shimony,et al.  Markov Network Based Ontology Matching , 2009, IJCAI.

[40]  Frank Wolter,et al.  Semi-qualitative Reasoning about Distances: A Preliminary Report , 2000, JELIA.

[41]  Lawrence B. Holder,et al.  Mining Graph Data , 2006 .

[42]  Manfred Jaeger,et al.  Probabilistic Reasoning in Terminological Logics , 1994, KR.

[43]  Thomas Lukasiewicz,et al.  P-SHOQ(D): A Probabilistic Extension of SHOQ(D) for Probabilistic Ontologies in the Semantic Web , 2002, JELIA.

[44]  Rudolf Kruse,et al.  Symbolic and Quantitative Approaches to Uncertainty , 1991, Lecture Notes in Computer Science.

[45]  Sebastian Riedel Improving the Accuracy and Efficiency of MAP Inference for Markov Logic , 2008, UAI.

[46]  Heiner Stuckenschmidt,et al.  Leveraging Terminological Structure for Object Reconciliation , 2010, ESWC.

[47]  Heiner Stuckenschmidt,et al.  Benchmarking Matching Applications on the Semantic Web , 2011, ESWC.

[48]  C.J.H. Mann,et al.  Information Sharing on the Semantic web , 2005 .

[49]  Dan Roth,et al.  Integer linear programming inference for conditional random fields , 2005, ICML.

[50]  Iván V. Meza,et al.  Multilingual Semantic Role Labelling with Markov Logic , 2009, CoNLL Shared Task.

[51]  Ivan P. Fellegi,et al.  A Theory for Record Linkage , 1969 .

[52]  Heiner Stuckenschmidt,et al.  A Probabilistic-Logical Framework for Ontology Matching , 2010, AAAI.

[53]  Ian Horrocks,et al.  Using Vampire to Reason with OWL , 2004, SEMWEB.

[54]  Vladimir I. Levenshtein,et al.  Binary codes capable of correcting deletions, insertions, and reversals , 1965 .

[55]  Jochen Heinsohn,et al.  A Hybrid Approach for Modeling Uncertainty in Terminological Logics , 1991, ECSQARU.

[56]  Lise Getoor,et al.  Query-time entity resolution , 2006, KDD '06.

[57]  Erhard Rahm,et al.  Similarity flooding: a versatile graph matching algorithm and its application to schema matching , 2002, Proceedings 18th International Conference on Data Engineering.

[58]  Heiner Stuckenschmidt,et al.  An Efficient Method for Computing Alignment Diagnoses , 2009, RR.

[59]  Alexander Borgida,et al.  On the Relative Expressiveness of Description Logics and Predicate Logics , 1996, Artif. Intell..

[60]  Jan Nößner,et al.  CODI: Combinatorial Optimization for Data Integration: results for OAEI 2011 , 2010, OM.

[61]  Yuzhong Qu,et al.  ObjectCoref & Falcon-AO: results for OAEI 2010 , 2010, OM.

[62]  Jérôme David,et al.  Matching directories and OWL ontologies with AROMA , 2006, CIKM '06.

[63]  Cosmin Stroe,et al.  Using AgreementMaker to align ontologies for OAEI 2010 , 2010, OM.

[64]  Zongmin Ma Soft computing in ontologies and semantic web , 2006 .

[65]  V. Svátek,et al.  OntoFarm : Towards an Experimental Collection of Parallel Ontologies , 2005 .

[66]  Lise Getoor,et al.  Entity Resolution in Graphs , 2005 .

[67]  Mansur R. Kabuka,et al.  ASMOV: results for OAEI 2010 , 2010, OM.

[68]  Yarden Katz,et al.  Pellet: A practical OWL-DL reasoner , 2007, J. Web Semant..