TRC-Matcher and enhanced TRC-Matcher. New Tools for Automatic XML Schema Matching

Modern society depends on the access to a wide range of information that is located in heterogeneous data sources. Schema matching is a task of finding relationships among data source elements automatically. However, most of the existing schema matching software are semi-automatic meaning that they need a lot of interaction from an expert familiar with the systems being integrated. In this work, we propose a new hybrid matcher algorithm, called TRC-matcher, that is targeted for matching business oriented XML schemas with none or minor user assistance. When compared to previously published schema matching methods, the efficiency of the new algorithm is based on a new content profiling algorithm and on intelligent combination of matching results of multiple matching algorithms. In addition, an enhanced version of the TRC-Matcher is introduced that combines machine learning methods together with few new matching algorithms.

[2]  Sadok Ben Yahia,et al.  XMap results for OAEI 2017 , 2014, OM@ISWC.

[3]  Qilin Sun,et al.  GMap: results for OAEI 2015 , 2015, OM.

[4]  Jérôme Euzenat,et al.  A Survey of Schema-Based Matching Approaches , 2005, J. Data Semant..

[5]  Timo Knuutila,et al.  Current State of Ontology Matching. A Survey of Ontology and Schema Matching , 2015 .

[6]  Heiner Stuckenschmidt,et al.  Results of the Ontology Alignment Evaluation Initiative 2007 , 2006, OM.

[7]  Bill Swartout,et al.  Description Logic Specification from the KRSS Effort (Draft) , 1993 .

[8]  Yuzhong Qu,et al.  FalconAO: Aligning Ontologies with Falcon , 2005, Integrating Ontologies.

[9]  Strother H. Walker,et al.  Estimation of the probability of an event as a function of several independent variables. , 1967, Biometrika.

[10]  Serena Sorrentino,et al.  Automatic generation of probabilistic relationships for improving schema matching , 2011, Inf. Syst..

[11]  Achille Fokoue,et al.  One Size Does Not Fit All: Customizing Ontology Alignment Using User Feedback , 2010, International Semantic Web Conference.

[12]  Aynaz Taheri,et al.  AML results for OAEI 2015 , 2015, OM.

[13]  Jennifer Widom,et al.  Database Systems: The Complete Book , 2001 .

[14]  Alvaro E. Monge,et al.  The WEBFIND tool for finding scientific papers over the worldwide web , 2007 .

[15]  Jiaheng Lu,et al.  String similarity measures and joins with synonyms , 2013, SIGMOD '13.

[16]  William E. Winkler,et al.  String Comparator Metrics and Enhanced Decision Rules in the Fellegi-Sunter Model of Record Linkage. , 1990 .

[17]  Vladimir I. Levenshtein,et al.  Binary codes capable of correcting deletions, insertions, and reversals , 1965 .

[18]  George A. Miller,et al.  WordNet: A Lexical Database for English , 1995, HLT.

[19]  Cosmin Stroe,et al.  AgreementMaker: Efficient Matching for Large Real-World Schemas and Ontologies , 2009, Proc. VLDB Endow..

[20]  Heiner Stuckenschmidt,et al.  Results of the Ontology Alignment Evaluation Initiative , 2007 .

[21]  Dan Gusfield Algorithms on Strings, Trees, and Sequences - Computer Science and Computational Biology , 1997 .

[22]  Erhard Rahm,et al.  Generic schema matching, ten years later , 2011, Proc. VLDB Endow..

[23]  Jérôme Euzenat,et al.  Ontology Matching: State of the Art and Future Challenges , 2013, IEEE Transactions on Knowledge and Data Engineering.

[24]  Imants Zarembo,et al.  Assessment of Name Based Algorithms for Land Administration Ontology Matching , 2015 .

[25]  Robert W. Irving,et al.  The Stable marriage problem - structure and algorithms , 1989, Foundations of computing series.

[26]  Christian Meilicke MAMBA - results for the OAEI 2015 , 2015, OM.