String Similarity Metrics for Ontology Alignment

Ontology alignment is an important part of enabling the semantic web to reach its full potential. The vast majority of ontology alignment systems use one or more string similarity metrics, but often the choice of which metrics to use is not given much attention. In this work we evaluate a wide range of such metrics, along with string pre-processing strategies such as removing stop words and considering synonyms, on different types of ontologies. We also present a set of guidelines on when to use which metric. We furthermore show that if optimal string similarity metrics are chosen, those alone can produce alignments that are competitive with the state of the art in ontology alignment systems. Finally, we examine the improvements possible to an existing ontology alignment system using an automated string metric selection strategy based upon the characteristics of the ontologies to be aligned.

[1]  Enrico Motta,et al.  The Semantic Web - ISWC 2005, 4th International Semantic Web Conference, ISWC 2005, Galway, Ireland, November 6-10, 2005, Proceedings , 2005, SEMWEB.

[2]  Clayton Fink,et al.  JHU/APL Onto-Mapology Results for OAEI 2006 , 2006, Ontology Matching.

[3]  Carlo Curino,et al.  X-SOM Results for OAEI 2007 , 2007, OM.

[4]  Pascal Hitzler,et al.  The Role of String Similarity Metrics in Ontology Alignment , 2013 .

[5]  Qiang Liu,et al.  SAMBO and SAMBOdtf Results for the Ontology Alignment Evaluation Initiative 2008 , 2008, OM.

[6]  Ioannis Vlahavas,et al.  Methods and Applications of Artificial Intelligence , 2002, Lecture Notes in Computer Science.

[7]  Karl Branting A comparative evaluation of name-matching algorithms , 2003, ICAIL.

[8]  J. Euzenat,et al.  Ontology Matching , 2007, Springer Berlin Heidelberg.

[9]  Heiner Stuckenschmidt,et al.  Results of the Ontology Alignment Evaluation Initiative , 2007 .

[10]  Stefanos D. Kollias,et al.  A String Metric for Ontology Alignment , 2005, SEMWEB.

[11]  Ian Horrocks,et al.  Ontologies and the semantic web , 2008, CACM.

[12]  Oscar Corcho,et al.  The Semantic Web: Semantics and Big Data , 2013, Lecture Notes in Computer Science.

[13]  Marc Ehrig,et al.  State of the art on ontology alignment , 2013 .

[14]  Charles Elkan,et al.  The Field Matching Problem: Algorithms and Applications , 1996, KDD.

[15]  Ryutaro Ichise,et al.  Integrating Know-How into the Linked Data Cloud , 2014, EKAW.

[16]  Steffen Staab,et al.  Measuring Similarity between Ontologies , 2002, EKAW.

[17]  Maria Vargas-Vera,et al.  State of the Art on Ontology Alignment , 2015, Int. J. Knowl. Soc. Res..

[18]  Dekang Lin,et al.  An Information-Theoretic Definition of Similarity , 1998, ICML.

[19]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[20]  George A. Vouros,et al.  A Name-Matching Algorithm for Supporting Ontology Enrichment , 2004, SETN.

[21]  Zohra Bellahsene,et al.  Opening the Black Box of Ontology Matching , 2013, ESWC.

[22]  Heiner Stuckenschmidt,et al.  Results of the Ontology Alignment Evaluation Initiative 2007 , 2006, OM.

[23]  Pradeep Ravikumar,et al.  A Comparison of String Distance Metrics for Name-Matching Tasks , 2003, IIWeb.

[24]  Michelle Cheatham MapSSS results for OAEI 2011 , 2011, OM.