A Clustering-Based Approach to Ontology Alignment

Ontology alignment is an important problem for the linked data web, as more and more ontologies and ontology instances get published for specific domains such as government and healthcare. A number of (semi-)automated alignment systems have been proposed in recent years. Most combine a set of similarity functions on lexical, semantic and structural features to align ontologies. Although these functions work well in many cases of ontology alignments, they fail to capture alignments when terms or structure varies vastly across ontologies. In this case, one is forced to rely on manual alignment. In this paper, we study whether it is feasible to re-use such expert provided ontology alignments for new alignment tasks. We focus in particular on many-to-one alignments, where the opportunity for re-use is feasible if alignments are stable. Specifically, we define the notion of a cluster as being made of multiple entities in the source ontology S that are mapped to the same entity in the target ontology τ. We test the stability hypothesis that the formed clusters of source ontology are stable across alignments to different target ontologies. If this hypothesis is valid, the clusters of an ontology S, built from an existing alignment with an ontology τ, can be effectively exploited to align S with a new ontology τ′. Evaluation on both manual and automated high-quality alignments show remarkable stability of clusters across ontology alignments in the financial domain and the healthcare and life sciences domain. Experimental evaluation also demonstrates the effectiveness of utilizing the stability of clusters in improving the alignment process in terms of precision and recall.

[1]  Mansur R. Kabuka,et al.  Ontology matching with semantic verification , 2009, J. Web Semant..

[2]  Steffen Staab,et al.  Handbook on Ontologies in Information Systems , 2003 .

[3]  Masaki Aono,et al.  Anchor-Flood: Results for OAEI 2009 , 2009, OM.

[4]  Pedro M. Domingos,et al.  Ontology Matching: A Machine Learning Approach , 2004, Handbook on Ontologies.

[5]  Ian Horrocks,et al.  The Semantic Web – ISWC 2010: 9th International Semantic Web Conference, ISWC 2010, Shanghai, China, November 7-11, 2010, Revised Selected Papers, Part I , 2010, SEMWEB.

[6]  Mansur R. Kabuka,et al.  ASMOV Results for OAEI 2007 , 2007, OM.

[7]  Cosmin Stroe,et al.  AgreementMaker: Efficient Matching for Large Real-World Schemas and Ontologies , 2009, Proc. VLDB Endow..

[8]  Radford M. Neal Pattern Recognition and Machine Learning , 2007, Technometrics.

[9]  Yi Li,et al.  RiMOM: A Dynamic Multistrategy Ontology Alignment Framework , 2009, IEEE Transactions on Knowledge and Data Engineering.

[10]  Achille Fokoue,et al.  One Size Does Not Fit All: Customizing Ontology Alignment Using User Feedback , 2010, International Semantic Web Conference.

[11]  Abraham Bernstein,et al.  The Semantic Web - ISWC 2009, 8th International Semantic Web Conference, ISWC 2009, Chantilly, VA, USA, October 25-29, 2009. Proceedings , 2009, SEMWEB.

[12]  Enrico Motta,et al.  The Semantic Web - ISWC 2005, 4th International Semantic Web Conference, ISWC 2005, Galway, Ireland, November 6-10, 2005, Proceedings , 2005, SEMWEB.

[13]  Hyoil Han,et al.  A survey on ontology mapping , 2006, SGMD.

[14]  Baowen Xu,et al.  Lily: Ontology Alignment Results for OAEI 2008 , 2008, OM.

[15]  Lora Aroyo,et al.  The Semantic Web: Research and Applications , 2009, Lecture Notes in Computer Science.

[16]  Natalya F. Noy,et al.  Semantic integration: a survey of ontology-based approaches , 2004, SGMD.

[17]  Steffen Staab,et al.  Bootstrapping Ontology Alignment Methods with APFEL , 2005, International Semantic Web Conference.

[18]  Cosmin Stroe,et al.  BLOOMS on AgreementMaker: results for OAEI 2010 , 2010, OM.

[19]  Heiner Stuckenschmidt,et al.  Improving Ontology Matching Using Meta-level Learning , 2009, ESWC.

[20]  Mark A. Musen,et al.  What Four Million Mappings Can Tell You about Two Hundred Ontologies , 2009, SEMWEB.

[21]  Erhard Rahm,et al.  On Matching Large Life Science Ontologies in Parallel , 2010, DILS.

[22]  Erhard Rahm,et al.  Mapping Composition for Matching Large Life Science Ontologies , 2011, ICBO.