Using Memetic Algorithm for Instance Coreference Resolution

Instance coreference resolution is an essential problem in studying semantic web, and it is also critical for the implementation of web of data and future integration and application of semantic data. In this paper, we propose to use Memetic Algorithm (MA) to solve this instance coreference problem in a sequential stage, i.e., the instance-level matching is carried out with the result of schema-level matching. We first give the optimization model for schema-level matching and instance-level matching. Then, we, respectively, present profile similarity measures and the rough evaluation metrics with the assumption that the golden alignment for both schema-level matching and instance-level matching is one-to-one. Furthermore, we give the details of the MA. Finally, the experiments of comparing our approach with the state-of-the-art systems on OAEI benchmarks and real-world datasets are conducted and the results demonstrate that our approach is effective.

[1]  Yuzhong Qu,et al.  A self-training approach for resolving object coreference on the semantic web , 2011, WWW.

[2]  Roberto J. Bayardo,et al.  Scaling up all pairs similarity search , 2007, WWW '07.

[3]  Haofen Wang,et al.  Zhishi.links results for OAEI 2011 , 2011, OM.

[4]  Jérôme Euzenat,et al.  Brief overview of T-tree: the Tropes Taxonomy building Tool , 1993 .

[5]  Changjun Jiang,et al.  GAOM: Genetic Algorithm Based Ontology Matching , 2006, 2006 IEEE Asia-Pacific Conference on Services Computing (APSCC'06).

[6]  Feng Shi,et al.  RiMOM Results for OAEI 2009 , 2008, OM.

[7]  Renée J. Miller,et al.  Leveraging data and structure in ontology integration , 2007, SIGMOD '07.

[8]  Adrian Iftene,et al.  Using a genetic algorithm for optimizing the similarity aggregation step in the process of ontology alignment , 2010, 9th RoEduNet IEEE International Conference.

[9]  Jan Hidders,et al.  SERIMI - resource description similarity, RDF instance matching and interlinking , 2011, OM.

[10]  Ahmed K. Elmagarmid,et al.  Duplicate Record Detection: A Survey , 2007, IEEE Transactions on Knowledge and Data Engineering.

[11]  Enrico Motta,et al.  Overcoming Schema Heterogeneity between Linked Semantic Repositories to Improve Coreference Resolution , 2009, ASWC.

[12]  Jan Nößner,et al.  CODI: Combinatorial Optimization for Data Integration: results for OAEI 2011 , 2010, OM.

[13]  Stefanos D. Kollias,et al.  A String Metric for Ontology Alignment , 2005, SEMWEB.

[14]  George A. Miller,et al.  WordNet: A Lexical Database for English , 1995, HLT.

[15]  Haixun Wang,et al.  Asymmetric signature schemes for efficient exact edit similarity query processing , 2013, TODS.

[16]  Sebastian Rudolph,et al.  More Than the Sum of Its Parts - Holistic Ontology Alignment by Population-Based Optimisation , 2012, FoIKS.

[17]  Jeff Heflin,et al.  Automatically Generating Data Linkages Using a Domain-Independent Candidate Selection Approach , 2011, SEMWEB.

[18]  Mehrnoush Shamsfard,et al.  Instance Coreference Resolution in Multi-ontology Linked Data Resources , 2012, JIST.

[19]  P. Ivax,et al.  A THEORY FOR RECORD LINKAGE , 2004 .

[20]  Autilia Vitiello,et al.  Memetic algorithms for ontology alignment , 2013 .

[21]  Daniel Rivero Cebrián,et al.  Soft Computing Methods for Practical Environment Solutions: Techniques and Studies , 2010 .

[22]  Jérôme Euzenat,et al.  Similarity-Based Ontology Alignment in OWL-Lite , 2004, ECAI.

[23]  Andreas Thor,et al.  Instance-Based Matching of Large Life Science Ontologies , 2007, DILS.

[24]  Seung-won Hwang,et al.  ARIA: Asymmetry Resistant Instance Alignment , 2014, AAAI.

[25]  Praveen Paritosh,et al.  Freebase: a collaboratively created graph database for structuring human knowledge , 2008, SIGMOD Conference.

[26]  Thomas R. Gruber,et al.  A Translation Approach to Portable Ontologies , 1993 .

[27]  Shengxiang Yang,et al.  A Memetic Algorithm for the University Course Timetabling Problem , 2008, 2008 20th IEEE International Conference on Tools with Artificial Intelligence.

[28]  Jeff Heflin,et al.  Accuracy vs. Speed: Scalable Entity Coreference on the Semantic Web with On-the-Fly Pruning , 2012, 2012 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology.

[29]  Yannis Kalfoglou,et al.  Centre for Intelligent Systems and Their Applications , 2006 .

[30]  Pedro M. Domingos,et al.  Learning to match ontologies on the Semantic Web , 2003, The VLDB Journal.

[31]  Robert D. Carr,et al.  Alignment Of Protein Structures With A Memetic Evolutionary Algorithm , 2002, GECCO.

[32]  Tim Berners-Lee,et al.  Linked Data - The Story So Far , 2009, Int. J. Semantic Web Inf. Syst..

[33]  Axel Polleres,et al.  Some entities are more equal than others: statistical methods to consolidate Linked Data , 2010 .

[34]  Cristian R. Munteanu,et al.  Improving Ontology Alignment through Genetic Algorithms , 2010 .

[35]  Jeff Heflin,et al.  Domain-Independent Entity Coreference for Linking Ontology Instances , 2013, JDIQ.

[36]  Gerhard Weikum,et al.  WWW 2007 / Track: Semantic Web Session: Ontologies ABSTRACT YAGO: A Core of Semantic Knowledge , 2022 .

[37]  Gjergji Kasneci,et al.  SIGMa: simple greedy matching for aligning large knowledge bases , 2012, KDD.

[38]  Thai Ngoc Thuy ED-JOIN: AN EFFICIENT ALGORITHM FOR SIMILARITY JOINS WITH EDIT DISTANCE CONSTRAINTS , 2009 .

[39]  Mansur R. Kabuka,et al.  ASMOV: results for OAEI 2010 , 2010, OM.

[40]  Yangyang Li,et al.  Quantum-Inspired Immune Clonal Algorithm for Global Optimization , 2008, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[41]  Xuemin Lin,et al.  Ed-Join: an efficient algorithm for similarity joins with edit distance constraints , 2008, Proc. VLDB Endow..

[42]  Giovanni Acampora,et al.  A hybrid evolutionary approach for solving the ontology alignment problem , 2012, Int. J. Intell. Syst..

[43]  Gerd Stumme,et al.  FCA-MERGE: Bottom-Up Merging of Ontologies , 2001, IJCAI.

[44]  Clement T. Yu,et al.  On the construction of effective vocabularies for information retrieval , 1974, SIGPLAN '73.

[45]  William E. Winkler,et al.  The State of Record Linkage and Current Research Problems , 1999 .

[46]  Enrique Alba,et al.  Optimizing Ontology Alignments by Using Genetic Algorithms , 2008, NatuReS.