Entity Identification on the Semantic Web

In the core of every information integration and data exchange effort lies the ability to identify whether two pieces of information refer to the same real world entity. This ability is of paramount importance for all those applications and systems currently operating in the highly heterogeneous web environment. Research in data management has long ago exploited features like keys or schema constraints for dealing with that issue, but the web reality has brought new challenges. In this work we survey a number of entity disambiguation and identification techniques and tools that can be used in semantic web applications and more specifically, into an entity management system for the semantic

[1]  Jayant Madhavan,et al.  Reference reconciliation in complex information spaces , 2005, SIGMOD '05.

[2]  Venkata Subramaniam,et al.  Information Retrieval: Data Structures & Algorithms , 1992 .

[3]  Erhard Rahm,et al.  A survey of approaches to automatic schema matching , 2001, The VLDB Journal.

[4]  Luciano Serafini,et al.  Semantic Coordination: A New Approach and an Application , 2003, SEMWEB.

[5]  Divesh Srivastava,et al.  Intensional associations between data and metadata , 2007, SIGMOD '07.

[6]  Craig A. Knoblock,et al.  Retrieving and Integrating Data from Multiple Information Sources , 1993, Int. J. Cooperative Inf. Syst..

[7]  H. Newcombe Record linking: the design of efficient systems for linking records into individual and family histories. , 1967, American journal of human genetics.

[8]  Erhard Rahm,et al.  Generic Schema Matching with Cupid , 2001, VLDB.

[9]  Marco A. Casanova,et al.  An Instance-based Approach for Matching Export Schemas of Geographical Database Web Services , 2007, GEOINFO.

[10]  Renée J. Miller,et al.  Mapping Adaptation under Evolving Schemas , 2003, VLDB.

[11]  Salvatore J. Stolfo,et al.  Real-world Data is Dirty: Data Cleansing and The Merge/Purge Problem , 1998, Data Mining and Knowledge Discovery.

[12]  Dekang Lin,et al.  An Information-Theoretic Definition of Similarity , 1998, ICML.

[13]  Cheng Li,et al.  Two supervised learning approaches for name disambiguation in author citations , 2004, Proceedings of the 2004 Joint ACM/IEEE Conference on Digital Libraries, 2004..

[14]  Ted Pedersen,et al.  Using Measures of Semantic Relatedness for Word Sense Disambiguation , 2003, CICLing.

[15]  Themis Palpanas,et al.  Entity Data Management in OKKAM , 2008, 2008 19th International Workshop on Database and Expert Systems Applications.

[16]  Farshad Hakimpour,et al.  Resolving semantic heterogeneity in schema integration , 2001, FOIS.

[17]  Themis Palpanas,et al.  Efficiently Discovering Recent Frequent Items in Data Streams , 2008, SSDBM.

[18]  DoanAnHai,et al.  Semantic-integration research in the database community , 2005 .

[19]  Ronald Fagin,et al.  Translating Web Data , 2002, VLDB.

[20]  Paolo Bouquet,et al.  An Entity Name System (ENS) for the Semantic Web , 2008, ESWC.

[21]  Dongwon Jeong,et al.  Intelligent Semantic Concept Mapping For Semantic Query Rewriting/Optimization In Ontology-Based Information Integration System , 2004, Int. J. Softw. Eng. Knowl. Eng..

[22]  H. Sofia Pinto,et al.  Some Issues on Ontology Integration , 1999, IJCAI 1999.

[23]  Heiner Stuckenschmidt,et al.  Ontology-Based Integration of Information - A Survey of Existing Approaches , 2001, OIS@IJCAI.

[24]  Guilin Qi,et al.  LCS: A Linguistic Combination System for Ontology Matching , 2006, KSEM.

[25]  Ahmed K. Elmagarmid,et al.  Duplicate Record Detection: A Survey , 2007, IEEE Transactions on Knowledge and Data Engineering.

[26]  Max J. Egenhofer,et al.  Determining Semantic Similarity among Entity Classes from Different Ontologies , 2003, IEEE Trans. Knowl. Data Eng..

[27]  Ramanathan V. Guha,et al.  SemTag and seeker: bootstrapping the semantic web via automated semantic annotation , 2003, WWW '03.

[28]  Adam Pease,et al.  Linking Lixicons and Ontologies: Mapping WordNet to the Suggested Upper Merged Ontology , 2003, IKE.

[29]  Hyoil Han,et al.  A survey on ontology mapping , 2006, SGMD.

[30]  Ansgar Bernardi,et al.  IdentityRank: Named Entity Disambiguation in the Context of the NEWS Project , 2007, ESWC.

[31]  Natalya F. Noy,et al.  Semantic integration: a survey of ontology-based approaches , 2004, SGMD.

[32]  Ricardo Baeza-Yates,et al.  Information Retrieval: Data Structures and Algorithms , 1992 .

[33]  Chris Clifton,et al.  Semantic Integration in Heterogeneous Databases Using Neural Networks , 1994, VLDB.

[34]  Je-Min Kim,et al.  OnCU system: ontology-based category utility approach for author name disambiguation , 2008, ICUIMC '08.