Combining Semantic Information and Information Quality on the Enrichment of Web Data Integration Systems

The emergence of the Web and its permanent growth has caused a big impact on the database research community. Thereby, Database research areas have evolved in order to consider the new problems arising from the need of managing the huge volume of data available on the Web. One of such areas is Data Integration (DI), which is considered a pervasive challenge faced by applications that need to query across multiple autonomous and heterogeneous data sources. To help matters, we argue that semantic information like ontological and contextual information, combined with Information Quality (IQ) provided by IQ measures, may be employed together in order to enrich processes in DI (e.g., schema matching and query answering). In this paper, we present our ideas regarding what we mean by semantic information and IQ and why and how they may be combined in order to produce semantic knowledge to be used in Web Data Integration Systems. Furthermore, we propose a preliminary version of a metamodel, which presents a formal description of relationships between concepts associated with semantic information and IQ.

[1]  Vaninha Vieira,et al.  Investigating the Specifics of Contextual Elements Management: The CEManTIKA Approach , 2007, CONTEXT.

[2]  Damires Yluska de Souza Fernandes Using semantics to enhance query reformulation in dynamic distributed environments , 2009 .

[3]  Fausto Giunchiglia,et al.  S-Match: an Algorithm and an Implementation of Semantic Matching , 2004, ESWS.

[4]  Markus Helfert,et al.  A Context Aware Information Quality Framework , 2009, 2009 Fourth International Conference on Cooperation and Promotion of Information Resources in Science and Technology.

[5]  Ana Carolina Salgado,et al.  Towards a Context Ontology to Enhance Data Integration Processes , 2008, ODBIS.

[6]  Mouzhi Ge,et al.  A Review of Information Quality Research - Develop a Research Agenda , 2007, ICIQ.

[7]  Federica Mandreoli,et al.  Flexible query answering on graph-modeled data , 2009, EDBT '09.

[8]  Ana Carolina Salgado,et al.  A Context-Based Schema Integration Process Applied to Healthcare Data Sources , 2010, OTM Workshops.

[9]  Thomas R. Gruber,et al.  Toward principles for the design of ontologies used for knowledge sharing? , 1995, Int. J. Hum. Comput. Stud..

[10]  Mohamed A. Soliman,et al.  A Survey of Data Management in Peer-to-Peer Systems , 2005 .

[11]  Carlo Curino,et al.  And what can context do for data? , 2009, Commun. ACM.

[12]  Diane M. Strong,et al.  Beyond Accuracy: What Data Quality Means to Data Consumers , 1996, J. Manag. Inf. Syst..

[13]  Frank van Harmelen,et al.  Query Processing in Ontology-Based Peer-to-Peer Systems , 2005 .

[14]  Jianing Wang,et al.  A Quality Framework for Data Integration , 2010, BNCOD.

[15]  Luis Olsina,et al.  Assessing Web Applications Consistently: A Context Information Approach , 2008, 2008 Eighth International Conference on Web Engineering.

[16]  Michael Berger,et al.  A metamodel approach to context information , 2005, Third IEEE International Conference on Pervasive Computing and Communications Workshops.

[17]  Tao Gu,et al.  Ontology based context modeling and reasoning using OWL , 2004, IEEE Annual Conference on Pervasive Computing and Communications Workshops, 2004. Proceedings of the Second.

[18]  Ana Carolina Salgado,et al.  A Semantic-Based Approach for Data Management in a P2P System , 2011, Trans. Large Scale Data Knowl. Centered Syst..

[19]  Ana Carolina Salgado,et al.  Data Integration Schema Analysis: An Approach With Information Quality , 2007, ICIQ.

[20]  Isabel F. Cruz,et al.  Query processing for heterogeneous data integration using ontologies , 2006 .

[21]  Felix Naumann,et al.  Benefit and Cost of Query Answering in PDMS , 2005, DBISP2P.

[22]  Anind K. Dey,et al.  Understanding and Using Context , 2001, Personal and Ubiquitous Computing.

[23]  Kimberly Keeton,et al.  Do you know your IQ?: a research agenda for information quality in systems , 2010, PERV.

[24]  Joann J. Ordille,et al.  Data integration: the teenage years , 2006, VLDB.

[25]  Yolande Berbers,et al.  When efficiency matters: Towards quality of context-aware peers for adaptive communication in VANETs , 2011, 2011 IEEE Intelligent Vehicles Symposium (IV).

[26]  Zohra Bellahsene,et al.  Measuring the Quality of an Integrated Schema , 2010, ER.

[27]  Ana Carolina Salgado,et al.  A Semantic-Based Ontology Matching Process for PDMS , 2009, Globe.