Incorporating the Timeliness Quality Dimension in Internet Query Systems

Internet Query Systems (IQS) are information systems used to query the World Wide Web by finding data sources relevant to a given query and retrieving data taking into account issues such as the unpredictability of access and transfer rates, infinite streams of data, and the ability to produce partial results. Despite the wide availability of research focusing on query processing for IQS, there are surprisingly few contributions addressing data quality issues such as timeliness and accuracy of data resulting from Internet query processing. This paper provides an overview of an ongoing research effort to extend IQS with a data quality component to ensure timeliness of data resulting from Internet query processing. In particular, we illustrate the quality model, data source layer design and the quality aware algebraic query processing framework adopted in our implementation effort.

[1]  Jennifer L. Leopold,et al.  Webformulate: a web-based visual continual query system , 2002, WWW '02.

[2]  Richard Y. Wang,et al.  Modeling Information Manufacturing Systems to Determine Information Product Quality Management Scien , 1998 .

[3]  Jack E. Olson,et al.  Data Quality: The Accuracy Dimension , 2003 .

[4]  Robert Stevens,et al.  {TAMBIS}: {T}ransparent {A}ccess to {M}ultiple {B}ioinformatics {I}nformation {S}ources. An Overview , 1998, ISMB 1998.

[5]  Tiziana Catarci,et al.  The DaQuinCIS Broker: Querying Data and Their Quality in Cooperative Information Systems , 2003, J. Data Semant..

[6]  Arthur M. Lesk,et al.  Introduction to bioinformatics , 2002 .

[7]  David J. DeWitt,et al.  The Niagara Internet Query System , 2001, IEEE Data Eng. Bull..

[8]  Jim Smith,et al.  The Design, Implementation and Evaluation of an ODMG Compliant, Parallel Object Database Server , 2004, Distributed and Parallel Databases.

[9]  Carole A. Goble,et al.  TAMBIS: Transparent Access to Multiple Bioinformatics Information Sources , 1998, ISMB.

[10]  Stefano Spaccapietra,et al.  Journal on Data Semantics I , 2003, Lecture Notes in Computer Science.

[11]  Richard Y. Wang,et al.  Toward quality data: An attribute-based approach , 2014, Decis. Support Syst..

[12]  Norman W. Paton,et al.  Query processing in DOQL: A deductive database language for the ODMG model , 2000, Data Knowl. Eng..

[13]  Felix Naumann,et al.  Quality-driven Integration of Heterogenous Information Systems , 1999, VLDB.

[14]  Michael Gertz,et al.  Report on the Dagstuhl Seminar , 2004, SGMD.