The DaQuinCIS Architecture : a Platform for Exchanging and Improving Data Quality in Cooperative Information Systems ?

In cooperative information systems, the quality of data exchanged and provided by different data sources is extremely important. A lack of attention to data quality can imply data of low quality to spread all over the cooperative system. At the same time, improvement can be based on comparing data, correcting them and thus disseminating high quality data. In this paper, we present an architecture for managing data quality in cooperative information systems, by focusing on two specific modules, the Data Quality Broker and the Quality Notification Service. The Data Quality Broker allows for querying and improving data quality values. The Quality Notification Service is specifically targeted to the dissemination of changes on data quality values.

[1]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[2]  Patrick A. V. Hall,et al.  Approximate String Matching , 1994, Encyclopedia of Algorithms.

[3]  Nancy A. Lynch,et al.  Impossibility of distributed consensus with one faulty process , 1983, PODS '83.

[4]  R. Snodgrass Temporal Databases , 1986, Computer.

[5]  Kenneth P. Birman,et al.  The process group approach to reliable distributed computing , 1992, CACM.

[6]  Stuart E. Madnick,et al.  Data quality requirements analysis and modeling , 2011, Proceedings of IEEE 9th International Conference on Data Engineering.

[7]  Dale Skeen,et al.  The Information Bus: an architecture for extensible distributed systems , 1994, SOSP '93.

[8]  Thomas Redman,et al.  Data quality for the information age , 1996 .

[9]  Sam Toueg,et al.  Unreliable failure detectors for reliable distributed systems , 1996, JACM.

[10]  Richard Y. Wang,et al.  Anchoring data quality dimensions in ontological foundations , 1996, CACM.

[11]  Stéphane Bressan,et al.  The Context Interchange mediator prototype , 1997, SIGMOD '97.

[12]  Rachid Guerraoui,et al.  Software-Based Replication for Fault Tolerance , 1997, Computer.

[13]  Rico Piantoni,et al.  Implementing the Swiss Exchange trading system , 1997, Proceedings of IEEE 27th International Symposium on Fault Tolerant Computing.

[14]  Matthias Jarke,et al.  Cooperative Information Systems: A Manifesto * , 1997 .

[15]  Maria-Esther Vidal,et al.  Querying Quality of Data Metadata , 1998 .

[16]  Richard Y. Wang,et al.  A product perspective on total data quality management , 1998, CACM.

[17]  Jian-Bo Yang,et al.  Multiple Attribute Decision Making , 1998 .

[18]  Richard Y. Wang,et al.  Modeling Information Manufacturing Systems to Determine Information Product Quality Management Scien , 1998 .

[19]  David S. Rosenblum,et al.  Challenges for Distributed Event Services: Scalability vs. Expressiveness , 1999 .

[20]  Marcos K. Aguilera,et al.  Matching events in a content-based subscription system , 1999, PODC '99.

[21]  Kenneth P. Birman,et al.  A review of experiences with reliable multicast , 1999, Softw. Pract. Exp..

[22]  Guruduth Banavar,et al.  An efficient multicast protocol for content-based publish-subscribe systems , 1999, Proceedings. 19th IEEE International Conference on Distributed Computing Systems (Cat. No.99CB37003).

[23]  M. Tamer Özsu,et al.  Conflict tolerant queries in AURORA , 1999, Proceedings Fourth IFCIS International Conference on Cooperative Information Systems. CoopIS 99 (Cat. No.PR00384).

[24]  Robert E. Gruber,et al.  The architecture of the READY event notification service , 1999, Proceedings. 19th IEEE International Conference on Distributed Computing Systems. Workshops on Electronic Commerce and Web-based Applications. Middleware.

[25]  Felix Naumann,et al.  Quality-driven Integration of Heterogenous Information Systems , 1999, VLDB.

[26]  Dennis Shasha,et al.  Publish/Subscribe on the Web at Extreme Speed , 2000, VLDB.

[27]  Bill Segall,et al.  Content Based Routing with Elvin4 , 2000 .

[28]  Jeffrey D. Ullman,et al.  Information integration using logical views , 1997, Theor. Comput. Sci..

[29]  Dennis Shasha,et al.  An extensible Framework for Data Cleaning , 2000, Proceedings of 16th International Conference on Data Engineering (Cat. No.00CB37073).

[30]  Monica Scannapieco,et al.  Introducing Data Quality in a Cooperative Context , 2001, IQ.

[31]  Dennis Shasha,et al.  Filtering algorithms and implementation for very fast publish/subscribe systems , 2001, SIGMOD '01.

[32]  Hongjun Lu,et al.  Discovering and reconciling value conflicts for numerical data integration , 2001, Inf. Syst..

[33]  Antonio Sassano,et al.  Errors Detection and Correction in Large Scale Data Collecting , 2001, IDA.

[34]  Carlo Batini,et al.  Enabling Italian E-Government through a Cooperative Architecture , 2001, Computer.

[35]  Miguel Castro,et al.  SCRIBE: The Design of a Large-Scale Event Notification Infrastructure , 2001, Networked Group Communication.

[36]  Helmut Veith,et al.  Efficient filtering in publish-subscribe systems using binary decision diagrams , 2001, Proceedings of the 23rd International Conference on Software Engineering. ICSE 2001.

[37]  Maurizio Lenzerini,et al.  Data integration: a theoretical perspective , 2002, PODS.

[39]  Chiara Francalanci,et al.  Data Quality Assurance in Cooperative Information Systems: A Multi-Dimension Quality Certificate , 2003 .

[40]  M. Jarke,et al.  Fundamentals of Data Warehouses , 2003, Springer Berlin Heidelberg.

[41]  Gunter Saake,et al.  Interactive example-driven integration and reconciliation for accessing database federations , 2003, Inf. Syst..

[42]  Anne-Marie Kermarrec,et al.  Lightweight probabilistic broadcast , 2003, TOCS.

[43]  Anne-Marie Kermarrec,et al.  The many faces of publish/subscribe , 2003, CSUR.

[44]  M. Bohanec,et al.  The Analytic Hierarchy Process , 2004 .

[45]  Salvatore J. Stolfo,et al.  Real-world Data is Dirty: Data Cleansing and The Merge/Purge Problem , 1998, Data Mining and Knowledge Discovery.