Trusting Data Quality in Cooperative Information Systems

Current approaches to the development of cooperative information systems are based on services to be offered by cooperating organizations, and on the opportunity of building coordinators and brokers on top of such services. The quality of data exchanged and provided by different services hampers such approaches, as data of low quality can spread all over the cooperative system. At the same time, improvement can be based on comparing data, correcting them and disseminating high quality data. In this paper, a service-based framework for managing data quality in cooperative information systems is presented. An XML-based model for data and quality data is proposed, and the design of a broker, which selects the best available data from different services, is presented. Sucha broker also supports the improvement of data based on feedbacks to source services.

[1]  Richard Y. Wang,et al.  A product perspective on total data quality management , 1998, CACM.

[2]  Torben Bach Pedersen,et al.  Multidimensional Database Technology , 2001, Computer.

[3]  Mike P. Papazoglou,et al.  Cooperative Information Systems: Trends and Directions , 1997 .

[4]  Maria-Esther Vidal,et al.  Querying Quality of Data Metadata , 1998 .

[5]  Massimo Mecella,et al.  Data Quality in Cooperative Information Systems , 2005 .

[6]  Carlo Batini,et al.  Data Quality in e-Business Applications , 2002, WES.

[7]  Matthias Jarke,et al.  Cooperative Information Systems: A Manifesto * , 1997 .

[8]  Stuart E. Madnick,et al.  Data quality requirements analysis and modeling , 2011, Proceedings of IEEE 9th International Conference on Data Engineering.

[9]  Matthias Jarke,et al.  Fundamentals of Data Warehouses , 2000, Springer Berlin Heidelberg.

[10]  Antonino Virgillito Carlo Marchetti,et al.  The DaQuinCIS Architecture : a Platform for Exchanging and Improving Data Quality in Cooperative Information Systems ? , 2003 .

[11]  Maurizio Lenzerini,et al.  Interschema knowledge in cooperative information , 1993, [1993] Proceedings International Conference on Intelligent and Cooperative Information Systems.

[12]  P. Mouncey Improving Data Warehouse and Business Information Quality , 2001 .

[13]  Matthias Jarke,et al.  Design and Analysis of Quality Information for Data Warehouses , 1998, ER.

[14]  Karl Aberer,et al.  Managing trust in a peer-2-peer information system , 2001, CIKM '01.

[15]  Dennis Shasha,et al.  An extensible Framework for Data Cleaning , 2000, Proceedings of 16th International Conference on Data Engineering (Cat. No.00CB37073).

[16]  Lik Mui,et al.  Notions of reputation in multi-agents systems: a review , 2002, AAMAS '02.

[17]  Tiziana Catarci,et al.  Managing Data Quality in Cooperative Information Systems , 2002, OTM.

[18]  Yolanda Gil,et al.  Trusting Information Sources One Citizen at a Time , 2002, SEMWEB.

[19]  Hye-Young Paik,et al.  Peer-to-Peer Traced Execution of Composite Services , 2001, TES.

[20]  Umeshwar Dayal,et al.  Business Process Coordination: State of the Art, Trends, and Open Issues , 2001, VLDB.

[21]  Alin Deutsch,et al.  XML-QL: A Query Language for XML , 1998 .

[22]  Timothy W. Finin,et al.  Agents, trust, and information access on the semantic web , 2002, SGMD.

[23]  Gabriel M. Kuper,et al.  Structural Properties of XPath Fragments , 2003, ICDT.

[24]  Jay L. Devore,et al.  Probability and statistics for engineering and the sciences , 1982 .

[25]  Nancy A. Lynch,et al.  Impossibility of distributed consensus with one faulty process , 1985, JACM.

[26]  Stephen Marsh,et al.  Formalising Trust as a Computational Concept , 1994 .

[27]  Alin Deutsch,et al.  A Query Language for XML , 1999, Comput. Networks.

[28]  Sam Toueg,et al.  Fault-tolerant broadcasts and related problems , 1993 .

[29]  Monica Scannapieco,et al.  Introducing Data Quality in a Cooperative Context , 2001, IQ.

[30]  Maurizio Lenzerini,et al.  Representing and Using Interschema Knowledge in Cooperative Information Systems , 1993, Int. J. Cooperative Inf. Syst..

[31]  Peter Szolovits,et al.  Ratings in Distributed Systems: A Bayesian Approach , 2002 .

[32]  Thomas Redman,et al.  Data quality for the information age , 1996 .

[33]  Felix Naumann,et al.  Quality-Driven Query Answering for Integrated Information Systems , 2002, Lecture Notes in Computer Science.

[34]  Ernesto Damiani,et al.  A reputation-based approach for choosing reliable resources in peer-to-peer networks , 2002, CCS '02.

[35]  Ernesto Damiani,et al.  Choosing reputable servents in a P2P network , 2002, WWW.

[36]  Tok Wang Ling,et al.  Conceptual Modeling – ER ’98 , 1998, Lecture Notes in Computer Science.

[37]  Massimo Mecella,et al.  Cooperative Processes and e-Services , 2002 .

[38]  David Jordan,et al.  The Object Database Standard: ODMG 2.0 , 1997 .

[39]  Maurizio Lenzerini,et al.  Data integration: a theoretical perspective , 2002, PODS.

[40]  Daniela Florescuand An Extensible Framework for Data Cleaning , 2000, ICDE 2000.

[41]  Carlo Batini,et al.  Enabling Italian E-Government through a Cooperative Architecture , 2001, Computer.