For technical and architectural reasons data in information systems are often redundant in various databases. Data changes are propagated between the various databases through a synchronization mechanism, which ensures a certain degree of consistency. Depending on the time delay of propagating data changes, synchronization is classified in real time synchronization and lazy synchronization in case of respectively high or low synchronization frequency. In practice, lazy synchronization is very commonly applied but, because of the delay in data synchronization, it causes misalignments among data values resulting in a negative impact on data quality. Indeed, the raise of the time interval between two realignments increases the probability that data result incorrect or out-of-date. The paper analyses the correlation between data quality criteria and the synchronization frequency and reveals the presence of trade-offs between different criteria such as availability and timeliness. The results illustrate the problem of balancing various data quality requirements within the design of information systems. The problem is examined in selected types of information systems that are in general characterized by high degree of data redundancy.
[1]
Richard Y. Wang,et al.
Anchoring data quality dimensions in ontological foundations
,
1996,
CACM.
[2]
Esther Pacitti,et al.
Update propagation strategies to improve freshness in lazy master replicated databases
,
2000,
The VLDB Journal.
[3]
Daniel Barbará,et al.
The cost of data replication
,
1981,
SIGCOMM 1981.
[4]
Chiara Francalanci,et al.
Time-Related Factors of Data Quality in Multichannel Information Systems
,
2003,
J. Manag. Inf. Syst..
[5]
Ken Orr,et al.
Data quality and systems theory
,
1998,
CACM.
[6]
Matthias Jarke,et al.
Fundamentals of Data Warehouses
,
2000,
Springer Berlin Heidelberg.