Antecedents of big data quality: An empirical examination in financial service organizations

Big data has been acknowledged for its enormous potential. In contrast to the potential, in a recent survey more than half of financial service organizations reported that big data has not delivered the expected value. One of the main reasons for this is related to data quality. The objective of this research is to identify the antecedents of big data quality in financial institutions. This will help to understand how data quality from big data analysis can be improved. For this, a literature review was performed and data was collected using three case studies, followed by content analysis. The overall findings indicate that there are no fundamentally new data quality issues in big data projects. Nevertheless, the complexity of the issues is higher, which makes it harder to assess and attain data quality in big data projects compared to the traditional projects. Ten antecedents of big data quality were identified encompassing data, technology, people, process and procedure, organization, and external aspects.

[1]  Jie Li,et al.  Rethinking big data: A review on the data quality and usage issues , 2016 .

[2]  Andrea De Mauro,et al.  What is big data? A consensual definition and a review of key research topics , 2015, AIP Conference Proceedings.

[3]  Diane M. Strong,et al.  Beyond Accuracy: What Data Quality Means to Data Consumers , 1996, J. Manag. Inf. Syst..

[4]  J. Efrim Boritz,et al.  IS practitioners' views on core concepts of information integrity , 2005, Int. J. Account. Inf. Syst..

[5]  M. Janssen,et al.  Factors influencing big data decision-making quality , 2017 .

[6]  Debra Zahay,et al.  Building the foundation for customer data quality in CRM systems for financial services firms , 2012 .

[7]  Steve Kelling,et al.  Taking a ‘Big Data’ approach to data quality in a citizen science project , 2015, Ambio.

[8]  Jacek Maslankowski Data Quality Issues Concerning Statistical Data Gathering Supported by Big Data Technology , 2014, BDAS.

[9]  Diane M. Strong,et al.  Product and Service Performance Model for Information Quality: An Update , 1998, IQ.

[10]  Total Quality data Management (TQdM) - Methodology for Information Quality Improvement , 2002, Information and Database Quality.

[11]  Yangyong Zhu,et al.  The Challenges of Data Quality and Data Quality Assessment in the Big Data Era , 2015, Data Sci. J..

[12]  Barbara Wixom,et al.  Antecedents of Information and System Quality: An Empirical Examination Within the Context of Data Warehousing , 2005, J. Manag. Inf. Syst..

[13]  Richard Y. Wang,et al.  Data Quality Assessment , 2002 .

[14]  Mario Piattini,et al.  A Data Quality in Use model for Big Data , 2016, Future Gener. Comput. Syst..

[15]  Richard Y. Wang,et al.  Anchoring data quality dimensions in ontological foundations , 1996, CACM.

[16]  Mouzhi Ge,et al.  A Review of Information Quality Research - Develop a Research Agenda , 2007, ICIQ.

[17]  A. Fardani Haryadi Requirements on and Antecedents of Big Data Quality: An Empirical Examination to Improve Big Data Quality in Financial Service Organizations , 2016 .

[18]  Mario Piattini,et al.  Getting Better Information Quality By Assessing And Improving Information Quality Management , 2004, ICIQ.

[19]  Bill McMullen,et al.  Big data, big data quality problem , 2015, 2015 IEEE International Conference on Big Data (Big Data).

[20]  Carlo Batini,et al.  A Comprehensive Data Quality Methodology for Web and Structured Data , 2007, 2006 1st International Conference on Digital Information Management.