Data quality for data science, predictive analytics, and big data in supply chain management: An introduction to the problem and suggestions for research and applications

Today׳s supply chain professionals are inundated with data, motivating new ways of thinking about how data are produced, organized, and analyzed. This has provided an impetus for organizations to adopt and perfect data analytic functions (e.g. data science, predictive analytics, and big data) in order to enhance supply chain processes and, ultimately, performance. However, management decisions informed by the use of these data analytic methods are only as good as the data on which they are based. In this paper, we introduce the data quality problem in the context of supply chain management (SCM) and propose methods for monitoring and controlling data quality. In addition to advocating for the importance of addressing data quality in supply chain research and practice, we also highlight interdisciplinary research topics based on complementary theory.

[1]  Thomas Rbement,et al.  Fundamentals of quality control and improvement , 1993 .

[2]  S. Rahman Quality management in logistics: an examination of industry practices , 2006 .

[3]  Thomas Redman,et al.  The impact of poor data quality on the typical enterprise , 1998, CACM.

[4]  Douglas C. Montgomery,et al.  Introduction to Statistical Quality Control , 1986 .

[5]  Matthew A. Waller,et al.  Making Sense Out of Chaos: Why Theory is Relevant to Supply Chain Research , 2011 .

[6]  Diane M. Strong,et al.  AIMQ: a methodology for information quality assessment , 2002, Inf. Manag..

[7]  Richard L. Daft,et al.  Organizational information requirements, media richness and structural design , 1986 .

[8]  Ludwig von Bertalanffy,et al.  General System Theory , 1969 .

[9]  Benjamin T. Hazen,et al.  Applying Control Chart Methods to Enhance Data Quality , 2014, Technometrics.

[10]  Michael L. Tushman,et al.  Information Processing as an Integrating Concept in Organizational Design. , 1978 .

[11]  Gerald J. Lieberman,et al.  Statistical Process Control and The Impact of Automatic Process Control , 1965 .

[12]  Roberto da Costa Quinino,et al.  An attribute control chart for monitoring the variability of a process , 2013 .

[13]  Richard Y. Wang,et al.  A product perspective on total data quality management , 1998, CACM.

[14]  J. M. Whipple,et al.  Strategic Alliance Success Factors , 2000 .

[15]  Carl E. Pierchala,et al.  Control charts as a tool for data quality control , 2009 .

[16]  Reuben E. Slone Leading a supply chain turnaround. , 2004, Harvard business review.

[17]  Diane M. Strong,et al.  Process-Embedded Data Integrity , 2004, J. Database Manag..

[18]  Thomas Redman,et al.  Data quality for the information age , 1996 .

[19]  Paul Mangiameli,et al.  The Effects and Interactions of Data Quality and Problem Complexity on Classification , 2011, JDIQ.

[20]  Richard Y. Wang,et al.  Modeling Information Manufacturing Systems to Determine Information Product Quality Management Scien , 1998 .

[21]  Ihab Hanna Sawalha,et al.  Quality control and supply chain management: a contextual perspective and a case study , 2013 .

[22]  Diane M. Strong,et al.  Information quality benchmarks: product and service performance , 2002, CACM.

[23]  E. S. Page Cumulative Sum Charts , 1961 .

[24]  M. Naim,et al.  Industrial Dynamics Simulation Models in the Design of Supply Chains , 1992 .

[25]  Zoe J. Radnor,et al.  Theoretical perspectives in purchasing and supply chain management: an analysis of the literature , 2010 .

[26]  Debabrata Dey,et al.  Reassessing Data Quality for Information Products , 2010, Manag. Sci..

[27]  Thomas H. Davenport,et al.  Book review:Working knowledge: How organizations manage what they know. Thomas H. Davenport and Laurence Prusak. Harvard Business School Press, 1998. $29.95US. ISBN 0‐87584‐655‐6 , 1998 .

[28]  Shouyang Wang,et al.  Information and decision-making delays in MRP, KANBAN, and CONWIP , 2014 .

[29]  Jay R. Galbraith Organization Design: An Information Processing View , 1974 .

[30]  R. Grant Toward a Knowledge-Based Theory of the Firm,” Strategic Management Journal (17), pp. , 1996 .

[31]  J. Brett,et al.  Managing multicultural teams. , 2006, Harvard business review.

[32]  Thomas C. Redman,et al.  Data Quality Management and Technology , 1992 .

[33]  T. Davenport Competing on analytics. , 2006, Harvard business review.

[34]  H. Hotelling,et al.  Multivariate Quality Control , 1947 .

[35]  Charles A. O'Reilly,et al.  Variations in Decision Makers' Use of Information Sources: The Impact of Quality and Accessibility of Information , 1982 .

[36]  José Farinha,et al.  A Data Quality Metamodel Extension to CWM , 2007, APCCM.

[37]  G. Hult,et al.  Bridging organization theory and supply chain management: The case of best value supply chains , 2007 .

[38]  R. Crosier Multivariate generalizations of cumulative sum quality-control schemes , 1988 .

[39]  Zachary G. Stoumbos,et al.  A CUSUM Chart for Monitoring a Proportion When Inspecting Continuously , 1999 .

[40]  Donald P. Ballou,et al.  Modeling Data and Process Quality in Multi-Input, Multi-Output Information Systems , 1985 .

[41]  G. Gaalman,et al.  A model of strategic product quality and process improvement incentives , 2014 .

[42]  Ray Y. Zhong,et al.  A big data approach for logistics trajectory discovery from RFID-enabled production data , 2015 .

[43]  Varghese S. Jacob,et al.  Assessing Data Quality for Information Products: Impact of Selection, Projection, and Cartesian Product , 2004, Manag. Sci..

[44]  Alan R. Hevner,et al.  Integrated decision support systems: A data warehousing perspective , 2007, Decis. Support Syst..

[45]  William H. Woodall,et al.  Controversies and Contradictions in Statistical Process Control , 2000 .

[46]  Ismail Sila,et al.  Quality in supply chains: an empirical analysis , 2006 .

[47]  Diane M. Strong,et al.  Beyond Accuracy: What Data Quality Means to Data Consumers , 1996, J. Manag. Inf. Syst..

[48]  Anders Haug,et al.  Barriers to master data quality , 2011, J. Enterp. Inf. Manag..

[49]  J. Barney Firm Resources and Sustained Competitive Advantage , 1991 .

[50]  Charles W. Champ,et al.  A multivariate exponentially weighted moving average control chart , 1992 .

[51]  Maurice Kügler,et al.  The impact of data quality and analytical capabilities on planning performance: insights from the automotive industry , 2011, Wirtschaftsinformatik.

[52]  S. W. Roberts Control chart tests based on geometric moving averages , 2000 .

[53]  Thomas Y. Choi,et al.  Comparison of Quality Management Practices: Across the Supply Chain and Industries , 1999 .

[54]  Ken Orr,et al.  Data quality and systems theory , 1998, CACM.

[55]  E. Hartmann,et al.  INTEGRATION IN THE GLOBAL SOURCING ORGANIZATION — AN INFORMATION PROCESSING PERSPECTIVE , 2009 .

[56]  Carlo Batini,et al.  Methodologies for data quality assessment and improvement , 2009, CSUR.

[57]  Robert G. Dyson,et al.  The relationship of participation and effectiveness in strategic planning , 1982 .

[58]  Thomas J. Steenburgh,et al.  Motivating Salespeople: What Really Works , 2012, Harvard business review.

[59]  Veda C. Storey,et al.  A Framework for Analysis of Data Quality Research , 1995, IEEE Trans. Knowl. Data Eng..

[60]  Adir Even,et al.  Data quality assessment in context: A cognitive perspective , 2009, Decis. Support Syst..

[61]  Dominic Barton,et al.  Making advanced analytics work for you. , 2012, Harvard business review.

[62]  Amir Parssian,et al.  Managerial decision support with knowledge of accuracy and completeness of the relational aggregate functions , 2006, Decis. Support Syst..

[63]  L. J. Porter,et al.  Quality costing for total quality management , 1992 .

[64]  Thomas C. Redman,et al.  Data Quality: The Field Guide , 2001 .

[65]  Benjamin T. Hazen,et al.  Cloud Computing in Support of Supply Chain Information System Infrastructure: Understanding When to go to the Cloud , 2013 .

[66]  F. R. Keller,et al.  Data Quality , 1990, Taxation and Labour Supply.

[67]  S. Fawcett,et al.  Data Science, Predictive Analytics, and Big Data: A Revolution that Will Transform Supply Chain Design and Management , 2013 .

[68]  Fugee Tsung,et al.  A comparison study of effectiveness and robustness of control charts for monitoring process mean , 2012 .

[69]  L. Fredendall,et al.  Mapping the critical links between organizational culture and TQM/Six Sigma practices , 2010 .

[70]  Nada R. Sanders,et al.  The Emerging Role of the Third‐Party Logistics Provider (3PL) as an Orchestrator , 2011 .

[71]  Stelios Psarakis,et al.  Review of multinomial and multiattribute quality control charts , 2009, Qual. Reliab. Eng. Int..

[72]  Kaitlin S. Dunn,et al.  An Empirically Derived Framework of Global Supply Resiliency , 2011 .

[73]  Jianxin Jiao,et al.  A single control chart for monitoring the frequency and magnitude of an event , 2009 .

[74]  A. R. Crathorne,et al.  Economic Control of Quality of Manufactured Product. , 1933 .

[75]  Kevin Laframboise,et al.  Gaining Competitive Advantage From Integrating Enterprise Resource Planning and Total Quality Management , 2005 .

[76]  D. Delen,et al.  RFID for Better Supply‐Chain Management through Enhanced Information Visibility , 2007 .

[77]  B. Chae,et al.  Insights from hashtag #supplychain and Twitter Analytics: Considering Twitter and Twitter data for supply chain practice and research , 2015 .

[78]  H. Jonas,et al.  General system theory; a new approach to unity of science. 4. Comment on general system theory. , 1951, Human biology.

[79]  D. Garvin Competing on the Eight Dimensions of Quality , 1987 .

[80]  William H. Woodall,et al.  Control Charts Based on Attribute Data: Bibliography and Review , 1997 .

[81]  Israel Spiegler,et al.  Information as inventory: A new conceptual view , 1991, Inf. Manag..

[82]  Dale S. Rogers,et al.  A Meta‐Analysis of Logistics Customer Service , 2013 .

[83]  W. Edwards Deming,et al.  Out of the Crisis , 1982 .

[84]  Stephen E. Arnold,et al.  Information manufacturing: the road to database quality , 1992 .

[85]  Brent D. Williams,et al.  An inventory of theory in logistics and SCM research , 2010 .

[86]  Richard Y. Wang,et al.  Data quality assessment , 2002, CACM.

[87]  David J. Ketchen,et al.  The effects of innovation–cost strategy, knowledge, and action in the supply chain on firm performance , 2009 .

[88]  Anders Haug,et al.  A classification model of ERP system data quality , 2009, Ind. Manag. Data Syst..

[89]  Richard Y. Wang,et al.  Anchoring data quality dimensions in ontological foundations , 1996, CACM.

[90]  Carlo Batini,et al.  Data Quality: Concepts, Methodologies and Techniques , 2006, Data-Centric Systems and Applications.

[91]  A. Parasuraman,et al.  Delivering quality service : balancing customer perceptions and expectations , 1990 .

[92]  Peter Rob,et al.  Database systems : design, implementation, and management , 2000 .