On the use of simulation as a Big Data semantic validator for supply chain management

Abstract Simulation stands out as an appropriate method for the Supply Chain Management (SCM) field. Nevertheless, to produce accurate simulations of Supply Chains (SCs), several business processes must be considered. Thus, when using real data in these simulation models, Big Data concepts and technologies become necessary, as the involved data sources generate data at increasing volume, velocity and variety, in what is known as a Big Data context. While developing such solution, several data issues were found, with simulation proving to be more efficient than traditional data profiling techniques in identifying them. Thus, this paper proposes the use of simulation as a semantic validator of the data, proposed a classification for such issues and quantified their impact in the volume of data used in the final achieved solution. This paper concluded that, while SC simulations using Big Data concepts and technologies are within the grasp of organizations, their data models still require considerable improvements, in order to produce perfect mimics of their SCs. In fact, it was also found that simulation can help in identifying and bypassing some of these issues.

[1]  Hans-Georg Kemper,et al.  Application-Pull and Technology-Push as Driving Forces for the Fourth Industrial Revolution , 2014 .

[2]  Markus Rabe,et al.  A Reinforcement Learning approach for a Decision Support System for logistics networks , 2015, 2015 Winter Simulation Conference (WSC).

[3]  Jorge Bernardino,et al.  A Survey on Data Quality: Classifying Poor Data , 2015, 2015 IEEE 21st Pacific Rim International Symposium on Dependable Computing (PRDC).

[4]  Maribel Yasmina Santos,et al.  Evaluating Several Design Patterns and Trends in Big Data Warehousing Systems , 2018, CAiSE.

[5]  David Simchi-Levi,et al.  Identifying Risks and Mitigating Disruptions in the Automotive Supply Chain , 2015, Interfaces.

[6]  Luís M. S. Dias,et al.  Discrete simulation software ranking — A top list of the worldwide most popular and used tools , 2016, 2016 Winter Simulation Conference (WSC).

[7]  Soumendra Mohanty,et al.  Big Data Imperatives: Enterprise Big Data Warehouse, BI Implementations and Analytics , 2013 .

[8]  Chetan Gupta,et al.  Fair, effective, efficient and differentiated scheduling in an enterprise data warehouse , 2009, EDBT '09.

[9]  Philip M. Kaminsky,et al.  Designing and managing the supply chain : concepts, strategies, and case studies , 2007 .

[10]  N. P. Kavya,et al.  Trend Analysis of E-Commerce Data using Hadoop Ecosystem , 2016 .

[11]  Jörn-Henrik Thun,et al.  An empirical analysis of supply chain risk management in the German automotive industry , 2011 .

[12]  Maribel Yasmina Santos,et al.  The SusCity Big Data Warehousing Approach for Smart Cities , 2017, IDEAS.

[13]  Luís M. S. Dias,et al.  Simulation model generation for warehouse management: case study to test different storage strategies , 2018 .

[14]  L. Monostori,et al.  Generic data structure and validation methodology for simulation of manufacturing systems , 2016, Int. J. Comput. Integr. Manuf..

[15]  Ozgur Koray Sahingoz,et al.  Using Agent Based Modeling and Simulation for Data Mining , 2012, ICONIP.

[16]  Pedro M. Domingos A few useful things to know about machine learning , 2012, Commun. ACM.

[17]  Sunil Tiwari,et al.  Big data analytics in supply chain management between 2010 and 2016: Insights to industries , 2018, Comput. Ind. Eng..

[18]  Luís M. S. Dias,et al.  Comparison of SIMIO and ARENA simulation tools , 2014 .

[19]  Zheng Shao,et al.  Hive - a petabyte scale data warehouse using Hadoop , 2010, 2010 IEEE 26th International Conference on Data Engineering (ICDE 2010).

[20]  Maribel Yasmina Santos,et al.  Setting an Industry 4.0 Research and Development Agenda for Simulation – a Literature Review , 2018, International Journal of Simulation Modelling.

[21]  Scott J. Mason,et al.  Integrated cost optimization in a two-stage, automotive supply chain , 2016, Comput. Oper. Res..

[22]  J. Alberto Espinosa,et al.  Big Data: Issues and Challenges Moving Forward , 2013, 2013 46th Hawaii International Conference on System Sciences.

[23]  Paul Zikopoulos,et al.  Understanding Big Data: Analytics for Enterprise Class Hadoop and Streaming Data , 2011 .

[24]  Peter Nyhuis,et al.  Simulation based comparison of safety-stock calculation methods , 2012 .

[25]  Benoit Gaudou,et al.  To Calibrate & Validate an Agent-based Simulation Model - An Application of the Combination Framework of BI Solution & Multi-agent Platform , 2014, ICAART.

[26]  Zheng Shao,et al.  Data warehousing and analytics infrastructure at facebook , 2010, SIGMOD Conference.

[27]  Sean D Dessureault,et al.  Simulation-based decision support system for sustainable coalmining operations , 2012 .

[28]  Diane M. Strong,et al.  Beyond Accuracy: What Data Quality Means to Data Consumers , 1996, J. Manag. Inf. Syst..

[29]  Anders Skoogh,et al.  Data quality problems in discrete event simulation of manufacturing operations , 2018, Simul..

[30]  Jan Fabian Ehmke,et al.  Interactive analysis of discrete-event logistics systems with support of a data warehouse , 2011, Comput. Ind..

[31]  Raymond Gardiner Goss,et al.  Heading towards big data building a better data warehouse for more data, more speed, and more users , 2013, ASMC 2013 SEMI Advanced Semiconductor Manufacturing Conference.

[32]  Robert G. Sargent,et al.  Validation and verification of simulation models , 1999, Proceedings of the 2004 Winter Simulation Conference, 2004..

[33]  Arthur J. Koehler,et al.  A Bayesian simulation approach for supply chain synchronization , 2016, 2017 Winter Simulation Conference (WSC).

[34]  Arpan Kumar Kar,et al.  Big Data Analytics: A Review on Theoretical Contributions and Tools Used in Literature , 2017, Global Journal of Flexible Systems Management.

[35]  Benoit Gaudou,et al.  An implementation of framework of business intelligence for agent-based simulation , 2013, SoICT '13.

[36]  Benoit Gaudou,et al.  CFBM - A Framework for Data Driven Approach in Agent-Based Modeling and Simulation , 2016, ICTCC.

[37]  Eleonora Bottani,et al.  Reengineering, Simulation and Data Analysis of an RFID System , 2008, J. Theor. Appl. Electron. Commer. Res..

[38]  Tillal Eldabi,et al.  Simulation in manufacturing and business: A review , 2010, Eur. J. Oper. Res..

[39]  Ray Y. Zhong,et al.  Big Data for supply chain management in the service and manufacturing sectors: Challenges, opportunities, and future perspectives , 2016, Comput. Ind. Eng..

[40]  Maribel Yasmina Santos,et al.  A Big Data system supporting Bosch Braga Industry 4.0 strategy , 2017, Int. J. Inf. Manag..

[41]  Luís M. S. Dias,et al.  Data Requirements Elicitation in Big Data Warehousing , 2018, EMCIS.

[42]  Maribel Yasmina Santos,et al.  Efficient Big Data Modelling and Organization for Hadoop Hive-Based Data Warehouses , 2017, EMCIS.

[43]  Maribel Yasmina Santos,et al.  Evaluating partitioning and bucketing strategies for Hive-based Big Data Warehousing systems , 2019, Journal of Big Data.

[44]  K. D. Joshi,et al.  Data Cleansing Decisions: Insights from Discrete-Event Simulations of Firm Resources and Data Quality , 2012, J. Organ. Comput. Electron. Commer..

[45]  Maribel Yasmina Santos,et al.  Fast Online Analytical Processing for Big Data Warehousing , 2018, 2018 International Conference on Intelligent Systems (IS).

[46]  Samir Dani,et al.  Supply Chain Risk Management: Present and Future Scope , 2012 .

[47]  Samuel Madden,et al.  From Databases to Big Data , 2012, IEEE Internet Comput..