Data modeling versus simulation modeling in the big data era: case study of a greenhouse control system

Recently, big data has received greater attention in diverse research fields, including medicine, science, engineering, management, defense, politics, and others. Such research uses big data to predict target systems, thereby constructing a model of the system in two ways: data modeling and simulation modeling. Data modeling is a method in which a model represents correlation relationships between one set of data and the other set of data. On the other hand, physics-based simulation modeling (or simply simulation modeling) is a more classical, but more powerful, method in which a model represents causal relationships between a set of controlled inputs and corresponding outputs. This paper (i) clarifies the difference between the two modeling approaches, (ii) explains their advantages and limitations and compares each characteristic, and (iii) presents a complementary cooperation modeling approach. Then, we apply the proposed modeling to develop a greenhouse control system in the real world. Finally, we expect that this modeling approach will be an alternative modeling approach in the big data era.

[1]  Jussi Tolvi,et al.  Modeling Financial Time Series with S‐Plus , 2003 .

[2]  Fei Wang,et al.  Combining Knowledge and Data Driven Insights for Identifying Risk Factors using Electronic Health Records , 2012, AMIA.

[3]  Rolf Rannacher,et al.  Model Based Parameter Estimation , 2013 .

[4]  José Boaventura Cunha,et al.  GREENHOUSE CLIMATE MODELS: AN OVERVIEW , 2003 .

[5]  António E. Ruano,et al.  Real-Time Parameter Estimation of Dynamic Temperature Models for Greenhouse Environmental Control , 1997 .

[6]  Vijay Manikandan Janakiraman,et al.  Machine Learning for Identification and Optimal Control of Advanced Automotive Engines. , 2013 .

[7]  Ming C. Lin,et al.  Transforming GIS Data into Functional Road Models for Large-Scale Traffic Simulation , 2012, IEEE Transactions on Visualization and Computer Graphics.

[8]  N. B. Anuar,et al.  The rise of "big data" on cloud computing: Review and open research issues , 2015, Inf. Syst..

[9]  Rolf Rannacher,et al.  Model Based Parameter Estimation: Theory and Applications , 2015 .

[10]  J. Aldrich Correlations Genuine and Spurious in Pearson and Yule , 1995 .

[11]  Anthony J. Hayter,et al.  Probability and statistics for engineers and scientists , 1996 .

[12]  Huapu Lu,et al.  Big Data-Driven Based Real-Time Traffic Flow State Identification and Prediction , 2015 .

[13]  P. Menzies,et al.  The Oxford Handbook of Causation , 2009 .

[14]  Xiaolin Hu,et al.  DEVS-FIRE: Towards an Integrated Simulation Environment for Surface Wildfire Spread and Containment , 2008, Simul..

[15]  Kaan Ozbay,et al.  Using Big Data and Efficient Methods to Capture Stochasticity for Calibration of Macroscopic Traffic Simulation Models , 2015 .

[16]  Xiaolin Hu,et al.  A dynamic data driven application system for wildfire spread simulation , 2009, Proceedings of the 2009 Winter Simulation Conference (WSC).

[17]  Alon Lavie,et al.  Combining Rule-based and Data-driven Techniques for Grammatical Relation Extraction in Spoken Language , 2003, IWPT.

[18]  Su-Youn Hong,et al.  DEVSim++ Toolset for Defense Modeling and Simulation and Interoperation , 2011 .

[19]  Michael Johnston,et al.  Balancing data-driven and rule-based approaches in the context of a multimodal conversational system , 2003, 2003 IEEE Workshop on Automatic Speech Recognition and Understanding (IEEE Cat. No.03EX721).

[20]  Tag Gon Kim,et al.  Multi-fidelity modeling & simulation methodology for simulation speed up , 2014, SIGSIM PADS '14.

[21]  Kathryn Graziano The innovator's dilemma: When new technologies cause great firms to fail , 1998 .

[22]  Jibonananda Sanyal,et al.  Simulation and big data challenges in tuning building energy models , 2013, 2013 Workshop on Modeling and Simulation of Cyber-Physical Energy Systems (MSCPES).

[23]  O. Nelles Nonlinear System Identification , 2001 .

[24]  Joshua Ryan New,et al.  Building Simulation Modelers are we big-data ready? , 2014 .

[25]  Lennart Ljung,et al.  System Identification: Theory for the User , 1987 .

[26]  Frederica Darema,et al.  Dynamic Data Driven Applications Systems: A New Paradigm for Application Simulations and Measurements , 2004, International Conference on Computational Science.

[27]  N. Bennis,et al.  Greenhouse climate modelling and robust control , 2008 .

[28]  Saso Dzeroski,et al.  Integrating Knowledge-Driven and Data-Driven Approaches to Modeling , 2006, EnviroInfo.

[29]  Fei Liu,et al.  Robust adaptive control for greenhouse climate using neural networks , 2011 .

[30]  Charles Anderson,et al.  The end of theory: The data deluge makes the scientific method obsolete , 2008 .