Towards an operational database for real-time environmental monitoring and early warning systems

Abstract Real-time environmental monitoring, early warning and decision support systems (EMEWD) require advanced management of operational data, i.e. recent sensor data needed for the assessment of the current situation. In this paper we evaluate the suitability of four data models and corresponding database technologies – MongoDB document database, PostgreSQL relational database, Redis dictionary data server and InfluxDB time series database – to serve as an operational database for EMEWD systems. For each of the evaluated databases, we design alternative data models to represent time series data, and experimentally evaluate each of them. We also perform comparative performance evaluation of all databases, using the best model in each case. We have designed performance tests which reflect realistic conditions, using mixed workloads (simultaneous read and write operations) and queries typical for a smart levee monitoring and flood decision support system. Overall the results of the experiments allow us to answer interesting questions, such as: (1) how best to implement time series in a given data model? (2) What are the reasonable operational database volume limits? (3) What are the performance limits for different types of databases?

[1]  Betul Karakus,et al.  Architecture and Implementation of a Scalable Sensor Data Storage and Analysis System Using Cloud Computing and Big Data Technologies , 2015, J. Sensors.

[2]  Mario Di Francesco,et al.  Cloud Databases for Internet-of-Things Data , 2014, 2014 IEEE International Conference on Internet of Things(iThings), and IEEE Green Computing and Communications (GreenCom) and IEEE Cyber, Physical and Social Computing (CPSCom).

[3]  Maneesha V. Ramesh,et al.  Real-Time Wireless Sensor Network for Landslide Detection , 2009, 2009 Third International Conference on Sensor Technologies and Applications.

[4]  Heiko Koziolek,et al.  Scalability and Robustness of Time-Series Databases for Cloud-Native Monitoring of Industrial Processes , 2014, 2014 IEEE 7th International Conference on Cloud Computing.

[5]  Dieter Kranzlmüller,et al.  Towards a General Definition of Urgent Computing , 2015, ICCS.

[6]  Robert J. Meijer,et al.  Sensor Data Storage Performance: SQL or NoSQL, Physical or Virtual , 2012, 2012 IEEE Fifth International Conference on Cloud Computing.

[7]  Bartosz Balis,et al.  Execution Management and Efficient Resource Provisioning for Flood Decision Support , 2015, ICCS.

[8]  Robert Brzoza-Woch,et al.  Embedded systems in the application of fog computing — Levee monitoring use case , 2016, 2016 11th IEEE Symposium on Industrial Embedded Systems (SIES).

[9]  André Dietrich,et al.  ROS meets Cassandra : Data Management in Smart Environments with NoSQL , 2014 .

[10]  Keeley Crockett,et al.  Database systems: design, implementation & management - international edition , 2008 .

[11]  Bartosz Balis,et al.  Holistic approach to management of IT infrastructure for environmental monitoring and decision support systems with urgent computing capabilities , 2018, Future Gener. Comput. Syst..

[12]  Bartosz Balis,et al.  A Development and Execution Environment for Early Warning Systems for Natural Disasters , 2013, 2013 13th IEEE/ACM International Symposium on Cluster, Cloud, and Grid Computing.

[13]  Kirk Martinez,et al.  Environmental Sensor Networks: A revolution in the earth system science? , 2006 .