A Dependable Time Series Analytic Framework for Cyber-Physical Systems of IoT-based Smart Grid

With the emergence of cyber-physical systems (CPS), we are now at the brink of next computing revolution. The Smart Grid (SG) built on top of IoT (Internet of Things) is one of the foundations of this CPS revolution, which involves a large number of smart objects connected by networks. The volume of time series of SG equipment is tremendous and the raw time series are very likely to contain missing values because of undependable network transferring. The problem of storing a tremendous volume of raw time series thereby providing a solid support for precise time series analytics now becomes tricky. In this article, we propose a dependable time series analytics (DTSA) framework for IoT-based SG. Our proposed DTSA framework is capable of providing a dependable data transforming from CPS to the target database with an extraction engine to preliminary refining raw data and further cleansing the data with a correction engine built on top of a sensor-network-regularization-based matrix factorization method. The experimental results reveal that our proposed DTSA framework is capable of effectively increasing the dependability of raw time series transforming between CPS and the target database system through the online lightweight extraction engine and the offline correction engine. Our proposed DTSA framework would be useful for other industrial big data practices.

[1]  Yoshua Bengio,et al.  Algorithms for Hyper-Parameter Optimization , 2011, NIPS.

[2]  Abraham Silberschatz,et al.  HadoopDB: An Architectural Hybrid of MapReduce and DBMS Technologies for Analytical Workloads , 2009, Proc. VLDB Endow..

[3]  H. Sebastian Seung,et al.  Learning the parts of objects by non-negative matrix factorization , 1999, Nature.

[4]  Wu Shu-wen Research on the key technologies of IOT applied on Smart Grid , 2011, 2011 International Conference on Electronics, Communications and Control (ICECC).

[5]  Hanghang Tong,et al.  Fast Mining of a Network of Coevolving Time Series , 2015, SDM.

[6]  Ruslan Salakhutdinov,et al.  Bayesian probabilistic matrix factorization using Markov chain Monte Carlo , 2008, ICML '08.

[7]  Paolo Frasconi,et al.  Short-Term Traffic Flow Forecasting: An Experimental Comparison of Time-Series Analysis and Supervised Learning , 2013, IEEE Transactions on Intelligent Transportation Systems.

[8]  Peter J. Haas,et al.  Eagle-eyed elephant: split-oriented indexing in Hadoop , 2013, EDBT '13.

[9]  Chao Liu,et al.  Recommender systems with social regularization , 2011, WSDM '11.

[10]  Meinard Müller,et al.  Dynamic Time Warping , 2008 .

[11]  Al-Sakib Khan Pathan,et al.  Smart Grid Internet of Things , 2017, Journal on spesial topics in mobile networks and applications.

[12]  Ruslan Salakhutdinov,et al.  Probabilistic Matrix Factorization , 2007, NIPS.

[13]  Weidong Xiao,et al.  Spatio-temporal queries in HBase , 2015, 2015 IEEE International Conference on Big Data (Big Data).

[14]  Insup Lee,et al.  Cyber-physical systems: The next computing revolution , 2010, Design Automation Conference.

[15]  Christophe Rigotti,et al.  Swap Randomization of Bases of Sequences for Mining Satellite Image Times Series , 2015, ECML/PKDD.

[16]  Meinard Müller,et al.  Information retrieval for music and motion , 2007 .

[17]  François Jammes,et al.  Service-oriented paradigms in industrial automation , 2005, IEEE Transactions on Industrial Informatics.

[18]  Enrico Zio,et al.  Reconstruction of missing data in multidimensional time series by fuzzy similarity , 2015, Appl. Soft Comput..

[19]  Bo Yang,et al.  Column Access-aware In-stream Data Cache with Stream Processing Framework , 2017, J. Signal Process. Syst..

[20]  Elad Hazan,et al.  Online Time Series Prediction with Missing Data , 2015, ICML.

[21]  Miao Yun,et al.  Research on the architecture and key technology of Internet of Things (IoT) applied on smart grid , 2010, 2010 International Conference on Advances in Energy Engineering.

[22]  Gehao Sheng,et al.  Improving Power Grid Monitoring Data Quality: An Efficient Machine Learning Framework for Missing Data Prediction , 2015, 2015 IEEE 17th International Conference on High Performance Computing and Communications, 2015 IEEE 7th International Symposium on Cyberspace Safety and Security, and 2015 IEEE 12th International Conference on Embedded Software and Systems.

[23]  Shie-Jue Lee,et al.  Time series forecasting with missing values , 2015, 2015 1st International Conference on Industrial Networks and Intelligent Systems (INISCom).

[24]  Dorin Carstoiu,et al.  Hbase - non SQL Database, Performances Evaluation , 2010, Int. J. Adv. Comp. Techn..

[25]  Bin Liu,et al.  A Versatile Event-Driven Data Model in HBase Database for Multi-source Data of Power Grid , 2016, 2016 IEEE International Conference on Smart Cloud (SmartCloud).

[26]  Jianwen Xiang,et al.  Performability analysis of RAID10 versus RAID6 , 2013, 2013 43rd Annual IEEE/IFIP International Conference on Dependable Systems and Networks (DSN).

[27]  Connie Phong,et al.  Missing Value Estimation for Time Series Microarray Data Using Linear Dynamical Systems Modeling , 2008, 22nd International Conference on Advanced Information Networking and Applications - Workshops (aina workshops 2008).

[28]  Lars Schmidt-Thieme,et al.  Classification of Sparse Time Series via Supervised Matrix Factorization , 2012, AAAI.

[29]  Sheetal Kalra,et al.  A Review on IOT Based Smart Grid , 2016 .

[30]  Panganamala Ramana Kumar,et al.  Cyber–Physical Systems: A Perspective at the Centennial , 2012, Proceedings of the IEEE.

[31]  Scott Shenker,et al.  Spark: Cluster Computing with Working Sets , 2010, HotCloud.

[32]  Lars George,et al.  HBase: The Definitive Guide , 2011 .

[33]  Shaojie Tang,et al.  Time series matrix factorization prediction of internet traffic matrices , 2012, 37th Annual IEEE Conference on Local Computer Networks.

[34]  Juan Li,et al.  An overview of energy efficiency techniques in cluster computing systems , 2013, Cluster Computing.