Multidimensional Sensor Data Analysis in Cyber-Physical System: An Atypical Cube Approach

Cyber-Physical System (CPS) is an integration of distributed sensor networks with computational devices. CPS claims many promising applications, such as traffic observation, battlefield surveillance, and sensor-network-based monitoring. One important topic in CPS research is about the atypical event analysis, that is, retrieving the events from massive sensor data and analyzing them with spatial, temporal, and other multidimensional information. Many traditional methods are not feasible for such analysis since they cannot describe the complex atypical events. In this paper, we propose a novel model of atypical cluster to effectively represent such events and efficiently retrieve them from massive data. The basic cluster is designed to summarize an individual event, and the macrocluster is used to integrate the information from multiple events. To facilitate scalable, flexible, and online analysis, the atypical cube is constructed, and a guided clustering algorithm is proposed to retrieve significant clusters in an efficient manner. We conduct experiments on real sensor datasets with the size of more than 50 GB; the results show that the proposed method can provide more accurate information with only 15% to 20% time cost of the baselines.

[1]  Sangkyum Kim,et al.  Tru-Alarm: Trustworthiness Analysis of Sensor Networks in Cyber-Physical Systems , 2010, 2010 IEEE International Conference on Data Mining.

[2]  Alexander Skabardonis,et al.  FREEWAY PERFORMANCE MEASUREMENT SYSTEM (PeMS): AN OPERATIONAL ANALYSIS TOOL , 2001 .

[3]  Joseph S. Fulda Data Mining and Privacy , 2000 .

[4]  Jiawei Han,et al.  Data Mining: Concepts and Techniques, Second Edition , 2006, The Morgan Kaufmann series in data management systems.

[5]  Petra Perner,et al.  Data Mining - Concepts and Techniques , 2002, Künstliche Intell..

[6]  Shashi Shekhar,et al.  CubeView: a system for traffic data visualization , 2002, Proceedings. The IEEE 5th International Conference on Intelligent Transportation Systems.

[7]  Jiawei Han,et al.  Object-Based Selective Materialization for Efficient Implementation of Spatial Data Cubes , 2000, IEEE Trans. Knowl. Data Eng..

[8]  Thomas F. La Porta,et al.  IntruMine: Mining Intruders in Untrustworthy Data of Cyber-physical Systems , 2012, SDM.

[9]  Panos Kalnis,et al.  Efficient OLAP Operations in Spatial Data Warehouses , 2001, SSTD.

[10]  Yizhou Sun,et al.  Multidimensional Analysis of Atypical Events in Cyber-Physical Data , 2012, 2012 IEEE 28th International Conference on Data Engineering.

[11]  Yvan Bédard,et al.  SOLAP technology: Merging business intelligence with geospatial technology for interactive spatio-temporal exploration and analysis of data , 2005 .

[12]  Jian Pei,et al.  Data Mining: Concepts and Techniques, 3rd edition , 2006 .

[13]  Yufei Tao,et al.  Range aggregate processing in spatial databases , 2004, IEEE Transactions on Knowledge and Data Engineering.

[14]  Hamid Pirahesh,et al.  Data Cube: A Relational Aggregation Operator Generalizing Group-By, Cross-Tab, and Sub-Totals , 1996, Data Mining and Knowledge Discovery.

[15]  John F. Roddick,et al.  Geographic Data Mining and Knowledge Discovery , 2001 .

[16]  Jeffrey Considine,et al.  Spatio-temporal aggregation using sketches , 2004, Proceedings. 20th International Conference on Data Engineering.

[17]  Xing Xie,et al.  Retrieving k-Nearest Neighboring Trajectories by a Set of Point Locations , 2011, SSTD.

[18]  Jing Yuan,et al.  On Discovery of Traveling Companions from Streaming Trajectories , 2012, 2012 IEEE 28th International Conference on Data Engineering.

[19]  Alexander Skabardonis,et al.  Freeway Performance Measurement System: Operational Analysis Tool , 2002 .

[20]  Arnold P. Boedihardjo,et al.  AITVS: Advanced Interactive Traffic Visualization System , 2006, 22nd International Conference on Data Engineering (ICDE'06).

[21]  Chris Clifton,et al.  Mobility Data Mining and Privacy , 2012 .

[22]  Panos Kalnis,et al.  Indexing spatio-temporal data warehouses , 2002, Proceedings 18th International Conference on Data Engineering.

[23]  Jiawei Han,et al.  Filtering and Refinement: A Two-Stage Approach for Efficient and Effective Anomaly Detection , 2009, 2009 Ninth IEEE International Conference on Data Mining.

[24]  Wang-Chien Lee,et al.  Using sensorranks for in-network detection of faulty readings in wireless sensor networks , 2007, MobiDE '07.

[25]  Wen-Chih Peng,et al.  CarWeb: A Traffic Data Collection Platform , 2008, The Ninth International Conference on Mobile Data Management (mdm 2008).

[26]  Jiawei Han,et al.  Geographic Data Mining and Knowledge Discovery , 2001 .

[27]  Yu Zheng,et al.  Computing with Spatial Trajectories , 2011, Computing with Spatial Trajectories.

[28]  Jing Dai,et al.  Spatial-Temporal Data Mining in Tra c Incident Detection , 2006 .

[29]  Chao Chen,et al.  An Empirical Assessment of Traffic Operations , 2005 .

[30]  Chao Chen,et al.  The PeMS algorithms for accurate, real-time estimates of g-factors and speeds from single-loop detectors , 2001, ITSC 2001. 2001 IEEE Intelligent Transportation Systems. Proceedings (Cat. No.01TH8585).

[31]  Yvan Bédard,et al.  Toward better support for spatial decision making: Defining the characteristics of spatial on-line analytical processing (SOLAP) , 2001 .

[32]  Hongyan Li,et al.  Effective variation management for pseudo periodical streams , 2007, SIGMOD '07.