Multidimensional Analysis of Atypical Events in Cyber-Physical Data

A Cyber-Physical System (CPS) integrates physical devices (e.g., sensors, cameras) with cyber (or informational) components to form a situation-integrated analytical system that may respond intelligently to dynamic changes of the real-world situations. CPS claims many promising applications, such as traffic observation, battlefield surveillance and sensor-network based monitoring. One important research topic in CPS is about the atypical event analysis, i.e., retrieving the events from large amount of data and analyzing them with spatial, temporal and other multi-dimensional information. Many traditional approaches are not feasible for such analysis since they use numeric measures and cannot describe the complex atypical events. In this study, we propose a new model of atypical cluster to effectively represent those events and efficiently retrieve them from massive data. The micro-cluster is designed to summarize individual events, and the macro-cluster is used to integrate the information from multiple event. To facilitate scalable, flexible and online analysis, the concept of significant cluster is defined and a guided clustering algorithm is proposed to retrieve significant clusters in an efficient manner. We conduct experiments on real datasets with the size of more than 50 GB, the results show that the proposed method can provide more accurate information with only 15% to 20% time cost of the baselines.

[1]  Panos Kalnis,et al.  Indexing spatio-temporal data warehouses , 2002, Proceedings 18th International Conference on Data Engineering.

[2]  Panos Kalnis,et al.  Efficient OLAP Operations in Spatial Data Warehouses , 2001, SSTD.

[3]  Yufei Tao,et al.  Range aggregate processing in spatial databases , 2004, IEEE Transactions on Knowledge and Data Engineering.

[4]  Shashi Shekhar,et al.  CubeView: a system for traffic data visualization , 2002, Proceedings. The IEEE 5th International Conference on Intelligent Transportation Systems.

[5]  Jiawei Han,et al.  Object-Based Selective Materialization for Efficient Implementation of Spatial Data Cubes , 2000, IEEE Trans. Knowl. Data Eng..

[6]  Jeffrey Considine,et al.  Spatio-temporal aggregation using sketches , 2004, Proceedings. 20th International Conference on Data Engineering.

[7]  Sangkyum Kim,et al.  Tru-Alarm: Trustworthiness Analysis of Sensor Networks in Cyber-Physical Systems , 2010, 2010 IEEE International Conference on Data Mining.

[8]  Chris Clifton,et al.  Mobility Data Mining and Privacy , 2012 .

[9]  Wen-Chih Peng,et al.  CarWeb: A Traffic Data Collection Platform , 2008, The Ninth International Conference on Mobile Data Management (mdm 2008).

[10]  Jing Dai,et al.  Spatial-Temporal Data Mining in Tra c Incident Detection , 2006 .

[11]  Alexander Skabardonis,et al.  Freeway Performance Measurement System: Operational Analysis Tool , 2002 .

[12]  Arnold P. Boedihardjo,et al.  AITVS: Advanced Interactive Traffic Visualization System , 2006, 22nd International Conference on Data Engineering (ICDE'06).

[13]  Jiawei Han,et al.  Filtering and Refinement: A Two-Stage Approach for Efficient and Effective Anomaly Detection , 2009, 2009 Ninth IEEE International Conference on Data Mining.

[14]  Chi-Yin Chow,et al.  Aggregate Location Monitoring for Wireless Sensor Networks: A Histogram-Based Approach , 2009, 2009 Tenth International Conference on Mobile Data Management: Systems, Services and Middleware.

[15]  Joseph S. Fulda Data Mining and Privacy , 2000 .

[16]  Walid G. Aref,et al.  Detection and Tracking of Discrete Phenomena in Sensor-Network Databases , 2005, SSDBM.

[17]  Hamid Pirahesh,et al.  Data Cube: A Relational Aggregation Operator Generalizing Group-By, Cross-Tab, and Sub-Totals , 1996, Data Mining and Knowledge Discovery.

[18]  Alexander Skabardonis,et al.  FREEWAY PERFORMANCE MEASUREMENT SYSTEM (PeMS): AN OPERATIONAL ANALYSIS TOOL , 2001 .

[19]  Hongyan Li,et al.  Effective variation management for pseudo periodical streams , 2007, SIGMOD '07.