Analyzing Massive Machine Maintenance Data in a Computing Cloud

We present a novel framework, CloudView, for storage, processing and analysis of massive machine maintenance data, collected from a large number of sensors embedded in industrial machines, in a cloud computing environment. This paper describes the architecture, design, and implementation of CloudView, and how the proposed framework leverages the parallel computing capability of a computing cloud based on a large-scale distributed batch processing infrastructure that is built of commodity hardware. A case-based reasoning (CBR) approach is adopted for machine fault prediction, where the past cases of failure from a large number of machines are collected in a cloud. A case-base of past cases of failure is created using the global information obtained from a large number of machines. CloudView facilitates organization of sensor data and creation of case-base with global information. Case-base creation jobs are formulated using the MapReduce parallel data processing model. CloudView captures the failure cases across a large number of machines and shares the failure information with a number of local nodes in the form of case-base updates that occur in a time scale of every few hours. At local nodes, the real-time sensor data from a group of machines in the same facility/plant is continuously matched to the cases from the case-base for predicting the incipient faults-this local processing takes a much shorter time of a few seconds. The case-base is updated regularly (in the time scale of a few hours) on the cloud to include new cases of failure, and these case-base updates are pushed from CloudView to the local nodes. Experimental measurements show that fault predictions can be done in real-time (on a timescale of seconds) at the local nodes and massive machine data analysis for case-base creation and updating can be done on a timescale of minutes in the cloud. Our approach, in addition to being the first reported use of the cloud architecture for maintenance data storage, processing and analysis, also evaluates several possible cloud-based architectures that leverage the advantages of the parallel computing capabilities of the cloud to make local decisions with global information efficiently, while avoiding potential data bottlenecks that can occur in getting the maintenance data in and out of the cloud.

[1]  Roger Ray Hill,et al.  Wind turbine reliability : a database and analysis approach. , 2008 .

[2]  William Cheetham,et al.  Case-Based Reasoning for Gas Turbine Diagnostics , 2005, FLAIRS Conference.

[3]  Taghi M. Khoshgoftaar,et al.  Predicting fault-prone modules with case-based reasoning , 1997, Proceedings The Eighth International Symposium on Software Reliability Engineering.

[4]  Shaul Markovitch,et al.  The Role of Forgetting in Learning , 1988, ML.

[5]  Sai Ji,et al.  An Algorithm for Case-Based Reasoning Based on Similarity Rough Set , 2008, 2008 Fifth International Conference on Fuzzy Systems and Knowledge Discovery.

[6]  IEEE Transactions on Parallel and Distributed Systems, Vol. 13 , 2002 .

[7]  Ma Venkataswamy,et al.  Failure of turbine rotor blisk of an aircraft engine , 2002 .

[8]  Zdziss Law Pawlak,et al.  Rough Sets and Decision Analysis , 2008 .

[9]  Agnar Aamodt,et al.  Case-Based Reasoning: Foundational Issues, Methodological Variations, and System Approaches , 1994, AI Commun..

[10]  Barry Smyth,et al.  Retrieval, reuse, revision and retention in case-based reasoning , 2005, The Knowledge Engineering Review.

[11]  Ning Zhong,et al.  Using Rough Sets with Heuristics for Feature Selection , 1999, Journal of Intelligent Information Systems.

[12]  S. G. Lee,et al.  Hybrid case-based reasoning for on-line product fault diagnosis , 2006 .

[13]  Gilberto Francisco Martha de Souza,et al.  Availability Analysis of Gas Turbines Used in Power Plants , 2009 .

[14]  Robert Orchard,et al.  Automated Case Base Creation and Management , 2003, IEA/AIE.

[15]  Padraig Cunningham,et al.  The Utility Problem Analysed: A Case-Based Reasoning Perspective , 1996, EWCBR.

[16]  Qiang Yang,et al.  Remembering to Add: Competence-preserving Case-Addition Policies for Case Base Maintenance , 1999, IJCAI.

[17]  Konstantinos Kalpakis,et al.  Collaborative Data Gathering in Wireless Sensor Networks Using Measurement Co-Occurrence , 2007, 2007 International Conference on Sensor Technologies and Applications (SENSORCOMM 2007).