A Generic and Scalable Pipeline for Large-Scale Analytics of Continuous Aircraft Engine Data

A major application of data analytics for aircraft engine manufacturers is engine health monitoring, which consists in improving availability and operation of engines by leveraging operational data and past events. Traditional tools can no longer handle the increasing volume and velocity of data collected on modern aircraft. We propose a generic and scalable pipeline for large-scale analytics of operational data from a recent type of aircraft engine, oriented towards health monitoring applications. Based on Hadoop and Spark, our approach enables domain experts to scale their algorithms and extract features from tens of thousands of flights stored on a cluster. All computations are performed using the Spark framework, however custom functions and algorithms can be integrated without knowledge of distributed programming. Unsupervised learning algorithms are integrated for clustering and dimensionality reduction of the flight features, in order to allow efficient visualization and interpretation through a dedicated web application. The use case guiding our work is a methodology for engine fleet monitoring with a self-organizing map. Finally, this pipeline is meant to be end-to-end, fully customizable and ready for use in an industrial setting.

[1]  S. Bliesner,et al.  Predictive analytics with aviation big data , 2013, 2013 Integrated Communications, Navigation and Surveillance Conference (ICNS).

[2]  Michel Verleysen,et al.  Aircraft Engine Health Monitoring Using Self-Organizing Maps , 2010, ICDM.

[3]  Teuvo Kohonen,et al.  The self-organizing map , 1990 .

[4]  Jun Jo,et al.  Application of deep neural network and generative adversarial network to industrial maintenance: A case study of induction motor fault detection , 2017, 2017 IEEE International Conference on Big Data (Big Data).

[5]  Juha Vesanto,et al.  SOM-based data visualization methods , 1999, Intell. Data Anal..

[6]  Fouad Badran,et al.  Probabilistic self-organizing map and radial basis function networks , 1998, Neurocomputing.

[7]  Zhiwei Xu,et al.  RCFile: A fast and space-efficient data placement structure in MapReduce-based warehouse systems , 2011, 2011 IEEE 27th International Conference on Data Engineering.

[8]  M. Svensson,et al.  Self-organizing maps for automatic fault detection in a vehicle cooling system , 2008, 2008 4th International IEEE Conference Intelligent Systems.

[9]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[10]  Mustapha Lebbah,et al.  SOM Clustering Using Spark-MapReduce , 2014, 2014 IEEE International Parallel & Distributed Processing Symposium Workshops.

[11]  Mustapha Lebbah,et al.  Mixed Topological Map , 2005, ESANN.

[12]  Musab Talha Akpinar Data mining applications in civil aviation sector : State-of-art review , 2017 .

[13]  Michel Verleysen,et al.  Aircraft Engine Fleet Monitoring Using Self-Organizing Maps and Edit Distance , 2011, WSOM.

[14]  Muthu Dayalan,et al.  MapReduce : Simplified Data Processing on Large Cluster , 2018 .

[15]  Teuvo Kohonen,et al.  Self-organized formation of topologically correct feature maps , 2004, Biological Cybernetics.

[16]  Cameron David Rose,et al.  The Hive Project , 2011 .

[17]  Rajendra Akerkar,et al.  Analytics on Big Aviation Data: Turning Data into Insights , 2014, Int. J. Comput. Sci. Appl..

[18]  S. Vinu Kiran,et al.  Airline Route Profitability Analysis and Optimization Using BIG DATA Analyticson Aviation Data Sets under Heuristic Techniques , 2016 .

[19]  Dinkar Mylaraswamy,et al.  Big Data Infrastructure for Aviation Data Analytics , 2014, 2014 IEEE International Conference on Cloud Computing in Emerging Markets (CCEM).

[20]  Yi Yang,et al.  Civil Aircraft Big Data Platform , 2017, 2017 IEEE 11th International Conference on Semantic Computing (ICSC).

[21]  Jerome Lacaille,et al.  Visual mining and statistics for a turbofan engine fleet , 2011, 2011 Aerospace Conference.

[22]  Sanjay Ghemawat,et al.  MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.

[23]  Michel Verleysen,et al.  Fault Prediction in Aircraft Engines Using Self-Organizing Maps , 2009, WSOM.

[24]  Christopher M. Bishop,et al.  GTM: The Generative Topographic Mapping , 1998, Neural Computation.

[25]  Geoffrey E. Hinton,et al.  Visualizing Data using t-SNE , 2008 .

[26]  Chang-Guen Oh Application of Big Data Systems to Aviation and Aerospace Fields; Pertinent Human Factors Considerations , 2017 .

[27]  Jérôme Lacaille,et al.  Online Normalization Algorithm for Engine Turbofan Monitoring , 2014 .

[28]  Guillaume Bastard,et al.  Engine Health Management in Safran Aircraft Engines , 2016 .