Mining association rules from time series to explain failures in a hot-dip galvanizing steel line

This paper presents an experience based on the use of association rules from multiple time series captured from industrial processes. The main goal is to seek useful knowledge for explaining failures in these processes. An overall method is developed to obtain association rules that represent the repeated relationships between pre-defined episodes in multiple time series, using a time window and a time lag. First, the process involves working in an iterative and interactive manner with several pre-processing and segmentation algorithms for each kind of time series in order to obtain significant events. In the next step, a search is made for sequences of events called episodes that are repeated among the various time series according to a pre-set consequent, a pre-established time window and a time lag. Extraction is then made of the association rules for those episodes that appear many times and have a high rate of hits. Finally, a case study is described regarding the application of this methodology to a historical database of 150 variables from an industrial process for galvanizing steel coils.

[1]  Shankar Chakraborty,et al.  Feature-based recognition of control chart patterns , 2006, Comput. Ind. Eng..

[2]  Ada Wai-Chee Fu,et al.  Discovering Temporal Patterns for Interval-Based Events , 2000, DaWaK.

[3]  Eamonn J. Keogh,et al.  Segmenting Time Series: A Survey and Novel Approach , 2002 .

[4]  Anne M. Denton,et al.  Establishing relationships among patterns in stock market data , 2009, Data Knowl. Eng..

[5]  J. Madejski,et al.  Methodology of the mechanical properties prediction for the metallurgical products from the engineering steels using the Artificial Intelligence methods , 2005 .

[6]  Fabian Mörchen,et al.  Discovering Interpretable Muscle Activation Patterns with the Temporal Data Mining Method , 2004, PKDD.

[7]  Sushil Jajodia,et al.  Testing complex temporal relationships involving multiple granularities and its application to data mining (extended abstract) , 1996, PODS.

[8]  James Nga-Kwok Liu,et al.  Inter-transactional association rules for multi-dimensional contexts for prediction and their application to studying meteorological data , 2001, Data Knowl. Eng..

[9]  Tak-Chung Fu,et al.  A review on time series data mining , 2011, Eng. Appl. Artif. Intell..

[10]  A. Pernía,et al.  Reducción de problemas de adherencia en procesos de galvanizado mediante técnicas de minería de datos , 2007 .

[11]  Ana González-Marcos,et al.  Steel annealing furnace robust neural network model , 2005 .

[12]  Pradipta Kishore Dash,et al.  Mining for similarities in time series data using wavelet-based feature vectors and neural networks , 2007, Eng. Appl. Artif. Intell..

[13]  Ana González-Marcos,et al.  Comparison of models created for the prediction of the mechanical properties of galvanized steel coils , 2010, Journal of Intelligent Manufacturing.

[14]  Susana Ferreiro,et al.  Data mining for quality control: Burr detection in the drilling process , 2011, Comput. Ind. Eng..

[15]  Hong-fei Teng,et al.  Human-computer cooperative layout design method and its application , 2008, Comput. Ind. Eng..

[16]  Francisco Javier Martinez-de-Pison,et al.  Optimising annealing process on hot dip galvanising line based on robust predictive models adjusted with genetic algorithms , 2011 .

[17]  Susan M. Bridges,et al.  Mining fuzzy association rules and fuzzy frequency episodes for intrusion detection , 2000, Int. J. Intell. Syst..

[18]  A. Pernía,et al.  Optimum model for predicting temperature settings on hot dip galvanising line , 2010 .

[19]  So Young Sohn,et al.  Sequential association rules for forecasting failure patterns of aircrafts in Korean airforce , 2009, Expert Syst. Appl..

[20]  Anthony K. H. Tung,et al.  Breaking the barrier of transactions: mining inter-transaction association rules , 1999, KDD '99.

[21]  Francisco J. García-Peñalvo,et al.  An association rule mining method for estimating the impact of project management policies on software quality, development time and effort , 2008, Expert Syst. Appl..

[22]  Dong Ze An Efficient Algorithm for Mining Inter-transaction Association Rules in Multiple Time Series , 2004 .

[23]  Jitender S. Deogun,et al.  Discovering Sequential Association Rules with Constraints and Time Lags in Multiple Sequences , 2002, ISMIS.

[24]  Ramakrishnan Srikant,et al.  Fast algorithms for mining association rules , 1998, VLDB 1998.

[25]  Jirachai Buddhakulsomsiri,et al.  Sequential pattern mining algorithm for automotive warranty data , 2009, Comput. Ind. Eng..

[26]  Chia-Hui Chang,et al.  Efficient mining of frequent episodes from complex sequences , 2008, Inf. Syst..

[27]  Janjao Mongkolnavin,et al.  Marking the Close analysis in Thai Bond Market Surveillance using association rules , 2009, Expert Syst. Appl..

[28]  Yo-Ping Huang,et al.  Efficient mining of salinity and temperature association rules from ARGO data , 2008, Expert Syst. Appl..

[29]  Riccardo Bellazzi,et al.  Temporal data mining for the quality assessment of hemodialysis services , 2005, Artif. Intell. Medicine.

[30]  F. J. Martinez-de-Pison,et al.  Improvement and optimisation of hot dip galvanising line using neural networks and genetic algorithms , 2006 .

[31]  Chung-Chian Hsu,et al.  Pattern recognition in time series database: A case study on financial database , 2007, Expert Syst. Appl..

[32]  Theophano Mitsa,et al.  Temporal Data Mining , 2010 .

[33]  S. Sitharama Iyengar,et al.  A focused issue on data mining and knowledge discovery in industrial engineering , 2002 .

[34]  A. Pernía,et al.  Overall model of the dynamic behaviour of the steel strip in an annealing heating furnace on a hot-dip galvanizing line , 2010 .

[35]  C. S. Wallace,et al.  Minimum Message Length Segmentation , 1998, PAKDD.

[36]  Kien A. Hua,et al.  Knowledge Discovery from Series of Interval Events , 2000, Journal of Intelligent Information Systems.

[37]  P. S. Sastry,et al.  A survey of temporal data mining , 2006 .

[38]  Fabian Mörchen,et al.  Efficient mining of understandable patterns from multivariate interval time series , 2007, Data Mining and Knowledge Discovery.

[39]  Heikki Mannila,et al.  Principles of Data Mining , 2001, Undergraduate Topics in Computer Science.

[40]  Fei Wu,et al.  Knowledge discovery in time-series databases , 2001 .

[41]  R. Suganya,et al.  Data Mining Concepts and Techniques , 2010 .

[42]  Geert Wets,et al.  Simple association rules (SAR) and the SAR-based rule discovery , 2002 .

[43]  Bin Wang,et al.  Parameter optimization in complex industrial process control based on improved fuzzy-GA , 2003, Proceedings of the 2003 International Conference on Machine Learning and Cybernetics (IEEE Cat. No.03EX693).

[44]  Susan M. Bridges,et al.  Mining fuzzy association rules and fuzzy frequency episodes for intrusion detection , 2000 .

[45]  Ming-Yang Su,et al.  Discovery and prevention of attack episodes by frequent episodes mining and finite state machines , 2010, J. Netw. Comput. Appl..

[46]  Henry C. W. Lau,et al.  Development of an intelligent quality management system using fuzzy association rules , 2009, Expert Syst. Appl..

[47]  Lars Schmidt-Thieme,et al.  Algorithmic Features of Eclat , 2004, FIMI.

[48]  Inci Batmaz,et al.  A review of data mining applications for quality improvement in manufacturing industry , 2011, Expert Syst. Appl..

[49]  Chai Tianyou Research and Application on Temperature Control of Industry Heating Furnace , 2010 .

[50]  Simon Parsons,et al.  Principles of Data Mining by David J. Hand, Heikki Mannila and Padhraic Smyth, MIT Press, 546 pp., £34.50, ISBN 0-262-08290-X , 2004, The Knowledge Engineering Review.

[51]  Boris Cule,et al.  Mining Association Rules in Long Sequences , 2010, PAKDD.

[52]  Alexandre Dolgui,et al.  A MIP approach for balancing transfer line with complex industrial constraints , 2010, Comput. Ind. Eng..

[53]  Manuel Castejón Limas,et al.  A Multi-agent Data Mining System for Defect Forecasting in a Decentralized Manufacturing Environment , 2010, CISIS.

[54]  Anthony K. H. Tung,et al.  Efficient Mining of , 2003 .

[55]  Sushil Jajodia,et al.  Mining Temporal Relationships with Multiple Granularities in Time Sequences , 1998, IEEE Data Eng. Bull..

[56]  Mohammed J. Zaki Scalable Algorithms for Association Mining , 2000, IEEE Trans. Knowl. Data Eng..

[57]  Sherri K. Harms,et al.  Drought Monitoring Using Data Mining Techniques: A Case Study for Nebraska, USA , 2004 .

[58]  John F. Roddick,et al.  ARMADA - An algorithm for discovering richer relative temporal association rules from interval-based data , 2007, Data Knowl. Eng..

[59]  Anthony K. H. Tung,et al.  Efficient Mining of Intertransaction Association Rules , 2003, IEEE Trans. Knowl. Data Eng..

[60]  Ramakrishnan Srikant,et al.  Fast Algorithms for Mining Association Rules in Large Databases , 1994, VLDB.

[61]  A. Akhmetova Discovery of Frequent Episodes in Event Sequences , 2006 .

[62]  Abraham Kandel,et al.  Data Mining in Time Series Database , 2004 .

[63]  Radford M. Neal Pattern Recognition and Machine Learning , 2007, Technometrics.

[64]  Shankar Chakraborty,et al.  Recognition of control chart patterns using improved selection of features , 2009, Comput. Ind. Eng..

[65]  Morteza Assadi,et al.  Fuzzy expert systems and challenge of new product pricing , 2009, Comput. Ind. Eng..

[66]  Dimitrios Gunopulos,et al.  Finding Similar Time Series , 1997, PKDD.

[67]  Evimaria Terzi,et al.  Efficient Algorithms for Sequence Segmentation , 2006, SDM.

[68]  Suh-Yin Lee,et al.  An efficient algorithm for mining time interval-based patterns in large database , 2010, CIKM.

[69]  Heikki Mannila,et al.  Rule Discovery from Time Series , 1998, KDD.

[70]  R. E. Abdel-Aal,et al.  Univariate modeling and forecasting of monthly energy demand time series using abductive and neural networks , 2008, Comput. Ind. Eng..

[71]  Jitender S. Deogun,et al.  Time-series data mining in a geospatial decision support system , 2003 .

[72]  Jitender S. Deogun,et al.  Discovering representative episodal association rules from event sequences using frequent closed episode sets and event constraints , 2001, Proceedings 2001 IEEE International Conference on Data Mining.

[73]  Eugene Fink,et al.  Indexing of Compressed Time Series , 2004 .