Sequential pattern mining algorithm for automotive warranty data

This paper presents a sequential pattern mining algorithm that allows product and quality engineers to extract hidden knowledge from a large automotive warranty database. The algorithm uses the elementary set concept and database manipulation techniques to search for patterns or relationships among occurrences of warranty claims over time. These patterns are represented as IF-THEN sequential rules, where the IF portion of the rule includes one or more occurrences of warranty problems at one time and the THEN portion includes warranty problem(s) that occur at a later time. Once sequential patterns are generated, the algorithm uses rule strength parameters to filter out insignificant patterns, so that only important (significant) rules are reported. Significant patterns provide knowledge of one or more product failures that leads to future product fault(s). The effectiveness of the algorithm is illustrated with the warranty data mining application from the automotive industry. A discussion on the sequential patterns generated by the algorithm and their interpretation for the automotive example are also provided.

[1]  Jianyong Wang,et al.  Mining sequential patterns by pattern-growth: the PrefixSpan approach , 2004, IEEE Transactions on Knowledge and Data Engineering.

[2]  R. J. Kuo,et al.  Integration of K-means algorithm and AprioriSome algorithm for fuzzy sequential pattern mining , 2009, Appl. Soft Comput..

[3]  Céline Fiot,et al.  From Crispness to Fuzziness: Three Algorithms for Soft Sequential Pattern Mining , 2007, IEEE Transactions on Fuzzy Systems.

[4]  Gholamreza Nakhaeizadeh,et al.  WAPS, a data mining support environment for the planning of warranty and goodwill costs in the automobile industry , 1999, KDD '99.

[5]  M. Teisseire,et al.  Efficient mining of sequential patterns with time constraints: Reducing the combinations , 2009, Expert Syst. Appl..

[6]  Jin Wang,et al.  An optimization model for concurrent selection of tolerances and suppliers , 2001 .

[7]  Jennifer Widom,et al.  Database Systems: The Complete Book , 2001 .

[8]  Karl D. Majeske,et al.  Evaluating product and process design changes with warranty data , 1997 .

[9]  James F. Cali,et al.  TQM for Purchasing Management , 1992 .

[10]  Yen-Liang Chen,et al.  Constraint-based sequential pattern mining: The consideration of recency and compactness , 2006, Decis. Support Syst..

[11]  Karl D. Majeske,et al.  A non-homogeneous Poisson process predictive model for automobile warranty claims , 2007, Reliab. Eng. Syst. Saf..

[12]  Vidyadhar G. Kulkarni,et al.  Warranty claims modeling , 2008 .

[13]  Jian Pei,et al.  CMAR: accurate and efficient classification based on multiple class-association rules , 2001, Proceedings 2001 IEEE International Conference on Data Mining.

[14]  Jirachai Buddhakulsomsiri,et al.  Association rule-generation algorithm for mining automotive warranty data , 2006 .

[15]  I. Sahin,et al.  Quality, warranty, and preventive maintenance , 1998 .

[16]  Ramakrishnan Srikant,et al.  Fast Algorithms for Mining Association Rules in Large Databases , 1994, VLDB.

[17]  Related Topics,et al.  Survival analysis : state of the art , 1992 .

[18]  Yen-Liang Chen,et al.  A novel knowledge discovering model for mining fuzzy multi-level sequential patterns in sequence databases , 2008, Data Knowl. Eng..

[19]  Roberto J. Bayardo,et al.  Mining the most interesting rules , 1999, KDD '99.

[20]  Ramakrishnan Srikant,et al.  Mining sequential patterns , 1995, Proceedings of the Eleventh International Conference on Data Engineering.

[21]  D. N. P. Murthy,et al.  Product Warranty Handbook , 1995 .

[22]  Humberto Gutiérrez-Pulido,et al.  A Bayesian Approach for the Determination of Warranty Length , 2006 .

[23]  Jerald F. Lawless,et al.  Finite-Horizon Prediction of Recurrent Events, With Application to Forecasts of Warranty Claims , 2007, Technometrics.

[24]  Rakesh Agarwal,et al.  Fast Algorithms for Mining Association Rules , 1994, VLDB 1994.

[25]  Gunar E. Liepins,et al.  Data quality control theory and pragmatics , 1991 .

[26]  Kazuyuki Suzuki,et al.  Statistical Analysis of Marginal Count Failure Data , 2001, Lifetime data analysis.

[27]  Jiawei Han,et al.  Data Mining: Concepts and Techniques , 2000 .

[28]  Unil Yun,et al.  A new framework for detecting weighted sequential patterns in large sequence databases , 2008, Knowl. Based Syst..

[29]  Sunita Sarawagi,et al.  Mining Generalized Association Rules and Sequential Patterns Using SQL Queries , 1998, KDD.

[30]  Wei Wang,et al.  Benchmarking the effectiveness of sequential pattern mining methods , 2007, Data Knowl. Eng..

[31]  D. Murthy,et al.  New product warranty: A literature review , 2002 .

[32]  Armen Zakarian,et al.  Entropy-based associative classification algorithm for mining manufacturing data , 2008, Int. J. Comput. Integr. Manuf..

[33]  Jiawei Han,et al.  CPAR: Classification based on Predictive Association Rules , 2003, SDM.

[34]  Z. Pawlak Rough set approach to knowledge-based decision support , 1997 .

[35]  Enhong Chen,et al.  Efficient strategies for tough aggregate constraint-based sequential pattern mining , 2008, Inf. Sci..

[36]  Lianhua Wang,et al.  Data Analysis Based on Warranty Database , 2000 .

[37]  Hakan Polatoglu,et al.  Probability distributions of cost, revenue and profit over a warranty cycle , 1998, Eur. J. Oper. Res..

[38]  Kazuyuki Suzuki,et al.  Ch. 21. Statistical analysis of reliability warranty data , 2001 .

[39]  Gregory Piatetsky-Shapiro,et al.  Discovery, Analysis, and Presentation of Strong Rules , 1991, Knowledge Discovery in Databases.

[40]  J. Lawless,et al.  Estimation from truncated lifetime data with supplementary information on covariates and censoring times , 1996 .

[41]  Rakesh Agrawal,et al.  Parallel Mining of Association Rules , 1996, IEEE Trans. Knowl. Data Eng..

[42]  W. Blischke,et al.  Warranty Cost Analysis , 2008 .

[43]  Udo Grimmer,et al.  REVI-MINER, a KDD-environment for deviation detection and analysis of warranty and goodwill cost statements in automotive industry , 2001, KDD '01.

[44]  Min Jung,et al.  Analysis of field data under two-dimensional warranty , 2007, Reliab. Eng. Syst. Saf..

[45]  Jerald F. Lawless,et al.  Statistical Analysis of Product Warranty Data * , 1998 .

[46]  鈴木 和幸,et al.  Nonparametric Estimation of Lifetime Distribution from Warranty Data without Monthly Unit Sales Information , 2001 .

[47]  Simon P. Wilson,et al.  Failure models indexed by two scales , 1998, Advances in Applied Probability.

[48]  Vasant Dhar,et al.  Abstract-Driven Pattern Discovery in Databases , 1992, IEEE Trans. Knowl. Data Eng..

[49]  Wei Wang,et al.  Sequential Pattern Mining in Multi-Databases via Multiple Alignment , 2006, Data Mining and Knowledge Discovery.

[50]  Richard Nock,et al.  Statistical supports for mining sequential patterns and improving the incremental update process on data streams , 2007, Intell. Data Anal..

[51]  Wei-Hua Hao,et al.  Mining strong positive and negative sequential patterns , 2008 .

[52]  John D. Kalbfleisch,et al.  Methods for the analysis and predic tion of warranty claims , 1991 .

[53]  G. D. Herrin,et al.  Assessing mixture-model goodness-of-fit with an application to automobile warranty data , 1995, Annual Reliability and Maintainability Symposium 1995 Proceedings.

[54]  I. Djamaludin,et al.  A consumer incentive warranty policy and servicing strategy for products with uncertain quality , 1995 .

[55]  J. F. Lawless,et al.  Some Issues in the Collection and Analysis of Field Reliability Data , 1992 .

[56]  Ming-Wei Lu,et al.  Automotive reliability prediction based on early field failure warranty data , 1998 .