The user query based learning system for lifetime prediction of metallic components

Real-World Data Mining Applications generally do not end up with the creation of the models. The use of the model is the final purpose especially in prediction tasks. The problem arises when the model is built based on much more information than that the user can provide in using the model. As a result, the performance of model reduces drastically due to many missing attributes values. This paper develops a new learning system framework, called as User Query Based Learning System (UQBLS), for building data mining models best suitable for users use. We demonstrate its deployment in a real-world application of the lifetime prediction of metallic components in buildings

[1]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[2]  Basilio Sierra,et al.  K Nearest Neighbor Edition to Guide Classification Tree Learning: Motivation and Experimental Results , 2006, Selected Papers from AusDM.

[3]  David Heckerman,et al.  Bayesian Networks for Knowledge Discovery , 1996, Advances in Knowledge Discovery and Data Mining.

[4]  Huan Liu,et al.  Feature Selection for High-Dimensional Data: A Fast Correlation-Based Filter Solution , 2003, ICML.

[5]  Daphne Koller,et al.  Toward Optimal Feature Selection , 1996, ICML.

[6]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques, 3rd Edition , 1999 .

[7]  Osmar R. Zaïane,et al.  Introduction to the special issue on successful real-world data mining applications , 2006, SKDD.

[8]  Usama M. Fayyad,et al.  On the Handling of Continuous-Valued Attributes in Decision Tree Generation , 1992, Machine Learning.

[9]  Claire Cardie,et al.  Using Decision Trees to Improve Case-Based Learning , 1993, ICML.

[10]  Larry A. Rendell,et al.  The Feature Selection Problem: Traditional Methods and a New Algorithm , 1992, AAAI.

[11]  Donald E. Brown,et al.  Data mining corrosion from Eddy current non-destructive tests , 2002 .

[12]  M. Kamrunnahar,et al.  Data Mining of Experimental Corrosion Data Using Neural Network , 2005 .

[13]  Padhraic Smyth,et al.  Knowledge Discovery and Data Mining: Towards a Unifying Framework , 1996, KDD.

[14]  Marco Tomassini,et al.  a Survey of Genetic Algorithms , 1995 .

[15]  Alberto Maria Segre,et al.  Programs for Machine Learning , 1994 .

[16]  Ron Kohavi,et al.  Wrappers for Feature Subset Selection , 1997, Artif. Intell..

[17]  Zhong-Xian Chi,et al.  Application of rough set theory and artificial neural network for load forecasting , 2002, Proceedings. International Conference on Machine Learning and Cybernetics.

[18]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[19]  Gregory Piatetsky-Shapiro,et al.  Knowledge Discovery in Real Databases: A Report on the IJCAI-89 Workshop , 1991, AI Mag..

[20]  Phillip Ein-Dor,et al.  Attributes of the performance of central processing units: a relative performance prediction model , 1987, CACM.

[21]  Padhraic Smyth,et al.  From Data Mining to Knowledge Discovery: An Overview , 1996, Advances in Knowledge Discovery and Data Mining.

[22]  Huan Liu,et al.  Searching for Interacting Features , 2007, IJCAI.

[23]  W. Kessler,et al.  Improved prediction of the corrosion behaviour of car body steel using a Kohonen self organising map , 1994 .

[24]  Lukasz A. Kurgan,et al.  A survey of Knowledge Discovery and Data Mining process models , 2006, The Knowledge Engineering Review.

[25]  Jozef Zurada,et al.  Next Generation of Data-Mining Applications , 2005 .

[26]  S. S. Iyengar,et al.  An Evaluation of Filter and Wrapper Methods for Feature Selection in Categorical Clustering , 2005, IDA.

[27]  Hani G. Melhem,et al.  WRAPPER METHODS FOR INDUCTIVE LEARNING: EXAMPLE APPLICATION TO BRIDGE DECKS , 2003 .

[28]  Jerzy W. Grzymala-Busse,et al.  Rough Sets , 1995, Commun. ACM.

[29]  Robert E. Schapire,et al.  The Boosting Approach to Machine Learning An Overview , 2003 .

[30]  F. Fleuret Fast Binary Feature Selection with Conditional Mutual Information , 2004, J. Mach. Learn. Res..

[31]  Yoav Freund,et al.  Experiments with a New Boosting Algorithm , 1996, ICML.

[32]  Zdzislaw Pawlak,et al.  Rough sets and intelligent data analysis , 2002, Inf. Sci..

[33]  Ethem Alpaydin,et al.  Introduction to machine learning , 2004, Adaptive computation and machine learning.

[34]  Marko Robnik-Sikonja,et al.  Theoretical and Empirical Analysis of ReliefF and RReliefF , 2003, Machine Learning.

[35]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[36]  Christopher J. C. Burges,et al.  A Tutorial on Support Vector Machines for Pattern Recognition , 1998, Data Mining and Knowledge Discovery.

[37]  David W. Aha,et al.  Instance-Based Learning Algorithms , 1991, Machine Learning.

[38]  J. Ross Quinlan,et al.  Combining Instance-Based and Model-Based Learning , 1993, ICML.

[39]  Marcin Szczuka,et al.  Rough Sets in KDD , 2005 .

[40]  Yoav Freund,et al.  A Short Introduction to Boosting , 1999 .

[41]  Michael J. A. Berry,et al.  Mastering Data Mining: The Art and Science of Customer Relationship Management , 1999 .

[42]  Richi Nayak,et al.  Data Mining For Lifetime Prediction of Metallic Components , 2006, AusDM.

[43]  S. Poyhonen Support vector machine based classification in condition monitoring of induction motors , 2004 .

[44]  Akira Mita,et al.  Damage diagnosis of a building structure using support vector machine and modal frequency patterns , 2003, SPIE Smart Structures and Materials + Nondestructive Evaluation and Health Monitoring.

[45]  Xiuju Fu,et al.  Extracting the knowledge embedded in support vector machines , 2004, 2004 IEEE International Joint Conference on Neural Networks (IEEE Cat. No.04CH37541).

[46]  Masayuki Numao,et al.  Ordered Estimation of Missing Values , 1999, PAKDD.

[47]  Huan Liu,et al.  Feature Selection for Classification , 1997, Intell. Data Anal..

[48]  Mark A. Hall,et al.  Correlation-based Feature Selection for Discrete and Numeric Class Machine Learning , 1999, ICML.

[49]  J. Ross Quinlan,et al.  Induction of Decision Trees , 1986, Machine Learning.

[50]  Renpu Li,et al.  Mining classification rules using rough sets and neural networks , 2004, Eur. J. Oper. Res..

[51]  Robert E. Schapire,et al.  A Brief Introduction to Boosting , 1999, IJCAI.

[52]  H. Furuta,et al.  Neural network analysis of structural damage due to corrosion , 1995, Proceedings of 3rd International Symposium on Uncertainty Modeling and Analysis and Annual Conference of the North American Fuzzy Information Processing Society.

[53]  Simon Haykin,et al.  Neural Networks: A Comprehensive Foundation , 1998 .

[54]  Mary Lou Maher,et al.  Case-Based Reasoning , 1997 .

[55]  Kezhi Mao,et al.  Feature Selection Algorithm for Data with Both Nominal and Continuous Features , 2005, PAKDD.

[56]  Ishwar K. Sethi,et al.  Data mining: an introducation , 2001 .

[57]  J. Ross Quinlan,et al.  Unknown Attribute Values in Induction , 1989, ML.

[58]  Hani G. Melhem,et al.  PREDICTION OF REMAINING SERVICE LIFE OF BRIDGE DECKS USING MACHINE LEARNING , 2003 .

[59]  Sou-Sen Leu,et al.  Data mining for tunnel support stability: neural network approach , 2001 .

[60]  George Morcous,et al.  Case-Based Reasoning System for Modeling Infrastructure Deterioration , 2002 .

[61]  John H. Holland,et al.  Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence , 1992 .

[62]  Shian-Shyong Tseng,et al.  A two-phase feature selection method using both filter and wrapper , 1999, IEEE SMC'99 Conference Proceedings. 1999 IEEE International Conference on Systems, Man, and Cybernetics (Cat. No.99CH37028).

[63]  Jerzy W. Grzymala-Busse,et al.  A Comparison of Several Approaches to Missing Attribute Values in Data Mining , 2000, Rough Sets and Current Trends in Computing.

[64]  Josef Kittler,et al.  Pattern recognition : a statistical approach , 1982 .

[65]  J. R. Quinlan Learning With Continuous Classes , 1992 .

[66]  Igor Kononenko,et al.  Estimating Attributes: Analysis and Extensions of RELIEF , 1994, ECML.

[67]  Max Bramer,et al.  Techniques for Dealing with Missing Values in Classification , 1997, IDA.

[68]  Lars Niklasson,et al.  Accuracy vs. comprehensibility in data mining models , 2004 .

[69]  Zhi-Hua Zhou Comprehensibility of Data Mining Algorithms , 2005 .

[70]  Alexander O. Skomorokhov A knowledge discovery method: APL implementation and application , 2000, APL '00.

[71]  Andrés Gómez de Silva Garza,et al.  Case-Based Reasoning in Design , 1995, IEEE Expert.

[72]  George Morcous,et al.  Modeling Bridge Deterioration Using Case-Based Reasoning , 2002 .

[73]  Xiaoming Xu,et al.  A Wrapper for Feature Selection Based on Mutual Information , 2006, 18th International Conference on Pattern Recognition (ICPR'06).

[74]  Andrzej Skowron,et al.  A rough set approach to knowledge discovery , 2002, International Journal of Intelligent Systems.

[75]  Thomas G. Dietterich,et al.  Learning Boolean Concepts in the Presence of Many Irrelevant Features , 1994, Artif. Intell..

[76]  Zijian Zheng,et al.  Classifying Unseen Cases with Many Missing Values , 1999, PAKDD.