The needs and benefits of applying textual data mining within the product development process

As a result of the growing competition in recent years, new trends such as increased product complexity, changing customer requirements and shortening development time have emerged within the product development process (PDP). These trends have added more challenges to the already-difficult task of quality and reliability prediction and improvement. They have given rise to an increase in the number of unexpected events in the PDP. Traditional tools are only partially adequate to cover these unexpected events. As such, new tools are being sought to complement traditional ones. This paper investigates the use of one such tool, textual data mining for the purpose of quality and reliability improvement. The motivation for this paper stems from the need to handle ‘loosely structured textual data’ within the product development process. Thus far, most of the studies on data mining within the PDP have focused on numerical databases. In this paper, the need for the study of textual databases is established. Possible areas within a generic PDP for consumer and professional products, where textual data mining could be employed are highlighted. In addition, successful implementations of textual data mining within two large multi-national companies are presented. Copyright © 2003 John Wiley & Sons, Ltd.

[1]  Robert Milne,et al.  Predicting paper making defects on-line using data mining , 1998, Knowl. Based Syst..

[2]  C. McGreavy,et al.  Data Mining for Failure Diagnosis of Process Units by Learning Probabilistic Networks , 1997 .

[3]  David Bell,et al.  Re-engineering Business Processes to Facilitate Data Mining , 1996 .

[4]  Unny Menon,et al.  Concurrent engineering : concepts, implementation and practice , 1994 .

[5]  Martin F. Porter,et al.  An algorithm for suffix stripping , 1997, Program.

[6]  C. Syan Introduction to concurrent engineering , 1994 .

[7]  Gerard Salton,et al.  Term-Weighting Approaches in Automatic Text Retrieval , 1988, Inf. Process. Manag..

[8]  Karl T. Ulrich,et al.  Product Design and Development , 1995 .

[9]  Michael McGill,et al.  Introduction to Modern Information Retrieval , 1983 .

[10]  S. Sathiya Keerthi,et al.  A fast iterative nearest point algorithm for support vector machine classifier design , 2000, IEEE Trans. Neural Networks Learn. Syst..

[11]  Petra Perner,et al.  Data Mining - Concepts and Techniques , 2002, Künstliche Intell..

[12]  Michael J. A. Berry,et al.  Data mining techniques - for marketing, sales, and customer support , 1997, Wiley computer publishing.

[13]  C. Irgens,et al.  An application of data mining for product design , 1998, KDD 1998.

[14]  Julie Beth Lovins,et al.  Development of a stemming algorithm , 1968, Mech. Transl. Comput. Linguistics.

[15]  Usama M. Fayyad,et al.  Knowledge Discovery in Databases: An Overview , 1997, ILP.

[16]  P. Chang,et al.  Transforming corporate information into value through data warehousing and data mining , 1998, Aslib Proc..

[17]  Usama M. Fayyad,et al.  Data Mining and Knowledge Discovery: Making Sense Out of Data , 1996, IEEE Expert.

[18]  Padhraic Smyth,et al.  From Data Mining to Knowledge Discovery in Databases , 1996, AI Mag..

[19]  U. M. Feyyad Data mining and knowledge discovery: making sense out of data , 1996 .

[20]  Robert P. Goldman,et al.  Textual data mining of service center call records , 2000, KDD '00.

[21]  Christina Mastrangelo,et al.  Data mining in a chemical process application , 1998, SMC'98 Conference Proceedings. 1998 IEEE International Conference on Systems, Man, and Cybernetics (Cat. No.98CH36218).

[22]  David A. Koonce,et al.  A data mining tool for learning from manufacturing systems , 1997 .

[23]  Steven H. Kim,et al.  Nonlinear prediction of manufacturing systems through explicit and implicit data mining , 1997 .

[24]  Thorsten Joachims,et al.  Text Categorization with Support Vector Machines: Learning with Many Relevant Features , 1998, ECML.

[25]  Chris J McDonald,et al.  New tools for yield improvement in integrated circuit manufacturing: can they be applied to reliability? , 1999 .

[26]  Rolf Stadler,et al.  Discovering Data Mining: From Concept to Implementation , 1997 .