3D spatial data mining on document sets for the discovery of failure causes in complex technical devices

The retrospective fault analysis of complex technical devices based on documents emerging in the advanced steps of the product life cycle can reveal error sources and problems, which have not been discovered by simulations or other test methods in the early stages of the product life cycle. This paper presents a novel approach to support the failure analysis through (i) a semi-automatic analysis of databases containing productrelated documents in natural language (e. g., problem and error descriptions, repair and maintenance protocols, service bills) using information retrieval and text mining techniques and (ii) an interactive exploration of the data mining results. Our system supports visual data mining by mapping the results of analyzing failure-related documents onto corresponding 3D models. Thus, visualization of statistics about failure sources can reveal problem sources resulting from problematic spatial configurations.

[1]  Gerard Salton,et al.  A vector space model for automatic indexing , 1975, CACM.

[2]  Tomasz Imielinski,et al.  Mining association rules between sets of items in large databases , 1993, SIGMOD Conference.

[3]  G Salton,et al.  Automatic Analysis, Theme Generation, and Summarization of Machine-Readable Texts , 1994, Science.

[4]  Ben Shneiderman,et al.  The eyes have it: a task by data type taxonomy for information visualizations , 1996, Proceedings 1996 IEEE Symposium on Visual Languages.

[5]  K. A. E. Totton,et al.  Computer Aided Decision Support in Telecommunications , 1996 .

[6]  M. L. Mitchell,et al.  Visual Explanations: Images and Quantities, Evidence and Narrative , 1997 .

[7]  Usama M. Fayyad,et al.  Knowledge Discovery in Databases: An Overview , 1997, ILP.

[8]  Srinivasan Parthasarathy,et al.  New Algorithms for Fast Discovery of Association Rules , 1997, KDD.

[9]  J M McDonald,et al.  Pathology information systems: data mining leads to knowledge discovery. , 1998, Archives of pathology & laboratory medicine.

[10]  Ian Witten,et al.  Data Mining , 2000 .

[11]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[12]  Jiawei Han,et al.  Geographic Data Mining and Knowledge Discovery , 2001 .

[13]  Bruce Gooch,et al.  Non-photorealistic rendering , 2001 .

[14]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques with Java implementations , 2002, SGMD.

[15]  Thomas Strothotte,et al.  Knowledge-supported graphical illustration of texts , 2002, AVI '02.

[16]  Daniel A. Keim,et al.  Information Visualization and Visual Data Mining , 2002, IEEE Trans. Vis. Comput. Graph..

[17]  Thomas Ertl,et al.  Transparency in Interactive Technical Illustrations , 2002, Comput. Graph. Forum.

[18]  Ulrich Güntzer,et al.  Data Mining of Association Rules and the Process of Knowledge Discovery in Databases , 2002, Industrial Conference on Data Mining.

[19]  Stefan Schlechtweg,et al.  Non-photorealistic computer graphics: modeling, rendering, and animation , 2002 .

[20]  K. Hartmann,et al.  A spreading activation approach to text illustration , 2002, SMARTGRAPH '02.

[21]  Christian Borgelt,et al.  EFFICIENT IMPLEMENTATIONS OF APRIORI AND ECLAT , 2003 .

[22]  Ivan Viola,et al.  Importance-driven volume rendering , 2004, IEEE Visualization 2004.

[23]  H. Kriegel,et al.  Spatial Data Mining: Database Primitives, Algorithms and Efficient DBMS Support , 2000, Data Mining and Knowledge Discovery.

[24]  David J. Hand,et al.  Intelligent Data Analysis: An Introduction , 2005 .

[25]  Andreas Hotho,et al.  A Brief Survey of Text Mining , 2005, LDV Forum.

[26]  Heikki Mannila,et al.  Principles of Data Mining , 2001, Undergraduate Topics in Computer Science.