Shape mining: A holistic data mining approach for engineering design

Abstract Although the integration of engineering data within the framework of product data management systems has been successful in the recent years, the holistic analysis (from a systems engineering perspective) of multi-disciplinary data or data based on different representations and tools is still not realized in practice. At the same time, the application of advanced data mining techniques to complete designs is very promising and bears a high potential for synergy between different teams in the development process. In this paper, we propose shape mining as a framework to combine and analyze data from engineering design across different tools and disciplines. In the first part of the paper, we introduce unstructured surface meshes as meta-design representations that enable us to apply sensitivity analysis, design concept retrieval and learning as well as methods for interaction analysis to heterogeneous engineering design data. We propose a new measure of relevance to evaluate the utility of a design concept. In the second part of the paper, we apply the formal methods to passenger car design. We combine data from different representations, design tools and methods for a holistic analysis of the resulting shapes. We visualize sensitivities and sensitive cluster centers (after feature reduction) on the car shape. Furthermore, we are able to identify conceptual design rules using tree induction and to create interaction graphs that illustrate the interrelation between spatially decoupled surface areas. Shape data mining in this paper is studied for a multi-criteria aerodynamic problem, i.e. drag force and rear lift, however, the extension to quality criteria from different disciplines is straightforward as long as the meta-design representation is still applicable.

[1]  Guoqiang Peter Zhang,et al.  Neural networks for classification: a survey , 2000, IEEE Trans. Syst. Man Cybern. Part C.

[2]  Teuvo Kohonen,et al.  Self-Organizing Maps , 2010 .

[3]  K E Willard,et al.  Probabilistic sensitivity analysis methods for general decision models. , 1986, Computers and biomedical research, an international journal.

[4]  Thomas Villmann,et al.  Rule Extraction from Self-Organizing Networks , 2002, ICANN.

[5]  Renée J. Miller,et al.  Association rules over interval data , 1997, SIGMOD '97.

[6]  Timothée Jost,et al.  Fast Geometric Matching for Shape Registration , 2002 .

[7]  Gianfranco La Rocca,et al.  Knowledge based engineering: Between AI and CAD. Review of a language based technology to support engineering design , 2012, Adv. Eng. Informatics.

[8]  Aleks Jakulin Machine Learning Based on Attribute Interactions , 2005 .

[9]  George J. Klir,et al.  Fuzzy Sets, Fuzzy Logic, and Fuzzy Systems - Selected Papers by Lotfi A Zadeh , 1996, Advances in Fuzzy Systems - Applications and Theory.

[10]  Howard J. Hamilton,et al.  Interestingness measures for data mining: A survey , 2006, CSUR.

[11]  Nikolaus Hansen,et al.  Completely Derandomized Self-Adaptation in Evolution Strategies , 2001, Evolutionary Computation.

[12]  Daisuke Sasaki,et al.  Visualization and Data Mining of Pareto Solutions Using Self-Organizing Map , 2003, EMO.

[13]  Bernhard Sendhoff,et al.  Knowledge Extraction from Aerodynamic Design Data and its Application to 3D Turbine Blade Geometries , 2008, J. Math. Model. Algorithms.

[14]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[15]  Andreas Griewank,et al.  Introduction to Automatic Differentiation , 2003 .

[16]  Stefan Wermter,et al.  Data mining using rule extraction from Kohonen self-organising maps , 2006, Neural Computing & Applications.

[17]  David A. Landgrebe,et al.  A survey of decision tree classifier methodology , 1991, IEEE Trans. Syst. Man Cybern..

[18]  Klaus Krippendorff,et al.  Ross Ashby's information theory: a bit of history, some solutions to problems, and what we face today , 2009, Int. J. Gen. Syst..

[19]  Kalyanmoy Deb,et al.  Multi-objective optimization using evolutionary algorithms , 2001, Wiley-Interscience series in systems and optimization.

[20]  R. Ho Algebraic Topology , 2022 .

[21]  Dirk Schaefer,et al.  A semantic file system for integrated product data management , 2011, Adv. Eng. Informatics.

[22]  Alfred Ultsch,et al.  Knowledge Extraction from Self-Organizing Neural Networks , 1993 .

[23]  D. Devaraj,et al.  Fuzzy Sets & Fuzzy Logic , 2014 .

[24]  Alexander M. Bronstein,et al.  Numerical Geometry of Non-Rigid Shapes , 2009, Monographs in Computer Science.

[25]  Michael Rath,et al.  Modeling Design and Flow Feature Interactions for Automotive Synthesis , 2011, IDEAL.

[26]  Erkki Oja,et al.  Engineering applications of the self-organizing map , 1996, Proc. IEEE.

[27]  Niles A. Pierce,et al.  An Introduction to the Adjoint Approach to Design , 2000 .

[28]  Ivan Bratko,et al.  Testing the significance of attribute interactions , 2004, ICML.

[29]  Anil K. Jain,et al.  Data clustering: a review , 1999, CSUR.

[30]  Andy J. Keane,et al.  Engineering Design via Surrogate Modelling - A Practical Guide , 2008 .

[31]  John F. Hughes,et al.  Direct manipulation of free-form deformations , 1992, SIGGRAPH.

[32]  Jorma Laaksonen,et al.  Variants of self-organizing maps , 1990, International 1989 Joint Conference on Neural Networks.

[33]  Lida Xu,et al.  Enterprise Systems: State-of-the-Art and Future Trends , 2011, IEEE Transactions on Industrial Informatics.

[34]  A. Saltelli,et al.  A quantitative model-independent method for global sensitivity analysis of model output , 1999 .

[35]  W. J. McGill Multivariate information transmission , 1954, Trans. IRE Prof. Group Inf. Theory.

[36]  Ilya M. Sobol,et al.  Sensitivity Estimates for Nonlinear Mathematical Models , 1993 .

[37]  Edsger W. Dijkstra,et al.  A note on two problems in connexion with graphs , 1959, Numerische Mathematik.

[38]  Nikolaus Hansen,et al.  Evaluating the CMA Evolution Strategy on Multimodal Test Functions , 2004, PPSN.

[39]  Bernhard Sendhoff,et al.  Representing the Change - Free Form Deformation for Evolutionary Design Optimization , 2008, Evolutionary Computation in Practice.

[40]  Wei-Yin Loh,et al.  Classification and regression trees , 2011, WIREs Data Mining Knowl. Discov..

[41]  Z. Pawlak Rough Sets: Theoretical Aspects of Reasoning about Data , 1991 .

[42]  Lawrence H. Staib,et al.  Shape-based 3D surface correspondence using geodesics and local geometry , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[43]  Lior Rokach,et al.  Data Mining And Knowledge Discovery Handbook , 2005 .

[44]  Andrew W. Moore,et al.  X-means: Extending K-means with Efficient Estimation of the Number of Clusters , 2000, ICML.

[45]  Pang-Ning Tan,et al.  Interestingness Measures for Association Patterns : A Perspective , 2000, KDD 2000.

[46]  Shigeru Obayashi,et al.  Data mining for multidisciplinary design space of regional-jet wing , 2005, 2005 IEEE Congress on Evolutionary Computation.

[47]  Paul J. Besl,et al.  A Method for Registration of 3-D Shapes , 1992, IEEE Trans. Pattern Anal. Mach. Intell..

[48]  F. Menter Improved two-equation k-omega turbulence models for aerodynamic flows , 1992 .

[49]  Petros Koumoutsakos,et al.  Learning probability distributions in continuous evolutionary algorithms – a comparative review , 2004, Natural Computing.

[50]  Bernhard Sendhoff,et al.  Application of Sensitivity Analysis for an Improved Representation in Evolutionary Design Optimization , 2012, 2012 Sixth International Conference on Genetic and Evolutionary Computing.

[51]  Marc Alexa,et al.  Recent Advances in Mesh Morphing , 2002, Comput. Graph. Forum.

[52]  Robert Tibshirani,et al.  Estimating the number of clusters in a data set via the gap statistic , 2000 .

[53]  T. J. Mitchell,et al.  Exploratory designs for computational experiments , 1995 .

[54]  Lucas Bradstreet,et al.  A Fast Way of Calculating Exact Hypervolumes , 2012, IEEE Transactions on Evolutionary Computation.

[55]  Erkki Jantunen,et al.  Challenges in Data Management in Product Life Cycle Engineering , 2011 .

[56]  Shigeru Obayashi,et al.  Multi-Objective Design Exploration and its Applications , 2010 .

[57]  Geoffrey I. Webb Discovering significant rules , 2006, KDD '06.

[58]  Thomas W. Sederberg,et al.  Free-form deformation of solid geometric models , 1986, SIGGRAPH.

[59]  Saltelli Andrea,et al.  Global Sensitivity Analysis: The Primer , 2008 .

[60]  Klaus Krippendorff,et al.  Information Theory: Structural Models for Qualitative Data. , 1988 .