Computation of failure probability via hierarchical clustering

Abstract Most of the recent reliability analysis methods are based on supervised learning approaches, e.g. neural network, support vector machine, importance sampling (IS), Markov Chain Monte Carlo (MCMC), subset simulation (SS), etc. Among these, SS approach has been shown to outperform in effectiveness and robustness, as it classifies failure samples through multiple levels instead of conventional use of a single level. Inherent to supervised learning approaches is the limitation that these only utilize the evaluated information, i.e. the information from the samples whose performance functions have been computed. To overcome this, a new scheme based on modified hierarchical clustering is proposed. This scheme, referred to here as ‘hierarchical failure clustering’ (HFC), adds a pre-processing step via an unsupervised learning approach, e.g. various clustering algorithms. The proposed method exploits the statistical structure embedded in the input samples prior to computing the performance functions of these samples. This statistical structure has a tree based on these input samples with nodes as multi-level clusters. Father clusters at a specific level are used to explore an intermediate failure region and the failure portions are sifted out. Child clusters that are attached to the father failure clusters continue to explore a smaller intermediate failure region. Accordingly, the HFC method proceeds recursively until the target failure region is detected. In contrast with the SS and IS, HFC benefits from additional information from unlabeled samples with the help of a pre-built tree. Through a relatively small added premium in terms of computations for pre-processing, the additional information helps to improve the quality of the intermediate failure probability estimates and thereby significantly reduces the overall computational effort. The efficacy of HFC is theoretically demonstrated here and supported by several examples. HFC holds the promise for complex and large-scale systems for which the performance evaluation is computationally intensive.

[1]  Bruce R. Ellingwood,et al.  A new look at the response surface approach for reliability analysis , 1993 .

[2]  L. Faravelli Response‐Surface Approach for Reliability Analysis , 1989 .

[3]  Junho Song,et al.  Cross-Entropy-Based Adaptive Importance Sampling Using Gaussian Mixture , 2013 .

[4]  James L. Beck,et al.  Reliability Estimation for Dynamical Systems Subject to Stochastic Excitation using Subset Simulation with Splitting , 2005 .

[5]  J. Beck,et al.  Estimation of Small Failure Probabilities in High Dimensions by Subset Simulation , 2001 .

[6]  Ahsan Kareem,et al.  Reliability analysis of wind-sensitive structures , 1990 .

[7]  Jinsuo Nie,et al.  A new directional simulation method for system reliability. Part I: application of deterministic point sets , 2004 .

[8]  C. Bucher Adaptive sampling — an iterative fast Monte Carlo procedure , 1988 .

[9]  David H. Wolpert,et al.  No free lunch theorems for optimization , 1997, IEEE Trans. Evol. Comput..

[10]  J. Hammersley,et al.  A new Monte Carlo technique: antithetic variates , 1956, Mathematical Proceedings of the Cambridge Philosophical Society.

[11]  W. T. Williams,et al.  Dissimilarity Analysis: a new Technique of Hierarchical Sub-division , 1964, Nature.

[12]  Palle Thoft-Christensen,et al.  Structural Reliability Theory and Its Applications , 1982 .

[13]  Peter J. Rousseeuw,et al.  Finding Groups in Data: An Introduction to Cluster Analysis , 1990 .

[14]  Dirk P. Kroese,et al.  The Cross-Entropy Method: A Unified Approach to Combinatorial Optimization, Monte-Carlo Simulation and Machine Learning , 2004 .

[15]  C. Bucher,et al.  A fast and efficient response surface approach for structural reliability problems , 1990 .

[16]  Qiang Du,et al.  Constrained Centroidal Voronoi Tessellations for Surfaces , 2002, SIAM J. Sci. Comput..

[17]  Kevin P. Murphy,et al.  Machine learning - a probabilistic perspective , 2012, Adaptive computation and machine learning series.

[18]  J. H. Ward Hierarchical Grouping to Optimize an Objective Function , 1963 .

[19]  Wilson H. Tang,et al.  Optimal Importance‐Sampling Density Estimator , 1992 .

[20]  Thomas Gerstner,et al.  Numerical integration using sparse grids , 2004, Numerical Algorithms.

[21]  Qiang Du,et al.  Centroidal Voronoi Tessellations: Applications and Algorithms , 1999, SIAM Rev..

[22]  A. Kiureghian,et al.  Multiple design points in first and second-order reliability , 1998 .

[23]  Jianbing Chen,et al.  The principle of preservation of probability and the generalized density evolution equation , 2008 .

[24]  M. Shinozuka Basic Analysis of Structural Safety , 1983 .