Performance Based Modifications of Random Forest to Perform Automated Defect Detection for Fluorescent Penetrant Inspection

The established Machine Learning algorithm Random Forest (RF) has previously been shown to be effective at performing automated defect detection for test pieces which have been processed using fluorescent penetrant inspection (FPI). The work presented here investigates three methods (two previously proposed in other fields, one novel method) of modifying the FPI RF based on the individual performance of decision trees within the RF. Evaluating based on the $$F_{2}$$F2 Score, which is the harmonic mean of precision and recall which places a larger weighting on recall, it is possible to reduce the RF in size by up to 50%, improving speed and memory requirements, whilst still gain equivalent results to a full RF. Introducing a performance based weighting or retraining decision trees which fall below a certain performance level however, offers no improvement on results for the increased computation time required to implement.

[1]  Huan Liu,et al.  Spectral feature selection for supervised and unsupervised learning , 2007, ICML '07.

[2]  R. Olshen,et al.  Points of Significance: Classification and regression trees , 2017, Nature Methods.

[3]  Juan José Rodríguez Diez,et al.  Rotation Forest: A New Classifier Ensemble Method , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4]  Horst Bunke,et al.  Optimization of Weights in a Multiple Classifier Handwritten Word Recognition System Using a Genetic Algorithm , 2004 .

[5]  OpitzDavid,et al.  Popular ensemble methods , 1999 .

[6]  Dimitrios I. Fotiadis,et al.  Modifications of the construction and voting mechanisms of the Random Forests Algorithm , 2013, Data Knowl. Eng..

[7]  Mykola Pechenizkiy,et al.  Dynamic Integration with Random Forests , 2006, ECML.

[8]  Tom M. Mitchell,et al.  Machine learning, International Edition , 1997, McGraw-Hill Series in Computer Science.

[9]  Thomas G. Dietterich What is machine learning? , 2020, Archives of Disease in Childhood.

[10]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[11]  Muhammed A. Hassan,et al.  Exploring the potential of tree-based ensemble methods in solar radiation modeling , 2017 .

[12]  Jonathan Cheung-Wai Chan,et al.  An evaluation of ensemble classifiers for mapping Natura 2000 heathland in Belgium using spaceborne angular hyperspectral (CHRIS/Proba) imagery , 2012, Int. J. Appl. Earth Obs. Geoinformation.

[13]  Tim Barden,et al.  Automated defect detection for Fluorescent Penetrant Inspection using Random Forest , 2019, NDT & E International.

[14]  L. Brasche,et al.  A Study of Drying and Cleaning Methods Used in Preparation for Fluorescent Penetrant Inspection — Part I , 2003 .

[15]  Antonio Criminisi,et al.  Object Class Segmentation using Random Forests , 2008, BMVC.

[16]  Horst Bischof,et al.  Alternating Decision Forests , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[17]  Marko Robnik-Sikonja,et al.  Improving Random Forests , 2004, ECML.

[18]  Barry de Ville,et al.  Decision trees: Decision trees , 2013 .

[19]  Laurent Heutte,et al.  On the selection of decision trees in Random Forests , 2009, 2009 International Joint Conference on Neural Networks.

[20]  Vili Podgorelec,et al.  Decision trees , 2018, Encyclopedia of Database Systems.

[21]  Padraig Cunningham,et al.  A Taxonomy of Similarity Mechanisms for Case-Based Reasoning , 2009, IEEE Transactions on Knowledge and Data Engineering.

[22]  David M. W. Powers,et al.  Evaluation: from precision, recall and F-measure to ROC, informedness, markedness and correlation , 2011, ArXiv.

[23]  Mariana Belgiu,et al.  Random forest in remote sensing: A review of applications and future directions , 2016 .

[24]  C. Faloutsos,et al.  Ensemble Methods , 2019, Machine Learning with Spark™ and Python®.

[25]  Johannes Ledolter,et al.  Data Mining and Business Analytics with R: Ledolter/Data Mining and Business Analytics with R , 2013 .

[26]  Erwan Scornet,et al.  A random forest guided tour , 2015, TEST.