Predicting Failures of Vision Systems

Computer vision systems today fail frequently. They also fail abruptly without warning or explanation. Alleviating the former has been the primary focus of the community. In this work, we hope to draw the community's attention to the latter, which is arguably equally problematic for real applications. We promote two metrics to evaluate failure prediction. We show that a surprisingly straightforward and general approach, that we call ALERT, can predict the likely accuracy (or failure) of a variety of computer vision systems - semantic segmentation, vanishing point and camera parameter estimation, and image memorability prediction - on individual input images. We also explore attribute prediction, where classifiers are typically meant to generalize to new unseen categories. We show that ALERT can be useful in predicting failures of this transfer. Finally, we leverage ALERT to improve the performance of a downstream application of attribute prediction: zero-shot learning. We show that ALERT can outperform several strong baselines for zero-shot learning on four datasets.

[1]  Pierre Hansen,et al.  Roof duality, complementation and persistency in quadratic 0–1 optimization , 1984, Math. Program..

[2]  M. Braga,et al.  Exploratory Data Analysis , 2018, Encyclopedia of Social Network Analysis and Mining. 2nd Ed..

[3]  George A. Miller,et al.  WordNet: A Lexical Database for English , 1995, HLT.

[4]  Robert P. W. Duin,et al.  Classifier Conditional Posterior Probabilities , 1998, SSPR/SPR.

[5]  Robert L. Williams,et al.  Automatic target recognition of time critical moving targets using 1D high range resolution (HRR) radar , 1999, Proceedings of the 1999 IEEE Radar Conference. Radar into the Next Millennium (Cat. No.99CH36249).

[6]  Robert L. Williams,et al.  Automatic target recognition of time critical moving targets using 1D high range resolution (HRR) radar , 1999 .

[7]  Bir Bhanu,et al.  Predicting Performance of Object Recognition , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[8]  Matjaz Kukar Estimating Confidence Values of Individual Predictions by their Typicalness and Reliability , 2004, ECAI.

[9]  David D. Palmer,et al.  Context-based Speech Recognition Error Detection and Correction , 2004, NAACL.

[10]  Stephanie Seneff,et al.  Error Detection and Recovery in Spoken Dialogue Systems , 2004, HLT-NAACL 2004.

[11]  Pietro Perona,et al.  Learning Generative Visual Models from Few Training Examples: An Incremental Bayesian Approach Tested on 101 Object Categories , 2004, 2004 Conference on Computer Vision and Pattern Recognition Workshop.

[12]  Hui Jiang,et al.  Confidence measures for speech recognition: A survey , 2005, Speech Commun..

[13]  Rong Wang,et al.  Learning models for predicting recognition performance , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[14]  Padraig Cunningham,et al.  Generating Estimates of Classification Confidence for a Case-Based Spam Filter , 2005, ICCBR.

[15]  Robi Polikar,et al.  Ensemble Confidence Estimates Posterior Probability , 2005, Multiple Classifier Systems.

[16]  Jianguo Zhang,et al.  The PASCAL Visual Object Classes Challenge , 2006 .

[17]  Bruno Mirbach,et al.  Confidence Estimation in Classification Decision: A Method for Detecting Unseen Patterns , 2006 .

[18]  Antonio Criminisi,et al.  TextonBoost for Image Understanding: Multi-Class Object Recognition and Segmentation by Jointly Modeling Texture, Layout, and Context , 2007, International Journal of Computer Vision.

[19]  Qiang Ji,et al.  Modeling and Predicting Face Recognition System Performance Based on Analysis of Similarity Scores , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[20]  Elham Tabassi,et al.  Performance of Biometric Quality Measures , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[21]  Vladimir Kolmogorov,et al.  Optimizing Binary MRFs via Extended Roof Duality , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[22]  Thomas J. Walsh,et al.  Knows what it knows: a framework for self-aware learning , 2008, ICML '08.

[23]  Koby Crammer,et al.  Confidence-weighted linear classification , 2008, ICML '08.

[24]  Christoph H. Lampert,et al.  Learning to detect unseen object classes by between-class attribute transfer , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[25]  Lise Getoor,et al.  Reflect and correct: A misclassification prediction approach to active inference , 2009, TKDD.

[26]  T. Kanade,et al.  Geometric reasoning for single image structure recovery , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[27]  Derek Hoiem,et al.  Recovering the spatial layout of cluttered rooms , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[28]  Jitendra Malik,et al.  Poselets: Body part detectors trained using 3D human pose annotations , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[29]  P. Jonathon Phillips,et al.  An introduction to biometric-completeness: The equivalence of matching and quality , 2009, 2009 IEEE 3rd International Conference on Biometrics: Theory, Applications, and Systems.

[30]  Ali Farhadi,et al.  Describing objects by their attributes , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[31]  Shree K. Nayar,et al.  Attribute and simile classifiers for face verification , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[32]  D. Sontag 1 Introduction to Dual Decomposition for Inference , 2010 .

[33]  Anderson Rocha,et al.  Robust Fusion: Extreme Value Theory for Recognition Score Normalization , 2010, ECCV.

[34]  Martial Hebert,et al.  Stacked Hierarchical Labeling , 2010, ECCV.

[35]  Cristian Sminchisescu,et al.  Constrained parametric min-cuts for automatic object segmentation , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[36]  Krista A. Ehinger,et al.  SUN database: Large-scale scene recognition from abbey to zoo , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[37]  Marc Pollefeys,et al.  Segmenting video into classes of algorithm-suitability , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[38]  Yong Jae Lee,et al.  Object-graphs for context-aware category discovery , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[39]  Alexei A. Efros,et al.  Unbiased look at dataset bias , 2011, CVPR 2011.

[40]  Anderson Rocha,et al.  Meta-Recognition: The Theory and Practice of Recognition Score Analysis , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[41]  Nathan J. Leap A Confidence Paradigm for Classification Systems , 2012 .

[42]  Daphne Koller,et al.  Discriminative learning of relaxed hierarchy for large-scale visual recognition , 2011, 2011 International Conference on Computer Vision.

[43]  Tommi S. Jaakkola,et al.  Introduction to dual composition for inference , 2011 .

[44]  Yaser Al-Onaizan,et al.  Goodness: A Method for Measuring Machine Translation Confidence , 2011, ACL.

[45]  Jianxiong Xiao,et al.  What makes an image memorable? , 2011, CVPR 2011.

[46]  Martial Hebert,et al.  Data-Driven Scene Understanding from 3D Models , 2012, BMVC.

[47]  Shang-Hua Teng,et al.  Power SVM: Generalization with exemplar classification uncertainty , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[48]  Terrance E. Boult,et al.  Multi-attribute spaces: Calibration for attribute fusion and similarity search , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[49]  Anderson Rocha,et al.  Learning for Meta-Recognition , 2012, IEEE Transactions on Information Forensics and Security.

[50]  Ali Farhadi,et al.  Building a dictionary of image fragments , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[51]  Alexei A. Efros,et al.  Undoing the Damage of Dataset Bias , 2012, ECCV.

[52]  Derek Hoiem,et al.  Diagnosing Error in Object Detectors , 2012, ECCV.

[53]  James Hays,et al.  SUN attribute database: Discovering, annotating, and recognizing scene attributes , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[54]  Cristian Sminchisescu,et al.  CPMC: Automatic Object Segmentation Using Constrained Parametric Min-Cuts , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[55]  C. V. Jawahar,et al.  Has My Algorithm Succeeded? An Evaluator for Human Pose Estimators , 2012, ECCV.

[56]  Jonathan Krause,et al.  Hedging your bets: Optimizing accuracy-specificity trade-offs in large scale visual recognition , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[57]  Rahul Nair,et al.  Ensemble Learning for Confidence Measures in Stereo Vision , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[58]  P. Perona,et al.  UvA-DARE (Digital Academic Repository) A Lazy Man's Approach to Benchmarking: Semisupervised Classifier Evaluation and Recalibration A Lazy Man’s Approach to Benchmarking: Semisupervised Classifier Evaluation and Recalibration , 2013 .

[59]  Martial Hebert,et al.  An architecture for online semantic labeling on UGVs , 2013, Defense, Security, and Sensing.

[60]  Stefan Carlsson,et al.  Properties of Datasets Predict the Performance of Classifiers , 2013, BMVC.

[61]  Fingerprint Image Quality , 2015, Encyclopedia of Biometrics.