Cascade of Boolean detector combinations

This paper considers a scenario when we have multiple pre-trained detectors for detecting an event and a small dataset for training a combined detection system. We build the combined detector as a Boolean function of thresholded detector scores and implement it as a binary classification cascade. The cascade structure is computationally efficient by providing the possibility to early termination. For the proposed Boolean combination function, the computational load of classification is reduced whenever the function becomes determinate before all the component detectors have been utilized. We also propose an algorithm, which selects all the needed thresholds for the component detectors within the proposed Boolean combination. We present results on two audio-visual datasets, which prove the efficiency of the proposed combination framework. We achieve state-of-the-art accuracy with substantially reduced computation time in laughter detection task, and our algorithm finds better thresholds for the component detectors within the Boolean combination than the other algorithms found in the literature.

[1]  François Fleuret,et al.  Joint Cascade Optimization Using A Product Of Boosted Classifiers , 2010, NIPS.

[2]  Tuomas Virtanen,et al.  Acoustic event detection in real life recordings , 2010, 2010 18th European Signal Processing Conference.

[3]  Julian Ashbourn,et al.  Biometrics - advanced identity verification: the complete guide , 2000 .

[4]  Julian Ashbourn,et al.  Biometrics: Advanced Identity Verification , 2000, Springer London.

[5]  A. Pawley,et al.  The One-clause-at-a-time Hypothesis , 2000 .

[6]  B. V. K. Vijaya Kumar,et al.  Role of Statistical Dependence Between Classifier Scores in Determining the Best Decision Fusion Rule for Improved Biometric Verification , 2006, MRCS.

[7]  Changyu Shen On the Principles of Believe the Positive and Believe the Negative for Diagnosis Using Two Continuous Tests , 2021, Journal of Data Science.

[8]  Paul A. Viola,et al.  Multiple-Instance Pruning For Learning Efficient Cascade Detectors , 2007, NIPS.

[9]  Tom Fawcett,et al.  An introduction to ROC analysis , 2006, Pattern Recognit. Lett..

[10]  Peter L. Hammer,et al.  Boolean Functions - Theory, Algorithms, and Applications , 2011, Encyclopedia of mathematics and its applications.

[11]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[12]  Maja Pantic,et al.  The MAHNOB Laughter database , 2013, Image Vis. Comput..

[13]  Franck Molina,et al.  Detection of pathological biomarkers in human clinical samples via amplifying genetic switches and logic gates , 2015, Science Translational Medicine.

[14]  Nuno Vasconcelos,et al.  Learning Optimal Embedded Cascades , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[15]  Alexander Kogan,et al.  Logical analysis of data – the vision of Peter L. Hammer , 2007, Annals of Mathematics and Artificial Intelligence.

[16]  Joni-Kristian Kämäräinen,et al.  Video Shot Boundary Detection using Visual Bag-of-Words , 2013, VISAPP.

[17]  Alvaro A. Cárdenas,et al.  Optimal ROC Curve for a Combination of Classifiers , 2007, NIPS.

[18]  Deva Ramanan,et al.  Face detection, pose estimation, and landmark localization in the wild , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[19]  Paul A. Viola,et al.  Robust Real-Time Face Detection , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[20]  Peter L. Hammer,et al.  Pareto-optimal patterns in logical analysis of data , 2004, Discret. Appl. Math..

[21]  Maja Pantic,et al.  Bimodal log-linear regression for fusion of audio and visual features , 2013, MM '13.

[22]  Peter L. Hammer,et al.  Accelerated algorithm for pattern detection in logical analysis of data , 2006, Discret. Appl. Math..

[23]  Shuo Yang,et al.  WIDER FACE: A Face Detection Benchmark , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[24]  Jiri Matas,et al.  WaldBoost - learning for time constrained sequential detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[25]  Maja Pantic,et al.  Comparison of single-model and multiple-model prediction-based audiovisual fusion , 2015, AVSP.

[26]  B. Schiele,et al.  How Far are We from Solving Pedestrian Detection? , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[27]  Andrzej Skowron,et al.  Logical Analysis of Data: Theory, Methodology and Applications , 2013 .

[28]  Steven N. Thorsen,et al.  A Boolean Algebra of receiver operating characteristic curves , 2007, 2007 10th International Conference on Information Fusion.

[29]  Song-Chun Zhu,et al.  Learning Near-Optimal Cost-Sensitive Decision Policy for Object Detection , 2015, 2013 IEEE International Conference on Computer Vision.

[30]  Aiko M. Hormann,et al.  Programs for Machine Learning. Part I , 1962, Inf. Control..

[31]  Raphaël Féraud,et al.  A Fast and Accurate Face Detector Based on Neural Networks , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[32]  Jianhua Chen,et al.  An incremental learning algorithm for constructing Boolean functions from positive and negative examples , 2002, Comput. Oper. Res..

[33]  Nada Lavrac,et al.  The Multi-Purpose Incremental Learning System AQ15 and Its Testing Application to Three Medical Domains , 1986, AAAI.

[34]  Robert Sabourin,et al.  Iterative Boolean combination of classifiers in the ROC space: An application to anomaly detection with HMMs , 2010, Pattern Recognit..

[35]  Kilian Q. Weinberger,et al.  Classifier Cascade for Minimizing Feature Evaluation Cost , 2012, AISTATS.

[36]  Murat Dundar,et al.  Joint Optimization of Cascaded Classifiers for Computer Aided Detection , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[37]  James M. Rehg,et al.  Combining acoustic and visual features to detect laughter in adults' speech , 2015, AVSP.

[38]  John Daugman,et al.  Biometric decision landscapes , 2000 .

[39]  Raymond N. J. Veldhuis,et al.  Threshold-optimized decision-level fusion and its application to biometrics , 2009, Pattern Recognit..

[40]  Paul Hess Dedekind's problem: monotone Boolean functions on the lattice of divisors of an integer. , 1979 .

[41]  Mauricio G. C. Resende,et al.  A continuous approach to inductive inference , 1992, Math. Program..

[42]  Shipeng Yu,et al.  Designing efficient cascaded classifiers: tradeoff between accuracy and cost , 2010, KDD.

[43]  Peter L. Hammer,et al.  Maximum patterns in datasets , 2008, Discret. Appl. Math..

[44]  Joni-Kristian Kämäräinen,et al.  Lifelog Scene Change Detection Using Cascades of Audio and Video Detectors , 2014, ACCV Workshops.

[45]  Dmitry O. Gorodnichy,et al.  Fusion of biometric systems using Boolean combination: an application to iris-based authentication , 2012, Int. J. Biom..

[46]  Mathieu Lagrange,et al.  Proceedings of the Detection and Classification of Acoustic Scenes and Events 2016 Workshop (DCASE2016), Budapest, Hungary, 3 Sep 2016. , 2016 .

[47]  Anton van den Hengel,et al.  Training Effective Node Classifiers for Cascade Classification , 2013, International Journal of Computer Vision.

[48]  J. Ross Quinlan,et al.  Induction of Decision Trees , 1986, Machine Learning.

[49]  E. T RIANTAPHYLLOU,et al.  A Greedy Randomized Adaptive Search Procedure ( GRASP ) for Inferring Logical Clauses from Examples in Polynomial Time and some Extensions , 1998 .

[50]  Leo Breiman,et al.  Classification and Regression Trees , 1984 .

[51]  Toshihide Ibaraki,et al.  Logical analysis of numerical data , 1997, Math. Program..

[52]  Gang Hua,et al.  A convolutional neural network cascade for face detection , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).