论文信息 - Place-specific Background Modeling Using Recursive Autoencoders

Place-specific Background Modeling Using Recursive Autoencoders

Image change detection (ICD) to detect changed objects in front of a vehicle with respect to a place-specific background model using an on-board monocular vision system is a fundamental problem in intelligent vehicle (IV). From the perspective of recent large-scale IV applications, it can be impractical in terms of space/time efficiency to train place-specific background models for every possible place. To address these issues, we introduce a new autoencoder (AE) based efficient ICD framework that combines the advantages of AE-based anomaly detection (AD) and AE-based image compression (IC). We propose a method that uses AE reconstruction errors as a single unified measure for training a minimal set of place-specific AEs and maintains detection accuracy. We introduce an efficient incremental recursive AE (rAE) training framework that recursively summarizes a large collection of background images into the AE set. The results of experiments on challenging cross-season ICD tasks validate the efficacy of the proposed approach.

Kanji Tanaka | Koji Takeda | Kousuke Yamaguchi | Takuma Sugimoto | Rino Ide

[1] Ryan M. Eustice,et al. Pairwise Consistent Measurement Set Maximization for Robust Multi-Robot Map Merging , 2018, 2018 IEEE International Conference on Robotics and Automation (ICRA).

[2] Ali Farhadi,et al. YOLO9000: Better, Faster, Stronger , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[3] Mahmood Fathy,et al. Video anomaly detection and localisation based on the sparsity and reconstruction error of auto-encoder , 2016 .

[4] Geoffrey E. Hinton,et al. Reducing the Dimensionality of Data with Neural Networks , 2006, Science.

[5] David G. Lowe,et al. Object recognition from local scale-invariant features , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[6] Charu C. Aggarwal,et al. Outlier Ensembles - An Introduction , 2017 .

[7] Mahmood Fathy,et al. Deep-anomaly: Fully convolutional neural network for fast anomaly detection in crowded scenes , 2016, Comput. Vis. Image Underst..

[8] Emmanuel Müller,et al. Statistical selection of relevant subspace projections for outlier ranking , 2011, 2011 IEEE 27th International Conference on Data Engineering.

[9] VARUN CHANDOLA,et al. Anomaly detection: A survey , 2009, CSUR.

[10] Valero Laparra,et al. End-to-end Optimized Image Compression , 2016, ICLR.

[11] Jana Kosecka,et al. Detecting Changes in Images of Street Scenes , 2012, ACCV.

[12] Luc Van Gool,et al. Towards Image Understanding from Deep Compression without Decoding , 2018, ICLR.

[13] Ryan M. Eustice,et al. University of Michigan North Campus long-term vision and lidar dataset , 2016, Int. J. Robotics Res..

[14] D. Opitz,et al. Popular Ensemble Methods: An Empirical Study , 1999, J. Artif. Intell. Res..