论文信息 - OpenLORIS-Object: A Dataset and Benchmark towards Lifelong Object Recognition

OpenLORIS-Object: A Dataset and Benchmark towards Lifelong Object Recognition

The recent breakthroughs in computer vision have benefited from the availability of large representative datasets (e.g. ImageNet and COCO) for training. Yet, robotic vision poses unique challenges for applying visual algorithms developed from these standard computer vision datasets due to their implicit assumption over non-varying distributions for a fixed set of tasks. Fully retraining models each time a new task becomes available is infeasible due to computational, storage and sometimes privacy issues, while naïve incremental strategies have been shown to suffer from catastrophic forgetting. It is crucial for the robots to operate continuously under open-set and detrimental conditions with adaptive visual perceptual systems, where lifelong learning is a fundamental capability. However, very few datasets and benchmarks are available to evaluate and compare emerging techniques. To fill this gap, we provide a new lifelong robotic vision dataset ("OpenLORIS-Object") collected via RGB-D cameras. The dataset embeds the challenges faced by a robot in the real-life application and provides new benchmarks for validating lifelong object recognition algorithms. Moreover, we have provided a testbed of 9 state-of-the-art lifelong learning algorithms. Each of them involves 48 tasks with 4 evaluation metrics over the OpenLORIS-Object dataset. The results demonstrate that the object recognition task in the ever-changing difficulty environments is far from being solved and the bottlenecks are at the forward/backward transfer designs. Our dataset and benchmark are publicly available at https://lifelong-robotic-vision.github.io/dataset/object.

[1] Markus Vincze,et al. Recognizing Objects in-the-Wild: Where do we Stand? , 2017, 2018 IEEE International Conference on Robotics and Automation (ICRA).

[2] Yoshua Bengio,et al. Generative Adversarial Nets , 2014, NIPS.

[3] George K. Karagiannidis,et al. Efficient Machine Learning for Big Data: A Review , 2015, Big Data Res..

[4] Yan Liu,et al. Deep Generative Dual Memory Network for Continual Learning , 2017, ArXiv.

[5] Pietro Perona,et al. The Caltech-UCSD Birds-200-2011 Dataset , 2011 .

[6] Peng Cui,et al. NICO: A Dataset Towards Non-I.I.D. Image Classification , 2019, ArXiv.

[7] Y. LeCun,et al. Learning methods for generic object recognition with invariance to pose and lighting , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[8] Christoph H. Lampert,et al. iCaRL: Incremental Classifier and Representation Learning , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[9] Derek Hoiem,et al. Learning without Forgetting , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[10] Stefan Wermter,et al. Continual Lifelong Learning with Neural Networks: A Review , 2019, Neural Networks.

[11] Li Fei-Fei,et al. ImageNet: A large-scale hierarchical image database , 2009, CVPR.

[12] Nicolas Y. Masse,et al. Alleviating catastrophic forgetting using context-dependent gating and synaptic stabilization , 2018, Proceedings of the National Academy of Sciences.

[13] Peng Cui,et al. Towards Non-I.I.D. image classification: A dataset and baselines , 2019, Pattern Recognit..

[14] Marc'Aurelio Ranzato,et al. Gradient Episodic Memory for Continual Learning , 2017, NIPS.

[15] Davide Maltoni,et al. CORe50: a New Dataset and Benchmark for Continuous Object Recognition , 2017, CoRL.

[16] Yoshua Bengio,et al. An Empirical Investigation of Catastrophic Forgeting in Gradient-Based Neural Networks , 2013, ICLR.

[17] Alan F. Smeaton,et al. Synthetic-Neuroscore: Using a neuro-AI interface for evaluating generative adversarial networks , 2019, Neurocomputing.

[18] Payman Mohassel,et al. SecureML: A System for Scalable Privacy-Preserving Machine Learning , 2017, 2017 IEEE Symposium on Security and Privacy (SP).

[19] Pietro Perona,et al. Caltech-UCSD Birds 200 , 2010 .

[20] Rosa H. M. Chan,et al. Challenges in Task Incremental Learning for Assistive Robotics , 2020, IEEE Access.

[21] Tomas E. Ward,et al. Generative Adversarial Networks: A Survey and Taxonomy , 2019, ArXiv.

[22] Jiwon Kim,et al. Continual Learning with Deep Generative Replay , 2017, NIPS.

[23] Yandong Guo,et al. Incremental Classifier Learning with Generative Adversarial Networks , 2018, ArXiv.

[24] Andrew Zisserman,et al. Automated Flower Classification over a Large Number of Classes , 2008, 2008 Sixth Indian Conference on Computer Vision, Graphics & Image Processing.

[25] Wolfram Burgard,et al. The limits and potentials of deep learning for robotics , 2018, Int. J. Robotics Res..

[26] Kai Xu,et al. Reduced-Rank Linear Dynamical Systems , 2018, AAAI.

[27] Andreas S. Tolias,et al. Generative replay with feedback connections as a general strategy for continual learning , 2018, ArXiv.

[28] Dieter Fox,et al. A large-scale hierarchical multi-view RGB-D object dataset , 2011, 2011 IEEE International Conference on Robotics and Automation.

[29] Surya Ganguli,et al. Continual Learning Through Synaptic Intelligence , 2017, ICML.

[30] Wei Yang,et al. Are We Ready for Service Robots? The OpenLORIS-Scene Datasets for Lifelong SLAM , 2020, 2020 IEEE International Conference on Robotics and Automation (ICRA).

[31] Yoshua Bengio,et al. Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[32] Sameer A. Nene,et al. Columbia Object Image Library (COIL100) , 1996 .

[33] David Filliat,et al. Don't forget, there is more than forgetting: new metrics for Continual Learning , 2018, ArXiv.

[34] Pietro Perona,et al. Microsoft COCO: Common Objects in Context , 2014, ECCV.

[35] Wei Liu,et al. NDDR-CNN: Layerwise Feature Fusing in Multi-Task CNNs by Neural Discriminative Dimensionality Reduction , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[36] Davide Maltoni,et al. Semi-supervised tuning from temporal coherence , 2015, 2016 23rd International Conference on Pattern Recognition (ICPR).

[37] Khaled Ghédira,et al. Discussion and review on evolving data streams and concept drift adapting , 2018, Evol. Syst..

[38] Razvan Pascanu,et al. Overcoming catastrophic forgetting in neural networks , 2016, Proceedings of the National Academy of Sciences.

[39] Anqi Wu,et al. Neural Dynamics Discovery via Gaussian Process Recurrent Neural Networks , 2019, UAI.

[40] Baoxin Li,et al. A Strategy for an Uncompromising Incremental Learner , 2017, ArXiv.

[41] Sung Ju Hwang,et al. Lifelong Learning with Dynamically Expandable Networks , 2017, ICLR.

[42] Alex Krizhevsky,et al. Learning Multiple Layers of Features from Tiny Images , 2009 .

[43] Yee Whye Teh,et al. Progress & Compress: A scalable framework for continual learning , 2018, ICML.

[44] Martial Mermillod,et al. The stability-plasticity dilemma: investigating the continuum from catastrophic forgetting to age-limited learning effects , 2013, Front. Psychol..

[45] Qiang Yang,et al. A Survey on Transfer Learning , 2010, IEEE Transactions on Knowledge and Data Engineering.