Faster R-CNN for marine organisms detection and recognition using data augmentation

Abstract Recently, Faster Region-based Convolutional Neural Network (Faster R-CNN) has achieved marvelous accomplishment in object detection and recognition. In this paper, Faster R-CNN is applied to marine organisms detection and recognition. However, the training of Faster R-CNN requires a mass of labeled samples which are difficult to obtain for marine organisms. Therefore, three data augmentation methods dedicated to underwater-imaging are proposed. Specifically, the inverse process of underwater image restoration is used to simulate different marine turbulence environments. Perspective transformation is proposed to simulate different views of camera shooting. Illumination synthesis is used to simulate different marine uneven illuminating environments. The performance of each data augmentation method, together with previous frequently used data augmentation methods are evaluated by Faster R-CNN on the real-world underwater dataset, which validate the effectiveness of the proposed methods for marine organisms detection and recognition.

[1]  Søren Hauberg,et al.  Transformations Based on Continuous Piecewise-Affine Velocity Fields , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2]  Xiang Zhang,et al.  OverFeat: Integrated Recognition, Localization and Detection using Convolutional Networks , 2013, ICLR.

[3]  Shengen Yan,et al.  Deep Image: Scaling up Image Recognition , 2015, ArXiv.

[4]  Andreas Uhl,et al.  Evaluation of domain specific data augmentation techniques for the classification of celiac disease using endoscopic imagery , 2017, 2017 IEEE 19th International Workshop on Multimedia Signal Processing (MMSP).

[5]  Hinrich Schütze,et al.  Book Reviews: Foundations of Statistical Natural Language Processing , 1999, CL.

[6]  Xiaojun Qi,et al.  Face recognition under varying illuminations using logarithmic fractal dimension-based complete eight local directional patterns , 2016, Neurocomputing.

[7]  Hayato Mori,et al.  Development of a Small Size Underwater Robot for Observing Fisheries Resources - Underwater Robot for Assisting Abalone Fishing - , 2016, J. Robotics Mechatronics.

[8]  J J Fernández,et al.  Increasing autonomy within underwater intervention scenarios: The user interface approach , 2010, 2010 IEEE International Systems Conference.

[9]  Xi Zhou,et al.  Data augmentation for face recognition , 2017, Neurocomputing.

[10]  Joongkyu Kim,et al.  Retinex method based on adaptive smoothing for illumination invariant face recognition , 2008, Signal Process..

[11]  Tristan Perez,et al.  DeepFruits: A Fruit Detection System Using Deep Neural Networks , 2016, Sensors.

[12]  John W. Fisher,et al.  Dreaming More Data: Class-dependent Distributions over Diffeomorphisms for Learned Data Augmentation , 2015, AISTATS.

[13]  De Xu,et al.  Active visual tracking of free-swimming robotic fish based on automatic recognition , 2014, Proceeding of the 11th World Congress on Intelligent Control and Automation.

[14]  Anil A. Bharath,et al.  A data augmentation methodology for training machine/deep learning gait recognition algorithms , 2016, BMVC.

[15]  Rafael C. González,et al.  Digital image processing using MATLAB , 2006 .

[16]  Ge Zhong-feng A visibility improving algorithm based on underwater imaging model with non-uniform illumination , 2011 .

[17]  Kaiming He,et al.  Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[18]  Xiaodong Cui,et al.  Data Augmentation for Deep Neural Network Acoustic Modeling , 2015, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[19]  Wen Gao,et al.  Efficient 3D reconstruction for face recognition , 2005, Pattern Recognit..

[20]  Ross B. Girshick,et al.  Fast R-CNN , 2015, 1504.08083.

[21]  Hanumant Singh,et al.  Subsea Fauna Enumeration Using Vision-Based Marine Robots , 2016, 2016 13th Conference on Computer and Robot Vision (CRV).

[22]  Cai Hao-peng Processing Method for Underwater Degenerative Image , 2010 .

[23]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[24]  R. Hufnagel,et al.  Modulation Transfer Function Associated with Image Transmission through Turbulent Media , 1964 .

[25]  D. G. Goslett,et al.  Ocean robotics in support of fisheries research and management , 2016 .

[26]  Pan Zhou,et al.  A Convolutional Neural Network for Leaves Recognition Using Data Augmentation , 2015, 2015 IEEE International Conference on Computer and Information Technology; Ubiquitous Computing and Communications; Dependable, Autonomic and Secure Computing; Pervasive Intelligence and Computing.

[27]  Hui Li,et al.  A method of removing the uneven illumination phenomenon for optical remote sensing image , 2005, Proceedings. 2005 IEEE International Geoscience and Remote Sensing Symposium, 2005. IGARSS '05..

[28]  Nitish Srivastava,et al.  Improving neural networks by preventing co-adaptation of feature detectors , 2012, ArXiv.

[29]  Richard C. Staunton,et al.  A modified fuzzy C-means image segmentation algorithm for use with uneven illumination patterns , 2007, Pattern Recognit..

[30]  Cordelia Schmid,et al.  MoCap-guided Data Augmentation for 3D Pose Estimation in the Wild , 2016, NIPS.

[31]  Chen Sun,et al.  Revisiting Unreasonable Effectiveness of Data in Deep Learning Era , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[32]  Luc Van Gool,et al.  The Pascal Visual Object Classes (VOC) Challenge , 2010, International Journal of Computer Vision.

[33]  Falk Scholer,et al.  User performance versus precision measures for simple search tasks , 2006, SIGIR.

[34]  Liang Lin,et al.  Is Faster R-CNN Doing Well for Pedestrian Detection? , 2016, ECCV.