Predicting Polarization Beyond Semantics for Wearable Robotics

Semantic perception is a key enabler in robotics, which supposes a very resourceful and efficient manner of applying vision information for upper-level navigation and manipulation tasks. Given the challenges on specular semantics such as water hazards, transparent glasses and metallic surfaces, polarization imaging has been explored to complement the RGB-based pixel-wise semantic segmentation because it reflects surface characteristics and provides additional attributes. However, polarimetric measurements generally entail prohibitively expensive cameras and highly accurate calibrations. Inspired by the representation power of Convolutional Neural Networks (CNNs), we propose to predict polarization information from monocular RGB images, precisely per-pixel polarization difference. The core of our approach is a cluster of efficient deep architectures building on factorized convolutions, hierarchical dilations and pyramid representations, aimed to produce both semantic and polarimetric estimations in real time. Comprehensive experiments demonstrate the qualified accuracy on a wearable exoskeleton humanoid robot.

[1]  Luis Miguel Bergasa,et al.  Unifying Terrain Awareness for the Visually Impaired through Real-Time Semantic Segmentation , 2018, Sensors.

[2]  Eugenio Culurciello,et al.  LinkNet: Exploiting encoder representations for efficient semantic segmentation , 2017, 2017 IEEE Visual Communications and Image Processing (VCIP).

[3]  Euntai Kim,et al.  Wet area and puddle detection for Advanced Driver Assistance Systems (ADAS) using a stereo camera , 2016, International Journal of Control, Automation and Systems.

[4]  Eduardo Romera,et al.  ERFNet: Efficient Residual Factorized ConvNet for Real-Time Semantic Segmentation , 2018, IEEE Transactions on Intelligent Transportation Systems.

[5]  Trevor Darrell,et al.  Fully Convolutional Networks for Semantic Segmentation , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6]  Dmitry Vorobiev,et al.  On-sky performance evaluation and calibration of a polarization-sensitive focal plane array , 2016, Astronomical Telescopes + Instrumentation.

[7]  Atsuo Takanishi,et al.  Material recognition CNNs and hierarchical planning for biped robot locomotion on slippery terrain , 2016, 2016 IEEE-RAS 16th International Conference on Humanoid Robots (Humanoids).

[8]  Jian Bai,et al.  Expanding the Detection of Traversable Area with RealSense for the Visually Impaired , 2016, Sensors.

[9]  Ruxandra Tapu,et al.  When Ultrasonic Sensors and Computer Vision Join Forces for Efficient Obstacle Detection and Recognition , 2016, Sensors.

[10]  Sertac Karaman,et al.  Sparse-to-Dense: Depth Prediction from Sparse Depth Samples and a Single Image , 2017, 2018 IEEE International Conference on Robotics and Automation (ICRA).

[11]  Jian Bai,et al.  Target enhanced 3D reconstruction based on polarization-coded structured light. , 2017, Optics express.

[12]  Joachim Hertzberg,et al.  Towards semantic maps for mobile robots , 2008, Robotics Auton. Syst..

[13]  Rongguang Liang,et al.  Simultaneous polarization Mirau interferometer based on pixelated polarization camera. , 2016, Optics letters.

[14]  Yu Liu,et al.  Multi-focus image fusion with a deep convolutional neural network , 2017, Inf. Fusion.

[15]  Björn Hartmann,et al.  HindSight: Enhancing Spatial Awareness by Sonifying Detected Objects in Real-Time 360-Degree Video , 2018, CHI.

[16]  Ping Tan,et al.  Polarimetric Dense Monocular SLAM , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[17]  Larry H. Matthies,et al.  Depth from stereo polarization in specular scenes for urban robotics , 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[18]  Ji Qi,et al.  Real time complete Stokes polarimetric imager based on a linear polarizer array camera for tissue polarimetric imaging , 2017, Biomedical optics express.

[19]  Roberto Cipolla,et al.  SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[20]  Patrick Pérez,et al.  The Semantic Paintbrush: Interactive 3D Mapping and Recognition in Large Outdoor Spaces , 2015, CHI.

[21]  Jian Bai,et al.  Detecting Traversable Area and Water Hazards for the Visually Impaired with a pRGB-D Sensor , 2017, Sensors.

[22]  Fan Wang,et al.  Multimodality semantic segmentation based on polarization and color images , 2017, Neurocomputing.

[23]  Su-Lin Lee,et al.  Towards Automatic 3D Shape Instantiation for Deployed Stent Grafts: 2D Multiple-class and Class-imbalance Marker Segmentation with Equally-weighted Focal U-Net , 2017, 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[24]  Xiaogang Wang,et al.  Pyramid Scene Parsing Network , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[25]  Vladlen Koltun,et al.  Multi-Scale Context Aggregation by Dilated Convolutions , 2015, ICLR.

[26]  Michael Milford,et al.  3D tracking of water hazards with polarized stereo cameras , 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[27]  Ji Qi,et al.  A real time complete Stokes polarimetric imager based on a linear polarizer array camera for tissue polarimetric imaging , 2017 .

[28]  Eugenio Culurciello,et al.  ENet: A Deep Neural Network Architecture for Real-Time Semantic Segmentation , 2016, ArXiv.

[29]  Peter Kontschieder,et al.  The Mapillary Vistas Dataset for Semantic Understanding of Street Scenes , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[30]  N. Engheta,et al.  Polarization-difference imaging: a biologically inspired technique for observation through scattering media. , 1995, Optics letters.