Deep Joint Source-Channel Coding for Multi-Task Network

Multi-task learning (MTL) is an efficient way to improve the performance of related tasks by sharing knowledge. However, most existing MTL networks run on a single end and are not suitable for collaborative intelligence (CI) scenarios. In this work, we propose an MTL network with a deep joint source-channel coding (JSCC) framework, which allows operating under CI scenarios. We first propose a feature fusion based MTL network (FFMNet) for joint object detection and semantic segmentation. Compared with other MTL networks, FFMNet gets higher performance with fewer parameters. Then FFMNet is split into two parts, which run on a mobile device and an edge server respectively. The feature generated by the mobile device is transmitted through the wireless channel to the edge server. To reduce the transmission overhead of the intermediate feature, a deep JSCC network is designed. By combining two networks together, the whole model achieves 512× compression for the intermediate feature and a performance loss within 2% on both tasks. At last, by training with noise, the FFMNet with JSCC is robust to various channel conditions and outperforms the separate source and channel coding scheme.

[1]  Deniz Gündüz,et al.  Deep Joint Source-Channel Coding for Wireless Image Transmission , 2019, IEEE Transactions on Cognitive Communications and Networking.

[2]  Ivan V. Bajic,et al.  Multi-Task Learning with Compressible Features for Collaborative Intelligence , 2019, 2019 IEEE International Conference on Image Processing (ICIP).

[3]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[4]  Wu Lenan,et al.  Joint source/channel coding modulation based on BP neural networks , 2003, International Conference on Neural Networks and Signal Processing, 2003. Proceedings of the 2003.

[5]  Qiang Yang,et al.  An Overview of Multi-task Learning , 2018 .

[6]  Ivan V. Bajic,et al.  Near-Lossless Deep Feature Compression for Collaborative Intelligence , 2018, 2018 IEEE 20th International Workshop on Multimedia Signal Processing (MMSP).

[7]  Sungho Kim,et al.  Performance Indicator Survey for Object Detection , 2020, 2020 20th International Conference on Control, Automation and Systems (ICCAS).

[8]  Yassine Ruichek,et al.  Survey on semantic segmentation using deep learning techniques , 2019, Neurocomputing.

[9]  Deniz Gündüz,et al.  Deep Joint Transmission-Recognition for Power-Constrained IoT Devices , 2020, ArXiv.

[10]  Zheng Luo,et al.  Driving Scene Perception Network: Real-Time Joint Detection, Depth Estimation and Semantic Segmentation , 2018, 2018 IEEE Winter Conference on Applications of Computer Vision (WACV).

[11]  Wei Liu,et al.  SSD: Single Shot MultiBox Detector , 2015, ECCV.

[12]  Deniz Gündüz Joint Source-Channel Coding of Images with (not very) Deep Learning , 2020 .

[13]  Xuelong Li,et al.  Triply Supervised Decoder Networks for Joint Detection and Segmentation , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[14]  Jordi Pont-Tuset,et al.  The Open Images Dataset V4 , 2018, International Journal of Computer Vision.

[15]  Ivan V. Bajic,et al.  Deep Feature Compression for Collaborative Object Detection , 2018, 2018 25th IEEE International Conference on Image Processing (ICIP).

[16]  Valero Laparra,et al.  Density Modeling of Images using a Generalized Normalization Transformation , 2015, ICLR.

[17]  Julien Mairal,et al.  BlitzNet: A Real-Time Deep Network for Scene Understanding , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[18]  Trevor N. Mudge,et al.  Neurosurgeon: Collaborative Intelligence Between the Cloud and Mobile Edge , 2017, ASPLOS.

[19]  Saeed Ranjbar Alvar,et al.  Bit Allocation for Multi-Task Collaborative Intelligence , 2020, ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[20]  Fuqiang Zhou,et al.  FSSD: Feature Fusion Single Shot Multibox Detector , 2017, ArXiv.

[21]  Aggelos K. Katsaggelos,et al.  Joint Source-Channel Coding for Video Communications , 2005 .