论文信息 - Wireless Image Retrieval at the Edge

Wireless Image Retrieval at the Edge

We study the image retrieval problem at the wireless edge, where an edge device captures an image, which is then used to retrieve similar images from an edge server. These can be images of the same person or a vehicle taken from other cameras at different times and locations. Our goal is to maximize the accuracy of the retrieval task under power and bandwidth constraints over the wireless link. Due to the stringent delay constraint of the underlying application, sending the whole image at a sufficient quality is not possible. We propose two alternative schemes based on digital and analog communications, respectively. In the digital approach, we first propose a deep neural network (DNN) aided retrieval-oriented image compression scheme, whose output bit sequence is transmitted over the channel using conventional channel codes. In the analog joint source and channel coding (JSCC) approach, the feature vectors are directly mapped into channel symbols. We evaluate both schemes on image based re-identification (re-ID) tasks under different channel conditions, including both static and fading channels. We show that the JSCC scheme significantly increases the end-to-end accuracy, speeds up the encoding process, and provides graceful degradation with channel conditions. The proposed architecture is evaluated through extensive simulations on different datasets and channel conditions, as well as through ablation studies.

[1] Deniz Gündüz,et al. Deep Joint Source-Channel Coding for Wireless Image Transmission , 2019, IEEE Transactions on Cognitive Communications and Networking.

[2] Andrea Goldsmith. Joint source/channel coding for wireless channels , 1995, 1995 IEEE 45th Vehicular Technology Conference. Countdown to the Wireless Twenty-First Century.

[3] Ling Shao,et al. Viewpoint-Aware Attentive Multi-view Inference for Vehicle Re-identification , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[4] D. A. Bell,et al. Information Theory and Reliable Communication , 1969 .

[5] Qi Tian,et al. Scalable Person Re-identification: A Benchmark , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[6] Massoud Pedram,et al. BottleNet: A Deep Learning Architecture for Intelligent Mobile Cloud Computing Services , 2019, 2019 IEEE/ACM International Symposium on Low Power Electronics and Design (ISLPED).

[7] David L. Neuhoff,et al. Quantization , 2022, IEEE Trans. Inf. Theory.

[8] Deniz Gündüz,et al. DeepJSCC-f: Deep Joint Source-Channel Coding of Images With Feedback , 2020, IEEE Journal on Selected Areas in Information Theory.

[9] Xiong Chen,et al. Learning Discriminative Features with Multiple Granularities for Person Re-Identification , 2018, ACM Multimedia.

[10] Bing He,et al. Part-Regularized Near-Duplicate Vehicle Re-Identification , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[11] Xiaogang Wang,et al. DeepReID: Deep Filter Pairing Neural Network for Person Re-identification , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[12] Deniz Gündüz,et al. Federated Learning Over Wireless Fading Channels , 2019, IEEE Transactions on Wireless Communications.

[13] Deniz Gündüz,et al. Successive Refinement of Images with Deep Joint Source-Channel Coding , 2019, 2019 IEEE 20th International Workshop on Signal Processing Advances in Wireless Communications (SPAWC).

[14] Yi Yang,et al. Person Re-identification: Past, Present and Future , 2016, ArXiv.

[15] Geoffrey E. Hinton,et al. Learning internal representations by error propagation , 1986 .

[16] Deniz Gündüz,et al. Deep Joint Source-channel Coding for Wireless Image Transmission , 2018, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[17] Michael S. Bernstein,et al. ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.

[18] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[19] Huchuan Lu,et al. Pose-Invariant Embedding for Deep Person Re-Identification , 2017, IEEE Transactions on Image Processing.

[20] Yang Zhang,et al. Improving Device-Edge Cooperative Inference of Deep Learning via 2-Step Pruning , 2019, IEEE INFOCOM 2019 - IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS).

[21] Yang Yang,et al. ABD-Net: Attentive but Diverse Person Re-Identification , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[22] Yonggang Wen,et al. JALAD: Joint Accuracy-And Latency-Aware Deep Structure Decoupling for Edge-Cloud Execution , 2018, 2018 IEEE 24th International Conference on Parallel and Distributed Systems (ICPADS).

[23] John M. Danskin,et al. Joint source and channel coding for image transmission over lossy packet networks , 1996, Optics & Photonics.

[24] David Minnen,et al. Joint Autoregressive and Hierarchical Priors for Learned Image Compression , 2018, NeurIPS.

[25] Lucas Beyer,et al. In Defense of the Triplet Loss for Person Re-Identification , 2017, ArXiv.

[26] Tao Mei,et al. A Deep Learning-Based Approach to Progressive Vehicle Re-identification for Urban Surveillance , 2016, ECCV.

[27] Sang Joon Kim,et al. A Mathematical Theory of Communication , 2006 .

[28] Luc Van Gool,et al. Learning Better Lossless Compression Using Lossy Compression , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[29] Deniz Gündüz,et al. Deep Joint Source-Channel Coding of Images with Feedback , 2020, ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[30] Deniz Gündüz,et al. Distributed Hypothesis Testing Over Discrete Memoryless Channels , 2018, IEEE Transactions on Information Theory.

[31] Deniz Gündüz,et al. Machine Learning at the Wireless Edge: Distributed Stochastic Gradient Descent Over-the-Air , 2019, 2019 IEEE International Symposium on Information Theory (ISIT).

[32] Yunchao Wei,et al. Horizontal Pyramid Matching for Person Re-identification , 2018, AAAI.

[33] Mihaela van der Schaar,et al. Machine Learning in the Air , 2019, IEEE Journal on Selected Areas in Communications.

[34] Tao Xiang,et al. Multi-level Factorisation Net for Person Re-identification , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[35] H. Vincent Poor,et al. Scheduling Policies for Federated Learning in Wireless Networks , 2019, IEEE Transactions on Communications.

[36] Aggelos K. Katsaggelos,et al. Joint Source-Channel Coding for Video Communications , 2005 .

[37] Mehdi Bennis,et al. Wireless Network Intelligence at the Edge , 2018, Proceedings of the IEEE.

[38] Deniz Gündüz,et al. Hierarchical Federated Learning ACROSS Heterogeneous Cellular Networks , 2019, ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[39] Farzin Aghdasi,et al. Vehicle Re-identification: an Efficient Baseline Using Triplet Embedding , 2019, 2019 International Joint Conference on Neural Networks (IJCNN).

[40] Yu-Chieh Chang,et al. Deep Learning-Constructed Joint Transmission-Recognition for Internet of Things , 2019, IEEE Access.

[41] Deniz Gündüz,et al. Joint Device-Edge Inference over Wireless Links with Pruning , 2020, 2020 IEEE 21st International Workshop on Signal Processing Advances in Wireless Communications (SPAWC).

[42] Tao Mei,et al. PROVID: Progressive and Multimodal Vehicle Reidentification for Large-Scale Urban Surveillance , 2018, IEEE Transactions on Multimedia.

[43] Andrea J. Goldsmith,et al. Deep Learning for Joint Source-Channel Coding of Text , 2018, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).