Toward Adaptive Semantic Communications: Efficient Data Transmission via Online Learned Nonlinear Transform Source-Channel Coding

The emerging field semantic communication is driving the research of end-to-end data transmission. By utilizing the powerful representation ability of deep learning models, learned data transmission schemes have exhibited superior performance than the established source and channel coding methods. While, so far, research efforts mainly concentrated on architecture and model improvements toward a static target domain. Despite their successes, such learned models are still suboptimal due to the limitations in model capacity and imperfect optimization and generalization, particularly when the testing data distribution or channel response is different from that adopted for model training, as is likely to be the case in real-world. To tackle this, we propose a novel online learned joint source and channel coding approach that leverages the deep learning model's overfitting property. Specifically, we update the off-the-shelf pre-trained models after deployment in a lightweight online fashion to adapt to the distribution shifts in source data and environment domain. We take the overfitting concept to the extreme, proposing a series of implementation-friendly methods to adapt the codec model or representations to an individual data or channel state instance, which can further lead to substantial gains in terms of the bandwidth ratio-distortion performance. The proposed methods enable the communication-efficient adaptation for all parameters in the network without sacrificing decoding speed. Our experiments, including user study, on continually changing target source data and wireless channel environments, demonstrate the effectiveness and efficiency of our approach, on which we outperform existing state-of-the-art engineered transmission scheme (VVC combined with 5G LDPC coded transmission).

[1]  Harpreet S. Dhillon,et al.  Beyond Transmitting Bits: Context, Semantics, and Task-Oriented Communications , 2022, IEEE Journal on Selected Areas in Communications.

[2]  Zhongwei Si,et al.  Wireless Deep Video Semantic Transmission , 2022, IEEE Journal on Selected Areas in Communications.

[3]  K. Mikolajczyk,et al.  Channel-Adaptive Wireless Image Transmission With OFDM , 2022, IEEE Wireless Communications Letters.

[4]  Fayccal Ait Aoudia,et al.  Sionna: An Open-Source Library for Next-Generation Physical Layer Research , 2022, ArXiv.

[5]  Jiaying Liu,et al.  Neural Data-Dependent Transform for Learned Image Compression , 2022, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[6]  Zhongwei Si,et al.  Nonlinear Transform Source-Channel Coding for Semantic Communications , 2021, IEEE Journal on Selected Areas in Communications.

[7]  Zhongwei Si,et al.  Communication Beyond Transmitting Bits: Semantics-Guided Source and Channel Coding , 2021, IEEE Wireless Communications.

[8]  Deniz Gündüz,et al.  DeepWiVe: Deep-Learning-Aided Wireless Video Transmission , 2021, IEEE Journal on Selected Areas in Communications.

[9]  Hun-Seok Kim,et al.  Deep Joint Source-Channel Coding for Wireless Image Transmission with Adaptive Rate Control , 2021, ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[10]  Gao Huang,et al.  Dynamic Neural Networks: A Survey , 2021, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11]  B. Ai,et al.  Wireless Image Transmission Using Deep Source Channel Coding With Attention Modules , 2020, IEEE Transactions on Circuits and Systems for Video Technology.

[12]  Shiqi Wang,et al.  Image Quality Assessment: Unifying Structure and Texture Similarity , 2020, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13]  K. Niu,et al.  A Novel Deep Learning Architecture for Wireless Image Transmission , 2021, 2021 IEEE Global Communications Conference (GLOBECOM).

[14]  Taco S. Cohen,et al.  Instance-Adaptive Video Compression: Improving Neural Codecs by Training on the Test Set , 2021, ArXiv.

[15]  Fangwei Zhang,et al.  Toward Wisdom-Evolutionary and Primitive-Concise 6G:A New Paradigm of Semantic Communication Networks , 2021, Engineering.

[16]  Enhua Wu,et al.  Overfitting the Data: Compact Neural Video Delivery via Content-aware Feature Modulation , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[17]  Youssef Iraqi,et al.  Hybrid Automatic Repeat Request (HARQ) in Wireless Communications Systems and Standards: A Contemporary Survey , 2021, IEEE Communications Surveys & Tutorials.

[18]  Claude E. Shannon,et al.  A Mathematical Theory of Communication (1948) , 2021 .

[19]  Taco S. Cohen,et al.  Overfitting for Fun and Profit: Instance-Adaptive Data Compression , 2021, ICLR.

[20]  Gary J. Sullivan,et al.  Developments in International Video Coding Standardization After AVC, With an Overview of Versatile Video Coding (VVC) , 2021, Proceedings of the IEEE.

[21]  Deniz Gündüz,et al.  Bandwidth-Agile Image Transmission With Deep Joint Source-Channel Coding , 2020, IEEE Transactions on Wireless Communications.

[22]  Deniz Gündüz,et al.  Wireless Image Retrieval at the Edge , 2020, IEEE Journal on Selected Areas in Communications.

[23]  Eirikur Agustsson,et al.  Nonlinear Transform Coding , 2020, IEEE Journal of Selected Topics in Signal Processing.

[24]  Geoffrey Ye Li,et al.  Deep Learning Enabled Semantic Communication Systems , 2020, IEEE Transactions on Signal Processing.

[25]  Akshay Pushparaja,et al.  CompressAI: a PyTorch library and evaluation platform for end-to-end compression research , 2020, ArXiv.

[26]  Dongsu Han,et al.  Neural-Enhanced Live Streaming: Improving Live Video Ingest via Online Learning , 2020, SIGCOMM.

[27]  Eirikur Agustsson,et al.  High-Fidelity Generative Image Compression , 2020, NeurIPS.

[28]  S. Mandt,et al.  Improving Inference for Neural Image Compression , 2020, NeurIPS.

[29]  Bo Bai,et al.  Variable Rate Image Compression with Content Adaptive Optimization , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[30]  Li Chen,et al.  Content Adaptive and Error Propagation Aware Deep Video Compression , 2020, ECCV.

[31]  Wen Gao,et al.  Video Coding for Machines: A Paradigm of Collaborative Compression and Intelligent Analytics , 2020, IEEE Transactions on Image Processing.

[32]  David Burth Kurka,et al.  DeepJSCC-f: Deep Joint Source-Channel Coding of Images With Feedback , 2019, IEEE Journal on Selected Areas in Information Theory.

[33]  Xinfeng Zhang,et al.  Image and Video Compression With Neural Networks: A Review , 2019, IEEE Transactions on Circuits and Systems for Video Technology.

[34]  Abdelaziz Djelouah,et al.  Content Adaptive Optimization for Neural Image Compression , 2019, CVPR Workshops.

[35]  Stefano Ermon,et al.  Neural Joint Source-Channel Coding , 2018, ICML.

[36]  Deniz Gündüz,et al.  Deep Joint Source-channel Coding for Wireless Image Transmission , 2018, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[37]  Luc Van Gool,et al.  Generative Adversarial Networks for Extreme Learned Image Compression , 2018, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[38]  David Minnen,et al.  Integer Networks for Data Compression with Latent-Variable Models , 2019, ICLR.

[39]  Guangming Shi,et al.  Variable Block-Sized Signal-Dependent Transform for Video Coding , 2018, IEEE Transactions on Circuits and Systems for Video Technology.

[40]  Shrinivas Kudekar,et al.  Design of Low-Density Parity Check Codes for 5G New Radio , 2018, IEEE Communications Magazine.

[41]  Lin Wang,et al.  Joint Optimization of Protograph LDPC Code Pair for Joint Source and Channel Coding , 2018, IEEE Transactions on Communications.

[42]  Andrea J. Goldsmith,et al.  Deep Learning for Joint Source-Channel Coding of Text , 2018, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[43]  David Minnen,et al.  Variational image compression with a scale hyperprior , 2018, ICLR.

[44]  Alexei A. Efros,et al.  The Unreasonable Effectiveness of Deep Features as a Perceptual Metric , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[45]  David Duvenaud,et al.  Inference Suboptimality in Variational Autoencoders , 2018, ICML.

[46]  E. George,et al.  The Spike-and-Slab LASSO , 2018 .

[47]  Eirikur Agustsson,et al.  NTIRE 2017 Challenge on Single Image Super-Resolution: Dataset and Study , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[48]  Sepp Hochreiter,et al.  GANs Trained by a Two Time-Scale Update Rule Converge to a Local Nash Equilibrium , 2017, NIPS.

[49]  Lukasz Kaiser,et al.  Attention is All you Need , 2017, NIPS.

[50]  Valero Laparra,et al.  End-to-end Optimized Image Compression , 2016, ICLR.

[51]  Patrick Le Callet,et al.  Annealed learning based block transforms for HEVC video coding , 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[52]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[53]  Max Welling,et al.  Auto-Encoding Variational Bayes , 2013, ICLR.

[54]  Vyas Sekar,et al.  Improving fairness, efficiency, and stability in HTTP-based adaptive video streaming with FESTIVE , 2012, CoNEXT '12.

[55]  Yoshua Bengio,et al.  Estimating or Propagating Gradients Through Stochastic Neurons for Conditional Computation , 2013, ArXiv.

[56]  Gary J. Sullivan,et al.  Overview of the High Efficiency Video Coding (HEVC) Standard , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[57]  Bu-Sung Lee,et al.  Low-Complexity Video Coding Based on Two-Dimensional Singular Value Decomposition , 2012, IEEE Transactions on Image Processing.

[58]  Claude Oestges,et al.  The COST 2100 MIMO channel model , 2011, IEEE Wirel. Commun..

[59]  H. Vincent Poor,et al.  Joint Source and Channel Coding , 2010, IEEE Signal Processing Magazine.

[60]  Ebroul Izquierdo,et al.  Joint Source-Channel Coding for Wavelet-Based Scalable Video Transmission Using an Adaptive Turbo Code , 2007, EURASIP J. Image Video Process..

[61]  Sang Joon Kim,et al.  A Mathematical Theory of Communication , 2006 .

[62]  Christine Guillemot,et al.  Joint source-channel turbo decoding of entropy-coded sources , 2001, IEEE J. Sel. Areas Commun..

[63]  G. Bjontegaard,et al.  Calculation of Average PSNR Differences between RD-curves , 2001 .

[64]  Ian H. Witten,et al.  Arithmetic coding for data compression , 1987, CACM.

[65]  Glen G. Langdon,et al.  Universal modeling and coding , 1981, IEEE Trans. Inf. Theory.