Learning Maximum Margin Channel Decoders

The problem of learning a channel decoder is considered for two channel models. The first model is an additive noise channel whose noise distribution is unknown and nonparametric. The learner is provided with a fixed codebook and a dataset comprised of independent samples of the noise, and is required to select a precision matrix for a nearest neighbor decoder in terms of the Mahalanobis distance. The second model is a non-linear channel with additive white Gaussian noise and unknown channel transformation. The learner is provided with a fixed codebook and a dataset comprised of independent input-output samples of the channel, and is required to select a matrix for a nearest neighbor decoder with a linear kernel. For both models, the objective of maximizing the margin of the decoder is addressed. Accordingly, for each channel model, a regularized loss minimization problem with a codebook-related regularization term and hinge-like loss function is developed, which is inspired by the support vector machine paradigm for classification problems. Expected generalization error bounds for the error probability loss function are provided for both models, under optimal choice of the regularization parameter. For the additive noise channel, a theoretical guidance for choosing the training signal-to-noise ratio is proposed based on this bound. In addition, for the non-linear channel, a high probability uniform generalization error bound is provided for the hypothesis class. For each channel, a stochastic sub-gradient descent algorithm for solving the regularized loss minimization problem is proposed, and an optimization error bound is stated. The performance of the proposed algorithms is demonstrated through several examples.

[1]  Nir Weinberger,et al.  Generalization Bounds and Algorithms for Learning to Communicate Over Additive Noise Channels , 2021, IEEE Transactions on Information Theory.

[2]  Takeo Ohseki,et al.  Deep Learning-Based Bit Reliability Based Decoding for Non-binary LDPC Codes , 2021, 2021 IEEE International Symposium on Information Theory (ISIT).

[3]  Julius Berner,et al.  The Modern Mathematics of Deep Learning , 2021, ArXiv.

[4]  Marco Di Renzo,et al.  Model-Driven Deep Learning Based Channel Estimation and Feedback for Millimeter-Wave Massive Hybrid MIMO Systems , 2021, IEEE Journal on Selected Areas in Communications.

[5]  Jinkang Zhu,et al.  Knowledge-Driven Machine Learning-based Channel Estimation in Massive MIMO System , 2021, 2021 IEEE Wireless Communications and Networking Conference Workshops (WCNCW).

[6]  Ling Qiu,et al.  Deep Learning-Based channel estimation with SRGAN in OFDM Systems , 2021, 2021 IEEE Wireless Communications and Networking Conference (WCNC).

[7]  Changshui Zhang,et al.  Recent Advances in Large Margin Learning , 2021, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  Bo Yuan,et al.  Doubly Residual Neural Decoder: Towards Low-Complexity High-Performance Channel Decoding , 2021, AAAI.

[9]  Sami Akin,et al.  Joint Channel Estimation and Data Decoding using SVM-based Receivers , 2020, ArXiv.

[10]  O. Papaspiliopoulos High-Dimensional Probability: An Introduction with Applications in Data Science , 2020 .

[11]  Allan Grønlund Jørgensen,et al.  Near-Tight Margin-Based Generalization Bounds for Support Vector Machines , 2020, ICML.

[12]  Yonina C. Eldar,et al.  Data-Driven Factor Graphs for Deep Symbol Detection , 2020, 2020 IEEE International Symposium on Information Theory (ISIT).

[13]  Joonhyuk Kang,et al.  Meta-Learning to Communicate: Fast End-to-End Training for Fading Channels , 2019, ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[14]  A. Sahai,et al.  Blind Interactive Learning of Modulation Schemes: Multi-Agent Cooperation Without Co-Design , 2019, IEEE Access.

[15]  Anant Sahai,et al.  Learning to Communicate in a Noisy Environment , 2019, ArXiv.

[16]  Ghaya Rekaya-Ben Othman,et al.  DNN assisted Sphere Decoder , 2019, 2019 IEEE International Symposium on Information Theory (ISIT).

[17]  Yonina C. Eldar,et al.  ViterbiNet: Symbol Detection Using a Deep Learning Based Viterbi Algorithm , 2019, 2019 IEEE 20th International Workshop on Signal Processing Advances in Wireless Communications (SPAWC).

[18]  Riccardo Raheli,et al.  Reinforcement Learning for Channel Coding: Learned Bit-Flipping Decoding , 2019, 2019 57th Annual Allerton Conference on Communication, Control, and Computing (Allerton).

[19]  Yonina C. Eldar,et al.  ViterbiNet: A Deep Learning Based Viterbi Algorithm for Symbol Detection , 2019, IEEE Transactions on Wireless Communications.

[20]  David Burshtein,et al.  Unsupervised Linear and Nonlinear Channel Equalization and Decoding Using Variational Autoencoders , 2019, IEEE Transactions on Cognitive Communications and Networking.

[21]  Biing-Hwang Juang,et al.  Deep Learning-Based End-to-End Wireless Communication Systems With Conditional GANs as Unknown Channels , 2019, IEEE Transactions on Wireless Communications.

[22]  Geoffrey Ye Li,et al.  Deep Learning based End-to-End Wireless Communication Systems with Conditional GAN as Unknown Channel , 2019 .

[23]  Himanshu Asnani,et al.  MIND: Model Independent Neural Decoder , 2019, 2019 IEEE 20th International Workshop on Signal Processing Advances in Wireless Communications (SPAWC).

[24]  Henry D. Pfister,et al.  Learned Belief-Propagation Decoding with Simple Scaling and SNR Adaptation , 2019, 2019 IEEE International Symposium on Information Theory (ISIT).

[25]  Tadashi Wadayama,et al.  Deep Learning-Aided Trainable Projected Gradient Decoding for LDPC Codes , 2019, 2019 IEEE International Symposium on Information Theory (ISIT).

[26]  Jaeho Lee,et al.  Learning finite-dimensional coding schemes with nonlinear reconstruction maps , 2018, SIAM J. Math. Data Sci..

[27]  Sreeram Kannan,et al.  Communication Algorithms via Deep Learning , 2018, ICLR.

[28]  Tao Jiang,et al.  Deep learning for wireless physical layer: Opportunities and challenges , 2017, China Communications.

[29]  David Burshtein,et al.  Deep Learning Methods for Improved Decoding of Linear Codes , 2017, IEEE Journal of Selected Topics in Signal Processing.

[30]  Abdelaziz Samet,et al.  Analyzing performance of joint SVR interpolation for LTE system with 64-QAM modulation under 500 Km/h mobile velocity , 2017, 2017 Sixth International Conference on Communications and Networking (ComNet).

[31]  Jakob Hoydis,et al.  An Introduction to Deep Learning for the Physical Layer , 2017, IEEE Transactions on Cognitive Communications and Networking.

[32]  Stephan ten Brink,et al.  On deep learning-based channel decoding , 2017, 2017 51st Annual Conference on Information Sciences and Systems (CISS).

[33]  Krzysztof Wesolowski,et al.  Channel Coding for Ultra-Reliable Low-Latency Communication in 5G Systems , 2016, 2016 IEEE 84th Vehicular Technology Conference (VTC-Fall).

[34]  Joseph M. Kahn,et al.  Convex Channel Power Optimization in Nonlinear WDM Systems Using Gaussian Noise Model , 2016, Journal of Lightwave Technology.

[35]  Thomas L. Marzetta,et al.  Massive MIMO: An Introduction , 2015, Bell Labs Technical Journal.

[36]  Ronen Dar,et al.  On shaping gain in the nonlinear fiber-optic channel , 2014, 2014 IEEE International Symposium on Information Theory.

[37]  Shai Ben-David,et al.  Understanding Machine Learning: From Theory to Algorithms , 2014 .

[38]  C. Meinel,et al.  Digital Communication , 2014, X.media.publishing.

[39]  Nicholas Kalouptsidis,et al.  New achievable rates for nonlinear Volterra channels via martingale inequalities , 2012, 2012 IEEE International Symposium on Information Theory Proceedings.

[40]  R. Essiambre,et al.  Nonlinear Shannon Limit in Pseudolinear Coherent Systems , 2012, Journal of Lightwave Technology.

[41]  Clément Levrard Fast rates for empirical vector quantization , 2012, 1201.6052.

[42]  P. Winzer,et al.  Capacity Limits of Optical Fiber Networks , 2010, Journal of Lightwave Technology.

[43]  Ambuj Tewari,et al.  On the Generalization Ability of Online Strongly Convex Programming Algorithms , 2008, NIPS.

[44]  Nathan Srebro,et al.  SVM optimization: inverse dependence on training set size , 2008, ICML '08.

[45]  Ting Wang,et al.  Proposal to Achieve 1 Tb/s per Wavelength Transmission Using Three-Dimensional LDPC-Coded Modulation , 2008, IEEE Photonics Technology Letters.

[46]  Y. Singer,et al.  Pegasos: Primal Estimated sub-GrAdient SOlver for SVM , 2011, ICML.

[47]  László Györfi,et al.  Individual convergence rates in empirical vector quantizer design , 2005, IEEE Transactions on Information Theory.

[48]  András Antos,et al.  Improved minimax bounds on the test and training distortion of empirically designed vector quantizers , 2005, IEEE Transactions on Information Theory.

[49]  Fernando Pérez-Cruz,et al.  SVM multiregression for nonlinear channel estimation in multiple-input multiple-output systems , 2004, IEEE Transactions on Signal Processing.

[50]  Peter L. Bartlett,et al.  Rademacher and Gaussian Complexities: Risk Bounds and Structural Results , 2003, J. Mach. Learn. Res..

[51]  Koby Crammer,et al.  On the Learnability and Design of Output Codes for Multiclass Problems , 2002, Machine Learning.

[52]  P. Mitra,et al.  Nonlinear limits to the information capacity of optical fibre communications , 2000, Nature.

[53]  T. Linder On the training distortion of vector quantizers , 2000, 2000 IEEE International Symposium on Information Theory (Cat. No.00CH37060).

[54]  Ghassan Kawas Kaleh,et al.  Joint parameter estimation and symbol detection for linear or nonlinear unknown channels , 1994, IEEE Trans. Commun..

[55]  C.F.N. Cowan,et al.  Adaptive equalization of finite nonlinear channels using multilayer perceptron , 1990 .

[56]  Charles E. McCulloch,et al.  Symmetric Matrix Derivatives with Applications , 1982 .

[57]  V.W.S. Chan,et al.  Principles of Digital Communication and Coding , 1979 .

[58]  Jianqing Fan,et al.  High-Dimensional Statistics , 2014 .

[59]  J. Weston,et al.  Support Vector Machine Solvers , 2007 .