A Reinforcement Learning based approach for Multi-target Detection in Massive MIMO radar.

This paper considers the problem of multi-target detection for massive multiple input multiple output (MMIMO) cognitive radar (CR). The concept of CR is based on the perception-action cycle that senses and intelligently adapts to the dynamic environment in order to optimally satisfy a specific mission. However, this usually requires a priori knowledge of the environmental model, which is not available in most cases. We propose a reinforcement learning (RL) based algorithm for cognitive multi-target detection in the presence of unknown disturbance statistics. The radar acts as an agent that continuously senses the unknown environment (i.e., targets and disturbance) and consequently optimizes transmitted waveforms in order to maximize the probability of detection ($P_\mathsf{D}$) by focusing the energy in specific range-angle cells (i.e., beamforming). Furthermore, we propose a solution to the beamforming optimization problem with less complexity than the existing methods. Numerical simulations are performed to assess the performance of the proposed RL-based algorithm in both stationary and dynamic environments. The RL based beamforming is compared to the conventional omnidirectional approach with equal power allocation and to adaptive beamforming with no RL. As highlighted by the proposed numerical results, our RL-based beamformer outperforms both approaches in terms of target detection performance. The performance improvement is even particularly remarkable under environmentally harsh conditions such as low SNR, heavy-tailed disturbance and rapidly changing scenarios.

[1]  Zhi-Quan Luo,et al.  Semidefinite Relaxation of Quadratic Optimization Problems , 2010, IEEE Signal Processing Magazine.

[2]  Benjamin Friedlander,et al.  On Transmit Beamforming for MIMO Radar , 2012, IEEE Transactions on Aerospace and Electronic Systems.

[3]  L.J. Cimini,et al.  MIMO Radar with Widely Separated Antennas , 2008, IEEE Signal Processing Magazine.

[4]  D. Fuhrmann,et al.  Transmit beamforming for MIMO radar systems using signal cross-correlation , 2008, IEEE Transactions on Aerospace and Electronic Systems.

[5]  Zdenek Matousek,et al.  Drone detection by Ku-band battlefield radar , 2017, 2017 International Conference on Military Technologies (ICMT).

[6]  Petar M. Djuric,et al.  Reinforcement Learning for UAV Autonomous Navigation, Mapping and Target Detection , 2020, 2020 IEEE/ION Position, Location and Navigation Symposium (PLANS).

[7]  Tianyao Huang,et al.  Cognitive Radar Using Reinforcement Learning in Automotive Applications , 2019, 1904.10739.

[8]  Fulvio Gini,et al.  The Misspecified Cramer-Rao Bound and Its Application to Scatter Matrix Estimation in Complex Elliptically Symmetric Distributions , 2016, IEEE Transactions on Signal Processing.

[9]  Fulvio Gini,et al.  Cognitive Radars: On the Road to Reality: Progress Thus Far and Possibilities for the Future , 2018, IEEE Signal Processing Magazine.

[10]  Francisco Facchinei,et al.  Parallel and Distributed Methods for Constrained Nonconvex Optimization—Part I: Theory , 2016, IEEE Transactions on Signal Processing.

[11]  Simon Haykin,et al.  Cognitive Radar: Step Toward Bridging the Gap Between Neuroscience and Engineering , 2012, Proceedings of the IEEE.

[12]  H. White,et al.  Nonlinear Regression with Dependent Observations , 1984 .

[13]  Qingmin Liao,et al.  Robust waveform design for multi-target detection in cognitive MIMO radar , 2018, 2018 IEEE Radar Conference (RadarConf18).

[14]  Fulvio Gini,et al.  Performance Bounds for Parameter Estimation under Misspecified Models: Fundamental Findings and Applications , 2017, IEEE Signal Processing Magazine.

[15]  Wei Jiang,et al.  End-to-end Learning of Waveform Generation and Detection for Radar Systems , 2019, 2019 53rd Asilomar Conference on Signals, Systems, and Computers.

[16]  Joel T. Johnson,et al.  Cognitive Radar Framework for Target Detection and Tracking , 2015, IEEE Journal of Selected Topics in Signal Processing.

[17]  Shannon D. Blunt,et al.  A machine learning approach to cognitive radar detection , 2015, 2015 IEEE Radar Conference (RadarCon).

[18]  Gordon P. Wright,et al.  Technical Note - A General Inner Approximation Algorithm for Nonconvex Mathematical Programs , 1978, Oper. Res..

[19]  Mohamed-Slim Alouini,et al.  Fourier-Based Transmit Beampattern Design Using MIMO Radar , 2014, IEEE Transactions on Signal Processing.

[20]  Luca Sanguinetti,et al.  Massive MIMO Radar for Target Detection , 2019, IEEE Transactions on Signal Processing.

[21]  J. R. Guerci,et al.  Cognitive radar: A knowledge-aided fully adaptive approach , 2010, 2010 IEEE Radar Conference.

[22]  Emil Björnson,et al.  Massive MIMO is a Reality - What is Next? Five Promising Research Directions for Antenna Arrays , 2019, ArXiv.

[23]  Jian Li,et al.  MIMO Radar with Colocated Antennas , 2007, IEEE Signal Processing Magazine.

[24]  Ameet Talwalkar,et al.  Foundations of Machine Learning , 2012, Adaptive computation and machine learning.

[25]  Li Wang,et al.  Reinforcement learning-based waveform optimization for MIMO multi-target detection , 2018, 2018 52nd Asilomar Conference on Signals, Systems, and Computers.

[26]  Feng Zhou,et al.  Target Tracking in Interference Environments Reinforcement Learning and Design for Cognitive Radar Soft Processing , 2008, 2008 Congress on Image and Signal Processing.

[27]  S. Haykin,et al.  Cognitive radar: a way of the future , 2006, IEEE Signal Processing Magazine.

[28]  A. Nuttall Some Integrals Involving the (Q sub M)-Function , 1974 .