A Reinforcement Learning Based Approach for Multitarget Detection in Massive MIMO Radar

This article considers the problem of multitarget detection for massive multiple input multiple output cognitive radar (CR). The concept of CR is based on the perception-action cycle that senses and intelligently adapts to the dynamic environment in order to optimally satisfy a specific mission. However, this usually requires a priori knowledge of the environmental model, which is not available in most cases. We propose a reinforcement learning (RL) based algorithm for cognitive multitarget detection in the presence of unknown disturbance statistics. The radar acts as an agent that continuously senses the unknown environment (i.e., targets and disturbance) and consequently optimizes transmitted waveforms in order to maximize the probability of detection ($P_{\mathsf {D}}$) by focusing the energy in specific range-angle cells (i.e., beamforming). Furthermore, we propose a solution to the beamformin optimization problem with less complexity than the existing methods. Numerical simulations are performed to assess the performance of the proposed RL-based algorithm in both stationary and dynamic environments. The RL-based beamforming is compared to the conventional omnidirectional approach with equal power allocation and to adaptive beamforming with no RL. As highlighted by the proposed numerical results, our RL-based beamformer outperforms both approaches in terms of target detection performance. The performance improvement is even particularly remarkable under environmentally harsh conditions such as low SNR, heavy-tailed disturbance, and rapidly changing scenarios.

[1]  Fulvio Gini,et al.  Cognitive Radars: On the Road to Reality: Progress Thus Far and Possibilities for the Future , 2018, IEEE Signal Processing Magazine.

[2]  Emil Björnson,et al.  Massive MIMO is a Reality - What is Next? Five Promising Research Directions for Antenna Arrays , 2019, ArXiv.

[3]  Francisco Facchinei,et al.  Parallel and Distributed Methods for Constrained Nonconvex Optimization—Part I: Theory , 2016, IEEE Transactions on Signal Processing.

[4]  Fulvio Gini,et al.  The Misspecified Cramer-Rao Bound and Its Application to Scatter Matrix Estimation in Complex Elliptically Symmetric Distributions , 2016, IEEE Transactions on Signal Processing.

[5]  Li Wang,et al.  Reinforcement learning-based waveform optimization for MIMO multi-target detection , 2018, 2018 52nd Asilomar Conference on Signals, Systems, and Computers.

[6]  Gordon P. Wright,et al.  Technical Note - A General Inner Approximation Algorithm for Nonconvex Mathematical Programs , 1978, Oper. Res..

[7]  Fulvio Gini,et al.  Performance Bounds for Parameter Estimation under Misspecified Models: Fundamental Findings and Applications , 2017, IEEE Signal Processing Magazine.

[8]  Mohamed-Slim Alouini,et al.  Fourier-Based Transmit Beampattern Design Using MIMO Radar , 2014, IEEE Transactions on Signal Processing.

[9]  Zdenek Matousek,et al.  Drone detection by Ku-band battlefield radar , 2017, 2017 International Conference on Military Technologies (ICMT).

[10]  H. White,et al.  Nonlinear Regression with Dependent Observations , 1984 .

[11]  Luca Sanguinetti,et al.  Massive MIMO Radar for Target Detection , 2019, IEEE Transactions on Signal Processing.

[12]  Shannon D. Blunt,et al.  A machine learning approach to cognitive radar detection , 2015, 2015 IEEE Radar Conference (RadarCon).

[13]  Feng Zhou,et al.  Target Tracking in Interference Environments Reinforcement Learning and Design for Cognitive Radar Soft Processing , 2008, 2008 Congress on Image and Signal Processing.

[14]  Qingmin Liao,et al.  Robust waveform design for multi-target detection in cognitive MIMO radar , 2018, 2018 IEEE Radar Conference (RadarConf18).

[15]  Ameet Talwalkar,et al.  Foundations of Machine Learning , 2012, Adaptive computation and machine learning.

[16]  L.J. Cimini,et al.  MIMO Radar with Widely Separated Antennas , 2008, IEEE Signal Processing Magazine.

[17]  Jian Li,et al.  MIMO Radar with Colocated Antennas , 2007, IEEE Signal Processing Magazine.

[18]  A. Nuttall Some Integrals Involving the (Q sub M)-Function , 1974 .

[19]  S. Haykin,et al.  Cognitive radar: a way of the future , 2006, IEEE Signal Processing Magazine.

[20]  D. Fuhrmann,et al.  Transmit beamforming for MIMO radar systems using signal cross-correlation , 2008, IEEE Transactions on Aerospace and Electronic Systems.

[21]  Tianyao Huang,et al.  Cognitive Radar Using Reinforcement Learning in Automotive Applications , 2019, 1904.10739.

[22]  Zhi-Quan Luo,et al.  Semidefinite Relaxation of Quadratic Optimization Problems , 2010, IEEE Signal Processing Magazine.

[23]  Simon Haykin,et al.  Cognitive Radar: Step Toward Bridging the Gap Between Neuroscience and Engineering , 2012, Proceedings of the IEEE.

[24]  J. R. Guerci,et al.  Cognitive radar: A knowledge-aided fully adaptive approach , 2010, 2010 IEEE Radar Conference.

[25]  Wei Jiang,et al.  End-to-end Learning of Waveform Generation and Detection for Radar Systems , 2019, 2019 53rd Asilomar Conference on Signals, Systems, and Computers.

[26]  Benjamin Friedlander,et al.  On Transmit Beamforming for MIMO Radar , 2012, IEEE Transactions on Aerospace and Electronic Systems.

[27]  Joel T. Johnson,et al.  Cognitive Radar Framework for Target Detection and Tracking , 2015, IEEE Journal of Selected Topics in Signal Processing.