Learning Low-Correlation GPS Spreading Codes with a Policy Gradient Algorithm

With the birth of the next-generation GPS III constellation and the upcoming launch of the Navigation Technology Satellite3 (NTS-3) testing platform to explore future technologies for GPS, we are indeed entering a new era of satellite navigation. Correspondingly, it is time to revisit the design methods of the GPS spreading code families. In this work, we develop a Gaussian policy gradient-based reinforcement learning algorithmwhich constructs high-quality families of spreading code sequences. We demonstrate the ability of our algorithm to achieve better mean-squared autoand cross-correlation than well-chosen families of equal-length Gold codes and Weil codes. Furthermore, we compare our algorithm with an analogous genetic algorithm implementation assigned the same code evaluation metric. To the best of the authors’ knowledge, this is the first work to explore using a machine learning / reinforcement learning approach to design navigation spreading codes.

[1]  Stefan Wallner,et al.  How to Optimize GNSS Signals and Codes for Indoor Positioning , 2006 .

[2]  J·O·温克尔 Spreading codes for a satellite navigation system , 2004 .

[3]  J. Spall STOCHASTIC OPTIMIZATION , 2002 .

[4]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[5]  R. Gold,et al.  Optimal binary sequences for spread spectrum multiplexing (Corresp.) , 1967, IEEE Trans. Inf. Theory.

[6]  Grace Xingxin Gao,et al.  Devising High-Performing Random Spreading Code Sequences Using a Multi-Objective Genetic Algorithm , 2019 .

[7]  Rong Li,et al.  AI Coding: Learning to Construct Error Correction Codes , 2020, IEEE Transactions on Communications.

[8]  Rich Caruana,et al.  Removing the Genetics from the Standard Genetic Algorithm , 1995, ICML.

[9]  Ronald J. Williams,et al.  Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning , 2004, Machine Learning.

[10]  Alex Graves,et al.  Playing Atari with Deep Reinforcement Learning , 2013, ArXiv.

[11]  Song Y. Yan Number Theory for Computing , 2002, Springer Berlin Heidelberg.

[12]  Gilbert Syswerda,et al.  Uniform Crossover in Genetic Algorithms , 1989, ICGA.

[13]  Grace Xingxin Gao,et al.  Designing Low-Correlation GPS Spreading Codes via a Policy Gradient Reinforcement Learning Algorithm , 2020, Proceedings of the 33rd International Technical Meeting of the Satellite Division of The Institute of Navigation (ION GNSS+ 2020).

[14]  Zhan Guohua,et al.  Pseudonoise codes constructed by Legendre sequence , 2002 .

[15]  Marc Peter Deisenroth,et al.  Deep Reinforcement Learning: A Brief Survey , 2017, IEEE Signal Processing Magazine.

[16]  Joseph J. Rushanan,et al.  The Spreading and Overlay Codes for the L1C Signal , 2007 .

[17]  Hu Chuan-Gan,et al.  On The Shift Register Sequences , 2004 .

[18]  Jonathan P. How,et al.  Decision Making Under Uncertainty: Theory and Application , 2015 .

[19]  Lloyd R. Welch,et al.  Lower bounds on the maximum cross correlation of signals (Corresp.) , 1974, IEEE Trans. Inf. Theory.

[20]  Guenter W. Hein,et al.  Galileo E1 OS and GPS L1C Pseudo Random Noise Codes - Requirements, Generation, Optimization and Comparison - , 2007 .

[21]  Peter W. Glynn,et al.  Likelilood ratio gradient estimation: an overview , 1987, WSC '87.

[22]  Joseph J. Rushanan Weil Sequences: A Family of Binary Sequences with Good Correlation Properties , 2006, 2006 IEEE International Symposium on Information Theory.