A Particle Filter-Based Reinforcement Learning Approach for Reliable Wireless Indoor Positioning

Positioning is envisioned as an essential enabler of future fifth generation (5G) mobile networks due to the massive number of use cases that would benefit from knowing users’ positions. In this work, we propose a particle filter-based reinforcement learning (PFRL) approach for the robust wireless indoor positioning system. Our algorithm integrates information of indoor zone prediction, inertial measurement units, wireless radio-based ranging, and floor plan into an particle filter. The zone prediction method is designed with an ensemble learning algorithm by integrating individual discriminative learning methods and Hidden Markov Models. Further, we integrate the particle filter approach with a reinforcement learning-based resampling method to provide robustness against localization failure problems such as the kidnapping robot problem. The PFRL approach is validated on a two-tier architecture, in which distributed machine learning tasks are hosted at client and edge layer. Experiment results show that our system outperforms traditional terminal-based approaches in both stability and accuracy.

[1]  Lei Yu,et al.  Calibration-free fusion of step counter and wireless fingerprints for indoor localization , 2015, UbiComp.

[2]  S. Seidel,et al.  914 MHz path loss prediction models for indoor wireless communications in multifloored buildings , 1992 .

[3]  Zan Li,et al.  A real-time robust indoor tracking system in smartphones , 2018, Comput. Commun..

[4]  Otto Spaniol,et al.  Indoor Positioning UsingWireless Local Area Networks , 2006, IEEE John Vincent Atanasoff 2006 International Symposium on Modern Computing (JVA'06).

[5]  Paramvir Bahl,et al.  RADAR: an in-building RF-based user location and tracking system , 2000, Proceedings IEEE INFOCOM 2000. Conference on Computer Communications. Nineteenth Annual Joint Conference of the IEEE Computer and Communications Societies (Cat. No.00CH37064).

[6]  Jie Wang,et al.  Differential radio map-based robust indoor localization , 2011, EURASIP J. Wirel. Commun. Netw..

[7]  Pengfei Wang,et al.  Research on WiFi Indoor Location Algorithm Based on RSSI Ranging , 2017, 2017 4th International Conference on Information Science and Control Engineering (ICISCE).

[8]  Hamed Haddadi,et al.  Deep Learning in Mobile and Wireless Networking: A Survey , 2018, IEEE Communications Surveys & Tutorials.

[9]  Ruizhi Chen,et al.  A Hybrid Smartphone Indoor Positioning Solution for Mobile LBS , 2012, Sensors.

[10]  G. Kitagawa Monte Carlo Filter and Smoother for Non-Gaussian Nonlinear State Space Models , 1996 .

[11]  Haiyong Luo,et al.  Discriminative Learning-based Smartphone Indoor Localization , 2018, ArXiv.

[12]  Eckehard Steinbach,et al.  Graph-based data fusion of pedometer and WiFi measurements for mobile indoor positioning , 2014, UbiComp.

[13]  Simon J. Godsill,et al.  On sequential Monte Carlo sampling methods for Bayesian filtering , 2000, Stat. Comput..

[14]  Jun S. Liu,et al.  Sequential Monte Carlo methods for dynamic systems , 1997 .

[15]  Wolfram Burgard,et al.  Monte Carlo Localization: Efficient Position Estimation for Mobile Robots , 1999, AAAI/IAAI.

[16]  V JoséLuisCarrera,et al.  Room Recognition Using Discriminative Ensemble Learning with Hidden Markov Models for Smartphones , 2018 .

[17]  Takeshi Kurata,et al.  Indoor/Outdoor Pedestrian Navigation with an Embedded GPS/RFID/Self-contained Sensor System , 2006, ICAT.

[18]  Fredrik Gustafsson,et al.  On Resampling Algorithms for Particle Filters , 2006, 2006 IEEE Nonlinear Statistical Signal Processing Workshop.

[19]  Zhao Zhang,et al.  WaP: Indoor localization and tracking using WiFi-Assisted Particle filter , 2014, 39th Annual IEEE Conference on Local Computer Networks.

[20]  Sebastian Thrun,et al.  FastSLAM 2.0: An Improved Particle Filtering Algorithm for Simultaneous Localization and Mapping that Provably Converges , 2003, IJCAI.

[21]  Peter Dayan,et al.  Technical Note: Q-Learning , 2004, Machine Learning.

[22]  W. Burgard,et al.  Markov Localization for Mobile Robots in Dynamic Environments , 1999, J. Artif. Intell. Res..

[23]  Moustafa Youssef,et al.  SemanticSLAM: Using Environment Landmarks for Unsupervised Indoor Localization , 2016, IEEE Transactions on Mobile Computing.

[24]  Shueng-Han Gary Chan,et al.  Wi-Fi Fingerprint-Based Indoor Positioning: Recent Advances and Comparisons , 2016, IEEE Communications Surveys & Tutorials.

[25]  Zan Li,et al.  A Real-time Indoor Tracking System in Smartphones , 2016, MSWiM.

[26]  Zan Li,et al.  A passive WiFi source localization system based on fine-grained power-based trilateration , 2015, 2015 IEEE 16th International Symposium on A World of Wireless, Mobile and Multimedia Networks (WoWMoM).

[27]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[28]  Srikanth Kandula,et al.  Resource Management with Deep Reinforcement Learning , 2016, HotNets.

[29]  Shunsuke Kamijo,et al.  Pedestrian dead reckoning for mobile phones through walking and running mode recognition , 2013, 16th International IEEE Conference on Intelligent Transportation Systems (ITSC 2013).

[30]  Shamim Nemati,et al.  Optimal medication dosing from suboptimal clinical examples: A deep reinforcement learning approach , 2016, 2016 38th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC).

[31]  Zan Li,et al.  A time-based passive source localization system for narrow-band signal , 2015, 2015 IEEE International Conference on Communications (ICC).

[32]  Xiaohui Zhao,et al.  A Narrow-Band Indoor Positioning System by Fusing Time and Received Signal Strength via Ensemble Learning , 2018, IEEE Access.

[33]  Hiroyuki Yokoyama,et al.  Estimating Position Relation between Two Pedestrians Using Mobile Phones , 2012, Pervasive.

[34]  Jr. G. Forney,et al.  The viterbi algorithm , 1973 .

[35]  Alexey Melnikov,et al.  The WebSocket Protocol , 2011, RFC.

[36]  Gabriel Curio,et al.  MACHINE LEARNING TECHNIQUES FOR BRAIN-COMPUTER INTERFACES , 2004 .