Reinforcement Learning for Intelligent Healthcare Systems: A Comprehensive Survey

The rapid increase in the percentage of chronic disease patients along with the recent pandemic pose immediate threats on healthcare expenditure and elevate causes of death. This calls for transforming healthcare systems away from one-on-one patient treatment into intelligent health systems, to improve services, access and scalability, while reducing costs. Reinforcement Learning (RL) has witnessed an intrinsic breakthrough in solving a variety of complex problems for diverse applications and services. Thus, we conduct in this paper a comprehensive survey of the recent models and techniques of RL that have been developed/used for supporting Intelligent-healthcare (I-health) systems. This paper can guide the readers to deeply understand the state-of-the-art regarding the use of RL in the context of I-health. Specifically, we first present an overview for the I-health systems challenges, architecture, and how RL can benefit these systems. We then review the background and mathematical modeling of different RL, Deep RL (DRL), and multi-agent RL models. After that, we provide a deep literature review for the applications of RL in I-health systems. In particular, three main areas have been tackled, i.e., edge intelligence, smart core network, and dynamic treatment regimes. Finally, we highlight emerging challenges and outline future research directions in driving the future success of RL in I-health systems, which opens the door for exploring some interesting and unsolved problems.

[1]  Jiming Liu,et al.  Reinforcement Learning in Healthcare: A Survey , 2019, ACM Comput. Surv..

[2]  Jianfeng Ma,et al.  Privacy-Preserving Diverse Keyword Search and Online Pre-Diagnosis in Cloud Computing , 2022, IEEE Transactions on Services Computing.

[3]  Mohsen Guizani,et al.  Optimal User-Edge Assignment in Hierarchical Federated Learning Based on Statistical Properties and Network Topology Constraints , 2022, IEEE Transactions on Network Science and Engineering.

[4]  Mohsen Guizani,et al.  Deep Reinforcement Learning for Network Selection Over Heterogeneous Health Systems , 2022, IEEE Transactions on Network Science and Engineering.

[5]  Dae-Hyun Choi,et al.  Federated Reinforcement Learning for Energy Management of Multiple Smart Homes With Distributed Energy Resources , 2022, IEEE Transactions on Industrial Informatics.

[6]  Mohammad Reza Khayyambashi,et al.  HFDRL: An Intelligent Dynamic Cooperate Cashing Method Based on Hierarchical Federated Deep Reinforcement Learning in Edge-Enabled IoT , 2021, IEEE Internet of Things Journal.

[7]  Ahmed A. Abd El-Latif,et al.  Multiagent Federated Reinforcement Learning for Secure Incentive Mechanism in Intelligent Cyber–Physical Systems , 2021, IEEE Internet of Things Journal.

[8]  K. Letaief,et al.  Federated Multiagent Actor–Critic Learning for Age Sensitive Mobile-Edge Computing , 2020, IEEE Internet of Things Journal.

[9]  Qiong Wu,et al.  FedHome: Cloud-Edge Based Personalized Federated Learning for In-Home Health Monitoring , 2020, IEEE Transactions on Mobile Computing.

[10]  Victor Talpaert,et al.  Deep Reinforcement Learning for Autonomous Driving: A Survey , 2020, IEEE Transactions on Intelligent Transportation Systems.

[11]  Kruthi Doddabasappla,et al.  Statistical and Machine Learning-Based Recognition of Coughing Events Using Triaxial Accelerometer Sensor Data From Multiple Wearable Points , 2021, IEEE Sensors Letters.

[12]  Mounir Hamdi,et al.  The Frontiers of Deep Reinforcement Learning for Resource Management in Future Wireless HetNets: Techniques, Challenges, and Research Directions , 2021, IEEE Open Journal of the Communications Society.

[13]  Elias Yaacoub,et al.  I-SEE: Intelligent, Secure, and Energy-Efficient Techniques for Medical Data Transmission Using Deep Reinforcement Learning , 2021, IEEE Internet of Things Journal.

[14]  Zibin Zheng,et al.  Deep Reinforcement Learning for Internet of Things: A Comprehensive Survey , 2021, IEEE Communications Surveys & Tutorials.

[15]  Amr Mohamed,et al.  Active Learning With Noisy Labelers for Improving Classification Accuracy of Connected Vehicles , 2021, IEEE Transactions on Vehicular Technology.

[16]  Mugen Peng,et al.  Resource Allocation for Energy-Efficient MEC in NOMA-Enabled Massive IoT Networks , 2021, IEEE Journal on Selected Areas in Communications.

[17]  Madhusanka Liyanage,et al.  Survey on Network Slicing for Internet of Things Realization in 5G Networks , 2021, IEEE Communications Surveys & Tutorials.

[18]  Danda B. Rawat,et al.  Reinforcement Learning for IoT Security: A Comprehensive Survey , 2021, IEEE Internet of Things Journal.

[19]  Shiping Wen,et al.  A Resource-Constrained and Privacy-Preserving Edge-Computing-Enabled Clinical Decision System: A Federated Reinforcement Learning Approach , 2021, IEEE Internet of Things Journal.

[20]  Mohsen Guizani,et al.  MEdge-Chain: Leveraging Edge Computing and Blockchain for Efficient Medical Data Exchange , 2021, IEEE Internet of Things Journal.

[21]  Rodolfo da Silva Villaça,et al.  Securing E-Health Networks by applying Network Slicing and Blockchain Techniques , 2021, 2021 IEEE 18th Annual Consumer Communications & Networking Conference (CCNC).

[22]  Ekram Hossain,et al.  Single and Multi-Agent Deep Reinforcement Learning for AI-Enabled Wireless Networks: A Tutorial , 2020, IEEE Communications Surveys & Tutorials.

[23]  Zhi Zhou,et al.  When Deep Reinforcement Learning Meets Federated Learning: Intelligent Multitimescale Resource Management for Multiaccess Edge Computing in 5G Ultradense Network , 2020, IEEE Internet of Things Journal.

[24]  P. Georgiou,et al.  Basal Glucose Control in Type 1 Diabetes Using Deep Reinforcement Learning: An In Silico Validation , 2020, IEEE Journal of Biomedical and Health Informatics.

[25]  Vijay Janapa Reddi,et al.  Deep Reinforcement Learning for Cyber Security , 2019, IEEE Transactions on Neural Networks and Learning Systems.

[26]  Robert H. Deng,et al.  Privacy-Preserving Reinforcement Learning Design for Patient-Centric Dynamic Treatment Regimes , 2019, IEEE Transactions on Emerging Topics in Computing.

[27]  Nei Kato,et al.  Survey on Machine Learning for Intelligent End-to-End Communication Toward 6G: From Network Access, Routing to Traffic Control and Streaming Adaption , 2021, IEEE Communications Surveys & Tutorials.

[28]  Weihua Zhuang,et al.  Reinforcement Learning-Based Physical-Layer Authentication for Controller Area Networks , 2021, IEEE Transactions on Information Forensics and Security.

[29]  Paulo Valente Klaine,et al.  Efficient Handover Mechanism for Radio Access Network Slicing by Exploiting Distributed Learning , 2020, IEEE Transactions on Network and Service Management.

[30]  Wei Du,et al.  A survey on multi-agent deep reinforcement learning: from the perspective of challenges and applications , 2020, Artificial Intelligence Review.

[31]  Ying-Chang Liang,et al.  Device Association for RAN Slicing Based on Hybrid Federated Deep Reinforcement Learning , 2020, IEEE Transactions on Vehicular Technology.

[32]  Ye Wang,et al.  DeepSlicing: Deep Reinforcement Learning Assisted Resource Allocation for Network Slicing , 2020, GLOBECOM 2020 - 2020 IEEE Global Communications Conference.

[33]  Xiaorong Zhu,et al.  An End-to-End Network Slicing Algorithm Based on Deep Q-Learning for 5G Network , 2020, IEEE Access.

[34]  Yun Zhang,et al.  OIDPR: Optimized Insulin Dosage based on Privacy-Preserving Reinforcement Learning , 2020, 2020 IFIP Networking Conference (Networking).

[35]  Rong Liu,et al.  A review of medical artificial intelligence , 2020, Global Health Journal.

[36]  Kai Chen,et al.  Parallelizing Adam Optimizer with Blockwise Model-Update Filtering , 2020, ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[37]  Amr Mohamed,et al.  ssHealth: Toward Secure, Blockchain-Enabled Healthcare Systems , 2020, IEEE Network.

[38]  Yue Gao,et al.  Multi-agent reinforcement learning for resource allocation in IoT networks with edge computing , 2020, China Communications.

[39]  Kevin I-Kai Wang,et al.  Deep-Learning-Enhanced Human Activity Recognition for Internet of Healthcare Things , 2020, IEEE Internet of Things Journal.

[40]  Marcello Ienca,et al.  On the responsible use of digital data to tackle the COVID-19 pandemic , 2020, Nature Medicine.

[41]  Alex Mathew,et al.  Network Slicing in 5G and the Security Concerns , 2020, 2020 Fourth International Conference on Computing Methodologies and Communication (ICCMC).

[42]  Francesco Malandrino,et al.  Active Learning-based Classification in Automated Connected Vehicles , 2020, IEEE INFOCOM 2020 - IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS).

[43]  Athanasios V. Vasilakos,et al.  The Future of Healthcare Internet of Things: A Survey of Emerging Technologies , 2020, IEEE Communications Surveys & Tutorials.

[44]  Mohsen Guizani,et al.  Performance Evaluation of Hyperledger Fabric , 2020, 2020 IEEE International Conference on Informatics, IoT, and Enabling Technologies (ICIoT).

[45]  Zhen Xiao,et al.  Learning Agent Communication under Limited Bandwidth by Message Pruning , 2019, AAAI.

[46]  V. Cevher,et al.  Optimization for Reinforcement Learning: From a single agent to cooperative agents , 2019, IEEE Signal Processing Magazine.

[47]  Bo An,et al.  Learning Efficient Multi-agent Communication: An Information Bottleneck Approach , 2019, ICML.

[48]  Yue Tan,et al.  Deep Reinforcement Learning for Autonomous Internet of Things: Model, Applications and Challenges , 2019, IEEE Communications Surveys & Tutorials.

[49]  Jonathan P. How,et al.  R-MADDPG for Partially Observable Environments and Limited Communication , 2019, ArXiv.

[50]  Lenan Wu,et al.  Power Allocation in Multi-User Cellular Networks: Deep Reinforcement Learning Approaches , 2019, IEEE Transactions on Wireless Communications.

[51]  Saeid Nahavandi,et al.  Deep Reinforcement Learning for Multiagent Systems: A Review of Challenges, Solutions, and Applications , 2018, IEEE Transactions on Cybernetics.

[52]  Michael R. Kosorok,et al.  Estimating Dynamic Treatment Regimes in Mobile Health Using V-Learning , 2016, Journal of the American Statistical Association.

[53]  Mina Sartipi,et al.  Reinforcement Learning Interpretation Methods: A Survey , 2020, IEEE Access.

[54]  Dimitrios Makrakis,et al.  A Survey on Blockchain-Based Self-Sovereign Patient Identity in Healthcare , 2020, IEEE Access.

[55]  Mohsen Guizani,et al.  Edge computing for energy-efficient smart health systems , 2020 .

[56]  Nguyen H. Tran,et al.  Network Slicing: Recent Advances, Taxonomy, Requirements, and Open Research Challenges , 2020, IEEE Access.

[57]  Nei Kato,et al.  Machine Learning Meets Computation and Communication Control in Evolving Edge and Cloud: Challenges and Future Perspective , 2020, IEEE Communications Surveys & Tutorials.

[58]  Tao Han,et al.  When Network Slicing meets Deep Reinforcement Learning , 2019, CoNEXT Companion.

[59]  Vijay Kumar Chaurasiya,et al.  Reinforcement Learning Based Energy Management in Wireless Body Area Network: A Survey , 2019, 2019 IEEE Conference on Information and Communication Technology.

[60]  Dimosthenis Kyriazis,et al.  An Innovative eHealth System Powered By 5G Network Slicing , 2019, 2019 Sixth International Conference on Internet of Things: Systems, Management and Security (IOTSMS).

[61]  Matthew E. Taylor,et al.  A survey and critique of multiagent deep reinforcement learning , 2019, Autonomous Agents and Multi-Agent Systems.

[62]  Mazen O. Hasna,et al.  On the Performance of Tactical Communication Interception Using Military Full Duplex Radios , 2019, 2019 IEEE 30th Annual International Symposium on Personal, Indoor and Mobile Radio Communications (PIMRC).

[63]  Muntasir Raihan Rahman,et al.  Deep Reinforcement Learning for Network Slicing with Heterogeneous Resource Requirements and Time Varying Traffic Dynamics , 2019, 2019 15th International Conference on Network and Service Management (CNSM).

[64]  Jian Song,et al.  Deep Reinforcement Learning-Enabled Secure Visible Light Communication Against Eavesdropping , 2019, IEEE Transactions on Communications.

[65]  Ke Zhang,et al.  Computation Offloading and Resource Allocation For Cloud Assisted Mobile Edge Computing in Vehicular Networks , 2019, IEEE Transactions on Vehicular Technology.

[66]  Shimon Whiteson,et al.  A Survey of Reinforcement Learning Informed by Natural Language , 2019, IJCAI.

[67]  Hyuk Lim,et al.  Reinforcement Learning Based Resource Management for Network Slicing , 2019, Applied Sciences.

[68]  Chao Yu,et al.  Deep Inverse Reinforcement Learning for Sepsis Treatment , 2019, 2019 IEEE International Conference on Healthcare Informatics (ICHI).

[69]  Liang Xiao,et al.  Learning-Based Privacy-Aware Offloading for Healthcare IoT With Energy Harvesting , 2019, IEEE Internet of Things Journal.

[70]  Yung Yi,et al.  QTRAN: Learning to Factorize with Transformation for Cooperative Multi-Agent Reinforcement Learning , 2019, ICML.

[71]  Ruili Wang,et al.  A Survey on an Emerging Area: Deep Learning for Smart City Data , 2019, IEEE Transactions on Emerging Topics in Computational Intelligence.

[72]  Ekram Hossain,et al.  A Deep Q-Learning Method for Downlink Power Allocation in Multi-Cell Networks , 2019, ArXiv.

[73]  Ausif Mahmood,et al.  Review of Deep Learning Algorithms and Architectures , 2019, IEEE Access.

[74]  Yan Xu,et al.  Q-Learning Based Physical-Layer Secure Game Against Multiagent Attacks , 2019, IEEE Access.

[75]  Sotiris Karabetsos,et al.  A Review of Machine Learning and IoT in Smart Transportation , 2019, Future Internet.

[76]  Dimosthenis Kyriazis,et al.  Internet of Medical Things (IoMT): Acquiring and Transforming Data into HL7 FHIR through 5G Network Slicing , 2019, Emerging Science Journal.

[77]  Khaled A. Harras,et al.  EdgeHealth: An Energy-Efficient Edge-based Remote mHealth Monitoring System , 2019, 2019 IEEE Wireless Communications and Networking Conference (WCNC).

[78]  Amr Mohamed,et al.  Edge Computing for Smart Health: Context-Aware Approaches, Opportunities, and Challenges , 2019, IEEE Network.

[79]  Amr Mohamed,et al.  Edge-based compression and classification for smart healthcare systems: Concept, implementation and evaluation , 2019, Expert Syst. Appl..

[80]  Zhu Han,et al.  Intelligent User-Centric Network Selection: A Model-Driven Reinforcement Learning Framework , 2019, IEEE Access.

[81]  Félix J. García Clemente,et al.  Dynamic network slicing management of multimedia scenarios for future remote healthcare , 2019, Multimedia Tools and Applications.

[82]  Dorin Comaniciu,et al.  Multi-Scale Deep Reinforcement Learning for Real-Time 3D-Landmark Detection in CT Scans , 2019, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[83]  Kai-Kit Wong,et al.  UAV-Assisted Relaying and Edge Computing: Scheduling and Trajectory Optimization , 2018, IEEE Transactions on Wireless Communications.

[84]  Ying-Chang Liang,et al.  Applications of Deep Reinforcement Learning in Communications and Networking: A Survey , 2018, IEEE Communications Surveys & Tutorials.

[85]  Ekram Hossain,et al.  Deep Learning for Radio Resource Allocation in Multi-Cell Networks , 2018, IEEE Network.

[86]  Kobi Cohen,et al.  Deep Multi-User Reinforcement Learning for Distributed Dynamic Spectrum Access , 2017, IEEE Transactions on Wireless Communications.

[87]  J. Riekki,et al.  EdgeAI: A Vision for Distributed, Edge-native Artificial Intelligence in Future 6G Networks , 2019 .

[88]  Sebastian Canovas-Carrasco,et al.  A Reinforcement Learning-Based Framework for the Exploitation of Multiple Rats in the IoT , 2019, IEEE Access.

[89]  Andrew Slavin Ross,et al.  Improving Sepsis Treatment Strategies by Combining Deep and Kernel-Based Reinforcement Learning , 2018, AMIA.

[90]  Yiyang Pei,et al.  Deep Reinforcement Learning for User Association and Resource Allocation in Heterogeneous Networks , 2018, 2018 IEEE Global Communications Conference (GLOBECOM).

[91]  Hiroaki Harai,et al.  Adaptive Virtual Network Slices for Diverse IoT Services , 2018, IEEE Communications Standards Magazine.

[92]  Tao Jiang,et al.  Deep Reinforcement Learning for Mobile Edge Caching: Review, New Features, and Open Issues , 2018, IEEE Network.

[93]  Yu Zhang,et al.  Intelligent Cloud Resource Management with Deep Reinforcement Learning , 2018, IEEE Cloud Computing.

[94]  Aldo A. Faisal,et al.  The Artificial Intelligence Clinician learns optimal treatment strategies for sepsis in intensive care , 2018, Nature Medicine.

[95]  Xiaoqing Han,et al.  Review on the research and practice of deep learning and reinforcement learning in smart grids , 2018, CSEE Journal of Power and Energy Systems.

[96]  Dongning Guo,et al.  Deep Reinforcement Learning for Distributed Dynamic Power Allocation in Wireless Networks , 2018, ArXiv.

[97]  Rashid Mehmood,et al.  UbeHealth: A Personalized Ubiquitous Cloud and Edge-Enabled Networked Healthcare System for Smart Cities , 2018, IEEE Access.

[98]  Matthieu Komorowski,et al.  The Actor Search Tree Critic (ASTC) for Off-Policy POMDP Learning in Medical Decision Making , 2018, ArXiv.

[99]  Mariette Awad,et al.  Decision Making in Multiagent Systems: A Survey , 2018, IEEE Transactions on Cognitive and Developmental Systems.

[100]  Amr Mohamed,et al.  EEG-Based Transceiver Design With Data Decomposition for Healthcare IoT Applications , 2018, IEEE Internet of Things Journal.

[101]  Jianye Hao,et al.  Falsification of Cyber-Physical Systems Using Deep Reinforcement Learning , 2018, IEEE Transactions on Software Engineering.

[102]  J. Shaw,et al.  IDF Diabetes Atlas: Global estimates of diabetes prevalence for 2017 and projections for 2045. , 2018, Diabetes research and clinical practice.

[103]  Shimon Whiteson,et al.  QMIX: Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning , 2018, ICML.

[104]  Xiaodong Lin,et al.  Efficient and Secure Service-Oriented Authentication Supporting Network Slicing for 5G-Enabled IoT , 2018, IEEE Journal on Selected Areas in Communications.

[105]  Qi Zhang,et al.  Towards 5G Enabled Tactile Robotic Telesurgery , 2018, ArXiv.

[106]  Wei Wang,et al.  Context-aware reinforcement learning-based mobile cloud computing for telemonitoring , 2018, 2018 IEEE EMBS International Conference on Biomedical & Health Informatics (BHI).

[107]  Jiachen Yang,et al.  Precision medicine as a control problem: Using simulation and deep reinforcement learning to discover adaptive, personalized multi-cytokine therapy for sepsis , 2018, ArXiv.

[108]  Erdogan Dogdu,et al.  Context-Aware Computing, Learning, and Big Data in Internet of Things: A Survey , 2018, IEEE Internet of Things Journal.

[109]  Gang Cao,et al.  AIF: An Artificial Intelligence Framework for Smart Wireless Network Management , 2018, IEEE Communications Letters.

[110]  Yan Zhang,et al.  Mobile Edge Computing: A Survey , 2018, IEEE Internet of Things Journal.

[111]  Xiang Chen,et al.  Security in Mobile Edge Caching with Reinforcement Learning , 2018, IEEE Wireless Communications.

[112]  Jianhong Zhou,et al.  Smart Multi-RAT Access Based on Multiagent Reinforcement Learning , 2018, IEEE Transactions on Vehicular Technology.

[113]  Yaping Lin,et al.  Privacy-Preserving Search Over Encrypted Personal Health Record In Multi-Source Cloud , 2018, IEEE Access.

[114]  Amr Mohamed,et al.  User-Centric Networks Selection With Adaptive Data Compression for Smart Health , 2018, IEEE Systems Journal.

[115]  Mustafa Cenk Gursoy,et al.  A deep reinforcement learning-based framework for content caching , 2017, 2018 52nd Annual Conference on Information Sciences and Systems (CISS).

[116]  Zhi Chen,et al.  Intelligent Power Control for Spectrum Sharing in Cognitive Radios: A Deep Reinforcement Learning Approach , 2017, IEEE Access.

[117]  Amir Hussain,et al.  Applications of Deep Learning and Reinforcement Learning to Biological Data , 2017, IEEE Transactions on Neural Networks and Learning Systems.

[118]  D. Bertsekas Reinforcement Learning and Optimal ControlA Selective Overview , 2018 .

[119]  Weitong Chen,et al.  Treatment Recommendation in Critical Care: A Scalable and Interpretable Approach in Partially Observable Health States , 2018, ICIS.

[120]  Jen-Tzung Chien,et al.  Deep reinforcement learning for automated radiation adaptation in lung cancer , 2017, Medical physics.

[121]  Peter Szolovits,et al.  Deep Reinforcement Learning for Sepsis Treatment , 2017, ArXiv.

[122]  M. Deisenroth,et al.  Deep Reinforcement Learning: A Brief Survey , 2017, IEEE Signal Processing Magazine.

[123]  Nader Meskin,et al.  Reinforcement learning-based control of drug dosing for cancer chemotherapy treatment. , 2017, Mathematical biosciences.

[124]  Andrew G. Lamperski,et al.  Seizure Control in a Computational Model Using a Reinforcement Learning Stimulation Paradigm , 2017, Int. J. Neural Syst..

[125]  Demis Hassabis,et al.  Mastering the game of Go without human knowledge , 2017, Nature.

[126]  Walid Saad,et al.  Machine Learning for Wireless Networks with Artificial Intelligence: A Tutorial on Neural Networks , 2017, ArXiv.

[127]  Victor C. M. Leung,et al.  Deep-Reinforcement-Learning-Based Optimization for Cache-Enabled Opportunistic Interference Alignment Wireless Networks , 2017, IEEE Transactions on Vehicular Technology.

[128]  Anil A. Bharath,et al.  Deep Reinforcement Learning: A Brief Survey , 2017, IEEE Signal Processing Magazine.

[129]  Alec Radford,et al.  Proximal Policy Optimization Algorithms , 2017, ArXiv.

[130]  Langford B. White,et al.  Reinforcement Learning With Network-Assisted Feedback for Heterogeneous RAT Selection , 2017, IEEE Transactions on Wireless Communications.

[131]  Yi Wu,et al.  Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments , 2017, NIPS.

[132]  Amr Mohamed,et al.  Concurrent association in heterogeneous networks with underlay D2D communication , 2017, 2017 13th International Wireless Communications and Mobile Computing Conference (IWCMC).

[133]  Peter Szolovits,et al.  Continuous State-Space Models for Optimal Sepsis Treatment: a Deep Reinforcement Learning Approach , 2017, MLHC.

[134]  Jing Wang,et al.  A deep reinforcement learning based framework for power-efficient resource allocation in cloud RANs , 2017, 2017 IEEE International Conference on Communications (ICC).

[135]  Trevor N. Mudge,et al.  Neurosurgeon: Collaborative Intelligence Between the Cloud and Mobile Edge , 2017, ASPLOS.

[136]  Noe Casas,et al.  Deep Deterministic Policy Gradient for Urban Traffic Light Control , 2017, ArXiv.

[137]  Jonathan P. How,et al.  Deep Decentralized Multi-task Multi-Agent Reinforcement Learning under Partial Observability , 2017, ICML.

[138]  Abbas Ahmadi,et al.  Simulation-based optimization of radiotherapy: Agent-based modeling and reinforcement learning , 2017, Math. Comput. Simul..

[139]  Amr Mohamed,et al.  Network Association with Dynamic Pricing over D2D-Enabled Heterogeneous Networks , 2017, 2017 IEEE Wireless Communications and Networking Conference (WCNC).

[140]  Xu Zhao,et al.  Context-Associative Hierarchical Memory Model for Human Activity Recognition and Prediction , 2017, IEEE Transactions on Multimedia.

[141]  Yuxi Li,et al.  Deep Reinforcement Learning: An Overview , 2017, ArXiv.

[142]  Maruf Pasha,et al.  Survey of Machine Learning Algorithms for Disease Diagnostic , 2017 .

[143]  Guang-Zhong Yang,et al.  Deep Learning for Health Informatics , 2017, IEEE Journal of Biomedical and Health Informatics.

[144]  Michael R. Kosorok,et al.  Robust Hybrid Learning for Estimating Personalized Dynamic Treatment Regimens , 2016, 1611.02314.

[145]  Rastin Pries,et al.  Network as a Service - A Demo on 5G Network Slicing , 2016, 2016 28th International Teletraffic Congress (ITC 28).

[146]  Elad Hazan,et al.  Introduction to Online Convex Optimization , 2016, Found. Trends Optim..

[147]  Shimon Whiteson,et al.  Learning to Communicate with Deep Multi-Agent Reinforcement Learning , 2016, NIPS.

[148]  Pieter Abbeel,et al.  Benchmarking Deep Reinforcement Learning for Continuous Control , 2016, ICML.

[149]  Alex Graves,et al.  Asynchronous Methods for Deep Reinforcement Learning , 2016, ICML.

[150]  Weihua Zhuang,et al.  PHY-Layer Spoofing Detection With Reinforcement Learning in Wireless Networks , 2016, IEEE Transactions on Vehicular Technology.

[151]  Demis Hassabis,et al.  Mastering the game of Go with deep neural networks and tree search , 2016, Nature.

[152]  Yuval Tassa,et al.  Continuous control with deep reinforcement learning , 2015, ICLR.

[153]  M. Shamim Hossain,et al.  Software defined healthcare networks , 2015, IEEE Wireless Communications.

[154]  Chao Xu,et al.  Automated OS-level Device Runtime Power Management , 2015, ASPLOS.

[155]  Shane Legg,et al.  Human-level control through deep reinforcement learning , 2015, Nature.

[156]  M R Kosorok,et al.  Penalized Q-Learning for Dynamic Treatment Regimens. , 2011, Statistica Sinica.

[157]  Russ Greiner,et al.  Budgeted Learning for Developing Personalized Treatment , 2014, 2014 13th International Conference on Machine Learning and Applications.

[158]  Eric B. Laber,et al.  Interactive model building for Q-learning. , 2014, Biometrika.

[159]  Wassim M. Haddad,et al.  Closed-loop control of anesthesia and mean arterial pressure using reinforcement learning , 2014, 2014 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (ADPRL).

[160]  Weihua Zhuang,et al.  Spoofing Detection with Reinforcement Learning in Wireless Networks , 2014, 2015 IEEE Global Communications Conference (GLOBECOM).

[161]  Tim Dallas,et al.  Feature Selection and Activity Recognition System Using a Single Triaxial Accelerometer , 2014, IEEE Transactions on Biomedical Engineering.

[162]  Anastasios A. Tsiatis,et al.  Q- and A-learning Methods for Estimating Optimal Dynamic Treatment Regimes , 2012, Statistical science : a review journal of the Institute of Mathematical Statistics.

[163]  Alex Graves,et al.  Playing Atari with Deep Reinforcement Learning , 2013, ArXiv.

[164]  W. Art Chaovalitwongse,et al.  Online Seizure Prediction Using an Adaptive Learning Approach , 2013, IEEE Transactions on Knowledge and Data Engineering.

[165]  A. Aldo Faisal,et al.  The use of reinforcement learning algorithms to meet the challenges of an artificial pancreas , 2013, Expert review of medical devices.

[166]  J. Vincent Critical care - where have we been and where are we going? , 2013, Critical Care.

[167]  W. Haddad,et al.  Clinical Decision Support and Closed‐Loop Control for Intensive Care Unit Sedation , 2013 .

[168]  Hideki Asoh,et al.  Modeling Medical Records of Diabetes using Markov Decision Processes , 2013 .

[169]  B. Chakraborty,et al.  Statistical Methods for Dynamic Treatment Regimes: Reinforcement Learning, Causal Inference, and Personalized Medicine , 2013 .

[170]  Houssam Abbas,et al.  Convergence proofs for Simulated Annealing falsification of safety properties , 2012, 2012 50th Annual Allerton Conference on Communication, Control, and Computing (Allerton).

[171]  Sriram Sankaranarayanan,et al.  Falsification of temporal properties of hybrid systems using the cross-entropy method , 2012, HSCC '12.

[172]  Tamer A. ElBatt,et al.  Distributed Cooperative Q-Learning for Power Allocation in Cognitive Femtocell Networks , 2012, 2012 IEEE Vehicular Technology Conference (VTC Fall).

[173]  Larry D. Pyeatt,et al.  An Adaptive Neural Network Filter for Improved Patient State Estimation in Closed-Loop Anesthesia Control , 2011, 2011 IEEE 23rd International Conference on Tools with Artificial Intelligence.

[174]  Jooyoung Park,et al.  Drug scheduling of cancer chemotherapy based on natural actor-critic approach , 2011, Biosyst..

[175]  Alaa Awad,et al.  Energy-aware routing for delay-sensitive applications over wireless multihop mesh networks , 2011, 2011 7th International Wireless Communications and Mobile Computing Conference.

[176]  Cosmin Paduraru,et al.  Adaptive Control of Epileptic Seizures using Reinforcement Learning , 2010 .

[177]  Dusit Niyato,et al.  Dynamics of Network Selection in Heterogeneous Wireless Networks: An Evolutionary Game Approach , 2009, IEEE Transactions on Vehicular Technology.

[178]  Ali Karimpour,et al.  Agent-based Simulation for Blood Glucose , 2009 .

[179]  M. Kosorok,et al.  Reinforcement learning design for cancer clinical trials , 2009, Statistics in medicine.

[180]  Joelle Pineau,et al.  Adaptive Treatment of Epilepsy via Batch-mode Reinforcement Learning , 2008, AAAI.

[181]  Stefan Schaal,et al.  Natural Actor-Critic , 2003, Neurocomputing.

[182]  Nasser Sadati,et al.  Multivariable Anesthesia Control Using Reinforcement Learning , 2006, 2006 IEEE International Conference on Systems, Man and Cybernetics.

[183]  T. Macdonald Preventing Chronic Diseases: A Vital Investment , 2006 .

[184]  M.K. Muezzinoglu,et al.  Reinforcement learning approach to individualization of chronic pharmacotherapy , 2005, Proceedings. 2005 IEEE International Joint Conference on Neural Networks, 2005..

[185]  Pierre Geurts,et al.  Tree-Based Batch Mode Reinforcement Learning , 2005, J. Mach. Learn. Res..

[186]  Richard S. Sutton,et al.  Learning to predict by the methods of temporal differences , 1988, Machine Learning.

[187]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[188]  Peter Dayan,et al.  Q-learning , 1992, Machine Learning.

[189]  Alexander J. Smola,et al.  Support Vector Method for Function Approximation, Regression Estimation and Signal Processing , 1996, NIPS.

[190]  Martin L. Puterman,et al.  Markov Decision Processes: Discrete Stochastic Dynamic Programming , 1994 .