Adaptive Treatment of Epilepsy via Batch-mode Reinforcement Learning

This paper highlights the crucial role that modern machine learning techniques can play in the optimization of treatment strategies for patients with chronic disorders. In particular, we focus on the task of optimizing a deep-brain stimulation strategy for the treatment of epilepsy. The challenge is to choose which stimulation action to apply, as a function of the observed EEG signal, so as to minimize the frequency and duration of seizures. We apply recent techniques from the reinforcement learning literature--namely fitted Q-iteration and extremely randomized trees--to learn an optimal stimulation policy using labeled training data from animal brain tissues. Our result, show that these methods are an effective means of reducing tile incidence of seizures, while also minimizing the amount ot stimulation applied. If these results Carry over to the human model of epilepsy, the impact for patients will be substantial.

[1]  W. Hauser,et al.  Epilepsy: Frequency Causes and Consequences , 1990 .

[2]  Gerald Tesauro,et al.  Temporal difference learning and TD-Gammon , 1995, CACM.

[3]  Andrew W. Moore,et al.  Reinforcement Learning: A Survey , 1996, J. Artif. Intell. Res..

[4]  Richard S. Sutton,et al.  Introduction to Reinforcement Learning , 1998 .

[5]  Geoffrey J. Gordon,et al.  Approximate solutions to markov decision processes , 1999 .

[6]  J. White,et al.  Epilepsy in Small-World Networks , 2004, The Journal of Neuroscience.

[7]  D. Spencer,et al.  Effect of an External Responsive Neurostimulator on Seizures and Electrographic Discharges during Subdural Electrode Monitoring , 2004, Epilepsia.

[8]  B. Uthman,et al.  Effectiveness of vagus nerve stimulation in epilepsy patients , 2004, Neurology.

[9]  Pierre Geurts,et al.  Tree-Based Batch Mode Reinforcement Learning , 2005, J. Mach. Learn. Res..

[10]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 2005, IEEE Transactions on Neural Networks.

[11]  M. Avoli,et al.  Repetitive low-frequency stimulation reduces epileptiform synchronization in limbic neuronal networks , 2005, Neurobiology of Disease.

[12]  Pierre Geurts,et al.  Extremely randomized trees , 2006, Machine Learning.

[13]  Louis Wehenkel,et al.  Clinical data based optimal STI strategies for HIV: a reinforcement learning approach , 2006, Proceedings of the 45th IEEE Conference on Decision and Control.

[14]  Liming Xiang,et al.  Kernel-Based Reinforcement Learning , 2006, ICIC.

[15]  S. Vilar,et al.  Probabilistic neural network model for the in silico evaluation of anti-HIV activity and mechanism of action. , 2006, Journal of medicinal chemistry.

[16]  Peter Stone,et al.  Batch reinforcement learning in a complex domain , 2007, AAMAS '07.

[17]  S. Murphy,et al.  Methodological Challenges in Constructing Effective Treatment Sequences for Chronic Psychiatric Disorders , 2007, Neuropsychopharmacology.