论文信息 - Efficient Navigation of Active Particles in an Unseen Environment via Deep Reinforcement Learning

Efficient Navigation of Active Particles in an Unseen Environment via Deep Reinforcement Learning

Equipping active particles with intelligence such that they can efficiently navigate in an unknown complex environment is essential for emerging applications like precision surgery and targeted drug delivery. Here we develop a deep reinforcement learning algorithm that can train active particles to navigate in environments with random obstacles. Through numerical experiments, we show that the trained particle agent learns to make navigation decision regarding both obstacle avoidance and travel time minimization, relying only on local pixel-level sensory inputs but not on pre-knowledge of the entire environment. In unseen complex obstacle environments, the trained particle agent can navigate nearly optimally in arbitrarily long distance nearly optimally at a fixed computational cost. This study illustrates the potentials of employing artificial intelligence to bridge the gap between active particle engineering and emerging real-world applications.

Bo Li | Yuguang Yang | Michael A. Bevan

[1] T. Mallouk,et al. Powering nanorobots. , 2009, Scientific American.

[2] Joseph Wang,et al. Micro/nanorobots for biomedicine: Delivery, surgery, sensing, and detoxification , 2017, Science Robotics.

[3] Demis Hassabis,et al. Mastering the game of Go without human knowledge , 2017, Nature.

[4] Aldo A. Faisal,et al. The Artificial Intelligence Clinician learns optimal treatment strategies for sepsis in intensive care , 2018, Nature Medicine.

[5] Joseph Wang,et al. Rocket Science at the Nanoscale. , 2016, ACS nano.

[6] Samuel Sánchez,et al. Chemically powered micro- and nanomotors. , 2015, Angewandte Chemie.

[7] Guigang Zhang,et al. Deep Learning , 2016, Int. J. Semantic Comput..

[8] Geoffrey E. Hinton,et al. Visualizing Data using t-SNE , 2008 .

[9] Syn Schmitt,et al. External control strategies for self-propelled particles: Optimizing navigational efficiency in the presence of limited resources. , 2016, Physical review. E.

[10] Michael A Bevan,et al. Optimal Feedback Controlled Assembly of Perfect Crystals. , 2016, ACS nano.

[11] Michael A Bevan,et al. Interfacial colloidal rod dynamics: Coefficients, simulations, and analysis. , 2017, The Journal of chemical physics.