Multiple Landmark Detection using Multi-Agent Reinforcement Learning

The detection of anatomical landmarks is a vital step for medical image analysis and applications for diagnosis, interpretation and guidance. Manual annotation of landmarks is a tedious process that requires domain-specific expertise and introduces inter-observer variability. This paper proposes a new detection approach for multiple landmarks based on multi-agent reinforcement learning. Our hypothesis is that the position of all anatomical landmarks is interdependent and non-random within the human anatomy, thus finding one landmark can help to deduce the location of others. Using a Deep Q-Network (DQN) architecture we construct an environment and agent with implicit inter-communication such that we can accommodate K agents acting and learning simultaneously, while they attempt to detect K different landmarks. During training the agents collaborate by sharing their accumulated knowledge for a collective gain. We compare our approach with state-of-the-art architectures and achieve significantly better accuracy by reducing the detection error by 50%, while requiring fewer computational resources and time to train compared to the näıve approach of training K agents separately.

[1]  Shane Legg,et al.  Human-level control through deep reinforcement learning , 2015, Nature.

[2]  Shimon Whiteson,et al.  QMIX: Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning , 2018, ICML.

[3]  Isabelle Bloch,et al.  Multi-organ localization with cascaded global-to-local regression and shape prior , 2015, Medical Image Anal..

[4]  D. Rueckert,et al.  Stratified Decision Forests for Accurate Anatomical Landmark Localization in Cardiac Images. , 2017, IEEE transactions on medical imaging.

[5]  Dorin Comaniciu,et al.  Multi-Scale Deep Reinforcement Learning for Real-Time 3D-Landmark Detection in CT Scans , 2019, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6]  Loïc Le Folgoc,et al.  Evaluating reinforcement learning agents for anatomical landmark detection , 2019, Medical Image Anal..

[7]  Dorin Comaniciu,et al.  3D Deep Learning for Efficient and Robust Landmark Detection in Volumetric Data , 2015, MICCAI.

[8]  Michael I. Jordan,et al.  Reinforcement Learning Algorithm for Partially Observable Markov Decision Problems , 1994, NIPS.

[9]  Mykel J. Kochenderfer,et al.  Cooperative Multi-agent Control Using Deep Reinforcement Learning , 2017, AAMAS Workshops.

[10]  J. Alison Noble,et al.  Image Analysis Using Machine Learning: Anatomical Landmarks Detection in Fetal Ultrasound Images , 2012, 2012 IEEE 36th Annual Computer Software and Applications Conference.

[11]  Shimon Whiteson,et al.  Learning with Opponent-Learning Awareness , 2017, AAMAS.

[12]  Yann LeCun,et al.  Signature Verification Using A "Siamese" Time Delay Neural Network , 1993, Int. J. Pattern Recognit. Artif. Intell..

[13]  Bishesh Khanal,et al.  Fast Multiple Landmark Localisation Using a Patch-based Iterative Network , 2018, MICCAI.

[14]  Daniel Rueckert,et al.  Population-based studies of myocardial hypertrophy: high resolution cardiovascular magnetic resonance atlases improve statistical power , 2014, Journal of Cardiovascular Magnetic Resonance.

[15]  Shimon Whiteson,et al.  Learning to Communicate with Deep Multi-Agent Reinforcement Learning , 2016, NIPS.

[16]  Dorin Comaniciu,et al.  An Artificial Agent for Anatomical Landmark Detection in Medical Images , 2016, MICCAI.

[17]  Mohammad Reza Emami,et al.  Concurrent Markov decision processes for robot team learning , 2015, Eng. Appl. Artif. Intell..

[18]  Nick C Fox,et al.  The Alzheimer's disease neuroimaging initiative (ADNI): MRI methods , 2008, Journal of magnetic resonance imaging : JMRI.

[19]  Loïc Le Folgoc,et al.  Automatic View Planning with Multi-scale Deep Reinforcement Learning Agents , 2018, MICCAI.