论文信息 - Self-organizing Synchronicity and Desynchronicity using Reinforcement Learning

Self-organizing Synchronicity and Desynchronicity using Reinforcement Learning

We present a self-organizing reinforcement learning (RL) approach for coordinating the wake-up cycles of nodes in a wireless sensor network in a decentralized manner. To the best of our knowledge we are the first to demonstrate how global synchronicity and desynchronicity can emerge through local interactions alone without the need of central mediator or any form of explicit coordination. We apply this RL approach to wireless sensor nodes arranged in different topologies and study how agents, starting with a random policy, are able to self-adapt their behavior based only on their interaction with neighboring nodes. Each agent independently learns to which nodes it should synchronize to improve message throughput and at the same with whom to desynchronize in order to reduce communication interference. The obtained results show how simple and computationally bounded sensor nodes are able to coordinate their wake-up cycles in a distributed way in order to improve the global system performance through (de)synchronicity.

Ann Nowé | Karl Tuyls | Mihail Mihaylov | Yann-Aël Le Borgne

[1] Deborah Estrin,et al. Medium access control with coordinated adaptive sleeping for wireless sensor networks , 2004, IEEE/ACM Transactions on Networking.

[2] Radhika Nagpal,et al. DESYNC: Self-Organizing Desynchronization and TDMA on Wireless Sensor Networks , 2007, International Symposium on Information Processing in Sensor Networks.

[3] Kagan Tumer,et al. An Introduction to Collective Intelligence , 1999, ArXiv.

[4] S. Sitharama Iyengar,et al. Random asynchronous wakeup protocol for sensor networks , 2004, First International Conference on Broadband Networks.

[5] Reuven Cohen,et al. An Optimal Algorithm for Minimizing Energy Consumption while Limiting Maximum Delay in a Mesh Sensor Network , 2007, IEEE INFOCOM 2007 - 26th IEEE International Conference on Computer Communications.

[6] I-Jeng Wang,et al. Decentralized synchronization protocols with nearest neighbor communication , 2004, SenSys '04.

[7] Radhika Nagpal,et al. Firefly-inspired sensor network synchronicity with realistic radio effects , 2005, SenSys '05.

[8] Curt Schurgers,et al. Wakeup Strategies in Wireless Sensor Networks , 2008 .

[9] Chris Watkins,et al. Learning from delayed rewards , 1989 .

[10] Shan Liang,et al. Passive Wake-up Scheme for Wireless Sensor Networks , 2007, Second International Conference on Innovative Computing, Informatio and Control (ICICIC 2007).

[11] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[12] Zhenzhen Liu,et al. RL-MAC: a reinforcement learning based MAC protocol for wireless sensor networks , 2006, Int. J. Sens. Networks.

[13] Ivan Stojmenović,et al. Handbook of Sensor Networks: Algorithms and Architectures , 2005, Handbook of Sensor Networks.

[14] Rong Zheng,et al. Asynchronous wakeup for ad hoc networks , 2003, MobiHoc '03.

[15] Prasant Mohapatra,et al. Medium access control in wireless sensor networks , 2007, Comput. Networks.

[16] David B. Knoester,et al. Evolving Virtual Fireflies , 2009, ECAL.

[17] S. Strogatz,et al. Synchronization of pulse-coupled biological oscillators , 1990 .

[18] Radhika Nagpal,et al. Desynchronization: The Theory of Self-Organizing Algorithms for Round-Robin Scheduling , 2007, First International Conference on Self-Adaptive and Self-Organizing Systems (SASO 2007).