Partner selection in self-organised wireless sensor networks for opportunistic energy negotiation: A multi-armed bandit based approach

Abstract The proliferation of “Things” over a network creates the Internet of Things (IoT), where sensors integrate to collect data from the environment over long periods of time. The growth of IoT applications will inevitably involve co-locating multiple wireless sensor networks, each serving different applications with, possibly, different needs and constraints. Since energy is scarce in sensor nodes equipped with non-rechargeable batteries, energy harvesting technologies have been the focus of research in recent years. However, new problems arise as a result of their wide spatio-temporal variation. Such a shortcoming can be avoided if co-located networks cooperate with each other and share their available energy. Due to their unique characteristics and different owners, recently, we proposed a negotiation approach to deal with conflict of preferences. Unfortunately, negotiation can be impractical with a large number of participants, especially in an open environment. Given this, we introduce a new partner selection technique based on multi-armed bandits (MAB), that enables each node to learn the strategy that optimises its energy resources in the long term. Our results show that the proposed solution allows networks to repeatedly learn the current best energy partner in a dynamic environment. The performance of such a technique is evaluated through simulation and shows that a network can achieve an efficiency of 72% against the optimal strategy in the most challenging scenario studied in this work.

[1]  Jiming Chen,et al.  Data Gathering Optimization by Dynamic Sensing and Routing in Rechargeable Sensor Networks , 2016, IEEE/ACM Trans. Netw..

[2]  Peter Auer,et al.  The Nonstochastic Multiarmed Bandit Problem , 2002, SIAM J. Comput..

[3]  Nicholas R. Jennings,et al.  Long-term information collection with energy harvesting wireless sensors: a multi-armed bandit based approach , 2012, Autonomous Agents and Multi-Agent Systems.

[4]  Sander M. Bohte,et al.  Automated Negotiation and Bundling of Information Goods , 2003, AMEC.

[5]  Michael Wagner,et al.  A Comparative Study on Vector Similarity Methods for Offer Generation in Multi-attribute Negotiation , 2015, Australasian Conference on Artificial Intelligence.

[6]  Setareh Maghsudi,et al.  Joint Channel Selection and Power Control in Infrastructureless Wireless Networks: A Multiplayer Multiarmed Bandit Framework , 2014, IEEE Transactions on Vehicular Technology.

[7]  Adina Magda Florea,et al.  A model of automated negotiation based on agents profiles , 2013, Scalable Comput. Pract. Exp..

[8]  Michael Luck,et al.  Motivation-Based Selection of Negotiation Opponents , 2004, ESAW.

[9]  Ting Wang,et al.  Adaptive Routing for Sensor Networks using Reinforcement Learning , 2006, The Sixth IEEE International Conference on Computer and Information Technology (CIT'06).

[10]  Nicholas R. Jennings,et al.  Playing Repeated Security Games with No Prior Knowledge , 2016, AAMAS.

[11]  Geoff V. Merrett,et al.  Opportunistic Direct Interconnection between Co-Located Wireless Sensor Networks , 2013, 2013 22nd International Conference on Computer Communication and Networks (ICCCN).

[12]  H. Robbins Some aspects of the sequential design of experiments , 1952 .

[13]  Minjie Zhang,et al.  Feasible Negotiation Procedures for Multiple Interdependent Negotiations , 2018, AAMAS.

[14]  Dong Kun Noh,et al.  Minimum Variance Energy Allocation for a Solar-Powered Sensor System , 2009, DCOSS.

[15]  Nicholas R. Jennings,et al.  The Influence of Information on Negotiation Equilibrium , 2002, AMEC.

[16]  Ainuddin Wahid Abdul Wahab,et al.  Energy harvesting and battery power based routing in wireless sensor networks , 2017, Wirel. Networks.

[17]  Nicholas R. Jennings,et al.  Negotiation decision functions for autonomous agents , 1998, Robotics Auton. Syst..

[18]  Hwee Pink Tan,et al.  Machine Learning in Wireless Sensor Networks: Algorithms, Strategies, and Applications , 2014, IEEE Communications Surveys & Tutorials.

[19]  Mohammad Hossein Anisi,et al.  A Review on energy management schemes in energy harvesting wireless sensor networks , 2017 .

[20]  David E. Culler,et al.  Perpetual environmentally powered sensor networks , 2005, IPSN 2005. Fourth International Symposium on Information Processing in Sensor Networks, 2005..

[21]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[22]  Guoming Lai,et al.  A Generic Framework for Automated Multi-attribute Negotiation , 2009 .

[23]  Fernand Meyer,et al.  A comparative study of LPWAN technologies for large-scale IoT deployment , 2019, ICT Express.

[24]  Leonid Sheremetov,et al.  Weiss, Gerhard. Multiagent Systems a Modern Approach to Distributed Artificial Intelligence , 2009 .

[25]  Jean-Pierre Hubaux,et al.  Cooperation in underwater sensor networks , 2009, 2009 International Conference on Game Theory for Networks.

[26]  Katia P. Sycara,et al.  Multiagent negotiation on multiple issues with incomplete information: extended abstract , 2013, AAMAS.

[27]  Jan M. Rabaey,et al.  Power Sources for Wireless Sensor Networks , 2004, EWSN.

[28]  Levente Buttyán,et al.  Spontaneous Cooperation in Multi-domain Sensor Networks , 2005, ESAS.

[29]  Raphaël Féraud,et al.  Node-based optimization of LoRa transmissions with Multi-Armed Bandit algorithms , 2018, 2018 25th International Conference on Telecommunications (ICT).

[30]  Mani B. Srivastava,et al.  Power management in energy harvesting sensor networks , 2007, TECS.

[31]  Lingling Li,et al.  On the application of LoRa LPWAN technology in Sailing Monitoring System , 2017, 2017 13th Annual Conference on Wireless On-demand Network Systems and Services (WONS).

[32]  Geoff V. Merrett,et al.  Experimental validation of opportunistic direct interconnection between different Wireless Sensor Networks , 2016, 2016 IEEE Sensors Applications Symposium (SAS).

[33]  Santosh S. Vempala,et al.  Efficient algorithms for online decision problems , 2005, Journal of computer and system sciences (Print).

[34]  Levente Buttyán,et al.  Cooperative packet forwarding in multi-domain sensor networks , 2005, Third IEEE International Conference on Pervasive Computing and Communications Workshops.

[35]  Bo Zhang,et al.  Maximum utility rate allocation for energy harvesting wireless sensor networks , 2011, MSWiM '11.

[36]  Sébastien Bubeck,et al.  Regret Analysis of Stochastic and Nonstochastic Multi-armed Bandit Problems , 2012, Found. Trends Mach. Learn..

[37]  Jacques Palicot,et al.  Multi-Armed Bandit Learning in IoT Networks: Learning Helps Even in Non-stationary Settings , 2017, CrownCom.

[38]  Mathijs de Weerdt,et al.  Efficient Methods for Multi-agent Multi-issue Negotiation: Allocating Resources , 2009, PRIMA.

[39]  Ingrid Moerman,et al.  A reinforcement learning based solution for cognitive network cooperation between co-located, heterogeneous wireless sensor networks , 2014, Ad Hoc Networks.

[40]  Bo An,et al.  Automated negotiation with decommitment for dynamic resource allocation in cloud computing , 2010, AAMAS.

[41]  Ness B. Shroff,et al.  A Simple Asymptotically Optimal Joint Energy Allocation and Routing Scheme in Rechargeable Sensor Networks , 2014, IEEE/ACM Transactions on Networking.

[42]  Nicholas R. Jennings,et al.  Using similarity criteria to make issue trade-offs in automated negotiations , 2002, Artif. Intell..

[43]  Gergely Neu,et al.  An Efficient Algorithm for Learning with Semi-bandit Feedback , 2013, ALT.

[44]  Yonghui Song,et al.  Multi-Armed Bandit Channel Access Scheme With Cognitive Radio Technology in Wireless Sensor Networks for the Internet of Things , 2016, IEEE Access.

[45]  Katia P. Sycara,et al.  Automated Bilateral Multiple-issue Negotiation with No Information About Opponent , 2013, 2013 46th Hawaii International Conference on System Sciences.

[46]  J. Cid-Sueiro,et al.  Q-Probabilistic Routing in Wireless Sensor Networks , 2007, 2007 3rd International Conference on Intelligent Sensors, Sensor Networks and Information.

[47]  A. Rubinstein Perfect Equilibrium in a Bargaining Model , 1982 .

[48]  J. Nash THE BARGAINING PROBLEM , 1950, Classics in Game Theory.

[49]  Lida Xu,et al.  Cloud Service Negotiation in Internet of Things Environment: A Mixed Approach , 2014, IEEE Transactions on Industrial Informatics.

[50]  Sarvapali D. Ramchurn,et al.  Automated Negotiation for Opportunistic Energy Trading Between Neighbouring Wireless Sensor Networks , 2018, 2018 IEEE International Conference on Communications, Control, and Computing Technologies for Smart Grids (SmartGridComm).

[51]  Ryszard Kowalczyk,et al.  On possibilistic case-based reasoning for selecting partners for multi-attribute agent negotiation , 2005, AAMAS '05.

[52]  N. R. Jennings,et al.  To appear in: Int Journal of Group Decision and Negotiation GDN2000 Keynote Paper Automated Negotiation: Prospects, Methods and Challenges , 2022 .