Periodic collection of spectrum occupancy data by energy constrained cognitive IoT devices

The Internet of things (IoT) sets new bandwidth requirements for devices' seamless connectivity and calls for new communication paradigms such as opportunistic spectrum access. However, realization of opportunistic spectrum access without causing interference to primary users poses significant challenges that are further exacerbated by the constrained nature of IoT devices both in terms of computational resources and battery capacity. In this work we consider the problem of energy efficient collection of primary users' spectrum occupancy data that may be used to facilitate the prediction of future primary users' states. We propose a process for a cognitive IoT device to decide the optimal spectrum sensing period for each state of a primary user. The decision process considers both energy consumption and accuracy in the tracking of a primary user's state within the period. We model the problem at hand as a semi-Markov decision process embedded within a base Markov decision process using the framework of options. Finally, we utilize two variants of the Q-learning algorithm that are known to converge to optimal policies and we propose enhancements that significantly speed up the learning process. Numerical results verify the effectiveness of the proposed enhanced learning strategies.

[1]  Anand Raghunathan,et al.  Battery discharge characteristics of wireless sensor nodes: an experimental analysis , 2005, 2005 Second Annual IEEE Communications Society Conference on Sensor and Ad Hoc Communications and Networks, 2005. IEEE SECON 2005..

[2]  Qian Zhang,et al.  Intelligent Dynamic Spectrum Access Assisted by Channel Usage Prediction , 2010, 2010 INFOCOM IEEE Conference on Computer Communications Workshops.

[3]  Nikolaos Limnios,et al.  Semi-Markov Chains and Hidden Semi-Markov Models toward Applications: Their Use in Reliability and DNA Analysis , 2008 .

[4]  Lei Yang,et al.  Proactive channel access in dynamic spectrum networks , 2008, Phys. Commun..

[5]  Xianwei Zhou,et al.  Spectrum Sensing Using Prior Probability Prediction , 2011, 2011 7th International Conference on Wireless Communications, Networking and Mobile Computing.

[6]  Eylem Ekici,et al.  A survey of MAC issues for TV white space access , 2015, Ad Hoc Networks.

[7]  Matt Welsh,et al.  Simulating the power consumption of large-scale sensor network applications , 2004, SenSys '04.

[8]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[9]  Yann Guédon,et al.  Semi‐Markov Chains and Hidden Semi‐Markov Models toward Applications: Their Use in Reliability and DNA Analysis by BARBU, V. S. and LIMNIOS, N. , 2009 .

[10]  W.H. Tranter,et al.  Dynamic spectrum allocation in cognitive radio using hidden Markov models: Poisson distributed case , 2007, Proceedings 2007 IEEE SoutheastCon.

[11]  Angela Sara Cacciapuoti,et al.  Database access strategy for TV White Space cognitive radio networks , 2014, 2014 Eleventh Annual IEEE International Conference on Sensing, Communication, and Networking Workshops (SECON Workshops).

[12]  Yang Li,et al.  Spectrum Usage Prediction Based on High-order Markov Model for Cognitive Radio Networks , 2010, 2010 10th IEEE International Conference on Computer and Information Technology.

[13]  Peter Friess,et al.  Internet of Things Strategic Research Roadmap , 2011 .

[14]  Jürgen Schönwälder,et al.  Management of resource constrained devices in the internet of things , 2012, IEEE Communications Magazine.

[15]  Álvaro Araujo,et al.  Security in cognitive wireless sensor networks. Challenges and open problems , 2012, EURASIP Journal on Wireless Communications and Networking.

[16]  Ian F. Akyildiz,et al.  Wireless sensor networks: a survey , 2002, Comput. Networks.

[17]  Doina Precup,et al.  Between MDPs and Semi-MDPs: A Framework for Temporal Abstraction in Reinforcement Learning , 1999, Artif. Intell..

[18]  Elias Z. Tragos,et al.  Cognitive radio inspired M2M communications , 2013, 2013 16th International Symposium on Wireless Personal Multimedia Communications (WPMC).

[19]  Warren B. Powell,et al.  Handbook of Learning and Approximate Dynamic Programming , 2006, IEEE Transactions on Automatic Control.

[20]  Danijela Cabric,et al.  Prediction of exponentially distributed primary user traffic for dynamic spectrum access , 2012, 2012 IEEE Global Communications Conference (GLOBECOM).

[21]  Sanjay Jha,et al.  Mitigating the effect of interference in Wireless Sensor Networks , 2010, IEEE Local Computer Network Conference.

[22]  R. Gallager Stochastic Processes , 2014 .