A cooperative learning framework for mobility-aware resource management in multi-inhabitant smart homes

The essence of pervasive (ubiquitous) computing lies in the creation of smart environments saturated with computing and communication capabilities, yet gracefully integrated with human users. 'Context Awareness' is perhaps the most important feature of such an intelligent computing paradigm. The mobility and activity of the inhabitants play significant roles in forming the context at any instance of time. In order to extract the best performance and efficacy of smart computing environments, one needs a technology-independent, context-aware platform spanning over multiple inhabitants. In this paper, we have developed a framework for mobility-aware resource (in particular, energy consumption) management in a multi-inhabitant smart home, based on a dynamic, cooperative reinforcement learning technique. The inhabitants' mobility creates uncertainty of his location and activity. Using the proposed cooperative game-theory based framework, all the inhabitants currently present in the house attempt to minimize this overall uncertainty in the form of utility functions associated with them. Joint optimization of the utility function corresponds to the convergence to Nash equilibrium and helps in accurate prediction of inhabitants' future locations and activities. This results in adaptive control of automated devices and temperature of the house, thus providing an amicable environment and sufficient comfort to the inhabitants. Simulation results point out that our framework can adaptively control the smart environment, while reducing the energy consumption and enhancing the comfort.

[1]  Chris Watkins,et al.  Learning from delayed rewards , 1989 .

[2]  J. Krumm,et al.  Multi-camera multi-person tracking for EasyLiving , 2000, Proceedings Third IEEE International Workshop on Visual Surveillance.

[3]  Bryan Horling,et al.  The Intelligent Home Testbed , 2004 .

[4]  Michael C. Mozer,et al.  The Neural Network House: An Environment that Adapts to its Inhabitants , 1998 .

[5]  Thomas M. Cover,et al.  Elements of Information Theory , 2005 .

[6]  Gregory D. Abowd,et al.  The smart floor: a mechanism for natural user identification and tracking , 2000, CHI Extended Abstracts.

[7]  Christian R. Shelton,et al.  Balancing Multiple Sources of Reward in Reinforcement Learning , 2000, NIPS.

[8]  W. P. Jones,et al.  Air Conditioning Engineering , 1967 .

[10]  Hari Balakrishnan,et al.  6th ACM/IEEE International Conference on on Mobile Computing and Networking (ACM MOBICOM ’00) The Cricket Location-Support System , 2022 .

[11]  Diane J. Cook,et al.  Location aware resource management in smart homes , 2003, Proceedings of the First IEEE International Conference on Pervasive Computing and Communications, 2003. (PerCom 2003)..

[12]  Andy Hopper,et al.  The Anatomy of a Context-Aware Application , 1999, Wirel. Networks.

[13]  A. Harter,et al.  A distributed location system for the active office , 1994, IEEE Network.

[14]  Sajal K. Das,et al.  LeZi-Update: An Information-Theoretic Framework for Personal Mobility Tracking in PCS Networks , 2002, Wirel. Networks.

[15]  Tommi S. Jaakkola,et al.  Convergence Results for Single-Step On-Policy Reinforcement-Learning Algorithms , 2000, Machine Learning.

[16]  Manfred Schneps-Schneppe,et al.  Service level agreement as an issue of teletraffic , 2003 .

[17]  Paramvir Bahl,et al.  RADAR: an in-building RF-based user location and tracking system , 2000, Proceedings IEEE INFOCOM 2000. Conference on Computer Communications. Nineteenth Annual Joint Conference of the IEEE Computer and Communications Societies (Cat. No.00CH37064).

[18]  Craig Boutilier,et al.  The Dynamics of Reinforcement Learning in Cooperative Multiagent Systems , 1998, AAAI/IAAI.

[19]  Diane J. Cook,et al.  The role of prediction algorithms in the MavHome smart home architecture , 2002, IEEE Wirel. Commun..

[20]  James H. Aylor,et al.  Computer for the 21st Century , 1999, Computer.

[21]  Manuela M. Veloso,et al.  Rational and Convergent Learning in Stochastic Games , 2001, IJCAI.