Distributed cross-layer resource allocation using Correlated Equilibrium based stochastic learning

In recent years, cross-layer Resource Management (RM) has been widely considered as an efficient approach for improving the network performance. However, due to the factors such as limited knowledge about the wireless environment and spontaneous characteristics of device behaviors, designing an efficient distributed RM scheme using the cross-layer paradigm becomes extremely difficult. In this paper, a distributed, cross-layer RM scheme for multi-users transmitting scalable video in an ad-hoc CDMA network is presented. For the cross-layer RM design, an end-to-end performance criterion at the APP layer is applied and the resource allocations in the PHY, LINK and APP layers are unified into a single decision process. Due to the dynamic property of the wireless channels and the uncoordinated user interactions, the distributed RM decision is modeled in the framework of stochastic game. In the game, each device gradually learns its policy using a distributed, Correlated Equilibrium (CE) based Q-Learning algorithm. In order to calculate the CE in a distributed way, the local devices apply the regret-matching based mechanism for policy and state-action value updating. Simulation experiments show that with the proposed resource management scheme, the system performance can be improved by about 15% compared to the layered RM scheme and 10% compared to the RM scheme purely based on the local-level learning.

[1]  S. Hart,et al.  A simple adaptive procedure leading to correlated equilibrium , 2000 .

[2]  K. J. Ray Liu,et al.  Toward a unified framework for modeling and analysis of diversity in joint source-channel coding , 2008, IEEE Transactions on Communications.

[3]  Andres Kwasinski,et al.  Cooperative learning for reduced complexity cross-layer Cognitive Radio , 2011, 2011 IEEE 22nd International Symposium on Personal, Indoor and Mobile Radio Communications.

[4]  Yusun Chang,et al.  Reinforcement Learning for Repeated Power Control Game in Cognitive Radio Networks , 2012, IEEE Journal on Selected Areas in Communications.

[5]  Rajarathnam Chandramouli,et al.  Adaptive downlink scheduling and rate selection: a cross-layer design , 2005, IEEE Journal on Selected Areas in Communications.

[6]  E. Altman Constrained Markov Decision Processes , 1999 .

[7]  Victor C. M. Leung,et al.  Cross-Layer Optimization for Multimedia Transport over Multicode CDMA Networks , 2011, IEEE Transactions on Mobile Computing.

[8]  Andres Kwasinski,et al.  Experience cooperative sharing in cross-layer cognitive radio for real-time multimedia communication , 2011, CogART '11.

[9]  Joseph Mitola,et al.  Cognitive radio : model-based competence for software radios , 1999 .

[10]  Mihaela van der Schaar,et al.  Learning to Compete for Resources in Wireless Stochastic Games , 2009, IEEE Transactions on Vehicular Technology.

[11]  Keith W. Ross,et al.  Unified framework for optimal video streaming , 2004, IEEE INFOCOM 2004.

[12]  Mihaela van der Schaar,et al.  Online Reinforcement Learning for Dynamic Multimedia Systems , 2010, IEEE Transactions on Image Processing.

[13]  Weiping Li,et al.  Overview of fine granularity scalability in MPEG-4 video standard , 2001, IEEE Trans. Circuits Syst. Video Technol..

[14]  Venugopal V. Veeravalli,et al.  Decentralized dynamic power control for cellular CDMA systems , 2003, IEEE Trans. Wirel. Commun..

[15]  Keith B. Hall,et al.  Correlated Q-Learning , 2003, ICML.

[16]  Mihaela van der Schaar,et al.  Decomposition Principles and Online Learning in Cross-Layer Optimization for Delay-Sensitive Applications , 2008, IEEE Transactions on Signal Processing.

[17]  Yun Q. Shi,et al.  Constant quality constrained rate allocation for FGS-coded video , 2003, IEEE Trans. Circuits Syst. Video Technol..