Q-Learning-based Hybrid ARQ for High Speed Downlink Packet Access in UMTS

In this paper, a Q-learning-based hybrid automatic repeat request (Q-HARQ) scheme is proposed to achieve efficient resource utilization for high speed downlink packet access (HSDPA) in universal mobile telecommunications system (UMTS). The hybrid ARQ procedure is modeled as a discrete-time Markov decision process (MDP), where the transmission cost is defined in terms of the signal-to-interference-and-noise (SINR) which is based on the desired (quality-of-service) QoS parameters of transport block error rate (BLER) for enhancing spectrum utilization subject to QoS constraint. The Q-learning reinforcement algorithm is employed to accurately estimate the transmission cost to perform the most suitable decision of modulation and coding scheme for the packet initial transmission while the requirement of transport block error rate is guaranteed. Simulation results show that the QoS requirement of block error rate for Q-HARQ is nearly met around a reasonable value indeed. In addition, the system throughput of the Q-HARQ can be improved under the specific QoS constraint of BLER. It is verified finally that the Q-HARQ scheme is feasible in the practical system because of the short processing and convergence time.

[1]  D. Signorini,et al.  Neural networks , 1995, The Lancet.

[2]  Ainslie,et al.  CORRELATION MODEL FOR SHADOW FADING IN MOBILE RADIO SYSTEMS , 2004 .

[3]  Stefan Parkvall,et al.  Adaptive incremental redundancy [WCDMA systems] , 2003, 2003 IEEE 58th Vehicular Technology Conference. VTC 2003-Fall (IEEE Cat. No.03CH37484).

[4]  R. Bellman Dynamic programming. , 1957, Science.

[5]  Peter Han Joo Chong,et al.  Analysis of the adaptive modulation and coding algorithm with the multicode transmission , 2002, Proceedings IEEE 56th Vehicular Technology Conference.

[6]  Stefan Parkvall,et al.  Adaptive Incremental Redundancy , 2003 .

[7]  Peter Dayan,et al.  Technical Note: Q-Learning , 2004, Machine Learning.

[8]  Peter Dayan,et al.  Q-learning , 1992, Machine Learning.

[9]  Jon W. Mark,et al.  A combined link adaptation and incremental redundancy protocol for enhanced data transmission , 2001, GLOBECOM'01. IEEE Global Telecommunications Conference (Cat. No.01CH37270).

[10]  E. Zehavi,et al.  Soft handoff extends CDMA cell coverage and increases reverse link capacity , 1994, IEEE J. Sel. Areas Commun..