HiQ is a hierarchical, online learning algorithm that finds dynamic solutions to the reader collision problem in RFID systems. When the transmissions from one reader interfere with the operation of another reader, a reader collision occurs. The objective of the reader collision problem is to minimize the reader collisions experienced by RFID readers while using the minimum number of frequencies and using the minimum total time for all readers to communicate successfully. HiQ attempts to minimize reader collisions by learning the collision patterns of the readers and by effectively assigning frequencies over time to ensure neighboring readers do not experience collisions from one another. HiQ is arranged hierarchically with distributed, local control. The algorithm is based on a type of reinforcement learning called Q-learning, which is used to determine frequency and time assignments. Through repeated interaction with the system, Q-learning attempts to discover an optimum frequency assignment over time. We show that HiQ finds optimal or near optimal solutions to the reader collision problem.
[1]
Daniel W. Engels,et al.
The reader collision problem
,
2002,
IEEE International Conference on Systems, Man and Cybernetics.
[2]
Daniel W. Engels,et al.
Colorwave: a MAC for RFID reader networks
,
2003,
2003 IEEE Wireless Communications and Networking, 2003. WCNC 2003..
[3]
Andrew G. Barto,et al.
Learning to Act Using Real-Time Dynamic Programming
,
1995,
Artif. Intell..
[4]
Daniel W. Engels,et al.
Colorwave: an anticollision algorithm for the reader collision problem
,
2003,
IEEE International Conference on Communications, 2003. ICC '03..
[5]
Ben J. A. Kröse,et al.
Learning from delayed rewards
,
1995,
Robotics Auton. Syst..
[6]
Ehl Emile Aarts,et al.
Simulated annealing and Boltzmann machines
,
2003
.
[7]
Fred W. Howell,et al.
Using Java for Discrete Event Simulation
,
1996
.
[8]
Panos M. Pardalos,et al.
Frequency Assignment Problems
,
1999,
Handbook of Combinatorial Optimization.