Model-Free Q-Learning for the Tracking Problem of Linear Discrete-Time Systems