Concentration bounds for temporal difference learning with linear function approximation: the case of batch data and uniform sampling