Acquiring Diverse Predictive Knowledge in Real Time by Temporal-difference Learning