Towards Better Evaluation of Multi-target Regression Models