Evaluating Models Based on Multiple Data Sets and Data Diagnosis Measures