An efficient density peak cluster algorithm for improving policy evaluation performance