Consistent evaluation of uncertain reasoning systems