Evaluating Reasoning Systems