Evaluating planning: what is successful planning and (how) can we measure it?