The Authenticity Gap in Human Evaluation