Three Years, Two Papers, One Course Off: Optimal Nonmonetary Reward Policies