Short-term plasticity as cause–effect hypothesis testing in distal reward learning