Short-term retention of response outcome as a determinant of serial reversal learning