Temporal Difference Learning and TD-Gammon

We provide an abstract, selectively u§ing the author's formulations: "The article presents a game-learning program called TD-GAMMON. TD-GAMMON is a neural network that trains itself to be an evaluation function for the game of backgammon by playing against itself and learning from the outcome. It was not developed to surpass all previous computer programs in backgammon; rather, its purpose was to explore some new ideas and approaches to traditional problems in reinforcement learning.