ThesaHelp: references t-z
ThesaHelp: ACM references m-z
Topic: limitations of artificial intelligence and cognitive science
Topic: artificial neuron nets
Group: artificial intelligence
Topic: heuristic-based systems
Topic: randomness
Topic: man-machine symbiosis
| |
Reference
Tesauro, G.,
"Temporal difference learning and TD-Gammon",
Communications of the ACM, 38, 3, March 1995, pp. 58-68.
Google
Quotations
58 ;;Quote: TD-Gammon is a self-training neural network for backgammon that outperforms other programs and, sometimes, human experts
| 59 ;;Quote: the goal of temporal difference methods is to match the learner's current prediction for a pattern with the next prediction at the next time step
| 59 ;;Quote: deep search can not be used for backgammon because there are several hundred possible combinations per ply
| 61 ;;Quote: TD-Gammon is a multilayer perception network with 40 hidden units and backgammon feature encoders
| 61+;;Quote: with just raw-encoding, TD-Gammon was a strong intermediate after 200,000 training games
| 65 ;;Quote: TD-Gammon is successful because of the randomness of backgammon and a fairly smooth outcome function
| 65+;;Quote: even with a random initial network, TD-Gammon would terminate in, at most, several thousand moves
| 67 ;;Quote: human experts use TD-Gammon to evaluate the best move for a position by playing the position to completion several thousand times
|
Related Topics
ThesaHelp: references t-z (309 items)
ThesaHelp: ACM references m-z (280 items)
Topic: limitations of artificial intelligence and cognitive science (64 items)
Topic: artificial neuron nets (29 items)
Group: artificial intelligence (14 topics, 500 quotes)
Topic: heuristic-based systems (35 items)
Topic: randomness (16 items)
Topic: man-machine symbiosis (46 items)
|