Map
Index
Random
Help
th

QuoteRef: tesaG3_1995

topics > all references > ThesaHelp: references t-z



ThesaHelp:
references t-z
ThesaHelp:
ACM references m-z
Topic:
limitations of artificial intelligence and cognitive science
Topic:
artificial neuron nets
Group:
artificial intelligence
Topic:
heuristic-based systems
Topic:
randomness
Topic:
man-machine symbiosis

Reference

Tesauro, G., "Temporal difference learning and TD-Gammon", Communications of the ACM, 38, 3, March 1995, pp. 58-68. Google

Quotations
58 ;;Quote: TD-Gammon is a self-training neural network for backgammon that outperforms other programs and, sometimes, human experts
59 ;;Quote: the goal of temporal difference methods is to match the learner's current prediction for a pattern with the next prediction at the next time step
59 ;;Quote: deep search can not be used for backgammon because there are several hundred possible combinations per ply
61 ;;Quote: TD-Gammon is a multilayer perception network with 40 hidden units and backgammon feature encoders
61+;;Quote: with just raw-encoding, TD-Gammon was a strong intermediate after 200,000 training games
65 ;;Quote: TD-Gammon is successful because of the randomness of backgammon and a fairly smooth outcome function
65+;;Quote: even with a random initial network, TD-Gammon would terminate in, at most, several thousand moves
67 ;;Quote: human experts use TD-Gammon to evaluate the best move for a position by playing the position to completion several thousand times


Related Topics up

ThesaHelp: references t-z (309 items)
ThesaHelp: ACM references m-z (280 items)
Topic: limitations of artificial intelligence and cognitive science (64 items)
Topic: artificial neuron nets (29 items)
Group: artificial intelligence   (14 topics, 500 quotes)
Topic: heuristic-based systems (35 items)
Topic: randomness (16 items)
Topic: man-machine symbiosis (46 items)

Collected barberCB 8/97
Copyright © 2002-2008 by C. Bradford Barber. All rights reserved.
Thesa is a trademark of C. Bradford Barber.