Th QuoteRef: tesaG3

QuoteRef: tesaG3_1995

topics > all references > ThesaHelp: references t-z

ThesaHelp:
references t-z
ThesaHelp:
ACM references m-z
Topic:
limitations of artificial intelligence and cognitive science
Topic:
artificial neuron nets
Group:
artificial intelligence
Topic:
heuristic-based systems
Topic:
randomness
Topic:
man-machine symbiosis

Reference

Tesauro, G., "Temporal difference learning and TD-Gammon", Communications of the ACM, 38, 3, March 1995, pp. 58-68. Google

Quotations

58 ;;Quote: TD-Gammon is a self-training neural network for backgammon that outperforms other programs and, sometimes, human experts

59 ;;Quote: the goal of temporal difference methods is to match the learner's current prediction for a pattern with the next prediction at the next time step

59 ;;Quote: deep search can not be used for backgammon because there are several hundred possible combinations per ply

61 ;;Quote: TD-Gammon is a multilayer perception network with 40 hidden units and backgammon feature encoders

61+;;Quote: with just raw-encoding, TD-Gammon was a strong intermediate after 200,000 training games

65 ;;Quote: TD-Gammon is successful because of the randomness of backgammon and a fairly smooth outcome function

65+;;Quote: even with a random initial network, TD-Gammon would terminate in, at most, several thousand moves

67 ;;Quote: human experts use TD-Gammon to evaluate the best move for a position by playing the position to completion several thousand times

Related Topics

ThesaHelp: references t-z (309 items)
ThesaHelp: ACM references m-z (280 items)
Topic: limitations of artificial intelligence and cognitive science (64 items)
Topic: artificial neuron nets (29 items)
Group: artificial intelligence (14 topics, 500 quotes)
Topic: heuristic-based systems (35 items)
Topic: randomness (16 items)
Topic: man-machine symbiosis (46 items)