ZIB

Hits per page

hit 1 - 1 | 1 hit

Sorting

Electronic Resource

Co-Evolution in the Successful Learning of Backgammon Strategy (1998)

Pollack, Jordan B. ; Blair, Alan D.

Springer

Machine learning 32 (1998), S. 225-240

add to mindlist on the mindlist

Details

ISSN: 0885-6125

Keywords: coevolution ; backgammon ; reinforcement ; temporal difference learning ; self-learning

Source: Springer Online Journal Archives 1860-2000

Topics: Computer Science

Notes: Abstract Following Tesauro's work on TD-Gammon, we used a 4,000 parameter feedforward neural network to develop a competitive backgammon evaluation function. Play proceeds by a roll of the dice, application of the network to all legal moves, and selection of the position with the highest evaluation. However, no backpropagation, reinforcement or temporal difference learning methods were employed. Instead we apply simple hillclimbing in a relative fitness environment. We start with an initial champion of all zero weights and proceed simply by playing the current champion network against a slightly mutated challenger and changing weights if the challenger wins. Surprisingly, this worked rather well. We investigate how the peculiar dynamics of this domain enabled a previously discarded weak method to succeed, by preventing suboptimal equilibria in a “meta-game” of self-learning.

Type of Medium: Electronic Resource

URL: http://dx.doi.org/10.1023/A:1007417214905

Permalink

Library	Location	Call Number	Volume/Issue/Year	Availability

Others were also interested in ...

Paper (German National Licenses)

Fulltext

hit 1 - 1 | 1 hit