ZIB

Hits per page

hits 1 - 4 | 4 hits

Sorting

Electronic Resource

Genetic reinforcement learning for neurocontrol problems (1993)

Whitley, Darrell ; Dominic, Stephen ; Das, Rajarshi ; [et al.]

Springer

Machine learning 13 (1993), S. 259-284

add to mindlist on the mindlist

Details

ISSN: 0885-6125

Keywords: Genetic algorithms ; reinforcement learning ; neural networks ; adaptive control

Source: Springer Online Journal Archives 1860-2000

Topics: Computer Science

Notes: Abstract Empirical tests indicate that at least one class of genetic algorithms yields good performance for neural network weight optimization in terms of learning rates and scalability. The successful application of these genetic algorithms to supervised learning problems sets the stage for the use of genetic algorithms in reinforcement learning problems. On a simulated inverted-pendulum control problem, “genetic reinforcement learning” produces competitive results with AHC, another well-known reinforcement learning paradigm for neural networks that employs the temporal difference method. These algorithms are compared in terms of learning rates, performance-based generalization, and control behavior over time.

Type of Medium: Electronic Resource

URL: http://dx.doi.org/10.1007/BF00993045

Permalink

Library	Location	Call Number	Volume/Issue/Year	Availability

Others were also interested in ...

Paper (German National Licenses)

Fulltext

Electronic Resource

Genetic Reinforcement Learning for Neurocontrol Problems (1993)

Whitley, Darrell ; Dominic, Stephen ; Das, Rajarshi ; [et al.]

Springer

Machine learning 13 (1993), S. 259-284

add to mindlist on the mindlist

Details

ISSN: 0885-6125

Keywords: Genetic algorithms ; reinforcement learning ; neural networks ; adaptive control

Source: Springer Online Journal Archives 1860-2000

Topics: Computer Science

Type of Medium: Electronic Resource

URL: http://dx.doi.org/10.1023/A:1022674030396

Permalink

Library	Location	Call Number	Volume/Issue/Year	Availability

Others were also interested in ...

Paper (German National Licenses)

Fulltext

Electronic Resource

Model Selection in an Information Economy: Choosing What to Learn (2002)

Brooks, Christopher H. ; Gazzale, Robert S. ; Das, Rajarshi ; [et al.]

Boston, USA and Oxford, UK : Blackwell Publishers Inc.

Computational intelligence 18 (2002), S. 0

add to mindlist on the mindlist

Details

ISSN: 1467-8640

Source: Blackwell Publishing Journal Backfiles 1879-2005

Topics: Computer Science

Notes: As online markets for the exchange of goods and services become more common, the study of markets composed, at least in part, of autonomous agents has taken on increasing importance. In contrast to traditional complete–information economic scenarios, agents that are operating in an electronic marketplace often do so under considerable uncertainty. In order to reduce their uncertainty, these agents must learn about the world around them. When an agent producer is engaged in a learning task in which data collection is costly, such as learning the preferences of a consumer population, it is faced with a classic decision problem: when to explore and when to exploit. If the agent has a limited number of chances to experiment, it must explicitly consider the cost of learning (in terms of foregone profit) against the value of the information acquired. Information goods add an additional dimension to this problem; due to their flexibility, they can be bundled and priced according to a number of different price schedules. An optimizing producer should consider the profit each price schedule can extract, as well as the difficulty of learning of this schedule.In this paper, we demonstrate the tradeoff between complexity and profitability for a number of common price schedules. We begin with a one–shot decision as to which schedule to learn. Schedules with moderate complexity are preferred in the short and medium term, as they are learned quickly, yet extract a significant fraction of the available profit. We then turn to the repeated version of this one–shot decision and show that moderate complexity schedules, in particular two–part tariff, perform well when the producer must adapt to nonstationarity in the consumer population. When a producer can dynamically change schedules as it learns, it can use an explicit decision–theoretic formulation to greedily select the schedule which appears to yield the greatest profit in the next period. By explicitly considering both the learnability and the profit extracted by different price schedules, a producer can extract more profit as it learns than if it naively chose models that are accurate once learned.

Type of Medium: Electronic Resource

URL: http://dx.doi.org/10.1111/1467-8640.t01-2-00204

Permalink

Library	Location	Call Number	Volume/Issue/Year	Availability

Others were also interested in ...

Paper (German National Licenses)

Fulltext

Electronic Resource

Tracking primary hyperplane competitors during genetic search (1992)

Whitley, Darrell ; Das, Rajarshi ; Crabb, Curtis

Springer

Annals of mathematics and artificial intelligence 6 (1992), S. 367-388

add to mindlist on the mindlist

Details

ISSN: 1573-7470

Keywords: Genetic algorithm ; schema theorem ; hyperplane competitions

Source: Springer Online Journal Archives 1860-2000

Topics: Computer Science , Mathematics

Notes: Abstract Holland's fundamental theorem of genetic algorithms (the “schema theorem”) provides a lower bound on the sampling rate of a single hyperplane during genetic search. However, the theorem tracks the change in representation for a single hyperplaneas if its representation is independent of other hyperplanes. Hyperplane samples are clearly interdependent and interactions in the hyperplane samples means that Holland's notion of “implicit parallelism” does not universally hold when conflicting hyperplanes interact. A set of equations are defined which allows one to model the interaction of strings, or of schemata representing hyperplanes at order-N and hyperplanes less than order-N. These equations do not account for the effects of higher order hyperplanes or co-lateral competitions. Nevertheless, these equations can serve to better describe the interaction of primary hyperplane competitors.

Type of Medium: Electronic Resource

URL: http://dx.doi.org/10.1007/BF01535526

Permalink

Library	Location	Call Number	Volume/Issue/Year	Availability

Others were also interested in ...

Paper (German National Licenses)

Fulltext

hits 1 - 4 | 4 hits