Nacionalni portal odprte znanosti

Iskalni niz:

išči po

Vrsta gradiva:

Jezik:

Prikaži samo zadetke s polnim besedilom

Št. zadetkov: 2

Preconditioned Temporal Difference Learning

Hengshuai Yao

Video in druga učna gradiva

Oznake:

This paper extends many of the recent popular reinforcement learning (RL) algorithms to a generalized framework that includes least-squares temporal difference (LSTD) learning, least-squares policy evaluation (LSPE) and a variant of incremental LSTD (iLSTD). The basis of this extension is a precondi ...

Leto: 2008 Vir: videolectures.net

Dyna(k): A Multi-Step Dyna Planning

Hengshuai Yao

Video in druga učna gradiva

Oznake: computer science;artificial intelligence;planning and scheduling

Dyna planning is an efficient way of learning from real and imaginary experience. Existing tabular and linear Dyna algorithms are single-step, because an "imaginary" feature is predicted only one step into the future. In this paper, we introduce a multi-step Dyna planning that predicts more st ...

Leto: 2009 Vir: videolectures.net

Št. zadetkov: 2

Ključne besede:

artificial intelligence (1)

computer science (1)

planning and scheduling (1)

Leto izdaje:

2008 (1)

2009 (1)

Repozitorij:

videolectures.net (2)

Tipologija:

Video in druga učna gradiva (2)

Jezik:

Angleški jezik (2)

Nacionalni portal odprte znanosti

Dostop do znanja slovenskih raziskovalnih organizacij