Epoch-incremental reinforcement learning algorithms

Roman Zajdel

Open Access

Epoch-incremental reinforcement learning algorithms

Roman Zajdel

| Sep 30, 2013

International Journal of Applied Mathematics and Computer Science

Volume 23 (2013): Issue 3 (September 2013)

About this article

Cite

Page range: 623 - 635

DOI: https://doi.org/10.2478/amcs-2013-0047

This content is open access.

In this article, a new class of the epoch-incremental reinforcement learning algorithm is proposed. In the incremental mode, the fundamental TD(0) or TD(λ) algorithm is performed and an environment model is created. In the epoch mode, on the basis of the environment model, the distances of past-active states to the terminal state are computed. These distances and the reinforcement terminal state signal are used to improve the agent policy.

eISSN:: 2083-8492
ISSN:: 1641-876X
Language:: English

Publication timeframe:: 4 times per year
Journal Subjects:: Mathematics, Applied Mathematics

Journal RSS Feed

Epoch-incremental reinforcement learning algorithms

Published Online: Sep 30, 2013

Page range: 623 - 635

DOI: https://doi.org/10.2478/amcs-2013-0047

This content is open access.