Towards Direct Policy Search Reinforcement Learning for Robot Control
dc.contributor.author
dc.date.accessioned
2010-05-06T10:39:59Z
dc.date.available
2010-05-03T15:07:26Z
2010-05-06T10:39:59Z
dc.date.issued
2006
dc.identifier.citation
El-Fakdi, A., Carreras, M., i Ridao, P. (2006). Towards Direct Policy Search Reinforcement Learning for Robot Control. IEEE/RSJ International Conference on Intelligent Robots and Systems, 3178 - 3183. Recuperat 06 maig 2010, a http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=4058885
dc.identifier.isbn
1-4244-0258-1
dc.identifier.uri
dc.description.abstract
This paper proposes a high-level reinforcement learning (RL) control system for solving the action selection problem of an autonomous robot. Although the dominant approach, when using RL, has been to apply value function based algorithms, the system here detailed is characterized by the use of direct policy search methods. Rather than approximating a value function, these methodologies approximate a policy using an independent function approximator with its own parameters, trying to maximize the future expected reward. The policy based algorithm presented in this paper is used for learning the internal state/action mapping of a behavior. In this preliminary work, we demonstrate its feasibility with simulated experiments using the underwater robot GARBI in a target reaching task
dc.format.mimetype
application/pdf
dc.language.iso
eng
dc.publisher
IEEE
dc.relation.isformatof
Reproducció digital del document publicat a: http://dx.doi.org/10.1109/IROS.2006.282342
dc.relation.ispartof
© IEEE/RSJ International Conference on Intelligent Robots and Systems, 2006, p. 3178-3183
dc.relation.ispartofseries
Articles publicats (D-ATC)
dc.rights
Tots els drets reservats
dc.subject
dc.title
Towards Direct Policy Search Reinforcement Learning for Robot Control
dc.type
info:eu-repo/semantics/article
dc.rights.accessRights
info:eu-repo/semantics/openAccess
dc.identifier.doi