Towards Direct Policy Search Reinforcement Learning for Robot Control

El-Fakdi Sencianes, Andrés; Carreras Pérez, Marc; Ridao Rodríguez, Pere

Towards Direct Policy Search Reinforcement Learning for Robot Control

dc.contributor.author

El-Fakdi Sencianes, Andrés

Carreras Pérez, Marc

Ridao Rodríguez, Pere

dc.date.accessioned

2010-05-06T10:39:59Z

dc.date.available

2010-05-03T15:07:26Z

2010-05-06T10:39:59Z

dc.date.issued

2006

dc.identifier.citation

El-Fakdi, A., Carreras, M., i Ridao, P. (2006). Towards Direct Policy Search Reinforcement Learning for Robot Control. IEEE/RSJ International Conference on Intelligent Robots and Systems, 3178 - 3183. Recuperat 06 maig 2010, a http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=4058885

dc.identifier.isbn

1-4244-0258-1

dc.identifier.uri

http://hdl.handle.net/10256/2221

dc.description.abstract

This paper proposes a high-level reinforcement learning (RL) control system for solving the action selection problem of an autonomous robot. Although the dominant approach, when using RL, has been to apply value function based algorithms, the system here detailed is characterized by the use of direct policy search methods. Rather than approximating a value function, these methodologies approximate a policy using an independent function approximator with its own parameters, trying to maximize the future expected reward. The policy based algorithm presented in this paper is used for learning the internal state/action mapping of a behavior. In this preliminary work, we demonstrate its feasibility with simulated experiments using the underwater robot GARBI in a target reaching task

dc.format.mimetype

application/pdf

dc.language.iso

eng

dc.publisher

IEEE

dc.relation.isformatof

Reproducció digital del document publicat a: http://dx.doi.org/10.1109/IROS.2006.282342

dc.relation.ispartof

dc.relation.ispartofseries

Articles publicats (D-ATC)

dc.rights