Policy gradient based Reinforcement Learning for real autonomous underwater cable tracking
dc.contributor.author
dc.date.accessioned
2010-05-06T09:40:36Z
dc.date.available
2010-05-03T15:07:15Z
2010-05-06T09:40:36Z
dc.date.issued
2008
dc.identifier.citation
El-Fakdi, A., i Carreras, M. (2008). IEEE/RSJ International Conference on Intelligent Robots and Systems : 2008 : IROS 2008, 3635 - 3640. Recuperat 06 maig 2010, a http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=4650873
dc.identifier.isbn
978-1-4244-2057-5
dc.identifier.uri
dc.description.abstract
This paper proposes a field application of a high-level reinforcement learning (RL) control system for solving the action selection problem of an autonomous robot in cable tracking task. The learning system is characterized by using a direct policy search method for learning the internal state/action mapping. Policy only algorithms may suffer from long convergence times when dealing with real robotics. In order to speed up the process, the learning phase has been carried out in a simulated environment and, in a second step, the policy has been transferred and tested successfully on a real robot. Future steps plan to continue the learning process on-line while on the real robot while performing the mentioned task. We demonstrate its feasibility with real experiments on the underwater robot ICTINEU AUV
dc.format.mimetype
application/pdf
dc.language.iso
eng
dc.publisher
IEEE
dc.relation.isformatof
Reproducció digital del document publicat a: http://dx.doi.org/10.1109/IROS.2008.4650873
dc.relation.ispartof
© IEEE/RSJ International Conference on Intelligent Robots and Systems : 2008 : IROS 2008, 2008, p. 3635-3640
dc.relation.ispartofseries
Articles publicats (D-ATC)
dc.rights
Tots els drets reservats
dc.title
Policy gradient based Reinforcement Learning for real autonomous underwater cable tracking
dc.type
info:eu-repo/semantics/article
dc.rights.accessRights
info:eu-repo/semantics/openAccess
dc.identifier.doi