Exploiting symmetries in reinforcement learning of bimanual robotic tasks

© 2019 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to s...

Descripción completa

Detalles Bibliográficos
Autores: Amadio, Fabio, Colomé Figueras, Adrià, Torras, Carme|||0000-0002-2933-398X
Tipo de recurso: artículo
Fecha de publicación:2019
País:España
Institución:Universitat Politècnica de Catalunya (UPC)
Repositorio:UPCommons. Portal del coneixement obert de la UPC
Idioma:inglés
OAI Identifier:oai:upcommons.upc.edu:2117/178327
Acceso en línea:https://hdl.handle.net/2117/178327
https://dx.doi.org/10.1109/LRA.2019.2898330
Access Level:acceso abierto
Palabra clave:Humanoid robots
Learning
Artificial intelligence
Manipulators
Classificació INSPEC::Cybernetics::Artificial intelligence
Àrees temàtiques de la UPC::Informàtica::Intel·ligència artificial
Àrees temàtiques de la UPC::Informàtica::Robòtica
id ES_f8b8abe36e63e348e104f877384c9fd6
oai_identifier_str oai:upcommons.upc.edu:2117/178327
network_acronym_str ES
network_name_str España
repository_id_str
spelling Exploiting symmetries in reinforcement learning of bimanual robotic tasksAmadio, FabioColomé Figueras, AdriàTorras, Carme|||0000-0002-2933-398XHumanoid robotsLearningArtificial intelligenceManipulatorsClassificació INSPEC::Cybernetics::Artificial intelligenceÀrees temàtiques de la UPC::Informàtica::Intel·ligència artificialÀrees temàtiques de la UPC::Informàtica::Robòtica© 2019 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.Movement Primitives (MPs) have been widely adopted for representing and learning robotic movements using Reinforcement Learning Policy Search. Probabilistic Movement Primitives (ProMPs) are a kind of MP based on a stochastic representation over sets of trajectories, able of capturing the variability allowed while executing a movement. This approach has proved effective in learning a wide range of robotic movements, but it comes with the need of dealing with a high-dimensional space of parameters. This may be a critical problem when learning tasks with two robotic manipulators, and this work proposes an approach to reduce the dimension of the parameter space based on the exploitation of symmetry. A symmetrization method for ProMPs is presented and used to represent two movements, employing a single ProMP for the first arm and a symmetry surface that maps that ProMP to the second arm. This symmetric representation is then adopted in reinforcement learning of bimanual tasks (from user-provided demonstrations), using Relative Entropy Policy Search (REPS) algorithm. The symmetry-based approach developed has been tested in an experiment of cloth manipulation, showing a speed increment in learning the task.Peer ReviewedInstitute of Electrical and Electronics Engineers (IEEE)20192019-01-0120202020-02-21journal articlehttp://purl.org/coar/resource_type/c_6501AMhttp://purl.org/coar/version/c_ab4af688f83e57aainfo:eu-repo/semantics/articleapplication/pdfhttps://hdl.handle.net/2117/178327https://dx.doi.org/10.1109/LRA.2019.2898330reponame:UPCommons. Portal del coneixement obert de la UPCinstname:Universitat Politècnica de Catalunya (UPC)InglésengEuropean Commission http://doi.org/10.13039/100010661 Horizon 2020 Framework Programme 741930 CLOTH manIpulation Learning from DEmonstrationsopen accesshttp://purl.org/coar/access_right/c_abf2Attribution-NonCommercial-NoDerivs 3.0 Spainhttp://creativecommons.org/licenses/by-nc-nd/3.0/es/info:eu-repo/semantics/openAccessoai:upcommons.upc.edu:2117/1783272026-05-27T15:37:01Z
dc.title.none.fl_str_mv Exploiting symmetries in reinforcement learning of bimanual robotic tasks
title Exploiting symmetries in reinforcement learning of bimanual robotic tasks
spellingShingle Exploiting symmetries in reinforcement learning of bimanual robotic tasks
Amadio, Fabio
Humanoid robots
Learning
Artificial intelligence
Manipulators
Classificació INSPEC::Cybernetics::Artificial intelligence
Àrees temàtiques de la UPC::Informàtica::Intel·ligència artificial
Àrees temàtiques de la UPC::Informàtica::Robòtica
title_short Exploiting symmetries in reinforcement learning of bimanual robotic tasks
title_full Exploiting symmetries in reinforcement learning of bimanual robotic tasks
title_fullStr Exploiting symmetries in reinforcement learning of bimanual robotic tasks
title_full_unstemmed Exploiting symmetries in reinforcement learning of bimanual robotic tasks
title_sort Exploiting symmetries in reinforcement learning of bimanual robotic tasks
dc.creator.none.fl_str_mv Amadio, Fabio
Colomé Figueras, Adrià
Torras, Carme|||0000-0002-2933-398X
author Amadio, Fabio
author_facet Amadio, Fabio
Colomé Figueras, Adrià
Torras, Carme|||0000-0002-2933-398X
author_role author
author2 Colomé Figueras, Adrià
Torras, Carme|||0000-0002-2933-398X
author2_role author
author
dc.subject.none.fl_str_mv Humanoid robots
Learning
Artificial intelligence
Manipulators
Classificació INSPEC::Cybernetics::Artificial intelligence
Àrees temàtiques de la UPC::Informàtica::Intel·ligència artificial
Àrees temàtiques de la UPC::Informàtica::Robòtica
topic Humanoid robots
Learning
Artificial intelligence
Manipulators
Classificació INSPEC::Cybernetics::Artificial intelligence
Àrees temàtiques de la UPC::Informàtica::Intel·ligència artificial
Àrees temàtiques de la UPC::Informàtica::Robòtica
description © 2019 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
publishDate 2019
dc.date.none.fl_str_mv 2019
2019-01-01
2020
2020-02-21
dc.type.none.fl_str_mv journal article
http://purl.org/coar/resource_type/c_6501
AM
http://purl.org/coar/version/c_ab4af688f83e57aa
dc.type.openaire.fl_str_mv info:eu-repo/semantics/article
format article
dc.identifier.none.fl_str_mv https://hdl.handle.net/2117/178327
https://dx.doi.org/10.1109/LRA.2019.2898330
url https://hdl.handle.net/2117/178327
https://dx.doi.org/10.1109/LRA.2019.2898330
dc.language.none.fl_str_mv Inglés
eng
language_invalid_str_mv Inglés
language eng
dc.relation.none.fl_str_mv European Commission http://doi.org/10.13039/100010661 Horizon 2020 Framework Programme 741930 CLOTH manIpulation Learning from DEmonstrations
dc.rights.none.fl_str_mv open access
http://purl.org/coar/access_right/c_abf2
Attribution-NonCommercial-NoDerivs 3.0 Spain
http://creativecommons.org/licenses/by-nc-nd/3.0/es/
dc.rights.openaire.fl_str_mv info:eu-repo/semantics/openAccess
rights_invalid_str_mv open access
http://purl.org/coar/access_right/c_abf2
Attribution-NonCommercial-NoDerivs 3.0 Spain
http://creativecommons.org/licenses/by-nc-nd/3.0/es/
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.publisher.none.fl_str_mv Institute of Electrical and Electronics Engineers (IEEE)
publisher.none.fl_str_mv Institute of Electrical and Electronics Engineers (IEEE)
dc.source.none.fl_str_mv reponame:UPCommons. Portal del coneixement obert de la UPC
instname:Universitat Politècnica de Catalunya (UPC)
instname_str Universitat Politècnica de Catalunya (UPC)
reponame_str UPCommons. Portal del coneixement obert de la UPC
collection UPCommons. Portal del coneixement obert de la UPC
repository.name.fl_str_mv
repository.mail.fl_str_mv
_version_ 1869425028028694528
score 15.300724