Dynamic robotic tracking of underwater targets using reinforcement learning

To realize the potential of autonomous underwater robots that scale up our observational capacity in the ocean, new techniques are needed. Fleets of autonomous robots could be used to study complex marine systems and animals with either new imaging configurations or by tracking tagged animals to stu...

Full description

Bibliographic Details
Authors: Masmitjà Rusiñol, Ivan|||0000-0001-6355-7955, Martín Muñoz, Mario|||0000-0002-4125-6630, O’Reilly, Tom, Kieft, Brian, Palomeras Rovira, Narcís, Navarro Bernabé, Joan, Katija, Kakani
Format: article
Publication Date:2023
Country:España
Institution:Universitat Politècnica de Catalunya (UPC)
Repository:UPCommons. Portal del coneixement obert de la UPC
Language:English
OAI Identifier:oai:upcommons.upc.edu:2117/393215
Online Access:https://hdl.handle.net/2117/393215
https://dx.doi.org/10.1126/scirobotics.ade7811
Access Level:Open access
Keyword:Autonomous underwater vehicles
Reinforcement learning
Vehicles submergibles autònoms
Aprenentatge per reforç
Àrees temàtiques de la UPC::Informàtica::Intel·ligència artificial::Aprenentatge automàtic
Àrees temàtiques de la UPC::Informàtica::Robòtica
id ES_c36082178faf92bbc95deb02f7cfcef4
oai_identifier_str oai:upcommons.upc.edu:2117/393215
network_acronym_str ES
network_name_str España
repository_id_str
spelling Dynamic robotic tracking of underwater targets using reinforcement learningMasmitjà Rusiñol, Ivan|||0000-0001-6355-7955Martín Muñoz, Mario|||0000-0002-4125-6630O’Reilly, TomKieft, BrianPalomeras Rovira, NarcísNavarro Bernabé, JoanKatija, KakaniAutonomous underwater vehiclesReinforcement learningVehicles submergibles autònomsAprenentatge per reforçÀrees temàtiques de la UPC::Informàtica::Intel·ligència artificial::Aprenentatge automàticÀrees temàtiques de la UPC::Informàtica::RobòticaTo realize the potential of autonomous underwater robots that scale up our observational capacity in the ocean, new techniques are needed. Fleets of autonomous robots could be used to study complex marine systems and animals with either new imaging configurations or by tracking tagged animals to study their behavior. These activities can then inform and create new policies for community conservation. The role of animal connectivity via active movement of animals represents a major knowledge gap related to the distribution of deep ocean populations. Tracking underwater targets represents a major challenge for observing biological processes in situ, and methods to robustly respond to a changing environment during monitoring missions are needed. Analytical techniques for optimal sensor placement and path planning to locate underwater targets are not straightforward in such cases. The aim of this study was to investigate the use of reinforcement learning as a tool for range-only underwater target-tracking optimization, whose promising capabilities have been demonstrated in terrestrial scenarios. To evaluate its usefulness, a reinforcement learning method was implemented as a path planning system for an autonomous surface vehicle while tracking an underwater mobile target. A complete description of an open-source model, performance metrics in simulated environments, and evaluated algorithms based on more than 15 hours of at-sea field experiments are presented. These efforts demonstrate that deep reinforcement learning is a powerful approach that enhances the abilities of autonomous robots in the ocean and encourages the deployment of algorithms like these for monitoring marine biological systems in the future.Funding: The European Union’s Horizon 2020 research and innovation programme under the Marie Skłodowska-Curie, grant agreement No 893089 (IM). The Spanish Ministerio de Economia y Competitividad under the project SASES, grant agreement No RTI2018-095112-B-I00 (IM). The Spanish Ministerio de Economia y Competitividad under the project BITER-ECO, grant agreement No PID2020-114732RB-C31 (IM, JN). The Spanish Ministerio de Economia y Competitividad under the project BITER-AUV, grant agreement No PID2020-114732RB-C33 (NP).Peer ReviewedAmerican Association for the Advancement of Science (AAAS)20232023-07-2620232023-09-07journal articlehttp://purl.org/coar/resource_type/c_6501AMhttp://purl.org/coar/version/c_ab4af688f83e57aainfo:eu-repo/semantics/articleapplication/pdfhttps://hdl.handle.net/2117/393215https://dx.doi.org/10.1126/scirobotics.ade7811reponame:UPCommons. Portal del coneixement obert de la UPCinstname:Universitat Politècnica de Catalunya (UPC)InglésengAgencia Estatal de Investigación http://doi.org/10.13039/501100011033 Plan Estatal de Investigación Científica y Técnica y de Innovación 2017-2020 RTI2018-095112-B-I00 SISTEMAS ACUSTICOS SUBMARINOS PARA LA MONITORIZACION DEL COMPORTAMIENTO ESPACIAL DE ESPECIESopen accesshttp://purl.org/coar/access_right/c_abf2info:eu-repo/semantics/openAccessoai:upcommons.upc.edu:2117/3932152026-05-27T15:37:01Z
dc.title.none.fl_str_mv Dynamic robotic tracking of underwater targets using reinforcement learning
title Dynamic robotic tracking of underwater targets using reinforcement learning
spellingShingle Dynamic robotic tracking of underwater targets using reinforcement learning
Masmitjà Rusiñol, Ivan|||0000-0001-6355-7955
Autonomous underwater vehicles
Reinforcement learning
Vehicles submergibles autònoms
Aprenentatge per reforç
Àrees temàtiques de la UPC::Informàtica::Intel·ligència artificial::Aprenentatge automàtic
Àrees temàtiques de la UPC::Informàtica::Robòtica
title_short Dynamic robotic tracking of underwater targets using reinforcement learning
title_full Dynamic robotic tracking of underwater targets using reinforcement learning
title_fullStr Dynamic robotic tracking of underwater targets using reinforcement learning
title_full_unstemmed Dynamic robotic tracking of underwater targets using reinforcement learning
title_sort Dynamic robotic tracking of underwater targets using reinforcement learning
dc.creator.none.fl_str_mv Masmitjà Rusiñol, Ivan|||0000-0001-6355-7955
Martín Muñoz, Mario|||0000-0002-4125-6630
O’Reilly, Tom
Kieft, Brian
Palomeras Rovira, Narcís
Navarro Bernabé, Joan
Katija, Kakani
author Masmitjà Rusiñol, Ivan|||0000-0001-6355-7955
author_facet Masmitjà Rusiñol, Ivan|||0000-0001-6355-7955
Martín Muñoz, Mario|||0000-0002-4125-6630
O’Reilly, Tom
Kieft, Brian
Palomeras Rovira, Narcís
Navarro Bernabé, Joan
Katija, Kakani
author_role author
author2 Martín Muñoz, Mario|||0000-0002-4125-6630
O’Reilly, Tom
Kieft, Brian
Palomeras Rovira, Narcís
Navarro Bernabé, Joan
Katija, Kakani
author2_role author
author
author
author
author
author
dc.subject.none.fl_str_mv Autonomous underwater vehicles
Reinforcement learning
Vehicles submergibles autònoms
Aprenentatge per reforç
Àrees temàtiques de la UPC::Informàtica::Intel·ligència artificial::Aprenentatge automàtic
Àrees temàtiques de la UPC::Informàtica::Robòtica
topic Autonomous underwater vehicles
Reinforcement learning
Vehicles submergibles autònoms
Aprenentatge per reforç
Àrees temàtiques de la UPC::Informàtica::Intel·ligència artificial::Aprenentatge automàtic
Àrees temàtiques de la UPC::Informàtica::Robòtica
description To realize the potential of autonomous underwater robots that scale up our observational capacity in the ocean, new techniques are needed. Fleets of autonomous robots could be used to study complex marine systems and animals with either new imaging configurations or by tracking tagged animals to study their behavior. These activities can then inform and create new policies for community conservation. The role of animal connectivity via active movement of animals represents a major knowledge gap related to the distribution of deep ocean populations. Tracking underwater targets represents a major challenge for observing biological processes in situ, and methods to robustly respond to a changing environment during monitoring missions are needed. Analytical techniques for optimal sensor placement and path planning to locate underwater targets are not straightforward in such cases. The aim of this study was to investigate the use of reinforcement learning as a tool for range-only underwater target-tracking optimization, whose promising capabilities have been demonstrated in terrestrial scenarios. To evaluate its usefulness, a reinforcement learning method was implemented as a path planning system for an autonomous surface vehicle while tracking an underwater mobile target. A complete description of an open-source model, performance metrics in simulated environments, and evaluated algorithms based on more than 15 hours of at-sea field experiments are presented. These efforts demonstrate that deep reinforcement learning is a powerful approach that enhances the abilities of autonomous robots in the ocean and encourages the deployment of algorithms like these for monitoring marine biological systems in the future.
publishDate 2023
dc.date.none.fl_str_mv 2023
2023-07-26
2023
2023-09-07
dc.type.none.fl_str_mv journal article
http://purl.org/coar/resource_type/c_6501
AM
http://purl.org/coar/version/c_ab4af688f83e57aa
dc.type.openaire.fl_str_mv info:eu-repo/semantics/article
format article
dc.identifier.none.fl_str_mv https://hdl.handle.net/2117/393215
https://dx.doi.org/10.1126/scirobotics.ade7811
url https://hdl.handle.net/2117/393215
https://dx.doi.org/10.1126/scirobotics.ade7811
dc.language.none.fl_str_mv Inglés
eng
language_invalid_str_mv Inglés
language eng
dc.relation.none.fl_str_mv Agencia Estatal de Investigación http://doi.org/10.13039/501100011033 Plan Estatal de Investigación Científica y Técnica y de Innovación 2017-2020 RTI2018-095112-B-I00 SISTEMAS ACUSTICOS SUBMARINOS PARA LA MONITORIZACION DEL COMPORTAMIENTO ESPACIAL DE ESPECIES
dc.rights.none.fl_str_mv open access
http://purl.org/coar/access_right/c_abf2
dc.rights.openaire.fl_str_mv info:eu-repo/semantics/openAccess
rights_invalid_str_mv open access
http://purl.org/coar/access_right/c_abf2
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.publisher.none.fl_str_mv American Association for the Advancement of Science (AAAS)
publisher.none.fl_str_mv American Association for the Advancement of Science (AAAS)
dc.source.none.fl_str_mv reponame:UPCommons. Portal del coneixement obert de la UPC
instname:Universitat Politècnica de Catalunya (UPC)
instname_str Universitat Politècnica de Catalunya (UPC)
reponame_str UPCommons. Portal del coneixement obert de la UPC
collection UPCommons. Portal del coneixement obert de la UPC
repository.name.fl_str_mv
repository.mail.fl_str_mv
_version_ 1869418790418120704
score 15,300724