Efficient hand gesture recognition for human-robot interaction

In this paper, we present an efficient and reliable deep-learning approach that allows users to communicate with robots via hand gesture recognition. Contrary to other works which use external devices such as gloves [1] or joysticks [2] to tele-operate robots, the proposed approach uses only visual...

Descripción completa

Detalles Bibliográficos
Autores: Peral Sánchez, Marc|||0000-0001-6521-6476, Sanfeliu Cortés, Alberto|||0000-0003-3868-9678, Garrell Zulueta, Anais|||0000-0002-4629-0723
Tipo de recurso: artículo
Fecha de publicación:2022
País:España
Institución:Universitat Politècnica de Catalunya (UPC)
Repositorio:UPCommons. Portal del coneixement obert de la UPC
Idioma:inglés
OAI Identifier:oai:upcommons.upc.edu:2117/378658
Acceso en línea:https://hdl.handle.net/2117/378658
https://dx.doi.org/10.1109/LRA.2022.3193251
Access Level:acceso abierto
Palabra clave:Human-robot interaction
Human robot interaction
Hand gesture recognition
Interacció persona-robot
Àrees temàtiques de la UPC::Informàtica::Robòtica
Descripción
Sumario:In this paper, we present an efficient and reliable deep-learning approach that allows users to communicate with robots via hand gesture recognition. Contrary to other works which use external devices such as gloves [1] or joysticks [2] to tele-operate robots, the proposed approach uses only visual information to recognize user's instructions that are encoded in a set of pre-defined hand gestures. Particularly, the method consists of two modules which work sequentially to extract 2D landmarks of hands –ie. joints positions– and to predict the hand gesture based on a temporal representation of them. The approach has been validated in a recent state-of-the-art dataset where it outperformed other methods that use multiple pre-processing steps such as optical flow and semantic segmentation. Our method achieves an accuracy of 87.5% and runs at 10 frames per second. Finally, we conducted real-life experiments with our IVO robot to validate the framework during the interaction process.