Efficient hand gesture recognition for human-robot interaction

In this paper, we present an efficient and reliable deep-learning approach that allows users to communicate with robots via hand gesture recognition. Contrary to other works which use external devices such as gloves [1] or joysticks [2] to tele-operate robots, the proposed approach uses only visual...

Full description

Bibliographic Details
Authors: Peral Sánchez, Marc|||0000-0001-6521-6476, Sanfeliu Cortés, Alberto|||0000-0003-3868-9678, Garrell Zulueta, Anais|||0000-0002-4629-0723
Format: article
Publication Date:2022
Country:España
Institution:Universitat Politècnica de Catalunya (UPC)
Repository:UPCommons. Portal del coneixement obert de la UPC
Language:English
OAI Identifier:oai:upcommons.upc.edu:2117/378658
Online Access:https://hdl.handle.net/2117/378658
https://dx.doi.org/10.1109/LRA.2022.3193251
Access Level:Open access
Keyword:Human-robot interaction
Human robot interaction
Hand gesture recognition
Interacció persona-robot
Àrees temàtiques de la UPC::Informàtica::Robòtica
Description
Summary:In this paper, we present an efficient and reliable deep-learning approach that allows users to communicate with robots via hand gesture recognition. Contrary to other works which use external devices such as gloves [1] or joysticks [2] to tele-operate robots, the proposed approach uses only visual information to recognize user's instructions that are encoded in a set of pre-defined hand gestures. Particularly, the method consists of two modules which work sequentially to extract 2D landmarks of hands –ie. joints positions– and to predict the hand gesture based on a temporal representation of them. The approach has been validated in a recent state-of-the-art dataset where it outperformed other methods that use multiple pre-processing steps such as optical flow and semantic segmentation. Our method achieves an accuracy of 87.5% and runs at 10 frames per second. Finally, we conducted real-life experiments with our IVO robot to validate the framework during the interaction process.