Article,

HandSense: smart multimodal hand gesture recognition based on deep neural networks

, , and .
Journal of Ambient Intelligence and Humanized Computing, (Aug 23, 2018)
DOI: 10.1007/s12652-018-0989-7

Abstract

Hand gesture recognition (HGR) is a promising enabler for human--computer interaction (HCI). Hand gestures are normally classified into multi-modal actions, including static gestures, fine-grained dynamic gestures, and coarse-grained dynamic gestures. Among them, the fine-grained action detection is limited under the small-scale image region condition. To solve this problem, we propose the HandSense, a new system for the multi-modal HGR based on a combined RGB and depth cameras to improve the fine-grained action descriptors as well as preserve the ability to perform general action recognition. First of all, two interconnected 3D convolutional neural network (3D-CNN) are employed to extract the spatial--temporal features from the RGB and depth images. Second, these spatial--temporal features are integrated into a fusion feature. Finally, the Support Vector Machine (SVM) is used to recognize different gestures based on the fusion feature. To validate the effectiveness of the HandSense, the extensive experiments are conducted on the public gesture dataset, namely the SKIG hand gesture dataset. In addition, the feasibility of the proposed system is also demonstrated by using a challenging multi-modal RGB-Depth hand gesture dataset.

Tags

Users

  • @jpmor

Comments and Reviews