Garuda - Garba Rujukan Digital

Indonesian Journal of Electrical Engineering and Computer Science

Vol 27, No 1: July 2022

Huong-Giang Doan (Electric Power University)
Ngoc-Trung Nguyen (Electric Power University)

Publish Date
01 Jul 2022

Multi-modal or multi-view dataset that was captured from various resources (e.g. RGB and Depth) of a subject at the same time. Combination between different cues has still faced to many challenges as unique data and complementary in-formation. In adition, the proposed method for multiple modalities recognition consists of discrete blocks, such as: extract features for separative data flows, combine of features, and classify gestures. To address the challenges, we pro-posed two novel end-to-end hand posture recognition frameworks, which are integrated all steps into a convolution neuronal network (CNN) system from capturing various types of cues (RGB and Depth images) to classify hand ges-ture labels. Both frameworks use the Resnet50 backbone that was pretrained by ImageNet dataset. We proposed a novel end-to-end multi-modal frameworks, which are named attention convolution module (ACM) and gated concatenation module (GCM). Both of them are deployed, evaluated and compared on vari-ous multi-modalities hand posture datasets. Experimental results show that our proposed method outperforms with others state-of-the-art techniques (SOTA) methods.

Citation Download

EndNote, Reference Manager, ProCite

Latex, Jabref

Check in Google Scholar

Journal Info

Indonesian Journal of Electrical Engineering and Computer Science

Website

Abbrev

IJEECS

Publisher

Institute of Advanced Engineering and Science

Subject

Description

...

Article Info

Abstract

End-to-end multiple modals deep learning system for hand posture recognition

Article Info

Abstract