• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
搜索

Author:

Li, Jinghua (Li, Jinghua.) | Huai, Huarui (Huai, Huarui.) | Gao, Junbin (Gao, Junbin.) | Kong, Dehui (Kong, Dehui.) (Scholars:孔德慧) | Wang, Lichun (Wang, Lichun.) (Scholars:王立春)

Indexed by:

EI Scopus SCIE

Abstract:

Hand gesture is a kind of natural interaction way and hand gesture recognition has recently become more and more popular in human-computer interaction. However, the complexity and variations of hand gesture like various illuminations, views, and self-structural characteristics make the hand gesture recognition still challengeable. How to design an appropriate feature representation and classifier are the core problems. To this end, this paper develops an expressive deep hybrid hand gesture recognition architecture called CNN-MVRBM-NN. The framework consists of three submodels. The CNN submodel automatically extracts frame-level spatial features, and the MVRBM submodel fuses spatial information over time for training higher level semantics inherent in hand gesture, while the NN submodel classifies hand gesture, which is initialized by MVRBM for second order data representation, and then such NN pre-trained by MVRBM can be fine-tuned by back propagation so as to be more discriminative. The experimental results on Cambridge Hand Gesture Data set show the proposed hybrid CNN-MVRBM-NN has obtained the state-of-the-art recognition performance.

Keyword:

Neural network (NN) Matrix variate restricted Boltzmann machine (MVRBM) Hand gesture recognition Convolutional neural networks (CNN)

Author Community:

  • [ 1 ] [Li, Jinghua]Beijing Univ Technol, Fac Informat Technol, Beijing Key Lab Multimedia & Intelligent Software, Beijing 100124, Peoples R China
  • [ 2 ] [Huai, Huarui]Beijing Univ Technol, Fac Informat Technol, Beijing Key Lab Multimedia & Intelligent Software, Beijing 100124, Peoples R China
  • [ 3 ] [Kong, Dehui]Beijing Univ Technol, Fac Informat Technol, Beijing Key Lab Multimedia & Intelligent Software, Beijing 100124, Peoples R China
  • [ 4 ] [Wang, Lichun]Beijing Univ Technol, Fac Informat Technol, Beijing Key Lab Multimedia & Intelligent Software, Beijing 100124, Peoples R China
  • [ 5 ] [Gao, Junbin]Univ Sydney, Sch Business, Discipline Business Analyt, Sydney, NSW 2006, Australia

Reprint Author's Address:

  • [Li, Jinghua]Beijing Univ Technol, Fac Informat Technol, Beijing Key Lab Multimedia & Intelligent Software, Beijing 100124, Peoples R China

Show more details

Related Keywords:

Related Article:

Source :

JOURNAL ON MULTIMODAL USER INTERFACES

ISSN: 1783-7677

Year: 2019

Issue: 4

Volume: 13

Page: 363-371

2 . 9 0 0

JCR@2022

ESI Discipline: COMPUTER SCIENCE;

ESI HC Threshold:147

JCR Journal Grade:3

Cited Count:

WoS CC Cited Count: 9

SCOPUS Cited Count: 12

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 0

Online/Total:688/10592102
Address:BJUT Library(100 Pingleyuan,Chaoyang District,Beijing 100124, China Post Code:100124) Contact Us:010-67392185
Copyright:BJUT Library Technical Support:Beijing Aegean Software Co., Ltd.