Gesture Recognition with Focuses Using Hierarchical Body Part Combination - Details

Author：

Zhang, Cheng (Zhang, Cheng.) | Hou, Yibin (Hou, Yibin.) | He, Jian (He, Jian.) | Xie, Xiaoyang (Xie, Xiaoyang.)

Indexed by：

SCIE

Abstract：

Human　gesture　recognition　is　an　important　research　field　of　human-computer　interaction　due　to　its　potential　applications　in　various　fields,　but　existing　methods　still　face　challenges　in　achieving　high　levels　of　accuracy.　To　address　this　issue,　some　existing　researches　propose　to　fuse　the　global　features　with　the　cropped　features　called　focuses　on　vital　body　parts　like　hands.　However,　most　methods　rely　on　experience　when　choosing　the　focus,　the　scheme　of　focus　selection　is　not　discussed　in　detail.　In　this　paper,　a　hierarchical　body　part　combination　method　is　proposed　to　take　into　account　the　number,　combinations,　and　logical　relationships　between　body　parts.　The　proposed　method　generates　multiple　focuses　using　this　method　and　employs　chart-based　surface　modality　alongside　red-green-blue　and　optical　flow　modalities　to　enhance　each　focus.　A　feature-level　fusion　scheme　based　on　the　residual　connection　structure　is　proposed　to　fuse　different　modalities　at　convolution　stages,　and　a　focus　fusion　scheme　is　proposed　to　learn　the　relevancy　of　focus　channels　for　each　gesture　class　individually.　Experiments　conducted　on　ChaLearn　isolated　gesture　dataset　show　that　the　use　of　multiple　focuses　in　conjunction　with　multi-modal　features　and　fusion　strategies　leads　to　better　gesture　recognition　accuracy.

Keyword：

computer vision multi-modal gesture recognition human-computer interaction

Author Community：

[ 1 ] [Zhang, Cheng]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
[ 2 ] [He, Jian]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
[ 3 ] [Xie, Xiaoyang]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
[ 4 ] [Hou, Yibin]Beijing Univ Technol, Beijing Engn Res Ctr IoT Software & Syst, Beijing 100124, Peoples R China
[ 5 ] [He, Jian]Beijing Univ Technol, Beijing Engn Res Ctr IoT Software & Syst, Beijing 100124, Peoples R China

Reprint Author's Address：

[He, Jian]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China

Email：

zc5209@outlook.com |
ybhou@bjut.edu.cn |
jianhee@bjut.edu.cn |
xiexiaoyang@bjut.edu.cn

Show more details

Related Keywords：

MMF-Net: A novel multi-feature and multi-level fusion network for 3D human pose estimation
2025，IET COMPUTER VISION
A Survey of Visual Affordance Recognition Based on Deep Learning
2023，IEEE Transactions on Big Data
Fingertips-based gesture recognition for interaction
2012，11th ACM SIGGRAPH International Conference on Virtual-Reality Continuum and Its Applications in Industry, VRCAI 2012
A Robust Hand Gesture Recognition Method via Convolutional Neural Network
2016，6th International Conference on Digital Home, ICDH 2016

Source ：

TSINGHUA SCIENCE AND TECHNOLOGY

ISSN： 1007-0214

Year： 2025

Issue： 4

Volume： 30

Page： 1583-1599

6 . 6 0 0

JCR@2022

Cited Count：

WoS CC Cited Count：

SCOPUS Cited Count：

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 14

Affiliated Colleges：

Get Fulltext

DOI Library Discovery Baidu Scholar Search Web of Science

Type
Departments

All Years Choose Year From to