3D Residual Networks with Channel-Spatial Attention Module for Action Recognition - Details

Author：

Yi, Ziwen (Yi, Ziwen.) | Sun, Zhonghua (Sun, Zhonghua.) | Feng, Jinchao (Feng, Jinchao.) (Scholars：冯金超) | Jia, Kebin (Jia, Kebin.) (Scholars：贾克斌)

Indexed by：

CPCI-S EI Scopus

Abstract：

Effectively　modeling　spatio-temporal　information　in　the　videos　is　the　key　to　improving　the　performance　of　action　recognition.　In　this　work,　we　propose　3D　residual　networks　with　channel　and　spatial　attention　modules　for　action　recognition.　The　proposed　network　architecture　can　directly　extract　spatiotemporal　features.　Channel　attention　module　and　spatial　attention　module　can　effectively　assist　the　network　to　learn　what　and　where　to　emphasize　or　suppress,　at　virtually　negligible　increase　in　computation　cost.　Specifically,　we　sequentially　add　channel　attention　module　and　spatial　attention　module　to　each　slice　tensor　of　the　intermediate　feature　map　to　form　channel　and　spatial　attention　maps.　Then　the　attention　maps　are　multiplied　to　the　input　feature　map　to　reweight　important　features.　We　validate　our　network　through　extensive　experiments　and　visualization　method　on　the　datasets　of　HMDB-51　and　UCF-101.

Keyword：

spatio-temporal features 3D residual networks attention module action recognition

Author Community：

[ 1 ] [Yi, Ziwen]Beijing Univ Technol, Beijing Key Lab Computat Intelligence & Intellige, Beijing, Peoples R China
[ 2 ] [Sun, Zhonghua]Beijing Univ Technol, Beijing Key Lab Computat Intelligence & Intellige, Beijing, Peoples R China
[ 3 ] [Feng, Jinchao]Beijing Univ Technol, Beijing Lab Adv Informat Networks, Beijing, Peoples R China
[ 4 ] [Jia, Kebin]Beijing Univ Technol, Fac Informat Technol, Beijing, Peoples R China

Reprint Author's Address：

[Sun, Zhonghua]Beijing Univ Technol, Beijing Key Lab Computat Intelligence & Intellige, Beijing, Peoples R China

Email：

yiziwen@emails.bjut.edu.cn |
sunzh@bjut.edu.cn |
fengjc@bjut.edu.cn |
kebinj@bjut.edu.cn

Show more details

Related Keywords：

An Efficient Lightweight Spatio-temporal Attention Module for Action Recognition
2022，11th International Conference on Computing and Pattern Recognition, ICCPR 2022
Improved SlowFast Network with Spatial–Temporal Attention Module for Action Recognition
2023，
Combining channel-wise joint attention and temporal attention in graph convolutional networks for skeleton-based action recognition
2022，SIGNAL IMAGE AND VIDEO PROCESSING
Temporal Enhanced Multi-Stream Graph Convolutional Nerual Networks for Skeleton-Based Action Recognition
2021，2021 China Automation Congress, CAC 2021

Source ：

2020 CHINESE AUTOMATION CONGRESS (CAC 2020)

ISSN： 2688-092X

Year： 2020

Page： 5171-5174

Language： English

Cited Count：

WoS CC Cited Count： 2

SCOPUS Cited Count： 4

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 16

Affiliated Colleges：

信息科学技术学院本学院/部未明确归属的数据

Get Fulltext

DOI Library Discovery Baidu Scholar Search Web of Science

Type
Departments

All Years Choose Year From to