Event Tubelet Compressor: Generating Compact Representations for Event-Based Action Recognition - Details

Author：

Indexed by：

CPCI-S EI Scopus

Abstract：

Event　cameras　asynchronously　capture　pixel-level　intensity　changes　in　scenes　and　output　a　stream　of　events.　Compared　with　traditional　frame-based　cameras,　they　can　offer　competitive　imaging　characteristics:　low　latency,　high　dynamic　range,　and　low　power　consumption.　It　means　that　event　cameras　are　ideal　for　vision　tasks　in　dynamic　scenarios,　such　as　human　action　recognition.　The　best-performing　event-based　algorithms　convert　events　into　frame-based　representations　and　feed　them　into　existing　learning　models.　However,　generating　informative　frames　for　long-duration　event　streams　is　still　a　challenge　since　event　cameras　work　asynchronously　without　a　fixed　frame　rate.　In　this　work,　we　propose　a　novel　frame-based　representation　named　Compact　Event　Image　(CEI)　for　action　recognition.　This　representation　is　generated　by　a　self-attention　based　module　named　Event　Tubelet　Compressor　(EVTC)　in　a　learnable　way.　The　EVTC　module　adaptively　summarizes　the　long-term　dynamics　and　temporal　patterns　of　events　into　a　CEI　frame　set.　We　can　combine　EVTC　with　conventional　video　backbones　for　end-to-end　event-based　action　recognition.　We　evaluate　our　approach　on　three　benchmark　datasets,　and　experimental　results　show　it　outperforms　state-of-the-art　methods　by　a　large　margin.

Keyword：

self-attention mechanism representation learning human action recognition event camera

Author Community：

[ 1 ] [Xie, Bochen]City Univ Hong Kong, Dept Mech Engn, Hong Kong, Peoples R China
[ 2 ] [Li, Youfu]City Univ Hong Kong, Dept Mech Engn, Hong Kong, Peoples R China
[ 3 ] [Deng, Yongjian]Beijing Univ Technol, Coll Comp Sci, Beijing, Peoples R China
[ 4 ] [Shao, Zhanpeng]Hunan Normal Univ, Coll Informat Sci & Engn, Changsha, Peoples R China
[ 5 ] [Liu, Hai]Cent China Normal Univ, Natl Engn Res Ctr E Learning, Wuhan, Peoples R China
[ 6 ] [Xu, Qingsong]Univ Macau, Dept Electromech Engn, Macau, Peoples R China

Reprint Author's Address：

Email：

Show more details

Related Keywords：

CLA-Net: A Deep Spatio-Temporal Attention Network Based on ConvLSTM for EEG Emotion Recognition
2024，
A novel self-attention weight transformer for air pollution smoke detection
2024，
Overview of Transformer-Based Visual Segmentation Techniques; [基于 Transformer 的视觉分割技术进展]
2024，Chinese Journal of Computers
Self-Attention Graph Convolution Residual Network for Traffic Data Completion
2022，IEEE Transactions on Big Data

Source ：

2022 7TH INTERNATIONAL CONFERENCE ON CONTROL, ROBOTICS AND CYBERNETICS, CRC

Year： 2022

Page： 12-16

Cited Count：

WoS CC Cited Count： 2

SCOPUS Cited Count： 3

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 7

Affiliated Colleges：

Get Fulltext

DOI Library Discovery Baidu Scholar Search Web of Science

Type
Departments

All Years Choose Year From to