Indexed by:
Abstract:
The objective of group activity recognition is to identify behaviors performed by multiple individuals within a given scene. However, current weakly supervised approaches often rely on object detectors or use self-attention mechanisms. The former approach is susceptible to background clutter and entails high computational costs, while the latter method learns weights from the input video and assigns them to key targets which is not reliable enough to find the key person. To address these limitations, we present a novel weakly supervised framework. Our proposed framework eliminates the need for ground-truth bounding boxes or object detectors. Meanwhile, it incorporates the semantics of individual action labels to replace self-attention to guide the learning process, enabling the extraction of more sophisticated semantic features relevant to activity. This approach also explores the interactions to promote group activity classification. Experimental results demonstrate that our method achieves state-of-the-art performances on both volleyball and collective datasets. © 2023, The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
Keyword:
Reprint Author's Address:
Email:
Source :
ISSN: 1865-0929
Year: 2023
Volume: 1910 CCIS
Page: 220-235
Language: English
Cited Count:
WoS CC Cited Count: 0
SCOPUS Cited Count:
ESI Highly Cited Papers on the List: 0 Unfold All
WanFang Cited Count:
Chinese Cited Count:
30 Days PV: 18
Affiliated Colleges: