CCST: crowd counting with swin transformer - Details

Author：

Li, Bo (Li, Bo.) | Zhang, Yong (Zhang, Yong.) (Scholars：张勇) | Xu, Haihui (Xu, Haihui.) | Yin, Baocai (Yin, Baocai.)

Indexed by：

EI Scopus SCIE

Abstract：

Accurately　estimating　the　number　of　individuals　contained　in　an　image　is　the　purpose　of　the　crowd　counting.　It　has　always　faced　two　major　difficulties:　uneven　distribution　of　crowd　density　and　large　span　of　head　size.　Focusing　on　the　former,　most　CNN-based　methods　divide　the　image　into　multiple　patches　for　processing,　ignoring　the　connection　between　the　patches.　For　the　latter,　the　multi-scale　feature　fusion　method　using　feature　pyramid　ignores　the　matching　relationship　between　the　head　size　and　the　hierarchical　features.　In　response　to　the　above　issues,　we　propose　a　crowd　counting　network　named　CCST　based　on　swin　transformer,　and　tailor　a　feature　adaptive　fusion　regression　head　called　FAFHead.　Swin　transformer　can　fully　exchange　information　within　and　between　patches,　and　effectively　alleviate　the　problem　of　uneven　distribution　of　crowd　density.　FAFHead　can　adaptively　fuse　multi-level　features,　improve　the　matching　relationship　between　head　size　and　feature　pyramid　hierarchy,　and　relief　the　problem　of　large　span　of　head　size　available.　Experimental　results　on　common　datasets　show　that　CCST　has　better　counting　performance　than　all　weakly　supervised　counting　works　and　great　majority　of　popular　density　map-based　fully　supervised　works.

Keyword：

Large span of head size Crowd counting Uneven distribution of crowd density Feature adaptive fusion Transformer

Author Community：

[ 1 ] [Li, Bo]Beijing Univ Technol, Beijing Inst Artificial Intelligence, Dept Informat Sci, Beijing Key Lab Multimedia & Intelligent Software, Beijing 100124, Peoples R China
[ 2 ] [Zhang, Yong]Beijing Univ Technol, Beijing Inst Artificial Intelligence, Dept Informat Sci, Beijing Key Lab Multimedia & Intelligent Software, Beijing 100124, Peoples R China
[ 3 ] [Yin, Baocai]Beijing Univ Technol, Beijing Inst Artificial Intelligence, Dept Informat Sci, Beijing Key Lab Multimedia & Intelligent Software, Beijing 100124, Peoples R China
[ 4 ] [Xu, Haihui]Beijing Municipal Transportat Operat Coordinat Ct, Beijing 100161, Peoples R China

Reprint Author's Address：

Email：

zhangyong2010@bjut.edu.cn

Show more details

Related Keywords：

Hypergraph AssociationWeakly Supervised Crowd Counting
2023，ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS
Multi-Level Dynamic Graph Convolutional Networks for Weakly Supervised Crowd Counting
2023，IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS
DTCC: Multi-level dilated convolution with transformer for weakly-supervised crowd counting
2023，COMPUTATIONAL VISUAL MEDIA
Double Recursive Sparse Self-attention Based Crowd Counting in the Cluttered Background
2022，

Source ：

VISUAL COMPUTER

ISSN： 0178-2789

Year： 2022

Issue： 7

Volume： 39

Page： 2671-2682

3 . 5

JCR@2022

3 . 5 0 0

JCR@2022

ESI Discipline： COMPUTER SCIENCE;

ESI HC Threshold：46

JCR Journal Grade：2

CAS Journal Grade：3

Cited Count：

WoS CC Cited Count： 24

SCOPUS Cited Count： 25

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 9

Affiliated Colleges：

Get Fulltext

DOI Library Discovery Baidu Scholar Search Web of Science

Type
Departments

All Years Choose Year From to