TPSN: Transformer-based multi-Prototype Search Network for few-shot semantic segmentation - Details

Author：

Wang, W. (Wang, W..) | Duan, L. (Duan, L..) | En, Q. (En, Q..) | Zhang, B. (Zhang, B..) | Liang, F. (Liang, F..)

Indexed by：

EI Scopus SCIE

Abstract：

Few-shot　semantic　segmentation　aims　to　segment　new　objects　in　the　image　with　limited　annotations.　Typically,　in　metric-based　few-shot　learning,　the　expression　of　categories　is　obtained　by　averaging　global　support　object　information.　However,　a　single　prototype　cannot　accurately　describe　a　category.　Meanwhile,　simple　foreground　averaging　operations　also　ignore　the　dependencies　between　objects　and　their　surroundings.　In　this　paper,　we　propose　a　novel　Transformer-based　Prototype　Search　Network　(TPSN)　for　few-shot　segmentation.　We　use　the　transformer　encoder　to　integrate　information　between　different　image　regions　and　then　use　the　decoder　to　express　a　category　in　terms　of　multiple　prototypes.　The　multi-prototype　approach　can　effectively　alleviate　the　feature　fluctuation　caused　by　limited　annotation　data.　Moreover,　we　use　adaptive　prototype　search　during　multi-prototype　extraction　instead　of　the　ordinary　averaging　operation　compared　with　the　previous　few-shot　prototype　framework.　This　helps　the　network　integrate　the　different　image　regions’　information　and　fuse　object　features　with　their　dependent　background　information,　obtaining　more　reasonable　prototype　expressions.　In　addition,　to　encourage　the　category＇s　prototypes　to　focus　on　different　parts　and　maintain　consistency　in　high-level　semantics,　we　use　the　diversity　and　consistency　loss　to　constrain　the　multi-prototype　training.　Experiments　show　that　our　algorithm　achieves　state-of-the-art　performance　in　few-shot　segmentation　on　two　datasets:　PASCAL-5i　and　COCO-20i.　©　2022　Elsevier　Ltd

Keyword：

Few-shot learning Semantic segmentation Vision transformer Multiple prototypes

Author Community：

[ 1 ] [Wang, W.]Faculty of Information Technology, Beijing University of Technology, Beijing, China
[ 2 ] [Wang, W.]Beijing Key Laboratory of Trusted Computing, Beijing, China
[ 3 ] [Wang, W.]National Engineering Laboratory for Key Technologies of Information Security Level Protection, Beijing, China
[ 4 ] [Duan, L.]Faculty of Information Technology, Beijing University of Technology, Beijing, China
[ 5 ] [Duan, L.]Beijing Key Laboratory of Trusted Computing, Beijing, China
[ 6 ] [Duan, L.]National Engineering Laboratory for Key Technologies of Information Security Level Protection, Beijing, China
[ 7 ] [En, Q.]Artificial Intelligence and Machine Learning (AIML) Lab, School of Computer Science, Carleton University, Ottawa, K1S 5B6, Canada
[ 8 ] [Zhang, B.]Institute of Artificial Intelligence, Beihang University, Beijing, China
[ 9 ] [Liang, F.]Hebei Agricultural University, Hebei Key Laboratory of Agricultural Big Data, Baoding, China

Reprint Author's Address：

[Duan, L.]Faculty of Information Technology, China

Email：

ljduan@bjut.edu.cn

Show more details

Related Keywords：

Region-adaptive Concept Aggregation for Few-shot Visual Recognition
2023，Machine Intelligence Research
A metric-based meta-learning approach combined attention mechanism and ensemble learning for few-shot learning
2021，DISPLAYS
CDR-Detector: a chronic disease risk prediction model combining pre-training with deep reinforcement learning
2025，COMPLEX & INTELLIGENT SYSTEMS
Uncertainty-guided recurrent prototype distillation for graph few-shot class-incremental learning
2025，MULTIMEDIA SYSTEMS

Source ：

Computers and Electrical Engineering

ISSN： 0045-7906

Year： 2022

Volume： 103

4 . 3

JCR@2022

4 . 3 0 0

JCR@2022

ESI Discipline： COMPUTER SCIENCE;

ESI HC Threshold：46

JCR Journal Grade：2

CAS Journal Grade：3

Cited Count：

WoS CC Cited Count： 0

SCOPUS Cited Count： 13

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 14

Affiliated Colleges：

Get Fulltext

DOI Library Discovery Baidu Scholar Search SCOPUS

Type
Departments

All Years Choose Year From to