• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
搜索

Author:

Wang, W. (Wang, W..) | Duan, L. (Duan, L..) | En, Q. (En, Q..) | Zhang, B. (Zhang, B..) | Liang, F. (Liang, F..)

Indexed by:

EI Scopus SCIE

Abstract:

Few-shot semantic segmentation aims to segment new objects in the image with limited annotations. Typically, in metric-based few-shot learning, the expression of categories is obtained by averaging global support object information. However, a single prototype cannot accurately describe a category. Meanwhile, simple foreground averaging operations also ignore the dependencies between objects and their surroundings. In this paper, we propose a novel Transformer-based Prototype Search Network (TPSN) for few-shot segmentation. We use the transformer encoder to integrate information between different image regions and then use the decoder to express a category in terms of multiple prototypes. The multi-prototype approach can effectively alleviate the feature fluctuation caused by limited annotation data. Moreover, we use adaptive prototype search during multi-prototype extraction instead of the ordinary averaging operation compared with the previous few-shot prototype framework. This helps the network integrate the different image regions’ information and fuse object features with their dependent background information, obtaining more reasonable prototype expressions. In addition, to encourage the category's prototypes to focus on different parts and maintain consistency in high-level semantics, we use the diversity and consistency loss to constrain the multi-prototype training. Experiments show that our algorithm achieves state-of-the-art performance in few-shot segmentation on two datasets: PASCAL-5i and COCO-20i. © 2022 Elsevier Ltd

Keyword:

Few-shot learning Semantic segmentation Vision transformer Multiple prototypes

Author Community:

  • [ 1 ] [Wang, W.]Faculty of Information Technology, Beijing University of Technology, Beijing, China
  • [ 2 ] [Wang, W.]Beijing Key Laboratory of Trusted Computing, Beijing, China
  • [ 3 ] [Wang, W.]National Engineering Laboratory for Key Technologies of Information Security Level Protection, Beijing, China
  • [ 4 ] [Duan, L.]Faculty of Information Technology, Beijing University of Technology, Beijing, China
  • [ 5 ] [Duan, L.]Beijing Key Laboratory of Trusted Computing, Beijing, China
  • [ 6 ] [Duan, L.]National Engineering Laboratory for Key Technologies of Information Security Level Protection, Beijing, China
  • [ 7 ] [En, Q.]Artificial Intelligence and Machine Learning (AIML) Lab, School of Computer Science, Carleton University, Ottawa, K1S 5B6, Canada
  • [ 8 ] [Zhang, B.]Institute of Artificial Intelligence, Beihang University, Beijing, China
  • [ 9 ] [Liang, F.]Hebei Agricultural University, Hebei Key Laboratory of Agricultural Big Data, Baoding, China

Reprint Author's Address:

  • [Duan, L.]Faculty of Information Technology, China

Show more details

Related Keywords:

Source :

Computers and Electrical Engineering

ISSN: 0045-7906

Year: 2022

Volume: 103

4 . 3

JCR@2022

4 . 3 0 0

JCR@2022

ESI Discipline: COMPUTER SCIENCE;

ESI HC Threshold:46

JCR Journal Grade:2

CAS Journal Grade:3

Cited Count:

WoS CC Cited Count: 0

SCOPUS Cited Count: 13

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 14

Affiliated Colleges:

Online/Total:1261/10563746
Address:BJUT Library(100 Pingleyuan,Chaoyang District,Beijing 100124, China Post Code:100124) Contact Us:010-67392185
Copyright:BJUT Library Technical Support:Beijing Aegean Software Co., Ltd.