• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
搜索

Author:

En, Qing (En, Qing.) | Duan, Lijuan (Duan, Lijuan.) (Scholars:段立娟) | Zhang, Zhaoxiang (Zhang, Zhaoxiang.)

Indexed by:

EI Scopus SCIE

Abstract:

Weakly supervised video object segmentation (WSVOS) is a vital yet challenging task in which the aim is to segment pixel-level masks with only category labels. Existing methods still have certain limitations, e.g., difficulty in comprehending appropriate spatiotemporal knowledge and an inability to explore common semantic information with category labels. To overcome these challenges, we formulate a novel framework by integrating multisource saliency and incorporating an exemplar mechanism for WSVOS. Specifically, we propose a multisource saliency module to comprehend spatiotemporal knowledge by integrating spatial and temporal saliency as bottom-up cues, which can effectively eliminate disruptions due to confusing regions and identify attractive regions. Moreover, to our knowledge, we make the first attempt to incorporate an exemplar mechanism into WSVOS by proposing an adaptive exemplar module to process top-down cues, which can provide reliable guidance for co-occurring objects in intraclass videos and identify attentive regions. Our framework, which comprises the two aforementioned modules, offers a new perspective on directly constructing the correspondence between bottom-up cues and top-down cues when ground-truth information for the reference frames is lacking. Comprehensive experiments demonstrate that the proposed framework achieves state-of-the-art performance.

Keyword:

Task analysis video object segmentation Motion segmentation Object segmentation Annotations spatiotemporal saliency Feature extraction exemplar mechanism Spatiotemporal phenomena Weakly supervised learning Training

Author Community:

  • [ 1 ] [En, Qing]Beijing Univ Technol, Fac Informat Technol, Beijing Key Lab Trusted Comp, Beijing 100124, Peoples R China
  • [ 2 ] [Duan, Lijuan]Beijing Univ Technol, Fac Informat Technol, Beijing Key Lab Trusted Comp, Beijing 100124, Peoples R China
  • [ 3 ] [En, Qing]Natl Engn Lab Crit Technol Informat Secur Classif, Beijing 100124, Peoples R China
  • [ 4 ] [Duan, Lijuan]Natl Engn Lab Crit Technol Informat Secur Classif, Beijing 100124, Peoples R China
  • [ 5 ] [Zhang, Zhaoxiang]Chinese Acad Sci, Ctr Res Intelligent Percept & Comp, Inst Automat, Natl Lab Pattern Recognit, Beijing 100190, Peoples R China
  • [ 6 ] [Zhang, Zhaoxiang]Chinese Acad Sci, Sch Artificial Intelligence, Beijing 100049, Peoples R China
  • [ 7 ] [Zhang, Zhaoxiang]Chinese Acad Sci, Ctr Artificial Intelligence & Robot, Hong Kong Inst Sci & Innovat, Hong Kong, Peoples R China

Reprint Author's Address:

  • [Zhang, Zhaoxiang]Chinese Acad Sci, Ctr Res Intelligent Percept & Comp, Inst Automat, Natl Lab Pattern Recognit, Beijing 100190, Peoples R China

Show more details

Related Keywords:

Source :

IEEE TRANSACTIONS ON IMAGE PROCESSING

ISSN: 1057-7149

Year: 2021

Volume: 30

Page: 8155-8169

1 0 . 6 0 0

JCR@2022

ESI Discipline: ENGINEERING;

ESI HC Threshold:87

JCR Journal Grade:1

Cited Count:

WoS CC Cited Count: 7

SCOPUS Cited Count: 8

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 9

Online/Total:1148/10544252
Address:BJUT Library(100 Pingleyuan,Chaoyang District,Beijing 100124, China Post Code:100124) Contact Us:010-67392185
Copyright:BJUT Library Technical Support:Beijing Aegean Software Co., Ltd.