Semi-supervised Video Object Segmentation Via an Edge Attention Gated Graph Convolutional Network - Details

Author：

Zhang, Y. (Zhang, Y..) | Wang, S. (Wang, S..) | Liang, Y. (Liang, Y..) | Yin, B. (Yin, B..)

Indexed by：

EI Scopus SCIE

Abstract：

Video　object　segmentation　(VOS)　exhibits　heavy　occlusions,　large　deformation,　and　severe　motion　blur.　While　many　remarkable　convolutional　neural　networks　are　devoted　to　the　VOS　task,　they　often　mis-identify　background　noise　as　the　target　or　output　coarse　object　boundaries,　due　to　the　failure　of　mining　detail　information　and　high-order　correlations　of　pixels　within　the　whole　video.　In　this　work,　we　propose　an　edge　attention　gated　graph　convolutional　network　(GCN)　for　VOS.　The　seed　point　initialization　and　graph　construction　stages　construct　a　spatio-temporal　graph　of　the　video　by　exploring　the　spatial　intra-frame　correlation　and　the　temporal　inter-frame　correlation　of　superpixels.　The　node　classification　stage　identifies　foreground　superpixels　by　using　an　edge　attention　gated　GCN　which　mines　higher-order　correlations　between　superpixels　and　propagates　features　among　different　nodes.　The　segmentation　optimization　stage　optimizes　the　classification　of　foreground　superpixels　and　reduces　segmentation　errors　by　using　a　global　appearance　model　which　captures　the　long-term　stable　feature　of　objects.　In　summary,　the　key　contribution　of　our　framework　is　twofold:　(a)　the　spatio-temporal　graph　representation　can　propagate　the　seed　points　of　the　first　frame　to　subsequent　frames　and　facilitate　our　framework　for　the　semi-supervised　VOS　task;　and　(b)　the　edge　attention　gated　GCN　can　learn　the　importance　of　each　node　with　respect　to　both　the　neighboring　nodes　and　the　whole　task　with　a　small　number　of　layers.　Experiments　on　Davis　2016　and　Davis　2017　datasets　show　that　our　framework　achieves　the　excellent　performance　with　only　small　training　samples　(45　video　sequences).　©　2023　Copyright　held　by　the　owner/author(s).　Publication　rights　licensed　to　ACM.

Keyword：

superpixel graph convolutional network spatio-temporal graph model semi-supervised video object segmentation

Author Community：

[ 1 ] [Zhang Y.]Beijing Key Laboratory of Multimedia and Intelligent Software Technology, Beijing Institute of Artificial Intelligence, Faculty of Information Technology, Beijing University of Technology, No. 100 Pingleyuan, Chaoyang District, Beijing, 100124, China
[ 2 ] [Zhang Y.]Beijing Key Laboratory of Multimedia and Intelligent Software Technology, Beijing Institute of Artificial Intelligence, Faculty of Information Technology, Beijing University of Technology, No. 100 Pingleyuan, Chaoyang District, Beijing, 100124, China
[ 3 ] [Wang S.]Beijing Key Laboratory of Multimedia and Intelligent Software Technology, Beijing Institute of Artificial Intelligence, Faculty of Information Technology, Beijing University of Technology, No. 100 Pingleyuan, Chaoyang District, Beijing, 100124, China
[ 4 ] [Liang Y.]Guangzhou Key Laboratory of Intelligent Agriculture, College of Mathematics and Informatics, South China Agricultural University, Guangzhou, China
[ 5 ] [Yin B.]Beijing Key Laboratory of Multimedia and Intelligent Software Technology, Beijing Institute of Artificial Intelligence, Faculty of Information Technology, Beijing University of Technology, No. 100 Pingleyuan, Chaoyang District, Beijing, 100124, China

Reprint Author's Address：

Email：

Show more details

Related Keywords：

Oil Spill Detection in Quad-Polarimetric SAR Images Using an Advanced Convolutional Neural Network Based on SuperPixel Model
2020，REMOTE SENSING
Semi-Supervised Video Object Segmentation Based on Foreground Perception Visual Attention
2022，Acta Electronica Sinica
BIT-WOW at NLPCC-2022 Task5 Track1: Hierarchical Multi-label Classification via Label-Aware Graph Convolutional Network
2022，NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, NLPCC 2022, PT II
RWGCN: Random walk graph convolutional network for group activity recognition
2025，APPLIED INTELLIGENCE

Source ：

ACM Transactions on Multimedia Computing, Communications and Applications

ISSN： 1551-6857

Year： 2023

Issue： 1

Volume： 20

5 . 1 0 0

JCR@2022

ESI Discipline： COMPUTER SCIENCE;

ESI HC Threshold：19

Cited Count：

WoS CC Cited Count： 0

SCOPUS Cited Count： 1

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 5

Affiliated Colleges：

Get Fulltext

DOI Library Discovery Baidu Scholar Search SCOPUS

Type
Departments

All Years Choose Year From to