MPLA-Net: Multiple Pseudo Label Aggregation Network for Weakly Supervised Video Salient Object Detection - Details

Author：

Ma, C. (Ma, C..) | Du, L. (Du, L..) | Zhuo, L. (Zhuo, L..) | Li, J. (Li, J..)

Indexed by：

EI Scopus SCIE

Abstract：

Weakly　Supervised　Video　Salient　Object　Detection　(WSVSOD)　only　requires　coarse-grained　manual　annotations,　which　can　achieve　a　good　trade-off　between　labeling　efficiency　and　detection　performance.　In　this　paper,　a　Multiple　Pseudo　Label　Aggregation　Network　(MPLA-Net)　is　proposed　for　WSVSOD.　Firstly,　the　video　frames　that　can　obtain　high-quality　pseudo　labels　are　selected　to　generate　multiple　pseudo　labels,　so　as　to　avoid　the　prejudice　of　the　single　label.　Moreover,　the　pseudo　label　with　fine　edge　information　is　used　to　generate　the　Edge　Information　Map　(EIM).　Secondly,　MPLA-Net　is　designed　to　adequately　excavate　and　utilize　the　comprehensive　saliency　cues　in　multiple　pseudo　labels　to　improve　the　detection　accuracy,　in　which　ResNet-50　is　adopted　as　the　backbone　network.　Edge　loss,　pseudo　label　loss,　self-supervised　loss　and　fusion　loss　are　exploited　to　jointly　supervise　and　optimize　the　network　training　to　obtain　a　robust　detection　model.　Experimental　results　on　five　benchmark　datasets　demonstrate　that,　compared　with　existing　weakly　supervised　methods,　the　proposed　method　can　achieve　state-of-the-art　detection　accuracy　with　less　model　parameters　and　higher　detection　speed.　And　the　detected　salient　objects　have　fine　boundaries.　IEEE

Keyword：

Training multiple pseudo label aggregation Feature extraction video frame quality evaluation Weakly supervised video salient object detection Image edge detection Task analysis Object detection pseudo label consistency evaluation Annotations Optical flow

Author Community：

[ 1 ] [Ma C.]Beijing Key Laboratory of Computational Intelligence and Intelligent System, Beijing University of Technology, Beijing, China
[ 2 ] [Du L.]School of Computer Science and Technology, Shandong Jianzhu University, Jinan, China
[ 3 ] [Zhuo L.]Beijing Key Laboratory of Computational Intelligence and Intelligent System, Beijing University of Technology, Beijing, China
[ 4 ] [Li J.]Beijing Key Laboratory of Computational Intelligence and Intelligent System, Beijing University of Technology, Beijing, China

Reprint Author's Address：

Email：

Show more details

Related Keywords：

Arbitrarily Oriented Object Detection in Remote Sensing Images Based on Improved YOLOv4-CSP
2022，IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING
Joint Multisource Saliency and Exemplar Mechanism for Weakly Supervised Video Object Segmentation
2021，IEEE TRANSACTIONS ON IMAGE PROCESSING
Enhancing Infrared Small Target Detection: A Saliency-Guided Multi-Task Learning Approach
2025，IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS
CRE-TSCAE: A Novel Classification Model Based on Stacked Convolutional Autoencoder for Dual-target RSVP-BCI Tasks
2024，IEEE Transactions on Biomedical Engineering

Source ：

IEEE Transactions on Circuits and Systems for Video Technology

ISSN： 1051-8215

Year： 2023

Issue： 5

Volume： 34

Page： 1-1

8 . 4 0 0

JCR@2022

ESI Discipline： ENGINEERING;

ESI HC Threshold：19

Cited Count：

WoS CC Cited Count： 0

SCOPUS Cited Count： 6

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 9

Affiliated Colleges：

Get Fulltext

DOI Library Discovery Baidu Scholar Search SCOPUS

Type
Departments

All Years Choose Year From to