ADOSMNet: a novel visual affordance detection network with object shape mask guided feature encoders - Details

Author：

Chen, D. (Chen, D..) | Kong, D. (Kong, D..) | Li, J. (Li, J..) | Wang, S. (Wang, S..) | Yin, B. (Yin, B..)

Indexed by：

EI Scopus SCIE

Abstract：

Visual　affordance　detection　aims　to　understand　the　functional　attributes　of　objects,　which　is　crucial　for　robots　to　achieve　interactive　tasks.　Most　existing　affordance　detection　methods　mainly　utilize　the　global　image　features　for　affordance　detection　while　do　not　fully　exploit　the　features　of　local　relevant　objects　in　the　image,　which　often　leads　to　suboptimal　detection　accuracy　under　the　interference　of　cluttered　backgrounds　and　neighbour　objects.　Numerous　researches　have　proved　that　the　accuracy　of　affordance　detection　largely　depends　on　the　quality　of　extracted　image　features.　In　this　paper,　we　propose　a　novel　affordance　detection　network　with　object　shape　mask　guided　feature　encoders.　The　masks　play　as　an　attention　mechanism　that　enforce　the　network　to　focus　on　the　shape　regions　of　target　objects　in　the　image,　which　facilitate　to　obtain　high-quality　features.　Specifically,　we　first　propose　a　shape　mask　guided　encoder,　which　uses　masks　to　effectively　locate　all　target　objects　so　as　to　extract　more　expressive　features.　Based　on　the　encoder,　we　then　propose　a　dual　enhance　feature　aggregation　module,　which　consists　of　two　branches.　The　first　branch　encodes　the　global　features　of　the　original　image,　while　the　second　branch　locates　each　local　relevant　object　and　encodes　its　precise　features.　Aggregating　these　features　enhances　the　feature　representation　of　each　object,　further　improving　feature　quality　and　suppressing　interference.　Quantitative　and　qualitative　evaluations　compared　with　state-of-the-art　methods　demonstrate　that　the　proposed　method　achieves　superior　performance　on　the　two　commonly　used　affordance　detection　datasets.　©　2023,　The　Author(s),　under　exclusive　licence　to　Springer　Science+Business　Media,　LLC,　part　of　Springer　Nature.

Keyword：

Feature enhancement Feature representation Image segmentation Visual affordance detection Object shape mask

Author Community：

[ 1 ] [Chen D.]Beijing Key Laboratory of Multimedia and Intelligent Software Technology, Beijing Artificial Intelligence Institute, Faculty of Information Technology, Beijing University of Technology, 100 Pingleyuan, Chaoyang District, Beijing, 100124, China
[ 2 ] [Kong D.]Beijing Key Laboratory of Multimedia and Intelligent Software Technology, Beijing Artificial Intelligence Institute, Faculty of Information Technology, Beijing University of Technology, 100 Pingleyuan, Chaoyang District, Beijing, 100124, China
[ 3 ] [Li J.]Beijing Key Laboratory of Multimedia and Intelligent Software Technology, Beijing Artificial Intelligence Institute, Faculty of Information Technology, Beijing University of Technology, 100 Pingleyuan, Chaoyang District, Beijing, 100124, China
[ 4 ] [Wang S.]Beijing Key Laboratory of Multimedia and Intelligent Software Technology, Beijing Artificial Intelligence Institute, Faculty of Information Technology, Beijing University of Technology, 100 Pingleyuan, Chaoyang District, Beijing, 100124, China
[ 5 ] [Yin B.]Beijing Key Laboratory of Multimedia and Intelligent Software Technology, Beijing Artificial Intelligence Institute, Faculty of Information Technology, Beijing University of Technology, 100 Pingleyuan, Chaoyang District, Beijing, 100124, China

Reprint Author's Address：

Email：

Show more details

Related Keywords：

Contrastive Learning Based on Feature Enhancement for Multi-modal Fake News Detection
2024，
Gradient importance enhancement based feature fusion intrusion detection technique
2022，COMPUTER NETWORKS
Feature enhanced spherical transformer for spherical image compression
2025，DISPLAYS
ACE-net: Biomedical image segmentation with augmented contracting and expansive paths
2019，22nd International Conference on Medical Image Computing and Computer-Assisted Intervention, MICCAI 2019

Source ：

Multimedia Tools and Applications

ISSN： 1380-7501

Year： 2023

Issue： 11

Volume： 83

Page： 31629-31653

3 . 6 0 0

JCR@2022

ESI Discipline： COMPUTER SCIENCE;

ESI HC Threshold：19

Cited Count：

WoS CC Cited Count： 0

SCOPUS Cited Count： 3

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 14

Affiliated Colleges：

Get Fulltext

DOI Library Discovery Baidu Scholar Search SCOPUS

Type
Departments

All Years Choose Year From to