OASNet: Object Affordance State Recognition Network with Joint Visual Features and Relational Semantic Embeddings - Details

Author：

Indexed by：

EI Scopus SCIE

Abstract：

Traditional　affordance　learning　tasks　aim　to　understand　object’s　interactive　functions　in　an　image,　such　as　affordance　recognition　and　affordance　detection.　However,　these　tasks　cannot　determine　whether　the　object　is　currently　interacting,　which　is　crucial　for　many　follow-up　tasks,　including　robotic　manipulation　and　planning　task.　To　fill　this　gap,　this　paper　proposes　a　novel　object　affrodance　state　(OAS)　recognition　task,　i.e.,　simultaneously　recognizing　an　object’s　affordances　and　the　partner　objects　that　are　interacting　with　it.　Accordingly,　to　facilitate　the　application　of　deep　learning　technology,　an　OAS　recognition　task　related　dataset　OAS10k　is　constructed　by　collecting　and　labeling　over　10k　images.　In　the　dataset,　a　sample　is　defined　as　a　set　of　an　image　and　its　OAS　labels,　each　label　is　represented　as　〈subject,　subject’s　affrodance,　interacted　object〉.　These　triplet　labels　have　rich　relational　semantic　information,　which　can　improve　OAS　recognition　performance.　We　hence　construct　a　directed　OAS　knowledge　graph　of　affordance　states,　and　extract　an　OAS　matrix　from　it　for　modelling　the　semantic　relationships　of　the　triplets.　Based　on　the　matrix,　we　propose　an　OAS　recognition　network　(OASNet),　which　utilizes　GCN　to　capture　the　relational　semantic　embeddings,　and　uses　a　transformer　to　fuse　them　with　the　visual　features　from　an　image　to　recognize　the　affordance　states　of　objects　in　the　image.　Experimental　results　on　OAS10k　dataset　and　other　triplet　label　recognition　datasets　demonstrate　that　the　proposed　OASNet　achieves　the　best　performance　compared　to　the　state-of-the-art　methods.　The　dataset　and　codes　will　be　released　on　https://github.com/mxmdpc/OAS.　IEEE

Keyword：

Semantics Affordances relational semantic embeddings Task analysis multi-label image classification Image recognition transformer Robots Object affordance state recognition Visualization Transformers graph convolution network

Author Community：

[ 1 ] [Chen D.]Beijing Artificial Intelligence Institute, Beijing Key Laboratory of Multimedia and Intelligent Software Technology, Beijing University of Technology, Beijing, China
[ 2 ] [Kong D.]Beijing Artificial Intelligence Institute, Beijing Key Laboratory of Multimedia and Intelligent Software Technology, Beijing University of Technology, Beijing, China
[ 3 ] [Li J.]Beijing Artificial Intelligence Institute, Beijing Key Laboratory of Multimedia and Intelligent Software Technology, Beijing University of Technology, Beijing, China
[ 4 ] [Wang L.]Beijing Artificial Intelligence Institute, Beijing Key Laboratory of Multimedia and Intelligent Software Technology, Beijing University of Technology, Beijing, China
[ 5 ] [Gao J.]Beijing Artificial Intelligence Institute, Beijing Key Laboratory of Multimedia and Intelligent Software Technology, Beijing University of Technology, Beijing, China
[ 6 ] [Yin B.]Beijing Artificial Intelligence Institute, Beijing Key Laboratory of Multimedia and Intelligent Software Technology, Beijing University of Technology, Beijing, China

Reprint Author's Address：

Email：

Show more details

Related Keywords：

Domain-aware Prototype Network for Generalized Zero-Shot Learning
2023，IEEE Transactions on Circuits and Systems for Video Technology
TransIFC: Invariant Cues-aware Feature Concentration Learning for Efficient Fine-grained Bird Image Classification
2023，IEEE Transactions on Multimedia
DHHG-TAC: Fusion of Dynamic Heterogeneous Hypergraphs and Transformer Attention Mechanism for Visual Question Answering Tasks
2024，IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS
RPLNet: Object-Object Affordance Recognition via Relational Phrase Learning
2023，5th International Conference on Industrial Artificial Intelligence, IAI 2023

Source ：

IEEE Transactions on Circuits and Systems for Video Technology

ISSN： 1051-8215

Year： 2023

Issue： 5

Volume： 34

Page： 1-1

8 . 4 0 0

JCR@2022

ESI Discipline： ENGINEERING;

ESI HC Threshold：19

Cited Count：

WoS CC Cited Count： 0

SCOPUS Cited Count： 1

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 11

Affiliated Colleges：

Get Fulltext

DOI Library Discovery Baidu Scholar Search SCOPUS

Type
Departments

All Years Choose Year From to