RAFNet: RGB-D attention feature fusion network for indoor semantic segmentation - Details

Author：

Yan, Xingchao (Yan, Xingchao.) | Hou, Sujuan (Hou, Sujuan.) | Karim, Awudu (Karim, Awudu.) | Jia, Weikuan (Jia, Weikuan.)

Indexed by：

EI Scopus SCIE

Abstract：

Semantic　segmentation　based　on　the　complementary　information　from　RGB　and　depth　images　has　recently　gained　great　popularity,　but　due　to　the　difference　between　RGB　and　depth　maps,　how　to　effectively　use　RGB-D　information　is　still　a　problem.　In　this　paper,　we　propose　a　novel　RGB-D　semantic　segmentation　network　named　RAFNet,　which　can　selectively　gather　features　from　the　RGB　and　depth　information.　Specifically,　we　construct　an　architecture　with　three　parallel　branches　and　propose　several　complementary　attention　modules.　This　structure　enables　a　fusion　branch　and　we　add　the　Bi-directional　Multi-step　Propagation　(BMP)　strategy　to　it,　which　can　not　only　retain　the　feature　streams　of　the　original　RGB　and　depth　branches　but　also　fully　utilize　the　feature　flow　of　the　fusion　branch.　There　are　three　kinds　of　complementary　attention　modules　that　we　have　constructed.　The　RGB-D　fusion　module　can　effectively　extract　important　features　from　the　RGB　and　depth　branch　streams.　The　refinement　module　can　reduce　the　loss　of　semantic　information　and　the　context　aggregation　module　can　help　propagate　and　integrate　information　better.　We　train　and　evaluate　our　model　on　NYUDv2　and　SUN-RGBD　datasets,　and　prove　that　our　model　achieves　state-of-the-art　performances.

Keyword：

Three parallel branches RGB-D semantic segmentation Attention modules

Author Community：

[ 1 ] [Yan, Xingchao]Shandong Normal Univ, Sch Informat Sci & Engn, Jinan 250358, Peoples R China
[ 2 ] [Hou, Sujuan]Shandong Normal Univ, Sch Informat Sci & Engn, Jinan 250358, Peoples R China
[ 3 ] [Jia, Weikuan]Shandong Normal Univ, Sch Informat Sci & Engn, Jinan 250358, Peoples R China
[ 4 ] [Karim, Awudu]Beijing Univ Technol, Sch Engn, Beijing 101303, Peoples R China

Reprint Author's Address：

[Hou, Sujuan]Shandong Normal Univ, Sch Informat Sci & Engn, Jinan 250358, Peoples R China

Email：

hsj1985@126.com

Show more details

Related Keywords：

Dual-modal non-local context guided multi-stage fusion for indoor RGB-D semantic segmentation
2024，EXPERT SYSTEMS WITH APPLICATIONS
Attention-Aware and Semantic-Aware Network for RGB-D Indoor Semantic Segmentation
2021，Chinese Journal of Computers
RGB-D Dual Modal Information Complementary Semantic Segmentation Network; [RGB-D 双模态信息互补的语义分割网络]
2023，Journal of Computer-Aided Design and Computer Graphics
Disentangled Cross-Modal Transformer for RGB-D Salient Object Detection and beyond
2024，IEEE Transactions on Image Processing

Source ：

DISPLAYS

ISSN： 0141-9382

Year： 2021

Volume： 70

4 . 3 0 0

JCR@2022

ESI Discipline： COMPUTER SCIENCE;

ESI HC Threshold：87

JCR Journal Grade：2

Cited Count：

WoS CC Cited Count： 17

SCOPUS Cited Count： 21

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 4

Affiliated Colleges：

Get Fulltext

DOI Library Discovery Baidu Scholar Search Web of Science

Type
Departments

All Years Choose Year From to