Dual-modal non-local context guided multi-stage fusion for indoor RGB-D semantic segmentation - Details

Author：

Guo, Xiangyu (Guo, Xiangyu.) | Ma, Wei (Ma, Wei.) (Scholars：马伟) | Liang, Fangfang (Liang, Fangfang.) | Mi, Qing (Mi, Qing.)

Indexed by：

EI Scopus SCIE

Abstract：

Complementarily　fusing　RGB　and　depth　images　while　effectively　suppressing　task-irrelevant　noise　is　crucial　for　achieving　accurate　indoor　RGB-D　semantic　segmentation.　In　this　paper,　we　propose　a　novel　deep　model　that　leverages　dual-modal　non-local　context　to　guide　the　aggregation　of　complementary　features　and　the　suppression　of　noise　at　multiple　stages.　Specifically,　we　introduce　a　dual-modal　non-local　context　encoding　(DNCE)　module　to　learn　global　representations　for　each　modality　at　each　stage,　which　are　then　utilized　to　facilitate　crossmodal　complementary　clue　aggregation　(CCA).　Subsequently,　the　enhanced　features　from　both　modalities　are　merged　together.　Additionally,　we　propose　a　semantic　guided　feature　rectification　(SGFR)　module　to　exploit　rich　semantic　clues　in　the　top-level　merged　features　for　suppressing　noise　in　the　lower-stage　merged　features.　Both　the　DNCE-CCA　and　the　SGFR　modules　provide　dual-modal　global　views　that　are　essential　for　effective　RGB-D　fusion.　Experimental　results　on　two　public　indoor　datasets,　NYU　Depth　V2　and　SUN-RGBD,　demonstrate　that　our　proposed　method　outperforms　state-of-the-art　models　of　similar　complexity.

Keyword：

Dual-modal non-local context Semantic segmentation Feature rectification aggregation Cross-modal complementary feature

Author Community：

[ 1 ] [Guo, Xiangyu]Beijing Univ Technol, Fac Informat Technol, Beijing, Peoples R China
[ 2 ] [Ma, Wei]Beijing Univ Technol, Fac Informat Technol, Beijing, Peoples R China
[ 3 ] [Mi, Qing]Beijing Univ Technol, Fac Informat Technol, Beijing, Peoples R China
[ 4 ] [Liang, Fangfang]Hebei Agr Univ, Hebei Key Lab Agr Big Data, Baoding, Peoples R China

Reprint Author's Address：

[Ma, Wei]Beijing Univ Technol, Fac Informat Technol, Beijing, Peoples R China;;

Email：

guoxiangyu@emails.bjut.edu.cn |
mawei@bjut.edu.cn |
liangfangfang@hebau.edu.cn |
miqing@bjut.edu.cn

Show more details

Related Keywords：

High precision real-time semantic segmentation algorithm: Multi-channel deep weighted aggregation network; [高精度实时语义分割算法框架: 多通道深度加权聚合网络]
2024，Control and Decision
RAFNet: RGB-D attention feature fusion network for indoor semantic segmentation
2021，DISPLAYS
Multi-Scale Context Aggregation for Semantic Segmentation of Remote Sensing Images
2020，REMOTE SENSING
Multi-scale and multi-path cascaded convolutional network for semantic segmentation of colorectal polyps
2024，ALEXANDRIA ENGINEERING JOURNAL

Source ：

EXPERT SYSTEMS WITH APPLICATIONS

ISSN： 0957-4174

Year： 2024

Volume： 255

8 . 5 0 0

JCR@2022

Cited Count：

WoS CC Cited Count： 5

SCOPUS Cited Count： 5

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 5

Affiliated Colleges：

Get Fulltext

DOI Library Discovery Baidu Scholar Search Web of Science

Type
Departments

All Years Choose Year From to