DSC-MDE: Dual structural contexts for monocular depth estimation - Details

Author：

Yan, Wubin (Yan, Wubin.) | Dong, Lijun (Dong, Lijun.) | Ma, Wei (Ma, Wei.) (Scholars：马伟) | Mi, Qing (Mi, Qing.) | Zha, Hongbin (Zha, Hongbin.)

Indexed by：

EI Scopus SCIE

Abstract：

Geometric　and　semantic　contexts　are　essential　to　solving　the　ill-posed　problem　of　monocular　depth　estimation　(MDE).　In　this　paper,　we　propose　a　deep　MDE　framework　that　can　aggregate　dual-modal　structural　contexts　for　monocular　depth　estimation　(DSC-MDE).　First,　a　cross-shaped　context　(CSC)　aggregation　module　is　developed　to　globally　encode　the　geometric　structures　in　depth　maps　observed　under　the　fields　of　vision　of　robots/autonomous　vehicles.　Next,　the　CSC-encoded　geometric　features　are　further　modulated　with　semantic　context　in　an　object-regional　context　(ORC)　aggregation　module.　Finally,　to　train　the　proposed　network,　we　present　a　focal　ordinal　loss　(FOL),　which　pays　more　attention　to　distant　samples　to　avoid　the　issue　of　over-relaxed　constraints　on　these　samples　occurring　in　the　ordinal　regression　loss　(ORL).　We　compare　the　proposed　model　to　recent　methods　with　geometric　and　multi-modal　contexts,　and　show　that　the　proposed　model　obtains　state-of-the-art　performance　on　both　indoor　and　outdoor　datasets,　including　NYU-Depth-V2,　Cityscapes　and　KITTI.　(c)　2023　Elsevier　B.V.　All　rights　reserved.

Keyword：

Structural context Context aggregation Multi-modal context fusion Monocular depth estimation

Author Community：

[ 1 ] [Yan, Wubin]Beijing Univ Technol, Fac Informat Technol, Beijing, Peoples R China
[ 2 ] [Dong, Lijun]Beijing Univ Technol, Fac Informat Technol, Beijing, Peoples R China
[ 3 ] [Ma, Wei]Beijing Univ Technol, Fac Informat Technol, Beijing, Peoples R China
[ 4 ] [Mi, Qing]Beijing Univ Technol, Fac Informat Technol, Beijing, Peoples R China
[ 5 ] [Zha, Hongbin]Peking Univ, Sch Elect Engn & Comp Sci, Key Lab Machine Percept MOE, Beijing, Peoples R China

Reprint Author's Address：

[Ma, Wei]Beijing Univ Technol, Fac Informat Technol, Beijing, Peoples R China;;

Email：

mawei@bjut.edu.cn

Show more details

Related Keywords：

Monocular Depth Estimation Based on Deep Learning:A Survey
2020，
Monocular Depth Estimation Method Based on Dual-discriminator Generative Adversarial Networks [基于双鉴别器生成对抗网络的单目深度估计方法]
2022，Journal of Beijing University of Technology
CAPDepth: 360 Monocular Depth Estimation by Content-Aware Projection
2025，APPLIED SCIENCES-BASEL
Event-Based Monocular Depth Estimation with Recurrent Transformers
2024，IEEE Transactions on Circuits and Systems for Video Technology

Source ：

KNOWLEDGE-BASED SYSTEMS

ISSN： 0950-7051

Year： 2023

Volume： 263

8 . 8 0 0

JCR@2022

ESI Discipline： COMPUTER SCIENCE;

ESI HC Threshold：19

Cited Count：

WoS CC Cited Count： 3

SCOPUS Cited Count： 4

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 3

Affiliated Colleges：

Get Fulltext

DOI Library Discovery Baidu Scholar Search Web of Science

Type
Departments

All Years Choose Year From to