Road Scene Segmentation Based on Multi-scale Attention Mechanism - Details

Author：

Qiu, Y. (Qiu, Y..) | Wang, Z. (Wang, Z..) | Zhu, Q. (Zhu, Q..) | Ji, M. (Ji, M..)

Indexed by：

EI Scopus

Abstract：

Road　scene　segmentation　is　mainly　used　in　the　field　of　autonomous　driving　today.　However,　the　complexity　of　road　scenes　brings　great　challenges　and　difficulties　to　the　perception　and　understanding　of　the　vehicle　environment,　so　the　semantic　segmentation　of　complex　traffic　scenes　is　a　challenging　research　topic　in　the　field　of　computer　vision.　When　facing　complex　background　and　variable　scale　real　scenes,　the　accuracy　of　segmentation　method　based　on　full　convolution　neural　network　needs　to　be　further　improved.　In　response　to　this　problem,　this　research　proposed　a　model　based　on　a　multi-scale　attention　to　improve　the　performance　of　semantic　segmentation　algorithms　from　multiple　perspectives.　First,　we　design　a　multi-scale　attention　module,　which　can　combine　multi-scale　information　and　attention　to　obtain　semantic　correlation　between　spatial　dimensions　and　channel　dimensions　at　different　scales　in　encoder.　Secondly,　the　number　of　fusions　of　low-level　features　and　high-level　features　is　increased　to　alleviate　the　problem　of　low-level　information　loss　caused　by　multiple　downsampling　in　the　decoder.　Finally,　a　better　feature　fusion　is　achieved　by　adding　an　adaptive　feature　fusion　module　after　concatenating　the　decoder　feature　maps.　The　experimental　results　show　that　on　the　Cityscapes,　compared　with　the　baseline　DeepLabV3+,　the　MIoU　of　the　model　is　improved　by　1.3%　on　the　validation　set　and　1.2%　on　the　test　set.　　©　2022　IEEE.

Keyword：

Road Scene Segmentation Multi-scale Information Attention Mechanism

Author Community：

[ 1 ] [Qiu Y.]Beijing University of Technology, Faculty of Information Technology, Beijing, China
[ 2 ] [Wang Z.]Beijing University of Technology, Faculty of Information Technology, Beijing, China
[ 3 ] [Zhu Q.]Beijing University of Technology, Faculty of Information Technology, Beijing, China
[ 4 ] [Ji M.]Beijing University of Technology, Faculty of Information Technology, Beijing, China

Reprint Author's Address：

Email：

Show more details

Related Keywords：

Attention-Bridged Modal Interaction for Text-to-Image Generation
2024，IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY
Transformer-Based Explainable Model for Breast Cancer Lesion Segmentation
2025，APPLIED SCIENCES-BASEL
SQI-DOANet: electroencephalogram-based deep neural network for estimating signal quality index and depth of anaesthesia
2024，JOURNAL OF NEURAL ENGINEERING
Multi-Scale Information Collaborative Management Method in Hierarchical Construction Projects Based on Building Information Modeling
2024，APPLIED SCIENCES-BASEL

Source ：

Year： 2022

Page： 527-532

Language： English

Cited Count：

WoS CC Cited Count： 0

SCOPUS Cited Count： 1

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 3

Affiliated Colleges：

Get Fulltext

DOI Library Discovery Baidu Scholar Search SCOPUS

Type
Departments

All Years Choose Year From to