• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
搜索

Author:

Xu, Keyu (Xu, Keyu.) | Song, Chengtian (Song, Chengtian.) | Xie, Yue (Xie, Yue.) | Pan, Lizhi (Pan, Lizhi.) | Gan, Xiaozheng (Gan, Xiaozheng.) | Huang, Gao (Huang, Gao.)

Indexed by:

EI Scopus SCIE

Abstract:

Unmanned aerial vehicles (UAVs) and infrared imaging technology have numerous applications in civilian fields. To address the issues of low accuracy resulting from complex ground backgrounds, small target size, and limited target features in UAV remote sensing infrared image target detection, we use the YOLOv9s model and the latest retentive networks meet vision transformers (RMTs) technology and propose the RMT-YOLOv9s model for infrared small target detection. First, a convolutional neural network (CNN)-RMT-based backbone is proposed by incorporating the RMT model into the backbone network of YOLOv9s, which extracts both local and global features for small target detection. Then, an improved neck multiscale feature-fusion network RMTELAN-PANet is designed using the novel convolutional RMTELAN module proposed in this letter, which can better capture and use semantic information from feature maps. Finally, efficient multiscale attention (EMA) attention module and upsampling Dysample module are integrated into RMTELAN-PANet to further improve the feature information of small targets. Experiments on the HIT-UAV dataset show that RMT-YOLOv9s outperforms other popular methods in infrared small target detection.

Keyword:

Semantics unmanned aerial vehicle (UAV) infrared target detection Head Dysample Feature extraction Object detection Accuracy Computer vision retentive networks meet vision transformer (RMT) transformer YOLOv9 Neck efficient multiscale attention (EMA) Remote sensing Vehicle dynamics Transformers

Author Community:

  • [ 1 ] [Xu, Keyu]Beijing Inst Technol, Sch Elect & Mech, Beijing 100081, Peoples R China
  • [ 2 ] [Song, Chengtian]Beijing Inst Technol, Sch Elect & Mech, Beijing 100081, Peoples R China
  • [ 3 ] [Pan, Lizhi]Beijing Inst Technol, Sch Elect & Mech, Beijing 100081, Peoples R China
  • [ 4 ] [Gan, Xiaozheng]Beijing Inst Technol, Sch Elect & Mech, Beijing 100081, Peoples R China
  • [ 5 ] [Song, Chengtian]Sci & Technol Electromech Dynam Control Lab, Xian 710065, Peoples R China
  • [ 6 ] [Xie, Yue]Sci & Technol Electromech Dynam Control Lab, Xian 710065, Peoples R China
  • [ 7 ] [Huang, Gao]Sci & Technol Electromech Dynam Control Lab, Xian 710065, Peoples R China
  • [ 8 ] [Huang, Gao]Beijing Univ Technol, Sch Informat Sci & Technol, Beijing 100124, Peoples R China

Reprint Author's Address:

  • [Song, Chengtian]Beijing Inst Technol, Sch Elect & Mech, Beijing 100081, Peoples R China;;[Huang, Gao]Beijing Univ Technol, Sch Informat Sci & Technol, Beijing 100124, Peoples R China;;

Show more details

Related Keywords:

Source :

IEEE GEOSCIENCE AND REMOTE SENSING LETTERS

ISSN: 1545-598X

Year: 2024

Volume: 21

4 . 8 0 0

JCR@2022

Cited Count:

WoS CC Cited Count: 3

SCOPUS Cited Count: 5

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 11

Affiliated Colleges:

Online/Total:710/10555048
Address:BJUT Library(100 Pingleyuan,Chaoyang District,Beijing 100124, China Post Code:100124) Contact Us:010-67392185
Copyright:BJUT Library Technical Support:Beijing Aegean Software Co., Ltd.