• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
搜索

Author:

Liang, Xi (Liang, Xi.) | Zhang, Jing (Zhang, Jing.) (Scholars:张菁) | Zhuo, Li (Zhuo, Li.) | Li, Yuzhao (Li, Yuzhao.) | Tian, Qi (Tian, Qi.)

Indexed by:

EI Scopus SCIE

Abstract:

Objects in unmanned aerial vehicle (UAV) images are generally small due to the high-photography altitude. Although many efforts have been made in object detection, how to accurately and quickly detect small objects is still one of the remaining open challenges. In this paper, we propose a feature fusion and scaling-based single shot detector (FS-SSD) for small object detection in the UAV images. The FS-SSD is an enhancement based on FSSD, a variety of the original single shot multibox detector (SSD). We add an extra scaling branch of the deconvolution module with an average pooling operation to form a feature pyramid. The original feature fusion branch is adjusted to be better suited to the small object detection task. The two feature pyramids generated by the deconvolution module and feature fusion module are utilized to make predictions together. In addition to the deep features learned by the FS-SSD, to further improve the detection accuracy, spatial context analysis is proposed to incorporate the object spatial relationships into object redetection. The interclass and intraclass distances between different object instances are computed as a spatial context, which proves effective for multiclass small object detection. Six experiments are conducted on the PASCAL VOC dataset and the two UAV image datasets. The experimental results demonstrate that the proposed method can achieve a comparable detection speed but an accuracy superior to those of the six state-of-the-art methods.

Keyword:

Unmanned aerial vehicle (UAV) image feature scaling Feature extraction Object detection Deconvolution single shot detector small object detection Detectors Unmanned aerial vehicles spatial context analysis Photography Remote sensing feature fusion

Author Community:

  • [ 1 ] [Liang, Xi]Beijing Univ Technol, Fac Informat Technol, Beijing Key Lab Computat Intelligence & Intellige, Beijing 100124, Peoples R China
  • [ 2 ] [Zhang, Jing]Beijing Univ Technol, Fac Informat Technol, Beijing Key Lab Computat Intelligence & Intellige, Beijing 100124, Peoples R China
  • [ 3 ] [Zhuo, Li]Beijing Univ Technol, Fac Informat Technol, Beijing Key Lab Computat Intelligence & Intellige, Beijing 100124, Peoples R China
  • [ 4 ] [Li, Yuzhao]Beijing Univ Technol, Fac Informat Technol, Beijing Key Lab Computat Intelligence & Intellige, Beijing 100124, Peoples R China
  • [ 5 ] [Zhuo, Li]Collaborat Innovat Ctr Elect Vehicles Beijing, Beijing 100124, Peoples R China
  • [ 6 ] [Tian, Qi]Univ Texas San Antonio, Dept Comp Sci, San Antonio, TX 78249 USA
  • [ 7 ] [Tian, Qi]Huawei Technol, Noahs Ark Lab, Shenzhen 518129, Peoples R China

Reprint Author's Address:

  • 张菁

    [Zhang, Jing]Beijing Univ Technol, Fac Informat Technol, Beijing Key Lab Computat Intelligence & Intellige, Beijing 100124, Peoples R China

Show more details

Related Keywords:

Source :

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY

ISSN: 1051-8215

Year: 2020

Issue: 6

Volume: 30

Page: 1758-1770

8 . 4 0 0

JCR@2022

ESI Discipline: ENGINEERING;

ESI HC Threshold:115

Cited Count:

WoS CC Cited Count: 182

SCOPUS Cited Count: 234

ESI Highly Cited Papers on the List: 2 Unfold All

  • 2025-5
  • 2025-3

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 13

Online/Total:685/10537605
Address:BJUT Library(100 Pingleyuan,Chaoyang District,Beijing 100124, China Post Code:100124) Contact Us:010-67392185
Copyright:BJUT Library Technical Support:Beijing Aegean Software Co., Ltd.