• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
搜索

Author:

Wang, Guangbiao (Wang, Guangbiao.) | Zhao, Hongbo (Zhao, Hongbo.) | Chang, Qing (Chang, Qing.) | Lyu, Shuchang (Lyu, Shuchang.) | Cheng, Guangliang (Cheng, Guangliang.) | Chen, Huojin (Chen, Huojin.)

Indexed by:

EI Scopus SCIE

Abstract:

Remote sensing scene zero-shot object detection (ZSD) aims to detect and recognize both seen and unseen categories of landscape elements with the guidance of the word embeddings. In this task, two primary challenges are identified. First, there exists considerable variability within categories of landscape elements, causing a misalignment between visual features and word embeddings, particularly noticeable for unseen categories. Second, the existing detection models struggle to provide accurate localization predictions, greatly impacting overall performance. To address these two issues, we propose word embedding alignment-DINO (WEA-DINO). Based on the original DINO structure, our WEA-DINO-Head is specifically designed to align the hidden features of "matching queries" with word embedding features, effectively addressing the misalignment issue between visual features and word embeddings. Furthermore, aligning the hidden features of "denoising queries" with word embedding features enables the translation of localization capabilities from known categories to previously unseen ones. Through extensive experimentation on the DIOR benchmark dataset, our method demonstrates state-of-the-art (SOTA) performance. The code is available at https://github.com/cv516Buaa/WEA-DINO.

Keyword:

remote sensing Feature alignment zero-shot object detection (ZSD) word embedding guidance

Author Community:

  • [ 1 ] [Wang, Guangbiao]Beihang Univ, Dept Elect & Informat Engn, Beijing 100191, Peoples R China
  • [ 2 ] [Zhao, Hongbo]Beihang Univ, Dept Elect & Informat Engn, Beijing 100191, Peoples R China
  • [ 3 ] [Chang, Qing]Beihang Univ, Dept Elect & Informat Engn, Beijing 100191, Peoples R China
  • [ 4 ] [Lyu, Shuchang]Beihang Univ, Dept Elect & Informat Engn, Beijing 100191, Peoples R China
  • [ 5 ] [Cheng, Guangliang]Univ Liverpool, Dept Comp Sci, Liverpool L69 3BX, England
  • [ 6 ] [Chen, Huojin]Beijing Univ Technol, Coll Art & Design, Beijing 100124, Peoples R China

Reprint Author's Address:

  • [Zhao, Hongbo]Beihang Univ, Dept Elect & Informat Engn, Beijing 100191, Peoples R China;;[Chen, Huojin]Beijing Univ Technol, Coll Art & Design, Beijing 100124, Peoples R China;;

Show more details

Related Keywords:

Source :

IEEE GEOSCIENCE AND REMOTE SENSING LETTERS

ISSN: 1545-598X

Year: 2024

Volume: 21

4 . 8 0 0

JCR@2022

Cited Count:

WoS CC Cited Count: 1

SCOPUS Cited Count: 1

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 6

Affiliated Colleges:

Online/Total:665/10705455
Address:BJUT Library(100 Pingleyuan,Chaoyang District,Beijing 100124, China Post Code:100124) Contact Us:010-67392185
Copyright:BJUT Library Technical Support:Beijing Aegean Software Co., Ltd.