• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
搜索

Author:

Ren, Keyan (Ren, Keyan.) | Yan, Tong (Yan, Tong.) | Hu, Zhaoxin (Hu, Zhaoxin.) | Han, Honggui (Han, Honggui.) (Scholars:韩红桂) | Zhang, Yunlu (Zhang, Yunlu.)

Indexed by:

EI Scopus SCIE

Abstract:

Point clouds and RGB images are both critical data for 3D object detection. While recent multi-modal methods combine them directly and show remarkable performances, they ignore the distinct forms of these two types of data. For mitigating the influence of this intrinsic difference on performance, we propose a novel but effective fusion model named LI-Attention model, which takes both RGB features and point cloud features into consideration and assigns a weight to each RGB feature by attention mechanism. Furthermore, based on the LI-Attention model, we propose a 3D object detection method called image attention transformer network (IAT-Net) specialized for indoor RGB-D scene. Compared with previous work on multi-modal detection, IAT-Net fuses elaborate RGB features from 2D detection results with point cloud features in attention mechanism, meanwhile generates and refines 3D detection results with transformer model. Extensive experiments demonstrate that our approach outperforms state-of-the-art performance on two widely used benchmarks of indoor 3D object detection, SUN RGB-D and NYU Depth V2, while ablation studies have been provided to analyze the effect of each module. And the source code for the proposed IAT-Net is publicly available at https://github.com/wisper181/IAT-Net.

Keyword:

attention mechanism transformer 3D object detection

Author Community:

  • [ 1 ] [Ren, Keyan]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
  • [ 2 ] [Yan, Tong]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
  • [ 3 ] [Hu, Zhaoxin]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
  • [ 4 ] [Han, Honggui]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
  • [ 5 ] [Zhang, Yunlu]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China

Reprint Author's Address:

  • [Yan, Tong]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China;;[Han, Honggui]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China;;

Show more details

Related Keywords:

Source :

SCIENCE CHINA-TECHNOLOGICAL SCIENCES

ISSN: 1674-7321

Year: 2024

Issue: 7

Volume: 67

Page: 2176-2190

4 . 6 0 0

JCR@2022

Cited Count:

WoS CC Cited Count:

SCOPUS Cited Count:

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 18

Affiliated Colleges:

Online/Total:496/10573547
Address:BJUT Library(100 Pingleyuan,Chaoyang District,Beijing 100124, China Post Code:100124) Contact Us:010-67392185
Copyright:BJUT Library Technical Support:Beijing Aegean Software Co., Ltd.