• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
搜索

Author:

Wang, L. (Wang, L..) | Fu, F. (Fu, F..) | Xu, K. (Xu, K..) | Xu, H. (Xu, H..) | Yin, B. (Yin, B..)

Indexed by:

Scopus

Abstract:

Aiming at that the granularity of the predicate feature extracted based on relation bounding box is relatively coarse, a region-sensitive scene graph generation (RS-SGG) method is proposed. The predicate feature extraction module divided the relationship bounding box into four regions and used the self-attention mechanism to suppress background regions that were irrelevant to relationship classification. The relationship feature decoder comprehensively employed the visual, semantic and the position features of object pairs for predicting the predicate relationships. Based on the publicly available visual genome (VG) dataset, RS-SGG was compared with some mainstream scene graph generation methods. The graph constraint recall and no graph constraint recall for three subtasks including scene graph detection, scene graph classification, and predicate classification were computed to evaluate the performance of the SGG models. Results show that graph constraint recall and no graph constraint of RS-SGG are better than that of the mainstream methods. Additionally, the results of visualization experiments further demonstrate the effectiveness of the proposed method. © 2025 Beijing University of Technology. All rights reserved.

Keyword:

self-attention mechanism region awareness object classification scene graph generation relationship classification image understanding

Author Community:

  • [ 1 ] [Wang L.]Faculty of Information Technology, Beijing University of Technology, Beijing, 100124, China
  • [ 2 ] [Wang L.]Beijing Key Laboratory of Multimedia and Intelligent Software Technology, Beijing University of Technology, Beijing, 100124, China
  • [ 3 ] [Fu F.]Faculty of Information Technology, Beijing University of Technology, Beijing, 100124, China
  • [ 4 ] [Fu F.]Beijing Key Laboratory of Multimedia and Intelligent Software Technology, Beijing University of Technology, Beijing, 100124, China
  • [ 5 ] [Xu K.]Faculty of Information Technology, Beijing University of Technology, Beijing, 100124, China
  • [ 6 ] [Xu K.]Beijing Key Laboratory of Multimedia and Intelligent Software Technology, Beijing University of Technology, Beijing, 100124, China
  • [ 7 ] [Xu H.]Faculty of Information Technology, Beijing University of Technology, Beijing, 100124, China
  • [ 8 ] [Xu H.]Beijing Key Laboratory of Multimedia and Intelligent Software Technology, Beijing University of Technology, Beijing, 100124, China
  • [ 9 ] [Yin B.]Faculty of Information Technology, Beijing University of Technology, Beijing, 100124, China
  • [ 10 ] [Yin B.]Beijing Key Laboratory of Multimedia and Intelligent Software Technology, Beijing University of Technology, Beijing, 100124, China

Reprint Author's Address:

Email:

Show more details

Related Keywords:

Source :

Journal of Beijing University of Technology

ISSN: 0254-0037

Year: 2025

Issue: 1

Volume: 51

Page: 51-58

Cited Count:

WoS CC Cited Count:

SCOPUS Cited Count:

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 9

Affiliated Colleges:

Online/Total:555/10713027
Address:BJUT Library(100 Pingleyuan,Chaoyang District,Beijing 100124, China Post Code:100124) Contact Us:010-67392185
Copyright:BJUT Library Technical Support:Beijing Aegean Software Co., Ltd.