• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
搜索

Author:

Xu, Kai (Xu, Kai.) | Wang, Lichun (Wang, Lichun.) (Scholars:王立春) | Li, Shuang (Li, Shuang.) | Gao, Tong (Gao, Tong.) | Yin, Baocai (Yin, Baocai.)

Indexed by:

SCIE

Abstract:

Scene graph generation (SGG) aims to perceive objects and their relations in images, which can bridge the gap between upstream detection tasks and downstream high-level visual understanding tasks. For SGG models, over-fitting head predicates can lead to bias in the generated scene graph, which has become a consensus. A series of debiasing methods have been proposed to solve the problem. However, some existing debiasing SGG methods have a tendency to over-fit tail predicates, which is another type of bias. In order to eliminate the one-way over-fitting of head or tail predicates, this article proposes a balanced relation prediction (BRP) module which is model-agnostic and compatible with existing re-balancing methods. Moreover, because the relation prediction is based on object feature representation, this article proposes a scene adaptive context fusion (SACF) module to refine the object feature representation. Specifically, SACF models the context based on a chain structure, where the order of objects in the chain structure is adaptively arranged according to the scene content, achieving visual information fusion that adapts to the scene where the objects are located. Experiments on VG and GQA datasets show that the proposed method achieves competitive results on the comprehensive metric of R@K and mR@K.

Keyword:

Deep Network Scene Graph Generation Balanced Relation Prediction Scene Adaptive Context Modeling

Author Community:

  • [ 1 ] [Xu, Kai]Beijing Univ Technol, Beijing Key Lab Multimedia & Intelligent Software, Beijing, Peoples R China
  • [ 2 ] [Wang, Lichun]Beijing Univ Technol, Beijing Key Lab Multimedia & Intelligent Software, Beijing, Peoples R China
  • [ 3 ] [Gao, Tong]Beijing Univ Technol, Beijing Key Lab Multimedia & Intelligent Software, Beijing, Peoples R China
  • [ 4 ] [Yin, Baocai]Beijing Univ Technol, Beijing Key Lab Multimedia & Intelligent Software, Beijing, Peoples R China
  • [ 5 ] [Li, Shuang]Beijing Informat Sci & Technol Univ, Sch Automat, Beijing, Peoples R China

Reprint Author's Address:

  • 王立春

    [Wang, Lichun]Beijing Univ Technol, Beijing Key Lab Multimedia & Intelligent Software, Beijing, Peoples R China

Show more details

Related Keywords:

Source :

ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS

ISSN: 1551-6857

Year: 2025

Issue: 3

Volume: 21

5 . 1 0 0

JCR@2022

Cited Count:

WoS CC Cited Count:

SCOPUS Cited Count:

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 4

Affiliated Colleges:

Online/Total:584/10495759
Address:BJUT Library(100 Pingleyuan,Chaoyang District,Beijing 100124, China Post Code:100124) Contact Us:010-67392185
Copyright:BJUT Library Technical Support:Beijing Aegean Software Co., Ltd.