Scene Adaptive Context Modeling and Balanced Relation Prediction for Scene Graph Generation - Details

Author：

Xu, Kai (Xu, Kai.) | Wang, Lichun (Wang, Lichun.) (Scholars：王立春) | Li, Shuang (Li, Shuang.) | Gao, Tong (Gao, Tong.) | Yin, Baocai (Yin, Baocai.)

Indexed by：

SCIE

Abstract：

Scene　graph　generation　(SGG)　aims　to　perceive　objects　and　their　relations　in　images,　which　can　bridge　the　gap　between　upstream　detection　tasks　and　downstream　high-level　visual　understanding　tasks.　For　SGG　models,　over-fitting　head　predicates　can　lead　to　bias　in　the　generated　scene　graph,　which　has　become　a　consensus.　A　series　of　debiasing　methods　have　been　proposed　to　solve　the　problem.　However,　some　existing　debiasing　SGG　methods　have　a　tendency　to　over-fit　tail　predicates,　which　is　another　type　of　bias.　In　order　to　eliminate　the　one-way　over-fitting　of　head　or　tail　predicates,　this　article　proposes　a　balanced　relation　prediction　(BRP)　module　which　is　model-agnostic　and　compatible　with　existing　re-balancing　methods.　Moreover,　because　the　relation　prediction　is　based　on　object　feature　representation,　this　article　proposes　a　scene　adaptive　context　fusion　(SACF)　module　to　refine　the　object　feature　representation.　Specifically,　SACF　models　the　context　based　on　a　chain　structure,　where　the　order　of　objects　in　the　chain　structure　is　adaptively　arranged　according　to　the　scene　content,　achieving　visual　information　fusion　that　adapts　to　the　scene　where　the　objects　are　located.　Experiments　on　VG　and　GQA　datasets　show　that　the　proposed　method　achieves　competitive　results　on　the　comprehensive　metric　of　R@K　and　mR@K.

Keyword：

Deep Network Scene Graph Generation Balanced Relation Prediction Scene Adaptive Context Modeling

Author Community：

[ 1 ] [Xu, Kai]Beijing Univ Technol, Beijing Key Lab Multimedia & Intelligent Software, Beijing, Peoples R China
[ 2 ] [Wang, Lichun]Beijing Univ Technol, Beijing Key Lab Multimedia & Intelligent Software, Beijing, Peoples R China
[ 3 ] [Gao, Tong]Beijing Univ Technol, Beijing Key Lab Multimedia & Intelligent Software, Beijing, Peoples R China
[ 4 ] [Yin, Baocai]Beijing Univ Technol, Beijing Key Lab Multimedia & Intelligent Software, Beijing, Peoples R China
[ 5 ] [Li, Shuang]Beijing Informat Sci & Technol Univ, Sch Automat, Beijing, Peoples R China

Reprint Author's Address：

王立春
[Wang, Lichun]Beijing Univ Technol, Beijing Key Lab Multimedia & Intelligent Software, Beijing, Peoples R China

Email：

xukai@emails.bjut.edu.cn |
wanglc@bjut.edu.cn |
shuangli@bistu.edu.cn |
gaotong@emails.bjut.edu.cn |
ybc@bjut.edu.cn

Show more details

Related Keywords：

A Balanced Relation Prediction Framework for Scene Graph Generation
2023，32nd International Conference on Artificial Neural Networks, ICANN 2023
Augmented Spatial Context Fusion Network for Scene Graph Generation
2023，
Scene Graph Generation Method Based on Dual-stream Multi-head Attention; [基于双分支多头注意力的场景图生成方法]
2024，Journal of Beijing University of Technology
Region-sensitive Scene Graph Generation Method; [区域敏感的场景图生成方法]
2025，Journal of Beijing University of Technology

Source ：

ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS

ISSN： 1551-6857

Year： 2025

Issue： 3

Volume： 21

5 . 1 0 0

JCR@2022

Cited Count：

WoS CC Cited Count：

SCOPUS Cited Count：

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 4

Affiliated Colleges：

Get Fulltext

DOI Library Discovery Baidu Scholar Search Web of Science

Type
Departments

All Years Choose Year From to