A New Training Data Organization Form and Training Mode for Unbiased Scene Graph Generation - Details

Author：

Indexed by：

EI Scopus SCIE

Abstract：

The　current　mainstream　studies　on　Scene　Graph　Generation　(SGG)　devote　to　the　long-tailed　predicate　distribution　problem　to　generate　unbiased　scene　graph.　The　long-tailed　predicate　distribution　exists　in　VG　dataset　and　is　more　severe　during　the　SGG　network　training　process.　Most　existing　de-biasing　methods　solve　the　problem　by　applying　re-　sampling　or　re-　weighting　in　a　mini-batch,　with　the　main　idea　being　to　provide　unbiased　attention　to　different　predicate　categories　based　on　prior　predicate　distributions.　During　the　training　process　of　SGG　models,　existing　training　mode　samples　several　images　into　a　mini-batch　to　obtain　training　data,　thus　providing　sparse　and　scattered　predicate　instances　for　training.　However,　sampling　predicate　instances　from　a　limited　set　of　predicate　samples　in　terms　of　quantity　and　category　poses　difficulties　in　training　unbiased　SGG　models.　In　order　to　provide　a　wider　range　for　sampling　predicate　instances,　this　paper　reorganizes　the　images　in　VG　training　set　with　a　new　form,　i.e.　object-pairs,　and　constructs　VG-OP　(VG　Object-Pair)　training　set　to　save　object-pairs.　Meanwhile,　this　paper　introduces　a　new　SGG　network　training　mode,　which　can　realize　unbiased　SGG　without　re-　sampling　or　re-　weighting.　In　particular,　a　Predicate-balanced　Sampling　Network　(PS-Net)　is　designed　to　validate　the　new　training　mode.　Extensive　experiments　on　VG　test　set　demonstrate　that　our　method　achieves　competitive　or　state-of-the-art　unbiased　SGG　performance.

Keyword：

new training data organization form Tail balanced predicate instances Proposals Unbiased scene graph generation Head Training new training mode Training data Semantics Predictive models

Author Community：

[ 1 ] [Xu, Hongbo]Beijing Univ Technol, Beijing Artificial Intelligence Inst, Fac Informat Technol, Beijing Key Lab Multimedia & Intelligent Software, Beijing 100124, Peoples R China
[ 2 ] [Wang, Lichun]Beijing Univ Technol, Beijing Artificial Intelligence Inst, Fac Informat Technol, Beijing Key Lab Multimedia & Intelligent Software, Beijing 100124, Peoples R China
[ 3 ] [Xu, Kai]Beijing Univ Technol, Beijing Artificial Intelligence Inst, Fac Informat Technol, Beijing Key Lab Multimedia & Intelligent Software, Beijing 100124, Peoples R China
[ 4 ] [Fu, Fangyu]Beijing Univ Technol, Beijing Artificial Intelligence Inst, Fac Informat Technol, Beijing Key Lab Multimedia & Intelligent Software, Beijing 100124, Peoples R China
[ 5 ] [Yin, Baocai]Beijing Univ Technol, Beijing Artificial Intelligence Inst, Fac Informat Technol, Beijing Key Lab Multimedia & Intelligent Software, Beijing 100124, Peoples R China
[ 6 ] [Huang, Qingming]Univ Chinese Acad Sci, Sch Comp Sci & Technol, Beijing 100049, Peoples R China

Reprint Author's Address：

[Wang, Lichun]Beijing Univ Technol, Beijing Artificial Intelligence Inst, Fac Informat Technol, Beijing Key Lab Multimedia & Intelligent Software, Beijing 100124, Peoples R China;;

Email：

Show more details

Related Keywords：

Improving Biomedical Question Answering by Data Augmentation and Model Weighting
2022，ACM Transactions on Computational Biology and Bioinformatics
Robust head pose estimation via convex regularized sparse regression
2011，2011 18th IEEE International Conference on Image Processing, ICIP 2011
Microseismic First-Arrival Picking Using Fine-Tuning Feature Pyramid Networks
2021，IEEE GEOSCIENCE AND REMOTE SENSING LETTERS
Development of a key-variable-based parallel HVAC energy predictive model
2022，BUILDING SIMULATION

Source ：

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY

ISSN： 1051-8215

Year： 2024

Issue： 7

Volume： 34

Page： 5295-5305

8 . 4 0 0

JCR@2022

Cited Count：

WoS CC Cited Count：

SCOPUS Cited Count：

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 6

Affiliated Colleges：

Get Fulltext

DOI Library Discovery Baidu Scholar Search Web of Science

Type
Departments

All Years Choose Year From to