• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
搜索

Author:

Tan, Hongchen (Tan, Hongchen.) | Yin, Baocai (Yin, Baocai.) | Wei, Kun (Wei, Kun.) | Liu, Xiuping (Liu, Xiuping.) | Li, Xin (Li, Xin.)

Indexed by:

EI Scopus SCIE

Abstract:

We propose a novel Text-to-Image Generation Network, Adaptive Layout Refinement Generative Adversarial Network (ALR-GAN), to adaptively refine the layout of synthesized images without any auxiliary information. The ALR-GAN includes an Adaptive Layout Refinement (ALR) module and a Layout Visual Refinement (LVR) loss. The ALR module aligns the layout structure (which refers to locations of objects and background) of a synthesized image with that of its corresponding real image. In ALR module, we proposed an Adaptive Layout Refinement (ALR) loss to balance the matching of hard and easy features, for more efficient layout structure matching. Based on the refined layout structure, the LVR loss further refines the visual representation within the layout area. Experimental results on two widely-used datasets show that ALR-GAN performs competitively at the Text-to-Image generation task.

Keyword:

text-to-image synthesis information consistency constraint Task analysis Adaptation models Training Visualization Semantics object layout refinement Layout Generators Generative adversarial network

Author Community:

  • [ 1 ] [Tan, Hongchen]Beijing Univ Technol, Beijing Inst Artificial Intelligence, Beijing 100124, Peoples R China
  • [ 2 ] [Yin, Baocai]Beijing Univ Technol, Beijing Inst Artificial Intelligence, Beijing 100124, Peoples R China
  • [ 3 ] [Wei, Kun]Xidian Univ, Sch Elect Engn, Xian 710071, Peoples R China
  • [ 4 ] [Liu, Xiuping]Dalian Univ Technol, Sch Math Sci, Dalian 116024, Peoples R China
  • [ 5 ] [Li, Xin]Texas A&M Univ, Sch Performance Visualizat & Fine Arts, Sect Visual Comp & Creat Media, College Stn, TX 77843 USA

Reprint Author's Address:

Show more details

Related Keywords:

Related Article:

Source :

IEEE TRANSACTIONS ON MULTIMEDIA

ISSN: 1520-9210

Year: 2023

Volume: 25

Page: 8620-8631

7 . 3 0 0

JCR@2022

Cited Count:

WoS CC Cited Count:

SCOPUS Cited Count:

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 1

Affiliated Colleges:

Online/Total:2473/10895412
Address:BJUT Library(100 Pingleyuan,Chaoyang District,Beijing 100124, China Post Code:100124) Contact Us:010-67392185
Copyright:BJUT Library Technical Support:Beijing Aegean Software Co., Ltd.