• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
搜索

Author:

Jia, Xibin (Jia, Xibin.) | Mi, Qing (Mi, Qing.) | Dai, Qi (Dai, Qi.)

Indexed by:

CPCI-S EI Scopus

Abstract:

Although generative adversarial networks are commonly used in textto-image generation tasks and have made great progress, there are still some problems. The convolution operation used in these GANs-based methods works on local regions, but not disjoint regions of the image, leading to structural anomalies in the generated image. Moreover, the semantic consistency of generated images and corresponding text descriptions still needs to be improved. In this paper, we propose a multi-attention generative adversarial networks (MAGAN) for text-toimage generation. We use self-attention mechanism to improve the overall quality of images, so that the target image with a certain structure can also be generated well. We use multi-head attentionmechanism to improve the semantic consistency of generated images and text descriptions. We conducted extensive experiments on three datasets: Oxford-102 Flowers dataset, Caltech-UCSD Birds dataset and COCO dataset. Our MAGAN has better results than representative methods such as AttnGAN, MirrorGAN and ControlGAN.

Keyword:

Text-to-image generation Attention mechanism Generative adversarial networks

Author Community:

  • [ 1 ] [Jia, Xibin]Beijing Univ Technol, Beijing 100124, Peoples R China
  • [ 2 ] [Mi, Qing]Beijing Univ Technol, Beijing 100124, Peoples R China
  • [ 3 ] [Dai, Qi]Beijing Univ Technol, Beijing 100124, Peoples R China

Reprint Author's Address:

Show more details

Related Keywords:

Related Article:

Source :

PATTERN RECOGNITION AND COMPUTER VISION, PT IV

ISSN: 0302-9743

Year: 2021

Volume: 13022

Page: 312-322

Cited Count:

WoS CC Cited Count:

SCOPUS Cited Count:

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 10

Affiliated Colleges:

Online/Total:427/10701327
Address:BJUT Library(100 Pingleyuan,Chaoyang District,Beijing 100124, China Post Code:100124) Contact Us:010-67392185
Copyright:BJUT Library Technical Support:Beijing Aegean Software Co., Ltd.