• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
搜索

Author:

Cao, Y. (Cao, Y..) | Qin, J. (Qin, J..) | Gao, T. (Gao, T..) | Ma, Q. (Ma, Q..) | Ren, J. (Ren, J..)

Indexed by:

EI Scopus

Abstract:

A generative adversarial network with deep fusion attention (DFA-GAN) was proposed, using multiple loss functions as constraints, to address the issues of poor image quality and inconsistency between text descriptions and generated images in traditional text-to-image generation methods. A two-stage image generation process was employed with a single-level generative adversarial network (GAN) as the backbone. An initial blurry image which was generated in the first stage was fed into the second stage, and high-quality image regeneration was achieved to enhance the overall image generation quality. During the first stage, a visual-text fusion module was designed to deeply integrate text features and image features, and text information was adequately fused during the image sampling process at different scales. In the second stage, an image generator with an improved Vision Transformer as the encoder was proposed to fully fuse image features with text description word features. Quantitative and qualitative experimental results showed that the proposed method outperformed other mainstream models in terms of image quality improvement and alignment with text descriptions. © 2024 Zhejiang University. All rights reserved.

Keyword:

semantics consistency deep fusion text-to-image multi-scale feature fusion generative adversarial network(GAN)

Author Community:

  • [ 1 ] [Cao Y.]College of Data Science and Applications, Inner Mongolia University of Technology, Hohhot, 010051, China
  • [ 2 ] [Cao Y.]Inner Mongolia Autonomous Region Engineering Technology Research Center of Big Data Based Software Service, Hohhot, 010000, China
  • [ 3 ] [Qin J.]College of Data Science and Applications, Inner Mongolia University of Technology, Hohhot, 010051, China
  • [ 4 ] [Qin J.]Inner Mongolia Autonomous Region Engineering Technology Research Center of Big Data Based Software Service, Hohhot, 010000, China
  • [ 5 ] [Gao T.]Inner Mongolia Autonomous Region Engineering Technology Research Center of Big Data Based Software Service, Hohhot, 010000, China
  • [ 6 ] [Gao T.]Faculty of Information Technology, Beijing University of Technology, Beijing, 100124, China
  • [ 7 ] [Ma Q.]College of Data Science and Applications, Inner Mongolia University of Technology, Hohhot, 010051, China
  • [ 8 ] [Ma Q.]Inner Mongolia Autonomous Region Engineering Technology Research Center of Big Data Based Software Service, Hohhot, 010000, China
  • [ 9 ] [Ren J.]College of Data Science and Applications, Inner Mongolia University of Technology, Hohhot, 010051, China
  • [ 10 ] [Ren J.]Inner Mongolia Autonomous Region Engineering Technology Research Center of Big Data Based Software Service, Hohhot, 010000, China

Reprint Author's Address:

Email:

Show more details

Related Keywords:

Source :

Journal of Zhejiang University (Engineering Science)

ISSN: 1008-973X

Year: 2024

Issue: 4

Volume: 58

Page: 674-683

Cited Count:

WoS CC Cited Count:

SCOPUS Cited Count:

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 5

Affiliated Colleges:

Online/Total:551/10728489
Address:BJUT Library(100 Pingleyuan,Chaoyang District,Beijing 100124, China Post Code:100124) Contact Us:010-67392185
Copyright:BJUT Library Technical Support:Beijing Aegean Software Co., Ltd.