• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
搜索

Author:

Cui, Zheng (Cui, Zheng.) | Hu, Yongli (Hu, Yongli.) | Sun, Yanfeng (Sun, Yanfeng.) | Gao, Junbin (Gao, Junbin.) | Yin, Baocai (Yin, Baocai.)

Indexed by:

EI Scopus

Abstract:

Image-text retrieval is a challenging task due to image and text are heterogeneous cross-modal data, which possess semantic gap. The key issue of image-text retrieval is how to learn a common feature space while semantic correspondence between image and text remains. Some existing works extract region feature in image and word feature in text to implement cross-modal alignment between local elements, the other works integrate relation-aware information to local elements to compute cross-modal similarity, while these methods not utilize the semantic information in different semantic-level. In order to address this issue, we propose a Bottom-up Progressive Semantic Alignment (BPSA) network, in which precise fine-grained alignment is carried out on diverse semantic-levels progressively. Specifically, the feature of the cross-modal data are extracted from bottom element to local-group, and global-representation by graph convolution and attention mechanism. We conduct extensive experiments on Flickr30K and MS-COCO datasets, compared with the related state-of-the-art methods. The results show that our network achieves competitive performance. © 2021, Springer Nature Switzerland AG.

Keyword:

Information retrieval Modal analysis Semantics Alignment

Author Community:

  • [ 1 ] [Cui, Zheng]Beijing Key Laboratory of Multimedia and Intelligent Software Technology, Beijing Institute of Artificial Intelligence, Faculty of Information Technology, Beijing University of Technology, Beijing, China
  • [ 2 ] [Hu, Yongli]Beijing Key Laboratory of Multimedia and Intelligent Software Technology, Beijing Institute of Artificial Intelligence, Faculty of Information Technology, Beijing University of Technology, Beijing, China
  • [ 3 ] [Sun, Yanfeng]Beijing Key Laboratory of Multimedia and Intelligent Software Technology, Beijing Institute of Artificial Intelligence, Faculty of Information Technology, Beijing University of Technology, Beijing, China
  • [ 4 ] [Gao, Junbin]Discipline of Business Analytics, The University of Sydney Business School, The University of Sydney, Sydney; NSW, Australia
  • [ 5 ] [Yin, Baocai]Beijing Key Laboratory of Multimedia and Intelligent Software Technology, Beijing Institute of Artificial Intelligence, Faculty of Information Technology, Beijing University of Technology, Beijing, China

Reprint Author's Address:

Email:

Show more details

Related Keywords:

Related Article:

Source :

ISSN: 1865-0929

Year: 2021

Volume: 1517 CCIS

Page: 417-424

Language: English

Cited Count:

WoS CC Cited Count: 0

SCOPUS Cited Count:

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 3

Affiliated Colleges:

Online/Total:1141/10613909
Address:BJUT Library(100 Pingleyuan,Chaoyang District,Beijing 100124, China Post Code:100124) Contact Us:010-67392185
Copyright:BJUT Library Technical Support:Beijing Aegean Software Co., Ltd.