• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
搜索

Author:

Bi, Yandong (Bi, Yandong.) | Jiang, Huajie (Jiang, Huajie.) | Zhang, Hanfu (Zhang, Hanfu.) | Hu, Yongli (Hu, Yongli.) | Yin, Baocai (Yin, Baocai.)

Indexed by:

EI Scopus SCIE

Abstract:

As a popular cross-modal reasoning task, Visual Question Answering (VQA) has achieved great progress in recent years. However, the issue of language bias has always affected the reliability of VQA models. To address this problem, counterfactual learning methods are proposed to learn more robust features to mitigate the bias problem. However, current counterfactual learning approaches mainly focus on generating synthesized samples and assigning answers to them, neglecting the relationship between factual and original data, which hinders robust feature learning for effective reasoning. To overcome this limitation, we propose a Self-supervised Knowledge Distillation approach in Counterfactual Learning for VQA, dubbed as VQA-SkdCL, which utilizes a self-supervised constraint to make good use of the hidden knowledge in the factual samples, enhancing the robustness of VQA models. We demonstrate the effectiveness of the proposed approach on VQA v2, VQA-CP v1, and VQA-CP v2 datasets and our approach achieves excellent performance.

Keyword:

Counterfactual learning Language bias Visual question answering Self-supervised learning

Author Community:

  • [ 1 ] [Bi, Yandong]Beijing Univ Technol, Beijing Key Lab Multimedia & Intelligent Software, Beijing Inst Artificial Intelligence, Fac Informat Technol, Beijing 100124, Peoples R China
  • [ 2 ] [Jiang, Huajie]Beijing Univ Technol, Beijing Key Lab Multimedia & Intelligent Software, Beijing Inst Artificial Intelligence, Fac Informat Technol, Beijing 100124, Peoples R China
  • [ 3 ] [Zhang, Hanfu]Beijing Univ Technol, Beijing Key Lab Multimedia & Intelligent Software, Beijing Inst Artificial Intelligence, Fac Informat Technol, Beijing 100124, Peoples R China
  • [ 4 ] [Hu, Yongli]Beijing Univ Technol, Beijing Key Lab Multimedia & Intelligent Software, Beijing Inst Artificial Intelligence, Fac Informat Technol, Beijing 100124, Peoples R China
  • [ 5 ] [Yin, Baocai]Beijing Univ Technol, Beijing Key Lab Multimedia & Intelligent Software, Beijing Inst Artificial Intelligence, Fac Informat Technol, Beijing 100124, Peoples R China
  • [ 6 ] [Jiang, Huajie]Beijing Univ Technol, Beijing 100124, Peoples R China

Reprint Author's Address:

  • [Jiang, Huajie]Beijing Univ Technol, Beijing 100124, Peoples R China;;

Show more details

Related Keywords:

Related Article:

Source :

PATTERN RECOGNITION LETTERS

ISSN: 0167-8655

Year: 2023

Volume: 177

Page: 33-39

5 . 1 0 0

JCR@2022

Cited Count:

WoS CC Cited Count:

SCOPUS Cited Count:

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 0

Affiliated Colleges:

Online/Total:2528/10973359
Address:BJUT Library(100 Pingleyuan,Chaoyang District,Beijing 100124, China Post Code:100124) Contact Us:010-67392185
Copyright:BJUT Library Technical Support:Beijing Aegean Software Co., Ltd.