• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
搜索

Author:

Liu, Tengfei (Liu, Tengfei.) | Hu, Yongli (Hu, Yongli.) | Gao, Junbin (Gao, Junbin.) | Sun, Yanfeng (Sun, Yanfeng.) | Yin, Baocai (Yin, Baocai.)

Indexed by:

EI Scopus SCIE

Abstract:

Long Document Classification (LDC) has attracted great attention in Natural Language Processing and achieved considerable progress owing to the large-scale pre-trained language models. In spite of this, as a different problem from the traditional text classification, LDC is far from being settled. Long documents, such as news and articles, generally have more than thousands of words with complex structures. Moreover, compared with flat text, long documents usually contain multi-modal content of images, which provide rich information but not yet being utilized for classification. In this article, we propose a novel cross-modal method for long document classification, in which multiple granularity feature shifting networks are proposed to integrate the multi-scale text and visual features of long documents adaptively. Additionally, a multi-modal collaborative pooling block is proposed to eliminate redundant fine-grained text features and simultaneously reduce the computational complexity. To verify the effectiveness of the proposed model, we conduct experiments on the Food101 dataset and two constructed multi-modal long document datasets. The experimental results show that the proposed cross-modal method outperforms the single-modal text methods and defeats the state-of-the-art related multi-modal baselines. Copyright © 2024 held by the owner/author(s)

Keyword:

Information retrieval systems Classification (of information) Natural language processing systems Complex networks Text processing

Author Community:

  • [ 1 ] [Liu, Tengfei]Beijing University of Technology, No. 100, Pingleyuan, Beijing, China
  • [ 2 ] [Hu, Yongli]Beijing University of Technology, No. 100, Pingleyuan, Beijing, China
  • [ 3 ] [Gao, Junbin]The University of Sydney, Camperdown, Sydney, Australia
  • [ 4 ] [Sun, Yanfeng]Beijing University of Technology, No. 100, Pingleyuan, Beijing, China
  • [ 5 ] [Yin, Baocai]Beijing University of Technology, No. 100, Pingleyuan, Beijing, China

Reprint Author's Address:

Email:

Show more details

Related Keywords:

Related Article:

Source :

ACM Transactions on Knowledge Discovery from Data

ISSN: 1556-4681

Year: 2024

Issue: 4

Volume: 18

3 . 6 0 0

JCR@2022

Cited Count:

WoS CC Cited Count: 0

SCOPUS Cited Count:

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 4

Affiliated Colleges:

Online/Total:500/10554416
Address:BJUT Library(100 Pingleyuan,Chaoyang District,Beijing 100124, China Post Code:100124) Contact Us:010-67392185
Copyright:BJUT Library Technical Support:Beijing Aegean Software Co., Ltd.