Cross-modal Multiple Granularity Interactive Fusion Network for Long Document Classification - Details

Author：

Liu, Tengfei (Liu, Tengfei.) | Hu, Yongli (Hu, Yongli.) | Gao, Junbin (Gao, Junbin.) | Sun, Yanfeng (Sun, Yanfeng.) | Yin, Baocai (Yin, Baocai.)

Indexed by：

EI Scopus SCIE

Abstract：

Long　Document　Classification　(LDC)　has　attracted　great　attention　in　Natural　Language　Processing　and　achieved　considerable　progress　owing　to　the　large-scale　pre-trained　language　models.　In　spite　of　this,　as　a　different　problem　from　the　traditional　text　classification,　LDC　is　far　from　being　settled.　Long　documents,　such　as　news　and　articles,　generally　have　more　than　thousands　of　words　with　complex　structures.　Moreover,　compared　with　flat　text,　long　documents　usually　contain　multi-modal　content　of　images,　which　provide　rich　information　but　not　yet　being　utilized　for　classification.　In　this　article,　we　propose　a　novel　cross-modal　method　for　long　document　classification,　in　which　multiple　granularity　feature　shifting　networks　are　proposed　to　integrate　the　multi-scale　text　and　visual　features　of　long　documents　adaptively.　Additionally,　a　multi-modal　collaborative　pooling　block　is　proposed　to　eliminate　redundant　fine-grained　text　features　and　simultaneously　reduce　the　computational　complexity.　To　verify　the　effectiveness　of　the　proposed　model,　we　conduct　experiments　on　the　Food101　dataset　and　two　constructed　multi-modal　long　document　datasets.　The　experimental　results　show　that　the　proposed　cross-modal　method　outperforms　the　single-modal　text　methods　and　defeats　the　state-of-the-art　related　multi-modal　baselines.　Copyright　©　2024　held　by　the　owner/author(s)

Keyword：

Information retrieval systems Classification (of information) Natural language processing systems Complex networks Text processing

Author Community：

[ 1 ] [Liu, Tengfei]Beijing University of Technology, No. 100, Pingleyuan, Beijing, China
[ 2 ] [Hu, Yongli]Beijing University of Technology, No. 100, Pingleyuan, Beijing, China
[ 3 ] [Gao, Junbin]The University of Sydney, Camperdown, Sydney, Australia
[ 4 ] [Sun, Yanfeng]Beijing University of Technology, No. 100, Pingleyuan, Beijing, China
[ 5 ] [Yin, Baocai]Beijing University of Technology, No. 100, Pingleyuan, Beijing, China

Reprint Author's Address：

Email：

Show more details

Related Keywords：

Research on Keyword Extraction Algorithm Using PMI and TextRank
2019，2nd IEEE International Conference on Information and Computer Technologies, ICICT 2019
Pyramid text recognition based on a new text representation model
2019，16th IEEE International Conference on Networking, Sensing and Control, ICNSC 2019
A novel term weighting scheme with distributional coefficient for text categorization with support vector machine
2010，
A new question answering system for chinese restricted domain
2006，IEICE Transactions on Information and Systems

Source ：

ACM Transactions on Knowledge Discovery from Data

ISSN： 1556-4681

Year： 2024

Issue： 4

Volume： 18

3 . 6 0 0

JCR@2022

Cited Count：

WoS CC Cited Count： 0

SCOPUS Cited Count：

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 4

Affiliated Colleges：

Get Fulltext

DOI Library Discovery Baidu Scholar Search Engineering Village

Type
Departments

All Years Choose Year From to