Hierarchical Attention Transformer Networks for Long Document Classification - Details

Author：

Indexed by：

CPCI-S EI Scopus

Abstract：

Profiting　from　the　pre-trained　language　representation　models　like　BERT,　the　recently　proposed　document　classification　methods　have　obtained　considerable　improvement.　However,　most　of　these　methods　usually　model　the　document　as　a　sequence　of　text　and　omit　the　structure　information,　which　appears　obviously　in　long　document　composed　of　several　sections　with　assigned　relations.　For　this　purpose,　we　propose　a　novel　Hierarchical　Attention　Transformer　Network　(HATN)　for　long　document　classification,　which　extracts　the　structure　of　the　long　document　by　intra-　and　inter-section　attention　transformers,　and　further　strengths　the　feature　interaction　by　two　fusion　gates:　the　Residual　Fusion　Gate　(RFG)　and　the　Feature　Fusion　Gate　(FFG).　The　proposed　method　is　evaluated　on　three　long　document　datasets　and　the　experimental　results　show　that　our　approach　outperforms　the　related　state-of-the-art　methods.　The　code　will　be　available　at　https://github.com/TengfeiLiu966/HATN

Keyword：

Author Community：

[ 1 ] [Hu, Yongli]Beijing Univ Technol, Fac Informat Technol, Beijing Inst Artificial Intelligence, Beijing Key Lab Multimedia & Intelligent Software, Beijing 100124, Peoples R China
[ 2 ] [Chen, Puman]Beijing Univ Technol, Fac Informat Technol, Beijing Inst Artificial Intelligence, Beijing Key Lab Multimedia & Intelligent Software, Beijing 100124, Peoples R China
[ 3 ] [Liu, Tengfei]Beijing Univ Technol, Fac Informat Technol, Beijing Inst Artificial Intelligence, Beijing Key Lab Multimedia & Intelligent Software, Beijing 100124, Peoples R China
[ 4 ] [Sun, Yanfeng]Beijing Univ Technol, Fac Informat Technol, Beijing Inst Artificial Intelligence, Beijing Key Lab Multimedia & Intelligent Software, Beijing 100124, Peoples R China
[ 5 ] [Yin, Baocai]Beijing Univ Technol, Fac Informat Technol, Beijing Inst Artificial Intelligence, Beijing Key Lab Multimedia & Intelligent Software, Beijing 100124, Peoples R China
[ 6 ] [Gao, Junbin]Univ Sydney, Univ Sydney Business Sch, Discipline Business Analyt, Sydney, NSW, Australia

Reprint Author's Address：

Email：

Show more details

Related Keywords：

Source ：

2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN)

ISSN： 2161-4393

Year： 2021

Cited Count：

WoS CC Cited Count： 1

SCOPUS Cited Count： 5

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 10

Affiliated Colleges：

Get Fulltext

DOI Library Discovery Baidu Scholar Search Web of Science

Type
Departments

All Years Choose Year From to