• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
搜索

Author:

Fan, Q. (Fan, Q..) | Song, X. (Song, X..)

Indexed by:

EI Scopus

Abstract:

The long-tailed distribution of data poses a significant challenge in text classification tasks. The imbalanced distribution of samples among categories often hinders effective classification of categories with a limited number of samples (referred to as tail class), leading to suboptimal overall classification performance. To address this issue and accurately classify the texts of complaints and reports, this paper proposes a text classification method tailored for long-tailed distribution, leveraging a rebalanced loss function to adjust the weights of samples from different categories. In the proposed approach, we first improve the classic loss function by incorporating the Gumbel activation function to replace the conventional activation function. This modification imparts varying gradients to both head class and tail class, thereby mitigating classification bias. Subsequently, to counteract overfitting in the tail class, regularization constraints is introduced within the loss function, enhancing its generalization capability. Experimental results demonstrate that, when compared to alternative loss functions, the method presented in this paper yields superior classification results in addressing multi-classification problems characterized by long-tailed distribution of text data. © 2024 IEEE.

Keyword:

text classification rebalanced loss function complaints and reports long-tailed distribution

Author Community:

  • [ 1 ] [Fan Q.]Beijing University of Technology, Information Department, Beijing, 100124, China
  • [ 2 ] [Fan Q.]Engineering Research Center of Digital Community, Ministry of Education, Beijing, 100124, China
  • [ 3 ] [Fan Q.]Beijing University of Technology, Beijing Key Laboratory of Urban Rail Transit, Beijing, 100124, China
  • [ 4 ] [Song X.]Beijing University of Technology, Information Department, Beijing, 100124, China
  • [ 5 ] [Song X.]Engineering Research Center of Digital Community, Ministry of Education, Beijing, 100124, China
  • [ 6 ] [Song X.]Beijing University of Technology, Beijing Key Laboratory of Urban Rail Transit, Beijing, 100124, China

Reprint Author's Address:

Email:

Show more details

Related Keywords:

Source :

ISSN: 2689-6621

Year: 2024

Page: 1901-1906

Language: English

Cited Count:

WoS CC Cited Count:

SCOPUS Cited Count: 1

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 8

Affiliated Colleges:

Online/Total:450/10510498
Address:BJUT Library(100 Pingleyuan,Chaoyang District,Beijing 100124, China Post Code:100124) Contact Us:010-67392185
Copyright:BJUT Library Technical Support:Beijing Aegean Software Co., Ltd.