• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
搜索

Author:

Zhang, Tao (Zhang, Tao.) (Scholars:张涛) | Wang, LuYao (Wang, LuYao.)

Indexed by:

EI Scopus

Abstract:

TF-IDF is widely used as the most common feature weight calculation method. The traditional TF-IDF feature extraction method lacks the representation of the distribution difference between classes in the text classification task and the feature matrix generated by the TF-IDF is huge and sparse. Based on this situation, this paper proposes a method of using the feature extraction algorithm of chi-square statistics to compensate for the distribution difference between classes and generating a fixed-dimensional real matrix through word2vec. The experimental results show that the new method is significantly better than the traditional feature extraction methods in the evaluation results such as precision, recall, F1 and ROC_AUC. © 2020, Springer Nature Switzerland AG.

Keyword:

Intelligent systems Feature extraction Text processing Extraction Classification (of information)

Author Community:

  • [ 1 ] [Zhang, Tao]School of Software, Beijing University of Technology, Beijing, China
  • [ 2 ] [Wang, LuYao]School of Software, Beijing University of Technology, Beijing, China

Reprint Author's Address:

  • [wang, luyao]school of software, beijing university of technology, beijing, china

Show more details

Related Keywords:

Related Article:

Source :

ISSN: 2194-5357

Year: 2020

Volume: 1084 AISC

Page: 199-205

Language: English

Cited Count:

WoS CC Cited Count: 0

SCOPUS Cited Count: 4

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 11

Online/Total:548/10519474
Address:BJUT Library(100 Pingleyuan,Chaoyang District,Beijing 100124, China Post Code:100124) Contact Us:010-67392185
Copyright:BJUT Library Technical Support:Beijing Aegean Software Co., Ltd.