• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
搜索

Author:

Li, Wen-Bin (Li, Wen-Bin.) | Liu, Chun-Nian (Liu, Chun-Nian.) | Chen, Yi-Ying (Chen, Yi-Ying.)

Indexed by:

EI Scopus PKU CSCD

Abstract:

In order to improve the training speed of classifiers without losing their accuracy, three classifying algorithms based on information gain of features are provided in this work. They are IG-C1, IG-C2 and IG-C, which classifies unlabeled text according to features weight generated in feature selection phase. All these approaches have two characteristics: lower time complexity and simpler implementation. The performance comparison between these algorithms and Naive Bayes, Vector Space Model using retuers 21578 and 20 newsgroup data sets, shows that IG-C algorithm is best one.

Keyword:

Classification (of information) Entropy Feature extraction Text processing

Author Community:

  • [ 1 ] [Li, Wen-Bin]Beijing Municipal Key Laboratory of Multimedia and Intelligent Software Technology, College of Computer Science and Technology, Beijing University of Technology, Beijing 100022, China
  • [ 2 ] [Li, Wen-Bin]School of Information Engineer, Shijiazhuang University of Economics, Shijizahuang 050031, China
  • [ 3 ] [Liu, Chun-Nian]Beijing Municipal Key Laboratory of Multimedia and Intelligent Software Technology, College of Computer Science and Technology, Beijing University of Technology, Beijing 100022, China
  • [ 4 ] [Chen, Yi-Ying]School of Information Engineer, Shijiazhuang University of Economics, Shijizahuang 050031, China

Reprint Author's Address:

Email:

Show more details

Related Keywords:

Related Article:

Source :

Journal of Beijing University of Technology

ISSN: 0254-0037

Year: 2006

Issue: 5

Volume: 32

Page: 456-460

Cited Count:

WoS CC Cited Count: 0

SCOPUS Cited Count:

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 8

Online/Total:597/10518125
Address:BJUT Library(100 Pingleyuan,Chaoyang District,Beijing 100124, China Post Code:100124) Contact Us:010-67392185
Copyright:BJUT Library Technical Support:Beijing Aegean Software Co., Ltd.