• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
搜索

Author:

Jiang, Zong-Li (Jiang, Zong-Li.) (Scholars:蒋宗礼) | Xu, Xue-Ke (Xu, Xue-Ke.) | Li, Shuai (Li, Shuai.)

Indexed by:

EI Scopus PKU CSCD

Abstract:

Feature extraction is essential for text classification. In this paper we discussed the basic ideas behind word-clustering-based feature extraction. Then a text classification method for feature extraction by the means of words clustering was presented. It employed an improved tree-structured growing self-organization map (TGSOM) to carry out word clustering. Also a new formula for calculating weights was developed by taking account of the distinction between clustered word features and plain word features. Finally, the SPRINT decision tree was applied to complete the text classification. Experiments showed that the precision of text classification using the proposed method is improved by 4.32%.

Keyword:

Decision trees Classification (of information) Text processing Feature extraction Extraction

Author Community:

  • [ 1 ] [Jiang, Zong-Li]College of Computer Science, Beijing University of Technology, Beijing 100022, China
  • [ 2 ] [Xu, Xue-Ke]College of Computer Science, Beijing University of Technology, Beijing 100022, China
  • [ 3 ] [Li, Shuai]Department of Electric Engineering, Tsinghua University, Beijing 100084, China

Reprint Author's Address:

Show more details

Related Keywords:

Related Article:

Source :

Journal of Harbin Engineering University

ISSN: 1006-7043

Year: 2008

Issue: 11

Volume: 29

Page: 1205-1209

Cited Count:

WoS CC Cited Count: 0

SCOPUS Cited Count:

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 9

Online/Total:426/10520161
Address:BJUT Library(100 Pingleyuan,Chaoyang District,Beijing 100124, China Post Code:100124) Contact Us:010-67392185
Copyright:BJUT Library Technical Support:Beijing Aegean Software Co., Ltd.