• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
搜索

Author:

Wu, Xiaojun (Wu, Xiaojun.) | Fang, Liying (Fang, Liying.) | Wang, Pu (Wang, Pu.) (Scholars:王普) | Yu, Nan (Yu, Nan.)

Indexed by:

CPCI-S

Abstract:

Chinese text classification is always challenging, especially when data are high dimensional and sparse. In this paper, we are interested in the way of text representation and dimension reduction in Chinese text classification. First, we introduces a topic model ------ Latent Dirichlet Allocation(LDA), which is uses LDA model as a dimension reduction method. Second, we choose Support Vector Machine(SVM) as the classification algorithm. Next, a method of text classification based on LDA and SVM is described. Finally, we choose documents with large number of Chinese text for experiment. Compared with LDA method and the traditional TF*IDF method, the experimental results show that LDA method runs a better results both on the classification accuracy and running time.

Keyword:

dimension reduction LDA text classification

Author Community:

  • [ 1 ] [Wu, Xiaojun]Beijing Univ Technol, Dept Elect Informat & Control Engn, Beijing, Peoples R China
  • [ 2 ] [Fang, Liying]Beijing Univ Technol, Dept Elect Informat & Control Engn, Beijing, Peoples R China
  • [ 3 ] [Wang, Pu]Beijing Univ Technol, Dept Elect Informat & Control Engn, Beijing, Peoples R China
  • [ 4 ] [Yu, Nan]Beijing Univ Technol, Dept Elect Informat & Control Engn, Beijing, Peoples R China

Reprint Author's Address:

  • [Fang, Liying]Beijing Univ Technol, Dept Elect Informat & Control Engn, Beijing, Peoples R China

Show more details

Related Keywords:

Source :

2015 IEEE 28TH CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (CCECE)

ISSN: 0840-7789

Year: 2015

Page: 1260-1264

Language: English

Cited Count:

WoS CC Cited Count: 14

SCOPUS Cited Count:

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 5

Online/Total:380/10629645
Address:BJUT Library(100 Pingleyuan,Chaoyang District,Beijing 100124, China Post Code:100124) Contact Us:010-67392185
Copyright:BJUT Library Technical Support:Beijing Aegean Software Co., Ltd.