• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
搜索

Author:

Yang, Suhua (Yang, Suhua.) | Yan, Jianzhuo (Yan, Jianzhuo.) | Gao, Chen (Gao, Chen.) | Tan, Guohua (Tan, Guohua.)

Indexed by:

EI Scopus

Abstract:

In this paper, a new blogger's interest mining module is proposed, which is based on Chinese text classification. In fact, the problem of the interest mining is transformed into the problem of Chinese text categorization. Before the Chinese text categorization, the text is pre-processed for the text representation. The Chinese text is represented in vector space model and classified by support vector machine classification, while filter algorithm which filters the unrelated interest text is proposed. After the filtering, the text can get it's interest category. Finally the new module has been made use of to carry out an interest mining experiment, and the other experiment which has not filter algorithm is also carried in order to compare with the new module. The two experimental results show that the support vector machine is a effective algorithm, and the comparing data of the two experiments shows that new module make the interest mining more effective. © 2011 Springer-Verlag Berlin Heidelberg.

Keyword:

Vector spaces Text processing Classification (of information) Blogs Support vector machines Vectors Filtration

Author Community:

  • [ 1 ] [Yang, Suhua]College of Electronic Information and Control Engineering, Beijing University of Technology, Beijing, China
  • [ 2 ] [Yan, Jianzhuo]College of Electronic Information and Control Engineering, Beijing University of Technology, Beijing, China
  • [ 3 ] [Gao, Chen]College of Electronic Information and Control Engineering, Beijing University of Technology, Beijing, China
  • [ 4 ] [Tan, Guohua]College of Electronic Information and Control Engineering, Beijing University of Technology, Beijing, China

Reprint Author's Address:

Show more details

Related Keywords:

Related Article:

Source :

ISSN: 1867-5662

Year: 2011

Volume: 100

Page: 611-618

Language: English

Cited Count:

WoS CC Cited Count: 0

SCOPUS Cited Count: 2

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 4

Online/Total:601/10600358
Address:BJUT Library(100 Pingleyuan,Chaoyang District,Beijing 100124, China Post Code:100124) Contact Us:010-67392185
Copyright:BJUT Library Technical Support:Beijing Aegean Software Co., Ltd.