• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
搜索

Author:

Yang, Zhen (Yang, Zhen.) (Scholars:杨震) | Duan, Li-Juan (Duan, Li-Juan.) (Scholars:段立娟) | Lai, Ying-Xu (Lai, Ying-Xu.) (Scholars:赖英旭)

Indexed by:

EI Scopus PKU CSCD

Abstract:

The unique language characteristic of short texts has made the performance of traditional natural language processing methods degradation, or even unavailable. Exact representation and calculation of the similarity between short texts are great helpful to content based clustering. That this paper treated each short text as a composition of characters, numbers and punctuation, and a similarity measure based on string similarity was proposed. Then a public opinion hotspot detection and analysis system based on short text hierarchical clustering was built. This method calculated the similarity directly which skipped the feature extraction and representation processing of short text, to a certain extent, and avoided using the sparse feature vectors. Experimental results show the effectiveness of the proposed method.

Keyword:

Text processing Processing Natural language processing systems Social aspects

Author Community:

  • [ 1 ] [Yang, Zhen]College of Computer Science, Beijing University of Technology, Beijing 100124, China
  • [ 2 ] [Duan, Li-Juan]College of Computer Science, Beijing University of Technology, Beijing 100124, China
  • [ 3 ] [Lai, Ying-Xu]College of Computer Science, Beijing University of Technology, Beijing 100124, China

Reprint Author's Address:

Email:

Show more details

Related Keywords:

Related Article:

Source :

Journal of Beijing University of Technology

ISSN: 0254-0037

Year: 2010

Issue: 5

Volume: 36

Page: 669-673

Cited Count:

WoS CC Cited Count:

SCOPUS Cited Count:

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 5

Online/Total:444/10596367
Address:BJUT Library(100 Pingleyuan,Chaoyang District,Beijing 100124, China Post Code:100124) Contact Us:010-67392185
Copyright:BJUT Library Technical Support:Beijing Aegean Software Co., Ltd.