• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
搜索

Author:

He, Ming (He, Ming.) | Wang, Zhen-zhen (Wang, Zhen-zhen.) | Du, Yong-ping (Du, Yong-ping.) (Scholars:杜永萍)

Indexed by:

CPCI-S EI Scopus

Abstract:

Document similarity computation is an exciting research topic in information retrieval (IR) and it is a key issue for automatic document categorization, clustering analysis, fuzzy query and question answering. Topic model is an emerging field in natural language processing ( NLP), IR and machine learning (ML). In this paper, we apply a latent Dirichlet allocation (LDA) topic modelbased method to compute similarity between documents. By mapping a document with term space representation into a topic space, a distribution over topics derived for computing document similarity. An empirical study using real data set demonstrates the efficiency of our method.

Keyword:

document similarity computation topic model latent Dirichlet allocation

Author Community:

  • [ 1 ] [He, Ming]Beijing Univ Technol, Coll Comp Sci, Beijing, Peoples R China
  • [ 2 ] [Wang, Zhen-zhen]Beijing Univ Technol, Coll Comp Sci, Beijing, Peoples R China
  • [ 3 ] [Du, Yong-ping]Beijing Univ Technol, Coll Comp Sci, Beijing, Peoples R China

Reprint Author's Address:

  • [He, Ming]Beijing Univ Technol, Coll Comp Sci, Beijing, Peoples R China

Show more details

Related Keywords:

Source :

APPLIED SCIENCE, MATERIALS SCIENCE AND INFORMATION TECHNOLOGIES IN INDUSTRY

ISSN: 1660-9336

Year: 2014

Volume: 513-517

Page: 1280-1284

Language: English

Cited Count:

WoS CC Cited Count: 0

SCOPUS Cited Count:

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 3

Online/Total:656/10645160
Address:BJUT Library(100 Pingleyuan,Chaoyang District,Beijing 100124, China Post Code:100124) Contact Us:010-67392185
Copyright:BJUT Library Technical Support:Beijing Aegean Software Co., Ltd.