Indexed by:
Abstract:
Document similarity computation is an exciting research topic in Information Retrieval (IR) and it is a key issue for automatic document categorization, clustering analysis, fuzzy query, and question answering. Topic model is an emerging field in Natural Language Processing (NLP), IR, and Machine Learning (ML). In this paper, we apply a Latent Dirichlet Allocation (LDA) topic model-based method to compute similarity between documents. By mapping a document with term space representation into a topic space, a distribution over topics is derived for computing document similarity. An empirical study using real data set demonstrates the efficiency of our method. © 2015 Taylor & Francis Group, London.
Keyword:
Reprint Author's Address:
Email:
Source :
Year: 2015
Page: 303-311
Language: English
Cited Count:
WoS CC Cited Count: 0
SCOPUS Cited Count: 2
ESI Highly Cited Papers on the List: 0 Unfold All
WanFang Cited Count:
Chinese Cited Count:
30 Days PV: 8
Affiliated Colleges: