Using probabilistic topic models for document similarity computation - Details

Author：

He, Ming (He, Ming.) | Zheng, Wei (Zheng, Wei.)

Indexed by：

EI Scopus

Abstract：

Document　similarity　computation　is　an　exciting　research　topic　in　Information　Retrieval　(IR)　and　it　is　a　key　issue　for　automatic　document　categorization,　clustering　analysis,　fuzzy　query,　and　question　answering.　Topic　model　is　an　emerging　field　in　Natural　Language　Processing　(NLP),　IR,　and　Machine　Learning　(ML).　In　this　paper,　we　apply　a　Latent　Dirichlet　Allocation　(LDA)　topic　model-based　method　to　compute　similarity　between　documents.　By　mapping　a　document　with　term　space　representation　into　a　topic　space,　a　distribution　over　topics　is　derived　for　computing　document　similarity.　An　empirical　study　using　real　data　set　demonstrates　the　efficiency　of　our　method.　©　2015　Taylor　&　Francis　Group,　London.

Keyword：

Statistics Natural language processing systems

Author Community：

[ 1 ] [He, Ming]College of Computer Science, Beijing University of Technology, Beijing, China
[ 2 ] [Zheng, Wei]College of Computer Science, Beijing University of Technology, Beijing, China

Reprint Author's Address：

Email：

Show more details

Related Keywords：

Feature Selection Algorithm for Dynamically Weighted Conditional Mutual Information
2021，Journal of Electronics and Information Technology
Multitasking Feedback Optimization Algorithm Based on an Evolutionary State Estimator
2024，IEEE Transactions on Emerging Topics in Computational Intelligence
Robust Diagnosis Method of Equipment Fault Based on Weighted Probabilistic Neural Network
2023，2023 CAA Symposium on Fault Detection, Supervision and Safety for Technical Processes, SAFEPROCESS 2023
Identifying Topics and Trends in DevOps: A Study of Stack Overflow Posts
2023，49th Euromicro Conference on Software Engineering and Advanced Applications, SEAA 2023

Source ：

Year： 2015

Page： 303-311

Language： English

Cited Count：

WoS CC Cited Count： 0

SCOPUS Cited Count： 2

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 8

Affiliated Colleges：

信息科学技术学院本学院/部未明确归属的数据

Get Fulltext

DOI Library Discovery Baidu Scholar Search Engineering Village

Type
Departments

All Years Choose Year From to