• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
搜索

Author:

Zhao, Hong (Zhao, Hong.) | Zhu, Zhong-Zhi (Zhu, Zhong-Zhi.) | Xu, Pei-Zhi (Xu, Pei-Zhi.) | Wang, Wei-Dong (Wang, Wei-Dong.)

Indexed by:

EI Scopus

Abstract:

Keyword extraction is a critical technique for document retrieval and text mining, Web page retrieval and document clustering. The traditional keyword extraction method is overly dependent on word frequency, which may lead to the limitations of the keyword extraction in short sentences. In order to solve this problem, we propose a novel word embedding generation method for keyword extraction, which trains a special domain word embedding to extract keywords automatically from user-generated query words. To ensure that the experimental results are not biased by the above test sample, we train the word embedding with the Chinese version of Wikipedia for contrast experiment. Compared with other methods, the recall rate of the proposed method reaches 92.55%, higher than the other current methods.

Keyword:

Data mining Extraction Information retrieval

Author Community:

  • [ 1 ] [Zhao, Hong]Department of Software Engineering, Beijing JiaoTong University, Beijing, Beijing; 100044, China
  • [ 2 ] [Zhu, Zhong-Zhi]Department of Software Engineering, Beijing JiaoTong University, Beijing, Beijing; 100044, China
  • [ 3 ] [Xu, Pei-Zhi]Baidu Online Network Technology (Beijing) Co., Ltd, Beijing, Beijing; 100094, China
  • [ 4 ] [Wang, Wei-Dong]Faculty of Information Technology, Beijing University of Technology, Beijing, Beijing; 1000124, China

Reprint Author's Address:

  • [zhao, hong]department of software engineering, beijing jiaotong university, beijing, beijing; 100044, china

Show more details

Related Keywords:

Related Article:

Source :

Technical Bulletin

ISSN: 0376-723X

Year: 2017

Issue: 3

Volume: 55

Page: 41-47

Cited Count:

WoS CC Cited Count:

SCOPUS Cited Count:

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 10

Online/Total:1190/10543497
Address:BJUT Library(100 Pingleyuan,Chaoyang District,Beijing 100124, China Post Code:100124) Contact Us:010-67392185
Copyright:BJUT Library Technical Support:Beijing Aegean Software Co., Ltd.