• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
搜索

Author:

Jia, Yalu (Jia, Yalu.) | Liu, Lei (Liu, Lei.) | Chen, Hao (Chen, Hao.)

Indexed by:

EI Scopus

Abstract:

Unknown word recognition is one of the important research contents of natural language processing. However, there are still problems such as sparse data, corpus noise, and various forms of expressions for the identification of micro-blog short words. This paper proposes an unknown words recognition method POS-FP (Frequent Pattern growth with part- of-speech)for micro-blog short text. Firstly, the candidate unknown words are obtained by combing the N-grams model and frequent item sets. Then the unknown word is filtered and verified by the improved mutual information, information entropy and context dependence. Finally, the open verification method is used to obtain final unknown word. Experiments show that the algorithm improved the unknown word recognition for micro-blog short texts. © 2018 IEEE.

Keyword:

Vocabulary control Blogs Fuzzy systems Speech recognition Character recognition Information filtering Natural language processing systems

Author Community:

  • [ 1 ] [Jia, Yalu]Beijing Institute for Scientific and Engineering Computing, College of Applied Sciences, Beijing University of Technology, Beijing, China
  • [ 2 ] [Liu, Lei]Beijing Institute for Scientific and Engineering Computing, College of Applied Sciences, Beijing University of Technology, Beijing, China
  • [ 3 ] [Chen, Hao]Beijing Institute for Scientific and Engineering Computing, College of Applied Sciences, Beijing University of Technology, Beijing, China

Reprint Author's Address:

  • [liu, lei]beijing institute for scientific and engineering computing, college of applied sciences, beijing university of technology, beijing, china

Email:

Show more details

Related Keywords:

Related Article:

Source :

Year: 2018

Page: 1-7

Language: English

Cited Count:

WoS CC Cited Count:

SCOPUS Cited Count: 1

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 16

Affiliated Colleges:

Online/Total:543/10595326
Address:BJUT Library(100 Pingleyuan,Chaoyang District,Beijing 100124, China Post Code:100124) Contact Us:010-67392185
Copyright:BJUT Library Technical Support:Beijing Aegean Software Co., Ltd.