• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
搜索

Author:

Zhang, Tao (Zhang, Tao.) (Scholars:张涛) | Chen, Cai (Chen, Cai.)

Indexed by:

EI Scopus

Abstract:

In order to quickly obtain the main information contained in news documents, reduce redundant information and improve the efficiency of finding news with specific content. A Chinese text summarization method based on TF-IDF is proposed. This method uses TF-IDF to calculate the importance of each word in the article, and calculates the TF-IDF of each sentence based on the TF-IDF value of the word. In order to avoid the effect of sentence length on the calculation of sentence TF-IDF value. The sliding window is used to calculate the mean of all words TF-IDF in each sliding window using the given sliding window size. Use the value of the sliding window with the largest mean value in each sentence as the TF-IDF value of the sentence. Combined with other feature of the sentence, the importance of the sentence is calculated. The sentences with the specified length or number of words are intercepted and arranged according to the order of the articles to form a summarization of the article. After comparison experiments, the method is superior to the text summarization scheme based on TextRank method in terms of efficiency and effect. © 2020, Springer Nature Switzerland AG.

Keyword:

Text processing Efficiency Intelligent systems Abstracting

Author Community:

  • [ 1 ] [Zhang, Tao]School of Software, Beijing University of Technology, Beijing, China
  • [ 2 ] [Chen, Cai]School of Software, Beijing University of Technology, Beijing, China

Reprint Author's Address:

  • [chen, cai]school of software, beijing university of technology, beijing, china

Show more details

Related Keywords:

Related Article:

Source :

ISSN: 2194-5357

Year: 2020

Volume: 1084 AISC

Page: 206-212

Language: English

Cited Count:

WoS CC Cited Count:

SCOPUS Cited Count: 1

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 7

Online/Total:477/10521254
Address:BJUT Library(100 Pingleyuan,Chaoyang District,Beijing 100124, China Post Code:100124) Contact Us:010-67392185
Copyright:BJUT Library Technical Support:Beijing Aegean Software Co., Ltd.