• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
搜索

Author:

Shi, YuLiang (Shi, YuLiang.) | Zhang, Ti (Zhang, Ti.)

Indexed by:

CPCI-S

Abstract:

In this article, an efficient and scalable distributed web crawler system based on Hadoop will be design and implement. In the paper, firstly the application of cloud computing in reptile field is introduced briefly, and then according to the current status of the crawler system, the specific use of Hadoop distributed and cloud computing features detailed design of a highly scalable crawler system, and finally the system Data statistics, under the same conditions, compared with the existing mature system, it is clear that the superiority of distributed web crawler. This advantage in the context of large data era of massive data is particularly important to climb.

Keyword:

hadoop big data distributed crawler cloud computing

Author Community:

  • [ 1 ] [Shi, YuLiang]BJUT, Sch Beijing Univ Technol, Beijing, Peoples R China
  • [ 2 ] [Zhang, Ti]BJUT, Sch Beijing Univ Technol, Beijing, Peoples R China

Reprint Author's Address:

  • [Shi, YuLiang]BJUT, Sch Beijing Univ Technol, Beijing, Peoples R China

Show more details

Related Keywords:

Related Article:

Source :

2017 IEEE 2ND INTERNATIONAL CONFERENCE ON BIG DATA ANALYSIS (ICBDA)

Year: 2017

Page: 537-541

Language: English

Cited Count:

WoS CC Cited Count: 2

SCOPUS Cited Count:

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 9

Affiliated Colleges:

Online/Total:569/10582984
Address:BJUT Library(100 Pingleyuan,Chaoyang District,Beijing 100124, China Post Code:100124) Contact Us:010-67392185
Copyright:BJUT Library Technical Support:Beijing Aegean Software Co., Ltd.