Design and implementation of a scalable distributed web crawler based on Hadoop - Details

Author：

Shi, Yuliang (Shi, Yuliang.) | Zhang, Ti (Zhang, Ti.)

Indexed by：

EI Scopus

Abstract：

In　this　article,　an　efficient　and　scalable　distributed　web　crawler　system　based　on　Hadoop　will　be　design　and　implement.　In　the　paper,　firstly　the　application　of　cloud　computing　in　reptile　field　is　introduced　briefly,　and　then　according　to　the　current　status　of　the　crawler　system,　the　specific　use　of　Hadoop　distributed　and　cloud　computing　features　detailed　design　of　a　highly　scalable　crawler　system,　and　finally　the　system　Data　statistics,　under　the　same　conditions,　compared　with　the　existing　mature　system,　it　is　clear　that　the　superiority　of　distributed　web　crawler.　This　advantage　in　the　context　of　large　data　era　of　massive　data　is　particularly　important　to　climb.　©　2017　IEEE.

Keyword：

Information analysis Data handling Big data Web crawler Cloud computing

Author Community：

[ 1 ] [Shi, Yuliang]School of Beijing University of Technology, BJUT, Beijing, China
[ 2 ] [Zhang, Ti]School of Beijing University of Technology, BJUT, Beijing, China

Reprint Author's Address：

Email：

Show more details

Related Keywords：

Research on Building the Credibility Evaluation's Indicator System of Cloud End User's Behavior
2017，3rd IEEE International Conference on Big Data Security on Cloud, BigDataSecurity 2017, 3rd IEEE International Conference on High Performance and Smart Computing, HPSC 2017 and 2nd IEEE International Conference on Intelligent Data and Security, IDS 2017
Big Data Modeling and Analysis of Microblog Ecosystem
2014，International Journal of Automation and Computing
The power spectrum estimation of signal based on neural networks
2014，1st Euro-China Conference on Intelligent Data Analysis and Applications, ECC 2014
Development of software of the pulse wave data analysis and management system
2012，2012 IEEE Symposium on Electrical and Electronics Engineering, EEESYM 2012

Source ：

Year： 2017

Page： 537-541

Language： English

Cited Count：

WoS CC Cited Count：

SCOPUS Cited Count： 3

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 12

Affiliated Colleges：

学院待认领

Get Fulltext

DOI Library Discovery Baidu Scholar Search Engineering Village

Type
Departments

All Years Choose Year From to