Design and Implementation of a Scalable Distributed Web Crawler Based on Hadoop - Details

Author：

Shi, YuLiang (Shi, YuLiang.) | Zhang, Ti (Zhang, Ti.)

Indexed by：

CPCI-S

Abstract：

In　this　article,　an　efficient　and　scalable　distributed　web　crawler　system　based　on　Hadoop　will　be　design　and　implement.　In　the　paper,　firstly　the　application　of　cloud　computing　in　reptile　field　is　introduced　briefly,　and　then　according　to　the　current　status　of　the　crawler　system,　the　specific　use　of　Hadoop　distributed　and　cloud　computing　features　detailed　design　of　a　highly　scalable　crawler　system,　and　finally　the　system　Data　statistics,　under　the　same　conditions,　compared　with　the　existing　mature　system,　it　is　clear　that　the　superiority　of　distributed　web　crawler.　This　advantage　in　the　context　of　large　data　era　of　massive　data　is　particularly　important　to　climb.

Keyword：

hadoop big data distributed crawler cloud computing

Author Community：

[ 1 ] [Shi, YuLiang]BJUT, Sch Beijing Univ Technol, Beijing, Peoples R China
[ 2 ] [Zhang, Ti]BJUT, Sch Beijing Univ Technol, Beijing, Peoples R China

Reprint Author's Address：

[Shi, YuLiang]BJUT, Sch Beijing Univ Technol, Beijing, Peoples R China

Email：

shiyl@bjut.edu.cn |
zhangti_it@163.com

Show more details

Related Keywords：

基于大数据的公交车GPS历史轨迹的数据处理
2016，物联网技术
Research and design of agricultural production monitoring platform based on cloud computing
2022，6th IEEE Information Technology and Mechatronics Engineering Conference, ITOEC 2022
Review of Big Data Security Critical Technologies
2017，Journal of Beijing University of Technology
Cold-Chain Logistics Big Data Problems Based on Food Safety Research
2017，3rd International Conference on Social Science and Development (ICSSD)

Source ：

2017 IEEE 2ND INTERNATIONAL CONFERENCE ON BIG DATA ANALYSIS (ICBDA)

Year： 2017

Page： 537-541

Language： English

Cited Count：

WoS CC Cited Count： 2

SCOPUS Cited Count：

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 9

Affiliated Colleges：

学院待认领

Get Fulltext

Library Discovery Baidu Scholar Search Web of Science

Type
Departments

All Years Choose Year From to