A method of classification-based spark job performance modeling - Details

Author：

Ding, Zhiyong (Ding, Zhiyong.) | Zhang, Chaohui (Zhang, Chaohui.)

Indexed by：

CPCI-S EI Scopus

Abstract：

Prediction　of　Apache　Spark　job　execution　time　is　a　key　technology　to　guide　Spark　cluster　resource　allocation　and　parameter　tuning.　In　the　existing　research,　a　unified　modeling　method　is　used　for　different　jobs,　and　the　prediction　model　considers　less　factors,　resulting　in　poor　prediction　effect.　In　view　of　the　above　problems,　this　paper　proposes　a　classification-based　Spark　job　performance　modeling　method.　The　method　first　selects　features　that　are　strongly　correlated　with　job　execution　time,　then　classifies　jobs　according　to　the　selected　features,　and　finally　uses　GBDT　algorithm　to　build　an　execution　time　prediction　model　for　each　class　of　jobs　classified.　The　experimental　results　show　that,　compared　with　the　method　using　unified　modeling,　the　method　proposed　in　this　paper　can　reduce　the　RMSE　and　MAPE　of　the　prediction　results　by　an　average　of　42.5%　and　51.1%.　©　2022　SPIE

Keyword：

Image processing Machine learning Forecasting

Author Community：

[ 1 ] [Ding, Zhiyong]Faculty of Information Technology, Beijing University of Technology, Beijing; 100124, China
[ 2 ] [Zhang, Chaohui]Faculty of Information Technology, Beijing University of Technology, Beijing; 100124, China

Reprint Author's Address：

Email：

Show more details

Related Keywords：

Webpage saliency prediction with multi-features fusion
2016，23rd IEEE International Conference on Image Processing, ICIP 2016
Software defect prediction via deep belief network
2019，Chinese Journal of Electronics
Research on smart home assistance control model based on machine learning
2020，2020 Asia-Pacific Conference on Image Processing, Electronics and Computers, IPEC 2020
A DDoS Attack Detection Method Based on Machine Learning
2019，2019 4th International Conference on Intelligent Computing and Signal Processing, ICSP 2019

Source ：

ISSN： 0277-786X

Year： 2022

Volume： 12259

Language： English

Cited Count：

WoS CC Cited Count： 0

SCOPUS Cited Count：

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 6

Affiliated Colleges：

Get Fulltext

DOI Library Discovery Baidu Scholar Search Engineering Village

Type
Departments

All Years Choose Year From to