On the optimization of Hadoop MapReduce default job scheduling through dynamic job prioritization
Author(s):
Article Type:
Research/Original Article (دارای رتبه معتبر)
Abstract:
One of the most popular frameworks for big data processing is Apache Hadoop MapReduce. The default Hadoop scheduler uses queue system. However, it does not consider any specific priority for the jobs required for MapReduce programming model. In this paper, a new dynamic score is developed to improve the performance of the default Hadoop MapReduce scheduler. This dynamic priority score is computed based on effective factors such as job runtime estimation, input data size, waiting time, and length or bustle of the waiting queue. The implementation of the proposed scheduling method, based on this dynamic score, not only improves CPU and memory performance, but also reduced waiting time and average turnaround time by approximately $45%$ and $40%$ respectively, compared to the default Hadoop scheduler.
Keywords:
Language:
English
Published:
Journal of Algorithms and Computation, Volume:52 Issue: 2, Dec 2020
Pages:
109 to 126
https://www.magiran.com/p2230906