A learning-based MapReduce scheduler in heterogeneous environments

dc.contributor.author Naik, Nenavath Srinivas
dc.contributor.author Negi, Atul
dc.date.accessioned 2022-03-27T05:52:50Z
dc.date.available 2022-03-27T05:52:50Z
dc.date.issued 2017-11-30
dc.description.abstract MapReduce is an essential framework for distributed storage and parallel processing for large-scale dataintensive jobs proposed in recent times. Hadoop default scheduler assumes a homogeneous environment. This assumption of homogeneity does not work at all times in practice and limits the performance of MapReduce. In heterogeneous environments, the job completion times do not synchronize. Data locality is essentially moving computation closer (faster access) to the input data. Fundamentally, MapReduce does not always look into the heterogeneity from a data locality perspective. Improving data locality for MapReduce framework is an important issue to enhance the performance of heterogeneous Hadoop clusters. Learning based scheduling decisions can potentially help in significantly reducing the overall job execution time. In this paper, we provide an overview of the taxonomy for MapReduce schedulers. This paper proposes a novel hybrid scheduler using a Reinforcement learning based approach. The proposed scheduler identifies the true Straggler tasks and schedules these tasks on fast processing nodes in a heterogeneous Hadoop cluster by taking the data locality into account.
dc.identifier.citation 2017 International Conference on Advances in Computing, Communications and Informatics, ICACCI 2017. v.2017-January
dc.identifier.uri 10.1109/ICACCI.2017.8126142
dc.identifier.uri http://ieeexplore.ieee.org/document/8126142/
dc.identifier.uri https://dspace.uohyd.ac.in/handle/1/8568
dc.subject Data Locality
dc.subject Heterogeneous environment
dc.subject MapReduce
dc.subject Reinforcement learning
dc.subject Stragglers
dc.title A learning-based MapReduce scheduler in heterogeneous environments
dc.type Conference Proceeding. Conference Paper
dspace.entity.type
Files
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Plain Text
Description: