NEWT - A resilient BSP framework for Iterative algorithms on hadoop YARN

dc.contributor.author Kromonov, Ilja
dc.contributor.author Jakovits, Pelle
dc.contributor.author Srirama, Satish Narayana
dc.date.accessioned 2022-03-27T00:16:29Z
dc.date.available 2022-03-27T00:16:29Z
dc.date.issued 2014-09-18
dc.description.abstract The importance of fault tolerance for parallel computing is ever increasing. The mean time between failures (MTBF) is predicted to decrease significantly for future highly parallel systems. At the same time, the current trend to use commodity hardware to reduce the cost of clusters puts pressure on users to ensure fault tolerance of their applications. Cloud-based resources are one of the environments where the latter holds true. When it comes to embarrassingly parallel data-intensive algorithms, MapReduce has gone a long way in ensuring users can easily utilize these resources without the fear of losing work. However, this does not apply to iterative communication-intensive algorithms common in the scientific computing domain. In this work we propose a new programming model inspired by Bulk Synchronous Parallel (BSP), for creating a new fault tolerant distributed computing framework. We strive to retain the advantages that MapReduce provides, yet efficiently support a larger assortment of algorithms, such as the aforementioned iterative ones. The model adopts an approach similar to continuation passing for implementing parallel algorithms and facilitates fault tolerance inherent in the BSP program structure. Based on the model we created a distributed computing framework - NEWT, which we describe and use to validate the approach.
dc.identifier.citation Proceedings of the 2014 International Conference on High Performance Computing and Simulation, HPCS 2014
dc.identifier.uri 10.1109/HPCSim.2014.6903693
dc.identifier.uri https://ieeexplore.ieee.org/document/6903693
dc.identifier.uri https://dspace.uohyd.ac.in/handle/1/3147
dc.subject Bulk Synchronous Parallel
dc.subject cloud computing
dc.subject fault tolerance
dc.subject Hadoop YARN
dc.subject iterative algorithms
dc.title NEWT - A resilient BSP framework for Iterative algorithms on hadoop YARN
dc.type Conference Proceeding. Conference Paper
dspace.entity.type
Files
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Plain Text
Description: