Adapting scientific computing problems to clouds using MapReduce

dc.contributor.author Srirama, Satish Narayana
dc.contributor.author Jakovits, Pelle
dc.contributor.author Vainikko, Eero
dc.date.accessioned 2022-03-27T06:06:16Z
dc.date.available 2022-03-27T06:06:16Z
dc.date.issued 2012-01-01
dc.description.abstract Cloud computing, with its promise of virtually infinite resources, seems to suit well in solving resource greedy scientific computing problems. To study this, we established a scientific computing cloud (SciCloud) project and environment on our internal clusters. The main goal of the project is to study the scope of establishing private clouds at the universities. With these clouds, students and researchers can efficiently use the already existing resources of university computer networks, in solving computationally intensive scientific, mathematical, and academic problems. However, to be able to run the scientific computing applications on the cloud infrastructure, the applications must be reduced to frameworks that can successfully exploit the cloud resources, like the MapReduce framework. This paper summarizes the challenges associated with reducing iterative algorithms to the MapReduce model. Algorithms used by scientific computing are divided into different classes by how they can be adapted to the MapReduce model; examples from each such class are reduced to the MapReduce model and their performance is measured and analyzed. The study mainly focuses on the Hadoop MapReduce framework but also compares it to an alternative MapReduce framework called Twister, which is specifically designed for iterative algorithms. The analysis shows that Hadoop MapReduce has significant trouble with iterative problems while it suits well for embarrassingly parallel problems, and that Twister can handle iterative problems much more efficiently. This work shows how to adapt algorithms from each class into the MapReduce model, what affects the efficiency and scalability of algorithms in each class and allows us to judge which framework is more efficient for each of them, by mapping the advantages and disadvantages of the two frameworks. This study is of significant importance for scientific computing as it often uses complex iterative methods to solve critical problems and adapting such methods to cloud computing frameworks is not a trivial task. © 2010 Elsevier B.V. All rights reserved.
dc.identifier.citation Future Generation Computer Systems. v.28(1)
dc.identifier.issn 0167739X
dc.identifier.uri 10.1016/j.future.2011.05.025
dc.identifier.uri https://www.sciencedirect.com/science/article/abs/pii/S0167739X11001075
dc.identifier.uri https://dspace.uohyd.ac.in/handle/1/9338
dc.subject Cloud computing
dc.subject Hadoop
dc.subject Iterative algorithm
dc.subject MapReduce
dc.subject Scientific computing
dc.subject Twister
dc.title Adapting scientific computing problems to clouds using MapReduce
dc.type Journal. Conference Paper
dspace.entity.type
Files
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Plain Text
Description: