Adapting scientific computing problems to clouds using MapReduce

Srirama, Satish Narayana; Jakovits, Pelle; Vainikko, Eero

Adapting scientific computing problems to clouds using MapReduce

dc.contributor.author	Srirama, Satish Narayana
dc.contributor.author	Jakovits, Pelle
dc.contributor.author	Vainikko, Eero
dc.date.accessioned	2022-03-27T06:06:16Z
dc.date.available	2022-03-27T06:06:16Z
dc.date.issued	2012-01-01
dc.description.abstract	Cloud computing, with its promise of virtually infinite resources, seems to suit well in solving resource greedy scientific computing problems. To study this, we established a scientific computing cloud (SciCloud) project and environment on our internal clusters. The main goal of the project is to study the scope of establishing private clouds at the universities. With these clouds, students and researchers can efficiently use the already existing resources of university computer networks, in solving computationally intensive scientific, mathematical, and academic problems. However, to be able to run the scientific computing applications on the cloud infrastructure, the applications must be reduced to frameworks that can successfully exploit the cloud resources, like the MapReduce framework. This paper summarizes the challenges associated with reducing iterative algorithms to the MapReduce model. Algorithms used by scientific computing are divided into different classes by how they can be adapted to the MapReduce model; examples from each such class are reduced to the MapReduce model and their performance is measured and analyzed. The study mainly focuses on the Hadoop MapReduce framework but also compares it to an alternative MapReduce framework called Twister, which is specifically designed for iterative algorithms. The analysis shows that Hadoop MapReduce has significant trouble with iterative problems while it suits well for embarrassingly parallel problems, and that Twister can handle iterative problems much more efficiently. This work shows how to adapt algorithms from each class into the MapReduce model, what affects the efficiency and scalability of algorithms in each class and allows us to judge which framework is more efficient for each of them, by mapping the advantages and disadvantages of the two frameworks. This study is of significant importance for scientific computing as it often uses complex iterative methods to solve critical problems and adapting such methods to cloud computing frameworks is not a trivial task. © 2010 Elsevier B.V. All rights reserved.
dc.identifier.citation	Future Generation Computer Systems. v.28(1)
dc.identifier.issn	0167739X
dc.identifier.uri	10.1016/j.future.2011.05.025
dc.identifier.uri	https://www.sciencedirect.com/science/article/abs/pii/S0167739X11001075
dc.identifier.uri	https://dspace.uohyd.ac.in/handle/1/9338
dc.subject	Cloud computing
dc.subject	Hadoop
dc.subject	Iterative algorithm
dc.subject	MapReduce
dc.subject	Scientific computing
dc.subject	Twister
dc.title	Adapting scientific computing problems to clouds using MapReduce
dc.type	Journal. Conference Paper
dspace.entity.type

Files

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.71 KB
Format:: Plain Text
Description:

Download

Collections

Computer and Information Sciences - Publications