Open Access Repository

Cost efficient scheduling of MapReduce applications on public clouds


Downloads per month over past year

Zeng, X, Garg, SK ORCID: 0000-0003-3510-2464, Wen, Z, Strazdins, P, Zomaya, AY and Ranjan, R 2017 , 'Cost efficient scheduling of MapReduce applications on public clouds' , Journal of Computational Science, vol. 26 , pp. 375-388 , doi: 10.1016/j.jocs.2017.07.017.

PDF (Accepted manuscript)
1-s2...pdf | Download (1MB)

| Preview


MapReduce framework has been one of the most prominent ways for efficient processing large amount of data requiring huge computational capacity. On-demand computing resources of Public Clouds have become a natural host for these MapReduce applications. However, the decision of what type and in what amount computing and storage resources should be rented is still a user’s responsibility. This is not a trivial task particularly when users may have performance constraints such as deadline and have several Cloud product types to choose with the intention of not spending much money. Even though there are several existing scheduling systems, however, most of them are not developed to manage the scheduling of MapReduce applications. That is, they do not consider things such as number of map and reduce tasks that are needed to be scheduled and heterogeneity of Virtual Machines (VMs) available. This paper proposes a novel greedy-based MapReduce application scheduling algorithm (MASA) that considers the user’s constraints in order to minimize cost of renting Cloud resources while considering Service Level Agreements (SLA) in terms of the user given budget and deadline constraints. The simulation results show that MASA can achieve 25-50% cost reduction in comparison to current SLA agnostic methods and there is only 10% performance disparity between MASA and an exhaustive search algorithm.

Item Type: Article
Authors/Creators:Zeng, X and Garg, SK and Wen, Z and Strazdins, P and Zomaya, AY and Ranjan, R
Keywords: cloud computing, big data, map reduce, service level agreement, scheduling, cross layer
Journal or Publication Title: Journal of Computational Science
Publisher: Elsevier Sci Ltd
ISSN: 1877-7503
DOI / ID Number: 10.1016/j.jocs.2017.07.017
Copyright Information:

2017 Elsevier B.V.

Related URLs:
Item Statistics: View statistics for this item

Actions (login required)

Item Control Page Item Control Page