Data-Oriented Scheduling with Dynamic-Clustering Fault-Tolerant Technique for Scientific Workflows in Clouds


Cite item

Full Text

Open Access Open Access
Restricted Access Access granted
Restricted Access Subscription Access

Abstract

Cloud computing is one of the most prominent parallel and distributed computing paradigm. It is used for providing solution to a huge number of scientific and business applications. Large scale scientific applications which are structured as scientific workflows are evaluated through cloud computing. Scientific workflows are data-intensive applications, as a single scientific workflow may consist of hundred thousands of tasks. Task failures, deadline constraints, budget constraints and improper management of tasks can also instigate inconvenience. Therefore, provision of fault-tolerant techniques with data-oriented scheduling is an important approach for execution of scientific workflows in Cloud computing. Accordingly, we have presented enhanced data-oriented scheduling with Dynamic-clustering fault-tolerant technique (EDS-DC) for execution of scientific workflows in cloud computing. We have presented data-oriented scheduling as a proposed scheduling technique. We have also equipped EDS-DC with Dynamic-clustering fault-tolerant technique. To know the effectiveness of EDS-DC, we compared its results with three well-known enhanced heuristic scheduling policies referred to as: (a) MCT-DC, (b) Max-min-DC, and (c) Min-min-DC. We considered scientific workflow of CyberShake as a case study, because it contains most of the characteristics of scientific workflows such as integration, disintegration, parallelism, and pipelining. The results show that EDS-DC reduced make-span of 10.9% as compared to MCT-DC, 13.7% as compared to Max-min-DC, and 6.4% as compared to Min-min-DC scheduling policies. Similarly, EDS-DC reduced the cost of 4% as compared to MCT-DC, 5.6% as compared to Max-min-DC, and 1.5% as compared to Min-min-DC scheduling policies. These results in respect of make-span and cost are highly significant for EDS-DC as compared with above referred three scheduling policies. The SLA is not violated for EDS-DC in respect of time and cost constraints, while it is violated number of times for MCT-DC, Max-min-DC, and Min-min-DC scheduling techniques.

About the authors

Z. Ahmad

Department of Information Technology, Hazara University

Author for correspondence.
Email: zulfiqarahmad@hu.edu.pk
Pakistan, Front of Multipurpose Hall, DhodialMansehra, Khyber Pakhtunkhwa, 21120

A. I. Jehangiri

Department of Information Technology, Hazara University

Author for correspondence.
Email: ali_imran@hu.edu.pk
Pakistan, Front of Multipurpose Hall, DhodialMansehra, Khyber Pakhtunkhwa, 21120

M. Iftikhar

Department of Information Technology, Hazara University

Author for correspondence.
Email: mehreeniftikhar@hu.edu.pk
Pakistan, Front of Multipurpose Hall, DhodialMansehra, Khyber Pakhtunkhwa, 21120

A. I. Umer

Department of Information Technology, Hazara University

Author for correspondence.
Email: arifiqbalumar@yahoo.com
Pakistan, Front of Multipurpose Hall, DhodialMansehra, Khyber Pakhtunkhwa, 21120

I. Afzal

Department of Information Technology, Hazara University

Author for correspondence.
Email: ibrar@hu.edu.pk
Pakistan, Front of Multipurpose Hall, DhodialMansehra, Khyber Pakhtunkhwa, 21120

Supplementary files

Supplementary Files
Action
1. JATS XML

Copyright (c) 2019 Pleiades Publishing, Ltd.