Skip to main content

Research Repository

Advanced Search

Re-provisioning of cloud-based execution infrastructure using the cloud-aware provenance to facilitate scientific workflow execution reproducibility

Khawar, Hasham; Munir, Kamran; McClatchey, Richard; Shamdasani, Jetendr

Authors

Hasham Khawar

Jetendr Shamdasani



Contributors

M Helfert
Editor

V. M. Munoz
Editor

D Ferguson
Editor

Abstract

Provenance has been considered as a means to achieve sci- entific workflow reproducibility to verify the workflow processes and results. Cloud computing provides a new computing paradigm for the workflow execution by o↵ering a dynamic and scalable environment with on-demand resource provisioning. In the absence of Cloud infrastructure information, achieving workflow reproducibility on the Cloud becomes a challenge. This paper presents a framework, named ReCAP, to capture the Cloud infrastructure information and to interlink it with the work- flow provenance to establish the Cloud-Aware Provenance (CAP). This paper identifies di↵erent scenarios of using the Cloud for workflow execu- tion and presents di↵erent mapping approaches. The reproducibility of the workflow execution is performed by re-provisioning the similar Cloud resources using CAP and re-executing the workflow; and by comparing the outputs of workflows. Finally, this paper also presents the evaluation of ReCAP in terms of captured provenance, workflow execution time and workflow output comparison.

Citation

Khawar, H., Munir, K., McClatchey, R., & Shamdasani, J. (2016). Re-provisioning of cloud-based execution infrastructure using the cloud-aware provenance to facilitate scientific workflow execution reproducibility. In M. Helfert, V. M. Munoz, & D. Ferguson (Eds.), Cloud Computing and Services Science, 74-94. Springer

Publication Date Jan 1, 2016
Peer Reviewed Peer Reviewed
Volume 581
Pages 74-94
Series Title Communications in Computer and Information Science
Book Title Cloud Computing and Services Science
ISBN 9783319295817
Keywords cloud computing, scientific workflows, cloud infrastruc- ture, provenance, reproducibility, repeatability
Publisher URL http://dx.doi.org/10.1007/978-3-319-29582-4_5
Additional Information Additional Information : The final publication is available at Springer via http://dx.doi.org/10.1007/978-3-319-29582-4_5

Files







You might also like



Downloadable Citations