Genome Analysis in a Dynamically Scaled Hybrid Cloud

Smowton, C., Copil, G., Truong, H.-L., Miller, C. and Xing, W. (2015) Genome Analysis in a Dynamically Scaled Hybrid Cloud. In: 2015 IEEE 11th International Conference on e-Science, Munich, Germany, 31 Aug - 4 Sept 2015, pp. 391-400. ISBN 9781467393256 (doi: 10.1109/eScience.2015.17)

Full text not currently available from Enlighten.

Abstract

In this paper, we explore the benefits of automatically determining the degree of parallelism used to perform genetic mutation calling in a hybrid cloud environment. We propose algorithms to automatically control both the hiring of hybrid cloud resources and the selection of the degree of parallelism employed in analysis tasks executed against that cloud. Using the Broad Institute's Genome Analysis Toolkit as a case study, we then conduct profile-driven simulation studies to characterise the circumstances in which our algorithms are beneficial or deleterious compared to simple, conventional baseline algorithms. We find that there are a wide range of cloud workload scenarios where our algorithms outperform the baselines, and thereby argue that automatic control of cloud scaling and task parallelism, using techniques like those proposed, are likely to be beneficially applicable to real-world biocomputing.

Item Type:Conference Proceedings
Status:Published
Refereed:Yes
Glasgow Author(s) Enlighten ID:Miller, Professor Crispin
Authors: Smowton, C., Copil, G., Truong, H.-L., Miller, C., and Xing, W.
College/School:College of Medical Veterinary and Life Sciences > School of Cancer Sciences
ISBN:9781467393256

University Staff: Request a correction | Enlighten Editors: Update this record