CoBell: Runtime Prediction for Distributed Dataflow Jobs in Shared Clusters

Verbitskiy, I., Thamsen, L., Renner, T. and Kao, O. (2018) CoBell: Runtime Prediction for Distributed Dataflow Jobs in Shared Clusters. In: 2018 IEEE International Conference on Cloud Computing Technology and Science (CloudCom), Nicosia, Cyprus, 10-13 Dec 2018, pp. 81-88. ISBN 9781538678992 (doi: 10.1109/CloudCom2018.2018.00029)

[img] Text
268141.pdf - Accepted Version
Restricted to Repository staff only

336kB

Abstract

Distributed dataflow systems have been developed to help users analyze and process large datasets. While they make it easier for users to develop massively-parallel programs, users still have to choose the amount of resources for the execution of their jobs. Yet, users do not necessarily understand workload and system dynamics, while they often have constraints like runtime targets and budgets. Addressing this problem, systems have been developed that automatically select the required amount of resources to fulfill the users' constraints. However, interference with co-located workloads can introduce a significant variance into the runtimes of jobs and make accurate runtime prediction harder. This paper presents CoBell, a resource allocation system that incorporates information about co-located workloads to improve the runtime prediction for jobs in shared clusters. CoBell receives jobs from users with runtime and scale-out constraints and then reserves resources based on predicted runtimes. We implemented CoBell as a job submission tool for YARN. As such, it works with existing YARN cluster setups. The paper evaluates CoBell using five different distributed dataflow jobs, showing that using CoBell results in runtimes that do not violate the runtime constraints by more than 7.2%.

Item Type:Conference Proceedings
Status:Published
Refereed:Yes
Glasgow Author(s) Enlighten ID:Thamsen, Dr Lauritz
Authors: Verbitskiy, I., Thamsen, L., Renner, T., and Kao, O.
College/School:College of Science and Engineering > School of Computing Science
Publisher:IEEE
ISBN:9781538678992

University Staff: Request a correction | Enlighten Editors: Update this record