Continuously Improving the Resource Utilization of Iterative Parallel Dataflows

Thamsen, L., Renner, T. and Kao, O. (2016) Continuously Improving the Resource Utilization of Iterative Parallel Dataflows. In: 2016 IEEE 36th International Conference on Distributed Computing Systems Workshops (ICDCSW), Nara, Japan, 27-30 Jun 2016, pp. 1-6. ISBN 9781509036868 (doi: 10.1109/ICDCSW.2016.20)

[img] Text
268131.pdf - Accepted Version
Restricted to Repository staff only

250kB

Abstract

Parallel dataflow systems like Apache Flink allow analysis of large datasets with iterative programs. However, allocating a cost-effective set of resources for such jobs is a difficult task as the resource utilization depends on many factors such as dataset size, key value distributions, computational complexity of programs, and the underlying hardware. What's more, some of these factors are not well known before the execution. There are, for example, often no data statistics such as key value distributions available beforehand. For this reason, we propose to improve the resource utilization at runtime using the repetitive nature of iterative dataflow programs. Based on runtime statistics gathered in previous iterations, the resource allocation is adapted dynamically at the synchronization barriers between iterations. This approach has two advantages: First, at barriers detailed statistics can be available, even for parallelly executed task pipelines. Second, at barriers dataflows can be adapted without complex handling of intermediate task state. This paper presents a prototype integrated with Apache Flink and an evaluation on a cluster with 480 cores. One experiment shows a 57% reduction of the job runtime by allocating more resources for a shorter time, another experiment a release of up to 40% surplus resources without significantly extending the job runtime.

Item Type:Conference Proceedings
Additional Information:Funding: This work has been supported through grants by the German Science Foundation (DFG) as FOR 1306 Stratosphere and by the German Ministry for Education and Research (BMBF) as Berlin Big Data Center BBDC (funding mark 01IS14013A).
Status:Published
Refereed:Yes
Glasgow Author(s) Enlighten ID:Thamsen, Dr Lauritz
Authors: Thamsen, L., Renner, T., and Kao, O.
College/School:College of Science and Engineering > School of Computing Science
Publisher:IEEE
ISBN:9781509036868

University Staff: Request a correction | Enlighten Editors: Update this record