Predicting performance of non-contiguous I/O with machine learning

Download

[thumbnail of pponiwmlkb15-predicting_performance_of_non_contiguous_i_o_with_machine_learning.pdf]

Preview

Text
- Accepted Version

Advice

Please see our End User Agreement.

It is advisable to refer to the publisher's version if you intend to cite from this work. See Guidance on citing.

Tools

Lists

Kunkel, J., Zimmer, M. and Betke, E. (2015) Predicting performance of non-contiguous I/O with machine learning. In: Kunkel, J. M. and Ludwig, T. (eds.) High Performance Computing. Lecture Notes in Computer Science (9137). Springer, pp. 257-273. ISBN 9783319201184 doi: 10.1007/978-3-319-20119-1_19

Abstract/Summary

Data sieving in ROMIO promises to optimize individual non-contiguous I/O. However, making the right choice and parameterizing its buffer size accordingly are non-trivial tasks, since predicting the resulting performance is difficult. Since many performance factors are not taken into account by data sieving, extracting the optimal performance for a given access pattern and system is often not possible. Additionally, in Lustre, settings such as the stripe size and number of servers are tunable, yet again, identifying rules for the data-centre proves challenging indeed. In this paper, we (1) discuss limitations of data sieving, (2) apply machine learning techniques to build a performance predictor, and (3) learn and extract best practices for the settings from the data. We used decision trees as these models can capture non-linear behavior, are easy to understand and allow for extraction of the rules used. Even though this initial research is based on decision trees, with sparse training data, the algorithm can predict many cases sufficiently. Compared to a standard setting, the decision trees created are able to improve performance significantly and we can derive expert knowledge by extracting rules from the learned tree. Applying the scheme to a set of experimental data improved the average throughput by 25–50 % of the best parametrization’s gain. Additionally, we demonstrate the versatility of this approach by applying it to the porting system of DKRZ’s next generation supercomputer and discuss achievable performance gains.

Altmetric Badge

Item Type	Book or Report Section
URI	https://reading-clone.eprints-hosting.org/id/eprint/77678
Identification Number/DOI	10.1007/978-3-319-20119-1_19
Refereed	Yes
Divisions	Science > School of Mathematical, Physical and Computational Sciences > Department of Computer Science
Publisher	Springer
Download/View statistics	View download statistics for this item

Download Statistics

Downloads

Downloads per month over past year

Deposit Details

Date Deposited:	20 Jun 2018 10:07	Date item deposited into CentAUR
Last Modified:	30 Jun 2024 01:25	Date item last modified

University Staff: Request a correction | Centaur Editors: Update this record

Search Google Scholar