Alexandru Calotoiu and Markus Geisenhofer and Florian Kummer and Marcus Ritter and Jens Weber and Torsten Hoefler and Martin Oberlack and Felix Wolf:
Empirical Modeling of Spatially Diverging Performance
(In 2020 IEEE/ACM International Workshop on HPC User Support Tools (HUST) and Workshop on Programming and Performance Visualization, Nov. 2020)
Abstract
A common simplification made when modeling the
performance of a parallel program is the assumption that the
performance behavior of all processes or threads is largely
uniform. Empirical performance-modeling tools such as Extra-P
exploit this common pattern to make their modeling process more
noise resilient, mitigating the effect of outliers by summarizing
performance measurements of individual functions across all
processes. While the underlying assumption does not equally
hold for all applications, knowing the qualitative differences in
how the performance of individual processes changes as execution parameters are varied can reveal important performance
bottlenecks such as malicious patterns of load imbalance. A
challenge for empirical modeling tools, however, arises from
the fact that the behavioral class of a process may depend on
the process configuration, letting process ranks migrate between
classes as the number of processes grows. In this paper, we
introduce a novel approach to the problem of modeling of
spatially diverging performance based on a certain type of process
clustering. We apply our technique to identify a previously
unknown performance bottleneck in the BoSSS fluid-dynamics
code. Removing it made the code regions in question running up
to 20 times and the application as a whole run up to 4.5 times
faster
Documents
download article:
Recorded talk (best effort)
BibTeX
@inproceedings{, author={Alexandru Calotoiu and Markus Geisenhofer and Florian Kummer and Marcus Ritter and Jens Weber and Torsten Hoefler and Martin Oberlack and Felix Wolf}, title={{Empirical Modeling of Spatially Diverging Performance}}, year={2020}, month={Nov.}, booktitle={2020 IEEE/ACM International Workshop on HPC User Support Tools (HUST) and Workshop on Programming and Performance Visualization}, source={http://www.unixer.de/~htor/publications/}, }