Close

Presentation

Leveraging the Compute Power of Two HPC Systems for Higher-Dimensional Grid-Based Simulations with the Widely-Distributed Sparse Grid Combination Technique
DescriptionThis paper presents the core concepts of the widely-distributed combination technique, which allows us to use the compute power and memory of more than one HPC system for the same simulation. We apply the sparse-grid combination technique to a six-dimensional advection problem serving as a proxy for plasma simulations. The full-grid solution approximated by the combination technique would contain ≈5ZB if computed with conventional grid-based methods. The combination-technique simulation operates on ≈988GB plus the supporting sparse grid data structures. We propose a new approach to divide the compute load, requiring only 76GB to be exchanged. Based on this, we have realized the first synchronous grid-based simulation using two HPC systems, the Tier-0 supercomputers Hawk and SuperMUC-NG. The simulation is computed at an average overhead of ≈35% (108s per combination step) for file-I/O and transfer. The presented concepts apply to any pair of HPC systems if high-speed data transfer is possible.
Event Type
Paper
TimeThursday, 16 November 20232:30pm - 3pm MST
Location403-404
Tags
Algorithms
Cloud Computing
Distributed Computing
Heterogeneous Computing
Large Scale Systems
State of the Practice
Registration Categories
TP
Reproducibility Badges