Detail výsledku

Accelerating Hybrid Local Domain Decomposition for the k-Wave Toolbox on Multi-GPU Systems

KUNÍK, O.; JAROŠ, J. Accelerating Hybrid Local Domain Decomposition for the k-Wave Toolbox on Multi-GPU Systems. Ostrava: 2024. 1 p.
Typ
prezentace, poster
Jazyk
angličtina
Autoři
Abstrakt

The k-Wave toolbox is designed for high-fidelity acoustic wave simulations using Fourier collocation for spatial derivatives, but its performance is constrained by communication overhead on multi-CPU and multi-GPU systems. We present a hybrid local domain decomposition approach that partitions the simulation domain into subdomains, each assigned to a GPU with configurable resolution. Using CUDA and cuFFT for Fourier transforms and NVLink for halo exchanges, our method minimizes inter-subdomain communication and accelerates multi-GPU performance. Testing on the Karolina supercomputer shows strong scalability and accuracy, especially with uniform-resolution subdomains, and proves effective even for large-scale simulations beyond single-GPU memory limits.

Klíčová slova

k-Wave, HPC, Hybrid decomposition, Local decomposition, CUDA, Multi-GPU

Rok
2024
Strany
1
Konference
8th Users' Conference of IT4Innovations
Místo
Ostrava
BibTeX
@misc{BUT193367,
  author="Oliver {Kuník} and Jiří {Jaroš}",
  title="Accelerating Hybrid Local Domain Decomposition for the k-Wave Toolbox on Multi-GPU Systems",
  year="2024",
  pages="1",
  address="Ostrava",
  url="https://www.fit.vut.cz/research/publication/13293/"
}
Soubory
Projekty
Application-specific HW/SW architectures and their applications, VUT, Vnitřní projekty VUT, FIT-S-23-8141, zahájení: 2023-03-01, ukončení: 2026-02-28, řešení
Personalizovaná transkraniální ultrazvuková stimule řízená MRI snímkováním, EU, HORIZON EUROPE, 101071008, zahájení: 2022-08-01, ukončení: 2026-07-31, řešení
Výzkumné skupiny
Pracoviště
Nahoru