EUDML

Advanced Search

Currently displaying 1 – 1 of 1

Performance of parallel QR factorization methods on the NVIDIA Grace CPU Superchip

Břichňáč, Vít; Šístek, Jakub — 2025

Programs and Algorithms of Numerical Mathematics

This article studies several algorithms for QR factorization based on hierarchical Householder reflectors organized into elimination trees, which are particularly suited for tall-and-skinny matrices and allow parallelization. We examine the effect of various parameters on the performance of the tree-based algorithms. The work is accompanied with a custom implementation that utilizes a task-based runtime system (OpenMP or StarPU). The same algorithm is implemented in the PLASMA library. The performance...

Advanced Search

Formula preview

Currently displaying 1 – 1 of 1

Performance of parallel QR factorization methods on the NVIDIA Grace CPU Superchip