Lists (10)
Sort Name ascending (A-Z)
Stars
A library of parallel sparse preconditionersfor PSBLAS
A library of parallel sparse linear algebra on high performance computer.
PaRSEC is a generic framework for architecture aware scheduling and management of micro-tasks on distributed, GPU accelerated, many-core heterogeneous architectures. PaRSEC assigns computation thre…
Parrot is an array fusion GPU library built on NVIDIA's CCCL libaries (Thrust/CUB).
Reproducible floating-point summations
This is a set of simple programs that can be used to explore the features of a parallel platform.
SYCL Academy, a set of learning materials for SYCL heterogeneous programming
Generic SYCL kernels for oneMath library
NVIDIA curated collection of educational resources related to general purpose GPU programming.
Vim plugin that shows the context of the currently visible buffer contents
Compiler for multiple programming models (SYCL, C++ standard parallelism, HIP/CUDA) for CPUs and GPUs from all vendors: The independent, community-driven compiler for C++-based heterogeneous progra…
The entrance repository of Markdown presentation ecosystem
Templight is a Clang-based tool to profile the time and memory consumption of template instantiations and to perform interactive debugging sessions to gain introspection into the template instantia…
Simple Python script that simplifies C++ compiler errors. Useful when using heavily-templated libraries.
aider is AI pair programming in your terminal
Using C++ magic to capture CUDA kernels and tune them with Kernel Tuner
CUDA/HIP header-only library for low-precision (16 bit, 8 bit) and vectorized GPU kernel development
C/C++ pre-commit hooks powered by clang-format and clang-tidy
A nearly complete collection of prefix sum algorithms implemented in CUDA, D3D12, Unity and WGPU. Theoretically portable to all wave/warp/subgroup sizes.