Skip to content
View pauleonix's full-sized avatar

Block or report pauleonix

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

PCG — C++ Implementation

C++ 839 105 Updated May 17, 2024

A library of parallel sparse preconditionersfor PSBLAS

Fortran 14 5 Updated May 5, 2026

A library of parallel sparse linear algebra on high performance computer.

Fortran 65 18 Updated May 5, 2026
Cuda 57 4 Updated Feb 24, 2026

PaRSEC is a generic framework for architecture aware scheduling and management of micro-tasks on distributed, GPU accelerated, many-core heterogeneous architectures. PaRSEC assigns computation thre…

C 76 21 Updated Feb 9, 2026

Parrot is an array fusion GPU library built on NVIDIA's CCCL libaries (Thrust/CUB).

Cuda 273 18 Updated Apr 23, 2026

Reproducible floating-point summations

C++ 4 2 Updated Dec 10, 2022

This is a set of simple programs that can be used to explore the features of a parallel platform.

C 473 119 Updated Jan 27, 2026

SYCL Academy, a set of learning materials for SYCL heterogeneous programming

HTML 528 116 Updated Feb 13, 2026

C++ HPC Tutorial materials

C++ 54 17 Updated Oct 23, 2025

Generic SYCL kernels for oneMath library

C++ 9 5 Updated Nov 1, 2025

pocl - Portable Computing Language

C 1,062 289 Updated May 4, 2026

Super-parallel Python port of the C-Reduce

Rust 330 36 Updated Apr 18, 2026

NVIDIA curated collection of educational resources related to general purpose GPU programming.

Jupyter Notebook 1,575 270 Updated Apr 30, 2026

ASEArch BLAS (DTRSV for GPUs)

Cuda 2 Updated Sep 1, 2016

Sparse Parallel Robust Algorithms Library

Fortran 140 30 Updated Mar 10, 2026

Vim plugin that shows the context of the currently visible buffer contents

Vim Script 1,372 30 Updated Feb 20, 2026

Compiler for multiple programming models (SYCL, C++ standard parallelism, HIP/CUDA) for CPUs and GPUs from all vendors: The independent, community-driven compiler for C++-based heterogeneous progra…

C++ 1,841 217 Updated May 3, 2026

The entrance repository of Markdown presentation ecosystem

TypeScript 11,635 268 Updated May 1, 2026

BLAS-like Library Instantiation Software Framework

C 2,635 417 Updated Nov 11, 2025

Templight is a Clang-based tool to profile the time and memory consumption of template instantiations and to perform interactive debugging sessions to gain introspection into the template instantia…

C++ 792 43 Updated Dec 7, 2024

Simple Python script that simplifies C++ compiler errors. Useful when using heavily-templated libraries.

Python 215 11 Updated Jan 30, 2020

aider is AI pair programming in your terminal

Python 44,416 4,362 Updated Apr 25, 2026
Fortran 112 44 Updated May 1, 2026

Using C++ magic to capture CUDA kernels and tune them with Kernel Tuner

C++ 21 2 Updated Sep 12, 2025

CUDA/HIP header-only library for low-precision (16 bit, 8 bit) and vectorized GPU kernel development

C++ 23 3 Updated May 5, 2026

High-level C++ for Accelerator Clusters

C++ 155 18 Updated May 5, 2026

C/C++ pre-commit hooks powered by clang-format and clang-tidy

Python 40 5 Updated May 4, 2026
Python 4 Updated May 4, 2026

A nearly complete collection of prefix sum algorithms implemented in CUDA, D3D12, Unity and WGPU. Theoretically portable to all wave/warp/subgroup sizes.

C++ 292 11 Updated Jan 29, 2025
Next