A General-purpose Task-parallel Programming System using Modern C++
- 
 Updated
 Oct 23, 2025 
- C++
A General-purpose Task-parallel Programming System using Modern C++
Build, Manage and Deploy AI/ML Systems
High-performance TensorFlow library for quantitative finance.
The fastest and most memory efficient lattice Boltzmann CFD software, running on all GPUs and CPUs via OpenCL. Free for non-commercial use.
高性能并行编程与优化 - 课件
Training and serving large-scale neural networks with auto parallelization.
BS::thread_pool: a fast, lightweight, modern, and easy-to-use C++17 / C++20 / C++23 thread pool library
A list of awesome compiler projects and papers for tensor computation and deep learning.
BLAS-like Library Instantiation Software Framework
Kokkos C++ Performance Portability Programming Ecosystem: The Programming Model - Parallel Execution and Memory Abstraction
Open-source software for volunteer computing and grid computing.
Lightweight, general, scalable C++ library for finite element methods
a Productive Parallel Programming Language
Hermit for Rust.
Compiler for multiple programming models (SYCL, C++ standard parallelism, HIP/CUDA) for CPUs and GPUs from all vendors: The independent, community-driven compiler for C++-based heterogeneous programming models. Lets applications adapt themselves to all the hardware in the system - even at runtime!
Acceleration package for neural networks on multi-core CPUs
A fast, ergonomic and portable tensor library in Nim with a deep learning focus for CPU, GPU and embedded devices via OpenMP, Cuda and OpenCL backends
A Rust-based, lightweight unikernel.
An R-focused pipeline toolkit for reproducibility and high-performance computing
Primary repository for the Trilinos Project
Add a description, image, and links to the high-performance-computing topic page so that developers can more easily learn about it.
To associate your repository with the high-performance-computing topic, visit your repo's landing page and select "manage topics."