CUDA C++ Core Libraries
-
Updated
May 20, 2024 - C++
CUDA C++ Core Libraries
A General-purpose Parallel and Heterogeneous Task Programming System
Towards Scalable and Efficient K-Means Data Clustering.
An Upstream Clang/LLVM-based toolchain for contemporary C++ and heterogeneous programming
Computations and statistics on manifolds with geometric structures.
Scalene: a high-performance, high-precision CPU, GPU, and memory profiler for Python with AI-powered optimization proposals
A GPU algorithm for enumerating weak pseudomanifolds
Learnings and experimentation with GPU programming
A high-performance, zero-overhead, extensible Python compiler using LLVM
Productive, portable, and performant GPU programming in Python.
🐉 Making Rust a first-class language and ecosystem for GPU shaders 🚧
Programming Gemm Kernels on NVIDIA GPUs with Tensor Cores in Julia
Compute shaders interface for WGPU from julia
OpenCL is the most powerful programming language ever created. Yet the OpenCL C++ bindings are cumbersome and the code overhead prevents many people from getting started. I created this lightweight OpenCL-Wrapper to greatly simplify OpenCL software development with C++ while keeping functionality and performance.
Rust frontend to LuisaCompute and more!
CHAI and RAJA provide an excellent base on which to build portable codes. CARE expands that functionality, adding new features such as loop fusion capability and a portable interface for many numerical algorithms. It provides all the basics for anyone wanting to write portable code.
🌟 Vertex Centric approach for building GNN/TGNNs
Add a description, image, and links to the gpu-programming topic page so that developers can more easily learn about it.
To associate your repository with the gpu-programming topic, visit your repo's landing page and select "manage topics."