Deep Tech R&D Research
-
Updated
May 20, 2024 - C++
Deep Tech R&D Research
My research, playground, techniques with Parallel Programming
CUDA C++ Core Libraries
🎉CUDA 笔记 / 大模型手撕CUDA / C++笔记,更新随缘: flash_attn、sgemm、sgemv、warp reduce、block reduce、dot product、elementwise、softmax、layernorm、rmsnorm、hist etc.
A General-purpose Parallel and Heterogeneous Task Programming System
A performance-oriented prototyping harness for state of the art Molecular Dynamics algorithms
AI, IoT and Robotics Hardware + ROS
TinyChatEngine: On-Device LLM Inference Library
Yet Another Scattering Framework python implementation
This repo contains CUDA Programming with C++. Projects are done to learn CUDA from scratch.
μ-Cuda, COVER THE LAST MILE OF CUDA. With features: intellisense-friendly, structured launch, automatic cuda graph generation and updating.
A place where I learn about CUDA
Eikonal CUDA implementation for the Advanced Methods for Scientific Computing (AMSC) Course @POLIMI
A simple ray-tracing program implemented with CUDA.
Safe rust wrapper around CUDA toolkit
Thin, unified, C++-flavored wrappers for the CUDA APIs
C++ framework for deep neural networks
Add a description, image, and links to the cuda-programming topic page so that developers can more easily learn about it.
To associate your repository with the cuda-programming topic, visit your repo's landing page and select "manage topics."