NVIDIA / CUDALibrarySamples
CUDA Library Samples
See what the GitHub community is most excited about today.
CUDA Library Samples
WholeGraph - large scale Graph Neural Networks
cuGraph - RAPIDS Graph Analytics Library
RAFT contains fundamental widely-used algorithms and primitives for machine learning and information retrieval. The algorithms are CUDA-accelerated and form building blocks for more easily writing high performance applications.
A massively parallel, optimal functional runtime in Rust
FlashInfer: Kernel Library for LLM Serving
LLM training in simple, raw C/CUDA
CUDA Kernel Benchmarking Library
Sample codes for my CUDA programming book
NCCL Tests
[ARCHIVED] Cooperative primitives for CUDA C++. See https://github.com/NVIDIA/cccl