rmsnorm

Here are 6 public repositories matching this topic...

DefTruth / CUDA-Learn-Notes

🎉 CUDA Learn Notes with PyTorch: fp32、fp16/bf16、fp8/int8、flash_attn、sgemm、sgemv、warp/block reduce、dot prod、elementwise、softmax、layernorm、rmsnorm、hist etc.

cuda pytorch triton gemm softmax cuda-programming layernorm gemv elementwise rmsnorm flash-attention flash-attention-2 warp-reduce block-reduce flash-attention-3

Updated Sep 21, 2024
Cuda

bzhangGo / rmsnorm

Star

Root Mean Square Layer Normalization

layernorm rmsnorm

Updated Mar 28, 2023
Python

knotgrass / Griffin

Star

Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models

h3 linear attention language-model griffin mamba gelu conv1d rmsnorm rg-lru shift-ssm

Updated May 26, 2024
Python

dtunai / Tri-RMSNorm

Star

Efficient kernel for RMS normalization with fused operations, includes both forward and backward passes, compatibility with PyTorch.

machine-learning ai triton rmsnorm

Updated Jun 5, 2024
Python

sushantkumar23 / nano-gpt

Star

Simple character level Transformer

transformers pytorch attention attention-mechanism rope self-attention multi-head-attention shakespeare-dataset transformer-architecture llm rmsnorm

Updated May 27, 2024
Jupyter Notebook

rmgogogo / nano-aigc

Star

Generative models nano version for fun. No STOA here, nano first.

Updated May 20, 2024
Jupyter Notebook

Improve this page

Add a description, image, and links to the rmsnorm topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the rmsnorm topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

rmsnorm

Here are 6 public repositories matching this topic...

DefTruth / CUDA-Learn-Notes

bzhangGo / rmsnorm

knotgrass / Griffin

dtunai / Tri-RMSNorm

sushantkumar23 / nano-gpt

rmgogogo / nano-aigc

Improve this page

Add this topic to your repo