Skip to content
View GHGmc2's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report GHGmc2

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.

Starred repositories

Showing results

On-device AI across mobile, embedded and edge for PyTorch

C++ 1,724 294 Updated Sep 24, 2024

collection of benchmarks to measure basic GPU capabilities

Jupyter Notebook 244 38 Updated Jun 21, 2024

seqax = sequence modeling + JAX

Python 130 10 Updated Jul 17, 2024

Debugging torch distributed program

Python 1 Updated Aug 30, 2024

Tutel MoE: An Optimized Mixture-of-Experts Implementation

Python 712 88 Updated Sep 13, 2024
Jupyter Notebook 14 2 Updated Jul 21, 2024

Examples demonstrating available options to program multiple GPUs in a single node or a cluster

Cuda 528 106 Updated Aug 14, 2024

nanoGPT style version of Llama 3.1

Python 1,178 56 Updated Aug 8, 2024

The official PyTorch implementation of Google's Gemma models

Python 5,241 501 Updated Jul 31, 2024

Dynamic Memory Management for Serving LLMs without PagedAttention

C 189 13 Updated Sep 23, 2024

The Hardware Sampling (hws) library can be used to track hardware performance like clock frequency, memory usage, temperatures, or power draw.

C++ 5 1 Updated Sep 23, 2024
Python 229 31 Updated Aug 20, 2024

[Information Fusion 2024] A Survey on Occupancy Perception for Autonomous Driving: The Information Fusion Perspective

187 12 Updated Sep 22, 2024
Jupyter Notebook 61 6 Updated Jul 23, 2024

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

1,052 22 Updated Jul 31, 2024

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Python 1,689 112 Updated Sep 19, 2024

The Abstraction and Reasoning Corpus

JavaScript 3,340 552 Updated Aug 4, 2024
Python 433 26 Updated Jul 29, 2024

SparseDrive: End-to-End Autonomous Driving via Sparse Scene Representation

Python 309 34 Updated Jun 24, 2024
Python 249 25 Updated Sep 24, 2024

📰 Must-read papers and blogs on Speculative Decoding ⚡️

366 14 Updated Sep 20, 2024

Use PyTorch Models with CasADi for data-driven optimization or learning-based optimal control. Supports Acados.

Python 340 22 Updated Sep 6, 2024

This is originally a collection of papers on neural network accelerators. Now it's more like my selection of research on deep learning and computer architecture.

1,843 381 Updated Aug 11, 2024

为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…

Python 64,217 7,940 Updated Sep 23, 2024

Odysseus: Playground of LLM Sequence Parallelism

Python 50 1 Updated Jun 17, 2024

A nanoGPT pipeline packed in a spreadsheet

2,035 120 Updated Jun 17, 2024

A fast communication-overlapping library for tensor parallelism on GPUs.

C++ 193 13 Updated Sep 18, 2024
Next