Skip to content
View zyeric's full-sized avatar
  • MSRA
  • Beijing

Block or report zyeric

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 35 3 Updated Sep 28, 2024

A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more

Jupyter Notebook 3,842 1,028 Updated Jun 6, 2024

LLM training code for Databricks foundation models

Python 3,983 525 Updated Sep 30, 2024

A native PyTorch Library for large model training

Python 2,255 165 Updated Sep 29, 2024

MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。

3,414 233 Updated Sep 14, 2024

A PyTorch Native LLM Training Framework

Python 587 28 Updated Aug 25, 2024

NCCL Tests

Cuda 840 232 Updated Jul 30, 2024

LLM inference in C/C++

C++ 65,657 9,421 Updated Sep 30, 2024

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Python 6,065 536 Updated May 31, 2024

A series of large language models developed by Baichuan Intelligent Technology

Python 4,079 293 Updated Jun 22, 2024

Inference code for Llama models

Python 55,757 9,502 Updated Aug 18, 2024

A python library that provides common I/O interface across different storage backends.

Python 132 23 Updated Sep 19, 2024
Jupyter Notebook 565 51 Updated Sep 17, 2024

🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Python 7,759 939 Updated Sep 30, 2024

maximal update parametrization (µP)

Jupyter Notebook 1,360 93 Updated Jul 17, 2024

Hackable and optimized Transformers building blocks, supporting a composable construction.

Python 8,418 596 Updated Sep 27, 2024

Fast and memory-efficient exact attention

Python 13,600 1,245 Updated Sep 30, 2024

A list of awesome compiler projects and papers for tensor computation and deep learning.

2,330 295 Updated Jul 14, 2024

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

Python 14,742 2,581 Updated Aug 20, 2024

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…

Python 31,655 4,711 Updated Sep 22, 2024

Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 33,245 5,621 Updated Sep 30, 2024

An open source implementation of CLIP.

Python 9,908 957 Updated Aug 19, 2024

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 24,964 3,230 Updated Jul 23, 2024

ModelScope: bring the notion of Model-as-a-Service to life.

Python 6,854 710 Updated Sep 30, 2024

C/C++ frontend for MLIR. Also features polyhedral optimizations, parallel optimizations, and more!

C++ 468 109 Updated Aug 22, 2024

Scenic: A Jax Library for Computer Vision Research and Beyond

Python 3,264 429 Updated Sep 30, 2024

Lingvo

Python 2,811 443 Updated Sep 28, 2024

OneFlow is a deep learning framework designed to be user-friendly, scalable and efficient.

C++ 5,874 667 Updated Sep 6, 2024

A fast MoE impl for PyTorch

Python 1,535 186 Updated Jul 5, 2024
Next