Stars
📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.
Solve puzzles. Improve your pytorch.
A curated list of awesome projects and papers for distributed training or inference
A novel temporal fusion framework for propelling autoregressive model inference
Distributed tracing without code changes. 🚀 Instantly monitor any application using OpenTelemetry and eBPF
The most common question-patterns for any coding-interview
A full-stack simulation of a ridesharing app
Expert resume guide for experienced software engineers
Being laid off can be overwhelming and it's easy to miss important tasks. This runbook will help make sure you stay on track.
C++ Implementation of PyTorch Tutorials for Everyone
List of awesome semiconductor startups
Master the command line, in one page
A playbook for systematically maximizing the performance of deep learning models.
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
Open-source benchmark suite for cloud microservices
The C++ Core Guidelines are a set of tried-and-true guidelines, rules, and best practices about coding in C++
Comprehensive language-agnostic guidelines on variables naming. Home of the A/HC/LC pattern.
Code for PaperRobot: Incremental Draft Generation of Scientific Ideas
📦 CMake's missing package manager. A small CMake script for setup-free, cross-platform, reproducible dependency management.
Run TensorFlow models in C++ without installation and without Bazel