Lists (1)
Sort Name ascending (A-Z)
Starred repositories
An open-source C++ library developed and used at Facebook.
A framework for serving and evaluating LLM routers - save LLM costs without compromising quality!
The Learning Interpretability Tool: Interactively analyze ML models to understand their behavior in an extensible and framework agnostic interface.
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
PyTorch/TorchScript/FX compiler for NVIDIA GPUs using TensorRT
Differentiable signal processing on the sphere for PyTorch
Efficient Triton Kernels for LLM Training
SRIOV network device plugin for Kubernetes
Core c99 package for AWS SDK for C. Includes cross-platform primitives, configuration, data structures, and error handling.
Repository for open inference protocol specification
Identify the blast radius and risks for Terraform changes in real time
Evidently is an open-source ML and LLM observability framework. Evaluate, test, and monitor any AI-powered system or data pipeline. From tabular data to Gen AI. 100+ metrics.
A vector search SQLite extension that runs anywhere!
An interactive HTML pretty-printer for machine learning research in IPython notebooks.
A SQLite extension for generate text embeddings from GGUF models using llama.cpp
NVIDIA Linux open GPU kernel module source
FlashInfer: Kernel Library for LLM Serving
A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API
Deploy a Flux MiniCluster to Kubernetes with the operator
In this repo, we share code samples from "Learn Kubernetes with Google" video series. This repo may expand with series on other projects in the future!
A networking protocol for agent-environment communication
Recommended C code style and coding rules for standard C99 or later
InstaSlice facilitates the use of Dynamic Resource Allocation (DRA) on Kubernetes clusters for GPU sharing