Skip to content
View VariableExp0rt's full-sized avatar

Block or report VariableExp0rt

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.

Starred repositories

Showing results

An open-source C++ library developed and used at Facebook.

C++ 28,202 5,543 Updated Sep 23, 2024

A framework for serving and evaluating LLM routers - save LLM costs without compromising quality!

Python 2,914 222 Updated Aug 10, 2024

The Learning Interpretability Tool: Interactively analyze ML models to understand their behavior in an extensible and framework agnostic interface.

TypeScript 3,466 353 Updated Sep 20, 2024

Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Python 4,025 313 Updated Sep 19, 2024

Optimizing inference proxy for LLMs

Python 798 76 Updated Sep 21, 2024

Generate builders for everything!

Rust 1,060 16 Updated Sep 22, 2024

PyTorch/TorchScript/FX compiler for NVIDIA GPUs using TensorRT

Python 2,521 349 Updated Sep 22, 2024

Differentiable signal processing on the sphere for PyTorch

Jupyter Notebook 365 28 Updated Sep 20, 2024

Efficient Triton Kernels for LLM Training

Python 3,002 153 Updated Sep 22, 2024

SRIOV network device plugin for Kubernetes

Go 396 175 Updated Sep 18, 2024

Core c99 package for AWS SDK for C. Includes cross-platform primitives, configuration, data structures, and error handling.

C 256 156 Updated Sep 6, 2024

Repository for open inference protocol specification

41 10 Updated Jul 21, 2024

Identify the blast radius and risks for Terraform changes in real time

Go 162 3 Updated Sep 23, 2024

Evidently is ​​an open-source ML and LLM observability framework. Evaluate, test, and monitor any AI-powered system or data pipeline. From tabular data to Gen AI. 100+ metrics.

Jupyter Notebook 5,164 575 Updated Sep 18, 2024

A vector search SQLite extension that runs anywhere!

C 3,817 130 Updated Sep 16, 2024

An interactive HTML pretty-printer for machine learning research in IPython notebooks.

Python 260 14 Updated Sep 13, 2024

A SQLite extension for generate text embeddings from GGUF models using llama.cpp

C 93 1 Updated Aug 24, 2024

NVIDIA Linux open GPU kernel module source

C 15,050 1,248 Updated Sep 20, 2024

FlashInfer: Kernel Library for LLM Serving

Cuda 1,153 104 Updated Sep 19, 2024

A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API

Jupyter Notebook 10,059 1,446 Updated Aug 8, 2024

Provisioning system for CM4 products

PHP 97 14 Updated Jun 26, 2024

Deploy a Flux MiniCluster to Kubernetes with the operator

Python 31 8 Updated Aug 16, 2024

In this repo, we share code samples from "Learn Kubernetes with Google" video series. This repo may expand with series on other projects in the future!

Python 37 9 Updated Jul 1, 2024

A networking protocol for agent-environment communication

Python 90 10 Updated Jun 17, 2024

Common code for TFX

Python 64 53 Updated Sep 19, 2024

Recommended C code style and coding rules for standard C99 or later

Python 1,018 231 Updated Jun 13, 2024

Frida Rust bindings

Rust 177 46 Updated Sep 22, 2024

A Rust crate for BBC micro:bit development

Rust 268 61 Updated Aug 5, 2024

InstaSlice facilitates the use of Dynamic Resource Allocation (DRA) on Kubernetes clusters for GPU sharing

Go 19 1 Updated Aug 2, 2024
Next