Skip to content
View ilumiere's full-sized avatar
💭
I may be slow to respond.
💭
I may be slow to respond.

Block or report ilumiere

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. lmdeploy lmdeploy Public

    Forked from InternLM/lmdeploy

    LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

    Python

  2. ollama ollama Public

    Forked from ollama/ollama

    Get up and running with Llama 3.1, Mistral, Gemma 2, and other large language models.

    Go

  3. sglang sglang Public

    Forked from sgl-project/sglang

    SGLang is yet another fast serving framework for large language models and vision language models.

    Python

  4. text-generation-inference text-generation-inference Public

    Forked from huggingface/text-generation-inference

    Large Language Model Text Generation Inference

    Python

  5. vllm vllm Public

    Forked from vllm-project/vllm

    A high-throughput and memory-efficient inference and serving engine for LLMs

    Python

  6. llama.cpp llama.cpp Public

    Forked from ggerganov/llama.cpp

    LLM inference in C/C++

    C++