Skip to content
@gpustack

GPUStack

Open-source GPU cluster manager for running large language models(LLMs)

Pinned Loading

  1. gpustack gpustack Public

    Manage GPU clusters for running LLMs

    Python 277 19

  2. gguf-parser-go gguf-parser-go Public

    Review/Check GGUF files and estimate the memory usage and maximum tokens per second.

    Go 16 2

Repositories

Showing 8 of 8 repositories
  • llama-box Public

    LLM inference server implementation based on llama.cpp.

    gpustack/llama-box’s past year of commit activity
    C++ 8 MIT 1 1 0 Updated Sep 23, 2024
  • gguf-parser-go Public

    Review/Check GGUF files and estimate the memory usage and maximum tokens per second.

    gpustack/gguf-parser-go’s past year of commit activity
    Go 16 MIT 2 0 0 Updated Sep 23, 2024
  • gguf-packer-go Public

    Deliver LLMs of GGUF format via Dockerfile.

    gpustack/gguf-packer-go’s past year of commit activity
    Go 2 MIT 0 0 0 Updated Sep 23, 2024
  • gpustack/gpustack-ui’s past year of commit activity
    TypeScript 1 5 0 0 Updated Sep 22, 2024
  • gpustack Public

    Manage GPU clusters for running LLMs

    gpustack/gpustack’s past year of commit activity
    Python 277 Apache-2.0 19 46 2 Updated Sep 22, 2024
  • gpustack/gpustack.github.io’s past year of commit activity
    HTML 0 0 0 0 Updated Sep 15, 2024
  • fastfetch Public Forked from fastfetch-cli/fastfetch

    Like neofetch, but much faster because written mostly in C.

    gpustack/fastfetch’s past year of commit activity
    C 0 MIT 396 0 1 Updated Aug 2, 2024
  • .github Public
    gpustack/.github’s past year of commit activity
    0 0 0 0 Updated Jul 23, 2024

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Most used topics

Loading…