Skip to content
@DeepAuto-AI

DeepAuto.ai

Deep Automation for Everyone

Popular repositories Loading

  1. hip-attention hip-attention Public

    Training-free Post-training Efficient Sub-quadratic Complexity Attention. Implemented with OpenAI Triton.

    Python 14 3

  2. vllm-legacy vllm-legacy Public

    Forked vLLM Framework, for DeepAuto Chat Platform. Supports HiP Attention

    Python 1

  3. sglang sglang Public

    Forked from sgl-project/sglang

    SGLang is a fast serving framework for large language models and vision language models.

    Python 1

  4. vllm vllm Public

    Forked from vllm-project/vllm

    Up to 4x faster decoding than vLLM using HiP Attention: https://github.com/DeepAuto-AI/hip-attention

    Python

Repositories

Showing 4 of 4 repositories
  • sglang Public Forked from sgl-project/sglang

    SGLang is a fast serving framework for large language models and vision language models.

    DeepAuto-AI/sglang’s past year of commit activity
    Python 1 Apache-2.0 371 0 0 Updated Sep 20, 2024
  • vllm Public Forked from vllm-project/vllm

    Up to 4x faster decoding than vLLM using HiP Attention: https://github.com/DeepAuto-AI/hip-attention

    DeepAuto-AI/vllm’s past year of commit activity
    Python 0 Apache-2.0 4,053 0 0 Updated Sep 20, 2024
  • hip-attention Public

    Training-free Post-training Efficient Sub-quadratic Complexity Attention. Implemented with OpenAI Triton.

    DeepAuto-AI/hip-attention’s past year of commit activity
    Python 14 3 0 0 Updated Sep 15, 2024
  • vllm-legacy Public

    Forked vLLM Framework, for DeepAuto Chat Platform. Supports HiP Attention

    DeepAuto-AI/vllm-legacy’s past year of commit activity
    Python 1 Apache-2.0 0 0 0 Updated Jul 8, 2024

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…