Lists (5)
Sort Name ascending (A-Z)
Starred repositories
Open-source low code data preparation library in python. Collect, clean and visualization your data in python with a few lines of code.
Code for the paper: Detecting Photoshopped Faces by Scripting Photoshop
AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:
TinyChatEngine: On-Device LLM Inference Library
[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration
VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)
VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs
🆓免费的 ChatGPT 镜像网站列表,持续更新。List of free ChatGPT mirror sites, continuously updated.
[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale
An official implementation of ShareGPT4Video: Improving Video Understanding and Generation with Better Captions
[ECCV 2024] ShareGPT4V: Improving Large Multi-modal Models with Better Captions
SGLang is a fast serving framework for large language models and vision language models.
✨✨Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis
EventHorizonV / psub
Forked from bulianglin/psub利用CF Worker搭建的反代订阅转换工具
Subconverter订阅转换前端增强版,增加近百条远程配置及更多自定义功能!
A rule-based tunnel for Android.
Multi-platform auto-proxy client, supporting Sing-box, X-ray, TUIC, Hysteria, Reality, Trojan, SSH etc. It’s an open-source, secure and ad-free.
A simple Python Pydantic model for Honkai: Star Rail parsed data from the Mihomo API.
clash for windows汉化版. 提供clash for windows的汉化版, 汉化补丁及汉化版安装程序
Continuation of Clash Verge - A Clash Meta GUI based on Tauri (Windows, MacOS, Linux)
OpenGFW is a flexible, easy-to-use, open source implementation of GFW (Great Firewall of China) on Linux
Accelerating the development of large multimodal models (LMMs) with lmms-eval
GPT4V-level open-source multi-modal model based on Llama3-8B