XieDake

XieDake

1 follower · 3 following

Stars

kepengxu / PGTFormer

[IJCAI'24] Beyond Alignment: Blind Video Face Restoration via Parsing-Guided Temporal-Coherent Transformer

Python 170 21 Updated Sep 10, 2024

Yuliang-Liu / Monkey

【CVPR 2024 Highlight】Monkey (LMM): Image Resolution and Text Label Are Important Things for Large Multi-modal Models

Python 1,776 123 Updated Sep 5, 2024

akfamily / akshare

AKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库

Python 9,007 1,849 Updated Sep 21, 2024

opendatalab / MinerU

A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具，支持PDF/网页/多格式电子书提取。

Python 11,462 859 Updated Sep 20, 2024

2noise / ChatTTS

A generative speech model for daily dialogue.

Python 30,870 3,357 Updated Sep 21, 2024

fishaudio / fish-speech

Brand new TTS solution

Python 12,206 926 Updated Sep 20, 2024

AiuniAI / Unique3D

Official implementation of Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image

Python 2,880 223 Updated Sep 18, 2024

advimman / lama

🦙 LaMa Image Inpainting, Resolution-robust Large Mask Inpainting with Fourier Convolutions, WACV 2022

Jupyter Notebook 7,854 832 Updated Jul 26, 2024

david419kr / video-watermark-removal-script-for-lama-cleaner

A simple script for removing video watermark, using Lama Cleaner. Only tested at NVIDIA windows environment.

Python 41 25 Updated Jul 16, 2024

TXH-mercury / VAST

Code and Model for VAST: A Vision-Audio-Subtitle-Text Omni-Modality Foundation Model and Dataset

Jupyter Notebook 231 15 Updated Mar 14, 2024

Sanster / IOPaint

Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.

Python 18,845 1,926 Updated Sep 5, 2024

facebookresearch / ImageBind

ImageBind One Embedding Space to Bind Them All

Python 8,233 754 Updated Jul 31, 2024

McGill-NLP / llm2vec

Code for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders'

Python 1,147 86 Updated Sep 16, 2024

PKU-YuanGroup / Video-LLaVA

Video-LLaVA: Learning United Visual Representation by Alignment Before Projection

Python 2,861 207 Updated Jul 27, 2024

Alpha-VLLM / LLaMA2-Accessory

An Open-source Toolkit for LLM Development

Python 2,685 169 Updated May 24, 2024

DAMO-NLP-SG / Video-LLaMA

[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding

Python 2,721 242 Updated Jun 4, 2024

FlagOpen / FlagEmbedding

Retrieval and Retrieval-augmented LLMs

Python 6,887 498 Updated Sep 19, 2024

modelscope / ms-swift

Use PEFT or Full-parameter to finetune 350+ LLMs or 90+ MLLMs. (Qwen2.5, GLM4v, Internlm2.5, Yi, Llama3.1, Llava-Video, Internvl2, MiniCPM-V-2.6, Deepseek, Baichuan2, Gemma2, Phi3-Vision, ...)

Python 3,498 299 Updated Sep 21, 2024