Stars
Official implementation of "MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling"
StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation
Official implementation of "ViewCrafter: Taming Video Diffusion Models for High-fidelity Novel View Synthesis"
Pytorch implementation of Transfusion, "Predict the Next Token and Diffuse Images with One Multi-Modal Model", from MetaAI
CV-VAE: A Compatible Video VAE for Latent Generative Video Models
Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Official Implementation of "Learning Inclusion Matching for Animation Paint Bucket Colorization"
PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838
Official Implementation of "Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining"
InstantSplat: Sparse-view SfM-free Gaussian Splatting in Seconds
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
DUSt3R + Gaussian Splatting
📚 A collection of Deep Learning based Image Colorization and Video Colorization papers.
[ECCV 2024] 3DPE: Real-time 3D-aware Portrait Editing from a Single Image
Understand Human Behavior to Align True Needs
A free, licensed, and industrial animation dataset
Official implementations for paper: LivePhoto: Real Image Animation with Text-guided Motion Control
Official implementations for paper: Zero-shot Image Editing with Reference Imitation
Implementation of our paper 'Bilateral Propagation Network for Depth Completion'
Official implementations for paper: InFusion: Inpainting 3D Gaussians via Learning Depth Completion from Diffusion Prior
[SIGGRAPH Asia 2024, Journal Track] ToonCrafter: Generative Cartoon Interpolation
Official code for the paper "LucidDreamer: Domain-free Generation of 3D Gaussian Splatting Scenes".
Official Implementation of paper "A Tale of Two Features: Stable Diffusion Complements DINO for Zero-Shot Semantic Correspondence"