Stars
🚀 Cross attention map tools for huggingface/diffusers
ReCo: Region-Controlled Text-to-Image Generation, CVPR 2023
Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRA
Official implementation of "Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance"
Person Image Synthesis via Denoising Diffusion Model (CVPR 2023)
Diffusion Model-Based Image Editing: A Survey (arXiv)
[ECCV 2024 Oral] MotionDirector: Motion Customization of Text-to-Video Diffusion Models.
[ICCV 2023] Consistent Image Synthesis and Editing
The official implementation for PNT-Edge, accepted by ACM-MM 2023.
Generative Models by Stability AI
Official pytorch implementation of the paper: "An Edit Friendly DDPM Noise Space: Inversion and Manipulations". CVPR 2024.
Diffusion attentive attribution maps for interpreting Stable Diffusion.
It is the official implementation of Shape-aware ControlNet, accepted by ACM MM'2024.
[arXiv preprint] Hi-SAM: Marrying Segment Anything Model for Hierarchical Text Segmentation
Official Pytorch Implementation for “Plug-and-Play Diffusion Features for Text-Driven Image-to-Image Translation” (CVPR 2023)
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
Transparent Image Layer Diffusion using Latent Transparency
Official Repository of the paper "Trajectory Consistency Distillation"
Official implementation of CVPR 2024 paper: "FreeControl: Training-Free Spatial Control of Any Text-to-Image Diffusion Model with Any Condition"
Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models