Lists (1)
Sort Name ascending (A-Z)
Stars
PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838
Official code release for the paper "SkillMimic: Learning Reusable Basketball Skills from Demonstrations"
Official PyTorch implementation of "Expressive Whole-Body 3D Gaussian Avatar", ECCV 2024.
High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance
Open-MAGVIT2: Democratizing Autoregressive Visual Generation
OmniTokenizer: one model and one weight for image-video joint tokenization.
Official implementation of "ViewCrafter: Taming Video Diffusion Models for High-fidelity Novel View Synthesis"
[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly …
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
Official implementation of Cycle3D: High-quality and Consistent Image-to-3D Generation via Generation-Reconstruction Cycle
ChronoMagic-Bench: A Benchmark for Metamorphic Evaluation of Text-to-Time-lapse Video Generation
Official implementation of "LOOK-M: Look-Once Optimization in KV Cache for Efficient Multimodal Long-Context Inference"
Official implementation of Physics3D: Learning Physical Properties of 3D Gaussians via Video Diffusion
[ICLR 2023 Oral] Zero-Shot Image Restoration Using Denoising Diffusion Null-Space Model
A Modular Framework for 3D Gaussian Splatting and Beyond
[ECCV 2024] HiDiffusion: Increases the resolution and speed of your diffusion model by only adding a single line of code!
MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators
[NeurIPS'23] Emergent Correspondence from Image Diffusion
Envision3D: One Image to 3D with Anchor Views Interpolation
[ACM MM-2021] WePerson: learning a generalized re-identification model from all-weather virtual data
[CVPR 2024 Oral, Best Paper Runner-Up] Code for "pixelSplat: 3D Gaussian Splats from Image Pairs for Scalable Generalizable 3D Reconstruction" by David Charatan, Sizhe Lester Li, Andrea Tagliasacch…
[RSS 2024] 3D Diffusion Policy: Generalizable Visuomotor Policy Learning via Simple 3D Representations
This is the repository for the Photorealistic Unreal Graphics (PUG) datasets for representation learning.