-
Alibaba TongYi
- Beijing, China
Stars
[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation
Code to accompany "A Method for Animating Children's Drawings of the Human Figure"
(SIGGRAPH Asia 2024) This is the official PyTorch implementation of SIGGRAPH Asia 2024 paper: DrawingSpinUp: 3D Animation from Single Character Drawings
High-resolution models for human tasks.
[CVPR 2024] code release for "DiffusionLight: Light Probes for Free by Painting a Chrome Ball"
Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation
Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRA
Simple Online Realtime Tracking with a Deep Association Metric
NEW - YOLOv8 🚀 in PyTorch > ONNX > OpenVINO > CoreML > TFLite
Official implementation of Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image
[ICLR24] Official PyTorch Implementation of Magic123: One Image to High-Quality 3D Object Generation Using Both 2D and 3D Diffusion Priors
[3DV 2024] Official repo of "TeCH: Text-guided Reconstruction of Lifelike Clothed Humans"
[CVPR'22] ICON: Implicit Clothed humans Obtained from Normals
[Arxiv 2024] From Parts to Whole: A Unified Reference Framework for Controllable Human Image Generation
Robust Video Matting in PyTorch, TensorFlow, TensorFlow.js, ONNX, CoreML!
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
[ECCV24] MixDQ: Memory-Efficient Few-Step Text-to-Image Diffusion Models with Metric-Decoupled Mixed Precision Quantization
Mask Transfiner for High-Quality Instance Segmentation, CVPR 2022
A batched offline inference oriented version of segment-anything
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Fast and memory-efficient exact attention
Karras et al. (2022) diffusion models for PyTorch
Official Code for MotionCtrl [SIGGRAPH 2024]
[CVPR 2024] TokenHMR: Advancing Human Mesh Recovery with a Tokenized Pose Representation