Stars
[NeurIPS 2023] Uni-ControlNet: All-in-One Control to Text-to-Image Diffusion Models
LayoutDM: Discrete Diffusion Model for Controllable Layout Generation [Inoue+, CVPR2023]
[CVPR2023] Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning (https://arxiv.org/abs/2212.04500)
[ICCV 2023] Spatially-Adaptive Feature Modulation for Efficient Image Super-Resolution; runner-up method for the model complexity track in NTIRE2023 Efficient SR challenge
Official repository of the Fried Rice Lab, including code resources of the following our works: ESWT [arXiv], etc. This repository also implements many useful features and out-of-the-box image rest…
NeRF-Art: Text-Driven Neural Radiance Fields Stylization
CLIP Itself is a Strong Fine-tuner: Achieving 85.7% and 88.0% Top-1 Accuracy with ViT-B and ViT-L on ImageNet
PyTorch code for Improving Commonsense in Vision-Language Models via Knowledge Graph Riddles (DANCE)
LAVIS - A One-stop Library for Language-Vision Intelligence
Batch Face Processing for Modern Research, including face detection, face alignment, face reconstruction, head pose estimation
[NeurIPS 2022] Official Code for REVIVE: Regional Visual Representation Matters in Knowledge-Based Visual Question Answering
ECCV2022,Bootstrapped Masked Autoencoders for Vision BERT Pretraining
Paper 'Transformer based Pluralistic Image Completion with Reduced Information Loss' in TPAMI 2024 and 'Reduce Information Loss in Transformers for Pluralistic Image Inpainting' in CVPR2022
Bringing Old Films Back to Life (CVPR 2022)
Large-Scale Pre-training for Person Re-identification with Noisy Labels (LUPerson-NL)
MichiGAN: Multi-Input-Conditioned Hair Image Generation for Portrait Editing (SIGGRAPH 2020)
PyTorch implementation of BEVT (CVPR 2022) https://arxiv.org/abs/2112.01529
Implementation of Diverse Semantic Image Synthesis via Probability Distribution Modeling (CVPR 2021)
CLIP-NeRF: Text-and-Image Driven Manipulation of Neural Radiance Fields
[CVPR 2022] HairCLIP: Design Your Hair by Text and Reference Image
[NeurIPS 2021] “Stronger NAS with Weaker Predictors“, Junru Wu, Xiyang Dai, Dongdong Chen, Yinpeng Chen, Mengchen Liu, Ye Yu, Zhangyang Wang, Zicheng Liu, Mei Chen and Lu Yuan
Learning with Noisy Labels for Robust Point Cloud Segmentation (ICCV2021 Oral)
CSWin Transformer: A General Vision Transformer Backbone with Cross-Shaped, CVPR 2022
Unsupervised Pre-training for Person Re-identification (LUPerson)
Pytorch implementation of "All Tokens Matter: Token Labeling for Training Better Vision Transformers"
High-Fidelity Pluralistic Image Completion with Transformers (ICCV 2021)