-
University of Surrey
- Guildford, UK
- https://sites.google.com/site/2adutta/
Highlights
- Pro
Stars
Official implementation of "Transitivity Recovering Decompositions: Interpretable and Robust Fine-Grained Relationships", NeurIPS 2023.
Official implementation of Data-Free Sketch-Based Image Retrieval, CVPR 2023.
[T-PAMI-2024] Transformer-Based Visual Segmentation: A Survey
Collect some papers about transformer for detection and segmentation. Awesome Detection Transformer for Computer Vision (CV)
ECCV 2022: Abstracting Sketches through Simple Primitives
📚 A collection of papers about Referring Image Segmentation.
A curated publication list on weakly-supervised temporal action localization
Official source code for "Continual 3D Convolutional Neural Networks for Real-time Processing of Videos" [ECCV2022]
Official implementation of "Cross-Modal Fusion Distillation for Fine-Grained Sketch-Based Image Retrieval", BMVC 2022.
Official implementation of "Relational Proxies: Emergent Relationships as Fine-Grained Discriminators", NeurIPS 2022.
PyTorch package for the discrete VAE used for DALL·E.
A curated list of prompt-based paper in computer vision and vision-language learning.
Identifying concept libraries from language about object structure
PyTorch implementation of the paper "Semantically Tied Paired Cycle Consistency for Zero-Shot Sketch-based Image Retrieval", CVPR 2019.
[CVPR2022] Animal Kingdom: A Large and Diverse Dataset for Animal Behavior Understanding
Repository of Jupyter notebook tutorials for teaching the Deep Learning Course at the University of Amsterdam (MSc AI), Fall 2023
Baseline Triplet Loss Based Model for Fine-Grained Sketch Based Image Retrieval.
Semantic segmentation models with 500+ pretrained convolutional and transformer-based backbones.
PyTorch implementation of the U-Net for image semantic segmentation with high quality images
Action Recognition on the KTH Dataset
A system for assigning and grading notebooks
A collection of extensions and data-loaders for few-shot learning & meta-learning in PyTorch
Multi-Headed Self-Attention via Vision Transformer for Zero-Shot Learning (ViT-ZSL)
PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO
VISSL is FAIR's library of extensible, modular and scalable components for SOTA Self-Supervised Learning with images.
PyTorch implementation of SimCLR: A Simple Framework for Contrastive Learning of Visual Representations