Highlights
- Pro
Stars
A script that automatically backups all Overleaf projects to a local folder. It works.
Official code implementation for the paper "Do Vision & Language Decoders use Images and Text equally? How Self-consistent are their Explanations?"
Official Implementation of Paella https://arxiv.org/abs/2211.07292v2
Pytorch implementation of MaskGIT: Masked Generative Image Transformer (https://arxiv.org/pdf/2202.04200.pdf)
A collection of resources and papers on Diffusion Models
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
MAGMA - a GPT-style multimodal model that can understand any combination of images and language. NOTE: The freely available model from this repo is only a demo. For the latest multimodal and multilβ¦
Recent Advances in Vision and Language PreTrained Models (VL-PTMs)
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
An implementation of model parallel GPT-2 and GPT-3-style models using the mesh-tensorflow library.
shawwn / CLIP
Forked from openai/CLIPContrastive Language-Image Pretraining
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
Multi Task Vision and Language
π€ Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Object detection, 3D detection, and pose estimation using center point detection:
A collection of various deep learning architectures, models, and tips
Code for Neural Motifs: Scene Graph Parsing with Global Context (CVPR 2018)
A flexible tool for creating, organizing, and sharing visualizations of live, rich data. Supports Torch and Numpy.
Factorizable Net (Multi-GPU version): An Efficient Subgraph-based Framework for Scene Graph Generation
[ECCV 2018] Official code for "Graph R-CNN for Scene Graph Generation"