Skip to content
View ShanZard's full-sized avatar

Block or report ShanZard

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
46 stars written in Python
Clear filter

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 30,217 6,381 Updated Sep 27, 2024

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

Python 19,802 2,977 Updated Aug 28, 2024

Transfer learning / domain adaptation / domain generalization / multi-task learning etc. Papers, codes, datasets, applications, tutorials.-迁移学习

Python 13,303 3,805 Updated Sep 25, 2024

A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。

Python 11,694 878 Updated Sep 27, 2024

🍀 Pytorch implementation of various Attention Mechanisms, MLP, Re-parameter, Convolution, which is helpful to further understand papers.⭐⭐⭐

Python 11,327 1,923 Updated Aug 29, 2024

Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.

Python 10,284 1,543 Updated Aug 29, 2024

Semantic segmentation models with 500+ pretrained convolutional and transformer-based backbones.

Python 9,461 1,652 Updated Sep 28, 2024

Implementation of Denoising Diffusion Probabilistic Model in Pytorch

Python 8,019 1,003 Updated Sep 5, 2024

Segment Anything in High Quality [NeurIPS 2023]

Python 3,657 220 Updated Jul 7, 2024

Summary of related papers on visual attention. Related code will be released based on Jittor gradually.

Python 2,746 410 Updated Dec 2, 2022

Just playing with getting VQGAN+CLIP running locally, rather than having to use colab.

Python 2,605 428 Updated Oct 2, 2022

Unsupervised Learning for Image Registration

Python 2,262 577 Updated Sep 20, 2024

Automated dense category annotation engine that serves as the initial semantic labeling for the Segment Anything dataset (SA-1B).

Python 2,120 135 Updated Jun 7, 2023

Class activation maps for your PyTorch models (CAM, Grad-CAM, Grad-CAM++, Smooth Grad-CAM++, Score-CAM, SS-CAM, IS-CAM, XGrad-CAM, Layer-CAM)

Python 1,993 205 Updated Sep 27, 2024

Medical image registration using deep learning

Python 564 76 Updated Dec 15, 2022

A Change Detection Repo Standing on the Shoulders of Giants

Python 506 74 Updated Jul 23, 2024

Transformer-based image captioning extension for pytorch/fairseq

Python 313 56 Updated Dec 18, 2020

The official repo for [NeurIPS'23] "SAMRS: Scaling-up Remote Sensing Segmentation Dataset with Segment Anything Model"

Python 276 14 Updated Aug 5, 2024

Implementation of Deformable Attention in Pytorch from the paper "Vision Transformer with Deformable Attention"

Python 273 29 Updated Apr 23, 2024

Code for "Aligning Linguistic Words and Visual Semantic Units for Image Captioning", ACM MM 2019

Python 263 24 Updated Oct 18, 2019

The Most Faithful Implementation of Segment Anything (SAM) in 3D

Python 261 12 Updated Sep 11, 2024

Includes: Learning data augmentation strategies for object detection | GridMask data augmentation | Augmentation for small object detection in Numpy. Use RetinaNet with ResNet-18 to test these meth…

Python 239 48 Updated Aug 21, 2020

Global Reasoning module for visual recognition

Python 206 52 Updated Oct 12, 2021

"SOLQ: Segmenting Objects by Learning Queries", SOLQ is an end-to-end instance segmentation framework with Transformer.

Python 198 21 Updated Apr 17, 2022
Python 192 21 Updated Sep 24, 2024

SSL4EO-S12: a large-scale dataset for self-supervised learning in Earth observation

Python 180 18 Updated Apr 20, 2024

Implementation of the Object Relation Transformer for Image Captioning

Python 176 44 Updated Sep 17, 2024

Official repo for "SkyScript: A Large and Semantically Diverse Vision-Language Dataset for Remote Sensing"

Python 113 2 Updated Dec 31, 2023

Official LEVIR-CC dataset and Pytorch implementation for Remote Sensing Image Change Captioning With Dual-Branch Transformers: A New Method and a Large Scale Dataset

Python 104 6 Updated May 11, 2024

[AAAI 2024] EarthVQA: Towards Queryable Earth via Relational Reasoning-Based Remote Sensing Visual Question Answering

Python 66 3 Updated Sep 26, 2024
Next