Skip to content
View Junhojuno's full-sized avatar
🤿
Dive in
🤿
Dive in

Block or report Junhojuno

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[ECCV2024] Grounded Multimodal Large Language Model with Localized Visual Tokenization

Python 543 57 Updated Jun 7, 2024

Project Page for "LISA: Reasoning Segmentation via Large Language Model"

Python 1,768 126 Updated Jul 2, 2024

High-resolution models for human tasks.

Python 3,957 205 Updated Sep 20, 2024

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 19,402 2,134 Updated Aug 12, 2024

Official Implementation for "SCAPE: A Simple and Strong Category-Agnostic Pose Estimator", ECCV 2024.

4 Updated Jul 14, 2024

Hiera: A fast, powerful, and simple hierarchical vision transformer.

Python 862 39 Updated Mar 2, 2024

[ECCV 2024] Beyond MOT: Semantic Multi-Object Tracking

Python 21 Updated Sep 12, 2024

This is the code of our paper "Video-Based Human Pose Regression via Decoupled Space-Time Aggregation".

Python 119 13 Updated Aug 2, 2024

A Visual Studio Code extension with support for the Ruff linter.

TypeScript 1,050 52 Updated Sep 20, 2024

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 10,921 907 Updated Aug 21, 2024

PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838

Python 782 40 Updated Sep 17, 2024

Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.

Jupyter Notebook 2,221 148 Updated Aug 23, 2024

MMPD Dataset from ECCV'2024 "When Pedestrian Detection Meets Multi-Modal Learning: Generalist Model and Benchmark Dataset"

10 1 Updated Jul 15, 2024

This repository contains implementations and illustrative code to accompany DeepMind publications

Jupyter Notebook 13,083 2,569 Updated Aug 30, 2024

[ICLR 2024 & ECCV 2024] The All-Seeing Projects: Towards Panoptic Visual Recognition&Understanding and General Relation Comprehension of the Open World"

Python 444 14 Updated Aug 9, 2024

autoupdate paper list

Python 46 8 Updated Sep 23, 2024

In CVPR'2024. Meta-Point Learning and Refining for Category-Agnostic Pose Estimation

Python 9 Updated Jul 30, 2024

Code for "LocLLM: Exploiting Generalizable Human Keypoint Localization via Large Language Model", CVPR 2024 Highlight

Python 26 2 Updated Jun 11, 2024

CAPE using text-graphs

Python 10 1 Updated Jun 8, 2024

A novel few-shot keypoint detector with uncertainty learning for unseen species (CVPR2022).

Python 30 2 Updated Sep 22, 2023

MLX: An array framework for Apple silicon

C++ 16,495 943 Updated Sep 20, 2024

[ECCV2024] Official implementation of paper, "DenseNets Reloaded: Paradigm Shift Beyond ResNets and ViTs".

Python 96 3 Updated Aug 8, 2024

The project is an official implementation of our paper "PoseFormerV2: Exploring Frequency Domain for Efficient and Robust 3D Human Pose Estimation".

Python 245 29 Updated Jun 17, 2024

[CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model

Python 10,369 1,063 Updated Jun 21, 2024

[ICLR 2024] MogaNet: Efficient Multi-order Gated Aggregation Network

Jupyter Notebook 169 13 Updated Mar 5, 2024

Transparent Image Layer Diffusion using Latent Transparency

1,991 26 Updated Jun 16, 2024

[CVPR 2023] Code for PConv and FasterNet

Python 675 55 Updated May 16, 2023

Unofficial implementation of "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows" (https://arxiv.org/abs/2103.14030)

Python 63 13 Updated Jun 28, 2021

This is the official code release for our work, Denoising Vision Transformers.

Python 282 8 Updated Jul 22, 2024
Next