Junhojuno

Follow

🤿

Dive in

Kim Junho Junhojuno

🤿

Dive in

Follow

Rearch Engineer, Human Pose Estimation

16 followers · 92 following

EverEx
Seoul
@J_u_n_o_Kim

Achievements

Achievements

Stars

FoundationVision / Groma

[ECCV2024] Grounded Multimodal Large Language Model with Localized Visual Tokenization

Python 543 57 Updated Jun 7, 2024

dvlab-research / LISA

Project Page for "LISA: Reasoning Segmentation via Large Language Model"

Python 1,768 126 Updated Jul 2, 2024

facebookresearch / sapiens

High-resolution models for human tasks.

Python 3,957 205 Updated Sep 20, 2024

haotian-liu / LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 19,402 2,134 Updated Aug 12, 2024

mistralai / mistral-finetune

Python 2,665 213 Updated Sep 13, 2024

tiny-smart / SCAPE

Official Implementation for "SCAPE: A Simple and Strong Category-Agnostic Pose Estimator", ECCV 2024.

4 Updated Jul 14, 2024

facebookresearch / hiera

Hiera: A fast, powerful, and simple hierarchical vision transformer.

Python 862 39 Updated Mar 2, 2024

HengLan / SMOT

[ECCV 2024] Beyond MOT: Semantic Multi-Object Tracking

Python 21 Updated Sep 12, 2024

zgspose / DSTA

This is the code of our paper "Video-Based Human Pose Regression via Decoupled Space-Time Aggregation".

Python 119 13 Updated Aug 2, 2024

astral-sh / ruff-vscode

A Visual Studio Code extension with support for the Ruff linter.

TypeScript 1,050 52 Updated Sep 20, 2024

facebookresearch / segment-anything-2

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 10,921 907 Updated Aug 21, 2024

LTH14 / mar

PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838

Python 782 40 Updated Sep 17, 2024

google-research / big_vision

Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.

Jupyter Notebook 2,221 148 Updated Aug 23, 2024

jin-s13 / MMPD-Dataset

MMPD Dataset from ECCV'2024 "When Pedestrian Detection Meets Multi-Modal Learning: Generalist Model and Benchmark Dataset"

10 1 Updated Jul 15, 2024

google-deepmind / deepmind-research

This repository contains implementations and illustrative code to accompany DeepMind publications

Jupyter Notebook 13,083 2,569 Updated Aug 30, 2024

OpenGVLab / all-seeing

[ICLR 2024 & ECCV 2024] The All-Seeing Projects: Towards Panoptic Visual Recognition&Understanding and General Relation Comprehension of the Open World"

Python 444 14 Updated Aug 9, 2024

isLinXu / paper-list

autoupdate paper list

Python 46 8 Updated Sep 23, 2024

chenbys / MetaPoint

In CVPR'2024. Meta-Point Learning and Refining for Category-Agnostic Pose Estimation

Python 9 Updated Jul 30, 2024

kennethwdk / LocLLM

Code for "LocLLM: Exploiting Generalizable Human Keypoint Localization via Large Language Model", CVPR 2024 Highlight

Python 26 2 Updated Jun 11, 2024

matanr / capex

CAPE using text-graphs

Python 10 1 Updated Jun 8, 2024

AlanLuSun / Few-shot-keypoint-detection

A novel few-shot keypoint detector with uncertainty learning for unseen species (CVPR2022).

Python 30 2 Updated Sep 22, 2023

ml-explore / mlx

MLX: An array framework for Apple silicon

C++ 16,495 943 Updated Sep 20, 2024

naver-ai / rdnet

[ECCV2024] Official implementation of paper, "DenseNets Reloaded: Paradigm Shift Beyond ResNets and ViTs".

Python 96 3 Updated Aug 8, 2024

QitaoZhao / PoseFormerV2

The project is an official implementation of our paper "PoseFormerV2: Exploring Frequency Domain for Efficient and Robust 3D Human Pose Estimation".

Python 245 29 Updated Jun 17, 2024

magic-research / magic-animate

[CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model

Python 10,369 1,063 Updated Jun 21, 2024

Westlake-AI / MogaNet

[ICLR 2024] MogaNet: Efficient Multi-order Gated Aggregation Network

Jupyter Notebook 169 13 Updated Mar 5, 2024

lllyasviel / LayerDiffuse

Transparent Image Layer Diffusion using Latent Transparency

1,991 26 Updated Jun 16, 2024

JierunChen / FasterNet

[CVPR 2023] Code for PConv and FasterNet

Python 675 55 Updated May 16, 2023

VcampSoldiers / Swin-Transformer-Tensorflow

Unofficial implementation of "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows" (https://arxiv.org/abs/2103.14030)

Python 63 13 Updated Jun 28, 2021

Jiawei-Yang / Denoising-ViT

This is the official code release for our work, Denoising Vision Transformers.

Python 282 8 Updated Jul 22, 2024