Skip to content
View JosephKJ's full-sized avatar
🙇‍♂️
Work hard!
🙇‍♂️
Work hard!

Block or report JosephKJ

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

Finetune Llama 3.1, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

Python 15,738 1,063 Updated Sep 18, 2024

This repo contains evaluation code for the paper "Are We on the Right Way for Evaluating Large Vision-Language Models"

Python 138 2 Updated Apr 17, 2024

✨✨Latest Advances on Multimodal Large Language Models

11,780 759 Updated Sep 19, 2024

Python Library to evaluate VLM models' robustness across diverse benchmarks

Jupyter Notebook 162 8 Updated Sep 3, 2024

Official release of InternLM2.5 base and chat models. 1M context support

Python 6,250 439 Updated Sep 6, 2024

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 10,841 889 Updated Aug 21, 2024

SEED-Story: Multimodal Long Story Generation with Large Language Model

Python 697 55 Updated Jul 29, 2024

next gen smart vlm reasoner

Python 5 Updated Jun 22, 2024
Python 208 15 Updated Apr 10, 2024

Continual Few-Shot Learning of New Actions With Prompt Tuning

1 Updated Jul 5, 2024

Official implementation of paper "MiniGPT-5: Interleaved Vision-and-Language Generation via Generative Vokens"

Python 844 52 Updated Mar 19, 2024

The open-source tool for building high-quality datasets and computer vision models

Python 8,115 540 Updated Sep 20, 2024

🔥 [CVPR 2024] The official repo for Zero-Painter!

Python 54 3 Updated Jun 8, 2024

[CVPR 2024] Code release for "InstanceDiffusion: Instance-level Control for Image Generation"

Python 473 26 Updated Jul 16, 2024

OpenCOLE: Towards Reproducible Automatic Graphic Design Generation [Inoue+, CVPRW2024 (GDUG)]

Python 41 4 Updated Sep 9, 2024

Recent LLM-based CV and related works. Welcome to comment/contribute!

826 35 Updated Jun 5, 2024

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

Python 14,697 2,576 Updated Aug 20, 2024

Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

Python 3,181 279 Updated May 4, 2024

[CVPR24 Oral] Official repository for RALF: Retrieval-Augmented Layout Transformer for Content-Aware Layout Generation

Python 91 1 Updated Jul 6, 2024
Python 432 26 Updated Jul 29, 2024

Go ahead and axolotl questions

Python 7,578 822 Updated Sep 18, 2024

Create Magic Story!

Jupyter Notebook 5,800 577 Updated Jul 24, 2024

Multimodal language model benchmark, featuring challenging examples

Python 144 6 Updated Aug 13, 2024

🔥🔥 LLaVA++: Extending LLaVA with Phi-3 and LLaMA-3 (LLaVA LLaMA-3, LLaVA Phi-3)

Python 797 57 Updated Jul 10, 2024

Official Repo of Graphist

92 2 Updated Apr 23, 2024

A Framework of Continual Learning

Python 73 4 Updated Sep 8, 2024

[CVPR 2024] AMUSE: Emotional Speech-driven 3D Body Animation via Disentangled Latent Diffusion

Python 78 4 Updated Aug 28, 2024

Control Color: Multimodal Diffusion-based Interactive Image Colorization

102 2 Updated Feb 22, 2024
Python 8,309 486 Updated Jan 27, 2024
Next