Skip to content
View passing2961's full-sized avatar
πŸ˜ƒ
πŸ˜ƒ

Block or report passing2961

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[NeurIPS 2024] This repo contains evaluation code for the paper "Are We on the Right Way for Evaluating Large Vision-Language Models"

Python 142 2 Updated Sep 26, 2024

[Under Review] Official PyTorch implementation code for realizing the technical part of Phantom of Latent representing equipped with enlarged hidden dimension to build super frontier vision languag…

Python 31 Updated Sep 24, 2024

Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Python 2,367 132 Updated Sep 24, 2024

Official implementation of the Law of Vision Representation in MLLMs

Python 115 7 Updated Sep 8, 2024

Critique-out-Loud Reward Models

Python 22 1 Updated Sep 5, 2024

A Survey on Benchmarks of Multimodal Large Language Models

33 1 Updated Sep 29, 2024

A flexible and efficient codebase for training visually-conditioned language models (VLMs)

Python 422 192 Updated Jul 4, 2024

🌍 Repository for "AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agent", ACL'24 Best Resource Paper.

Python 85 5 Updated Sep 28, 2024

SimPO: Simple Preference Optimization with a Reference-Free Reward

Python 661 39 Updated Aug 22, 2024

[ICLR 2024 & NeurIPS 2023 WS] An Evaluator LM that is open-source, offers reproducible evaluation, and inexpensive to use. Specifically designed for fine-grained evaluation on a customized score ru…

Python 287 18 Updated Nov 11, 2023

[NeurIPS 2023] MeZO: Fine-Tuning Language Models with Just Forward Passes. https://arxiv.org/abs/2305.17333

Python 1,026 61 Updated Jan 11, 2024

😸 πŸ’¬ A module to compute textual lexical richness (aka lexical diversity).

Python 90 19 Updated Aug 27, 2023

Large-scale, Informative, and Diverse Multi-round Chat Data (and Models)

Python 2,220 113 Updated Mar 13, 2024

Code for the ICML 2021 (long talk) paper: "ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision"

Python 1,367 208 Updated Apr 3, 2024

20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.

Python 10,090 1,002 Updated Sep 27, 2024

This is the official repository of our paper "What If We Recaption Billions of Web Images with LLaMA-3 ?"

117 1 Updated Jun 13, 2024

A benchmark dataset and simple code examples for measuring the perception and reasoning of multi-sensor Vision Language models.

Python 16 Updated Aug 23, 2024

πŸŒ‹πŸ‘΅πŸ» Yo'LLaVA: Your Personalized Language and Vision Assistant

Python 52 2 Updated Sep 11, 2024

Python Library to evaluate VLM models' robustness across diverse benchmarks

Jupyter Notebook 163 8 Updated Sep 26, 2024

cuML - RAPIDS Machine Learning Library

C++ 4,170 528 Updated Sep 27, 2024

Experiments and data for the paper "When and why vision-language models behave like bags-of-words, and what to do about it?" Oral @ ICLR 2023

Python 238 15 Updated Jun 7, 2023

A Home Assistant integration & Model to control your smart home using a Local LLM

Python 618 64 Updated Sep 14, 2024

A framework for serving and evaluating LLM routers - save LLM costs without compromising quality!

Python 2,951 227 Updated Aug 10, 2024

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 11,088 942 Updated Sep 29, 2024

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Python 11,928 810 Updated Aug 15, 2024

πŸ₯ Code and Dataset for our EMNLP 2022 paper - "ProsocialDialog: A Prosocial Backbone for Conversational Agents"

Python 60 Updated Aug 2, 2023

[ACL 2024] LangBridge: Multilingual Reasoning Without Multilingual Supervision

Python 69 7 Updated Sep 12, 2024

πŸ‘» Code and benchmark for our EMNLP 2023 paper - "FANToM: A Benchmark for Stress-testing Machine Theory of Mind in Interactions"

Python 49 3 Updated May 31, 2024

Official repository for "Intriguing Properties of Vision Transformers" (NeurIPS 2021--Spotlight)

Python 176 19 Updated Aug 9, 2022

Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]

Python 12,518 1,456 Updated Sep 29, 2024
Next