Skip to content
View Xu-Jianjun's full-sized avatar
  • university of science and technology of China
  • Hefei China

Highlights

  • Pro

Block or report Xu-Jianjun

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

up-to-date curated list of state-of-the-art Large vision language models hallucinations research work, papers & resources

17 Updated Sep 14, 2024

Paper collections of multi-modal LLM for Math/STEM/Code.

13 1 Updated Sep 21, 2024

[CVPR 2024 Highlight] OPERA: Alleviating Hallucination in Multi-Modal Large Language Models via Over-Trust Penalty and Retrospection-Allocation

Python 256 22 Updated Aug 24, 2024

Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Python 2,100 121 Updated Sep 21, 2024

Accompanying repo for 'Charting New Territories: Exploring the Geographic and Geospatial Capabilities of Multimodal LLMs' project

24 Updated Sep 5, 2024
Python 14 Updated Mar 21, 2024
4 Updated Jun 18, 2024

OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text

Python 248 5 Updated Aug 29, 2024

Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks

Python 5,018 417 Updated Aug 29, 2024

100+套大数据可视化炫酷大屏Html5模板;包含行业:社区、物业、政务、交通、金融银行等,全网最新、最多,最全、最酷、最炫大数据可视化模板。陆续更新中

JavaScript 2,116 691 Updated Jul 24, 2024

MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone

Python 11,979 842 Updated Sep 13, 2024

A generative speech model for daily dialogue.

Python 30,851 3,350 Updated Sep 4, 2024

[Under Review] Official PyTorch implementation code for realizing the technical part of Mamba-based traversal of rationale (Meteor) to improve performance of numerous vision language performances f…

Python 99 4 Updated May 30, 2024

Converting Mixtral-8x7B to Mixtral-[1~7]x7B

Python 20 1 Updated Mar 4, 2024

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 11,558 2,421 Updated Sep 21, 2024

天涯神贴合集 pdf版 无水印版 免费分享 方便阅读

349 126 Updated Sep 14, 2024

A native PyTorch Library for large model training

Python 2,069 156 Updated Sep 19, 2024

The official Meta Llama 3 GitHub site

Python 26,205 2,951 Updated Aug 12, 2024

[CVPR 2024] Official RT-DETR (RTDETR paddle pytorch), Real-Time DEtection TRansformer, DETRs Beat YOLOs on Real-time Object Detection. 🔥 🔥 🔥

Python 2,275 251 Updated Aug 27, 2024

Ongoing research training transformer models at scale

Python 10,040 2,261 Updated Sep 21, 2024

800,000 step-level correctness labels on LLM solutions to MATH problems

Python 1,455 94 Updated Jun 1, 2023

Compose multimodal datasets 🎹

Python 171 8 Updated Mar 17, 2024

Must-read Papers on Large Language Model (LLM) as Optimizers and Automatic Optimization for Prompting LLMs.

209 18 Updated Mar 19, 2024

A family of open-sourced Mixture-of-Experts (MoE) Large Language Models

Python 1,354 67 Updated Mar 8, 2024

Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)

Python 56,236 6,914 Updated Sep 20, 2024

An all-in-one LLMs Chat UI for Apple Silicon Mac using MLX Framework.

Python 1,459 131 Updated Sep 6, 2024

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Python 11,273 1,007 Updated Sep 20, 2024

A repository sharing the literatures about long-context large language models, including the methodologies and the evaluation benchmarks

Jupyter Notebook 241 11 Updated Jul 30, 2024

VideoSys: An easy and efficient system for video generation

Python 1,646 111 Updated Sep 14, 2024

YaRN: Efficient Context Window Extension of Large Language Models

Python 1,310 115 Updated Apr 17, 2024
Next