Xu-Jianjun

kaiclife Xu-Jianjun

university of science and technology of China
Hefei China

Highlights

Stars

NishilBalar / Awesome-LVLM-Hallucination

up-to-date curated list of state-of-the-art Large vision language models hallucinations research work, papers & resources

17 Updated Sep 14, 2024

InfiMM / Awesome-Multimodal-LLM-for-Math-STEM

Paper collections of multi-modal LLM for Math/STEM/Code.

13 1 Updated Sep 21, 2024

shikiw / OPERA

[CVPR 2024 Highlight] OPERA: Alleviating Hallucination in Multi-Modal Large Language Models via Over-Trust Penalty and Retrospection-Allocation

Python 256 22 Updated Aug 24, 2024

QwenLM / Qwen2-VL

Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Python 2,100 121 Updated Sep 21, 2024

jonathan-roberts1 / charting-new-territories

Accompanying repo for 'Charting New Territories: Exploring the Geographic and Geospatial Capabilities of Multimodal LLMs' project

24 Updated Sep 5, 2024

tal-tech / chinese-k12-evaluation

Python 14 Updated Mar 21, 2024

Xu-Jianjun / OTE

4 Updated Jun 18, 2024

OpenGVLab / OmniCorpus

OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text

Python 248 5 Updated Aug 29, 2024

adithya-s-k / omniparse

Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks

Python 5,018 417 Updated Aug 29, 2024

iGaoWei / BigDataView

100+套大数据可视化炫酷大屏Html5模板；包含行业：社区、物业、政务、交通、金融银行等，全网最新、最多，最全、最酷、最炫大数据可视化模板。陆续更新中

JavaScript 2,116 691 Updated Jul 24, 2024

OpenBMB / MiniCPM-V

MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone

Python 11,979 842 Updated Sep 13, 2024

2noise / ChatTTS

A generative speech model for daily dialogue.

Python 30,851 3,350 Updated Sep 4, 2024

ByungKwanLee / Meteor

[Under Review] Official PyTorch implementation code for realizing the technical part of Mamba-based traversal of rationale (Meteor) to improve performance of numerous vision language performances f…

Python 99 4 Updated May 30, 2024

fxmeng / mixtral_spliter

Converting Mixtral-8x7B to Mixtral-[1~7]x7B

Python 20 1 Updated Mar 4, 2024

NVIDIA / NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 11,558 2,421 Updated Sep 21, 2024