Stars
A multimodal agent framework for solving complex tasks
✨✨Latest Advances on Multimodal Large Language Models
agentUniverse is a LLM multi-agent framework that allows developers to easily build multi-agent applications.
Research Code for Multimodal-Cognition Team in Ant Group
A package for parsing PDFs and analyzing their content using LLMs.
A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。
《开源大模型食用指南》基于Linux环境快速部署开源大模型,更适合中国宝宝的部署教程
李白 👤 作为唐代杰出诗人,其诗歌作品在中国文学史上具有重要地位。近年来,随着数字技术和人工智能的快速发展,传统文化普及推广的形式也面临着创新与变革。国内外对于李白诗歌的研究虽已相当深入,但在数字化、智能化普及方面仍存在不足。因此,本项目旨在通过构建李白知识图谱,结合大模型训练出专业的AI智能体,以生成式对话应用的形式,推动李白文化的普及与推广。
GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型
A generative speech model for daily dialogue.
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
#1 Locally hosted web application that allows you to perform various operations on PDF files
[NAACL 2024] Visually Guided Generative Text-Layout Pre-training for Document Intelligence
A generalized information-seeking agent system with Large Language Models (LLMs).
RTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.
FinQwen: 致力于构建一个开放、稳定、高质量的金融大模型项目,基于大模型搭建金融场景智能问答系统,利用开源开放来促进「AI+金融」。
【LLMs九层妖塔】分享 LLMs在自然语言处理(ChatGLM、Chinese-LLaMA-Alpaca、小羊驼 Vicuna、LLaMA、GPT4ALL等)、信息检索(langchain)、语言合成、语言识别、多模态等领域(Stable Diffusion、MiniGPT-4、VisualGLM-6B、Ziya-Visual等)等 实战与经验。
An Efficient "Factory" to Build Multiple LoRA Adapters