Stars
mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding
This is a repository for sharing papers in the field of empathetic conversational AI. The related source code for each paper is linked if available.
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
Mobile-Agent: The Powerful Mobile Device Operation Assistant Family
整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。
Example models using DeepSpeed
Awesome-LLM: a curated list of Large Language Model
BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translatio…
Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.
Evaluating Cross-lingual Sentence Representations
Specialize word embedding for word semantic similarity or relatedness task.
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
A curated list of resources dedicated to Natural Language Processing (NLP) of Cantonese | 粵語 NLP
State-of-the-Art Text Embeddings
MASS: Masked Sequence to Sequence Pre-training for Language Generation
Enhancing Multilingual Sentence Embeddings with Semantic Specialization (AAAI '20)
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
Paddle Graph Learning (PGL) is an efficient and flexible graph learning framework based on PaddlePaddle