Skip to content
View xuwenshen's full-sized avatar

Block or report xuwenshen

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding

Python 1,331 84 Updated Sep 20, 2024

This is a repository for sharing papers in the field of empathetic conversational AI. The related source code for each paper is linked if available.

237 25 Updated Apr 17, 2024

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 13,466 1,097 Updated Sep 2, 2024

Mobile-Agent: The Powerful Mobile Device Operation Assistant Family

Python 2,719 250 Updated Sep 23, 2024

整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。

15,059 1,400 Updated Sep 19, 2024

Public repo for HF blog posts

Jupyter Notebook 2,279 708 Updated Sep 23, 2024

A Chinese Open-Domain Dialogue System

Python 310 27 Updated Aug 16, 2023

Example models using DeepSpeed

Python 6,000 1,019 Updated Sep 17, 2024

Awesome-LLM: a curated list of Large Language Model

17,528 1,418 Updated Sep 23, 2024

BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)

HTML 7,831 753 Updated Mar 15, 2024

行业内关于智能客服、聊天机器人的应用和架构、算法分享和介绍

1,089 228 Updated Aug 29, 2024

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translatio…

Python 10,946 1,825 Updated Sep 17, 2024

结巴中文分词

Python 33,129 6,723 Updated Aug 21, 2024

EVA: Large-scale Pre-trained Chit-Chat Models

Python 304 51 Updated Mar 11, 2023

Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.

Python 22,550 3,614 Updated Jul 28, 2024

End-to-End Speech Processing Toolkit

Python 8,319 2,157 Updated Sep 23, 2024
Jupyter Notebook 14 2 Updated Jun 4, 2022
C 89 24 Updated Apr 22, 2024

Evaluating Cross-lingual Sentence Representations

440 44 Updated Aug 30, 2021

Specialize word embedding for word semantic similarity or relatedness task.

JavaScript 8 Updated Mar 25, 2018

The Cantonese Wordnet

14 3 Updated Dec 4, 2023

🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools

Python 19,036 2,632 Updated Sep 23, 2024

A curated list of resources dedicated to Natural Language Processing (NLP) of Cantonese | 粵語 NLP

84 4 Updated Oct 17, 2021

Cantonese Linguistics and NLP

Python 356 39 Updated May 23, 2024

State-of-the-Art Text Embeddings

Python 14,894 2,438 Updated Sep 19, 2024

MASS: Masked Sequence to Sequence Pre-training for Language Generation

Python 1,116 206 Updated Nov 28, 2022

Enhancing Multilingual Sentence Embeddings with Semantic Specialization (AAAI '20)

4 Updated Nov 20, 2019

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 30,202 6,376 Updated Sep 9, 2024

Paddle Graph Learning (PGL) is an efficient and flexible graph learning framework based on PaddlePaddle

Python 1,570 311 Updated Dec 11, 2023
Python 21 5 Updated Dec 20, 2019
Next