Skip to content
View Fage2016's full-sized avatar

Block or report Fage2016

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 9,069 838 Updated Jul 1, 2024

Rasa UI is a frontend for the Rasa Framework

JavaScript 957 330 Updated Dec 30, 2022

BERT-based intent and slots detector for chatbots.

Python 124 17 Updated May 10, 2024

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 36,552 4,509 Updated Sep 25, 2024

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

Python 25,326 2,910 Updated Sep 2, 2024

GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.

C++ 69,640 7,620 Updated Sep 29, 2024

中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽车品牌词库、汽车零件词库、连续英文切割、各种中文词向量、公司名字大全、古诗词库、IT词库、财经词库、成语词库、地名词库、…

Python 68,008 14,428 Updated May 10, 2024

A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training

Python 19,923 2,465 Updated Aug 15, 2024

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 36,437 5,722 Updated Aug 19, 2024

Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow

C++ 26,138 8,706 Updated Sep 28, 2024

https://github.com/dmlc/xgboost

C++ 571 260 Updated Jul 4, 2018

坚持分享 GitHub 上高质量、有趣实用的开源技术教程、开发者工具、编程网站、技术资讯。A list cool, interesting projects of GitHub.

32,055 3,528 Updated May 29, 2024

LeetCode Solutions: A Record of My Problem Solving Journey.( leetcode题解,记录自己的leetcode解题之路。)

JavaScript 54,525 9,451 Updated Aug 12, 2024

Jupyter notebooks for the code samples of the book "Deep Learning with Python"

Jupyter Notebook 18,542 8,623 Updated Jul 9, 2024

Implementation of BERT that could load official pre-trained models for feature extraction and prediction

Python 2,429 513 Updated Jan 22, 2022

《Hello 算法》:动画图解、一键运行的数据结构与算法教程。支持 Python, Java, C++, C, C#, JS, Go, Swift, Rust, Ruby, Kotlin, TS, Dart 代码。简体版和繁体版同步更新,English version ongoing

Java 96,013 12,177 Updated Sep 28, 2024

自然语言处理(NLP)教程,包括:词向量,词法分析,预训练语言模型,文本分类,文本语义匹配,信息抽取,翻译,对话。

Jupyter Notebook 385 62 Updated May 7, 2022

TextGen: Implementation of Text Generation models, include LLaMA, BLOOM, GPT2, BART, T5, SongNet and so on. 文本生成模型,实现了包括LLaMA,ChatGLM,BLOOM,GPT2,Seq2Seq,BART,T5,UDA等模型的训练和预测,开箱即用。

Python 926 107 Updated Sep 14, 2024

pytextclassifier is a toolkit for text classification. 文本分类,LR,Xgboost,TextCNN,FastText,TextRNN,BERT等分类模型实现,开箱即用。

Python 482 74 Updated Sep 25, 2024

text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型,开箱即用。

Python 4,411 392 Updated Sep 8, 2024

pycorrector is a toolkit for text error correction. 文本纠错,实现了Kenlm,T5,MacBERT,ChatGLM3,LLaMA等模型应用在纠错场景,开箱即用。

Python 5,512 1,089 Updated Sep 24, 2024

Easy-to-use,Modular and Extendible package of deep-learning based CTR models .

Python 7,535 2,206 Updated Aug 9, 2024

DSSM and Multi-View DSSM

Python 658 230 Updated Dec 15, 2020

A deep matching model library for recommendations & advertising. It's easy to train models and to export representation vectors which can be used for ANN search.

Python 2,213 530 Updated May 14, 2024

A collection of algorithms and data structures

Java 17,144 4,337 Updated Aug 15, 2024

基于tensorflow 实现的用textcnn方法做情感分析的项目,有数据,可以直接跑。

Python 340 132 Updated Dec 19, 2019

The code of CIKM'19 paper《Hierarchical Multi-label Text Classification: An Attention-based Recurrent Network Approach》

Python 315 73 Updated May 27, 2024

Word2vec, Fasttext, Glove, Elmo, Bert, Flair pre-train Word Embedding

Python 644 186 Updated Nov 16, 2020

torch-optimizer -- collection of optimizers for Pytorch

Python 3,021 297 Updated Mar 22, 2024
Next