Skip to content
View wwfwwf's full-sized avatar

Block or report wwfwwf

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Python scripts for training/testing paragraph vectors

Python 644 192 Updated Aug 7, 2023

这是一个类,里面包含的有关文本相似度的常用的计算算法,例如,最长公共子序列,最短标记距离,TF-IDF等算法

Python 63 37 Updated Mar 28, 2017

DeepDive

Shell 1,955 539 Updated Jun 9, 2022

NLTK Source

Python 13,440 2,864 Updated Sep 4, 2024

Easily generate document/paragraph/sentence vectors and calculate similarity.

Python 136 31 Updated Oct 5, 2021

:octocat:GitHub最全的前端资源汇总仓库(包括前端学习、开发资源、求职面试等)

PHP 9,378 1,876 Updated Mar 16, 2024

结巴中文分词

Python 33,118 6,724 Updated Aug 21, 2024

pyltp: the python extension for LTP

C++ 1,530 352 Updated Jul 24, 2022

根据自己搭的 LTP 服务器,实现:分词、词性标注、命名实体识别、依存句法分析、语义角色标、命名实体的抽取:人名,地名,机构名、三元组的抽取:主谓宾,动宾关系,介宾关系,(实体1,关系,实体2)

Python 142 53 Updated Aug 19, 2017

Python wrapper for Stanford CoreNLP.

Python 919 200 Updated Dec 7, 2021

An optimizer that trains as fast as Adam and as good as SGD.

Python 2,904 330 Updated Jul 23, 2023

一个用于提取简体中文字符串中省,市和区并能够进行映射,检验和简单绘图的python模块

Python 1,661 396 Updated Mar 19, 2024

Jiagu深度学习自然语言处理工具 知识图谱关系抽取 中文分词 词性标注 命名实体识别 情感分析 新词发现 关键词 文本摘要 文本聚类

Python 3,297 611 Updated May 7, 2022

中文分词 词性标注 命名实体识别 依存句法分析 成分句法分析 语义依存分析 语义角色标注 指代消解 风格转换 语义相似度 新词发现 关键词短语提取 自动摘要 文本分类聚类 拼音简繁转换 自然语言处理

Python 33,546 10,005 Updated Sep 8, 2024

The AI Agent Framework in .NET

C# 2,165 449 Updated Sep 20, 2024

Python library for information extraction of quantities from unstructured text

Python 120 23 Updated Apr 21, 2023

中文分词

Python 3,115 804 Updated Apr 19, 2024

Chinese Named Entity Recognition with IDCNN/biLSTM+CRF, and Relation Extraction with biGRU+2ATT 中文实体识别与关系提取

Python 2,220 814 Updated Feb 1, 2024

A topic-centric list of HQ open datasets.

60,188 9,850 Updated Sep 6, 2024

常用工具搜集

2 Updated Feb 21, 2019

史上最大规模1.4亿知识图谱数据免费下载,知识图谱,通用知识图谱,融合了两千五百多万的实体,拥有亿级别的实体属性关系。

Python 994 155 Updated Oct 21, 2020

各大中文分词性能评测

Python 151 29 Updated Feb 10, 2019

all kinds of text classification models and more with deep learning

Python 2 Updated Nov 25, 2018

Doc2Vec algorithm for solving moview review sentiment analysis

Python 25 12 Updated Nov 26, 2015

A natural language modeling framework based on PyTorch

Python 6,340 800 Updated Oct 17, 2022

StyleGAN - Official TensorFlow Implementation

Python 14,090 3,171 Updated Apr 10, 2024

利用bert预训练的中文模型进行文本分类 数据集中文情感分析语料chnsenticorp

Python 302 74 Updated Aug 26, 2019

基于siamese-lstm的中文句子相似度计算

Python 130 36 Updated Jul 1, 2018

Software in C and data files for the popular GloVe model for distributed word representations, a.k.a. word vectors or embeddings

C 6,825 1,511 Updated Sep 19, 2023

Code for the paper "Language Models are Unsupervised Multitask Learners"

Python 22,269 5,489 Updated Aug 14, 2024
Next