Skip to content
View Erwin-X's full-sized avatar

Block or report Erwin-X

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

Download the latest stable Synergy binaries.

Python 1,194 112 Updated Sep 19, 2023

计算机自学指南

HTML 55,848 6,745 Updated Sep 13, 2024

The math library of Lean 4

Lean 1,388 309 Updated Sep 21, 2024

Official codebase for the paper "Beyond A* Better Planning with Transformers via Search Dynamics Bootstrapping".

Jupyter Notebook 291 13 Updated Jun 11, 2024

Conversion between Traditional and Simplified Chinese

C++ 8,369 974 Updated Sep 5, 2024

The most comprehensive database of Chinese poetry 🧶最全中华古诗词数据库, 唐宋两朝近一万四千古诗人, 接近5.5万首唐诗加26万宋诗. 两宋时期1564位词人,21050首词。

JavaScript 47,895 9,612 Updated Aug 10, 2024

MiniCheck: Efficient Fact-Checking of LLMs on Grounding Documents [EMNLP 2024]

Python 59 7 Updated Sep 20, 2024

A tool for extracting plain text from Wikipedia dumps

Python 3,735 963 Updated May 23, 2024

中文自然语言处理数据集,平时做做实验的材料。欢迎补充提交合并。

Python 4,240 783 Updated Nov 21, 2023

Meta-Transformer for Unified Multimodal Learning

Python 1,499 115 Updated Dec 5, 2023

Similarities: a toolkit for similarity calculation and semantic search. 相似度计算、匹配搜索工具包,支持亿级数据文搜文、文搜图、图搜图,python3开发,开箱即用。

Python 740 72 Updated Sep 8, 2024

A package to evaluate factuality of long-form generation. Original implementation of our EMNLP 2023 paper "FActScore: Fine-grained Atomic Evaluation of Factual Precision in Long Form Text Generation"

Python 270 40 Updated May 19, 2024

Scalable training for dense retrieval models.

Python 268 24 Updated May 27, 2023

Get up and running with Llama 3.1, Mistral, Gemma 2, and other large language models.

Go 90,069 7,073 Updated Sep 21, 2024

Official completion of “Training on the Benchmark Is Not All You Need”.

Python 20 3 Updated Sep 18, 2024

TextGen: Implementation of Text Generation models, include LLaMA, BLOOM, GPT2, BART, T5, SongNet and so on. 文本生成模型,实现了包括LLaMA,ChatGLM,BLOOM,GPT2,Seq2Seq,BART,T5,UDA等模型的训练和预测,开箱即用。

Python 926 107 Updated Sep 14, 2024

text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型,开箱即用。

Python 4,401 392 Updated Sep 8, 2024

Fengshenbang-LM(封神榜大模型)是IDEA研究院认知计算与自然语言研究中心主导的大模型开源体系,成为中文AIGC和认知智能的基础设施。

Python 4,001 374 Updated Aug 13, 2024

[ACL 2024] User-friendly evaluation framework: Eval Suite & Benchmarks: UHGEval, HaluEval, HalluQA

JavaScript 176 17 Updated Aug 31, 2024

Cool Papers - Immersive Paper Discovery

HTML 355 5 Updated Sep 11, 2024

Dataset and evaluation script for "Evaluating Hallucinations in Chinese Large Language Models"

Python 106 4 Updated Jun 5, 2024

LLM hallucination paper list

268 21 Updated Mar 11, 2024

Token-level Reference-free Hallucination Detection

Python 92 8 Updated Jul 25, 2023

Code of ICLR paper: https://openreview.net/forum?id=-cqvvvb-NkI

Python 90 10 Updated Feb 22, 2023
Python 53 1 Updated Feb 16, 2024

Teaching Models to Express Their Uncertainty in Words

Python 36 5 Updated May 26, 2022

Do Large Language Models Know What They Don’t Know?

Python 84 5 Updated Dec 5, 2023
Python 78 1 Updated Nov 11, 2022

EMNLP'2021: Simple Entity-centric Questions Challenge Dense Retrievers https://arxiv.org/abs/2109.08535

Python 137 11 Updated Feb 21, 2022

Leaderboard Comparing LLM Performance at Producing Hallucinations when Summarizing Short Documents

Python 1,170 45 Updated Sep 17, 2024
Next