Stars
Get up and running with Llama 3.1, Mistral, Gemma 2, and other large language models.
Multiple NVIDIA GPUs or Apple Silicon for Large Language Model Inference?
Anthropic's educational courses
Understanding Deep Learning - Simon J.D. Prince
This project converts the API of Anthropic's Claude model to the OpenAI Chat API format.
📚 Freely available programming books
A massively parallel, high-level programming language
The source code of our work "Prepacking: A Simple Method for Fast Prefilling and Increased Throughput in Large Language Models"
Source code of the paper: RetrievalQA: Assessing Adaptive Retrieval-Augmented Generation for Short-form Open-Domain Question Answering [Findings of ACL 2024]
Label Studio is a multi-type data labeling and annotation tool with standardized output format
This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and cont…
комплексное руководство по машинному обучению (ML) и обработке естественного языка (NLP). Этот проект предназначен для студентов технических вузов, изучающих ML, а также для тех, кто стремится стат…
A modular graph-based Retrieval-Augmented Generation (RAG) system
SimPO: Simple Preference Optimization with a Reference-Free Reward
SGLang is a fast serving framework for large language models and vision language models.
SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm
Generalist and Lightweight Model for Named Entity Recognition (Extract any entity types from texts) @ NAACL 2024
Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
This project is concerned with my participating in the RuNNE competition https://github.com/dialogue-evaluation/RuNNE
Compare texts within a pandas DataFrame, highlighting changes and computing similarity ratios
Vector (and Scalar) Quantization, in Pytorch
A python library for hierarchical classification compatible with scikit-learn