Stars
The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, and more.
Building open version of OpenAI o1 via reasoning traces (Groq, ollama, Anthropic, Gemini, OpenAI, Azure supported)
Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.
800,000 step-level correctness labels on LLM solutions to MATH problems
Production grade design for llm deployment (in progress)
This project demonstrates a basic chain-of-thought interaction with any LLM (Large Language Model)
Solving the Traveling Salesman Problem using Self-Organizing Maps
Robust recipes to align language models with human and AI preferences
Advanced Quantization Algorithm for LLMs. This is official implementation of "Optimize Weight Rounding via Signed Gradient Descent for the Quantization of LLMs"
This repository provides tutorials and implementations for various Generative AI Agent techniques, from basic to advanced. It serves as a comprehensive guide for building intelligent, interactive A…
Robot Utility Models are trained on a diverse set of environments and objects, and then can be deployed in novel environments with novel objects without any further data or training.
200+ detailed flashcards useful for reviewing topics in machine learning, computer vision, and computer science.
[ICML'24 Spotlight] LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning
This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and cont…
Lightning-fast serving engine for AI models. Flexible. Easy. Enterprise-scale.
AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents
An open and scalable video surveillance system for anyone making this world a better and more peaceful place.
SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It solves 12.47% of bugs in the SWE-bench evaluation set and takes just 1 minute to run.
Beating the GAIA benchmark with Transformers Agents. 🚀
Code for STaR: Bootstrapping Reasoning With Reasoning (NeurIPS 2022)
INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model
A guide to try out examples shown in YourTechBud Codes YouTube Channel