Advanced Quantization Algorithm for LLMs. This is official implementation of "Optimize Weight Rounding via Signed Gradient Descent for the Quantization of LLMs"

Python 206 19 Updated Sep 20, 2024

NirDiamant / GenAI_Agents

This repository provides tutorials and implementations for various Generative AI Agent techniques, from basic to advanced. It serves as a comprehensive guide for building intelligent, interactive A…

Jupyter Notebook 640 67 Updated Sep 19, 2024

haritheja-e / robot-utility-models

Robot Utility Models are trained on a diverse set of environments and objects, and then can be deployed in novel environments with novel objects without any further data or training.

Python 138 4 Updated Sep 12, 2024

b7leung / MLE-Flashcards

200+ detailed flashcards useful for reviewing topics in machine learning, computer vision, and computer science.

1,960 172 Updated Jun 12, 2024

Sakil786 / Crawl4aiforWebscrapping

Crawl4aiforWebscrapping

Jupyter Notebook 1 Updated Sep 8, 2024

datamllab / LongLM

[ICML'24 Spotlight] LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning

Python 596 58 Updated Jun 1, 2024

NirDiamant / RAG_Techniques

This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and cont…

Jupyter Notebook 6,387 609 Updated Sep 19, 2024

JohnSnowLabs / langtest

Deliver safe & effective language models

Python 491 38 Updated Sep 20, 2024

Lightning-AI / LitServe

Lightning-fast serving engine for AI models. Flexible. Easy. Enterprise-scale.

Python 2,111 129 Updated Sep 20, 2024

eosphoros-ai / DB-GPT

AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents

Python 13,296 1,766 Updated Sep 19, 2024

kerberos-io / agent

An open and scalable video surveillance system for anyone making this world a better and more peaceful place.

Go 689 85 Updated Sep 14, 2024

princeton-nlp / SWE-agent

SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It solves 12.47% of bugs in the SWE-bench evaluation set and takes just 1 minute to run.

Python 13,323 1,305 Updated Sep 16, 2024

aymeric-roucher / GAIA

Beating the GAIA benchmark with Transformers Agents. 🚀

Jupyter Notebook 56 8 Updated Sep 18, 2024

ezelikman / STaR

Code for STaR: Bootstrapping Reasoning With Reasoning (NeurIPS 2022)

Python 115 16 Updated Feb 21, 2023

JohnZolton / snorkle

100% Local Document deep search with LLMs

TypeScript 24 3 Updated Sep 5, 2024

ggerganov / llama.cpp

LLM inference in C/C++

C++ 65,188 9,344 Updated Sep 21, 2024

RWKV / rwkv.cpp

INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model

C++ 1,404 93 Updated Aug 7, 2024

YourTechBud / ytb-practical-guide

A guide to try out examples shown in YourTechBud Codes YouTube Channel

Jupyter Notebook 47 21 Updated Sep 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Shaun shaunck96

Block or report shaunck96

Stars

Mintplex-Labs / anything-llm

Kwai-Kolors / Kolors

pseudotensor / open-strawberry

exo-explore / exo

codelion / optillm

hijkzzz / Awesome-LLM-Strawberry

openai / prm800k

shaunck96 / SEC-Filings-Scraper

shaunck96 / LLM-Production-System-Design-Llama-Phi-InternLM

antibitcoin / ReflectionAnyLLM

diego-vicente / som-tsp

huggingface / alignment-handbook

intel / auto-round