Stars
Litmus is a comprehensive LLM testing and evaluation tool designed for GenAI Application Development. It provides a robust platform with a user-friendly UI for streamlining the process of building …
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It solves 12.47% of bugs in the SWE-bench evaluation set and takes just 1 minute to run.
Official implementation of paper "Meta Prompting for AI Systems" (https://arxiv.org/abs/2311.11482)
A list of articles, books, videos related to entrepreneurship
A collection of learning resources for curious software engineers
DataLoader is a generic utility to be used as part of your application's data fetching layer to provide a consistent API over various backends and reduce requests to those backends via batching and…
This dbt package contains macros to support unit testing that can be (re)used across dbt projects.
A Gradio web UI for Large Language Models.
Audio dictionary is a dictionary built with two AI (chatGPT and DeepGram). that can be used to answer any questions you have
data load tool (dlt) is an open source Python library that makes data loading easy 🛠️
Demo Project for Open Source MDS
Material for my React Fundamentals Workshop
Complete framework to build API applications in Golang using grpc
A collection of public resources about how software companies test their software
The Big List of Naughty Strings is a list of strings which have a high probability of causing issues when used as user-input data.
A Random Quote Machine using "The Matrix" Theme.
The "Navigating the World as a Context-Driven Tester" book