Jags jags14385

building stuff

Trying to be a better human and a developer everyday

24 followers · 131 following

Achievements

x3 x3

Achievements

x3 x3

Stars

google / litmus

Litmus is a comprehensive LLM testing and evaluation tool designed for GenAI Application Development. It provides a robust platform with a user-friendly UI for streamlining the process of building …

Vue 2 2 Updated Sep 19, 2024

google / har-sanitizer

Python 70 12 Updated May 20, 2024

ml-explore / mlx-swift-examples

Examples using MLX Swift

Swift 628 85 Updated Sep 12, 2024

karpathy / LLM101n

LLM101n: Let's build a Storyteller

28,576 1,561 Updated Aug 1, 2024

mzbac / IntelliVend32

Python 1 Updated Jul 2, 2024

openai / evals

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

Python 14,700 2,575 Updated Aug 20, 2024

princeton-nlp / SWE-agent

SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It solves 12.47% of bugs in the SWE-bench evaluation set and takes just 1 minute to run.

Python 13,322 1,305 Updated Sep 16, 2024

meta-prompting / meta-prompting

Official implementation of paper "Meta Prompting for AI Systems" (https://arxiv.org/abs/2311.11482)

Python 76 11 Updated Sep 17, 2024

apple / ml-ferret

Python 8,313 486 Updated Jan 27, 2024

All-Hands-AI / OpenHands

🙌 OpenHands: Code Less, Make More

Python 31,425 3,626 Updated Sep 20, 2024

charlax / entrepreneurship-resources

A list of articles, books, videos related to entrepreneurship

JavaScript 391 53 Updated Sep 16, 2024

charlax / professional-programming

A collection of learning resources for curious software engineers

Python 46,245 3,707 Updated Sep 16, 2024

kolodny / safetest

TypeScript 1,360 35 Updated Jul 15, 2024

graphql / dataloader

DataLoader is a generic utility to be used as part of your application's data fetching layer to provide a consistent API over various backends and reduce requests to those backends via batching and…

JavaScript 12,835 510 Updated Aug 27, 2024