Stars
Langflow is a low-code app builder for RAG and multi-agent AI applications. It’s Python-based and agnostic to any model, API, or database.
OpenAPI Generator allows generation of API client libraries (SDK generation), server stubs, documentation and configuration automatically given an OpenAPI Spec (v2, v3)
Backstage is an open framework for building developer portals
Agent for collecting, processing, aggregating, and writing metrics, logs, and other arbitrary data.
Configure and deploy complete EKS clusters.
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…
Retrieval and Retrieval-augmented LLMs
The toolkit to pack, ship, store, and deliver container content
Open-source observability for your LLM application, based on OpenTelemetry
The Triton Inference Server provides an optimized cloud and edge inferencing solution.
Accessible large language models via k-bit quantization for PyTorch.
Code and documentation to train Stanford's Alpaca models, and generate the data.
QLoRA: Efficient Finetuning of Quantized LLMs
Transformer related optimization, including BERT, GPT
Style transfer, deep learning, feature transform
Ongoing research training transformer models at scale
Platform for building access networks and modular network services
GraphiQL & the GraphQL LSP Reference Ecosystem for building browser & IDE tools.
A cloud-native vector database, storage for next generation AI applications
LlamaIndex is a data framework for your LLM applications
🦜🔗 Build context-aware reasoning applications
Karpenter is a Kubernetes Node Autoscaler built for flexibility, performance, and simplicity.
State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.
Rules engine for cloud security, cost optimization, and governance, DSL in yaml for policies to query, filter, and take actions on resources