Lists (6)
Sort Name ascending (A-Z)
Stars
Manage a Postgres cluster's roles, role memberships, schema ownership, and privileges
A library that creates fully populated objects for your unit tests.
Nessie: Transactional Catalog for Data Lakes with Git-like semantics
Open source platform for the machine learning lifecycle
VPN client in a thin Docker container for multiple VPN providers, written in Go, and using OpenVPN or Wireguard, DNS over TLS, with a few proxy servers built-in.
Code for the paper Hybrid Spectrogram and Waveform Source Separation
Run your dbt Core projects as Apache Airflow DAGs and Task Groups with a few lines of code
Easily turn your Click CLI into a powerful terminal application
Upserts, Deletes And Incremental Processing on Big Data.
Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
Pelican plugin to improve search engine optimization (SEO)
DuckDB is an analytical in-process SQL database management system
IDE style command line auto complete
dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications.
🦉 ML Experiments and Data Management with Git
🛠 Python project template generator with batteries included
Scalable and flexible workflow orchestration platform that seamlessly unifies data, ML and analytics stacks.
CTF framework and exploit development library
Python composable command line interface toolkit
Backend for the self-hosted gaming platform for drm-free games
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
CLI tool to easily migrate Kubernetes persistent volumes
Another repository with lightweight Helm Charts.