๐ญ Future/Later
Your friendliest open source all-in-one automation tool โจ Workflow automation tool 200+ integration / Enterprise automation tool / Zapier Alternative
๐ 10x easier, ๐ 140x lower storage cost, ๐ high performance, ๐ petabyte scale - Elasticsearch/Splunk/Datadog alternative for ๐ (logs, metrics, traces, RUM, Error tracking, Session replay).
Web based real-time log viewer. Stream ANY content to a web UI with autogenerated filters. Parse any format with TypeScript.
[ICML2024] Unified Training of Universal Time Series Forecasting Transformers
A Python framework for defining and querying BI models in your data warehouse
Create web-based user interfaces with Python. The nice way.
pyinfra turns Python code into shell commands and runs them on your servers. Execute ad-hoc commands and write declarative operations. Target SSH servers, local machine and Docker containers. Fast โฆ
Auto-generated Diagrams from Airflow DAGs. ๐ฎ ๐ช
Example Repo to have full end to end pyspark testing via docker-compose
The Open-Source Enterprise Data Platform in a single Portal
A dbt-core plugin to weave together multi-project dbt-core deployments
Interact with your SQL database, Natural Language to SQL using LLMs
A little Python library for making simple Electron-like HTML/JS GUI apps
Performant Redshift data source for Apache Spark
List of EDI (Mostly) Github Resources
Open source distributed Platform as a Service (PaaS). A self-hosted Vercel / Netlify / Cloudflare alternative.
chDB is an in-process OLAP SQL Engine ๐ powered by ClickHouse
Code for "Efficient Data Processing in Spark" Course
Database Markup Language (DBML), designed to define and document database structures
Conditionally run actions based on files modified by PR, feature branch or pushed commits
DuckDB-powered Postgres for high performance apps & analytics.
The open-source analytics development platform
A modern cookiecutter template for Python projects that use uv for dependency management