Data
lakeFS - Data version control for your data lake | Git for data
Nessie: Transactional Catalog for Data Lakes with Git-like semantics
A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rewrites.
DuckDB is an analytical in-process SQL database management system
An open-source time-series SQL database optimized for fast ingest and complex queries. Packaged as a PostgreSQL extension.
Voilร turns Jupyter notebooks into standalone web applications
A data visualization and analytics component, especially well-suited for large and/or streaming datasets.
๐ฆ๐ Build context-aware reasoning applications
Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
Dataframes powered by a multithreaded, vectorized query engine, written in Rust
Read and write Parquet in Scala. Use Scala classes as schema. No need to start a cluster.
๐๐ฎ๐๐ฎ, ๐๐ป๐ฎ๐น๐๐๐ถ๐ฐ๐ & ๐๐. Modern alternative to Snowflake. Cost-effective and simple for massive-scale analytics. https://databend.com
๐ฅ๐ฅ๐ฅAI-driven database tool and SQL client, The hottest GUI client, supporting MySQL, Oracle, PostgreSQL, DB2, SQL Server, DB2, SQLite, H2, ClickHouse, and more.
Conduit streams data between data stores. Kafka Connect replacement. No JVM required.
OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team coโฆ
EventStoreDB, the event-native database. Designed for Event Sourcing, Event-Driven, and Microservices architectures
Concurrent and multi-stage data ingestion and data processing with Elixir
A library that provides useful extensions to Apache Spark and PySpark.
A generative AI extension for JupyterLab
A code-first agent framework for seamlessly planning and executing data analytics tasks.
๐ฆ PostHog provides open-source product analytics, session recording, feature flagging and A/B testing that you can self-host.