Skip to content
View Dzeri96's full-sized avatar
🤖
🤖

Block or report Dzeri96

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

Data Science

Visualization Platform
27 repositories

Apache Superset is a Data Visualization and Data Exploration Platform

TypeScript 62,017 13,594 Updated Sep 30, 2024

Panel: The powerful data exploration & web app framework for Python

Python 4,677 508 Updated Sep 29, 2024

SymmetricDS is database replication and file synchronization software that is platform independent, web enabled, and database agnostic. It is designed to make bi-directional data replication fast, …

Java 737 224 Updated Sep 30, 2024

The Data Engineering Cookbook

13,600 2,491 Updated Aug 1, 2024

Dolt – Git for Data

Go 17,793 505 Updated Sep 28, 2024

The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.

Python 15,684 4,024 Updated Sep 30, 2024

Database Subsetting and Relational Data Browsing Tool.

Java 2,838 118 Updated Sep 24, 2024

Testing Framework for PL/SQL

PLSQL 559 185 Updated Sep 18, 2024

The platform for building AI from enterprise data

Python 26,333 4,789 Updated Sep 27, 2024

Apache Airflow - A platform to programmatically author, schedule, and monitor workflows

Python 36,506 14,135 Updated Sep 30, 2024

Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.

Python 17,745 2,391 Updated Sep 24, 2024

Scalene: a high-performance, high-precision CPU, GPU, and memory profiler for Python with AI-powered optimization proposals

Python 11,615 387 Updated Sep 20, 2024

Scalable and flexible workflow orchestration platform that seamlessly unifies data, ML and analytics stacks.

Go 5,474 584 Updated Sep 29, 2024

🛠 Python project template generator with batteries included

Python 2,084 181 Updated Sep 23, 2024

🦉 ML Experiments and Data Management with Git

Python 13,659 1,173 Updated Sep 29, 2024

dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications.

Python 9,679 1,607 Updated Sep 29, 2024

Simple Python version management

Roff 38,801 3,024 Updated Sep 28, 2024

Dependency injection container made for Python

Python 402 25 Updated Aug 15, 2024

DuckDB is an analytical in-process SQL database management system

C++ 23,082 1,837 Updated Sep 28, 2024

Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 33,235 5,620 Updated Sep 30, 2024

Upserts, Deletes And Incremental Processing on Big Data.

Java 5,344 2,419 Updated Sep 30, 2024

Run your dbt Core projects as Apache Airflow DAGs and Task Groups with a few lines of code

Python 607 153 Updated Sep 29, 2024

Code for the paper Hybrid Spectrogram and Waveform Source Separation

Python 8,188 1,032 Updated Apr 24, 2024

Open source platform for the machine learning lifecycle

Python 18,407 4,161 Updated Sep 30, 2024

Nessie: Transactional Catalog for Data Lakes with Git-like semantics

Java 987 129 Updated Sep 30, 2024

Automatic data change tracking for PostgreSQL

TypeScript 279 7 Updated Sep 26, 2024

Manage a Postgres cluster's roles, role memberships, schema ownership, and privileges

Python 315 35 Updated Jan 11, 2024