Skip to content
View bartoszgajda55's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report bartoszgajda55

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.

Starred repositories

Showing results

A Python Library to support running data quality rules while the spark job is running⚡

Python 1 Updated Mar 4, 2024

Pythonic Programming Framework to orchestrate jobs in Databricks Workflow

Python 187 41 Updated Sep 17, 2024

Maestro: Netflix’s Workflow Orchestrator

Java 3,256 199 Updated Aug 9, 2024

Open, Multi-modal Catalog for Data & AI

Java 2,264 353 Updated Sep 28, 2024

The Lakehouse Engine is a configuration driven Spark framework, written in Python, serving as a scalable and distributed engine for several lakehouse algorithms, data flows and utilities for Data P…

Python 217 39 Updated Sep 23, 2024

Apache DataFusion Comet Spark Accelerator

Rust 759 150 Updated Sep 28, 2024

Monitoring Azure Databricks jobs

Scala 211 177 Updated Jul 30, 2024

OpenTofu lets you declaratively manage your cloud infrastructure.

Go 22,667 871 Updated Sep 28, 2024

pyspark methods to enhance developer productivity 📣 👯 🎉

Python 625 97 Updated Sep 7, 2024

Databricks SDK for Python (Beta)

Python 349 116 Updated Sep 26, 2024

Work with your web service, database, and streaming schemas in a single format.

Python 324 25 Updated Mar 28, 2024

Apache DolphinScheduler is the modern data orchestration platform. Agile to create high performance workflow with low-code

Java 12,724 4,584 Updated Sep 27, 2024

Demo of using the Nutter for testing of Databricks notebooks in the CI/CD pipeline

Python 149 129 Updated Aug 14, 2024

Pattern Matching

Python 1,022 64 Updated Jun 2, 2022

A comprehensive self-management System

Haskell 288 47 Updated Sep 24, 2024

🧱 Databricks CLI eXtensions - aka dbx is a CLI tool for development and advanced Databricks workflows management.

Python 440 120 Updated Sep 16, 2024

Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)

Java 10,241 2,949 Updated Sep 28, 2024

Spark Gotchas. A subjective compilation of the Apache Spark tips and tricks

359 80 Updated Jun 6, 2017

A curated list of useful resources for gRPC

7,549 578 Updated Aug 20, 2024

Pre-trained models and language resources for Natural Language Processing in Polish

315 27 Updated Jun 5, 2024

ERP beyond your fridge - Grocy is a web-based self-hosted groceries & household management solution for your home

Blade 6,708 560 Updated Sep 10, 2024

A better notebook for Scala (and more)

Jupyter Notebook 4,514 393 Updated Aug 1, 2024

Simple and Distributed Machine Learning

Scala 5,052 830 Updated Sep 16, 2024

Command-line program to download videos from YouTube.com and other video sites

Python 131,571 9,968 Updated Aug 17, 2024

ROADMAP(Mind Map) and KEYWORD for students those who have interest in learning NLP

3,218 512 Updated Sep 29, 2019

A personal knowledge management and sharing system for VSCode

TypeScript 15,258 649 Updated Sep 27, 2024

🌊 Online machine learning in Python

Python 5,016 540 Updated Sep 11, 2024

Clean Code concepts adapted for Java. Based on @ryanmcdermott repository.

450 119 Updated Jun 7, 2024

Deploy über-JARs. Restart processes. (port of codahale/assembly-sbt)

Scala 1,949 224 Updated Sep 7, 2024
Next