Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to content
View jboarman's full-sized avatar
Sparkfish
Sparkfish

Sponsoring

@RandomFractals
@aloneguid
@sparkfish

Highlights

  • Pro

Organizations

@sparkfish

Block or report jboarman

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

😎 A curated list of awesome DataOps tools

Python 155 20 Updated Oct 14, 2024

⚡ Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.io

Python 1,907 208 Updated Nov 4, 2024

A light-weight, flexible, and expressive statistical data testing library

Python 3,381 310 Updated Nov 13, 2024

Semantic Functions for Semantic Link

Python 12 6 Updated Nov 12, 2024

Maestro: Netflix’s Workflow Orchestrator

Java 3,317 201 Updated Aug 9, 2024

Qubole Sparklens tool for performance tuning Apache Spark

Scala 568 138 Updated Jun 26, 2024

Samples on how to use Azure SQL database with Azure OpenAI

TSQL 73 31 Updated Nov 8, 2024

A Python framework for defining and querying BI models in your data warehouse

Python 160 6 Updated Apr 12, 2024

Display paginated content in the browser and generate print books using web technology

HTML 841 87 Updated Oct 4, 2024

Integration tests for dbt

Makefile 12 2 Updated Aug 26, 2023

Bloat-free, no BS cloud storage SDK.

C# 167 13 Updated Nov 8, 2024

Exposes the Windows Process creation Win32 functions in PowerShell

PowerShell 44 5 Updated Nov 8, 2024

Invoke Command As System/Interactive/GMSA/User on Local/Remote machine & returns PSObjects.

PowerShell 456 71 Updated May 5, 2023

No-code in the front, Python in the back. An open-source framework for creating data apps.

Python 1,326 75 Updated Nov 7, 2024

Yet another googlesearch - A Python library for executing intelligent, realistic-looking, and tunable Google searches.

Python 252 43 Updated Apr 7, 2024

🔥 Blazing fast bulk data transfers between any cloud 🔥

Python 1,082 62 Updated May 11, 2024

Free and open source schema versioning and database migration made natively with .NET/6. NEW THIS MAY 2022! v1.3.15 released!

C# 419 64 Updated Jul 25, 2024

All image quality metrics you need in one package.

Python 591 72 Updated Oct 4, 2023

Augraphy: Creating Realistic Document Image Datasets with Data Augmentation

Jupyter Notebook 6 4 Updated Aug 21, 2023
Python 3 1 Updated Apr 11, 2023

An SVG rendering library.

Rust 2,821 228 Updated Nov 12, 2024

Fully managed Apache Parquet implementation

C# 632 152 Updated Nov 12, 2024

2D/3D renderer - makes it simple to draw stuff across platforms (including web)

Rust 1,322 110 Updated Nov 11, 2024

Make writing easier!

Python 79 5 Updated Aug 15, 2021

A high-performance SVG renderer and toolkit, powered by Rust based resvg and napi-rs.

TypeScript 1,565 55 Updated Nov 1, 2024

ShabbyPages is a state-of-the-art corpus of born-digital document images with both ground truth and distorted versions appropriate for use in training models to reverse distortions and recover to o…

Jupyter Notebook 50 6 Updated Nov 12, 2024

Tool for probabilistically linking the records of individual entities (e.g. people) within and across datasets

Python 109 3 Updated May 20, 2024

Introducing the most comprehensive and up-to-date open source dataset on US car models on Github. With over 15,000 entries covering car models manufactured between 1992 and 2023, this repository of…

432 160 Updated May 5, 2024

The state-of-the-art image restoration model without nonlinear activation functions.

Python 2,238 276 Updated Jul 3, 2024
Python 5 Updated Jan 30, 2021
Next