Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
-
Updated
Dec 25, 2024 - Python
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
Flink CDC is a streaming data integration tool
SeaTunnel is a next-generation super high-performance, distributed, massive data integration tool.
dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications.
🧙 Build, run, and manage data pipelines for integrating and transforming data.
The open source high performance ELT framework powered by Apache Arrow
Privacy and Security focused Segment-alternative, in Golang and React
data load tool (dlt) is an open source Python library that makes data loading easy 🛠️
Efficient data transformation and modeling framework that is backwards compatible with dbt.
Dataform is a framework for managing SQL based data operations in BigQuery
Meltano: the declarative code-first data integration engine that powers your wildest data and ML-powered product ideas. Say goodbye to writing, maintaining, and scaling your own API integrations.
Optimus is an easy-to-use, reliable, and performant workflow orchestrator for data transformation, data modeling, pipelines, and data quality management.
A serverless architecture for orchestrating ETL jobs in arbitrarily-complex workflows using AWS Step Functions and AWS Lambda.
A free to use dbt package for creating and loading Data Vault 2.0 compliant Data Warehouses (powered by dbt, an open source data engineering tool, registered trademark of dbt Labs)
ReplicaDB is open source tool for database replication, designed for efficiently transferring bulk data between relational and non-relational databases
dbt + Metabase integration
Collection of Sample Databricks Spark Notebooks ( mostly for Azure Databricks )
Add a description, image, and links to the elt topic page so that developers can more easily learn about it.
To associate your repository with the elt topic, visit your repo's landing page and select "manage topics."