Useful scripts, udfs, views, and other utilities for migration and data warehouse operations in BigQuery.
-
Updated
Dec 3, 2024 - Java
Useful scripts, udfs, views, and other utilities for migration and data warehouse operations in BigQuery.
A powerful open source data warehouse system
🤖 The semantic engine for LLMs, bringing semantic context to AI agents. 🔥
An efficient storage and compute engine for both on-prem and cloud-native data analytics.
ElasticFlow(伊塔)是一个开源弹性流数据交换系统,支持在任意类型数据端之间通过简单配置就可以建立可计算的弹性流管道,并进行定时、定量、高并发、多类型的交换数据服务。系统可应用于数据交换、通用搜索引擎、数据发布服务、数据仓库等项目。
BioDWH2 is an easy-to-use, automated, graph-based data warehouse and mapping tool for bioinformatics and medical informatics.
LogUnify is a schema-centric service that provides structured application event logging and seamless integration with data warehouses such as BigQuery for easy storage and analysis of event data.
All assignments and the final project are completed in class CSCI 5408 (Data Management, Warehousing and Analytics) of MACS at Dalhousie University. CSCI 5408 DMWA Dalhousie University.
Universidade do Minho - 4º ano
This warehouse is made for storing and studying insurance claims data for vehicles serviced at designated branches.
🏥 Public Health Data Warehouse using FHIR and Kibana
Purpose-built data connectors for Google CDAP data pipelines
A POC on the data warehousing solution provided by Google Cloud
A powerful open source data warehouse system
The METRO DW prototype uses Mesh Join & Star Schema for sales, customer & inventory data analysis. Implemented in SQL & Java for fast, accurate, & consistent data retrieval. Offers valuable insights & can be queried with standard BI tools.
This repository comprises the design, implementation, and analysis of a near real-time data warehouse prototype for an electronics business chain, utilising a multi-threaded Extract, Transform, Load (ETL) pipeline leveraging the efficient HYBRIDJOIN algorithm implemented with Java and MySQL on customer sales data.
distributed system built in Java that will run on two Google Cloud Platform Linux virtual
A complete end-to-end project for building a Data Warehouse using IMDb data with Talend for ETL and Power BI for insightful visualizations. Includes a star schema, optimized database, and interactive dashboards.
Add a description, image, and links to the data-warehouse topic page so that developers can more easily learn about it.
To associate your repository with the data-warehouse topic, visit your repo's landing page and select "manage topics."