DW 101: Introduction To Data Warehouse
DW 101: Introduction To Data Warehouse
DW 101: Introduction To Data Warehouse
Data Warehouse
Objective
To provide basic understanding about data
warehouse concepts
In a way that everyone involved in data
warehouse project have common
understanding about data warehouse
concepts
So that the data warehouse project team can
effectively communicate under the same
understanding
Acknowledgement
This presentation is summarized from the first
chapter of The data warehouse lifecycle
toolkit : expert methods for designing,
developing, and deploying data warehouses
by Ralph Kimball and others.
Agenda
The
Extract
Storage:
Populate,
replicate,
recover
flat files;
RDBMS;
other
Processing:
Extract
Extract
clean;
prune;
combine;
remove duplicates;
household;
standardize;
conform dimensions;
store awaiting replication;
archive;
export to data marts
Presentation Servers
Populate,
replicate,
recover
Feed
Feed
Report Writers
Feed
DW
BUS
Data Mart #2
Populate,
replicate,
recover
DW
BUS
Conformed dimensions
Conformed facts
Feed
Conformed dimensions
Conformed facts
Models
forecasting;
scoring;
allocating;
data mining;
other downstream systems;
other parameters;
special UI
Data Mart #3
upload cleaned
dimensions
Upload model
results
Source System
An
Presentation Server
The
Dimensional Model
A specific
Data Mart
A logical subset of the complete data
warehouse
A data warehouse is made up of the union of
all its data marts
Without conformed dimensions and
conformed facts, a data mart is a stovepipe
Data mart can contains not only the summary
data but also atomic data
Ad
Application
Meta Data
All
and indexing
Quality assurance checking
Release/publishing
feedback
Auditing
Securing
Backing up and recovering
Extracting
First
Transforming
Release/Publishing
When
Updating
Modern data marts may be updated,
sometimes frequently
Managed load updates, not transactional
updates
Triggers of the update includes
Data correction
Changes in labels
Changes in hierarchies
Changes in status
Changes in corporate ownership
Querying
Means
Data Feedback
May
Auditing
Critically
Securing
Data