Data Warehousing and OLAP Technology For Data Mining: - Chapter 3
Data Warehousing and OLAP Technology For Data Mining: - Chapter 3
— Chapter 3 —
all
0-D(apex) cuboid
time,location,supplier
time,item,location 3-D cuboids
time,item,supplier item,location,supplier
4-D(base) cuboid
time, item, location, supplier
October 27, 2020 Data Mining: Concepts and Techniques 14
Conceptual Modeling of Data
Warehouses
Modeling data warehouses: dimensions & measures
◦ Star schema: A fact table in the middle connected to a set of
dimension tables
◦ Snowflake schema: A refinement of star schema where some
dimensional hierarchy is normalized into a set of smaller
dimension tables, forming a shape similar to snowflake
◦ Fact constellations: Multiple fact tables share dimension
tables, viewed as a collection of stars, therefore called galaxy
schema or fact constellation
branch_key
location
branch location_key
location_key
branch_key
units_sold street
branch_name
city_key city
branch_type
dollars_sold
city_key
avg_sales city
province_or_street
Measures country
all all
Office Day
Month
October 27, 2020 Data Mining: Concepts and Techniques 25
A Sample Data Cube
TV
od
PC U.S.A
Pr
VCR
Country
sum
Canada
Mexico
sum
all
0-D(apex) cuboid
product date country
1-D cuboids
3-D(base) cuboid
product, date, country
Monitor
Metadata & OLAP Server
other
source Integrator
s Analysis
Operational Extract Query
DBs Transform Data Serve Reports
Load
Refresh
Warehouse Data mining
Data Marts
Multi-Tier Data
Warehouse
Distributed
Data Marts