What Is Dimensional Model
What Is Dimensional Model
Warehouse?
What is Dimensional Model?
A dimensional model is a data structure technique optimized for Data warehousing
tools. The concept of Dimensional Modelling was developed by Ralph Kimball and
is comprised of "fact" and "dimension" tables.
These dimensional and relational models have their unique way of data storage
that has specific advantages.
Hence, Dimensional models are used in data warehouse systems and not a good fit
for relational systems.
Dimension
Dimension provides the context surrounding a business process event. In simple
terms, they give who, what, where of a fact. In the Sales business process, for the
fact quarterly sales number, dimensions would be
• Who – Customer Names
• Where – Location
• What – Product Name
Attributes
The Attributes are the various characteristics of the dimension.
• State
• Country
• Zipcode etc.
Attributes are used to search, filter, or classify facts. Dimension Tables contain
Attributes
Fact Table
A fact table is a primary table in a dimensional model.
1. Measurements/facts
2. Foreign key to dimension table
Dimension table
• A dimension table contains dimensions of a fact.
• They are joined to fact table via a foreign key.
• Dimension tables are de-normalized tables.
• The Dimension Attributes are the various columns in a dimension table
• Dimensions offers descriptive characteristics of the facts with the help of
their attributes
• No set limit set for given for number of dimensions
• The dimension can also contain one or more hierarchical relationships
Steps of Dimensional Modelling
The accuracy in creating your Dimensional modeling determines the success of
your data warehouse implementation. Here are the steps to create Dimension
Model
The model should describe the Why, How much, When/Where/Who and What of
your business process
Step 1) Identify the business process
Identifying the actual business process a data rehouse should cover. This could be
Marketing, Sales, HR, etc. as per the data analysis needs of the organization. The
selection of the Business process also depends on the quality of data available for
that process. It is the most important step of the Data Modelling process, and a
failure here would have cascading and irreparable defects.
To describe the business process, you can use plain text or use basic Business
Process Modelling Notation (BPMN) or Unified Modelling Language (UML).
Example of Grain:
The CEO at an MNC wants to find the sales for specific products in different
locations on a daily basis.
Example of Dimensions:
The CEO at an MNC wants to find the sales for specific products in different
locations on a daily basis.
Attributes: For Product: Product key (Foreign Key), Name, Type, Specifications
Example of Facts:
The CEO at an MNC wants to find the sales for specific products in different
locations on a daily basis.
1. Star Schema
The star schema architecture is easy to design. It is called a star schema because
diagram resembles a star, with points radiating from a center. The center of the
star consists of the fact table, and the points of the star is dimension tables.
The fact tables in a star schema which is third normal form whereas dimensional
tables are de-normalized.
2. Snowflake Schema
Summary:
• A dimensional model is a data structure technique optimized for Data
warehousing tools.
• Facts are the measurements/metrics or facts from your business process.
• Dimension provides the context surrounding a business process event.
• The Attributes are the various characteristics of the dimension.
• A fact table is a primary table in a dimensional model.
• A dimension table contains dimensions of a fact.
• There are three types of facts 1. Additive 2. Non-additive 3. Semi- additive .
• Types of Dimensions are Conformed, Outrigger, Shrunken, Role-playing,
Dimension to Dimension Table, Junk, Degenerate, Swappable and Step
Dimensions.
• Five steps of Dimensional modeling are 1. Identify Business Process 2.
Identify Grain (level of detail) 3. Identify Dimensions 4. Identify Facts 5. Build
Star
• In Dimensional modeling, there is need to ensure that every fact table has an
associated date dimension table.