Database Lecture06
Database Lecture06
Database Design
1
Chapter Objectives
2. Describe the five phases in the Systems Development Life Cycle (SDLC)
3. Design databases using the six phases in the Database Life Cycle (DBLC) framework
4. Conduct evaluation and revision within the SDLC and DBLC frameworks
2
The Information System (1 of 2)
• The database is part of a larger whole known as an information system (IS), which
provides for data collection, storage, and retrieval
− A complete information system is composed of people, hardware, and software,
the database(s), application programs, and procedures
• Systems analysis is the process that establishes the need for an information system
and its extent
3
The Information System (2 of 2)
• The term database development describes the process of database design and its
implementation
4
The Systems Development Life Cycle
• The Systems Development Life Cycle (SDLC) traces the history of an information
system
− The SDLC provides the big picture within which database design and application
development can be mapped out and evaluated
5
The Systems Development Life Cycle
6
The Systems Development Life Cycle
• The SDLC planning phase yields a general overview of the company and its objectives
• During the SDLC analysis phase problems that were defined during the planning
phase are examined in greater detail
• In the detailed systems design phase, the designer completes the design of the
system’s processes
• During the implementation phase, the hardware, DBMS software, and application
programs are installed, and the database design is implemented
• Maintenance can be grouped into three types: corrective, adaptive, and perfective
7
The Database Life Cycle
• The database life cycle (DBLC) is a cycle that races the history of a database within
an information system
8
The Database Life Cycle
9
The Database Initial Study (1 of 2)
10
The Database Initial Study (2 of 2)
11
The Database Design Process
12
Implementation and Loading
• The output of the database design phase is a series of instruction detailing the
creation of tables, attributes, domains, views, indexes, security constraints, and
storage and performance guidelines
13
Testing and Evaluation (1 of 2)
14
Testing and Evaluation (2 of 2)
15
Operation
• Once the database has passed the evaluation and testing stage, it is considered
operational
• The beginning of the operational phase starts the process of system evolution
• Problems that could not have been foreseen during the testing phase being to surface
during the operational phase
16
Maintenance
17
Knowledge Check Activity 9-1
• What does the acronym DBLC mean, and what does a DBLC portray?
18
Knowledge Check Activity 9-1: Answer
• What does the acronym DBLC mean, and what does a DBLC portray?
Answer: DBLC is the acronym that is used to label the Database Life
Cycle. The DBLC traces the history of a database system from its
inception to its obsolescence. The DBLC is composed of six phases:
initial study, design, implementation and loading, testing and evaluation,
operation, and maintenance and evolution.
19
Conceptual Design
• The goal at this stage is to design a database independent of database software and
physical details
• The output of this process is a conceptual data model that describes main data
entities, attributes, relationships, and constrains
20
Data Analysis and Requirements
• The first step in conceptual design is to discover the characteristics of the data
elements
• The designer obtains information from a variety of sources including the following:
− Developing and gathering end-user data views
− Directly observing current system – existing and desired output
− Interfacing with the systems design group
21
Entity Relationship Modeling and
Normalization (1 of 3)
• All objects (entities, attributes,
relations, views, and so on) are
defined in a data dictionary,
which is used in tandem with the
normalization process
22
Entity Relationship Modeling and
Normalization (2 of 3)
• During the ER modeling process, the designer much do the following:
− Define entities, attributes, primary keys, and foreign keys
− Make decisions about adding new primary key attributes
− Make decisions about the treatment of composite and multivalued
− Make decisions about adding derived attributes to satisfy processing requirements
− Make decisions about the placement of foreign keys in 1:1 relationships
− Avoid unnecessary ternary or higher degree relationships
− Draw the corresponding ER diagram
23
Entity Relationship Modeling and
Normalization (3 of 3)
• During the ER modeling process, the designer much do the following (continued):
− Normalize the entities
− Include all data element definitions in the data dictionary
− Make decisions about standard naming conventions
24
Data Model Verification
25
Distributed Database Design
• If the database data and processes will be distributed across the system, portions of
a database, known as database fragments, may reside in several physical locations
• Distributed database design defines the optimum allocation strategy for database
fragments to ensure database integrity, security, and performance
26
Knowledge Check Activity 9-2
27
Knowledge Check Activity 9-2: Answer
28
DBMS Software Selection
• The following are common factors that affect the purchasing decision:
− Cost
− DBMS features and tools
− Underlying model
− Portability
− DBMS hardware requirements
29
Logical Design
30
Map the Conceptual Model to the Logical
Model Components
• The process of translating the conceptual model into a set of relations is performed
in the following steps:
− Step 1 – Map strong entities
− Step 2 – Map supertype/subtype relationships
− Step 3 – Map weak entities
− Step 4 – Map binary relationships
− Step 5 – Map higher-degree relationships
31
Validate the Logical Model Using
Normalization
• The logical design should contain only properly normalized tables
• Activities such as normalization take place at different stages in the design process
• Each time you reiterate a step, the model is further refined and better documented
32
Validate the Logical Model Integrity
Constraints
• All defined constraints must be supported by the logical data model
• Special attention is needed at this stage to ensure that all views can be resolved and
that security is enforced to ensure the privacy of data
33
Validate the Logical Model Against User
Requirements
• The final step in the logical design process is to validate all logical model definitions
against all end-user data, transaction, and security requirements
• Once the steps of the logical design are complete, attention can now be placed on
defining the physical requirements that allow the system to function within the
selected DBMS/hardware environment
34
Physical Design
• Physical design is the process of determining the data storage organization and data
access characteristics of the database to ensure its integrity, security, and
performance
− This is the last stage of the database design process
35
Define Data Storage Organization
• Designers must determine the volume of data to be managed and the data usage
patterns
36
Define Integrity and Security Measures
• In this step of physical design, the following two tasks must be addressed:
− Define user and security groups and roles
− Assign security controls
37
Determine Performance Measurements
• Physical design performance measurement deals with fine-tuning the DBMS and
queries to ensure that they will meet end-user performance requirements as
identified in the previous stages
38
Database Design Strategies (1 of 2)
39
Database Design Strategies (2 of 2)
40
Centralized versus Decentralized Design (1
of 5)
• Centralized design is a process by which all database design decisions are carried out
centrally by a small group of people
− This process is typical of relatively simple, small databases and can be
successfully done by a single database administrator
41
Centralized versus Decentralized Design (2
of 5)
42
Centralized versus Decentralized Design (3
of 5)
• Decentralized design might be used when the system’s data component has a
considerable number of entities and complex relations on which very complex
operations are performed
− It is also often used when the problem itself is spread across several operational
sites and each element is a subset of the entire data set
43
Centralized versus Decentralized Design (4
of 5)
44
Centralized versus Decentralized Design (5
of 5)
45
Knowledge Check Activity 9-3
46
Knowledge Check Activity 9-3: Answer
47
Summary
Now that the lesson has ended, you should be able to:
2. Describe the five phases in the Systems Development Life Cycle (SDLC)
3. Design databases using the six phases in the Database Life Cycle (DBLC) framework
4. Conduct evaluation and revision within the SDLC and DBLC frameworks
48