Chapter - 7 Distributed Database System
Chapter - 7 Distributed Database System
1
Outline
1 Distributed Database Concepts
4 Query Processing
– Cost
– Security
– Lack of standards
– Lack of experience
Site 1
Distributed database
• There are two types of distributed database architecture:
• Homogeneous Distributed Database.
• Heterogeneous Distributed Database.
Types of Distributed Database Systems
Autonomous Database
An autonomous database is a cloud database that uses machine learning to
automate database tuning, security, backups, updates, and other routine
management tasks traditionally performed by DBAs.
Homogeneous
• All sites of the database system have identical setup, i.e., same database
system software.
• The underlying operating systems can be a mixture of Linux, Window,
Unix, etc.
• For example, all sites run Oracle or DB2, or Sybase or some other database
system.
Window
Advantages Site 5 Unix
Oracle Site 1
Easy to use Oracle
Window
Easy to mange Site 4 Communications
neteork
Easy to Design
Oracle
Disadvantages Site 3 Site 2
Linux Oracle Linux Oracle
Difficult for most organizations to
force a homogeneous environment
Heterogeneous
Different data center may run different DBMS products, with possibly different underlying data models.
Network
Object DBMS
Oriented Site 3 Site 2 Relational
Linux Linux
Heterogeneous
Advantages
Huge data can be stored in one Global center from different data center
Remote access is done using the global schema.
Different DBMSs may be used at each node
Disadvantages
Difficult to mange
Difficult to design.
.
Federated Database Management Systems
• Differences in constraints
− Client/Server architecture.
– Fully distributed DBMS with support for multiple DPs and
TPs at multiple sites.
– Homogeneous DDMS
Integrate only one type of centralized DBMS over the network.
– Heterogeneous DDBMS
Integrate different types of centralized DBMSs over a network.
Distributed DB Transparency
– Distribution transparency
– Transaction transparency
– Failure transparency
– Performance transparency
– Heterogeneity transparency
Distribution Transparency
• Distribution transparency allows us to manage a physically
dispersed database as though it were a centralized database.
– Fragmentation transparency
– Location transparency
– Remote Transactions
– Distributed Transactions
– Distributed Requests
A Remote Request
Allows us to access data to be processed by a single remote database
processor.
A Remote Transaction
Composed of several requests, may access data at only a single
site.
Allows a transaction to reference several different (local or remote) DP
sites.
A Distributed Request
Reference data from several remote DP sites.
Allows a single request to reference a physically partitioned table.
Example2:
Distributed Request
Transaction Transparency
Two-Phase Commit Protocol
DO performs the operation and records the “before” and “after”
values in the transaction log.
UNDO reverses an operation, using the log entries written by the
DO portion of the sequence.
REDO redoes an operation, using the log entries written by DO
portion of the sequence.
Horizontal fragmentation
Vertical fragmentation
Mixed fragmentation
Data Fragmentation
Horizontal Fragmentation - Consists of a subset of the tuples
of a relation.
Fragment represents the equivalent of a SELECT statement, with
the WHERE clause on a single attribute.
Data Fragmentation
Vertical fragment Consists of a subset of the attributes of a
relation.
Equivalent to the PROJECT statement.
Data Fragmentation