Distributed Multimedia & Database System
Distributed Multimedia & Database System
10,000,000,000,000 bytes!
INTERQUERY PARALLELISM
• Improves Throughput.
INTRAQUERY PARALLELISM
• Speed-Up.
• Scale-up.
– a: Shared memory.
– b: Shared disk.
– c: Shared nothing.
PARALLEL vs. DISTRIBUTED TECHNOLOGY
Parallel system architectures:
CPU MEMORY
CPU
CPU
CPU
CPU
CPU
Shared Disk – Parallel Database Architecture
M CPU
M CPU
M CPU
M CPU
M CPU
M CPU
Shared Nothing – Parallel Database Architecture
M CPU
CPU M
M CPU
CPU M
M CPU
MAINFRAME DATABASE SYSTEM
TERMINALS
DUMB
Optimises queries
SERVER
CLIENT
#2
D/BASE
CLIENT
#3
DATA LOGIC
PRESENTATION LOGIC
BUSINESS LOGIC Data Request
(FAT CLIENT) Data Response
CLIENT CLIENT/SERVER
#1
DBMS ARCHITECTURE
SERVER
CLIENT
#2
D/BASE
PL/SQL
CLIENT
#3
BUSINESS LOGIC
DATA LOGIC
PRESENTATION LOGIC
(THIN CLIENT) Data Request
Data Response
DISTRIBUTED PROCESSING ARCHITECTURE
CLIENT CLIENT
CLIENT CLIENT
Stratford Leyton
CLIENT CLIENT
CLIENT CLIENT
DBMS
LAN LAN
CLIENT CLIENT
CLIENT CLIENT
Barking Leytonstone
DISTRIBUTED DATABASE SYSTEM
DISTRIBUTED DATABASE
DBMS
DBMS
LAN
Stratford Leyton
CLIENT
CLIENT CLIENT CLIENT CLIENT
DBMS
DBMS
LAN
Barking Leytonstone
M:N CLIENT/SERVER DBMS ARCHITECTURE
SERVER #1
CLIENT
#1
D/BASE
CLIENT
#2
SERVER #2
D/BASE
CLIENT
#3
NOT TRANSPARENT!
COMPONENTS OF A DDBMS
Site 1
DDBMS
DC LDBMS
GSC
Computer DB
Network
GSC
DDBMS
LDBMS = Local DBMS
DC DC = Data Communications
GSC = Global Systems Catalog
Site 2 DDBMS = Distributed DBMS
ADVANTAGE OF DISTRIBUTED DATABASES
Management of distributed data with different levels of transparency
(This refers to the physical placement of data (files, relations, etc.)
which is not known to the user (distribution transparency).
Distribution or network transparency- Users do not have to worry
about operational details of the network.
Location transparency (refers to freedom of issuing command
from any location without affecting its working).
Naming transparency (allows access to any names object
(files, relations, etc.) from any location).
Replication transparency- allows to store copies of a data at
multiple sites. This is done to minimize access time to the
required data.
User is unaware of the existence of multiple copies
Fragmentation transparency-Allows to fragment a relation
horizontally (create a subset of tuples of a relation) or vertically
(create a subset of columns of a relation).
Horizontal fragmentation
Vertical fragmentation
ADVANTAGE OF DISTRIBUTED DATABASES
Increased Reliability and Availability
Reliability – Probability that a system is running at a given time.
Availability – Probability that a system is continuously available
during a time interval .When the data and the DBMS software are
distributed over several sites ,one site may fail other sites continue
to operate. Only the data and the software that exist at the failed
site cannot be accessed. This improves both reliability and
availability.
Improved Performance
Data Localization – A Distributed database management system
fragments the database by keeping the data closer to where it is
needed. Data Localization reduces the contention for CPU and I/O
services and simultaneously reduces access delays involved in
wide area networks.
1. Architectural complexity.
2. Cost.
3. Security.
5. Lack of standards.
6. Lack of experience.
Security
Proper management of security of the data
Proper authorization/access privileges of users
• Fragmentation
– Relation may be divided into a number of sub-relations,
which are then distributed.
• Allocation
– Each fragment is stored at site with "optimal"
distribution.
• Replication
– Copy of fragment may be maintained at several sites.
WHY FRAGMENT DATA?
Þ Usage
Applications are usually interested in ‘views’ not whole relations.
Þ Efficiency
It’s more efficient if data is close to where it is frequently used.
Þ Parallelism
It is possible to run several ‘sub-queries’ in random.
Þ Security
If data not required by local applications, is not stored at the local
site.
DATA FRAGMENTATION
Consider the Employee relation with selection condition (DNO = 5). All
tuples satisfy this condition will create a subset which will be a horizontal
fragment of Employee relation.
NETWORK ADMINISTRATION
S# LOGIN-ID PASSWORD
200 JON200T XXYY22
324 GRA324S ZZEE56
456 KHA456T KJTR78
MIXED FRAGMENTATION
Allocation Schema
Describes the allocation of fragments to sites of the DDBs
DATA REPLICATION
Process of storing data in more than one site
Replication Schema
Description of the replication of fragments
Fully replicated distributed database
Replicating the whole database at every site
Improves availability
Improves performance of retrieval
Can slow down update operations drastically
Expensive concurrency control and recovery techniques
No replication distributed database
Each fragment is stored exactly at one site
All fragments must be disjoint except primary keys
Also called Non-redundant allocation
Partial Replication
Some fragments may be replicated while others may not
Number of copies range from one to total number of sites in a
distributed system
TYPES OF DISTRIBUTED DATABASE
SYSTEM
Homogeneous: All sites of the Window
database system have identical Site 5 Unix
Oracle Site 1
setup, i.e., same database system Oracle
software. The underlying Window
Site 4 Communications
operating system may be neteork
different. For example, all sites
run Oracle or DB2, or Sybase or Oracle
some other database system. The Site 3 Site 2
Linux Oracle Linux Oracle
underlying operating systems can
be a mixture of Linux, Window,
Unix, etc. The clients thus have
to use identical client software.
TYPES OF DISTRIBUTED DATABASE SYSTEM