0% found this document useful (0 votes)

67 views

Database Systems Model

This document presents an overview of distributed database systems. It discusses why databases are distributed across multiple sites on a computer network, what objects are distributed, and how distribution is implemented. Specifically, it addresses limitations of centralized databases, strategies for distributing data like replication and fragmentation, and the role of distributed database management systems in maintaining the distributed schema and coordinating transactions across sites. It proposes developing a mathematical model for distributed database design to standardize the integration of global and local schemas and aid in interpretation and implementation of distribution approaches.

Uploaded by

Mani Ammal

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

67 views

Database Systems Model

Uploaded by

Mani Ammal

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 14

International Journal of Computer Science and Mathematical Theory E-ISSN 2545-5699 P-ISSN 2695-1924,

Vol 7. No. 1 2021 www.iiardpub.org

Database Systems Model: Distributed

Woko, Ovunda
Research Scholar, School of Post Graduate Studies,
Department of Computer Science,
Faculty of Natural and Applied Sciences,
Ignatius Ajuru University of Education.
ovundawoko@gmail.com

Asagba, Prince Oghenekaro

Professor, Department of Computer Science,
Faculty of Science, University of Port Harcourt,
Choba, Port Harcourt
asagba.prince@uniport.edu.ng

Abstract
A Database is a collection of structured data describing the activities of one or more related
organizations with a specific defined purpose. Databases are controlled by Database
Management System by maintaining and utilizing large collections of data. A Distributed
Database is a collection of multiple, logically organized databases distributed over a Computer
Network. It is also a collection of databases that can be stored at different computer network
sites. This work presents an overview of Distributed Database System: explaining why we
distribute, what we distribute, and how we distribute. This paper classified distributed database
objects, stated various strategies for distribution and proposed a standard mathematical model
for understanding, interpreting and implementing the distributed database design method.

Keywords: Database, Distributed Database, Database Management System, Mathematical

Model

1.0 ITRODUCTION
Some couple of years ago, numerous organizations migrated from a paradigm of data
processing in which each application defined and maintained its own data (Traditional File
Processing) as shown in Figure1 to one in which the data are defined and administered centrally
(Database Processing) as shown in Figure 2 ( Özsu, M. T. & Valduriez, P., 2011).

Figure 1: Traditional File Processing

IIARD – International Institute of Academic Research and Development Page 41

International Journal of Computer Science and Mathematical Theory E-ISSN 2545-5699 P-ISSN 2695-1924,
Vol 7. No. 1 2021 www.iiardpub.org

DBMS

Figure 2: Database Processing

Centralized Databases Systems (CDBS) were used for daily transactions in diverse domains of
activities: booking, library, banking, commerce, manufacturing, etc. Even nowadays, a handful
of organizations still adopt CDBS approach. However, there are performance, maintenance,
cost of data communication, scalability, and other limitations associated with centralized
database system during query processing as end-users from different sites query a single host.
Hence, these issues and advancement in computer networks motivated the design and
implementation of efficient Distributed Database Systems (DDBS) otherwise known as
Decentralized Database Systems (DDS).
Distributed Database System is a model derived from the combination of two entirely opposed
approaches to data processing: Databases and their Networking. This approach implements
different strategies like Data replication, Data fragmentation and Data allocation (Özsu, M. T.
& Valduriez, P., 2011; Shareef M. I. & Rawi A.W., 2011). A Distributed Database is a set of
more than one database interconnected and propagated physically across various locations
(sites) which communicate, via a computer network (Kaur K. & Singh H., 2016 ; Tomar, P.,
2014 ). Furthermore, Singh, I. and Singh, S. (2015) proposed a practical and explanatory
definition that “A Distributed Database is a collection of multiple, logically interrelated
databases distributed over a Computer Network”. He added that “sometimes Distributed
Database System is used to refer jointly to the Distributed Database and the Distributed
Database Management System”. In this approach, processing logic or elements, functions, data,
and control are distributed in a multiply location of a computer network (Tomar, P., 2014) as
shown in figure 3. However, the object of the distribution remains the data in the database.

IIARD – International Institute of Academic Research and Development Page 42

International Journal of Computer Science and Mathematical Theory E-ISSN 2545-5699 P-ISSN 2695-1924,
Vol 7. No. 1 2021 www.iiardpub.org

Site 2 DB

Computer Network
Site1 no DB
Site3 no DB

Site 5 DB
Site 4 DB

Figure 3: Distributed Database System Architecture

When designing a Distributed Database, it is required that it be fully accommodated or

fragmented on various sites in a computer network. Going by this approach in the computer
network, there must be at least two sites hosting the database and not certainly each site. The
main aim of a Distributed Database System is to appear as a centralized system to end-users
(Tomar, P. 2014). All the
Administrative activities of Distributed Database are piloted by the Distributed Database
Management Systems (DDBMS). The DDBM is a software that manages the distribution of
the Distributed Database to each site on the network, maintaining its database schema globally
and locally. The DDBM provides the capability for fragmentation, replication and allocation
of data on several sites as shown in Figure 1, which is different from the Centralized Database
System (CDBS), where only a replica of the Database is stored as shown in Figure 2 Singh, I.
and Singh, S. (2015). In a Centralized Database System (CDBS), the Database is managed by
one computer system on a central site (Site 2) and all query transactions from other sites are
directed to the central site as shown in Figure 4.

IIARD – International Institute of Academic Research and Development Page 43

International Journal of Computer Science and Mathematical Theory E-ISSN 2545-5699 P-ISSN 2695-1924,
Vol 7. No. 1 2021 www.iiardpub.org

Site 2 DB

Computer Network
Site1 no DB
Site3 no DB

Site 5 no DB

Figure 4. Centralized Database System Architecture

It is important to know that the most crucial objective of the database technology now is
integration, and not centralization. It is also important to know the concept of integration and
centralization are distinct (Özsu, M. T. & Valduriez, P., 2011) because integration can be
achieved without centralization. Therefore, Distributed Database System is an integrated
Database Technology distributed by Computer Networks as shown in Figure 5

Figure 5: Integration ≠ Centralization (Özsu, M. T. & Valduriez, P., 2011)

The modelling, designing, and implementation of the Distributed Database System is a

daunting task (Singh, I. & Singh, S., 2015; Bhuyar P. R., Gawande, A. D., & Deshmukh A.
B., 2012). The design of the Distributed Database System involves the global conceptual
schema, which is added to local schemas, based on the three-level architecture (Physical,
Conceptual, and External) of the DBMS in all sites. The establishment of a computer network
across sites of a distributed system is an additional complex problem of design (Katembo K.
E., Shri K., & Ruchi A., 2019). This daunting task for distributed database can be
mathematically model to establish a bases for its design approaches.
Models describe our beliefs about how the world functions. So, mathematical models formulate
and express real world activities and abstract ideas using mathematical well-defined rules and
IIARD – International Institute of Academic Research and Development Page 44
International Journal of Computer Science and Mathematical Theory E-ISSN 2545-5699 P-ISSN 2695-1924,
Vol 7. No. 1 2021 www.iiardpub.org

symbols. In research works, more often than not, mathematical framework or models form the
bases for communicating real world ideals. Mathematical models play vital roles for design of
many concepts. Therefore, a standard working model has to express the working concept of
distributed database.

In this work, we focus on why do we distribute, what is distributed, and how do we distribute
(model). We propose a mathematical model for distributed database design for easy expression
and interpretation of the integration of global conceptual schema and the local schemas since
distributed database system can only be achieved by integration especially with heterogeneous
databases.

2.0 REVIEW OF RELATED LITERATURE

Özsu, M. T. and Valduriez, P. (2011) designed the distributed database approach. The design
process phases include the company situation analysis, the problems definition and constraints,
the objectives definition, and the scope design and boundaries. The paper classified distributed
database design into Top-down and bottom-up design or approach. But these approaches lack
standard mathematical frame or model for easy interpretation and implementation.
Singh, I. and Singh, S. (2015) stated that the Bottom-up design process requires the following
steps: the selection of a mutual prototype to describe the global schema of the database; the
conversion of all local schemas into a mutual data model; and the unification of local patterns
to arrive at a mutual global schema. However, there is not standard mathematical frame or
model for unification of local mutually exclusive database to arrive at a mutual global schema
that can interpret all these.

Shareef M. I., and Rawi A. W. (2011) expressed distributed database model is a model that its
goal is to break the relation, to allocate and to replicate the fragment in different sites of the
distributed system with local optimization on each site. This model is shallow and only focused
on distributed database management system but not on distributed database.
Tomar, P. and Megha (2014) presented an overview of Distributed Database System along with
their advantages and disadvantages. This paper also provides various aspects like replication,
fragmentation and various problems that can be faced in distributed database systems.
Kumar, N., Bilgaiyan, S., & Sagnika, S. (2013) explained how the cost of implementing
multiple transparencies interact and how to reduce operating system and communication
stack.
Hiremath, D.S. & Kishore, S.B. (2016) emphasized on distributed database problem areas and
approaches.

3.0 DISCUSSION
This section discusses why we distribute, what we distribute and how distribution of database
is achieved, proposing a mathematical model for unification of local mutually exclusive
databases to arrive at a mutual global schema for easy interpretation and implementation of
distributed database design approach.

3.1 WHY DO WE DISTRIBUTE?

This is a fundamental question because many of the current applications of computer
technology are inherently distributed. Applications such as web-based applications, e-
commerce business over the Internet, multimedia applications, and manufacturing control
systems are all examples of distributed applications. From a more global view, however, it can
be identified that the fundamental reason behind distributed processing is to be better able to
cope with the challenges of huge data management problems that we face today, by using a
IIARD – International Institute of Academic Research and Development Page 45
International Journal of Computer Science and Mathematical Theory E-ISSN 2545-5699 P-ISSN 2695-1924,
Vol 7. No. 1 2021 www.iiardpub.org

variation of the well-known divide-and-conquer rule (Michel A. et al., 2016). Almeida F. and
Calistru, C. (2012) explained that data warehouse operational processes normally compose a
labour intensive workflow and constitute an integral part of the back-stage of data warehouse
architectures, where the collection, extraction, cleaning, transformation, and transport of data
takes place, in order to populate the warehouse. Tanenbaum, S. A. and Steen V. M. (2016)
stated goals distributed database: resource sharing, making distribution transparent, being
open, and being scalable are the four important goals for distributed database.
i. Support for resource sharing: The significant goal of a distributed system is to
make it easy for systems, applications and users (people) to access and share remote
resources. Resources can be virtually anything: processing logic or elements,
functions, data, and control. Common examples are peripherals, storage facilities,
data, files, services, and networks, etc. The reason for sharing of resource is because
of limited resources. It is cost effective to share a single high – end storage facility
on a network than to deploy the storage facility in each of the systems in the
network.

ii. Making distribution transparent: Hiding the processes and resources physically
distributed across multiple computers, perhaps separated by large distances is
crucial. Distributed systems tries to make the distribution of processes and
resources invisible to end users and applications. This is called transparency. There
are different transparencies require (Tanenbaum, S. A. & Steen V. M., 2016):
 Access - differences in data representation and how an object is accessed must be
hidden
 Migration – How objects move to another location must be hidden to end user
 Replication – How an object is replicated must be hidden to end user
 Location - Where an object is located must be hidden to end user
 Failure - Failure and recovery of an object must be hidden to end user
 Relocation - Hide that an object may be moved to another location while in use
 Concurrency- Hide that an object may be shared by several independent users

iii. Being open: Distributed systems involve a lot of integrated components in the
networks. An important fact is that the components must be flexibly used. If
components are not user friendly, then distribution is not opened for use. At user
end, the users must be able to use the system with little or no supervision.
Distributed systems must support Interoperability, composability, and extensibility.

iv. Being scalable: To distribute resources wide world, scalable design must a serious
goal, that is being able to accommodate more processes and resources in the future.
Scalability can be measured along at least three different dimensions (Neuman, B.,
1994):
 Size scalability - A system can be scalable with respect to its size, meaning that we can
easily add more users and resources to the system without any noticeable loss of
performance.
 Geographical scalability - A geographically scalable system is one in which the users
and resources may lie far apart, but the fact that communication delays may be
significant is hardly noticed.
 Administrative scalability - An administratively scalable system is one that can still be
easily managed even if it spans many independent administrative organizations.

IIARD – International Institute of Academic Research and Development Page 46

International Journal of Computer Science and Mathematical Theory E-ISSN 2545-5699 P-ISSN 2695-1924,
Vol 7. No. 1 2021 www.iiardpub.org

3.2 WHAT IS DISTRIBUTED?

In distributed database system, the following objects are distributed: Processing logic or
elements, Functions, Data, and Controls are distributed (Özsu, M.T. & Valduriez, P., 2001).
a. Processing logic or elements: These are numbers of autonomous processing elements (not
necessarily homogeneous) that are interconnected by a computer network, that cooperate in
performing their assigned tasks. The “processing element” are computing devices that can
execute a program on its own (Özsu, M.T. & Valduriez, P., 2011). Interconnection of these
processing logic or processing elements computing devices and fundamental across in
distributed database system,
b. Function: Functions that ensure that authorized users perform correct operations on the
database, contributing to the maintenance of database integrity. These functions are distributed
across to enable perform correct operation.
c. Data: The major reason for distribution is data in a base. Data stored must be made available
for use through distribution technology. Replication of logical data item and physical data items
become necessary (Özsu, M.T. & Valduriez, P., 2011). There are five categories of distributed
data namely: replicated data, horizontally fragmented data, vertically fragmented data,
reorganized data, and separate-schema data (Borysowich C., 2007).
d. Control: The definition of the rules for controlling data manipulation is part of the
administration of the database, a function generally is performed by a database administrator.
An important requirement of a centralized or a distributed DBMS is the ability to support
semantic data control - data and access control using high-level semantics (Özsu, M.T. &
Valduriez, P., 2011). According to Özsu, M.T. and Valduriez, P. (2011), semantic data control
typically includes view management, security control, and semantic integrity control. Rules for
semantic data control must be stored in a catalog, the management of a distributed directory
(also called a catalog) and be distributed.

3.2.1 Classification of Distributed Database System Objects

Therefore mentioned and explained distributed database system objects are classified into two:
Object of the distribution and Distributed Objects for Distribution. Object of the distribution is
the database object (relations) distributed and managed by Distributed Database Management
System, while the Distributed Objects for Distribution guarantee effective distribution of
database even if there are also distributed in a way. Distributed Database System Objects as
shown in Figure 6

Distributed Database System Objects

Object of the Distribution Distributed Objects for Distribution

Database (data) Controls Functions Processing Logic

Figure 6: Distributed Database System Object

IIARD – International Institute of Academic Research and Development Page 47

International Journal of Computer Science and Mathematical Theory E-ISSN 2545-5699 P-ISSN 2695-1924,
Vol 7. No. 1 2021 www.iiardpub.org

3.3 HOW DO WE DISTRIBUTED?

3.3.1 Classification of Database Systems
Database management systems can be classified based on several measures, such as the data
model, user numbers and database distribution. However, DBMS is classified into Centralized
database systems and Distributed Database System.
a. Centralized database systems
In a centralized database system, the Database Management System (DBMS) and database are
stored at a single site that is used by several other systems too. This is illustrated in Figure 2.
All the sites or workstations or nodes or terminals have access to the database at the site (central
computer).
b. Distributed Database System
In a distributed database system, the Database Management System (DBMS) and database are
stored at more than one site that is used by several other sites.
Distributed Database management system is a system that allocate a set of fragments F = {F1,
F2, …, Fm} resources across computer network nodes of a distributed environment comprising
sites S = {S1, S2, …, Sn} on which a set of query Q = {Q1, Q2, …, Qp} is running. Distributed
database model is a model that its goal is to break the relation, to allocate and to replicate the
fragment in different sites of the distributed system with local optimization on each site
(Shareef M. I., Rawi A. W., 2011). In a distributed database system, the actual database and
the DBMS software are distributed from various sites that are connected by a computer
network, as shown in Figure 3.

3.3.2 CLASSIFICATION OF DISTRIBUTED DATABASE SYSTEM

Distributed databases can be generally classified into homogeneous and heterogeneous
distributed database. Homogeneous is further classified into Autonomous and Non-
Autonomous, while Heterogeneous is classified into Federated and Multi-database as shown
in the Figure 6

Distributed Database Environment

Heterogeneous Homogenous

Autonomous Non - Autonomous Federated Multidatabase

Figure 7: Classification of Distributed Database Environment (source:

https://www.tutorialspoint.com)

3.3.2.1 Homogeneous Distributed Database System

Homogeneous distributed database systems use the same DBMS software from multiple site.
Data exchange between these various sites can be handled easily. The environment of a
homogeneous is typically defined by the following features:
i. Data are distributed across all the nodes.
ii. The same DBMS is used at each location.
iii. All data are managed by the distributed DBMS (so there
IIARD – International Institute of Academic Research and Development Page 48
International Journal of Computer Science and Mathematical Theory E-ISSN 2545-5699 P-ISSN 2695-1924,
Vol 7. No. 1 2021 www.iiardpub.org

are no exclusively Local data).

iv. Operating system may vary

Figure 8: Homogeneous distributed database system (source: google)

Data exchange or access policy between these various sites gives rise to two types of
homogeneous distributed database: Autonomous and Non-autonomous
i. Autonomous
Each database is independent and functions on its own at each of the sites. All the sites are
integrated by a controlling application and use message passing to share data updates. That is,
access to databases is done by a controlling application and a message passing to share data
updates.
ii. Non-autonomous
Data is distributed across the homogeneous nodes and a central or master DBMS co-ordinates
data updates across the sites.

3.3.2.2 Heterogeneous Distributed Database Systems

In a heterogeneous distributed database system, different sites may use different database
model, DBMS software and operating systems, but access to data policy can be different - a
single conceptual schema (global) access policy or multi conceptual schema access policy. The
Figure 10 depicts heterogeneous distributed database systems.

IIARD – International Institute of Academic Research and Development Page 49

International Journal of Computer Science and Mathematical Theory E-ISSN 2545-5699 P-ISSN 2695-1924,
Vol 7. No. 1 2021 www.iiardpub.org

Figure 9: Depicts Heterogeneous Distributed Database Systems (Source:google)

The data access policy gives rise to two types of heterogeneous distributed database
i. Federated: Here each site may run different database system but the data access is managed
through a single conceptual schema. This implies that the degree of local autonomy is
minimum. Each site must adhere to a centralized access policy. There may be a global schema.
Federated distributed Database Management Systems has the following Issues:
- Differences in data models: Relational, Objected oriented, hierarchical, network, etc.
- Differences in constraints: Each site may have their own data accessing and processing
constraints.
- Differences in query language: Some site may use SQL, some may use SQL-89, some
may use SQL-92, and so on.
ii. Multidatabase: There is no one conceptual global schema. For data access, a schema is
constructed dynamically as needed by the application software.

3.3.3 Distributed Database Management System (DDMS) Features

Since a distributed database management system (DDBMS) is a centralized software system
that manages a distributed database in a manner as if it were all stored in a single location, it
has following features:
i. It is used to create, retrieve, update and delete distributed databases.
ii. It synchronizes the database periodically and provides access mechanisms by the virtue
of which the distribution becomes transparent to the users.
iii. It ensures that the data modified at any site is universally updated.
iv. It is used in application areas where large volumes of data are processed and accessed
by numerous users simultaneously.
v. It is designed for heterogeneous database platforms.
vi. It maintains confidentiality and data integrity of the databases.

3.3.4 Factors for Organization’s Choice for DDBMS

The following factors encourage organizations to migrate to DDBMS
ii. Physical distributed nature of organizational units: Most organizations in the
nowadays are subdivided into multiple units that are physically distributed over the
globe. So, the overall database of the organization becomes distributed as each unit
requires its own set of local data.
iii. Need for sharing of data: Various organizational units regularly need to communicate
with each other and share their data and resources. This demands common databases
or replicated databases that should be used in a synchronized approach.
IIARD – International Institute of Academic Research and Development Page 50
International Journal of Computer Science and Mathematical Theory E-ISSN 2545-5699 P-ISSN 2695-1924,
Vol 7. No. 1 2021 www.iiardpub.org

iv. Support for both OLTP and OLAP: Online Transaction Processing (OLTP) and Online
Analytical Processing (OLAP) work upon diversified systems which may have
common data. Distributed database systems aid both these processing by providing
synchronized data.
v. Database recovery: One of the common techniques used in DDBMS is replication of
data across different sites. Replication of data repeatedly helps in data recovery if
database in any site is damaged. Users can access data from other sites while the
damaged site is being restored.
vi. Support for multiple application software: Quite number of organizations use a variety
of application software each with its specific database support. DDBMS provides a
uniform functionality for using the same data among dissimilar platforms.

3.3.5 Advantages of Distributed Database System (DDBS)

i. It increases reliability and availability. The problems experienced in one branch
or site of the organization can not affect other branches in the same manner
ii. It supports smooth transactions due to replication of the database.
iii. It supports hardware, operating system, network, fragmentation, DBMS,
replication and location independence.
iv. Its distributed query processing improves performance.
v. It supports distributed transaction management
vi. Performance of system cannot be affected by single-site
failure.

3.3.6 Disadvantages of Distributed Database System (DDBS)

According to Tomar, P. and Megha (2014) the following are the various disadvantages of
distributed databases.
i. Complexity-A distributed database is more complicated to setup and maintain as
compared to central database system.
ii. Security–There are many remote entry points to the system compared to central system
leading to security threats.
iii. Data Integrity–In distributed system it is very difficult to make sure that data and
indexes are not corrupted.
iv. In distributed database systems, data need to be carefully placed to make the system as
efficient as possible.
v. Distributed databases are not so efficient if there is heavy interaction between sites.
vi. Failures: Several types of failures may occur in distributed database systems like,
transaction failure, site failure, media failure, and communication failure.
vii. Economics: Increased complexity and a more extensive infrastructure means extra
labour costs.

3.3.6 Distributed Databases Design Methods

Designing Distributed Databases involve two main ingenuities: the top-down method and the
bottom-up method (Özsu, M. T. & Valduriez, P., 2011; Singh, I. & Singh, S., 2015; Hiremath
D., S. & Kishor S., B., 2016). These methods deliver very different techniques in the design
process. The top down method is much more suitable when designing homogeneous strongly
cohesive Distributed Databases, while the bottom-up method is more suitable for
heterogeneous or multidatabases (Katembo K. E., Shri K., & Ruchi A. , 2019).

a. Top-down method

IIARD – International Institute of Academic Research and Development Page 51

International Journal of Computer Science and Mathematical Theory E-ISSN 2545-5699 P-ISSN 2695-1924,
Vol 7. No. 1 2021 www.iiardpub.org

More often than not, top-down method is used when the Distributed database is implemented
from start as shown in the Figure 3. The design process starts from the analysis of requirements.
The design process phases include the company situation analysis, the problems definition and
constraints, the objectives definition, and the scope design and boundaries (Özsu, M. T. &
Valduriez, P; Katembo K. E., Shri K. & Ruchi A., 2019; Gadicha, A. B. et, al., 2012).

Figure 11: Top-Down Design Method (Özsu, M. T. & Valduriez, P. 2011)

- The conceptual modelling and view design are two next level tasks concerned. The conceptual
modelling formalizes and standardizes the entity relationships while focusing on data
requirements. Its modelling process controls the types of entities and the relationships among
them and then the entity analysis advances to determine the entities and their attributes.
- The View design provides interface for the end users. The functional analysis connected
determines the fundamental functions involved in associating the modelling.
- The View integration is the activity that defines the conceptual model which supports existing
applications as well as future applications (Özsu, M. T. & Valduriez, P. 2011).

b. Bottom-up method
This method is used when Distributed Database already exists and requires scalability to other
features or another Database have to be integrated into the existing environment (Hiremath D.,
S. & Kishor S. B., 2016). This method provides integration capability of several existing local
schemas into a global conceptual schema in already developing distributed system. The
bottom-up method is adopted when combining several existing databases to develop a
distributed system because it is based on the integration of several existing local schemas into
a single global schema. This capability integrates more than one existing heterogeneous
databases to build a distributed database system. This is also called ascending order method.
IIARD – International Institute of Academic Research and Development Page 52
International Journal of Computer Science and Mathematical Theory E-ISSN 2545-5699 P-ISSN 2695-1924,
Vol 7. No. 1 2021 www.iiardpub.org

Therefore, (Özsu, M. T. & Valduriez, P. , 2011; Singh, I. & Singh, S., 2015 ; Gadicha, A. B.
et, al. , 2012) states that the Bottom-up design process requires the following steps:
 The selection of a mutual prototype to describe the global schema of the database;
 The conversion of all local schemas into a mutual data model;
 The unification of local patterns to arrive at a mutual global schema

3.3.7 Proposed Model for Bottom-up Design Method for Distributed Database
We propose a mathematical model for distributed database design (Özsu, M. T. & Valduriez,
P. , 2011; Singh, I. & Singh, S., 2015 ; Gadicha, A. B. et, al. , 2012) for easy expression and
interpretation of the integration of global conceptual schema and the local schemas since
distributed database system can only be achieved by integration especially with heterogeneous
databases.
Let’s consider three heterogeneous local database schema to be integrated into a single global
distributed database schema:
Let O represent an Oracle db, M = Microsoft SQLServr db,
S = MySQL db.
Let db = Database
L = db (local database)
Let f(G) = function of a global distributed database schema
f(L) = function of a local database schema function
Thus: The Integrated global database of heterogeneous databases of Oracle, Microsoft
SQLServer and My SQL can be represented as follow:

N
f (G) =∑f(Lijk), where L = db and {x: ∀ O, M, S}
i =1,j= 1,k = 1
i∈O, j∈M, k∈S
Nothing that i,j,k are mutually exclusive.
3.3.8 Distribution strategies
The design of the Distributed Database System integrating global conceptual schema and local
schemas base on the three-level architecture of the DBMS in all sites; the complex design for
establishment of a computer network across sites of a distributed system; and the modelling
and implementation of all these make Distributed Database System a daunting task (Singh, I.
& Singh, S., 2015; Bhuyar P. R., Gawande A. D., & Deshmukh A. B., 2012). The Distributed
Database system provides the capability for fragmentation, replication and allocation of data
on several sites with help of efficient query join operators and optimization. Data fragmented,
replicated and allocated are strategies used to distribute to different sites. For the sake of space,
details of distribution strategies shall be the focus of our next work.

CONCLUSION
In a distributed database system, from a more global view, however, it can be identified that
the fundamental reason behind distributed processing is to be better able to cope with the
challenges of huge data management problems that we face today, by using a variation of the
well-known divide-and-conquer. Processing logic or processing elements, controls, and
functions are the distributed objects for distribution of relation objects, while the relations
object remain the main object of distribution via the computer networks. Data fragmented,
replicated and allocated are strategies used to distribute to different sites. Our proposed model

IIARD – International Institute of Academic Research and Development Page 53

International Journal of Computer Science and Mathematical Theory E-ISSN 2545-5699 P-ISSN 2695-1924,
Vol 7. No. 1 2021 www.iiardpub.org

becomes a mathematical model for understanding, interpreting and implementing the complex
distributed database design method for integration of mutually exclusive local schemas and
mutual global conceptual schema in the heterogeneous model.

REFERENCES
Almeida F. and Calistru, C. (2012). The main challenges and issues of big data management.
International Journal of Research Studies in Computing , 2(1). 11-20
Bhuyar P. R., Gawande, A. D., and Deshmukh A. B. (2012). Horizontal fragmentation
technique in distributed database. International Journal of Scientific and Research
Publications. 2(5).1-7.
Gadicha A.B, Alvi AS, Gadicha VB, Zaki SM (2012). Top-Down Approach Process Built on
Conceptual Design to Physical Design Using LIS, GCS Schema. In,ternational Journal
of Engineering Sciences & Emerging Technologies.
Hiremath D., S. and Kishor S., B. (2016). Distributed Database Problem areas and Approaches.
Journal of Computer Engineering: National Conference on Recent Trends in Computer
Science and Information Technology, 2278-8727.
Katembo K. E., Shri K. and Ruchi A. (2019). A Systematic Review on Distributed Databases
Kaur K., Singh H., “Distributed database system on web server: A Review”. International
Journal of Computer Techniques, 3, pp. 12-16,
Michel A. et al. (2016 ). Big Data Management Challenges, Approaches, Tools and their
limitations.
Retrieved 2021 from https://www.researchgate.net/publication/295134268_Big_Data_
Management_Challenges_Approaches_Tools_and_their_limitations.
Özsu M. T, Valduriez P. (2011). Principles of distributed database systems. Springer Science
& Business Media .
Özsu, M. T., & Valduriez, P. (2011). Introduction Principles of Distributed Database Systems.
Third Edition, 1–40. doi:10.1007/978-1-4419-8834-8_1
Özsu, M.T. & Valduriez, P. (2001). Principles of Distributed Database Systems. fourth Edition.
Shareef M. I. , Rawi A.W.(2011). The Customized Database Fragmentation Technique in
Distributed Database Systems.
Singh, I. and Singh, S. (2015). Distributed Database Systems: Principles, Algorithms and
Systems, New-Delhi, India: Khanna Book Publishing, Co.(P) Ltd.
Tomar, P.( 2014 ). An overview of distributed databases. International Journal of Information
and Computation Technology.

IIARD – International Institute of Academic Research and Development Page 54

The Organization of Information 4th Edition (2017, Libraries Unlimited)
97% (34)
The Organization of Information 4th Edition (2017, Libraries Unlimited)
483 pages
Mysql 3rd Edition
100% (10)
Mysql 3rd Edition
646 pages
Google Hacking Database
83% (18)
Google Hacking Database
91 pages
Dangerous Google - Searching For Secrets PDF
88% (26)
Dangerous Google - Searching For Secrets PDF
12 pages
Voyager 7S Data Dictionary - Through Update DB 5854 - 060619
67% (3)
Voyager 7S Data Dictionary - Through Update DB 5854 - 060619
3,877 pages
Data Structures Cheat Sheet
71% (14)
Data Structures Cheat Sheet
2 pages
Google Hacking Database
No ratings yet
Google Hacking Database
91 pages
Understanding Database Types - by Alex Xu
No ratings yet
Understanding Database Types - by Alex Xu
13 pages
How To Use Google Hack
100% (1)
How To Use Google Hack
4 pages
Policy Document Ucc Redemption Understanding The Process Further
80% (20)
Policy Document Ucc Redemption Understanding The Process Further
37 pages
Hackers Black Book (2011-Edition)
No ratings yet
Hackers Black Book (2011-Edition)
6 pages
Top 40 Data Structure Interview Questions and Answers (2021) - InterviewBit
100% (2)
Top 40 Data Structure Interview Questions and Answers (2021) - InterviewBit
31 pages
Dark Web Market Price Index Hacking Tools July 2018 Top10VPN2
91% (11)
Dark Web Market Price Index Hacking Tools July 2018 Top10VPN2
7 pages
Google Hacking
100% (7)
Google Hacking
66 pages
Color-Coded Genealogy Research Filing System
No ratings yet
Color-Coded Genealogy Research Filing System
15 pages
Kali Linux Tools Descriptions
100% (2)
Kali Linux Tools Descriptions
26 pages
Thesis Leadership of Apple
100% (1)
Thesis Leadership of Apple
78 pages
Anti-Cyber Crime Law RA 10175
No ratings yet
Anti-Cyber Crime Law RA 10175
25 pages
DCSA P3 Asset List Risk Assessment Framework Examples
No ratings yet
DCSA P3 Asset List Risk Assessment Framework Examples
5 pages
Case 1 Jaguar
No ratings yet
Case 1 Jaguar
4 pages
Measuring Value in The Public Sector
No ratings yet
Measuring Value in The Public Sector
6 pages
Distributed Database Design Methodologies: Stefan0 Ceri, Barbara Pernici, Wiederhold
No ratings yet
Distributed Database Design Methodologies: Stefan0 Ceri, Barbara Pernici, Wiederhold
14 pages
009 - Lungu, Velicanu, Botha PDF
No ratings yet
009 - Lungu, Velicanu, Botha PDF
16 pages
The Role of Database Systems in The Development of Digital Learning
No ratings yet
The Role of Database Systems in The Development of Digital Learning
10 pages
CSC401 Database Management ECU Final Part 1
No ratings yet
CSC401 Database Management ECU Final Part 1
26 pages
12 Follosco EAPP 2manynames
No ratings yet
12 Follosco EAPP 2manynames
10 pages
1 Slide
No ratings yet
1 Slide
35 pages
Six Layers Architecture Model For Object Oriented
No ratings yet
Six Layers Architecture Model For Object Oriented
4 pages
Assignment
100% (1)
Assignment
35 pages
Fundamentals of Database System - Module - CHapter - One
100% (1)
Fundamentals of Database System - Module - CHapter - One
19 pages
7
No ratings yet
7
66 pages
Introduction Merged
No ratings yet
Introduction Merged
13 pages
02 Handout 144-Unlocked
No ratings yet
02 Handout 144-Unlocked
3 pages
Distributed Database System
No ratings yet
Distributed Database System
6 pages
Course Pack - Introduction To Databases
No ratings yet
Course Pack - Introduction To Databases
41 pages
Using Ontologies To Overcoming Drawbacks of Databases and Vice Versa: A Survey
No ratings yet
Using Ontologies To Overcoming Drawbacks of Databases and Vice Versa: A Survey
21 pages
Module 1 Data Science
No ratings yet
Module 1 Data Science
8 pages
Chap 1 - Database and Database Users
No ratings yet
Chap 1 - Database and Database Users
27 pages
Introduction To DBMS
No ratings yet
Introduction To DBMS
41 pages
Is Worktext 1 Laboratory
No ratings yet
Is Worktext 1 Laboratory
20 pages
Short Notes Distributed Obect Multimedia Mobile Databases
No ratings yet
Short Notes Distributed Obect Multimedia Mobile Databases
9 pages
Baze de Date Prezent Si Viitor
No ratings yet
Baze de Date Prezent Si Viitor
16 pages
ICT503 - Database Management Systems - Presentation
No ratings yet
ICT503 - Database Management Systems - Presentation
11 pages
BITWeek1 - L2 - ITE2422 V1
No ratings yet
BITWeek1 - L2 - ITE2422 V1
13 pages
Unit 1 - DBMS-II BSC
No ratings yet
Unit 1 - DBMS-II BSC
97 pages
Advanced DataBases W
No ratings yet
Advanced DataBases W
5 pages
Database Lexicography: Gary Coen
No ratings yet
Database Lexicography: Gary Coen
22 pages
VTU Exam Question Paper With Solution of 18CS72 Big Data and Analytics Feb-2022-Dr. v. Vijayalakshmi
No ratings yet
VTU Exam Question Paper With Solution of 18CS72 Big Data and Analytics Feb-2022-Dr. v. Vijayalakshmi
25 pages
Bigdata Unit5
No ratings yet
Bigdata Unit5
20 pages
Design of A Spatial Data Warehouse Based On An Int
No ratings yet
Design of A Spatial Data Warehouse Based On An Int
9 pages
Adminjti,+8 +JURNAL+TRI+AMRI
No ratings yet
Adminjti,+8 +JURNAL+TRI+AMRI
10 pages
Block Introduction: Unit 1 Introduction To Object Oriented Database Management System Relational Model
No ratings yet
Block Introduction: Unit 1 Introduction To Object Oriented Database Management System Relational Model
50 pages
Unit 1 - DBMS-II BSC
No ratings yet
Unit 1 - DBMS-II BSC
74 pages
What Is Database.1
No ratings yet
What Is Database.1
10 pages
BSCIT 201 Database Management System
No ratings yet
BSCIT 201 Database Management System
171 pages
Document Clustering: Alankrit Bhardwaj 18BIT0142 Priyanshu Gupta 18BIT0146 Aditya Raj 18BIT0412
No ratings yet
Document Clustering: Alankrit Bhardwaj 18BIT0142 Priyanshu Gupta 18BIT0146 Aditya Raj 18BIT0412
33 pages
Distributed Database Management Systems
No ratings yet
Distributed Database Management Systems
6 pages
Rdbms III Sem
100% (1)
Rdbms III Sem
80 pages
Unit-III
No ratings yet
Unit-III
33 pages
IJETT-V71I7P218
No ratings yet
IJETT-V71I7P218
12 pages
Research Publish Journal
No ratings yet
Research Publish Journal
7 pages
Hierarchical Model Leads To The Evolution of Relational Model
No ratings yet
Hierarchical Model Leads To The Evolution of Relational Model
4 pages
DDBS Lec1
No ratings yet
DDBS Lec1
20 pages
Data Science Assesment 1-2
No ratings yet
Data Science Assesment 1-2
6 pages
Distributed Query Processing +
No ratings yet
Distributed Query Processing +
19 pages
Top Down Database Design
No ratings yet
Top Down Database Design
4 pages
DB Module Final
No ratings yet
DB Module Final
43 pages
Glossing The Information From Distributed Databases
No ratings yet
Glossing The Information From Distributed Databases
4 pages
Fundamentals of Database Systems
No ratings yet
Fundamentals of Database Systems
105 pages
Term Paper On Distributed Database
100% (1)
Term Paper On Distributed Database
5 pages
279 - DBMS Complete1
No ratings yet
279 - DBMS Complete1
121 pages
CPP106-MODULE - 10 - 2ndSEM - Database - Intro (2) (20230504181134)
No ratings yet
CPP106-MODULE - 10 - 2ndSEM - Database - Intro (2) (20230504181134)
11 pages
Libro Estruc Datos Amplios fnt23 Athanassoulis
No ratings yet
Libro Estruc Datos Amplios fnt23 Athanassoulis
168 pages
Sensing Cloud Computing in Internet of Things: A Novel Data Scheduling Optimization Algorithm
No ratings yet
Sensing Cloud Computing in Internet of Things: A Novel Data Scheduling Optimization Algorithm
13 pages
ddb unit 1-5
No ratings yet
ddb unit 1-5
190 pages
7 DBMS
No ratings yet
7 DBMS
27 pages
File Organization Terms and Concepts
No ratings yet
File Organization Terms and Concepts
3 pages
Gustavo Arcay Fase3
No ratings yet
Gustavo Arcay Fase3
5 pages
Query Processing and Interlinking of Fuzzy Object-Oriented Database
No ratings yet
Query Processing and Interlinking of Fuzzy Object-Oriented Database
6 pages
Databases: System Concepts, Designs, Management, and Implementation
From Everand
Databases: System Concepts, Designs, Management, and Implementation
Jonathan Rigdon
No ratings yet
Useful Google Hacks
100% (4)
Useful Google Hacks
7 pages
SQL Crash Course
No ratings yet
SQL Crash Course
17 pages
Microsoft Access For Beginners PDF
100% (2)
Microsoft Access For Beginners PDF
196 pages
TITLE 28 United States Code Sec. 3002
91% (11)
TITLE 28 United States Code Sec. 3002
77 pages
Google Hacking Database PDF
0% (1)
Google Hacking Database PDF
100 pages
Database Management Systems
No ratings yet
Database Management Systems
19 pages
24 Essential SQL Interview Questions
No ratings yet
24 Essential SQL Interview Questions
13 pages
Mythic Magazine #015
100% (3)
Mythic Magazine #015
34 pages
Open Source Intelligence
No ratings yet
Open Source Intelligence
4 pages
Open Source Intelligence (Osint) Reference Sheet
0% (1)
Open Source Intelligence (Osint) Reference Sheet
23 pages
Other Link Classified - How To Find The Book I Want
No ratings yet
Other Link Classified - How To Find The Book I Want
453 pages
Master Cyber Digital Forensics
50% (2)
Master Cyber Digital Forensics
114 pages
SQL Cheat Sheet
91% (11)
SQL Cheat Sheet
11 pages
Anatomy of A Hack
No ratings yet
Anatomy of A Hack
43 pages
Network Automation Cookbook
No ratings yet
Network Automation Cookbook
44 pages
Ibrahim M Al Amshan: Ref: GC955-500
No ratings yet
Ibrahim M Al Amshan: Ref: GC955-500
8 pages
Competency Model: Hay'S Method
100% (1)
Competency Model: Hay'S Method
15 pages
Urbanization in Bangladesh
No ratings yet
Urbanization in Bangladesh
22 pages
Quotation of Gait Training Electric Wheelchair
No ratings yet
Quotation of Gait Training Electric Wheelchair
2 pages
Graphic Designer Neville Brody Facts
No ratings yet
Graphic Designer Neville Brody Facts
3 pages
Case Digest Tax
0% (1)
Case Digest Tax
36 pages
Sand Filter Next Gen
100% (1)
Sand Filter Next Gen
18 pages
Sidambaram V Lok Bee Yeong
No ratings yet
Sidambaram V Lok Bee Yeong
30 pages
BS Tata
No ratings yet
BS Tata
18 pages
Instant download (Ebook) Structural Concrete: Strut-and-Tie Models for Unified Design by Chen, Wai-Fah; El-Metwally, Salah El-Din E ISBN 9781498783842, 1498783848 pdf all chapter
100% (10)
Instant download (Ebook) Structural Concrete: Strut-and-Tie Models for Unified Design by Chen, Wai-Fah; El-Metwally, Salah El-Din E ISBN 9781498783842, 1498783848 pdf all chapter
65 pages
Week 3 Lab
No ratings yet
Week 3 Lab
2 pages
Practice Questions & Answers: Made With by Sawzeeyy
No ratings yet
Practice Questions & Answers: Made With by Sawzeeyy
141 pages
Ies Oradea
No ratings yet
Ies Oradea
50 pages
Manila Standard Today - Tuesday (September 25, 2012) Issue
No ratings yet
Manila Standard Today - Tuesday (September 25, 2012) Issue
14 pages
Wins For The Week 5 February 2016
No ratings yet
Wins For The Week 5 February 2016
3 pages
TD1763-02 Kaft Mercedes OM906La
No ratings yet
TD1763-02 Kaft Mercedes OM906La
2 pages
Federal Reserve Bank of Kansas City, Kansas City, MO 64198, USA
No ratings yet
Federal Reserve Bank of Kansas City, Kansas City, MO 64198, USA
14 pages
Straight Egyptian Selection PDF
No ratings yet
Straight Egyptian Selection PDF
23 pages
Clinical Teaching On Cardiac Rehabilitation
No ratings yet
Clinical Teaching On Cardiac Rehabilitation
14 pages
Earnings Per Share (Eps)
No ratings yet
Earnings Per Share (Eps)
2 pages
Digital Textbooks Listing - 17K Pivot Subjects
No ratings yet
Digital Textbooks Listing - 17K Pivot Subjects
1,182 pages
Estimate S
No ratings yet
Estimate S
28 pages
Read Me - LHB Duronto Express
No ratings yet
Read Me - LHB Duronto Express
2 pages
CSE3009 - PARALLEL-AND-DISTRIBUTED-COMPUTING - LTP - 1.0 - 8 - Parallel and Distributed Computing
No ratings yet
CSE3009 - PARALLEL-AND-DISTRIBUTED-COMPUTING - LTP - 1.0 - 8 - Parallel and Distributed Computing
2 pages
Catalogue Wonil - KOREA (English)
No ratings yet
Catalogue Wonil - KOREA (English)
68 pages