Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                

Distributed Information Systems

Download as pptx, pdf, or txt
Download as pptx, pdf, or txt
You are on page 1of 21

Distributed Information Systems

Rationale
• Many computer applications are confronted
with ever increasing volumes of data which
must be managed, accessed, and processed
efficiently. Typical cases arise in the fields of e-
Health, banking and insurance, e-Science,
digital libraries, etc. From the data
management point of view, this has led to a
radical shift from centralized database systems
towards distributed information systems
What is information

PRODUCTS SOLD IN DECEMBER


What is information
• Information is data that is (1) accurate  and timely, (2)
specific and organized for a purpose, (3) presented
within a context  that gives it meaning and relevance,
and (4) can lead  to an increase in understanding and
decrease in uncertainty

In summary, information is organized data that has


meaning to the receiver

• Information = Data that is interpreted


Data?
What is a system
What is a system
• A system is a set of interacting or
interdependent component parts forming a
complex whole for a common purpose
What is an Information System
What is an Information System
• An information system is any
organized system for the
collection, organization,
storage and communication
of information.

• More specifically, it is a
complementary network that
people and organizations use
to collect, filter, process, crea
te and distribute data.
Distributed Information Systems (IS)?
Distributed Information Systems (IS)?

• A distributed
information system is
one that runs services
that have distinct
components, at many
different sites, that
collaborate to ensure
the quality of service
Why Distributed Information Systems
(IS)?
• A computer service that runs at a single central
location is more likely to become unavailable
than a service distributed to many sites.
• There are two ways in which a service can be
made to run at many sites: replication of the
service, and distribution of the service.
• We’ll be investigating ways in which to build
distributed services
About Distributed Information Systems (IS)

• Distributed IS implement Distributed processing.


• Distributed processing is a technique for implementing a
single logical set of processing functions across a number
of physical devices, so that each performs some part of the
total processing required.
• Distributed processing is often accompanied by the
formation of a distributed database.
• A distributed database exists when the data elements
stored at multiple locations are interrelated, or if a process
(program execution) at one location requires access to data
stored at another location.
A centralized IS, i.e. running on one node
under a single authority
Upside of Distributed IS
Upside of Distributed IS
• There exist many reasons not to keep all data in a single node
on the network. These include:
• Improved use of existing resources.
• Moving data close to the node where it is processed
• Taking advantage of parallel processing of the data hence
avoiding bottlenecks
• Allows for easy and improved scalability of the systems.

• NOTE: Nevertheless distributed systems don’t give up the


abstraction that one is accessing a single information system
that is running under a single authority.
Down Side of Distributed IS
Business conditions that justify the use of distributed
databases:
• Data communications costs and reliability: If the data is
geographically distributed and the applications are related to
this data, it may be much more economical, in terms of
communication costs, to partition the application and do the
processing at each site.

• On the other hand, the cost of having smaller computing


powers at each site is much less than the cost of having an
equivalent power of a single mainframe

• Database recovery: Replicating data on separate computers is


one strategy for ensuring that a damaged database can be
quickly recovered and users can have access to data while the
primary site is being restored. Replicating data across multiple
computer sites is one natural form of a distributed database.
Business conditions that justify the use of
distributed databases:
• Distribution and autonomy of business units: Divisions,
departments, and facilities in modern organizations are
often geographically distributed, often across national
boundaries. Often each unit has the authority to create its
own information systems, and often these units want local
data over which they can have control

• Improved Performance: Data retrieved by a transaction


may be stored at a number of sites, making it possible to
execute the transaction in parallel. Besides, using several
resources in parallel can significantly improve performance.
Business conditions that justify the use of
distributed databases:
Depending on the way data is distributed, most requests
for data by users at a particular site can be satisfied by data
stored at that site. This speeds up query processing since
communication and central computer delays are minimized. It
may also be possible to split complex queries into sub-queries
that can be processed in parallel at several sites, providing even
faster response

• Increased reliability and availability: When a centralized


system fails, the database is unavailable to all users. In
contrast to centralized systems, distributed system will
continue to function at some reduced level, however, even
when a component fails.

You might also like