0% found this document useful (0 votes)

2 views

Distributed system module 1

The document provides an introduction to distributed systems, highlighting their importance in career opportunities and real-world applications such as cloud computing and big data. It covers key components, characteristics, challenges, and protocols related to distributed systems, including remote procedure calls (RPC) and remote method invocation (RMI). Additionally, it discusses design issues, call semantics, and the role of middleware in facilitating communication between distributed components.

Uploaded by

World Inside The Pc MirrorBot

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views

Distributed system module 1

Uploaded by

World Inside The Pc MirrorBot

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 50

Introduction

to
Distributed Systems

Department of
Computer Science & Engineering

www.cambridge.edu.in
Why Distributed Systems?

• Career Opportunities: Companies like Google, Amazon,

Microsoft, and numerous startups build their infrastructure on
distributed systems. Understanding these concepts can open
doors to careers in software engineering, cloud computing,
and big data.
• Real-World Applications
• Cutting-Edge Technologies
• Research and Innovation

CiTech, BANGALORE
What is a Distributed System?

A Distributed System is one in which hardware and software

components located at networked computers communicate
and coordinate their actions only by passing messages.

CiTech, BANGALORE
Components of a Distributed System

• Nodes: Individual computers or processes in the distributed

system.
• Communication Network: Infrastructure that allows nodes to
exchange messages (e.g., internet, local network).
• Middleware: Software that provides common services and
capabilities to applications beyond what's offered by the
operating system.

CiTech, BANGALORE
Characteristics of a Distributed
System
• Concurrency: Multiple processes execute simultaneously across
different machines.
• Scalability: Ability to grow and handle increased loads by adding
more resources.
• Fault Tolerance: Ability to continue functioning despite failures
of some components.
• Transparency: The system hides the complexity of distribution
from users and applications, making it seem like a single system.

CiTech, BANGALORE
Challenges of a Distributed System
• Heterogeneity: The Internet enables users to access services and run
applications over a heterogeneous collection of computers and
networks.
• Openness: The openness of a computer system is the characteristic
that determines whether the system can be extended and
reimplemented in various ways.
• Security: Many of the information resources that are made available
and maintained in distributed systems have a high intrinsic value to
their users. Their security is therefore of considerable importance.
• Concurrency: Multiple processes execute simultaneously across
different machines.
Challenges of a Distributed System
• Transparency: Transparency is defined as the concealment from
the user and the application programmer of the separation of
components in a distributed system, so that the system is
perceived as a whole rather than as a collection of independent
components.
• Failure handling: Designing the system to handle node failures
and network issues gracefully.
• Scalability: Ensuring that the system can handle increased load
without significant performance degradation.
• Quality of service

CiTech, BANGALORE
Applications of Distributed System

• Cloud Computing: Services like AWS, Azure, and Google Cloud

leverage distributed systems to provide scalable and reliable
services.
• Big Data: Frameworks like Hadoop and Apache Spark use
distributed systems to process and analyze large volumes of
data.
• Blockchain: Distributed ledger technologies like Bitcoin and
Ethereum use distributed systems to maintain a secure and
decentralized record of transactions.

CiTech, BANGALORE
Examples

• File Systems: Distributed file systems like Google File System

(GFS) and Hadoop Distributed File System (HDFS) store and
manage data across multiple machines.
• Databases: Distributed databases such as Cassandra and
MongoDB manage data across multiple nodes, ensuring high
availability and scalability.

CiTech, BANGALORE
Chapter 5 – Remote Invocation

• This chapter is concerned with how processes (or entities at a

higher level of abstraction such as objects or services)
communicate in a distributed system

CiTech, BANGALORE
Request-Reply Protocols
A protocol built over datagrams avoids unnecessary overheads
associated with the TCP stream protocol. In particular:
• Acknowledgements are redundant, since requests are followed by
replies.
• Establishing a connection involves two extra pairs of messages in
addition to the pair required for a request and a reply.
• Flow control is redundant for the majority of invocations, which
pass only small arguments and results.

CiTech, BANGALORE
Request-Reply Protocols
• The protocol we describe here is based on a trio of communication
primitives, doOperation, getRequest and sendReply, as shown in
Figure.

CiTech, BANGALORE
Request-Reply Protocols
• The doOperation method is used by clients to invoke remote
operations. Its arguments specify the remote server and which
operation to invoke, together with additional information
(arguments) required by the operation. Its result is a byte array
containing the reply.
• getRequest is used by a server process to acquire service requests.
• sendReply is used to send the reply message to the client.

CiTech, BANGALORE
Message identifiers
A message identifier consists of two parts:
1. a requestId, which is taken from an increasing sequence of integers
by the sending process;
2. an identifier for the sender process, for example, its port and
Internet address.

CiTech, BANGALORE
Failure model of the request-reply protocol

That is:
• They suffer from omission failures.
• Messages are not guaranteed to be delivered in sender order.

CiTech, BANGALORE
Timeouts & Discarding duplicate request
messages
• The timeout may have been due to the request or reply message
getting lost.
• In cases when the request message is retransmitted, the server
may receive it more than once. This can lead to the server
executing an operation more than once for the same request.

CiTech, BANGALORE
Request-reply message structure
The information to be transmitted in a request message or a reply
message is shown in Figure

CiTech, BANGALORE
Failure model of the request-reply protocol

If the three primitives doOperation, getRequest and sendReply are

implemented over UDP datagrams, then they suffer from the same
communication failures.
That is:
• They suffer from omission failures.
• Messages are not guaranteed to be delivered in sender order.

CiTech, BANGALORE
Timeouts

• There are various options as to what doOperation can do after a

timeout.
• The simplest option is to return immediately from doOperation
with an indication to the client that the doOperation has failed.
• To compensate for the possibility of lost messages, doOperation
sends the request message repeatedly until either it gets a reply or
it is reasonably sure that the delay is due to lack of response from
the server rather than to lost messages.

CiTech, BANGALORE
Discarding duplicate request messages

• In cases when the request message is retransmitted, the server

may receive it more than once.
• the protocol is designed to recognize successive messages (from
the same client) with the same request identifier and to filter out
duplicates.

CiTech, BANGALORE
Styles of exchange protocols

Three protocols, that produce differing behaviors in the presence of

communication failures are used for implementing various types of
request behavior. They were originally identified by Spector [1982]:
• the request (R) protocol;
• the request-reply (RR) protocol;
• the request-reply-acknowledge reply (RRA) protocol.

CiTech, BANGALORE
Styles of exchange protocols

CiTech, BANGALORE
Remote Procedure Call (RPC)
• In RPC, procedures on remote machines can be called as if they are
procedures in the local address space.
• The underlying RPC system then hides important aspects of
distribution, including the encoding and decoding of parameters and
results, the passing of messages and the preserving of the required
semantics for the procedure call.
• This concept was first introduced by Birrell and Nelson [1984] and
paved the way for many of the developments in distributed systems
programming.

CiTech, BANGALORE
Design issues for RPC
Before looking at the implementation of RPC systems, we look at three
issues that are important in understanding this concept:
• the style of programming promoted by RPC – programming with
interfaces;
• the call semantics associated with RPC;
• the key issue of transparency and how it relates to remote procedure
calls.

CiTech, BANGALORE
Interfaces in distributed systems
• In a distributed program, the modules can run in separate processes.
• In the client-server model, in particular, each server provides a set of
procedures that are available for use by clients.
• The term service interface is used to refer to the specification of the
procedures offered by a server, defining the types of the arguments of
each of the procedures.

CiTech, BANGALORE
CORBA IDL example

CiTech, BANGALORE
CORBA(Common Object Request Broker
Architecture)
• It is a standard defined by the Object Management Group (OMG)
that allows pieces of programs, known as objects, to communicate
with one another regardless of where they are located (locally or
across a network) and regardless of the programming language
used to write them.
• CORBA achieves this through its middleware framework, enabling
interoperability between distributed systems.

CiTech, BANGALORE
Idempotency

• In Distributed Systems: Idempotency is crucial in systems where

operations might be retried due to network failures. If an
operation is idempotent, it can safely be retried without concern
for unintended side effects.

CiTech, BANGALORE
RPC call semantics
The main choices are:
Retry request message: Controls whether to retransmit the request
message until either a reply is received or the server is assumed to
have failed.
Duplicate filtering: Controls when retransmissions are used and
whether to filter out duplicate requests at the server.
Retransmission of results: Controls whether to keep a history of
result messages to enable lost results to be retransmitted without
re-executing the operations at the server.

CiTech, BANGALORE
RPC call semantics

The choices of RPC invocation semantics are defined as follows:

• Maybe semantics
• At-least-once semantics
• At-most-once semantics

CiTech, BANGALORE
Maybe semantics

With maybe semantics, the remote procedure call may be executed

once or not at all. Maybe semantics arises when no fault-tolerance
measures are applied and can suffer from the following types of
failure:
• omission failures if the request or result message is lost;
• crash failures when the server containing the remote operation fails.

CiTech, BANGALORE
At-least-once semantics

With at-least-once semantics, the invoker receives either a

result, in which case the invoker knows that the procedure was
executed at least once, or an exception informing it that no result was
received.
At-least-once semantics can be achieved by the retransmission of
request messages, which masks the omission failures of the request or
result message.

CiTech, BANGALORE
At-least-once semantics

At-least-once semantics can suffer from the following types of failure:

• crash failures when the server containing the remote procedure
fails;
• arbitrary failures – in cases when the request message is
retransmitted, the remote server may receive it and execute the
procedure more than once, possibly causing wrong values to be
stored or returned.

CiTech, BANGALORE
Idempotent operation

An Idempotent operation is one that can be performed repeatedly

with the same effect as if it had been performed exactly once.
Non-idempotent operations can have the wrong effect if they are
performed more than once.

CiTech, BANGALORE
At-most-once semantics

With at-most-once semantics, the caller receives either a result, in

which case the caller knows that the procedure was executed exactly
once, or an exception informing it that no result was received, in
which case the procedure will have been executed either once or not
at all.

Sun RPC provides at-least-once call semantics.

CiTech, BANGALORE
Implementation of RPC

CiTech, BANGALORE
Remote Method Invocation (RMI)

• Remote method invocation (RMI) is closely related to RPC but

extended into the world of distributed objects.
• In RMI, a calling object can invoke a method in a potentially
remote object.
• As with RPC, the underlying details are generally hidden from the
user.

CiTech, BANGALORE
Remote Method Invocation (RMI)
The commonalities between RMI and RPC are as follows:
• They both support programming with interfaces, with the resultant
benefits that stem from this approach.
• They are both typically constructed on top of request-reply
protocols and can offer a range of call semantics such as
at-least-once and at-most-once.
• They both offer a similar level of transparency – that is, local and
remote calls employ the same syntax but remote interfaces
typically expose the distributed nature of the underlying call, for
example by supporting remote exceptions.

CiTech, BANGALORE
Remote Method Invocation (RMI)
The advantages of RMI are as follows:
• The programmer is able to use the full expressive power of
object-oriented programming in the development of distributed systems
software, including the use of objects, classes and inheritance, and can
also employ related object-oriented design methodologies and associated
tools.
• Building on the concept of object identity in object-oriented systems, all
objects in an RMI-based system have unique object references (whether
they are local or remote), such object references can also be passed as
parameters, thus offering significantly richer parameter-passing semantics
than in RPC.

CiTech, BANGALORE
Design issues for RMI

RMI shares the same design issues as RPC in terms of programming

with interfaces, call semantics and level of transparency.

The key added design issue relates to the object model and, in
particular, achieving the transition from objects to distributed objects.

CiTech, BANGALORE
The object model

Object references
Interfaces
Actions
Exceptions
Garbage collection

CiTech, BANGALORE
The distributed object model

CiTech, BANGALORE
Implementation of RMI

CiTech, BANGALORE
Implementation of RMI
Communication module : The two cooperating communication modules
carry out the request-reply protocol, which transmits request and reply
messages between the client and server.

Remote reference module: A remote reference module is responsible for

translating between local and remote object references and for creating
remote object references.(remote object table)

Servants: A servant is an instance of a class that provides the body of a

remote object.

CiTech, BANGALORE
Remote reference module
To support its responsibilities, the remote reference module in each process
has a remote object table that records the correspondence between local
object references in that process and remote object references (which are
system-wide).

The table includes:

• An entry for all the remote objects held by the process.
• An entry for each local proxy.

CiTech, BANGALORE
The RMI software
This consists of a layer of software between the application-level objects
and the communication and remote reference modules.

The roles of the middleware objects are as follows:

• Proxy : The role of a proxy is to make remote method invocation
transparent to clients by behaving like a local object to the invoker
• Dispatcher : The dispatcher receives request messages from the
communication module.
• Skeleton : A skeleton method unmarshals the arguments in the request
message and invokes the corresponding method in the servant.

CiTech, BANGALORE
Distributed garbage collection
The aim of a distributed garbage collector is to ensure that if a local or
remote reference to an object is still held anywhere in a set of distributed
objects, the object itself will continue to exist, but as soon as no object any
longer holds a reference to it, the object will be collected and the memory it
uses recovered.

CiTech, BANGALORE

SASE Secondary
No ratings yet
SASE Secondary
29 pages
Embedded Ethernet and Internet Complete
From Everand
Embedded Ethernet and Internet Complete
Jan Axelson
4/5 (1)
PHP Microservices
From Everand
PHP Microservices
Carlos Pérez Sánchez
3/5 (1)
Genki - An Integrated Course in Elementary Japanese Workbook II (Second Edition) (2011), WITH PDF BOOKMARKS!
85% (27)
Genki - An Integrated Course in Elementary Japanese Workbook II (Second Edition) (2011), WITH PDF BOOKMARKS!
130 pages
FYP Proposal
No ratings yet
FYP Proposal
5 pages
Distributed_Systems_Ch5
No ratings yet
Distributed_Systems_Ch5
37 pages
lecture8-DistributedSystem
No ratings yet
lecture8-DistributedSystem
27 pages
Unit-4(kd)
No ratings yet
Unit-4(kd)
61 pages
Chapter 3 Communication in Distributed Systems
No ratings yet
Chapter 3 Communication in Distributed Systems
14 pages
APznzabcpA6aPab9_jQWOwqlj6gUO5oA8citg8PhUT7Otg5g8ah72QiT3DjGunoaJJ98Ubua2QVHruCWdrFPgoh-B8EB4hz23Mt5CTTniCrI67gsmbQSaCTlszd4A1HhirLCpdMBB77K6f7Tt6MMbcv_cR4-ttjz-BU58zwGqbKI77CjZax4tF-LR7x28rNMw9WgcxDfDvIA5CuD6Cu0q9Z
No ratings yet
APznzabcpA6aPab9_jQWOwqlj6gUO5oA8citg8PhUT7Otg5g8ah72QiT3DjGunoaJJ98Ubua2QVHruCWdrFPgoh-B8EB4hz23Mt5CTTniCrI67gsmbQSaCTlszd4A1HhirLCpdMBB77K6f7Tt6MMbcv_cR4-ttjz-BU58zwGqbKI77CjZax4tF-LR7x28rNMw9WgcxDfDvIA5CuD6Cu0q9Z
49 pages
Chapter 4 Communication
No ratings yet
Chapter 4 Communication
10 pages
Distributed Systems
No ratings yet
Distributed Systems
6 pages
Mc4203 Cloud Computing Technologies (1) 2
No ratings yet
Mc4203 Cloud Computing Technologies (1) 2
64 pages
Distributed Systems 2 Mark Question & Answers
No ratings yet
Distributed Systems 2 Mark Question & Answers
16 pages
Internal 2 Question
No ratings yet
Internal 2 Question
11 pages
Module 1 Ppt
No ratings yet
Module 1 Ppt
47 pages
Unit 2 - Communication in Dis
No ratings yet
Unit 2 - Communication in Dis
73 pages
Jimma University: Jimma Institute of Technology
No ratings yet
Jimma University: Jimma Institute of Technology
8 pages
DS ModelQP Solution
No ratings yet
DS ModelQP Solution
44 pages
unit-3
No ratings yet
unit-3
22 pages
AOS-UNIT 2
No ratings yet
AOS-UNIT 2
23 pages
Chap 5
No ratings yet
Chap 5
34 pages
Communication
No ratings yet
Communication
52 pages
A) What Is RPC? Explain Different Types of RPC?
No ratings yet
A) What Is RPC? Explain Different Types of RPC?
6 pages
DC Chap 4
No ratings yet
DC Chap 4
58 pages
chap2dc
No ratings yet
chap2dc
10 pages
Advanced Distributed Systems
100% (1)
Advanced Distributed Systems
15 pages
Chapter Five Remote Method Invocation 1
No ratings yet
Chapter Five Remote Method Invocation 1
79 pages
Chapter 4-Communication
No ratings yet
Chapter 4-Communication
41 pages
Distributed Computing practice questions Chapter 4 pt2
No ratings yet
Distributed Computing practice questions Chapter 4 pt2
6 pages
DC Module 2a
No ratings yet
DC Module 2a
137 pages
Chapter 4 - Distributed System
No ratings yet
Chapter 4 - Distributed System
24 pages
Unit-2 (A)
No ratings yet
Unit-2 (A)
40 pages
Chapter 4-Communication
No ratings yet
Chapter 4-Communication
41 pages
MODULE 1
No ratings yet
MODULE 1
76 pages
Fault System One
No ratings yet
Fault System One
19 pages
Unit I MC4203 CC_pdf
No ratings yet
Unit I MC4203 CC_pdf
48 pages
1
No ratings yet
1
31 pages
Unit 1 Part 2
No ratings yet
Unit 1 Part 2
37 pages
DC IAT1
No ratings yet
DC IAT1
20 pages
DOS_Answers
No ratings yet
DOS_Answers
18 pages
Networking Programming with C++: Build Efficient Communication Systems
From Everand
Networking Programming with C++: Build Efficient Communication Systems
Robert Johnson
No ratings yet
Lecture23 FaultTolerance
No ratings yet
Lecture23 FaultTolerance
56 pages
Chapter 4-Communication
No ratings yet
Chapter 4-Communication
36 pages
Distributed Systems Research
No ratings yet
Distributed Systems Research
6 pages
Slides
No ratings yet
Slides
516 pages
IT6505: Middleware Architecture: University of Colombo, Sri Lanka
No ratings yet
IT6505: Middleware Architecture: University of Colombo, Sri Lanka
10 pages
Chapter Four: Communication in Distributed Systems
No ratings yet
Chapter Four: Communication in Distributed Systems
26 pages
Ch4-Communicaton
No ratings yet
Ch4-Communicaton
37 pages
Unit IV
No ratings yet
Unit IV
45 pages
Important Q A
No ratings yet
Important Q A
51 pages
Chapter 2
No ratings yet
Chapter 2
46 pages
DS UNIT-1 Saqs Laqs (Complete)
No ratings yet
DS UNIT-1 Saqs Laqs (Complete)
14 pages
Chapter 4 Communication
No ratings yet
Chapter 4 Communication
75 pages
Dist Sys Slides
No ratings yet
Dist Sys Slides
516 pages
Chapter 4 - Communication
No ratings yet
Chapter 4 - Communication
53 pages
dscc QB solution copy
No ratings yet
dscc QB solution copy
15 pages
Chapter - 4
No ratings yet
Chapter - 4
53 pages
Lect3 Communication
No ratings yet
Lect3 Communication
7 pages
Mod 2
No ratings yet
Mod 2
66 pages
Lect 3
No ratings yet
Lect 3
37 pages
CS542: Topics in Distributed Systems
No ratings yet
CS542: Topics in Distributed Systems
39 pages
Group 2
No ratings yet
Group 2
24 pages
Module 4 Distributed System
No ratings yet
Module 4 Distributed System
29 pages
brmk557modelquestionpaper1solution-250124043647-c8aeecde
No ratings yet
brmk557modelquestionpaper1solution-250124043647-c8aeecde
50 pages
Distributed system - Distrubuted file system
No ratings yet
Distributed system - Distrubuted file system
30 pages
Distributed system module 3
No ratings yet
Distributed system module 3
31 pages
English Class Test For 11&12
No ratings yet
English Class Test For 11&12
1 page
Fi Salary Account - Step-By-Step Onboarding
No ratings yet
Fi Salary Account - Step-By-Step Onboarding
7 pages
DD and Co
No ratings yet
DD and Co
9 pages
Student Attendance Template by Sheetgo
No ratings yet
Student Attendance Template by Sheetgo
72 pages
Akshay LLR
No ratings yet
Akshay LLR
1 page
16 SVPMSG
No ratings yet
16 SVPMSG
258 pages
The 2014 Cio Agenda
100% (1)
The 2014 Cio Agenda
70 pages
Sample PPT Bidder's Point
No ratings yet
Sample PPT Bidder's Point
108 pages
new_Syllabus_Of_Azure_Suite
No ratings yet
new_Syllabus_Of_Azure_Suite
13 pages
SharePoint 2010 End-User Training Manual
No ratings yet
SharePoint 2010 End-User Training Manual
28 pages
Adobe Photoshop Level 1 - EnG
No ratings yet
Adobe Photoshop Level 1 - EnG
56 pages
Region of Interest Pooling Explained
No ratings yet
Region of Interest Pooling Explained
12 pages
Omsdk Session Client
No ratings yet
Omsdk Session Client
18 pages
Decision-Makerr MPAC 1500 Decision-Makerr MPAC 1500 Controller Standard Features
No ratings yet
Decision-Makerr MPAC 1500 Decision-Makerr MPAC 1500 Controller Standard Features
6 pages
CP Functions
No ratings yet
CP Functions
39 pages
Troubleshooting Log Files in Domino
No ratings yet
Troubleshooting Log Files in Domino
68 pages
PASCO UV VIS Spectrometer Brochure
No ratings yet
PASCO UV VIS Spectrometer Brochure
12 pages
123-20 IRIG 106 Chapter10 Programmers Handbook
No ratings yet
123-20 IRIG 106 Chapter10 Programmers Handbook
281 pages
PAC Productivity Suite: Integrated PLC and SCADA Solution
No ratings yet
PAC Productivity Suite: Integrated PLC and SCADA Solution
6 pages
Creating A Variable Font - Glyphs
No ratings yet
Creating A Variable Font - Glyphs
37 pages
System Analysis and Design
No ratings yet
System Analysis and Design
90 pages
Ejemplos de Declaraciones Generales para Ensayos
100% (1)
Ejemplos de Declaraciones Generales para Ensayos
7 pages
Computer Graphics & Multimedia (DCO-511)
No ratings yet
Computer Graphics & Multimedia (DCO-511)
19 pages
An Overview of Block-Chain Technology and Related Security Attacks: Systematic Literature Review
No ratings yet
An Overview of Block-Chain Technology and Related Security Attacks: Systematic Literature Review
15 pages
C Prog Interview Q PTR
No ratings yet
C Prog Interview Q PTR
36 pages
Rapid Application Development
No ratings yet
Rapid Application Development
13 pages
Tcs2p125 Manual en
No ratings yet
Tcs2p125 Manual en
11 pages
Secnav M-5239.1 PDF
No ratings yet
Secnav M-5239.1 PDF
43 pages
PLC Programming With RSLogix 500 - Shared
100% (6)
PLC Programming With RSLogix 500 - Shared
132 pages
Manisha - Smart Phones Vs DSLR - Final - 2020
No ratings yet
Manisha - Smart Phones Vs DSLR - Final - 2020
34 pages
Art Gallery Dbms Project
No ratings yet
Art Gallery Dbms Project
5 pages
Dev Ops Engineer - Dariel Software Agency
No ratings yet
Dev Ops Engineer - Dariel Software Agency
3 pages