Chapter 6 Parallel and Concurrent Computing

Chapter 5 of ICS 2410 discusses parallel and concurrent systems, defining key concepts such as concurrency and parallelism, and introducing Flynn's Taxonomy which classifies parallel computers into SISD, SIMD, MISD, and MIMD categories. It elaborates on the characteristics and programming methods of these systems, including multiprocessors and multicomputers, as well as various forms of parallelism like data and task parallelism. The chapter also highlights the advantages and disadvantages of SIMD and MIMD architectures.

Uploaded by

Jorams Barasa

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views

Chapter 6 Parallel and Concurrent Computing

Uploaded by

Jorams Barasa

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 27

ICS 2410 Advanced

Topics in Computer
Science
Chapter 5 Parallel and
Concurrent Systems

1
Some Definitions

 Concurrent - Events or processes which

seem to occur or progress at the same
time.
 Parallel–Events or processes which occur
or progress at the same time
 Parallelprogramming (also, unfortunately,
sometimes called concurrent programming), is
a computer programming technique that
provides for the parallel execution of
operations , either
 within a single parallel computer
 or across a number of systems. 2

 In the latter case, the term distributed

Flynn’s Taxonomy
 Best
known classification scheme for parallel
computers.
 Depends on parallelism it exhibits with its
 Instruction stream
 Data stream
A sequence of instructions (the instruction
stream) manipulates a sequence of operands
(the data stream)
 Theinstruction stream (I) and the data
stream (D) can be either single (S) or multiple
(M)
3

 Four combinations: SISD, SIMD, MISD, MIMD

SISD
 Single Instruction, Single Data
 Single-CPU systems
 i.e.,
uniprocessors
 Note: co-processors don’t count as additional
processors
 Concurrent processing allowed
 Instructionprefetching
 Pipelined execution of instructions
 Concurrent execution allowed
 Thatis, independent concurrent tasks can
execute different sequences of operations.
 Most Important Example: a PC
4
SIMD
 Single instruction, multiple data
 Oneinstruction stream is broadcast to all
processors
 Each processor, also called a processing
element (or PE), is usually simplistic and
logically is essentially an ALU;
 PEs do not store a copy of the program
nor have a program control unit.
 Individual
processors can remain idle
during execution of segments of the
program (based on a data test).
5
SIMD (cont.)
 Allactive processor executes the same
instruction synchronously, but on different
data
 Technically, on a memory access, all active
processors must access the same location
in their local memory.
 The data items form an array (or vector)
and an instruction can act on the complete
array in one cycle.

6
How to View a SIMD
Machine
 Think of soldiers all in a unit.
 The commander selects certain
soldiers as active – for example, the
first row.
 The commander barks out an order
to all the active soldiers, who execute
the order synchronously.
 The remaining soldiers do not
execute orders until they are re-
activated.
7
MIMD
 Multiple instruction, multiple data
 Processors are asynchronous, since they can
independently execute different programs
on different data sets.
 Communications are handled either
 through shared memory.
(multiprocessors)
 byuse of message passing
(multicomputers)
 MIMD’shave been considered by most
researchers to include the most powerful 8

and least restricted computers.

MIMD (cont. 2/4)
 Have very major communication costs
 When compared to SIMDs
 Internal ‘housekeeping activities’ are often overlooked
 Maintaining distributed memory & distributed databases
 Synchronization or scheduling of tasks
 Load balancing between processors
 Onemethod for programming MIMDs is for all
processors to execute the same program.
 Execution of tasks by processors is still asynchronous
 Called SPMD method (single program, multiple data)
 Usual method when number of processors are large.
 Considered to be a “data parallel programming” style
for MIMDs.

9
MIMD (cont 3/4)
 A more common technique for programming MIMDs is to
use multi-tasking:
 The problem solution is broken up into various tasks.
 Tasks are distributed among processors initially.
 If new tasks are produced during executions, these may
handled by parent processor or distributed
 Each processor can execute its collection of tasks
concurrently.
 Ifsome of its tasks must wait for results from other tasks or new
data , the processor will focus the remaining tasks.
 Larger programs usually run a load balancing algorithm in
the background that re-distributes the tasks assigned to
the processors during execution
 Either dynamic load balancing or called at specific times
 Dynamic scheduling algorithms may be needed to assign
a higher execution priority to time-critical tasks
 E.g., on critical path, more important, earlier deadline, etc.
10
Multiprocessors
(Shared Memory MIMDs)
 Allprocessors have access to all memory
locations .
 Two types: UMA and NUMA
 UMA (uniform memory access)
 Frequently called symmetric multiprocessors or
SMPs
 Similar to uniprocessor, except additional,
identical CPU’s are added to the bus.
 Each processor has equal access to memory
and can do anything that any other processor
can do. 11

 SMPs have been and remain very popular

Multiprocessors (cont.)
 NUMA (non-uniform memory access).
 Has a distributed memory system.
 Each memory location has the same address
for all processors.
 Access time to a given memory location varies
considerably for different CPUs.
 Normally,fast cache is used with NUMA
systems to reduce the problem of different
memory access time for PEs.
 Creates problem of ensuring all copies of the
same data in different memory locations are
identical. 12
Multicomputers
(Message-Passing MIMDs)
 Processors are connected by a network
 Interconnection network connections is one possibility
 Also, may be connected by Ethernet links or a bus.
 Each processor has a local memory and can only
access its own local memory.
 Data is passed between processors using
messages, when specified by the program.
 Message passing between processors is
controlled by a message passing language
(typically MPI)
 The problem is divided into processes or tasks
that can be executed concurrently on individual
processors. Each processor is normally assigned
13

multiple processes.
Multiprocessors vs
Multicomputers
 Programmingdisadvantages of
message-passing
 Programmers must make explicit message-
passing calls in the code
 This
is low-level programming and is error
prone.
 Datais not shared between processors but
copied, which increases the total data size.
 Dataintegrity problem: Difficulty to
maintain correctness of multiple copies of
data item.
14
Multiprocessors vs Multicomputers (cont)

 Programming advantages of message-

passing
 No problem with simultaneous access to data.
 Allows different PCs to operate on the same data
independently.
 Allows PCs on a network to be easily upgraded when
faster processors become available.
 Mixed“distributed shared memory”
systems exist
 Lots of current interest in a cluster of SMPs.
 Easier
to build systems with a very large
number of processors. 15
Seeking Concurrency
Several Different Ways Exist
 Data parallelism
 Task parallelism
Sometimes called control
parallelism or functional
parallelism.
 Pipelining

16
Data Parallelism
 All tasks (or processors) apply the same set of
operations to different data.

 Example: for i  0 to 99 do
a[i]  b[i] + c[i]
endfor

 Operations may be executed concurrently

17
Data Parallelism Features
 Each processor performs the same data
computation on different data sets
 Computations can be performed either
synchronously or asynchronously
 Defn: Grain Size is the average number of
computations performed between
communication or synchronization steps

18
Task/Functional/
Control/Job Parallelism
 Independent tasks apply different operations to
different data elements

a2
b3
m  (a + b) / 2
s  (a2 + b2) / 2
v  s - m2

 First and second statements may execute concurrently

 Third and fourth statements may execute concurrently
 Normally, this type of parallelism deals with
concurrent execution of tasks, not statements
19
Control Parallelism
Features
 Problem is divided into different
non-identical tasks
 Tasks
are divided between the
processors so that their
workload is roughly balanced
 Parallelismat the task level is
considered to be coarse grained
parallelism

20
Pipelining
 Divide a process into stages
 Produce several items simultaneously

21
Compute Partial Sums
Consider the for loop:
p[0]  a[0]
for i  1 to 3 do
p[i]  p[i-1] + a[i]
endfor
 This computes the partial sums:

p[0]  a[0]
p[1]  a[0] + a[1]
p[2]  a[0] + a[1] + a[2]
p[3]  a[0] + a[1] + a[2] + a[3]
 The loop is not data parallel as there are dependencies.
 However, we can stage the calculations in order to
22

achieve some parallelism.

SIMD Machines
 An early SIMD computer designed for
vector and matrix processing was the Illiac
IV computer
 Initialdevelopment at the University of Illinois
1965-70
 Moved to NASA Ames, completed in 1972 but
not fully functional until 1976.
 The MPP, DAP, the Connection Machines
CM-1 and CM-2, and MasPar’s MP-1 and
MP-2 are examples of SIMD computers
 The CRAY-1 and the Cyber-205 use
pipelined arithmetic units to support
vector operations and are sometimes
called a pipelined SIMD 23
Today’s SIMDs
 SIMD functionality is sometimes
embedded in sequential machines.
 Others are being build as part of hybrid
architectures.
 Some SIMD and SIMD-like features are
included in some multi/many core
processing units
 Some SIMD-like architectures have been
build as special purpose machines,
although some of these could classify as
general purpose.
24
Advantages of SIMDs
 Less hardware than MIMDs as they
have only one control unit.
 Control units are complex.
 Less memory needed than MIMD
 Only one copy of the instructions need
to be stored
 Allows more data to be stored in
memory.
 Much less time required for
communication between PEs and
data movement.
25
Advantages of SIMDs (cont)
 Singleinstruction stream and
synchronization of PEs make SIMD
applications easier to program,
understand, & debug.
 Similar to sequential programming
 Control flow operations and scalar
operations can be executed on the control
unit while PEs are executing other
instructions.
 Less complex hardware in SIMD since no
message decoder is needed in the PEs 26

 MIMDs need a message decoder in each PE.

SIMD Shortcoming Claims
 Claim 1: SIMDs have a data-parallel
orientation, but not all problems are data-
parallel
 Claim2: Speed drops for conditionally
executed branches
 Claim 3: Don’t adapt to multiple users
well.
 Claim 4: Do not scale down well to
“starter” systems that are affordable.
 Claim5: Requires customized VLSI for
27

processors and expense of control units in

Main
No ratings yet
Main
7 pages
Learn Multithreading with Modern C++
From Everand
Learn Multithreading with Modern C++
James Raynard
No ratings yet
Parallel Processing
No ratings yet
Parallel Processing
35 pages
Parallel Processing
No ratings yet
Parallel Processing
31 pages
CA Unit IV Notes Part 1 PDF
No ratings yet
CA Unit IV Notes Part 1 PDF
17 pages
Parallel Processing in LINUX (Paresh
No ratings yet
Parallel Processing in LINUX (Paresh
13 pages
Module 2 - Parallel Computing
No ratings yet
Module 2 - Parallel Computing
55 pages
Parallel Computer Models: PCA Chapter 1
No ratings yet
Parallel Computer Models: PCA Chapter 1
61 pages
Unit VI Parallel Programming Concepts
No ratings yet
Unit VI Parallel Programming Concepts
90 pages
Module II (CC)
No ratings yet
Module II (CC)
125 pages
UNIT 2 Cloud Computing
No ratings yet
UNIT 2 Cloud Computing
18 pages
Large Computer Systems and Pipelining: Homework
No ratings yet
Large Computer Systems and Pipelining: Homework
11 pages
Multi-Core Processors: Page 1 of 25
No ratings yet
Multi-Core Processors: Page 1 of 25
25 pages
Unit I 2 Marks With Answer
No ratings yet
Unit I 2 Marks With Answer
6 pages
Parallel and distributed computing
No ratings yet
Parallel and distributed computing
16 pages
Unit 7 - Parallel Processing Paradigm
No ratings yet
Unit 7 - Parallel Processing Paradigm
26 pages
Flynn's Taxonomy of Computer Architecture
No ratings yet
Flynn's Taxonomy of Computer Architecture
8 pages
Chapter 9
No ratings yet
Chapter 9
28 pages
Lecture Week - 2 General Parallelism Terms
No ratings yet
Lecture Week - 2 General Parallelism Terms
24 pages
Ch12 Parallel Proc3-Aula
No ratings yet
Ch12 Parallel Proc3-Aula
35 pages
Cloud Computing - Lecture 3
No ratings yet
Cloud Computing - Lecture 3
22 pages
Parallel
No ratings yet
Parallel
5 pages
Computer Architecture 4
No ratings yet
Computer Architecture 4
6 pages
downloadfile (3)
No ratings yet
downloadfile (3)
16 pages
Solved Assignment - Parallel Processing
63% (8)
Solved Assignment - Parallel Processing
29 pages
Parallel Computing
No ratings yet
Parallel Computing
32 pages
Synopsis On "Massive Parallel Processing (MPP) "
No ratings yet
Synopsis On "Massive Parallel Processing (MPP) "
4 pages
Parallel Computing (Unit5)
No ratings yet
Parallel Computing (Unit5)
25 pages
What Is Serial Computing?: Traditionally, Software Has Been Written For Serial Computation
No ratings yet
What Is Serial Computing?: Traditionally, Software Has Been Written For Serial Computation
22 pages
Multiprocessor Architecture System
100% (1)
Multiprocessor Architecture System
10 pages
Ch1 Annotated
No ratings yet
Ch1 Annotated
27 pages
NOTES
No ratings yet
NOTES
19 pages
Hardware Multithreading
No ratings yet
Hardware Multithreading
10 pages
Parallel and Distributed Computing Complete Notes
No ratings yet
Parallel and Distributed Computing Complete Notes
41 pages
Baker CHPT 5 SIMD Good
No ratings yet
Baker CHPT 5 SIMD Good
94 pages
ACA UNIT-5 Notes
No ratings yet
ACA UNIT-5 Notes
15 pages
COMPUTER ARCHITECTURE ASSIGNMENT GROUP 3
No ratings yet
COMPUTER ARCHITECTURE ASSIGNMENT GROUP 3
15 pages
Project - ParallelComputing BSR v2
No ratings yet
Project - ParallelComputing BSR v2
40 pages
Advanced Computer Architecture
No ratings yet
Advanced Computer Architecture
28 pages
3a Flynns
No ratings yet
3a Flynns
17 pages
Chapter 6 Advanced Topics
No ratings yet
Chapter 6 Advanced Topics
14 pages
Parallel_computing
No ratings yet
Parallel_computing
32 pages
SISd
No ratings yet
SISd
17 pages
Speedup
No ratings yet
Speedup
12 pages
Advance Computer Architecture2
No ratings yet
Advance Computer Architecture2
36 pages
HPC Quebank Solution
No ratings yet
HPC Quebank Solution
40 pages
Module 4- Architecture
No ratings yet
Module 4- Architecture
22 pages
Multi Core
No ratings yet
Multi Core
7 pages
Chapter - 5 Multiprocessors and Thread-Level Parallelism: A Taxonomy of Parallel Architectures
No ratings yet
Chapter - 5 Multiprocessors and Thread-Level Parallelism: A Taxonomy of Parallel Architectures
41 pages
PDC - Lecture - No. 2
No ratings yet
PDC - Lecture - No. 2
31 pages
Cs405-Computer System Architecture: Module - 1 Parallel Computer Models
No ratings yet
Cs405-Computer System Architecture: Module - 1 Parallel Computer Models
91 pages
KCS 713 Unit 1 Lecture 5
No ratings yet
KCS 713 Unit 1 Lecture 5
32 pages
Cs405-Computer System Architecture: Module - 1 Parallel Computer Models
No ratings yet
Cs405-Computer System Architecture: Module - 1 Parallel Computer Models
72 pages
Cs405-Computer System Architecture: Module - 1 Parallel Computer Models
No ratings yet
Cs405-Computer System Architecture: Module - 1 Parallel Computer Models
72 pages
Parallel Processing
100% (1)
Parallel Processing
4 pages
PA midsem
No ratings yet
PA midsem
20 pages
Operating Systems Interview Questions You'll Most Likely Be Asked
From Everand
Operating Systems Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
Computer Science: Learn about Algorithms, Cybersecurity, Databases, Operating Systems, and Web Design
From Everand
Computer Science: Learn about Algorithms, Cybersecurity, Databases, Operating Systems, and Web Design
Jonathan Rigdon
No ratings yet
SAS Programming Guidelines Interview Questions You'll Most Likely Be Asked
From Everand
SAS Programming Guidelines Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
Learn Computer Science
From Everand
Learn Computer Science
Knowledge Flow
No ratings yet
Quantum Computer Vs Traditional Computer
From Everand
Quantum Computer Vs Traditional Computer
Arief Muinnudin
No ratings yet
LECTURE 14
No ratings yet
LECTURE 14
10 pages
LECTURE 3
No ratings yet
LECTURE 3
8 pages
LECTURE 7
No ratings yet
LECTURE 7
4 pages
LECTURE 6
No ratings yet
LECTURE 6
7 pages
LECTURE 12
No ratings yet
LECTURE 12
8 pages
Chapter 2 Cloud Computing
No ratings yet
Chapter 2 Cloud Computing
44 pages
LECTURE 11b
No ratings yet
LECTURE 11b
4 pages
Chapter 3 Service Oriented Architectures
No ratings yet
Chapter 3 Service Oriented Architectures
36 pages
LECTURE 11
No ratings yet
LECTURE 11
7 pages
UML 2 0 in a Nutshell 1st ed Edition Dan Pilone - The full ebook version is available, download now to explore
100% (2)
UML 2 0 in a Nutshell 1st ed Edition Dan Pilone - The full ebook version is available, download now to explore
57 pages
DATATHON PROGRAMMING COMPETITION 2022 Rules and Regulations 1
No ratings yet
DATATHON PROGRAMMING COMPETITION 2022 Rules and Regulations 1
6 pages
Journal of Anxiety Disorders: Laura H. Clark, Jennifer L. Hudson, Ronald M. Rapee, Katrina L. Grasby
No ratings yet
Journal of Anxiety Disorders: Laura H. Clark, Jennifer L. Hudson, Ronald M. Rapee, Katrina L. Grasby
8 pages
Q1 Health 7 Module 4
No ratings yet
Q1 Health 7 Module 4
15 pages
Research Paper3
No ratings yet
Research Paper3
9 pages
Sessão 2 - Spencer, H - Structure, Function and Evolution, Pp. 67-92
No ratings yet
Sessão 2 - Spencer, H - Structure, Function and Evolution, Pp. 67-92
16 pages
Antenna
100% (2)
Antenna
68 pages
Industrial Engineering Mec 422 2 Unit Course Note WK1-3
No ratings yet
Industrial Engineering Mec 422 2 Unit Course Note WK1-3
8 pages
Blavatsky's Diagram of Meditation and The Process of Spiritual Transformation
No ratings yet
Blavatsky's Diagram of Meditation and The Process of Spiritual Transformation
7 pages
Assignment Question PE & PLC
No ratings yet
Assignment Question PE & PLC
5 pages
Download Scientific and Technical Translation Routledge Translation Guides First Edition Maeve Olohan ebook All Chapters PDF
100% (3)
Download Scientific and Technical Translation Routledge Translation Guides First Edition Maeve Olohan ebook All Chapters PDF
40 pages
Military Civil Engineering
100% (1)
Military Civil Engineering
8 pages
Detailed Lesson Plan in BPP
100% (1)
Detailed Lesson Plan in BPP
6 pages
EC-Interview Questions PDF
No ratings yet
EC-Interview Questions PDF
4 pages
Recognition As Intersubjective Vulnerability in The Psychoanalytic Dialogue PDF
No ratings yet
Recognition As Intersubjective Vulnerability in The Psychoanalytic Dialogue PDF
18 pages
Eagle Quantum Premier 8 Channel Relay Module Model EQ3720RM: Specification Data
No ratings yet
Eagle Quantum Premier 8 Channel Relay Module Model EQ3720RM: Specification Data
4 pages
Analyzing Word Problems (Word Clues and The Operation To Use)
No ratings yet
Analyzing Word Problems (Word Clues and The Operation To Use)
5 pages
FURUNO ECHOSOUNDER FCV581L OME-F
No ratings yet
FURUNO ECHOSOUNDER FCV581L OME-F
34 pages
Double MS Plugin: User Guide
No ratings yet
Double MS Plugin: User Guide
9 pages
Hose Pusher
No ratings yet
Hose Pusher
27 pages
The Rotordynamics Analysis of The Washing Machine Shaft Supported by Passive Magnetic
No ratings yet
The Rotordynamics Analysis of The Washing Machine Shaft Supported by Passive Magnetic
22 pages
An Evaluation of Recent Strategic Environmental Assessment Practice in Brazil
No ratings yet
An Evaluation of Recent Strategic Environmental Assessment Practice in Brazil
28 pages
A fully digital approach to replicate peri-implant soft tissue contours and emergence profile in the esthetic zone
No ratings yet
A fully digital approach to replicate peri-implant soft tissue contours and emergence profile in the esthetic zone
5 pages
2 Circuit Elements
No ratings yet
2 Circuit Elements
18 pages
Tendernotice 1
No ratings yet
Tendernotice 1
8 pages
Name: STD: VI Div: Roll No: Date:: Ans:-RAM Stands For Random Access Memory
No ratings yet
Name: STD: VI Div: Roll No: Date:: Ans:-RAM Stands For Random Access Memory
3 pages
Mumbai University Result
No ratings yet
Mumbai University Result
96 pages
Fbs Week 5 Grade 7 8 Leap
No ratings yet
Fbs Week 5 Grade 7 8 Leap
4 pages
Zekarias Mekonnen
No ratings yet
Zekarias Mekonnen
68 pages