Lecture Week - 2 General Parallelism Terms

The document discusses parallel computing terms and concepts. It describes Flynn's taxonomy for classifying parallel computers and covers terms like SIMD, MIMD, and shared vs distributed memory architectures. It also defines common parallelism terms including tasks, granularity, speedup, and scalability.

Uploaded by

imhafeezniazi

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

43 views

Lecture Week - 2 General Parallelism Terms

Uploaded by

imhafeezniazi

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 24

Parallel & Distributed Computing

Lecture 3:
General Parallelism Terms

Farhad M. Riaz
Farhad.Muhammad@numl.edu.pk

Department of Computer Science

NUML, Islamabad
von Neumann Architecture

 Authored the general

requirements for an
electronic computer in his
1945 papers
 Also known as "stored-
program computer”
 Both program instructions
and data are kept in
electronic memory
 Since then, virtually all
computers have followed
his design
John von Neumann
Von Neumann Architecture
Four main Components

 Read/write, random access memory

is used to store both program
instructions and data
– Program instructions are coded data which
tell the computer to do something
– Data is simply information to be used by
the program
 Control unit fetches instructions/data
from memory, decodes the
instructions and then sequentially
coordinates operations to accomplish
the programmed task.
 Arithmetic Unit performs basic parallel computers still
arithmetic operations follow this basic design
 Input/Output is the interface to the Just multiplied in units
human operator
Flynn's Classical Taxonomy

 Different ways to classify

parallel computers
 One of the more widely
used classifications, in use
since 1966, is called Flynn's
Taxonomy
 Distinguishes multi-
processor computer
architectures according to
two dimensions
– Instruction Stream
– Data Stream
SISD (Scalar Processing)
 A serial (non-parallel) computer
 Single Instruction: Only one
instruction stream is being acted on
by the CPU during any one clock
cycle
 Single Data: Only one data stream is
being used as input during any one
clock cycle
 Deterministic execution
 This is the oldest type of computer
 Examples: older generation
mainframes, minicomputers,
workstations and single
processor/core PCs.
SIMD (Vector Processing)
 Single Instruction: All processing units
execute the same instruction at any given
clock cycle
 Multiple Data: Each processing unit can
operate on a different data element
 Best suited for specialized problems such
as graphics/image processing.
 Synchronous and deterministic execution
 Processor Arrays Vector Pipelines
 Examples:
– Processor Arrays: Thinking Machines
CM-2
– Vector Pipelines: IBM 9000
– Most modern computers with GPU
units
Scalar vs Vector (SIMD)
Multiple Instruction, Single Data
(MISD)
 Multiple Instruction: Each
processing unit operates on the
data independently via separate
instruction streams.
 Single Data: A single data stream
is fed into multiple processing
units.
 Few (if any) actual examples of
this class of parallel computer
have ever existed.
 Some conceivable uses might be:
– multiple frequency filters operating
on a single signal stream
– multiple cryptography algorithms
attempting to crack a single coded
message.
Multiple Instruction, Multiple Data
(MIMD)

 Multiple Instruction: Every processor may

be executing a different instruction stream
 Multiple Data: Every processor may be
working with a different data stream
 Execution can be synchronous or
asynchronous, deterministic or non-
deterministic
 Currently, the most common type of parallel
computer - most modern supercomputers
fall into this category
 Examples: most current supercomputers,
networked parallel computer clusters, grids,
multi-core PCs
MIMD Classification

• Shared Memory
• Distributed Memory
Shared vs Distributed Memory
Some General Parallel
Terminology
 Supercomputing / High Performance Computing (HPC)
– Using the world's fastest and largest computers to solve large
problems
 Node
– A standalone "computer in a box". Usually comprised of multiple
CPUs/processors/cores, memory, network interfaces, etc. Nodes
are networked together to comprise a supercomputer.
 CPU / Socket / Processor / Core
– In the past, a CPU (Central Processing Unit) was a singular
execution component
– Then, multiple CPUs were incorporated into a node
– Then, individual CPUs were subdivided into multiple "cores", each
being a unique execution unit
– CPUs with multiple cores are sometimes called "sockets”
– The result is a node with multiple CPUs, each containing multiple
cores
Some General Parallel
Terminology
 Task
– A logically discrete section of computational work. A task is typically a
program or program-like set of instructions that is executed by a
processor. A parallel program consists of multiple tasks running on
multiple processors.
 Pipelining
– Breaking a task into steps performed by different processor units, with
inputs streaming through, much like an assembly line; a type of
parallel computing.
 Shared Memory
– From a strictly hardware point of view, describes a computer
architecture where all processors have direct (usually bus based)
access to common physical memory. In a programming sense, it
describes a model where parallel tasks all have the same "picture" of
memory and can directly address and access the same logical
memory locations regardless of where the physical memory actually
exists.
Some General Parallel
Terminology
 Symmetric Multi-Processor (SMP)
– Shared memory hardware architecture where multiple processors
share a single address space and have equal access to all resources.
 Distributed Memory
– In hardware, refers to network based memory access for physical
memory that is not common. As a programming model, tasks can only
logically "see" local machine memory and must use communications
to access memory on other machines where other tasks are
executing.
 Communications
– Parallel tasks typically need to exchange data. There are several ways
this can be accomplished, such as through a shared memory bus or
over a network, however the actual event of data exchange is
commonly referred to as communications regardless of the method
employed
Some General Parallel
Terminology

 Synchronization
– The coordination of parallel tasks in real time, very often associated with communications.
– Often implemented by establishing a synchronization point within an application where a
task may not proceed further until another task(s) reaches the same or logically equivalent
point.
– Synchronization usually involves waiting by at least one task and can therefore cause a
parallel application's wall clock execution time to increase.
 Granularity
– granularity is a qualitative measure of the ratio of computation to communication
– Coarse: relatively large amounts of computational work are done between communication
events
– Fine: relatively small amounts of computational work are done between communication
events
 Observed Speedup
– Observed speedup of a code which has been parallelized, defined as:
wall-clock time of serial execution
-----------------------------------
wall-clock time of parallel execution
– One of the simplest and most widely used indicators for a parallel program's performance.
Some General Parallel
Terminology
 Parallel Overhead
– The amount of time required to coordinate parallel tasks, as opposed to
doing useful work. Parallel overhead can include factors such Task
start-up time
 Synchronizations
 Data communications
 Software overhead imposed by parallel languages, libraries, operating
system, etc.
 Task termination time
 Massively Parallel
– Refers to the hardware that comprises a given parallel system - having
many processing elements. The meaning of "many" keeps increasing,
but currently, the largest parallel computers are comprised of
processing elements numbering in the hundreds of thousands
 Embarrassingly Parallel
– Solving many similar, but independent tasks simultaneously; little to no
need for coordination between the tasks
Some General Parallel
Terminology

 Scalability
– Refers to a parallel system's (hardware
and/or software) ability to demonstrate a
proportionate increase in parallel speedup
with the addition of more resources. Factors
that contribute to scalability include:
 Hardware - particularly memory-CPU bandwidths
and network communication properties
 Application algorithm
 Parallel overhead related
 Characteristics of your specific application
Why Parallel Computing is
necessary?

 Rise of multi-core
computing machines
 Under-utilization of
resources
 Hyperthreading,
introduced by Intel
Example

 Let’s take a 16-core CPU with

hyperthreading and a 256 bit-wide
vector unit, commonly found in
home desktops. A serial program
using a single core and no
vectorization only uses 0.8% of
the theoretical processing
capability of this processor.
 The calculation is
– 16 cores × 2 hyperthreads × (256
bit-wide vector unit)/(64-bit double) =
128-way parallelism
where 1 serial path/128 parallel
paths = .008 or 0.8%. The following
figure shows that this is a small
fraction of the total CPU processing
power.
Benefits of Parallel Computing

 Lesser execution time with multiple cores

 Solve larger and complex problems
 Energy efficient system
 Cost reduction
Process vs threads

 An application consists of one or more

processes.
 Both processes and threads are
independent sequences of execution
 The typical difference is that threads (of
the same process) run in a shared
memory space, while processes run in
separate memory spaces
That’s all for today!!

Task 1 - MYP 2 - Summative
No ratings yet
Task 1 - MYP 2 - Summative
5 pages
Digital Literacy Level 6-Assessors Guide
67% (3)
Digital Literacy Level 6-Assessors Guide
12 pages
CSEC-Jan 2023 - IT P2
80% (5)
CSEC-Jan 2023 - IT P2
21 pages
Learn Multithreading with Modern C++
From Everand
Learn Multithreading with Modern C++
James Raynard
No ratings yet
Parallel Processing
No ratings yet
Parallel Processing
35 pages
Labconco Manual Liofilizadora
No ratings yet
Labconco Manual Liofilizadora
71 pages
DeSeasonalizing A Time Series
100% (2)
DeSeasonalizing A Time Series
8 pages
Lecture 2 General Parallelism Terms
No ratings yet
Lecture 2 General Parallelism Terms
22 pages
Lecture 2 General Parallelism Terms
No ratings yet
Lecture 2 General Parallelism Terms
22 pages
Lecture 4
No ratings yet
Lecture 4
27 pages
Introduction To Parallel Computing-Dr Nousheen
No ratings yet
Introduction To Parallel Computing-Dr Nousheen
43 pages
KCS 713 Unit 1 Lecture 5
No ratings yet
KCS 713 Unit 1 Lecture 5
32 pages
Computer Achitecture II - Parallel - Computing
No ratings yet
Computer Achitecture II - Parallel - Computing
46 pages
Week1-Parallel-and-Distributed-Computing
No ratings yet
Week1-Parallel-and-Distributed-Computing
55 pages
Parallel Computing Main
No ratings yet
Parallel Computing Main
47 pages
Parallel Computing
No ratings yet
Parallel Computing
32 pages
CS326 Parallel and Distributed Computing: SPRING 2021 National University of Computer and Emerging Sciences
No ratings yet
CS326 Parallel and Distributed Computing: SPRING 2021 National University of Computer and Emerging Sciences
47 pages
Week1 - Parallel and Distributed Computing
100% (1)
Week1 - Parallel and Distributed Computing
46 pages
Introduction To Parallel Computing
No ratings yet
Introduction To Parallel Computing
38 pages
Parallel Computing Terminology
No ratings yet
Parallel Computing Terminology
11 pages
Unit VI Parallel Programming Concepts
No ratings yet
Unit VI Parallel Programming Concepts
90 pages
01 Intro Parallel Computing
No ratings yet
01 Intro Parallel Computing
40 pages
Project - ParallelComputing BSR v2
No ratings yet
Project - ParallelComputing BSR v2
40 pages
Introduction To Computing
No ratings yet
Introduction To Computing
6 pages
Parallel Computing: Er. Anupama Singh Department of Computer Science & Engg
No ratings yet
Parallel Computing: Er. Anupama Singh Department of Computer Science & Engg
22 pages
Paralle Processing in Brief
No ratings yet
Paralle Processing in Brief
31 pages
Introduction To Parallel Computing LLNL
No ratings yet
Introduction To Parallel Computing LLNL
44 pages
Parallel and Distributed Computing
No ratings yet
Parallel and Distributed Computing
90 pages
Intro To Parallel Computing
No ratings yet
Intro To Parallel Computing
127 pages
Lecture Parallel Computing
No ratings yet
Lecture Parallel Computing
6 pages
Lec1 Introduction to Parallel Computing (2)
No ratings yet
Lec1 Introduction to Parallel Computing (2)
40 pages
Unit -01 easid
No ratings yet
Unit -01 easid
18 pages
Parallel Computing
No ratings yet
Parallel Computing
28 pages
FALLSEM2021-22 CSE4001 ETH VL2021220104078 Reference Material I 05-Aug-2021 Module1 (Part 1)
No ratings yet
FALLSEM2021-22 CSE4001 ETH VL2021220104078 Reference Material I 05-Aug-2021 Module1 (Part 1)
30 pages
W3C1 Principles of Parallel Computing
No ratings yet
W3C1 Principles of Parallel Computing
28 pages
Basics of Parallel Programming: Unit-1
No ratings yet
Basics of Parallel Programming: Unit-1
79 pages
Unit 5
No ratings yet
Unit 5
66 pages
Parallel Programming Module 1
No ratings yet
Parallel Programming Module 1
71 pages
Topic 1 2024
No ratings yet
Topic 1 2024
41 pages
Parallel Computing
No ratings yet
Parallel Computing
19 pages
2-INTRODUCTION TO PDC - MOTIVATION - KEY CONCEPTS-03-Dec-2019Material - I - 03-Dec-2019 - Module - 1 PDF
No ratings yet
2-INTRODUCTION TO PDC - MOTIVATION - KEY CONCEPTS-03-Dec-2019Material - I - 03-Dec-2019 - Module - 1 PDF
63 pages
Unit4 Session3 Parallel Computing Concepts Terminology Design Issues
No ratings yet
Unit4 Session3 Parallel Computing Concepts Terminology Design Issues
30 pages
Parallel_computing
No ratings yet
Parallel_computing
32 pages
Theory of Distributed Computing and Parallel Processing With Its Applications, Advantages and Disadvantages
No ratings yet
Theory of Distributed Computing and Parallel Processing With Its Applications, Advantages and Disadvantages
11 pages
What Is Serial Computing?: Traditionally, Software Has Been Written For Serial Computation
No ratings yet
What Is Serial Computing?: Traditionally, Software Has Been Written For Serial Computation
22 pages
CS326 Parallel and Distributed Computing: SPRING 2021 National University of Computer and Emerging Sciences
No ratings yet
CS326 Parallel and Distributed Computing: SPRING 2021 National University of Computer and Emerging Sciences
33 pages
Lecture_2_Computer_Architecture_course_2024_1
No ratings yet
Lecture_2_Computer_Architecture_course_2024_1
57 pages
Unit 1
No ratings yet
Unit 1
22 pages
Unit 1 - Part - 2
No ratings yet
Unit 1 - Part - 2
30 pages
PARALLEL VS DISTRIBUTED COMPUTING
No ratings yet
PARALLEL VS DISTRIBUTED COMPUTING
9 pages
Parallel 123
No ratings yet
Parallel 123
28 pages
Parallel and Distributed Computing
No ratings yet
Parallel and Distributed Computing
28 pages
Synopsis On "Massive Parallel Processing (MPP) "
No ratings yet
Synopsis On "Massive Parallel Processing (MPP) "
4 pages
Lecture 1 - Introduction
No ratings yet
Lecture 1 - Introduction
6 pages
UNIT-2 PP FlynnsClassification
No ratings yet
UNIT-2 PP FlynnsClassification
80 pages
CSC580 Quick Notes Lect1and2
100% (1)
CSC580 Quick Notes Lect1and2
18 pages
24-25 - Parallel Processing PDF
No ratings yet
24-25 - Parallel Processing PDF
36 pages
COA - Unit 4
No ratings yet
COA - Unit 4
84 pages
Parallel Computing Varun Patial
No ratings yet
Parallel Computing Varun Patial
41 pages
Chapter 1 - Parallel Architectures
No ratings yet
Chapter 1 - Parallel Architectures
60 pages
Lecture-2-06.01.2025
No ratings yet
Lecture-2-06.01.2025
21 pages
Parallel Computing
100% (1)
Parallel Computing
53 pages
Multiprocessors - Parallel Processing Overview: "The Real World Is Inherently Concurrent Yet Our Computational
No ratings yet
Multiprocessors - Parallel Processing Overview: "The Real World Is Inherently Concurrent Yet Our Computational
78 pages
Computer Science: Learn about Algorithms, Cybersecurity, Databases, Operating Systems, and Web Design
From Everand
Computer Science: Learn about Algorithms, Cybersecurity, Databases, Operating Systems, and Web Design
Jonathan Rigdon
No ratings yet
Quantum Computer Vs Traditional Computer
From Everand
Quantum Computer Vs Traditional Computer
Arief Muinnudin
No ratings yet
Analysis of Vibrational Teston Jig For
No ratings yet
Analysis of Vibrational Teston Jig For
4 pages
Year 5 Forces Revision Activity Mat
No ratings yet
Year 5 Forces Revision Activity Mat
6 pages
Image Cse No Model No Part No Description
No ratings yet
Image Cse No Model No Part No Description
4 pages
Multi Tenant
No ratings yet
Multi Tenant
5 pages
Toyota Case Study
No ratings yet
Toyota Case Study
2 pages
4G Welding
No ratings yet
4G Welding
11 pages
Ball Valve Data Sheet: Item Requirement Notes
No ratings yet
Ball Valve Data Sheet: Item Requirement Notes
1 page
National Comprehensive HIV Prevention, Care, and Treatment Training For Pharmacy Professionals-Participant Manual
No ratings yet
National Comprehensive HIV Prevention, Care, and Treatment Training For Pharmacy Professionals-Participant Manual
326 pages
Battery Energy Storage System For Power Conditioning of Renewable Energy Sources
No ratings yet
Battery Energy Storage System For Power Conditioning of Renewable Energy Sources
6 pages
Laminates and Veneer Endodontics.
No ratings yet
Laminates and Veneer Endodontics.
19 pages
Dissimilar Welding of AISI 309 Stainless Steel To AISI 1020 Carbon Steel Using Arc Stud Welding
No ratings yet
Dissimilar Welding of AISI 309 Stainless Steel To AISI 1020 Carbon Steel Using Arc Stud Welding
6 pages
Lms Activity 3 Two-Storey Residential House Magaddon, Engel A. Ground Floor Plan
No ratings yet
Lms Activity 3 Two-Storey Residential House Magaddon, Engel A. Ground Floor Plan
1 page
Quantitative Aptitude Sample Paper 1 PDF
No ratings yet
Quantitative Aptitude Sample Paper 1 PDF
9 pages
Characterization of The Radiation Tolerant ToASt ASIC For The Readout of The PANDA MVD Strip Detector
No ratings yet
Characterization of The Radiation Tolerant ToASt ASIC For The Readout of The PANDA MVD Strip Detector
10 pages
Target: Before Proceeding Further, Check How Much You Know About Business
No ratings yet
Target: Before Proceeding Further, Check How Much You Know About Business
16 pages
Server-Side Web Programming: Introduction To Sessions
No ratings yet
Server-Side Web Programming: Introduction To Sessions
24 pages
Tutorial 7 Matrix Algebra For Homogeneous Linear Algebraic System
No ratings yet
Tutorial 7 Matrix Algebra For Homogeneous Linear Algebraic System
3 pages
3.5 Cross Price Elasticity
0% (1)
3.5 Cross Price Elasticity
21 pages
Nidek Mark 5 Plus Concentrator - User Manual
No ratings yet
Nidek Mark 5 Plus Concentrator - User Manual
7 pages
Reading Comprehensión Power Tools GUELL
No ratings yet
Reading Comprehensión Power Tools GUELL
4 pages
ASI Library For The Arduino - ByVac
No ratings yet
ASI Library For The Arduino - ByVac
3 pages
(Hong 2011) Theoretical modeling for a rotor-bearing-foundation system and its dynamic characteristi
No ratings yet
(Hong 2011) Theoretical modeling for a rotor-bearing-foundation system and its dynamic characteristi
12 pages
Fourth Grade Reading Success Complete Learning Kit - Excerpt
33% (3)
Fourth Grade Reading Success Complete Learning Kit - Excerpt
29 pages
Indra: A - INST/K. Fujita 11-SEP-2018
No ratings yet
Indra: A - INST/K. Fujita 11-SEP-2018
13 pages
Type 2 Diabetes among Native Americans
No ratings yet
Type 2 Diabetes among Native Americans
3 pages