Lecture 1 Introduction

The document outlines the CSC334 course on Parallel & Distributed Computing, covering key concepts such as parallel and distributed computing, application profiling, and programming with OpenMP and MPI. It highlights prerequisites, recommended materials, and the significance of high-performance computing (HPC) and GPU-accelerated computing. Additionally, it discusses various types of parallel and distributed computing, emphasizing the advantages of parallel computing in solving complex problems and improving efficiency.

Uploaded by

zaeemrana69

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views

Lecture 1 Introduction

Uploaded by

zaeemrana69

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 34

CSC334 Parallel & Distributed Computing

Lecture # 01
Introduction
• Dr. Samera Batool
• Samera.Batool@comsats.edu.pk
• Ground Flood Room No G41
Course Overview
• This course covers following main concepts
• Concepts of parallel and distributed computing
• Analysis and profiling of applications
• Shared memory concepts
• Distributed memory concepts
• Parallel and distributed programming (OpenMP, MPI)
• GPU based computing and programming (CUDA)
• Virtualization
• Cloud Computing, MapReduce
• Grid Computing
• Peer-to-Peer Computing
• Future trends in computing
Course Pre-requisites
• Programming Experience (preferably Python/C++/Java)
• Understanding of Computer Organization and Architecture
• Understanding of Operating System
Recommended Material
• Distributed Systems, Maarten van Steen & Andrew S. Tanenbaum, 3rd Edition (2020),
Pearson.
• Parallel Programming: Concepts and Practice, Bertil Schmidt, Jorge Gonzalez-
Dominguez, Christian Hundt, Moritz Schlarb, 1st Edition (2018), Elsevier.
• Parallel and High-Performance Computing, Robert Robey and Yuliana Zamora, 1st
Edition (2021).
• Distributed and Cloud Computing: From Parallel Processing to the Internet of Things,
Kai Hwang, Jack Dongarra, Geoffrey Fox, 1st Edition (2012), Elsevier.
• Multicore and GPU Programming: An Integrated Approach, Gerassimos Barlas, 2nd
Edition (2015), Elsevier.
• Parallel programming: For multicore and cluster systems. Rauber, Thomas, and
Gudula Rünger. Springer Science & Business Media, 2013.
Single Processor Architecture
Memory Hierarchy
Application
Partitioning
High-Performance Computing (HPC)
• HPC is the use of parallel processing for running advanced application
programs efficiently, reliably and quickly.
• It applies especially to systems that function above a tera FLOPs
(floating-point operations per second) processing speed.
• The term HPC is occasionally used as a synonym for supercomputing,
although technically a supercomputer is a system that performs at or
near the currently highest operational rate for computers.
High Performance Computing
GPU-accelerated Computing
• GPU-accelerated computing is the use of a graphics processing unit
(GPU) together with a CPU to accelerate deep learning, analytics, and
engineering applications.
• Pioneered in 2007 by NVIDIA, GPU accelerators now power energy-
efficient data centers in government labs, universities, enterprises,
and small-and-medium businesses around the world.
• They play a huge role in accelerating applications in platforms ranging
from artificial intelligence to cars, drones, and robots.
What is GPU?
• It is a processor optimized for 2D/3D graphics, video, visual
computing, and display.
• It is highly parallel, highly multithreaded multiprocessor optimized for
visual computing.
• It provide real-time visual interaction with computed objects via
graphics images, and video.
• It serves as both a programmable graphics processor and a scalable
parallel computing platform.
• Heterogeneous Systems: combine a GPU with a CPU
HPC System
composition
Parallel Computers
• Virtually all stand-alone computers
today are parallel from hardware
perspective:
• Multiple functional units (L1 cache, L2
cache, branch, pre-fetch, decode,
floating-point, graphics processing
(GPU), integer, etc.)
• Multiple execution units/cores
• Multiple hardware threads

IBM BG/Q Compute Chip with 18 cores (PU) and 16 L2 Cache units (L2)
Parallel Computers
• Networks connect multiple stand-
alone computers (nodes) to make
larger parallel computer clusters.
• Parallel computer cluster
• Each compute node is a multi-
processor parallel computer in itself
• Multiple compute nodes are
networked together with an
Infiniband network
• Special purpose nodes, also multi-
processor, are used for other
purposes
Types of Parallel and Distributed
Computing
• Parallel Computing
• Shared Memory
• Distributed Memory

• Distributed Computing
• Cluster Computing
• Grid Computing
• Cloud Computing
• Distributed Pervasive Systems
Parallel Computing
Distributed (Cluster) Computing
• Essentially a group of high-end
systems connected through a LAN
• Homogeneous: same OS, near-
identical hardware
• Single managing node
Distributed (Grid) Computing
• Lots of nodes from everywhere
• Heterogeneous
• Dispersed across several organizations
• Can easily span a wide-area network

• To allow for collaborations, grids generally use virtual organizations.

• In essence, this is a grouping of users (or their IDs) that will allow for
authorization on resource allocation.
Distributed (Cloud) Computing
Distributed (Pervasive) Computing
• Emerging next-generation of distributed systems in which nodes are
small, mobile, and often embedded in a larger system, characterized
by the fact that the system naturally blends into the user’s
environment.
• Three subtypes
• Ubiquitous computing systems: pervasive and continuously present, i.e.,
there is a continuous interaction between system and user.
• Mobile computing systems: pervasive, but emphasis is on the fact that
devices are inherently mobile.
• Sensor (and actuator) networks: pervasive, with emphasis on the actual
(collaborative) sensing and actuation of the environment.
Why Use Parallel
Computing?
The Real World is Massively
Parallel
• In the natural world, many
complex, interrelated events are
happening at the same time, yet
within a temporal sequence.
• Compared to serial computing,
parallel computing is much better
suited for modeling, simulating
and understanding complex, real
world phenomena.
• For example, imagine modeling
these serially =>
SAVE TIME AND/OR MONEY
(Main Reasons)
• In theory, throwing more
resources at a task will shorten
its time to completion, with
potential cost savings.
• Parallel computers can be built
from cheap, commodity
components.
SOLVE LARGER / MORE COMPLEX PROBLEMS
(Main Reasons)
• Many problems are so large and/or complex that it is impractical or
impossible to solve them on a single computer, especially given
limited computer memory.
• Example: Web search engines/databases processing millions of
transactions every second
PROVIDE CONCURRENCY
(Main Reasons)
• A single compute resource can only do one thing at a time. Multiple
compute resources can do many things simultaneously.
• Example: Collaborative Networks provide a global venue where
people from around the world can meet and conduct work "virtually".
MAKE BETTER USE OF UNDERLYING PARALLEL
HARDWARE
(Main Reasons)
• Modern computers, even laptops,
are parallel in architecture with
multiple processors/cores.
• Parallel software is specifically
intended for parallel hardware
with multiple cores, threads, etc.
• In most cases, serial programs run
on modern computers "waste" Intel Xeon processor with 6 cores and 6 L3
potential computing power. cache units
The Future
(Main Reasons)
• During the past 20+ years, the trends
indicated by ever faster networks,
distributed systems, and multi-
processor computer architectures
(even at the desktop level) clearly
show that parallelism is the future of
computing.
• In this same time period, there has
been a greater than 500,000x increase
in supercomputer performance, with
no end currently in sight.
•

Ebook - Allan Mckay - VFX Hardware Guide - Jan 2019 PDF
No ratings yet
Ebook - Allan Mckay - VFX Hardware Guide - Jan 2019 PDF
29 pages
Lecture Week - 1 Introduction 1 - SP-24
No ratings yet
Lecture Week - 1 Introduction 1 - SP-24
51 pages
PP Cuda Unit1 1
No ratings yet
PP Cuda Unit1 1
77 pages
Chapter # 1
No ratings yet
Chapter # 1
117 pages
Parallel Distributed Computing
No ratings yet
Parallel Distributed Computing
51 pages
Introduction To Parallel Computing LLNL
No ratings yet
Introduction To Parallel Computing LLNL
44 pages
Introduction To Parallel Computing
No ratings yet
Introduction To Parallel Computing
30 pages
Parallel Computing
No ratings yet
Parallel Computing
57 pages
Lec1 and 2
No ratings yet
Lec1 and 2
52 pages
2-INTRODUCTION TO PDC - MOTIVATION - KEY CONCEPTS-03-Dec-2019Material - I - 03-Dec-2019 - Module - 1 PDF
No ratings yet
2-INTRODUCTION TO PDC - MOTIVATION - KEY CONCEPTS-03-Dec-2019Material - I - 03-Dec-2019 - Module - 1 PDF
63 pages
Lecture 1 Introduction To PDC
No ratings yet
Lecture 1 Introduction To PDC
17 pages
Lec1 Introduction
No ratings yet
Lec1 Introduction
23 pages
Week 1
No ratings yet
Week 1
74 pages
FALLSEM2021-22 CSE4001 ETH VL2021220104078 Reference Material I 05-Aug-2021 Module1 (Part 1)
No ratings yet
FALLSEM2021-22 CSE4001 ETH VL2021220104078 Reference Material I 05-Aug-2021 Module1 (Part 1)
30 pages
CS621_Handouts - Mids
No ratings yet
CS621_Handouts - Mids
61 pages
Parallel Computing Terminology
No ratings yet
Parallel Computing Terminology
11 pages
CS ELEC 2 Introduce Parallel Computing
No ratings yet
CS ELEC 2 Introduce Parallel Computing
28 pages
The New Trends of Parallel Processing
No ratings yet
The New Trends of Parallel Processing
5 pages
CLOUD COMPUTING UNIT - 1
No ratings yet
CLOUD COMPUTING UNIT - 1
41 pages
week 1
No ratings yet
week 1
14 pages
Introduction To Parallel Computing
100% (1)
Introduction To Parallel Computing
34 pages
CC 1
No ratings yet
CC 1
11 pages
10 Parallel Computing
No ratings yet
10 Parallel Computing
15 pages
EE664: Introduction To Parallel Computing: Dr. Gaurav Trivedi Lectures 5-14
No ratings yet
EE664: Introduction To Parallel Computing: Dr. Gaurav Trivedi Lectures 5-14
170 pages
Basics of Parallel Programming: Unit-1
No ratings yet
Basics of Parallel Programming: Unit-1
79 pages
Topic 1 2024
No ratings yet
Topic 1 2024
41 pages
FALLSEM2021-22 CSE4001 ETH VL2021220104078 Reference Material I 03-Aug-2021 Lecture1-Course Introduction
No ratings yet
FALLSEM2021-22 CSE4001 ETH VL2021220104078 Reference Material I 03-Aug-2021 Lecture1-Course Introduction
39 pages
Chapter1 - CLO1
No ratings yet
Chapter1 - CLO1
28 pages
Intro Parallel Computing PDF
No ratings yet
Intro Parallel Computing PDF
58 pages
BDS Session 2
No ratings yet
BDS Session 2
56 pages
Lec1-Introduction To Parallel - Distributed System
No ratings yet
Lec1-Introduction To Parallel - Distributed System
29 pages
Csc4306 Net-Centric Computing
100% (1)
Csc4306 Net-Centric Computing
5 pages
Parallel Computing Varun Patial
No ratings yet
Parallel Computing Varun Patial
41 pages
CC notes I unit
No ratings yet
CC notes I unit
31 pages
Screenshot 2024-06-27 at 11.49.45 PM
No ratings yet
Screenshot 2024-06-27 at 11.49.45 PM
28 pages
Lecture 01
No ratings yet
Lecture 01
34 pages
Presentation cc 1
No ratings yet
Presentation cc 1
63 pages
CC Unit1 Notes Compressed
No ratings yet
CC Unit1 Notes Compressed
41 pages
Unit 1 - Computing Paradigms
No ratings yet
Unit 1 - Computing Paradigms
31 pages
Parallel & Distributed Computing: By: M. Imran Siddiqui
No ratings yet
Parallel & Distributed Computing: By: M. Imran Siddiqui
30 pages
Lecture 2 Introduction to Parallel and Distributed Computing
No ratings yet
Lecture 2 Introduction to Parallel and Distributed Computing
29 pages
Lecture 4
No ratings yet
Lecture 4
27 pages
Lecture 10 Parallel Computing - by FQ
No ratings yet
Lecture 10 Parallel Computing - by FQ
29 pages
Introduction To Computing
No ratings yet
Introduction To Computing
6 pages
Lecture 9
No ratings yet
Lecture 9
72 pages
Distributed Computing With Python - Sample Chapter
No ratings yet
Distributed Computing With Python - Sample Chapter
18 pages
Chapter 1 - Parallel Architectures
No ratings yet
Chapter 1 - Parallel Architectures
60 pages
Lecture_2_Computer_Architecture_course_2024_1
No ratings yet
Lecture_2_Computer_Architecture_course_2024_1
57 pages
PDC 1
No ratings yet
PDC 1
41 pages
Co-1 (2)
No ratings yet
Co-1 (2)
66 pages
Lecture Parallel Computing
No ratings yet
Lecture Parallel Computing
6 pages
Cloud Computing Unit - 1
No ratings yet
Cloud Computing Unit - 1
42 pages
Lecture 1 - Introduction to PDC
No ratings yet
Lecture 1 - Introduction to PDC
24 pages
CCUnit-1
No ratings yet
CCUnit-1
83 pages
Theory of Distributed Computing and Parallel Processing With Its Applications, Advantages and Disadvantages
No ratings yet
Theory of Distributed Computing and Parallel Processing With Its Applications, Advantages and Disadvantages
11 pages
Introduction To Parallel Co...
No ratings yet
Introduction To Parallel Co...
44 pages
cloud computing unit-1
No ratings yet
cloud computing unit-1
51 pages
Types of Parallel Computing
No ratings yet
Types of Parallel Computing
11 pages
Parallel Computing Main
No ratings yet
Parallel Computing Main
47 pages
Quantum Computer Vs Traditional Computer
From Everand
Quantum Computer Vs Traditional Computer
Arief Muinnudin
No ratings yet
Learn Computer Science
From Everand
Learn Computer Science
Knowledge Flow
No ratings yet
PrimePar Efficient Spatial-Temporal Tensor
No ratings yet
PrimePar Efficient Spatial-Temporal Tensor
17 pages
Bitmain Antminer KS5 (20Th) Profitability - KHeavyHash Miner
No ratings yet
Bitmain Antminer KS5 (20Th) Profitability - KHeavyHash Miner
1 page
NB Asus User
No ratings yet
NB Asus User
8 pages
CUDA Zone - Library of Resources - NVIDIA Developer
No ratings yet
CUDA Zone - Library of Resources - NVIDIA Developer
7 pages
06 - Acer Aspire E5-491G Compal A4WAD LA-C871P Rev 1.0 Schematic
No ratings yet
06 - Acer Aspire E5-491G Compal A4WAD LA-C871P Rev 1.0 Schematic
61 pages
PC Hardware Components a Deep Dive.pdf
No ratings yet
PC Hardware Components a Deep Dive.pdf
11 pages
Divy HPC
No ratings yet
Divy HPC
36 pages
Vbo Vao
No ratings yet
Vbo Vao
19 pages
NVSwitch
No ratings yet
NVSwitch
23 pages
The Design and Verification of Mumax3: Keywords: Micromagnetism, Simulation, Graphical Processing Unit
No ratings yet
The Design and Verification of Mumax3: Keywords: Micromagnetism, Simulation, Graphical Processing Unit
32 pages
Data Center Acceleration: Dave Salvator, Senior Manager, Product Management, NVIDIA
No ratings yet
Data Center Acceleration: Dave Salvator, Senior Manager, Product Management, NVIDIA
9 pages
SplashPRO EX User Manual
No ratings yet
SplashPRO EX User Manual
36 pages
AI From The Data Center To The Edge An Optimized Path Using Intel Architecture PDF
No ratings yet
AI From The Data Center To The Edge An Optimized Path Using Intel Architecture PDF
149 pages
DeepClear_Manual
No ratings yet
DeepClear_Manual
3 pages
Cinematic Studio Sample Startup Guide
No ratings yet
Cinematic Studio Sample Startup Guide
12 pages
An introduction to fine-tuning LLMs at home with Axolotl #4 • The Register
No ratings yet
An introduction to fine-tuning LLMs at home with Axolotl #4 • The Register
3 pages
OpenHardwareMonitor Report
No ratings yet
OpenHardwareMonitor Report
17 pages
Prix Num 2 - Datasheet DELL Latitude 5550 - Anglais
No ratings yet
Prix Num 2 - Datasheet DELL Latitude 5550 - Anglais
17 pages
PVTC Technical Requirements: About The Installation Scenarios
No ratings yet
PVTC Technical Requirements: About The Installation Scenarios
12 pages
ROCKY
No ratings yet
ROCKY
4 pages
Debayer Resize Jgt09
No ratings yet
Debayer Resize Jgt09
10 pages
Snapdragon 650 Processor Product Brief
No ratings yet
Snapdragon 650 Processor Product Brief
2 pages
Pete Ogl Readme 1 76
No ratings yet
Pete Ogl Readme 1 76
7 pages
Holoscan SDK User Guide v0.6.0
No ratings yet
Holoscan SDK User Guide v0.6.0
333 pages
Metashape-Pro 1 7 en
No ratings yet
Metashape-Pro 1 7 en
187 pages
High Performance Molecular Visualization and Analysis With GPU Computing
No ratings yet
High Performance Molecular Visualization and Analysis With GPU Computing
46 pages
Log 20210903 140616
No ratings yet
Log 20210903 140616
2 pages
QUIZ PREP
No ratings yet
QUIZ PREP
21 pages
Gpu Programming
100% (2)
Gpu Programming
96 pages