0% found this document useful (0 votes)

49 views

BCS3413 Principle & Applications of Parallel Programming Quiz 2: Gpgpu Cuda

This document contains 9 multiple choice questions about CUDA programming concepts like memory allocation, data transfer between host and device, and mapping thread indices to data indices. It provides the questions, possible answers, and a short explanation for each question. The questions cover topics like using cudaMalloc() to allocate memory on the device, using cudaMemcpy() to transfer data between host and device, mapping thread and block indices to data indices for common parallel programming patterns like processing individual elements or pairs of adjacent elements, and determining the number of threads in a grid given the vector length and block size.

Uploaded by

amin minshaf

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

49 views

BCS3413 Principle & Applications of Parallel Programming Quiz 2: Gpgpu Cuda

Uploaded by

amin minshaf

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

BCS3413 Principle & Applications of Parallel Programming

Quiz 2 : GPGPU CUDA

1. If we want to allocate an array of v integer elements in CUDA device global memory, what would
be an appropriate expression for the second argument of the cudaMalloc() call?
(A) n
(B) v
(C) n * sizeof(int)
(D) v * sizeof(int)

Answer: (D)

2. If we want to allocate an array of n floating-point elements and have a floating-point pointer

variable d_A to point to the allocated memory, what would be an appropriate expression for the
first argument of the cudaMalloc() call?
(A) n
(B) (void *) d_A
(C) *d_A
(D) (void **) &d_A

Answer: (D)

Explanation: &d_A is pointer to a pointer of float. To convert it to a generic pointer required by

cudaMalloc() should use (void **) to cast it to a generic double-level pointer.

3. If we want to copy 3000 bytes of data from host array h_A (h_A is a pointer to element 0 of the
source array) to device array d_A (d_A is a pointer to element 0 of the destination array), what
would be an appropriate API call for this in CUDA?
(A) cudaMemcpy(3000, h_A, d_A, cudaMemcpyHostToDevice);
(B) cudaMemcpy(h_A, d_A, 3000, cudaMemcpyDeviceTHost);
(C) cudaMemcpy(d_A, h_A, 3000, cudaMemcpyHostToDevice);
(D) cudaMemcpy(3000, d_A, h_A, cudaMemcpyHostToDevice);

Answer: (C)

Explanation: See Lecture 2.2 slides.

4. How would one declare a variable err that can appropriately receive returned value of a CUDA
API call?
(A) int err;
(B) cudaError err;
(C) cudaError_t err;
(D) cudaSuccess_t err;

Answer: (C)

Explanation: See Lecture 2.2 slides.

5. If we need to use each thread to calculate one output element of a vector addition, what would
be the expression for mapping the thread/block indices to data index:
(A) i=threadIdx.x + threadIdx.y;
(B) i=blockIdx.x + threadIdx.x;
(C) i=blockIdx.x*blockDim.x + threadIdx.x;
(D) i=blockIdx.x * threadIdx.x;

Answer: (C)
Explanation: This is the case we covered in Lecture 2.3.

6. We want to use each thread to calculate two (adjacent) output elements of a vector addition.
Assume that variable i should be the index for the first element to be processed by a thread. What
would be the expression for mapping the thread/block indices to data index of the first element?
(A) i=blockIdx.x*blockDim.x + threadIdx.x +2;
(B) i=blockIdx.x*threadIdx.x*2
(C) i=(blockIdx.x*blockDim.x + threadIdx.x)*2
(D) i=blockIdx.x*blockDim.x*2 + threadIdx.x

Answer: (C)

Explanation: Every thread covers two adjacent output elements. The starting data index is
simply twice the global thread index. Another way to look at it is that all previous blocks cover
(blockIdx.x*blockDim.x)*2. Within the block, each thread covers 2 elements so the beginning
position for a thread is threadIdx.x.

7. We want to use each thread to calculate two output elements of a vector addition. Each thread
block processes 2*blockDim.x consecutive elements that form two sections. All threads in each
block will first process a section, each processing one element. They will then all move to the next
section, again each processing one element. Assume that variable i should be the index for the
first element to be processed by a thread. What would be the expression for mapping the
thread/block indices to data index of the first element?
(A) i=blockIdx.x*blockDim.x + threadIdx.x +2;
(B) i=blockIdx.x*threadIdx.x*2
(C) i=(blockIdx.x*blockDim.x + threadIdx.x)*2
(D) i=blockIdx.x*blockDim.x*2 + threadIdx.x

Answer: (D)

Explanation: Each previous block covers (blockIdx.xblockDim.x)2. The beginning elements of

the threads are consecutive in this case so just add threadIdx.x to it.

8. For a vector addition, assume that the vector length is 8000, each thread calculates one output
element, and the thread block size is 1024 threads. The programmer configures the kernel launch
to have a minimal number of thread blocks to cover all output elements. How many threads will
be in the grid?
(A) 8000
(B) 8196
(C) 8192
(D) 8200

Answer: (C)

Explanation: ceil(8000/1024)1024 = 8 1024 = 8192. Another way to look at it is the minimal

multiple of 1024 to cover 8000 is 1024*8 = 8192.

9. The following table shows CUDA function declarations, state where the function can be
executed and is callable from
Executed on the: Only callable from the:
__host__ float FuncC() Host host
__global__ void FuncB () Device Host
__device__ float FuncA () Device Device

1z0-809 Prepaway Premium Exam 207q
No ratings yet
1z0-809 Prepaway Premium Exam 207q
119 pages
Advanced C++ Interview Questions You'll Most Likely Be Asked
From Everand
Advanced C++ Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
Coursera Quiz Week1 Spring 2014 Heterogeneous Programming
100% (5)
Coursera Quiz Week1 Spring 2014 Heterogeneous Programming
4 pages
ECE408 S19 ZJUI Exam1 Study Guide
No ratings yet
ECE408 S19 ZJUI Exam1 Study Guide
25 pages
Processors
No ratings yet
Processors
25 pages
Second nd_prep_com
No ratings yet
Second nd_prep_com
3 pages
CDS S14
No ratings yet
CDS S14
26 pages
Assignment C++
No ratings yet
Assignment C++
219 pages
Assignment - 0 Solution
No ratings yet
Assignment - 0 Solution
15 pages
Merged Assignment Programming in Mordern C++ All Answers
No ratings yet
Merged Assignment Programming in Mordern C++ All Answers
195 pages
Set 2
No ratings yet
Set 2
10 pages
GPU Programming EE 4702-1 Final Examination: Exam Total
No ratings yet
GPU Programming EE 4702-1 Final Examination: Exam Total
10 pages
CS301-Mcqs-MidTerm-By-Vu-Topper-RM
No ratings yet
CS301-Mcqs-MidTerm-By-Vu-Topper-RM
56 pages
Programming Logic Concepts For TCS NQT
No ratings yet
Programming Logic Concepts For TCS NQT
39 pages
lokhandwala school_comp
No ratings yet
lokhandwala school_comp
6 pages
C# MCQ
No ratings yet
C# MCQ
11 pages
Cosc206 MCQ 2020
No ratings yet
Cosc206 MCQ 2020
4 pages
Jaspal Kaur Public School: PREBOARD 2012 - 2013
No ratings yet
Jaspal Kaur Public School: PREBOARD 2012 - 2013
7 pages
Practices Problem
No ratings yet
Practices Problem
49 pages
CS301 Quiz-2 by Vu Topper RM
No ratings yet
CS301 Quiz-2 by Vu Topper RM
56 pages
Tech
No ratings yet
Tech
6 pages
Sample Midterm
No ratings yet
Sample Midterm
11 pages
Class 12
No ratings yet
Class 12
5 pages
2PU-Cs-SOLVED-MPS-MERGED1,2,3(1)_copy_250106_083304
No ratings yet
2PU-Cs-SOLVED-MPS-MERGED1,2,3(1)_copy_250106_083304
23 pages
Programming in C++: Assignment Week 1: August 2, 2017
No ratings yet
Programming in C++: Assignment Week 1: August 2, 2017
6 pages
Computer Science MQP II Pu 2023-24 Version.
100% (1)
Computer Science MQP II Pu 2023-24 Version.
4 pages
Key Answers Kisa Preparatory Computer Applications
40% (5)
Key Answers Kisa Preparatory Computer Applications
9 pages
Assignment 8
No ratings yet
Assignment 8
10 pages
Programming in C++: Assignment Week 1
100% (1)
Programming in C++: Assignment Week 1
11 pages
12 Set A Computer Science Mid Term 2019
No ratings yet
12 Set A Computer Science Mid Term 2019
6 pages
GPU Programming EE 4702-1 Final Examination: Name Solution
No ratings yet
GPU Programming EE 4702-1 Final Examination: Name Solution
10 pages
Module 3 Quiz
No ratings yet
Module 3 Quiz
2 pages
Dsa Q12
No ratings yet
Dsa Q12
12 pages
Matrix Mult
100% (1)
Matrix Mult
55 pages
NPTEL Week1 Assignment-1 V4
No ratings yet
NPTEL Week1 Assignment-1 V4
15 pages
Preparatory 1
No ratings yet
Preparatory 1
6 pages
NPTEL
100% (2)
NPTEL
11 pages
BUET CSE 109 Merged TF Q - 2010-11 to 2021-22
No ratings yet
BUET CSE 109 Merged TF Q - 2010-11 to 2021-22
84 pages
COMPUTER_FULL 1
No ratings yet
COMPUTER_FULL 1
4 pages
pgdca9(16.05.2018)
No ratings yet
pgdca9(16.05.2018)
2 pages
VV Pad-Gr 12 Cs QP Set 1
No ratings yet
VV Pad-Gr 12 Cs QP Set 1
8 pages
Week 5 Solution
No ratings yet
Week 5 Solution
15 pages
AC11 Sol PDF
100% (1)
AC11 Sol PDF
194 pages
KVS-30-dec-2018-Part-B
No ratings yet
KVS-30-dec-2018-Part-B
14 pages
Object Oriented Programming Concept by Using C++ (CSEN 3208)
No ratings yet
Object Oriented Programming Concept by Using C++ (CSEN 3208)
3 pages
Cosc 112 Final MCQ
No ratings yet
Cosc 112 Final MCQ
3 pages
Assignment 2 FUNCTIONS
No ratings yet
Assignment 2 FUNCTIONS
6 pages
CS Practice Paper Term1
No ratings yet
CS Practice Paper Term1
19 pages
DS MCQ
No ratings yet
DS MCQ
9 pages
XII COMPUTER SCIENCE PREBOARD SET 3
No ratings yet
XII COMPUTER SCIENCE PREBOARD SET 3
10 pages
JavaScript Quick Quiz
No ratings yet
JavaScript Quick Quiz
19 pages
Karnataka II PUC Computer Science Model Paper 2025_1741840429558
No ratings yet
Karnataka II PUC Computer Science Model Paper 2025_1741840429558
5 pages
Class Xii Computer Science (083) : General Instructions - (I) All Questions Are Compulsory (Ii) Programming Language: C++
No ratings yet
Class Xii Computer Science (083) : General Instructions - (I) All Questions Are Compulsory (Ii) Programming Language: C++
7 pages
Examination Papers, 2003: (Delhi)
No ratings yet
Examination Papers, 2003: (Delhi)
12 pages
Assignment - Week3 - C++ - 2nd - Run - Latest
No ratings yet
Assignment - Week3 - C++ - 2nd - Run - Latest
13 pages
Advanced C Concepts and Programming: First Edition
From Everand
Advanced C Concepts and Programming: First Edition
Gayatri
3/5 (1)
Oracle Certified Professional Java Programmer OCPJP 1Z0 809
From Everand
Oracle Certified Professional Java Programmer OCPJP 1Z0 809
Manish Soni
No ratings yet
Couchbase Certified Java Developer - Exam Practice Tests
From Everand
Couchbase Certified Java Developer - Exam Practice Tests
Cristian Scutaru
No ratings yet
Comptia Network+ Primer
From Everand
Comptia Network+ Primer
John Greene
No ratings yet
Computer Engineering Laboratory Solution Primer
From Everand
Computer Engineering Laboratory Solution Primer
Karan Bhandari
No ratings yet
Object Oriented Programming With Java - Question Bank
No ratings yet
Object Oriented Programming With Java - Question Bank
2 pages
React Interview Questions 1704289355
No ratings yet
React Interview Questions 1704289355
22 pages
Basic Concepts of C
No ratings yet
Basic Concepts of C
6 pages
Solved - Re - Send Mail With XML Document Attached - SAP Community
No ratings yet
Solved - Re - Send Mail With XML Document Attached - SAP Community
8 pages
Log
No ratings yet
Log
16 pages
2 InputOutputFormatting
No ratings yet
2 InputOutputFormatting
32 pages
Arrays and Structures in CPP
No ratings yet
Arrays and Structures in CPP
55 pages
C Programming Language Features
No ratings yet
C Programming Language Features
4 pages
Java Database Connectivity
No ratings yet
Java Database Connectivity
6 pages
Dynamic Memory Allocation
No ratings yet
Dynamic Memory Allocation
28 pages
Java M4
No ratings yet
Java M4
14 pages
ECMAScript6 Handson
100% (1)
ECMAScript6 Handson
2 pages
Compilers and Interpreters.ppt 20250102 220650 0000
No ratings yet
Compilers and Interpreters.ppt 20250102 220650 0000
5 pages
Developing Applications - Class 8
No ratings yet
Developing Applications - Class 8
4 pages
Sandeep Java - Code 22to24 PDF
No ratings yet
Sandeep Java - Code 22to24 PDF
8 pages
R Module 1
No ratings yet
R Module 1
34 pages
220010059
No ratings yet
220010059
4 pages
Loops in C++
No ratings yet
Loops in C++
13 pages
Hacker Rank Questions Kunga
No ratings yet
Hacker Rank Questions Kunga
25 pages
Lab 9
No ratings yet
Lab 9
6 pages
Aditya Garg Resume PDF
No ratings yet
Aditya Garg Resume PDF
1 page
Membuat Alarm Sahur
No ratings yet
Membuat Alarm Sahur
4 pages
React Hooks: Cheat Sheets
100% (1)
React Hooks: Cheat Sheets
13 pages
Coccinelle A Case Study in Simplifying Resource Management in The Linux Kernel
No ratings yet
Coccinelle A Case Study in Simplifying Resource Management in The Linux Kernel
35 pages
Mockwarriors Basic Java Topic Test4
No ratings yet
Mockwarriors Basic Java Topic Test4
4 pages
Ozkosullar d11072024
No ratings yet
Ozkosullar d11072024
5 pages
Class Temper
No ratings yet
Class Temper
37 pages
C Programming-Control Statements
No ratings yet
C Programming-Control Statements
13 pages
Odata Service For CDS View Using Annotations
No ratings yet
Odata Service For CDS View Using Annotations
9 pages
Filter 1
No ratings yet
Filter 1
430 pages

BCS3413 Principle & Applications of Parallel Programming Quiz 2: Gpgpu Cuda

Uploaded by

BCS3413 Principle & Applications of Parallel Programming Quiz 2: Gpgpu Cuda

Uploaded by

BCS3413 Principle & Applications of Parallel Programming

Quiz 2 : GPGPU CUDA

2. If we want to allocate an array of n floating-point elements and have a floating-point pointer

Explanation: &d_A is pointer to a pointer of float. To convert it to a generic pointer required by

Explanation: See Lecture 2.2 slides.

Explanation: See Lecture 2.2 slides.

Explanation: Each previous block covers (blockIdx.x*blockDim.x)*2. The beginning elements of

Explanation: ceil(8000/1024)*1024 = 8 * 1024 = 8192. Another way to look at it is the minimal

You might also like

Explanation: Each previous block covers (blockIdx.xblockDim.x)2. The beginning elements of

Explanation: ceil(8000/1024)1024 = 8 1024 = 8192. Another way to look at it is the minimal