0% found this document useful (0 votes)

30 views

Part 4 - Easy Data Parallelism

This document discusses data parallelism in Java streams. It begins by explaining why parallelism is important due to multicore CPUs. It then defines data parallelism as distributing data over different processes to be processed simultaneously. The document provides examples of using parallel streams in Java to parallelize processing of data. It notes some pitfalls to avoid, such as interfering with data sources, misusing reduce, holding locks, or using mutable shared state. Overall it presents best practices for effectively parallelizing stream processing of data in Java.

Uploaded by

Ionut Negru

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

30 views

Part 4 - Easy Data Parallelism

Uploaded by

Ionut Negru

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 42

Easy Data Parallelism

Richard Warburton
Raoul-Gabriel Urma
Overview
● Why is parallelism Important?

● What is data parallelism?

● Parallelising your Streams

● Performance and Internals

Why is Parallelism important?
source: http://www.gotw.ca/images/CPU.png
Multicore
What is Data Parallelism?
Concurrency is not Parallelism!
● Concurrency
○ At least two threads are making progress
○ May not run at the same time
○ Eg: chrome and eclipse both running

● Parallelism
○ At least two threads are executing simultaneously
○ A specific case of concurrency
○ Eg: servlet container dealing with two users at
once on a multicore machine
Parallelism
● Task
○ Distribute execution processes over processes
○ Threads and Executors in Java
○ Eg: each thread services a user in JEE App

● Data
○ Distribute data over different processes
○ Support built on top of Streams
○ Eg: process a payroll and give each core 100
employee’s salary
What are good data parallel
problems?
● Big Batch Jobs

○ Transaction Processing

○ Analytics/Reporting

● Web crawlers / parsers

● Maths

○ Monte Carlo Simulations

○ Linear Algebra
What’s a good data parallel problem from your

workplace?
Parallelising your Streams
Data Parallelism
● Useful
○ a lot of data
○ want to process in a similar way

● API aims to be explicit, but unobtrusive

○ .parallelStream()
○ .parallel()

● Can flip between sequential and parallel

Data Parallelism

// Replace stream() with parallelStream()

Set<String> origins = musicians
.parallelStream()
.filter(artist -> artist.getName().startsWith("The"))
.map(artist -> artist.getNationality())
.collect(toSet());
Not all serial code works in parallel.
DON’T interfere with data sources

// add double each value into a list.

List<Integer> numbers = getNumbers();

numbers.parallelStream()
.forEach(i -> numbers.add(i * 2));
Referring to data sources fixed

// add double each value into a list.

List<Integer> numbers = getNumbers();

numbers = numbers.parallelStream()
.flatMap(i -> Stream.of(i, i * 2))
.collect(toList());
DON’T misuse reduce

int totalCost(List<Purchase> items) {

return items.parallelStream()
.reduce(DELIVERY_FEE,
(tally, item) -> tally + item.cost());
}
Associativity

“you can flip order around and things still work”

(4 + 2) + 1 = 4 + (2 + 1) = 7
(4 * 2) * 1 = 4 * (2 * 1) = 8
Identity

“the do nothing value”

0 + 5 = 5
1 * 5 = 5
How to fix reduce

int totalCost(List<Purchase> items) {

return DELIVERY_FEE
+ items.parallelStream()
.reduce(0,
(tally, item) -> tally + item.cost());
}
How to fix reduce (2)

int totalCost(List<Purchase> items) {

return DELIVERY_FEE
+ items.parallelStream()
.mapToInt(Purchase::getCost)
.sum();
}
DON’T hold locks

List<Integer> values = getValues();

CountDownLatch latch = new CountDownLatch(values.size());

values.parallelStream()
.forEach(i -> {
try {
doSomething(i);
// Potential Deadlock
latch.countdown();
} catch (Exception e ) {
e.printStackTrace();
}});
No mutable state!
public static long sideEffectParallelSum(long n) {
Accumulator accumulator = new Accumulator();
LongStream.rangeClosed(1,n).parallel()
.forEach(accumulator::add);
return accumulator.total;
}

public static class Accumulator {

private long total = 0;
public void add(long value) {
total += value;
}
}
Parallel Code Summary
● Very easy to make your code parallel,

but …

● Sometimes you can get away with things

sequentially that you can’t in parallel
○ sources
○ reduce
○ locks
○ unprotected mutable data
Performance and Internals
Under the hood

● Work distributed using Fork/Join framework

● Distributed by data

● New abstraction: Spliterator

Parallel Integer Sums

int sum =
values.parallelStream()
.mapToInt(i -> i)
.sum();
Spliterator
public interface Spliterator<T> {
/** Carve off a portion of the data
into a separate Spliterator */
Spliterator<T> trySplit();

/** Iterate the data described by this Spliterator */

void forEachRemaining(Consumer<? super T> action);

/** The size of the data described

by this Spliterator, if known */
long getExactSizeIfKnown();
}
Always a tradeoff ...
● Parallelism eats more CPU time
○ Thread communication
○ Distributing & Decomposing work
○ Potentially increased memory pressure
○ Competing for the CPU with other processes

● It can reduce wall time

○ Time from beginning to end of the processes’
execution
○ Ideally only need to wait for 1/N of the execution
time
Decomposition Performance
● Data Size

● Source Data Structure

● Packing

● Number of Cores

● Cost per Element

Data Structures
● Good
○ ArrayList / Intstream.range / Stream.of
○ Random Access + Easy to balance
● Meh
○ Hashset / Treeset
○ Usually good balance
● Bad
○ LinkedList / BufferedReader.lines() /
Streams.iterate()
○ Unknown length
○ bad random access performance
Stateful Operations
● Stateless
○ no need to keep state when evaluated
○ eg: map, reduce
○ superior parallel decomposition
○ bounded amounts of data

● Stateful
○ accumulate state during evaluation
○ eg: sorted
○ unbounded caching of data
Benchmarking and Testing
● Don’t assume parallel = faster, measure it
● Use jmh:
http://openjdk.java.net/projects/code-tools/jmh/

● Best Practices
○ Warmup
○ Repeatability
○ Evade the JIT
Summary
Lesson Summary

● Easy to obtain Data Parallelism

● Pick your situation well

● A lot of performance influencers

● Benchmark your parallel code

The End
Exercise
In: com.java_8_training.problems.data_parallelism

1. Looks at OptimisationExample
2. Try to improve the performance of this code
3. Measure performance using the benchmark harness
4. Don’t make the code uglier!
Exercise
In: com.java_8_training.problems.data_parallelism

1. Parallelise the sum of squares method

Question1Test

2. Fix the bug in the "multiplyThrough" method

Question2Test

3. Remove the locks and keep the code safe

Question3Test
Amdahl’s Law
● Defines upper bound for parallel speedup

● Time(n) = Time(1) * (s + 1/n * (1 - s))

○ n = number of cores
○ s = proportion of code that is strictly serial

● Speedup(n) = 1 / (s + 1/n * (1 - s))

● Example
○ 1024 cores, 50% serial
○ 1 / (0.5 + 1/1024 * (1 - 0.5)) ~= 2x speedup

BDS Session 6
No ratings yet
BDS Session 6
53 pages
Tushar Babar Automotive
No ratings yet
Tushar Babar Automotive
3 pages
J Java Streams 2 Brian Goetz PDF
No ratings yet
J Java Streams 2 Brian Goetz PDF
12 pages
Bruker Toolbox, S1 TITAN and Tracer 5i
No ratings yet
Bruker Toolbox, S1 TITAN and Tracer 5i
41 pages
Log
No ratings yet
Log
78 pages
Introduction to Algorithms
From Everand
Introduction to Algorithms
S VASIST
No ratings yet
26 Parallel Algorithms
No ratings yet
26 Parallel Algorithms
24 pages
From Concurrent To Parallel
No ratings yet
From Concurrent To Parallel
35 pages
Lambda Operations
No ratings yet
Lambda Operations
34 pages
BDS Session 5
No ratings yet
BDS Session 5
48 pages
Google Guava Concurrent Slides
No ratings yet
Google Guava Concurrent Slides
39 pages
Parallel Asynchronous Programming
No ratings yet
Parallel Asynchronous Programming
144 pages
PCP 2022 7 MutualExclusion
No ratings yet
PCP 2022 7 MutualExclusion
49 pages
Java_Stream_API_1740049148
No ratings yet
Java_Stream_API_1740049148
17 pages
in3200-chap05
No ratings yet
in3200-chap05
34 pages
Java 8 Features Cheat Sheet
100% (1)
Java 8 Features Cheat Sheet
2 pages
Paper Content
No ratings yet
Paper Content
21 pages
Slides Cours9 Multithreading
No ratings yet
Slides Cours9 Multithreading
90 pages
Multi-Core in JVM/Java: Concurrent Programming in Java Prior Java 5 Java 5 (2006) Java 7 (2010) Other Topics
No ratings yet
Multi-Core in JVM/Java: Concurrent Programming in Java Prior Java 5 Java 5 (2006) Java 7 (2010) Other Topics
34 pages
50 Java Concepts Every Developer Should Know
From Everand
50 Java Concepts Every Developer Should Know
Hernando Abella
No ratings yet
Concurrency Models
No ratings yet
Concurrency Models
22 pages
Parallel Asynchronous Programming Java
No ratings yet
Parallel Asynchronous Programming Java
144 pages
C Programming
From Everand
C Programming
Netra
No ratings yet
Parallel Programming
No ratings yet
Parallel Programming
42 pages
Parallel Streams in Java
No ratings yet
Parallel Streams in Java
2 pages
Fork Join
No ratings yet
Fork Join
24 pages
Data Parallel Model
No ratings yet
Data Parallel Model
11 pages
Preliminary
No ratings yet
Preliminary
169 pages
Overview of Parallel Programming in C++ - Pablo Halpern - CppCon 2014
No ratings yet
Overview of Parallel Programming in C++ - Pablo Halpern - CppCon 2014
37 pages
CS609
100% (1)
CS609
292 pages
PCP 2022 6 ParallelAlgorithms PartI
No ratings yet
PCP 2022 6 ParallelAlgorithms PartI
39 pages
Parallel Algorithms: Theory and Practice
No ratings yet
Parallel Algorithms: Theory and Practice
44 pages
5 Java Concurrent Patterns Advanced m5 Slides
No ratings yet
5 Java Concurrent Patterns Advanced m5 Slides
79 pages
Linq Plinq
No ratings yet
Linq Plinq
24 pages
Java-1 8 PDF
100% (1)
Java-1 8 PDF
143 pages
Grokking The Java Developer Interview More Than 200 Questions To Crack The Java, Spring, SpringBoot & Hibernate Interview-200-327
No ratings yet
Grokking The Java Developer Interview More Than 200 Questions To Crack The Java, Spring, SpringBoot & Hibernate Interview-200-327
128 pages
Day14 Collections
No ratings yet
Day14 Collections
32 pages
Part 1 - Lecture 3 - Parallel Software-1
No ratings yet
Part 1 - Lecture 3 - Parallel Software-1
45 pages
Introduction On Spark Anuj Jain
No ratings yet
Introduction On Spark Anuj Jain
28 pages
Lecture 05
No ratings yet
Lecture 05
73 pages
Cloud Computing Unit4
No ratings yet
Cloud Computing Unit4
55 pages
Lock Free
No ratings yet
Lock Free
4 pages
Introduction To Parallel Programming - Student Workbook With Instructor's Notes PDF
No ratings yet
Introduction To Parallel Programming - Student Workbook With Instructor's Notes PDF
33 pages
Design And Analysis Of Algorithm
From Everand
Design And Analysis Of Algorithm
Bhupendra Mandloi
No ratings yet
Parallel Models of Computation
No ratings yet
Parallel Models of Computation
3 pages
Lecture 9-OpenMP Coclusion
No ratings yet
Lecture 9-OpenMP Coclusion
39 pages
Lecture HPC 11 Parallelization
No ratings yet
Lecture HPC 11 Parallelization
128 pages
Dimensionnement Spark - Les 5 Erreurs À Éviter
No ratings yet
Dimensionnement Spark - Les 5 Erreurs À Éviter
75 pages
Chapter 7 - Parallel Programming Issues
No ratings yet
Chapter 7 - Parallel Programming Issues
68 pages
Projects With Microcontrollers And PICC
From Everand
Projects With Microcontrollers And PICC
Guillermo Perez Guillen
5/5 (1)
Advanced C Concepts and Programming: First Edition
From Everand
Advanced C Concepts and Programming: First Edition
Gayatri
3/5 (1)
Parallel Algorithms: Theory and Practice: Deterministi C Parallelism
No ratings yet
Parallel Algorithms: Theory and Practice: Deterministi C Parallelism
51 pages
Distributed Computing Seminar
No ratings yet
Distributed Computing Seminar
37 pages
Amazing Java: Learn Java Quickly
From Everand
Amazing Java: Learn Java Quickly
Andrei Besedin
No ratings yet
HPC Module 4
No ratings yet
HPC Module 4
18 pages
217-lec10
No ratings yet
217-lec10
27 pages
Dijkstra's Algorithm Overview: Mergesort Example: Merge As We Return From Recursive Calls
No ratings yet
Dijkstra's Algorithm Overview: Mergesort Example: Merge As We Return From Recursive Calls
4 pages
Introduction To Parallel Computing Design and Anal
No ratings yet
Introduction To Parallel Computing Design and Anal
53 pages
Lecture4 IntroMapReduce PDF
No ratings yet
Lecture4 IntroMapReduce PDF
75 pages
Python for Data Science: Data Science Mastery by Nikhil Khan, #1
From Everand
Python for Data Science: Data Science Mastery by Nikhil Khan, #1
Nikhil Khan
No ratings yet
Parallel Programming Session 1
No ratings yet
Parallel Programming Session 1
27 pages
Unit 4 Map Reduce
No ratings yet
Unit 4 Map Reduce
10 pages
Ewa 3 July - Abstract Classes
No ratings yet
Ewa 3 July - Abstract Classes
3 pages
NVR and Camera Configuration Guide
No ratings yet
NVR and Camera Configuration Guide
22 pages
Part 5 - Testing and Debugging Lambda Expresions
No ratings yet
Part 5 - Testing and Debugging Lambda Expresions
32 pages
Ingredients For A Healthy Codebase
No ratings yet
Ingredients For A Healthy Codebase
97 pages
Ds-2Cd2047G2-L (U) 4 MP Colorvu Fixed Bullet Network Camera
No ratings yet
Ds-2Cd2047G2-L (U) 4 MP Colorvu Fixed Bullet Network Camera
5 pages
Axolight 2017 PDF
No ratings yet
Axolight 2017 PDF
176 pages
... The Garden: Turfline Owner's Favourite
No ratings yet
... The Garden: Turfline Owner's Favourite
2 pages
Hikvision AcuSense Products Leaflet
No ratings yet
Hikvision AcuSense Products Leaflet
8 pages
RN Hik Design Tool v1.0.1.5 011320NA PDF
No ratings yet
RN Hik Design Tool v1.0.1.5 011320NA PDF
2 pages
Devoxx Dagger2.ForExport PDF
No ratings yet
Devoxx Dagger2.ForExport PDF
291 pages
Hunter Run Time Calculator - Programming Guide - X-Core Main
No ratings yet
Hunter Run Time Calculator - Programming Guide - X-Core Main
1 page
Hunter Run Time Calculator - Door Card - X-Core Main
No ratings yet
Hunter Run Time Calculator - Door Card - X-Core Main
1 page
DG GraphPaper em
No ratings yet
DG GraphPaper em
2 pages
1.1 Create Your First Android App
No ratings yet
1.1 Create Your First Android App
28 pages
Android Developer Fundamentals Course Practicals en PDF
100% (5)
Android Developer Fundamentals Course Practicals en PDF
566 pages
Omron PLC Semi Universal Cable
No ratings yet
Omron PLC Semi Universal Cable
2 pages
Mg86fel508 Datasheet A1-2
No ratings yet
Mg86fel508 Datasheet A1-2
202 pages
QBASIC Exercises
No ratings yet
QBASIC Exercises
4 pages
Openshift Container Platform-4.10-Security and Compliance-Zh-Cn
No ratings yet
Openshift Container Platform-4.10-Security and Compliance-Zh-Cn
223 pages
Submitting Your MATLAB Jobs Using Slurm To High-Performance Clusters - by Rahul Bhadani - Towards Da
No ratings yet
Submitting Your MATLAB Jobs Using Slurm To High-Performance Clusters - by Rahul Bhadani - Towards Da
1 page
Broadcast and Collision Domain
No ratings yet
Broadcast and Collision Domain
3 pages
Lecture 4 MultiTerm 2014 Eng
No ratings yet
Lecture 4 MultiTerm 2014 Eng
67 pages
Chapter 4B: The Processor, Part B: Mary Jane Irwin
No ratings yet
Chapter 4B: The Processor, Part B: Mary Jane Irwin
56 pages
Firewall Forward Info
No ratings yet
Firewall Forward Info
2 pages
Bba Solution
No ratings yet
Bba Solution
15 pages
Lenovo Moto Smart Assistant User Guide v4.2.0
0% (1)
Lenovo Moto Smart Assistant User Guide v4.2.0
51 pages
Teltonika Rut240 Manual
100% (1)
Teltonika Rut240 Manual
150 pages
sap_integration_icg
No ratings yet
sap_integration_icg
58 pages
VENUE - S6L System: Software Installation Guide
No ratings yet
VENUE - S6L System: Software Installation Guide
23 pages
Studocu Is Not Sponsored or Endorsed by Any College or University
No ratings yet
Studocu Is Not Sponsored or Endorsed by Any College or University
51 pages
CANoe - Course Content
No ratings yet
CANoe - Course Content
4 pages
Samsung PDT Special Feature
No ratings yet
Samsung PDT Special Feature
4 pages
Robokart Maker Space Franchise Cost Estimation and Bifurcation
No ratings yet
Robokart Maker Space Franchise Cost Estimation and Bifurcation
5 pages
Main Asset Number Anla-Anln1 Asset Subnumbe R Anla - Anln2 Company Code Anla - Bukrs Cost Center Anlz - Kostl
No ratings yet
Main Asset Number Anla-Anln1 Asset Subnumbe R Anla - Anln2 Company Code Anla - Bukrs Cost Center Anlz - Kostl
18 pages
Short Cut Keys For Computer Users
No ratings yet
Short Cut Keys For Computer Users
11 pages
G1 Chapter 1 Notes IT Era
No ratings yet
G1 Chapter 1 Notes IT Era
10 pages
PS Rowset
No ratings yet
PS Rowset
8 pages
Exam Az 900 Microsoft Azure Fundamentals Skills Measured
100% (1)
Exam Az 900 Microsoft Azure Fundamentals Skills Measured
4 pages
131585-2-Floppy Disk Emulator Win
No ratings yet
131585-2-Floppy Disk Emulator Win
52 pages
MSI Q5T MS-AC711VER 11 PDF
No ratings yet
MSI Q5T MS-AC711VER 11 PDF
56 pages
1st Set (300 Core Java Interview Questions (2022) - Javatpoint)
No ratings yet
1st Set (300 Core Java Interview Questions (2022) - Javatpoint)
50 pages
Tic Tac Toe: Bachelor of Technology
No ratings yet
Tic Tac Toe: Bachelor of Technology
16 pages