Programming Assignment: EE382C, Spring 2020

This programming assignment involves using the Booksim network simulator to conduct simulations on a Dragonfly topology. Students are asked to modify various topology and traffic parameters, collect simulation results, and analyze the impact on performance metrics. They must then improve on a given metric by choosing an alternative topology supported by Booksim. The second part of the assignment requires rewriting the fat tree model in Booksim to support additional parameters and routing functions. Students submit a report describing their steps, results, and conclusions.

Uploaded by

raj

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

66 views

Programming Assignment: EE382C, Spring 2020

Uploaded by

raj

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

Programming Assignment

EE382C, Spring 2020

Submit: Generate a report that contains all necessary information to answer the stated questions.
Questions are given in a section labelled as such in each of the two parts of this assignment description.
Briefly describe your steps along with your results and conclusions. You may encounter concepts you
have not yet seen in class in detail but you are not required to have an understanding of those yet. Email
both the instructor and the TA with your report and files with the new code you wrote for the second
part of this assignment. This assignment allows teams of two. Clearly state the names and submit one
report per team. Your partner cannot be the same as the research paper assignment. This assignment is
worth 12% of your final grade.

Summary

The focus of this assignment is to familiarize yourself with Booksim, which is a cycle-accurate network
simulator we will use in this class. Booksim was created to accompany the book we use in this class,
hence the name.

This assignment has two parts. For the first part, this assignment has you conducting simulations on a
Dragonfly topology, modify various parameters, collect results, and reach conclusions. At the end you
will be asked to improve on a metric, such as performance, by choosing another topology from the ones
supported. The goal is to familiarize yourself with using the simulator and efficiently extracting results,
in order to prepare you for further assignments for which you will modify code.

In the second part, you are asked to re-write the fat tree (folded Clos) model inside Booksim. The
current model Booksim has is restrictive and has limited support for some parameters. For this new fat
tree, you will write an oblivious load-balancing routing function.

Other Simulators

If you are familiar with another simulator and would rather use that instead, you are welcome to. In that
case, make sure you answer the questions this assignment states, but you can ignore all the step-by-step
instructions. Make sure that simulator has the capability to satisfy this assignment. However, for the
sake of being able to choose the most suitable simulator for the rest of the class, it is worth giving
Booksim a try. The instructor is intimately familiar with Booksim but few others, so you may be on your
own if you try another simulator.

Booksim URL

https://github.com/booksim/booksim2
Contains source code and brief documentation.
Part 1: Dragonfly Topology

A Dragonfly topology is shown below.

In Booksim, parameter “n” is the dimension of the intra-group network (restricted to 1) and “k” is the
radix of each switch. From those parameters the rest of the configuration is derived. Please refer to
“dragonfly.cpp” in directory “networks” for more details. You’ll have to get into the habit of reading
source code since documentation in academic simulators is lacking.

Installing Booksim

Luckily, Booksim has very few dependencies and in most environments simply compiles by running the
included makefile in the “src” directory. The “src” directory contains the source code for all of Booksim.
That directory also has subdirectories for classes of a specific type, such as network topologies. Refer to
the class slides for an overview of the internal hierarchy. Note that many classes have child classes. For
instance, the class “trafficmanager” generates, injects, and ejects traffic. Some simulation types use
synthetic traffic such as uniform random, while others use trace files. Each of these is a different child
“trafficmanager” class. Here is a broad overview of each important source file:

• Batchtrafficmanager: a type of traffic manager where simulation ends after a predefined

number of packets, not injection rate.
• Booksim_config: Here you can find all the options the configuration file accepts as well as
default values. The default configuration scripts do not specify all values so you need to refer to
this class to figure out what is the default value when a configuration option is not specified.
• Buffer: A buffer class to store flits and packets at every input or output depending on the router
architecture.
• Buffer_state: Keeps track of where in the router pipeline the front flight of each buffer and VC is
(routing, allocation, etc).
• Config_utils: This defines the class that the rest of the code uses to access values for
configuration parameters.
• Credit: Defines a single credit that is transmitted and used to implement credit-based flow
control.
• Flit: A flit (flow control digit). The lowest unit that flow control sees.
• Flitchannel: A channel between routers that is used to transport flits.
• Injection: Implements an injection process that given an injection rate decides when to generate
and inject packets.
• Module: High level parent class.
• Outputset: Used to hold a range of options, such as multiple output ports and VCs, that a
routing function can return.
• Packetreplyinto: Auxiliary to record what packets are waiting for replies.
• Routefunc: Defines a set of standard routing functions. A routing function receives information
about the flit and the current router and returns a range of outputs and VCs that are valid
options. Each network class can define more routing functions so this class is not the only place
to find them.
• Stats: Keeps statistics of given values and reports median, histogram, etc.
• Traffic: Defines a collection of synthetic traffic patterns. A traffic pattern function takes as input
the source and returns a destination.
• Trafficmanager: This is the class that generates traffic (usually based on injection rate), injects to
the network (network class), ejects from the network, keeps statistics, and makes sure the
simulation runs correctly.
• Vc: Defines a VC class that is used in buffers to isolate traffic from different VCs.

A list of directories:

• Allocators: Defines a range of allocator classes that can be used in routers.

• Arbiters: Similarly but for arbiters. Allocators typically use arbiters.
• Examples: Example configuration files.
• Networks: Defines a collection of network topologies. Each topology has a “BuildNet” function
that instantiates routers and channels and connects them appropriately. Also, each network can
define extra routing functions specific to that network.
• Power: This is where power models reside. In booksim every network component such as a
router keeps track of its activity. At the end the power models attach an energy cost using
equation to each event in order to report power. These models also report area.
• Routers: A collection of routers. You are advised to only use the input queued router (iq_router)
because the others have not been tested recently.
Other directories of interest in Booksim are “runfiles”, which contains some example configuration files
to run the simulator with different topologies. “util” contains a bash script that invokes Booksim
multiple times and generates a latency versus throughput graph. You will likely have to modify or re-
write this script in another language during this class. Finally, “doc” contains a short manual for booksim
with more information.

Running Booksim

For this we will use “dragonflyconfig” in directory “examples”. Before you run booksim, add “stats_out =
<filename of your choice>.m. This will generate a matlab file with helpful statistics after each simulation
such as latency historgrams, packet latency histograms (plat), and others. To run booksim simply:

➢ ./booksim dragonflyconfig

The simulator will then generate an output to stdout (it is a good idea to redirect stdout to a file). That
will contain a printout of the configuration, statistics report at regular intervals (parameter
“sample_period”), and a final statistics report. Booksim can report statistics separately for traffic classes,
but by default there is only one class which is why you see a “class 0:” printout. These statistics are not
detailed, which is why “stats_out” is important. In that file, “sent_packets” is the rate at which each
source generates packets while “accepted_packets” is the rate at which those packets are admitted into
the network. “plat” is a histogram of packet latencies where if bin I equals A it means that A many
packets had a latency of I. Similarly there are two more histograms: flat for flit latencies and nlat for
packet network latencies (latencies without the time spent waiting in injection queues).

At the beginning of a simulation rate is the warmup phase where the network is filled with non-recorded
packets (will not change statistics) for the purpose of creating a realistic state for the recorded packets
that will follow in the main phase. Also, when a simulation is about to terminate because the pre-
defined number of cycles was reached, booksim continues to generate non-recorded packets. The
reason is that If booksim simply stopped generating packets, the last recorded packets would experience
an unrealistically empty network. This only occurs for simulations that use injection rate. If traffic is read
from a trace file booksim does not guess what packets could come before or after the ones in the trace
file.

It is important to understand when a simulation is considered stable. If the average latency keeps
increasing and does not stabilize after the warmup period (the duration of which is configurable),
booksim declares the simulation unstable and exists. This is meant to detect when the network is
saturated because it cannot satisfy the load it is receiving. If booksim does not report the simulation as
unstable, the network can handle the load and simulation begins.

In your debugging you may wish to track individual packets of flits. The easiest way to do that is to add
“watch_file = <filename>” in the configuration file and then create a file. That file has flit IDs, one per
line, that will be watched. You can also specify a packet ID by prefacing the ID with a p, e.g., “p34” is
packet ID 34. When a flit or packet is watched, booksim will report any action that is relevant to that flit
or packet.
Some configuration options that may be interest are “num_vcs” which specifies the number of VCs.
“vc_buf_size” specifies the buffer depth in flits per VC and per input.

(10 points) Part 1: Questions and Deliverables

The goal of part 1 is to have you use booksim, read parts of the source code to get acquainted, analyze
results, and change network configurations.

1. Read the Dragonfly source code and report what is the topology connectivity within each group
and across groups. This means figure out and describe how routers are connected to each other,
not just their radices. Also, how many routing functions does the Dragonfly have in the source
code and how do they work?
2. Now it’s time to run your first simulation! You can use the example configuration file but modify
the injection rate to 2% packet injection rate. You’ll have to be careful how to properly define
this so your injection rate is actually what this question asks for. For the requested injection
rate, is the network saturated? What is the average packet and flit latency? What is the median,
and standard deviation? What is the 99th percentile flit and packet latency? How many
measured packets were sent?
3. Now lets sweep the injection rate starting from 1% flit injection rate (not packet), and increase it
by 5% at a time (1%, 5%, 10%, etc) until you find the point where the network saturates. What is
that injection rate? What is the average and maximum latency at an injection rate right before
the network saturates? Plot the injection rate – average latency curve. Compare the offered
traffic versus ejected traffic at a point before network saturation and after. Are they equal? This
question asks you to remember the injection rate that saturated the network and the one
before it, and compare "sent_packets" and "accepted_packets" for each of those injection rates
from the matlab output file.
4. Now we will start modifying the network to figure out how its performance changes. For the
injection rate that you identified above, lets use adaptive routing. Does the network still
saturate? What is the average hop count with and without adaptive routing? If the network
saturates, reduce the injection rate until you find the new point of saturation. If it does not
saturate, increment until you find the same point. Does the new saturation point make sense in
relation to the old one?
5. Finally, repeat question 3 but now for a 2D mesh of the same size (same number of terminals). Is
the mesh better than the Dragonfly for this configuration? Why do you think? Since the mesh
has to be square it may not have the exact same number of terminals. In that case use a mesh
with the closest possible number of terminals.
Part 2: Fat Tree Topology (Folded Clos)

A picture of an example fat tree is shown above. There are two parameters of interest here: the number
of levels (only three are shown), and the connectivity radix in each level (lets call it “k”). As shown, k = 2
because each router connects to two other routers going up and two more going down. Note that this
topology has a hierarchy. Also, sources and destinations of traffic are only connected at the leaf routers
(i.e., this is an indirect topology). The number of sources and destinations that are connected to each
leaf router is k.

You are asked to implement a fat tree topology of any number of levels and any value of “k”. Some
combinations of parameters will be invalid and you should check for that. For instance, for convenience
you can check and return an error if a value of k and number of levels would create a network where not
all routers have the same number of inputs and outputs. Hint: calculate the number of inputs and
outputs for each router as a function of k and the number of levels.

For the topology you create, you will also create a routing function that is oblivious and load balancing.
That is, for packets traversing up, the routing function will choose at random and with equal probability
among all channels going up. Thankfully, you don’t need to check if the channels you choose among
provide a final path to your destination because if you construct your topology right, packets can go to
any destination once they start traversing in the down direction. Your routing function should not take
unnecessary hops. That is, if your source and destination share a router that is not at the top level,
packets should only go as high as that common router level (i.e., not go any higher than necessary). Note
that once packets start moving in the down direction, there is no path diversity anymore and packets
have only one choice. Once packets start moving in the down direction, they cannot switch to the up
direction.

Following how booksim is internally organized, almost all your code edits will be constrained into a
topology file. It is ok if you want to refer to or overwrite booksim’s existing fat tree model. As you will
see, there are really two functions that you need to edit. One is “BuildNet” which instantiates routers,
channels, and connects them as well as to sources and destinations appropriately. Also, the routing
function you will write will be its own standalone function in the same .cpp file. If you write a new
routing function, you will need to register it so that booksim recognizes its name if it’s given in the
configuration file. We strongly advise you to read and understand existing an existing topology file and
ask the instructor or TA questions to help you understand what is going on before you start writing
code.
(25 points) Part 2: Questions and Deliverables

Submit a report answering the questions below. Also submit the code that you wrote for this
assignment (the topology file .h and .cpp). The primary metric for this part 2 is correctness.

1. (10 points) Your first task is to make sure that your code works correctly. Show simulation
results for three, five, and seven levels with a k = 4 and k = 8. Use different injection rates. Does
your simulation return any errors? Do your results make sense based on your knowledge from
class?
2. (10 points) Sadly, bugs do not always trigger assertions. Therefore, prove that your load-
balancing routing algorithm works correctly. Remember that one expected result is that
channels in the up and down directions have comparable loads. You may have to insert statistics
and printouts.
3. (5 points) As we mentioned in class, a fully-sized fat tree should be able to provide 100% (full)
throughput for any traffic pattern. Following this expectation, run uniform random, transpose,
bitcomp and find the saturation rate of the topology like you did in part 1. Is it 100%? If not, why
do you think it is not? Hint: it may be a bug, but also consider differences between theory (what
we talked about in class) and reality (imperfections of an actual network).

System Verilog Interview Questions With Answers
100% (1)
System Verilog Interview Questions With Answers
10 pages
SCM - 11 01 2005
No ratings yet
SCM - 11 01 2005
18 pages
JNTU B.tech Computer Networks Lab Manual All Programs
100% (3)
JNTU B.tech Computer Networks Lab Manual All Programs
53 pages
Drill
100% (1)
Drill
2 pages
CSE 5311: Design and Analysis of Algorithms Programming Project Topics
No ratings yet
CSE 5311: Design and Analysis of Algorithms Programming Project Topics
3 pages
Lab 10 - Subprograms (Answers) PDF
No ratings yet
Lab 10 - Subprograms (Answers) PDF
6 pages
Cs336 Spring2024 Assignment2 Systems
No ratings yet
Cs336 Spring2024 Assignment2 Systems
30 pages
Data Parallel Patterns
No ratings yet
Data Parallel Patterns
9 pages
Assign 1-Statistical Summaries Using Pthreads
No ratings yet
Assign 1-Statistical Summaries Using Pthreads
4 pages
Systemverilog Interview Questions
100% (2)
Systemverilog Interview Questions
31 pages
Software Description
No ratings yet
Software Description
8 pages
XCS224N Assignment 3 Dependency Parsing
No ratings yet
XCS224N Assignment 3 Dependency Parsing
8 pages
CSE 4-589 - PA2 Handout
No ratings yet
CSE 4-589 - PA2 Handout
11 pages
High Performance Computing (HPC) Lec4
No ratings yet
High Performance Computing (HPC) Lec4
32 pages
Titanic: Mohit Kothari Roger Tanuatmadja Gautam Akiwate
No ratings yet
Titanic: Mohit Kothari Roger Tanuatmadja Gautam Akiwate
18 pages
Simulation and Modeling I: Assignment 3
No ratings yet
Simulation and Modeling I: Assignment 3
4 pages
Prep
No ratings yet
Prep
41 pages
Assignment: MBA - SEM IV Subject Code: MI0032 Java and Web Design Set II
No ratings yet
Assignment: MBA - SEM IV Subject Code: MI0032 Java and Web Design Set II
23 pages
What Is Callback?: Systemverilog&Uvm Interview Questions
100% (1)
What Is Callback?: Systemverilog&Uvm Interview Questions
53 pages
MCS 041 2011
No ratings yet
MCS 041 2011
7 pages
Destinationsof Benims Projems
No ratings yet
Destinationsof Benims Projems
6 pages
SystemC Questa Tutorial
No ratings yet
SystemC Questa Tutorial
11 pages
Howto
No ratings yet
Howto
74 pages
System Verilog Interview Questions
83% (6)
System Verilog Interview Questions
22 pages
Systemverilog Interview Questions
100% (1)
Systemverilog Interview Questions
39 pages
CN Lab Manual
75% (4)
CN Lab Manual
34 pages
CS+354+Spring+2024+Lab+3 v1
No ratings yet
CS+354+Spring+2024+Lab+3 v1
7 pages
Cs4/Msc Parallel Architectures Practical 2 - Cache Coherence Protocols
No ratings yet
Cs4/Msc Parallel Architectures Practical 2 - Cache Coherence Protocols
5 pages
ceng204_w8_systems_programming2024_spring
No ratings yet
ceng204_w8_systems_programming2024_spring
53 pages
Project - Cache Organization and Performance Evaluation
No ratings yet
Project - Cache Organization and Performance Evaluation
9 pages
ML Hota Assign5
No ratings yet
ML Hota Assign5
2 pages
Q.1 What Are Various Parameters of An Applet Tag. Answer:: Java and Web Design
No ratings yet
Q.1 What Are Various Parameters of An Applet Tag. Answer:: Java and Web Design
22 pages
314 Pthread Lab Assignment
No ratings yet
314 Pthread Lab Assignment
6 pages
Awr Report Analysis
No ratings yet
Awr Report Analysis
13 pages
File of SIMULATION OF NETWORK PROTOCOLS
No ratings yet
File of SIMULATION OF NETWORK PROTOCOLS
86 pages
Python unit - 2 answer bank
No ratings yet
Python unit - 2 answer bank
18 pages
Cache Lab
No ratings yet
Cache Lab
10 pages
Temple T
No ratings yet
Temple T
3 pages
Ten Simple Rules For Taking Advantage of Git and Github: Supplementary File S1
No ratings yet
Ten Simple Rules For Taking Advantage of Git and Github: Supplementary File S1
4 pages
Stack and Dynamic Memory
No ratings yet
Stack and Dynamic Memory
7 pages
1-Program Design and Analysis
No ratings yet
1-Program Design and Analysis
6 pages
Context Likelihood of Relatedness
No ratings yet
Context Likelihood of Relatedness
10 pages
TD Contiki PDF
No ratings yet
TD Contiki PDF
10 pages
Final
No ratings yet
Final
4 pages
GloMoSim Manual
No ratings yet
GloMoSim Manual
5 pages
Next Pathway Hack Backpackers Problem Statement
No ratings yet
Next Pathway Hack Backpackers Problem Statement
11 pages
Os Nguyenvanvietquang 20213583
No ratings yet
Os Nguyenvanvietquang 20213583
18 pages
ass2
No ratings yet
ass2
2 pages
Cachelab
No ratings yet
Cachelab
10 pages
Sheet3 Cms
No ratings yet
Sheet3 Cms
4 pages
Advanced OS Assignment 3 Thread Management-1
No ratings yet
Advanced OS Assignment 3 Thread Management-1
2 pages
Ee382M - Vlsi I: Spring 2009 (Prof. David Pan) Final Project
No ratings yet
Ee382M - Vlsi I: Spring 2009 (Prof. David Pan) Final Project
13 pages
ES Module-3
No ratings yet
ES Module-3
19 pages
SCS16L
No ratings yet
SCS16L
3 pages
INF_3201_h24_Assignment_1
No ratings yet
INF_3201_h24_Assignment_1
4 pages
ACM, Classic of The Month: On The Criteria To Be Used in Decomposing Systems Into Modules
No ratings yet
ACM, Classic of The Month: On The Criteria To Be Used in Decomposing Systems Into Modules
8 pages
Lab 5
No ratings yet
Lab 5
8 pages
C# Package Mastery: 100 Essentials in 1 Hour - 2024 Edition
From Everand
C# Package Mastery: 100 Essentials in 1 Hour - 2024 Edition
Tenko
No ratings yet
Python Advanced Programming: The Guide to Learn Python Programming. Reference with Exercises and Samples About Dynamical Programming, Multithreading, Multiprocessing, Debugging, Testing and More
From Everand
Python Advanced Programming: The Guide to Learn Python Programming. Reference with Exercises and Samples About Dynamical Programming, Multithreading, Multiprocessing, Debugging, Testing and More
Marcus Richards
No ratings yet
Ian Talks Python A-Z
From Everand
Ian Talks Python A-Z
Ian Eress
No ratings yet
Concurrent, Real-Time and Distributed Programming in Java: Threads, RTSJ and RMI
From Everand
Concurrent, Real-Time and Distributed Programming in Java: Threads, RTSJ and RMI
Badr Benmammar
No ratings yet
FireandSecurity Catalog
No ratings yet
FireandSecurity Catalog
56 pages
ABM Quiz 3
No ratings yet
ABM Quiz 3
3 pages
VOS3000 Details Pricing
No ratings yet
VOS3000 Details Pricing
13 pages
Indifference Curve Analysis
No ratings yet
Indifference Curve Analysis
36 pages
RESEARCH-FINAL
No ratings yet
RESEARCH-FINAL
35 pages
Technical Analysis: DR - Manish Dadhich Mba, Net, Set
No ratings yet
Technical Analysis: DR - Manish Dadhich Mba, Net, Set
51 pages
Low Salicylates
No ratings yet
Low Salicylates
12 pages
Engine Misfire
No ratings yet
Engine Misfire
6 pages
Shimadzu Spesification UV-1900
No ratings yet
Shimadzu Spesification UV-1900
1 page
Practical Obstetrics and Gynaecology Handbook for O G Clinicians and General Practitioners 2nd Edition Thiam Chye Tan - Read the ebook online or download it to own the full content
No ratings yet
Practical Obstetrics and Gynaecology Handbook for O G Clinicians and General Practitioners 2nd Edition Thiam Chye Tan - Read the ebook online or download it to own the full content
76 pages
AJAY Chhattisgarh - Company Final
No ratings yet
AJAY Chhattisgarh - Company Final
333 pages
(6C) Aircraft Structures
No ratings yet
(6C) Aircraft Structures
71 pages
Marine Biologist
No ratings yet
Marine Biologist
12 pages
Lecture 5 - Costs and Profit
No ratings yet
Lecture 5 - Costs and Profit
7 pages
How To Calculate Notice Timedkading Master For Stopping The Cargo... Here Is The Answer - MySeaTime
No ratings yet
How To Calculate Notice Timedkading Master For Stopping The Cargo... Here Is The Answer - MySeaTime
4 pages
Circular Economy in Spanish SMEs Challenges and Opportunities
100% (1)
Circular Economy in Spanish SMEs Challenges and Opportunities
11 pages
Adult Male Shirt Decals - Google Search
No ratings yet
Adult Male Shirt Decals - Google Search
1 page
IAC 16 - Regulatory Framework For Business Transactions
No ratings yet
IAC 16 - Regulatory Framework For Business Transactions
13 pages
Mystery School Code Review
No ratings yet
Mystery School Code Review
4 pages
Business Ethics and Corporate Social Responsibility: A Holistic Approach
100% (1)
Business Ethics and Corporate Social Responsibility: A Holistic Approach
6 pages
Report autoDNA WBAVU71040KG92706 PDF
No ratings yet
Report autoDNA WBAVU71040KG92706 PDF
6 pages
Catalogue Wonil - KOREA (English)
No ratings yet
Catalogue Wonil - KOREA (English)
68 pages
Perkinsrestaurant Menu PDF
No ratings yet
Perkinsrestaurant Menu PDF
12 pages
Er 82
No ratings yet
Er 82
2 pages
Poultry Industry in Moldova
No ratings yet
Poultry Industry in Moldova
5 pages
Deepak Parekh
No ratings yet
Deepak Parekh
6 pages
Ies Oradea
No ratings yet
Ies Oradea
50 pages
Laplana Rem Abagatnan v. Clarito
No ratings yet
Laplana Rem Abagatnan v. Clarito
3 pages
Headquarters: Microsoft Redmond Campus
No ratings yet
Headquarters: Microsoft Redmond Campus
2 pages