Lecture 1 Notes

This module focuses on designing algorithms and their dependence on suitable data structures, covering basic tasks like storing, sorting, and searching data. It emphasizes the importance of algorithm specification, verification, and performance analysis while introducing key data structures such as arrays, lists, and trees. The module will utilize pseudocode for clarity and will explore concepts like invariants to ensure algorithm correctness.

Uploaded by

nabatanzi.gorret

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views

Lecture 1 Notes

Uploaded by

nabatanzi.gorret

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 8

Lecture 1: Introduction

In this module we are going to look at designing algorithms. We will see how
they depend on the design of suitable data structures, and how some structures
and algorithms are more efficient than others for the same task. We’ll concentrate
on a few basic tasks, such as storing, sorting and searching data, that underlie much
of computer science, but the techniques discussed will be applicable much more
generally. We will start by studying some key data structures, such as arrays, lists,
queues, stacks and trees, and then move on to explore their use in a range of
different searching and sorting algorithms. This will lead us on to consider
approaches for the efficient storage of data in hash tables. Finally, we’ll look at
graph based representations and cover the kinds of algorithms needed to work
efficiently with them. Throughout, we’ll investigate the computational efficiency
of the algorithms we develop, and gain intuitions about the pros and cons of the
various potential approaches for each task. We will not restrict ourselves to
implementing the various data structures and algorithms in particular computer
programming languages (e.g., Java, C , OCaml), but specify them in simple
pseudocode that can easily be implemented in any appropriate language.

Algorithms as opposed to programs

An algorithm for a given task is “a finite sequence of instructions, each of which

has a clear meaning and can be performed with a finite amount of effort in a finite
length of time”. As such, an algorithm must be precise enough to be understood by
human beings. However, in order to be executed by a computer, we need a
program that is written in a rigorous formal language; and since computers are
quite inflexible compared to the human mind, programs usually need to contain
more details than algorithms.

In this module we shall ignore such programming details, and concentrate on the
design of algorithms rather than programs. The task of implementing the discussed
algorithms as computer programs is left to the Software Workshop module, and
you will frequently see the same topics covered in both modules from different
perspectives. Having said that, you will often find it useful to write down segments
of actual programs in order to clarify and test certain algorithmic aspects. It is also
worth bearing in mind the distinction between different programming paradigms:
Imperative Programming describes computation in terms of instructions that
change the program/data state, whereas Declarative Programming specifies what
the program should accomplish without describing how to do it.

This module is primarily concerned with developing algorithms that map easily
onto the imperative programming approach. Algorithms can obviously be
described in plain English, and we will sometimes do that. However, for computer
scientists it is usually easier and clearer to use something that comes somewhere in
between formatted English and computer program code, but is not runnable
because certain details are omitted. This is called pseudocode. Often we will use
segments of psudocode that are very similar to the languages we are interested in,
e.g. the overlap of C and Java, with the advantage that they can easily be inserted
into runnable programs.

Fundamental questions about algorithms

Given an algorithm to solve a particular problem, we are naturally led to ask:

1. What is it supposed to do?
2. Does it really do what it is supposed to do?
3. How efficiently does it do it?
The technical terms normally used for these three aspects are:
1. Specification.
2. Verification.
3. Performance analysis.

The details of these three aspects will usually be rather problem dependent.
The specification should formalize the crucial details of the problem that the
algorithm is trying to solve. Sometimes that will be based on a particular
representation of the associated data, sometimes it will be presented more
abstractly. Typically, it will have to specify how the inputs and outputs of the
algorithm are related, though there is no general requirement that the specification
is complete or non-ambiguous.
For simple problems, it is often easy to see that a particular algorithm will always
work, i.e. that it satisfies its specification. However, for more complicated
specifications and/or algorithms, the fact that an algorithm satisfies its specification
may not be obvious at all. In this case, we need to spend some effort verifying
whether the algorithm is indeed correct. In general, testing on a few particular
inputs can be enough to show that the algorithm is incorrect. However, since the
number of different potential inputs for most algorithms is infinite in theory, and
huge in practice, more than just testing on particular cases is needed to be sure that
the algorithm satisfies its specification. We need correctness proofs. Although we
will discuss proofs in this module, and useful relevant ideas like invariants, we will
usually only do so in a rather informal manner (though, of course, we will attempt
to be rigorous). The reason is that we want to concentrate on the data structures
and algorithms. Formal verification techniques are complex and will be taught in
later modules.
Finally, the efficiency or performance of an algorithm relates to the resources
required by it, such as how quickly it will run, or how much computer memory it
will use. This will usually depend on the problem instance size, the choice of data
representation, and the details of the algorithm. Indeed, this is what normally
drives the development of new data structures and algorithms. We shall study the
general ideas concerning efficiency in Chapter 5, and then apply them throughout
the remainder of the module.
Data structures, abstract data types, design patterns

For many problems, the ability to formulate an efficient algorithm depends on

being able to organize the data in an appropriate manner. The term data structure is
used to denote a particular way of organizing data for particular types of operation.
This module will look at numerous data structures ranging from familiar arrays and
lists to complex types of trees, heaps and graphs, and we will see how their choice
affects the efficiency of the algorithms based upon them.
Often we want to talk about data structures without having to worry about all the
implementational details associated with particular programming languages, or
how the data is stored in computer memory. We can do this by formulating abstract
mathematical models of particular classes of data structures or data types which
have common features. These are called abstract data types, and are defined only
by the operations that may be performed on them. Typically, we specify how they
are built out of more primitive data types (e.g., integers or strings), how to extract
that data from them, and some basic checks to control the flow of processing in
algorithms. The idea that the implementational details are hidden from the user and
protected from outside access is known as encapsulation. We shall see many
example of abstract data types throughout this module.

At an even higher level of abstraction are design patterns which describe the
design of algorithms, rather the design of data structures. These embody and
generalize important design concepts that appear repeatedly in many problem
contexts. They provide a general structure for algorithms, leaving the details to be
added as required for particular problems. These can speed up the development of
algorithms by providing familiar proven algorithm structures that can be applied
straightforwardly to new problems. We shall see a number of familiar design
patterns throughout this module.
Overview

This module will cover the principal fundamental data structures and algorithms
used in computer science, and bring together a broad range of topics covered in
other modules into a coherent framework. Data structures will be formulated to
represent various types of information in such a way that it can be conveniently
and efficiently manipulated by the algorithms we develop. Throughout, the
recurring practical issues of algorithm specification, verification and performance
analysis will be discussed.

We shall begin by looking at some widely used basic data structures (namely
arrays, linked lists, stacks and queues), and the advantages and disadvantages of
the associated abstract data types. Then we consider the ubiquitous problem of
searching, and how that leads on to the general ideas of computational efficiency
and complexity. That will leave us with the necessary tools to study three
particularly important data structures: trees (in particular, binary search trees and
heap trees), hash tables, and graphs. We shall learn how to develop and analyse
increasingly efficient algorithms for manipulating and performing useful
operations on those structures, and look in detail at developing efficient processes
for data storing, sorting, searching and analysis. The idea is that once the basic
ideas and examples covered in this module are understood, dealing with more
complex problems in the future should be straightforward.
Arrays, Iteration, Invariants
Data is ultimately stored in computers as patterns of bits, though these days most
programming languages deal with higher level objects, such as characters, integers,
and floating point numbers. Generally, we need to build algorithms that manipulate
collections of such objects, so we need procedures for storing and sequentially
processing them.
Arrays
In computer science, the obvious way to store an ordered collection of items is as
an array. Array items are typically stored in a sequence of computer memory
locations, but to discuss them, we need a convenient way to write them down on
paper. We can just write the items in order, separated by commas and enclosed by
square brackets. Thus,
[1, 4, 17, 3, 90, 79, 4, 6, 81]
is an example of an array of integers. If we call this array a, we can write it as:
a = [1, 4, 17, 3, 90, 79, 4, 6, 81]
This array a has 9 items, and hence we say that its size is 9. In everyday life, we
usually start counting from 1. When we work with arrays in computer science,
however, we more often (though not always) start from 0. Thus, for our array a, its
positions are 0, 1, 2, . . . , 7, 8. The element in the 8th position is 81, and we use the
notation a[8] to denote this element. More generally, for any integer i denoting a
position, we write a[i] to denote the element in the i th position. This position i is
called an index (and the plural is indices). Then, in the above example, a[0] = 1,
a[1] = 4, a[2] = 17, and so on.
It is worth noting at this point that the symbol = is quite overloaded. In
mathematics, it stands for equality. In most modern programming languages, =
denotes assignment, while equality is expressed by ==. We will typically use = in
its mathematical meaning, unless it is written as part of code or pseudocode.
We say that the individual items a[i] in the array a are accessed using their index i,
and one can move sequentially through the array by incrementing or decrementing
that index, or jump straight to a particular item given its index value. Algorithms
that process data stored as arrays will typically need to visit systematically all the
items in the array, and apply appropriate operations on them.

Loops and Iteration

The standard approach in most programming languages for repeating a process a

certain number of times, such as moving sequentially through an array to perform
the same operations on each item, involves a loop. In pseudocode, this would
typically take the general form:

For i = 1,...,N,
do something
In programming languages like C and Java this would be written as the for-loop
for( i = 0 ; i < N ; i++ )
{
// do something
}
in which a counter i keep tracks of doing “the something” N times. For example,
we could compute the sum of all 20 items in an array a using
for( i = 0, sum = 0 ; i < 20 ; i++ ) {
sum += a[i];
}
We say that there is iteration over the index i. The general for-loop structure is
for( INITIALIZATION ; CONDITION ; UPDATE )
{
REPEATED PROCESS
}
in which any of the four parts are optional. One way to write this out explicitly is

INITIALIZATION
if ( not CONDITION ) go to LOOP FINISHED
LOOP START
REPEATED PROCESS
UPDATE
if ( CONDITION ) go to LOOP START
LOOP FINISHED
In this module, we will regularly make use of this basic loop structure when
operating on data stored in arrays, but it is important to remember that different
programming languages use different syntax, and there are numerous variations
that check the condition to terminate the repetition at different points.

Invariants
An invariant, as the name suggests, is a condition that does not change during
execution of a given program or algorithm. It may be a simple inequality, such as
“i < 20”, or something more abstract, such as “the items in the array are sorted”.
Invariants are important for data structures and algorithms because they enable
correctness proofs and verification.
Structures and algorithms because they enable correctness proofs and verification.
In particular, a loop-invariant is a condition that is true at the beginning and end of
every iteration of the given loop. Consider the standard simple example of a
procedure that finds the minimum of n numbers stored in an array a:
minimum(int n, float a[n]) {
float min = a[0];
// min equals the minimum item in a[0],...,a[0]
for(int i = 1 ; i != n ; i++) {
// min equals the minimum item in a[0],...,a[i-1]
if (a[i] < min) min = a[i];
}
// min equals the minimum item in a[0],...,a[i-1], and i==n return min;
}
At the beginning of each iteration, and end of any iterations before, the invariant
“min equals the minimum item in a[0], ..., a[i − 1]” is true – it starts off true, and
the repeated process and update clearly maintain its truth. Hence, when the loop
terminates with “i == n”, we know that “min equals the minimum item in a[0], ...,
a[n − 1]” and hence we can be sure that min can be returned as the required
minimum value. This is a kind of proof by induction: the invariant is true at the
start of the loop, and is preserved by each iteration of the loop, therefore it must be
true at the end of the loop.
As we noted earlier, formal proofs of correctness are beyond the scope of this
module, but identifying suitable loop invariants and their implications for
algorithm correctness as we go through this module will certainly be a useful
exercise. We will also see how invariants (sometimes called inductive assertions)
can be used to formulate similar correctness proofs concerning properties of data
structures that are defined inductively.

Kebere Goshu: Bahir Dar University
0% (1)
Kebere Goshu: Bahir Dar University
22 pages
Mechanical Vibration Lab Report
No ratings yet
Mechanical Vibration Lab Report
7 pages
TNTRB Data Structure WWW - Governmentexams.co - in
No ratings yet
TNTRB Data Structure WWW - Governmentexams.co - in
121 pages
CPE 202 Lecture Notes
No ratings yet
CPE 202 Lecture Notes
41 pages
Affixation Adrian Tuarez
No ratings yet
Affixation Adrian Tuarez
5 pages
Data Structures and Algorithms BBIT 2.2 L1
No ratings yet
Data Structures and Algorithms BBIT 2.2 L1
5 pages
Ads Unit 1
No ratings yet
Ads Unit 1
36 pages
Chapter 1.0 Introduction To Algorithm 4th Edition
No ratings yet
Chapter 1.0 Introduction To Algorithm 4th Edition
4 pages
AMIE DataStructure Nots
No ratings yet
AMIE DataStructure Nots
123 pages
Intro to Data Structure and Ag (3)
No ratings yet
Intro to Data Structure and Ag (3)
38 pages
Introduction To Data Structure & Algorithm
100% (1)
Introduction To Data Structure & Algorithm
11 pages
cmsc451 Fall15 Lects
No ratings yet
cmsc451 Fall15 Lects
197 pages
451 2015 Merged
No ratings yet
451 2015 Merged
231 pages
Advanced Data Structure
No ratings yet
Advanced Data Structure
35 pages
Chapter 2.0 Introduction To Algorithm 4th Edition
No ratings yet
Chapter 2.0 Introduction To Algorithm 4th Edition
4 pages
05 Data structure and algorithms Full [WU] (1)
No ratings yet
05 Data structure and algorithms Full [WU] (1)
16 pages
Wa0002.
No ratings yet
Wa0002.
21 pages
01-Chap 01-BASIC CONCEPTS
No ratings yet
01-Chap 01-BASIC CONCEPTS
48 pages
Ch 2 DS&A
No ratings yet
Ch 2 DS&A
15 pages
Basic Alghoritm
No ratings yet
Basic Alghoritm
22 pages
Algorithms and Data Structures: An Easy Guide to Programming Skills
From Everand
Algorithms and Data Structures: An Easy Guide to Programming Skills
Rigdon Jonathan
No ratings yet
Introduction To Data Structure
No ratings yet
Introduction To Data Structure
7 pages
281704lecture Notes 2-Data Structures Vs Algorithms-1718434382117
No ratings yet
281704lecture Notes 2-Data Structures Vs Algorithms-1718434382117
8 pages
UNIT I Introduction to Data Sciecne
No ratings yet
UNIT I Introduction to Data Sciecne
18 pages
Part 1
No ratings yet
Part 1
8 pages
Data Structure Questions
No ratings yet
Data Structure Questions
9 pages
CH #2 Solved Exercise
No ratings yet
CH #2 Solved Exercise
3 pages
BSIT F21 DS & Algo Lecture 1
No ratings yet
BSIT F21 DS & Algo Lecture 1
21 pages
NOTE_071520
No ratings yet
NOTE_071520
56 pages
Unit 1 - 1
No ratings yet
Unit 1 - 1
3 pages
Chapter 1(Data Structure and Algorthim)
No ratings yet
Chapter 1(Data Structure and Algorthim)
7 pages
DSA Chapter 1
No ratings yet
DSA Chapter 1
19 pages
Self-Study Module S4: Modeling and Design (Part 1)
No ratings yet
Self-Study Module S4: Modeling and Design (Part 1)
8 pages
Complexity Analysis of Algorithms
No ratings yet
Complexity Analysis of Algorithms
12 pages
Basic Programming Model
No ratings yet
Basic Programming Model
4 pages
BABI Feb'19 Prep - Programming Basics - v1-1
No ratings yet
BABI Feb'19 Prep - Programming Basics - v1-1
4 pages
Chapter 1-3
No ratings yet
Chapter 1-3
82 pages
Data Structures
No ratings yet
Data Structures
225 pages
Chapter 1
No ratings yet
Chapter 1
15 pages
Algorithms and Data Structure
100% (2)
Algorithms and Data Structure
199 pages
Algorithm (From Subjects Find and Chat GPT Generate)
No ratings yet
Algorithm (From Subjects Find and Chat GPT Generate)
19 pages
2 Computer Problem Solving (E-next.in)
No ratings yet
2 Computer Problem Solving (E-next.in)
56 pages
Introduction to Algorithms & Data Structures: A solid foundation for the real world of machine learning and data analytics
From Everand
Introduction to Algorithms & Data Structures: A solid foundation for the real world of machine learning and data analytics
Bolakale Aremu
No ratings yet
Data Structures-UNIT I
No ratings yet
Data Structures-UNIT I
35 pages
BCAS Project Data Structures and Algorithm
No ratings yet
BCAS Project Data Structures and Algorithm
29 pages
Steel and Timbe
No ratings yet
Steel and Timbe
15 pages
Chapter One:data Structures and Algorithm Analysis
No ratings yet
Chapter One:data Structures and Algorithm Analysis
209 pages
Knoledge MGT
No ratings yet
Knoledge MGT
20 pages
Introduction To Algorithm
No ratings yet
Introduction To Algorithm
27 pages
Mit6 Chapter 3
No ratings yet
Mit6 Chapter 3
72 pages
Algorithms
No ratings yet
Algorithms
93 pages
Chapter 1+DSTRU
No ratings yet
Chapter 1+DSTRU
48 pages
10 DSA Cheatsheets For Your Interview Preparation
No ratings yet
10 DSA Cheatsheets For Your Interview Preparation
6 pages
Chapter 1 - Algorithm Analysis Concept
No ratings yet
Chapter 1 - Algorithm Analysis Concept
15 pages
Chapter 1 and 2
No ratings yet
Chapter 1 and 2
15 pages
LAb 1
No ratings yet
LAb 1
1 page
Chapter-1. Introduction To Data Structure
No ratings yet
Chapter-1. Introduction To Data Structure
3 pages
Mastering Data Structures and Algorithms in Python & Java
From Everand
Mastering Data Structures and Algorithms in Python & Java
Sachin Naha
No ratings yet
Essential Algorithms: A Practical Approach to Computer Algorithms
From Everand
Essential Algorithms: A Practical Approach to Computer Algorithms
Rod Stephens
4.5/5 (2)
Data Structures I Essentials
From Everand
Data Structures I Essentials
Dennis Smolarski
No ratings yet
Programming Logic
From Everand
Programming Logic
Frank Wellington
No ratings yet
Ergonomic Design and Analysis of A Post in A Stall: Article Information
No ratings yet
Ergonomic Design and Analysis of A Post in A Stall: Article Information
10 pages
Statistical Techniques Lab (BCSL-044)
No ratings yet
Statistical Techniques Lab (BCSL-044)
3 pages
Mathematical Symbols: Good Problems: March 25, 2008
No ratings yet
Mathematical Symbols: Good Problems: March 25, 2008
2 pages
Chap 1 Physics Measurement
No ratings yet
Chap 1 Physics Measurement
17 pages
® Iit-Jee: 2024-25 Enthusiast Course Phase-I (A), I (B), I & Ii
No ratings yet
® Iit-Jee: 2024-25 Enthusiast Course Phase-I (A), I (B), I & Ii
1 page
Holy Garden Model School Syllabus - 4
No ratings yet
Holy Garden Model School Syllabus - 4
4 pages
Rotation Work Sheets PDF
No ratings yet
Rotation Work Sheets PDF
20 pages
Momentum and Impulse
No ratings yet
Momentum and Impulse
28 pages
Probabilistic Reasoning: 13.1 Representing Knowledge in An Uncertain Domain
100% (1)
Probabilistic Reasoning: 13.1 Representing Knowledge in An Uncertain Domain
1 page
Fluid Question
No ratings yet
Fluid Question
20 pages
Smart English 1 - WB Answer Key
No ratings yet
Smart English 1 - WB Answer Key
22 pages
IGCSE Physics Syllabus Overview
No ratings yet
IGCSE Physics Syllabus Overview
13 pages
Grade 7 Fundamental Operation On Integers Addition of Integers
No ratings yet
Grade 7 Fundamental Operation On Integers Addition of Integers
7 pages
EEET423L Exp3 Script Functions and FlowCOntrol
No ratings yet
EEET423L Exp3 Script Functions and FlowCOntrol
2 pages
MAcread
No ratings yet
MAcread
23 pages
Chtp5e Pie SM 09
0% (1)
Chtp5e Pie SM 09
12 pages
4 - LM Test and Heteroskedasticity
No ratings yet
4 - LM Test and Heteroskedasticity
13 pages
JEE (Main) 2021: PAPER-1 (B.E./B. TECH.)
No ratings yet
JEE (Main) 2021: PAPER-1 (B.E./B. TECH.)
14 pages
Rsm-Phase Plan-Pcm-2019 - 21
No ratings yet
Rsm-Phase Plan-Pcm-2019 - 21
2 pages
GUI Programming With Python - Layout Management in Tkinter
No ratings yet
GUI Programming With Python - Layout Management in Tkinter
9 pages
Continuous RV Probability Distributions
No ratings yet
Continuous RV Probability Distributions
51 pages
Modelling and Dynamic Simulation of Processes With MATLAB'. An Application of A Natural Gas Installation in A Power Plant
No ratings yet
Modelling and Dynamic Simulation of Processes With MATLAB'. An Application of A Natural Gas Installation in A Power Plant
12 pages
CLAI Syllabus
No ratings yet
CLAI Syllabus
3 pages
Grade 10 DBE JIT Learner Notes and Activities
No ratings yet
Grade 10 DBE JIT Learner Notes and Activities
62 pages
Introduction Panel Data
No ratings yet
Introduction Panel Data
16 pages
Java Implementation of Elliptic Curve Integrated Encryption Scheme
No ratings yet
Java Implementation of Elliptic Curve Integrated Encryption Scheme
41 pages
Phase2 InterpretReference
No ratings yet
Phase2 InterpretReference
115 pages
S5 Bot Math P2 Term Ii-Chebet Eli
No ratings yet
S5 Bot Math P2 Term Ii-Chebet Eli
2 pages
CHP 1
No ratings yet
CHP 1
47 pages