Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                

DS CH 1 PDF

Download as pdf or txt
Download as pdf or txt
You are on page 1of 15

1. Accept n eles from user and display it. mXn eles from user and display it.

2. Disp max ele from array. N eles e.g how many eles? 5 enter els -> 20 10 100 50
5 => 100 max
3. Addn of two matrixes [2][2] [3][3] num rows and cols => 3 [3][3]
4. Sort eles of array (1D) => 5 10 20….
5. Accept n eles from user and display it. mXn eles from user and display it.
Using function.

Chapter 1. Introduction to Data Structures and Algorithm Analysis

Need of Data Structures

As applications are getting complexed and amount of data is increasing day by day,
there may arise the following problems:

Processor speed: To handle very large amount of data, high speed processing is
required, but as the data is growing day by day to the billions of files per entity,
processor may fail to deal with that much amount of data.

Data Search: Consider an inventory size of 106 items in a store, If our application
needs to search for a particular item, it needs to traverse 106 items every time,
results in slowing down the search process.

Multiple requests: If thousands of users are searching the data simultaneously on a


web server, then there are the chances that a very large server can be failed during
that process

in order to solve the above problems, data structures are used. Data is organized to
form a data structure in such a way that all items are not required to be searched
and required data can be searched instantly.

Data: Data can be defined as an elementary value or the collection of values, for
example, student's name and its id are the data about the student.

An ADT (abstract data type) is a mathematical model of a data structure that


specifies the type of data stored, the operations supported on them, and the types
of parameters of the operations. An ADT specifies what each operation does, but
not how it does it. Typically, an ADT can be implemented using one of many
different data structures. A useful first step in deciding what data structure to use in
a program is to specify an ADT for the program.
Formally, an ADT may be defined as a "class of objects whose logical behaviour is
defined by a set of values and a set of operations";

In general, the steps of building ADT to data structures are:

 Understand and clarify the nature of the target information unit.


 Identify and determine which data objects and operations to include in the
models.
 Express this property somewhat formally so that it can be understood and
communicate well.
 Translate this formal specification into proper language. In C++, this becomes
a .h file. In Java, this is called "user interface".
 Upon finalized specification, write necessary implementation. This includes
storage scheme and operational detail. Operational detail is expressed as
separate functions (methods).

Data Structure :

a data structure is a data organization, management, and storage format that


enables efficient access and modification.

More precisely, a data structure is a collection of data values, the relationships


among them, and the functions or operations that can be applied to the data,i.e., it
is an algebraic structure about data.
Linear Data Structures: A data structure is called linear if all of its elements are
arranged in the linear order. In linear data structures, the elements are stored in
non-hierarchical way where each element has the successors and predecessors
except the first and last element.

Array ll, stack, que

10 20 40 50 100 30

Non Linear Data Structures: This data structure does not form a sequence i.e. each
item or element is connected with two or more other items in a non-linear
arrangement. The data elements are not arranged in sequential structure.

An algorithm is a set of steps of operations to solve a problem performing


calculation, data processing, and automated reasoning tasks. An algorithm is an
efficient method that can be expressed within finite amount of time and space.

Characteristics of Algorithms

o Input: It should externally supply zero or more quantities.


o Output: It results in at least one quantity.
o Definiteness: Each instruction should be clear and ambiguous.
o Finiteness: An algorithm should terminate after executing a finite number of
steps.
o Effectiveness: Every instruction should be fundamental to be carried out, in
principle, by a person using only pen and paper.
o Feasible: It must be feasible enough to produce each instruction.
o Flexibility: It must be flexible enough to carry out desired changes with no
efforts.
o Efficient: The term efficiency is measured in terms of time and space required
by an algorithm to implement. Thus, an algorithm must ensure that it takes
little time and less memory space meeting the acceptable limit of
development time.
o Independent: An algorithm must be language independent, which means that
it should mainly focus on the input and the procedure required to derive the
output instead of depending upon the language.
In theoretical analysis of algorithms, it is common to estimate their complexity in
the asymptotic sense, i.e., to estimate the complexity function for arbitrarily large
input. The term "analysis of algorithms" was coined by Donald Knuth.
Algorithm analysis is an important part of computational complexity theory,
which provides theoretical estimation for the required resources of an algorithm
to solve a specific computational problem. Most algorithms are designed to work
with inputs of arbitrary length. Analysis of algorithms is the determination of the
amount of time and space resources required to execute it.

Time complexity : The amount of time is required by the algorithm is called time
complexity.
Space Complexity : The amount of space / memory required by the algorithm is
called Space complexity.

Analysis of algorithm is the process of analysing the problem-solving capability of


the algorithm in terms of the time and size required (the size of memory for storage
while implementation). However, the main concern of analysis of algorithms is the
required time or performance. Generally, we perform the following types of
analysis −
 Worst-case − For 'n' input size, the worst-case time complexity can be
defined as the maximum amount of time needed by an algorithm to complete
its execution. The maximum number of steps taken on any instance of size n.
 Best-case − For 'n' input size, the best-case time complexity can be defined as
the minimum amount of time needed by an algorithm to complete its
execution. The minimum number of steps taken on any instance of size n.
 Average case − For 'n' input size, the average-case time complexity can be
defined as the average amount of time needed by an algorithm to complete
its execution. An average number of steps taken on any instance of size n.

Asymptotic Analysis of algorithms (Growth of function)

Resources for an algorithm are usually expressed as a function regarding input.


Often this function is messy and complicated to work.

Let f (n) = an2+bn+c

In this function, the n2 term dominates the function that is when n gets sufficiently
large.
N F(2^n) F(n!)
1 2 1
2 4 2
3 8 6
4 16 24
5 32 120

n! > 2n for all n>=4


n! has higher growth rate than 2n.

Asymptotic notation:

The word Asymptotic means approaching a value or curve arbitrarily closely (i.e., as
some sort of limit is taken).

Asymptotic analysis

It is a technique of representing limiting behaviour. The methodology has the


applications across science. It can be used to analyse the performance of an
algorithm for some large data set.

Classification of Algorithms
o Constant Complexity:
It imposes a complexity of O(1). It undergoes an execution of a constant
number of steps like 1, 5, 10, etc. for solving a given problem. The count of
operations is independent of the input data size.
o Logarithmic Complexity:
It imposes a complexity of O(log(N)). It undergoes the execution of the order
of log(N) steps. To perform operations on N elements, it often takes the
logarithmic base as 2.
For N = 1,000,000, an algorithm that has a complexity of O(log(N)) would
undergo 20 steps (with a constant precision). Here, the logarithmic base does
not hold a necessary consequence for the operation count order, so it is
usually omitted.
o Linear Complexity:

o It imposes a complexity of O(N). It encompasses the same number of


steps as that of the total number of elements to implement an operation
on N elements.
For example, if there exist 500 elements, then it will take about 500
steps. Basically, in linear complexity, the number of elements linearly
depends on the number of steps. For example, the number of steps for
N elements can be N/2 or 3*N.
o It also imposes a run time of O(n*log(n)). It undergoes the execution of
the order N*log(N) on N number of elements to solve the given problem.
For a given 1000 elements, the linear complexity will execute 10,000
steps for solving a given problem.
o Quadratic Complexity: It imposes a complexity of O(n2). For N input data size,
it undergoes the order of N2 count of operations on N number of elements for
solving a given problem.
If N = 100, it will endure 10,000 steps. In other words, whenever the order of
operation tends to have a quadratic relation with the input data size, it results
in quadratic complexity. For example, for N number of elements, the steps are
found to be in the order of 3*N2/2.
o Cubic Complexity: It imposes a complexity of O(n3). For N input data size, it
executes the order of N3 steps on N elements to solve a given problem.
For example, if there exist 100 elements, it is going to execute 1,000,000 steps.
o Exponential Complexity: It imposes a complexity of O(2n), O(N!), O(nk), …. For
N elements, it will execute the order of count of operations that is
exponentially dependable on the input data size.
For example, if N = 10, then the exponential function 2N will result in 1024.
Similarly, if N = 20, it will result in 1048 576, and if N = 100, it will result in a
number having 30 digits. The exponential function N! grows even faster; for
example, if N = 5 will result in 120. Likewise, if N = 10, it will result in 3,628,800
and so on.

Increasing order of growth rate

o O(1) < O(log n) < O (n) < O(n log n) < O(n^2) < O (n^3)< O(2^n) < O(n!)
Constant < = Logarithmic <= Polynomial <= exponential <= Factorial

There are three notation of Time complexity :

1. Big-oh notation: Big-oh is the formal method of expressing the upper bound of an
algorithm's running time. It is the measure of the longest amount of time. The
function f (n) = O (g (n)) [read as "f of n is big-oh of g of n"] if and only if exist
positive constant c and such that

1. f (n) ⩽ k.g (n)f(n)⩽k.g(n) for n>n0n>n0 in all case

Hence, function g (n) is an upper bound for function f (n), as g (n) grows faster than f
(n)
For Example:
1. 1. 3n+2=O(n) as 3n+2≤4n for all n≥2
2. 2. 3n+3=O(n) as 3n+3≤4n for all n≥3

Hence, the complexity of f(n) can be represented as O (g (n))

2. Omega () Notation: The function f (n) = Ω (g (n)) *read as "f of n is omega of g of


n"] if and only if there exists positive constant c and n0 such that

F (n) ≥ k* g (n) for all n, n≥ n0

For Example:
f (n) =8n2+2n-3≥8n2-3
=7n2+(n2-3)≥7n2 (g(n))
Thus, k1=7

Hence, the complexity of f (n) can be represented as Ω (g (n))


3. Theta (θ): The function f (n) = θ (g (n)) *read as "f is the theta of g of n"+ if and only
if there exists positive constant k1, k2 and k0 such that

k1 * g (n) ≤ f(n)≤ k2 g(n)for all n, n≥ n0

For Example:
3n+2= θ (n) as 3n+2≥3n and 3n+2≤ 4n, for n
k1=3,k2=4, and n0=2

Hence, the complexity of f (n) can be represented as θ (g(n)).

Example1:
In the first example, we have an integer i and a for loop running from i equals 1 to n.
Now the question arises, how many times does the name get printed?
A()
{
int i,j;
for (i=1 to n)
printf("Edward");
}
Since i equals 1 to n, so the above program will print Edward, n number of times.
Thus, the complexity will be O(n).

Example2:
A()
{
int i, j:
for (i=1 to n) i=1 i<=5
for (j=1 to n) j=1 j<=5
printf("hello");
}
In this case, firstly, the outer loop will run n times, such that for each time, the inner
loop will also run n times. Thus, the time complexity will be O(n2).

Example3:
A()
{
i = 1; S = 1;
while (S<=n) 1 to n=5 6
{
i++;
S = S + i;
printf("Edward");
} 5
}

As we can see from the above example, we have two variables; i, S and then we
have while S<=n, which means S will start at 1, and the entire loop will stop
whenever S value reaches a point where S becomes greater than n.

Here i is incrementing in steps of one, and S will increment by the value of i, i.e., the
increment in i is linear. However, the increment in S depends on the i.

Initially;

i=1, S=1

After 1st iteration;

i=2, S=3

After 2nd iteration;

i=3, S=6 num3 add= 1+2+3 =6

After 3rd iteration;

i=4, S=10 … and so on. 4+3+2+1

Since we don't know the value of n, so let's suppose it to be k. Now, if we notice the
value of S in the above case is increasing; for i=1, S=1; i=2, S=3; i=3, S=6; i=4, S=10; …
Thus, it is nothing but a series of the sum of first n natural numbers, i.e., by the
time i reaches k, the value of S will be k(k+1)/2.

To stop the loop, has to be greater than n, and when we solve this equation,
we will get > n. Hence, it can be concluded that we get a complexity of O(√n) in
this case.

counting primitive operations

Primitive operations are basic computations performed by an algorithm. Examples


are evaluating an expression, assigning a value to a variable, indexing into an array,
calling a method, returning from a method, etc. They are easily identifiable in
pseudocode and largely independent from the programming language.

By inspecting the pseudocode, we can determine the maximum number of primitive


operations executed by an algorithm as a function of the input size. Think about the
worst case, best case, and average case.

Algorithm arrayMax(A, n):


Input: An array A of n
Number of operations:
integers
Output: the max element

2+
1 max = A[0] n+
2 for i = 1 to n-1 do (n-1)+(n-1)= 2(n-1)+ (including increment
3 if (A[i] > max) then max = A[i] counter)
4 return max 1
Total: 2+n+2(n-1)+1= 3+n+2(n-1)=3+n+2n-2= 3n-1
-1(const) ::::: 3n

The algorithm arrayMax executes about 8n - 3 primitive operations in the worst


case. Define:

 a = Time taken by the fastest primitive operation


 b = Time taken by the slowest primitive operation
 Let T(n) be the worst case time of arrayMax. Then a(8n - 3) <= T(n) <= b(8n - 3)
 The running time T(n) is bounded by two linear functions
 Changing the hardware/software environment will not affect the growth rate
of T(n)

For Recursive Program


An algorithm that calls itself is called recursive algorithm.

Consider the following recursive programs.

Example1:

1. A(n)
2. {
3. if (n>1)
4. return (A(n-1))
5. }

Solution;

Here we will see the simple Back Substitution method to solve the above problem.

T(n) = 1 + T(n-1) …Eqn. (1)


Step1: Substitute n-1 at the place of n in Eqn. (1)

T(n-1) = 1 + T(n-2) ...Eqn. (2)

Step2: Substitute n-2 at the place of n in Eqn. (1)

T(n-2) = 1 + T(n-3) …Eqn. (3)

Step3: Substitute Eqn. (2) in Eqn. (1)

T(n)= 1 + 1+ T(n-2) = 2 + T(n-2) …Eqn. (4)

Step4: Substitute eqn. (3) in Eqn. (4)

T(n) = 2 + 1 + T(n-3) = 3 + T(n-3) = …... = k + T(n-k) …Eqn. (5)

Now, according to Eqn. (1), i.e. T(n) = 1 + T(n-1), the algorithm will run until n>1.
Basically, n will start from a very large number, and it will decrease gradually. So,
when T(n) = 1, the algorithm eventually stops, and such a terminating condition is
called anchor condition, base condition or stopping condition.

Thus, for k = n-1, the T(n) will become.

Step5: Substitute k = n-1 in eqn. (5)

T(n) = (n-1) + T(n-(n-1)) = (n-1) + T(1) = n-1+1

Hence, T(n) = n or O(n).

Time Complexity Analysis | Tower Of Hanoi (Recursion)


Tower of Hanoi is a mathematical puzzle where we have three rods and n disks.
The objective of the puzzle is to move the entire stack to another rod, obeying the
following simple rules:
1) Only one disk can be moved at a time.
2) Each move consists of taking the upper disk from one of the stacks and placing it
on top of another stack i.e. a disk can only be moved if it is the uppermost disk on a
stack.
3) No disk may be placed on top of a smaller disk.
Pseudo Code

TOH(n, x, y, z)
{ Toh(3) t(1,x…)

T(1,z,y,x)
if (n >= 1)
Toh(2,x,z,y) toh(1,x,y,z)
toh(1,x,z,y) toh(2,z,y,x)
{
// put (n-1) disk to z by using y
TOH((n-1), x, z, y)

// move larger disk to right place


// move:x-->y
TOH(1, x, y, z)

// put (n-1) disk to right place


TOH((n-1), z, y, x)
}
}
Analysis of Recursion

Recursive Equation : ——-equation-1


Solving it by Backsubstitution :
———–equation-2
———–equation-3

Put the value of T(n-2) in the equation–2 with help of equation-3


——equation-4
Put the value of T(n-1) in equation-1 with help of equation-4

After Generalization :

Base condition T(1) =1


n–k=1
k = n-1
put, k = n-1
It is a GP series, and the sum is , or you can say which is exponential.
for 5 disks i.e. n=5 It will take 2^5-1=31 moves.
Time complexity is O(2^n)

https://www.baeldung.com/cs/towers-of-hanoi-complexity

Space Complexity:
The space complexity of a program is the amount of memory it needs to run to
completion. The space need by a program has the following components:
Instruction space: Instruction space is the space needed to store the compiled
version of the program instructions.
Data space: Data space is the space needed to store all constant and variable
values. Data space has two components:

instances.
Environment stack space: The environment stack is used to save information
needed to resume execution of partially completed functions.
Instruction Space: The amount of instructions space that is needed depends on
factors such as:
used to complete the program into machine code.

Algorithm Design Goals


The three basic design goals that one should strive for in a program are:
1. Try to save Time
2. Try to save Space
3. Try to save Face
A program that runs faster is a better program, so saving time is an obvious
goal. Like wise, a program that saves space over a competing program is considered
desirable. We want to “save face” by preventing the program from locking up or
generating reams of garbled data.

• Space Complexity Example:

• Algorithm abc(a,b,c) ,
return a+b++*c+(a+b-c)/(a+b) +4.0;
}
The Space needed by each of these algorithms is seen to be the sum of the
following component.
1.A fixed part that is independent of the characteristics (eg:number,size)of the
inputs and outputs.
The part typically includes the instruction space (ie. Space for the code), space for
simple variable and fixed-size component variables (also called aggregate) space
for constants, and so on.

2. A variable part that consists of the space needed by component variables whose
size is dependent on the particular problem instance being solved, the space
needed by referenced variables (to the extent that is depends on instance
characteristics), and the recursion stack space.
The space requirement s(p) of any algorithm p may therefore be written as,
S(P) = c+ Sp(Instance characteristics) Where ‘c’ is a constant.

Example 2:
Algorithm sum(a,n)
{
s=0.0;
for I=1 to n do
s= s+a[I];
return s;
}

elements to be summed. The space needed d by ‘n’ is one word, since it is of


type integer.
tyepe array of
floating point numbers.

elements to be summed.

• * n for a*+,one each for n,I a& s+

You might also like