F17XA Main
F17XA Main
F17XA Main
7XA
Mathematics for Engineers and Scientists 1
Course Notes
Name:
For use of students of the Heriot-Watt University enrolled in the subject F17XA/XE.
• Version 1: Compiled by Jack Carr and Des Johnston, July 2013.
• Version 2: Major updates by Thomas Wong, April, 2020.
F17XA Contents
Contents
1 Formulae 4
1.1 Working with Formulae . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
1.2 Units . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
1.3 Typesetting Mathematics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
1.4 Algebraic Fractions and Partial Fractions . . . . . . . . . . . . . . . . . . . . . 8
2 Logarithms and Exponentials 13
2.1 Indices and Exponents . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13
2.2 Exponential Functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13
2.3 Logarithms . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
2.4 Hyperbolic Functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 22
3 Application of Linear and Log Functions 27
3.1 Equation of a Straight Line . . . . . . . . . . . . . . . . . . . . . . . . . . . . 27
3.2 Experimental Data . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 28
3.3 Log-Log and Log-Linear Laws . . . . . . . . . . . . . . . . . . . . . . . . . . . 32
3.4 Interpolation and Extrapolation . . . . . . . . . . . . . . . . . . . . . . . . . . 33
4 Introduction to Differentiation 37
4.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 37
4.2 Derivatives . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 42
4.3 Product Rule . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 47
4.4 Quotient Rule . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 48
4.5 Chain Rule . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 50
4.6 Higher Order Derivatives . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 53
4.7 Implicit Differentiation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 56
4.8 Parametric Differentiation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 60
4.9 Curve Sketching . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 63
4.10 Related Rates . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 69
5 Introduction to Integration 72
5.1 Antidifferentiation (Indefinite Integration) . . . . . . . . . . . . . . . . . . . . . 72
5.2 Area Under Curves . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 76
5.3 Improper Integrals . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 80
6 Statistics and Probability 83
6.1 Statistics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 83
6.2 Measures of Average . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 83
6.3 Standard Deviation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 85
6.4 Frequency Tables . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 86
6.5 Probability Theory . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 87
6.6 Probability Rules . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 88
6.7 Tree Diagrams . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 92
6.8 Binomial Distribution . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 93
6.9 Continuous Distribution . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 96
6.10 Normal Distribution . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 98
7 Vectors 103
2
F17XA Contents
3
F17XA 1 Formulae
1 Formulae
1.1 Working with Formulae
When we want to describe or calculate various phenomena in science and engineering, we often
employ mathematical formulae. We need to be able to extract the information we want from
a given formula by re-arranging it if necessary. Let us look at some examples of formulae from
various subjects:
• In physics, the period T of a pendulum of length L:
s
L
T = 2π
g
• In physics and engineering, the distance s travelled in a time t with a constant acceleration
a and initial velocity u:
1
s = ut + at2
2
• To convey our lovely Scottish weather to our North American friends, we have to relate
temperature in Centigrade (C) to Fahrenheit (F ) by:
9C
F = 32 +
5
We take the formulae above as examples and attempt to change the subject of each formula.
Exercise 1.1
q
Given that T = 2π L
g
. Express L in terms of T, g.
Solution: Eliminate the square root by squaring both sides of the equation.
s !2
2 L
T = 2π
g
L
= 4π 2
g
Multiply both sides by g
4π 2
g g L
2
× T 2 = 2 × 4π 2
4π 4π g
and cancel out to get
gT 2
L=
4π 2
4
F17XA 1 Formulae
Exercise 1.2
A particle with initial velocity u and constant acceleration a travels a distance s in time t
where s = ut + 12 at2 . Express u in terms of s, a and t.
Solution: Starting with the original equation, we rearrange (with steps in brackets)
1
s = ut + at2
2
1 1
s − at2 = ut − at2
2 2
s − 21 at2
=u (÷t)
t
s at
− =u Simplify
t 2
So we get u = s
t
− at
2
.
Exercise 1.3
The formula for converting Centigrade (C) to Fahrenheit (F ) is
9C
F = 32 +
5
Express C in terms of F .
Remark 1.4
When manipulating equations, we must always do the same thing to both sides of the
equation
5
F17XA 1 Formulae
1.2 Units
We won’t emphasise it heavily in this subject, but most applications of mathematics in science
and engineering involves units. We also have to contend with prefixes such as milli, micro, and
nano. In addition, there is often a mixture of measurement systems such as feet/metres or
pounds/kilograms. Errors in converting units have resulted in serious consequences:
• In 2001, the Los Angeles Zoo lent a tortoise, to a Exotic Animal Training Centre. The L.A.
Zoo said that the tortoise was big with weighing 250. The college thought the zoo meant
250 pounds and built an enclosure to match this. In fact the animal was a 75-year-old
Galapagos tortoise weighing 250 kilograms and it wrecked the enclosure on the first night
in its new home.
• An Air Canada flight was fuelled in Toronto using pounds instead of litres of fuel. The pilot
calculated how much he needed thinking he was getting his fuel in litres and ran out of fuel
about an hour into the flight. He successfully landed the plane safely at a semi-abandoned
airstrip.
• In 2003, the Space Mountain roller coaster at Tokyo Disneyland derailed when an axle broke
just before the end of the ride. The axle fractured because it was smaller than the design’s
requirement; the gap between the bearing and the axle was over 1mm - when it should have
been a 0.2mm gap. The error was in converting measurements to the metric system.
• Many medication errors are due to conversion and calculation errors In 1999, the Institute
for Safe Medication Practices reported an instance where a patient had received 0.5 grams
of Phenobarbital (a sedative) instead of 0.5 grains when the recommendation was misread.
A grain is a unit of measure equal to about 0.065 grams.
• In 1999 NASA lost a Mars orbiter because one team used metric units for a calculation and
the other team used imperial units. This loss cost over 125 million dollars.
6
F17XA 1 Formulae
Example 1.6
We can enter the expression:
4(32 − 1)
x= q √
1+ 2+ 2
Using x=(4*(3ˆ2-1))/(1+sqrt(2+sqrt(2)))
Be aware of order of operations, use brackets appropriately, and pair off brackets appropriately.
Be aware of log(x). Some packages use log(x) = log10 (x). When using a new package, it is
good practice to evaluate log(10) and log(e) to see which logarithm is used.
Example 1.8
We can enter the expression !
3+x
x log √
x
using x*log((3+x)/sqrt(x))
Example 1.9
We can enter the expression √
1−2x2
e1−
using exp(1-sqrt(1-2*xˆ2))
Example 1.10
We can enter the expression
πx − 4
sin2
2+x
using (sin((pi*x-4)/(2+x)))ˆ2
7
F17XA 1 Formulae
x3 − 3x2 − 2x + 1 or − x2 + 2x + 3
An Algebraic fraction is the ratio of two polynomials. If the numerator (top) has a strictly lower
power of x than the denominator, then the fraction is proper, otherwise it is improper.
Example 1.11
• Examples of proper algebraic fractions include:
1 x 3−x
, ,
x 1 − x2 x4 + 3x2 + 2x + 1
x7 x2 x3 − 3x2 − 2x + 1
, ,
1−x 1 − x2 −x2 + 2x + 3
Often we are interested in either splitting up fractions or combining them. In particular, splitting
up algebraic fractions is useful for simplifying integrals and this method is usually called the
method of partial fractions.
Example 1.12
The expression x−1
2 5
+ x+4 can be combined into a single fraction by introducing the common
denominator (x − 1)(x + 4):
2 5 2(x + 4) + 5(x − 1)
+ =
x−1 x+4 (x − 1)(x + 4)
7x + 3
=
(x − 1)(x + 4)
The method of partial fractions provides a method of performing the reverse operation. First,
consider the case when the denominator of the fraction can be expressed as a product of two
linear factors and the numerator is linear.
We will illustrate two common methods for partial fractions via the example below
Exercise 1.13
Express 7x+3
(x−1)(x+4)
in terms of partial fractions.
8
F17XA 1 Formulae
Solution: We start by splitting up the fraction based on the factorisation of the denomi-
nator with unknowns A, B. Let
7x + 3 A B
= +
(x − 1)(x + 4) x−1 x+4
We have to find the values of A and B so the above is true for all x.
Clearing denominators on the right hand side:
7x + 3 = A(x + 4) + B(x − 1)
Since this equation is true for all x, we choose values of x to make the term involving A
or the term involving B disappear.
• Set x = 1. Then 7 × 1 + 3 = A(1 + 4) + 0. Thus 10 = 5A and A = 2.
• Set x = −4 then 7 × (−4) + 3 = 0 + B(−4 − 1). Thus −25 = −5B and B = 5.
So we get
7x + 3 2 5
= +
(x − 1)(x + 4) x−1 x+4
7x + 3 = A(x + 4) + B(x − 1)
7x + 3 = Ax + 4A + Bx − B
= (A + B)x + (4A − B)
Since this is true for all x values, this means the coefficients of each power of x on both
9
F17XA 1 Formulae
[x1 ] : 7=A+B
[x0 ] : 3 = 4A − B
Exercise 1.14
Express 2x−1
(2x+1)(x−3)
in terms of partial fractions.
Solution: Let
2x − 1 A B
= +
(2x + 1)(x − 3) 2x + 1 x − 3
Combine denominators and equate numerators
2x − 1 = A(x − 3) + B(2x + 1)
To obtain A and B:
• Set x = 3 to get 6 − 1 = 0 + B(6 + 1). Hence 5 = 7B, and so B = 57 .
• Set x = − 12 to get −1 − 1 = A(− 12 − 3). Hence −2 = − 7A
2
, and so A = 74 .
So we get
2x − 1 4 5
= +
(2x + 1)(x − 3) 7(2x + 1) 7(x − 3)
Exercise 1.15
Express 10x+18
(2x+3)2
in terms of partial fractions.
Solution: Let
10x + 18 A B
= +
(2x + 3)2 2x + 3 (2x + 3)2
10
F17XA 1 Formulae
10x + 18 = A(2x + 3) + B
Example 1.16
For numbers l, m, n, p, q, r, s, t and unknowns A, B, C, we have the following partial fractions
expressions:
mx + n A B
= +
(px + q)(rx + s) px + q rx + s
mx + n A B
= +
(px + q)2 px + q (px + q)2
lx2 + mx + n A Bx + C
= + 2
(px + q)(rx2 + sx + t) px + q rx + sx + t
lx2 + mx + n A B C
= + +
(px + q)2 (rx + s) px + q (px + q)2 rx + s
Exercise 1.17
Express 9x+3
(x−1)(x2 +x+10)
in terms of partial fractions.
Solution: Let
9x + 3 A Bx + C
= + 2
(x − 1)(x + x + 10)
2 x − 1 (x + x + 10)
Combining denominators and then equating numerators gives:
Hence:
9x + 3 1 7−x
= + 2
(x − 1)(x + x + 10)
2 x − 1 (x + x + 10)
11
F17XA 1 Formulae
In general, we can express a proper1 algebraic fraction in terms of partial fractions. We illustrate
the general method via an example.
4(x2 + x + 1)
(x + 1)3 (x2 + 1)2
4(x2 + x + 1) A1 A2 A3 B1 x + C1 B2 x + C2
3 2 2
= + 2
+ 3
+ + 2
(x + 1) (x + 1) (x + 1) (x + 1) (x + 1) (x2 + 1) (x + 1)2
4(x2 + x + 1) 1 1 1 −x 1−x
3 2 2
= + 2
+ 3
+ 2 + 2
(x + 1) (x + 1) (x + 1) (x + 1) (x + 1) (x + 1) (x + 1)2
1
If the given algebraic fraction is improper, an additional step call polynomial long division is required before
we can apply partial fraction.
12
F17XA 2 Logarithms and Exponentials
a2 = a × a b3 = b × b × b c1 = c
In the case where a < 0, there are values of m, n for which am and an are not well-defined.
3
For example (−1) 2 is not well-defined.
Example 2.2
In the case with a = 2, we have:
x -2 -1 0 1 2 3 4
2x 1
4
1
2
1 2 4 8 16
y
20
15
10
x
-2 -1 1 2 3 4
13
F17XA 2 Logarithms and Exponentials
For reasons that will become obvious later, base e ≈ 2.7182818284 is of special interest. The
number e is called Euler’s number. The exponential function y = ex is sometimes written as
exp(x).
Example 2.3
Values of exponential function in base e:
x -2 -1 0 1 2 3 4
ex 0.14 0.37 1 2.72 7.39 20.1 54.6
y
60
50
40
30
20
10
x
-2 -1 1 2 3 4
The function y = ex is called the exponential function. On a calculator , the exponential function
is usually written as ex . The reader should use the ex button on a calculator to check the above
table. We can also write ex as exp(x) . The alternative notation exp(x) is useful for complicated
expressions like exp(−5x2 + 3x). One feature of ex is that it increases very rapidly with x, such
an increase has passed into common usage as an “exponential growth”.
The usual laws of indices apply to ex so we have
e0 = 1 (ex )2 = e2x ex · ey = ex+y
Exercise 2.4
Let f (t) = 3 exp(−5t2 ). Find f (0) and f (0.4).
Solution:
• For the first case, f (0) = 3e0 = 3.
• When t = 0.4, 5t2 = 0.8. Using a calculator f (0.4) = 3e−0.8 = 1.35.
14
F17XA 2 Logarithms and Exponentials
Exercise 2.5
Expand and simplify (e4x − e−4x )2 .
As we saw in Example 2.3, exp(x) is rapidly increasing, similarly exp(−x) is rapidly decreasing.
This remains true for expressions such as A exp(kx) (increasing for positive constants A and k)
and A exp(−kx) (decreasing for positive constants A and k). It is useful to be able to sketch
the behaviour of various exponential curves since they occur in so many physical problems. Let
us consider a couple of examples.
Exercise 2.6
Sketch the graph of y = 6e−2t for t ≥ 0.
2 6 exp(−2x)
x
1 2 3 4
Exercise 2.7
Sketch the graph of y = 7 − 4e−2t for t ≥ 0.
15
F17XA 2 Logarithms and Exponentials
4 7 − 4 exp(−2t)
x
1 2 3 4
Exponential functions often appear in physical problems where the rate of increase or decrease of
a quantity is depend on the value of the quantity at the time. An archetypal example of this is
radioactive decay, where the exponential decrease is with time.
A very similar calculation can be used to calculate how air pressure falls off with height.
Exercise 2.8
Atmospheric pressure, when measured in atmospheres, varies according to the height (h)
above the Earth’s surface. The formula is
P (h) = exp(−Ah)
where A = 0.000016 (i.e. 1.6 × 10−5 ), and h is measured in metres. At sea level h = 0
and P = 1 atmosphere.
Determine the air pressure at:
• The summit of Ben Nevis (h = 1344m)
• The summit of Everest (h = 8848m)
• Heriot-Watt Riccarton campus (h = 90m)
16
F17XA 2 Logarithms and Exponentials
Solution:
• The summit of Ben Nevis (h = 1344m)
Example 2.9
An electrical circuit is set up with a resistor R and capacitor C in series with a direct current
source V .
C
17
F17XA 2 Logarithms and Exponentials
Q
3
t
1 2 3 4 5 6 7 8
2.3 Logarithms
The logarithm of a particular exponent can be thought of as “undoing” the exponential.
Example 2.11
We have
• log10 (100) = 2 since 102 = 100.
• log2 (128) = 7 since 27 = 128
• loge (e) = 1 since e1 = e
Most calculators have two different logarithm functions: logarithm to base 10 and logarithm to
base e.
• If the base of the logarithm is a = 10, then we can omit the base and write log10 (b) = log(b).
On a calculator, logarithms to base 10 are usually denoted by log. The reader should use a
calculator to check that
• If the base of the logarithm is a = e, then we write loge (b) = ln(b). This is known as the
natural log. The reader should use the ln button on a calculator to check that
18
F17XA 2 Logarithms and Exponentials
y
2
ln(x)
1
log(x)
x
1 2 3 4 5 6 7 8
-1
-2
For any a > 0, we have ax > 0 for all x. That means ln A (and log A ) only make sense if A > 0.
Hence, an expression such as ln(3x − 15) will only make sense if 3x − 15 > 0. That is if x > 5.
Further, since we have a0 = 1 for any a > 0, we must have loga 1 = 0.
What does the graph of a logarithm look like? We can see that both ln(x) and log(x) look very
similar, going through x = 1 since 100 = e0 = 1.
Below we give the laws of logarithms and demonstrate how to apply them.
Theorem 2.12 Laws of logarithms
For a base a > 0 and positive numbers B and C
Further, we have:
19
F17XA 2 Logarithms and Exponentials
Exercise 2.13
Reduce the following to a single logarithmic term:
1. log(x) + log(x − 3)
2. 4 log(2) + 2 log(3) − log(12)
3. ln(x2 y) − ln(y) + 3 ln(x)
Solution:
1.
2.
4 log(2) + 2 log(3) − log(12) = log 24 + log 32 − log(12)
= log(16) + log(9) − log(12)
16 × 9
= log
12
= log (12)
3.
ln x2 y − ln(y) + 3 ln(x) = ln x2 y − ln(y) + ln x3
!
x2 y × x3
= ln
y
= ln x5
Exercise 2.14
Find the value of x which satisfies (1.76)x = 2002.
20
F17XA 2 Logarithms and Exponentials
ln ((1.76)x ) = ln(2002)
x ln (1.76) = ln(2002)
ln(2002)
x=
ln (1.76)
7.602
x= = 13.45
0.5653
Example 2.15
The last example shows the the result we obtain is independent of the basis chosen. In general,
if we want to solve y = ax , we can choose any c > 0 to get
y = ax
logc (y) = logc (ax )
logc (y) = x logc (a)
logc (y)
x=
logc (a)
logc (y)
loga (y) =
logc (a)
Exercise 2.16
Find the x value that satisfies 100e−2x = 26.2.
100e−2x = 26.2
e−2x = 0.262
−2x = ln(0.262)
−2x = −1.34
x = 0.67
21
F17XA 2 Logarithms and Exponentials
Exercise 2.17
Given the equation y = 7 − 4e−2t . Express t in terms of y
y = 7 − 4e−2t
y − 7 = −4e−2t
y−7
= e−2t
−4
7−y
= e−2t
4
7−y
ln = −2t
4
1 7−y
− ln =t
2 4
Hence
1 7−y
t = − ln
2 4
22
F17XA 2 Logarithms and Exponentials
y y
30 30
20 sinh(x) 20 cosh(x)
10 10
x x
-4 -3 -2 -1 1 2 3 4 -4 -3 -2 -1 1 2 3 4
-10 -10
-20 -20
-30 -30
tanh(x)
x
-4 -3 -2 -1 1 2 3 4
-1
23
F17XA 2 Logarithms and Exponentials
y
700
x
y = 757 − 127.7 cosh 127.7
600
500
400
300
200
100
x
-400 -300 -200 -100 100 200 300 400
Similar to trigonometric functions, the hyperbolic functions satisfy various identities. We can
derive these by considering their exponential definition.
Exercise 2.19
Express sinh(8x) in terms of e8x and e−8x .
Exercise 2.20
Express 5e4x − 7e−4x in terms of cosh(4x) and sinh(4x).
24
F17XA 2 Logarithms and Exponentials
Hence,
1 4x 1 4x
cosh(4x) + sinh(4x) = e + e−4x + e − e−4x = e4x
2 2
1 4x 1 4x
cosh(4x) − sinh(4x) = e + e−4x − e − e−4x = e−4x
2 2
Then
Exercise 2.21
Show that cosh2 (x) − sinh2 (x) = 1
Solution: Since
1 x
cosh(x) = e + e−x
2
1 x
sinh(x) = e − e−x
2
We can multiply these out, (remembering that (ex )2 = e2x ):
1 x 2 1 2x
cosh2 (x) = e + e−x = e + 2 + e−2x
4 4
1 x −x 2
1 2x
2
sinh (x) = e −e = e − 2 + e−2x
4 4
So subtracting the two just gives one, as required.
Exercise 2.22
Show that 1 + 2 sinh2 (x) = cosh(2x)
25
F17XA 2 Logarithms and Exponentials
So
1 2x
1 + 2 sinh2 (x) = 1 + e − 2 + e−2x
2
1 2x
= e + e−2x
2
= cosh(2x)
The last line is the definition of cosh(2x), which is what we have to show.
Exercise 2.23
The formula for the velocity of waves in shallow water channel is given by:
!
6.3d
V 2 = 1.8L tanh
L
where d is the depth of the water and L is the wavelength of the wave. Find V if d = 0.5m
and L = 3.0m.
26
F17XA 3 Application of Linear and Log Functions
10
4 y = 2x + 1
2
x
1 2 3 4 5
A line with slope m passing through a point (a, b) satisfies the equation
y − b = m(x − a)
One advantage of this approach is that it avoids the need to solve for the intercept c.
Example 3.2
In the above example, the line has slope 2 passing through the point (1, 3). Using the point
slope formula, we get:
y − b = m(x − a)
y − 3 = 2(x − 1)
y − 3 = 2x − 2
y = 2x + 1
3
It is easier to tell whether data forms a straight line or not. For example see Exercise 3.5
27
F17XA 3 Application of Linear and Log Functions
Given two points on the line, we can obtain the slope m by rearranging the point-slope formula:
y−b ∆y
slope = m = =
x−a ∆x
The notation ∆y = y − b (and ∆x = x − a) is sometimes used to denote the change in y
coordinate (change in x coordinate respectively). The ratio of the two tells us that if x changes
by ∆x, then y changes by ∆y. In this case, moving along 3 in x results in the line going up 6 in
y. The slope is 63 = 2. If the line slopes downwards, it has a negative slope.
Example 3.3
If we know the points (1, 3) and (4, 9) are on the line, then we have
9−3 6
slope = m = = = 2.
4−1 3
We can then use one of the points, say (1, 3), to recover the equation of the line.
y − b = m(x − a)
y − 3 = 2(x − 1)
y − 3 = 2x − 2
y = 2x + 1
For completeness, it is possible to use any point that we know is on the line. For example, if
we used (4, 9),
y − b = m(x − a)
y − 9 = 2(x − 4)
y − 9 = 2x − 8
y = 2x + 1
R = mT + c (3.1)
we could test the validity of Equation 3.1 by attempting to find m and c using experimental data.
28
F17XA 3 Application of Linear and Log Functions
Exercise 3.4
The following data was collected as a result of an experiment.
T (◦ C) 10 30 50 70 90
R(Ω) 4.9 5.2 5.8 6.2 6.8
Determine a linear relationship between temperature and resistance.
4 R = mT + c
T
20 40 60 80 100
R − b = m(T − a)
R − 5.25 = 0.025(T − 30)
R = 0.025T + 4.5
In cases where the variables are not linearly related, we may have to manipulate the data by
plotting some other combination of the variables.
29
F17XA 3 Application of Linear and Log Functions
Exercise 3.5
The following data is collected in an experiment
t 0 1 2 3 4
y 3.1 4.9 11.2 20.8 34.9
Validate if the variables t and y can be related by
y = mt2 + c (3.2)
Solution:
Based on this data, it would be difficult to see from a plot of this data if Equation 3.2
holds. Further, it is not clear how to determine the values m and c based on these points.
y
40
30
20
10
t
1 2 3 4
t 0 1 2 3 4
x = t2 0 1 4 9 16
y 3.1 4.9 11.2 20.8 34.9
y = mx + c (3.3)
30
F17XA 3 Application of Linear and Log Functions
y
40
30
20
y = 2x + 3
10
x
2 4 6 8 10 12 14 16
y − b = m(x − a)
y − 3 = 2(x − 0)
y = 2x + 3
The process of manipulating variables so that they are related by a straight line is sometimes
called linearisation.
Exercise 3.6
For the following cases, reduce the proposed law connecting t and y to straight line form.
1. y = mt4 + c
2. y = mt2 + ct
3. y = mt + tc2
Solution:
1. Let x = t4 . This gives y = mx + c. So plotting y against t4 gives a straight line.
y
2. For t 6= 0, let w = yt . Then w = mt + c. So plotting w = against t gives a
t
straight line.
3. Rearranging y = mt + tc2 gives t2 y = mt3 + c. So substituting w = t2 y and x = t3
we get w = mx + c. So plotting w = t2 y against t3 gives a straight line.
31
F17XA 3 Application of Linear and Log Functions
y = atb
ln(y) = ln(atb )
ln(y) = ln(a) + ln(tb )
ln(y) = ln(a) + b ln(t)
Let w = ln(y) and x = ln(t). Then w = ln(a) + bx would be a straight line. So we should
plot ln(y) against ln(t). The resulting graph is known as a log-log graph since we are plotting
logarithms on both horizontal and vertical axes.
Example 3.7
Suppose that the variables t and y are related by y = atb . We can linearise by setting w = ln(y)
and x = ln(t) to obtain the straight line w = ln(1.7) + (−2.3)x from the graph. Back
substituting gives a = 1.6 and b = −2.3.
y
w = ln(y) 15
2
ln(1.7) x = ln(t)
y = 1.7t−2.3
1 2 3 4
-2 10
-4
-6 w = ln(1.7) − 2.3x 5
-8
-10 t
1 2 3 4
Figure 3.5: The log-log graph.
Figure 3.6: Example of power law.
The second case we will deal with is the exponential law. Suppose two variables t and y are
related via the relation y = Abt . We can linearise the relation by taking logarithms on both sides
4
Examples include Kepler’s third law, inverse square laws for gravity and electric forces, or even the number of
friends a person has.
5
Examples include radioactive decay or simple population growth.
32
F17XA 3 Application of Linear and Log Functions
Example 3.8
Suppose that the variables t and y are related by y = Abt . We can linearise by setting w = ln(y)
to obtain the straight line w = ln(3) + ln(0.5)x from the graph. Back substituting gives A = 3
and b = 0.5.
y
w = ln(y) 14
2
12 y = 3 · (0.5)x
10
ln(3)
1 w = ln(3) + ln(0.5)x
8
6
t
4
-1 1 2 3
2
t
-1 -2 -1 1 2 3
Figure 3.7: The log-linear graph. Figure 3.8: Example of exponential law.
Exercise 3.9
The velocity of sound in air varies with temperature. The values at different temperatures
are measured to be:
Air Temp o C 0 10 20 30
Velocity ms −1
331.3 337.3 343.2 349.0
1. Interpolate the velocity of sound in air at 16o C
2. Extrapolate the velocity of sound in air at −30o C
33
F17XA 3 Application of Linear and Log Functions
Solution:
Based on the following plot, we can see that a suitable linear relationship exists between
velocity and temperature. So we will use a linear fit to estimate values.
Velocity
350
340
330
320
310
Temp
-30 -20 -10 10 20 30
Let us finish this section with a example of using graphical methods to determine a relationship
between two variables.
Example 3.10
In the experiment, we consider how changing the length L of a simple pendulum affects the
period of oscillation T .
L 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1.0 1.1 1.2 1.3 1.4 1.5
T 0.63 0.89 1.10 1.27 1.42 1.55 1.68 1.79 1.90 2.01 2.10 2.20 2.37 2.46 2.54
34
F17XA 3 Application of Linear and Log Functions
2.5
2.0
1.5
1.0
0.5
L
0.3 0.6 0.9 1.2 1.5
T
7
L
0.3 0.6 0.9 1.2 1.5
35
F17XA 3 Application of Linear and Log Functions
by: !
2 4π 2
T = L
g
where g = 9.81ms−2 is the acceleration due to gravity near the Earth’s surface.
a
Here, meaningful could range from using other contextual knowledge or theory about the problem, to
guessing the function based on the shape of the plot.
36
F17XA 4 Introduction to Differentiation
4 Introduction to Differentiation
4.1 Introduction
Recall we can obtain the gradient of the straight line joining points (a, b) and (x, y) by rearranging
the point-slope formula (Definition 3.1)
y−b
gradient = m =
x−a
For instance, the straight line connecting the points (1, 5) and (3, 11) has slope
11 − 5 6
gradient = m = = =3
3−1 2
37
F17XA 4 Introduction to Differentiation
y = f (x) y = f (x)
8 f (x) = x2 8 f (x) = x2
6 6
Q
4 4
2 2
P P
x x
1 2 3 1 2 3
-2 -2
• When x = 1.1:
f (1.1) − f (1) 1.12 − 12 1.21 − 1
slopeP Q = = = = 2.1
1.1 − 1 1.1 − 1 0.1
• When x = 1.01:
f (1.01) − f (1) 1.012 − 12 1.0201 − 1
slopeP Q = = = = 2.01
1.01 − 1 1.01 − 1 0.01
Based on this, it appears that the estimates get closer to 2 as Q gets close to P .
Point Q cannot be at x = 1 since this will result in the zero denominator! Instead, we rely on
a mathematical concept known as limits to help us avoid this issue.
38
F17XA 4 Introduction to Differentiation
f (a + h) − f (a)
f 0 (a) = lim
h→0 h
if it exists.
In general, we use the notation lim g(h) = L to mean the function g(h) is really close to L when
h→0
h is really close to 06 .
Exercise 4.2
Use the limit definition of the derivative to find the derivative of f (x) = x2 for:
1. x = 1
2. A general x = a
Solution:
1. For x = 1
f (1 + h) − f (1)
f 0 (1) = lim
h→0 h
(1 + h)2 − 12
= lim
h→0 h
(1 + 2h + h2 ) − 1
= lim
h→0 h
2h + h2
= lim
h→0 h
h(2 + h)
= lim
h→0 h
= lim 2 + h
h→0
=2
6
There are rigorous mathematical definitions on what really close means. But our intuitive notion is sufficient
for this subject.
39
F17XA 4 Introduction to Differentiation
2. For x = a
f (a + h) − f (a)
f 0 (a) = lim
h→0 h
(a + h)2 − a2
= lim
h→0 h
(a + 2ah + h2 ) − a2
= lim
h→0 h
2ah + h2
= lim
h→0 h
h(2a + h)
= lim
h→0 h
= lim 2a + h
h→0
= 2a
The last case shows that the process can be achieved for an arbitrary a. That is, the derivative
process takes the function f (x) = x2 and input a and returns 2a. This is the exact definition of
what a function is. Hence, we can extend the notion of a derivative from a point to the whole
function.
In this case, we write f 0 (x) (pronounced f dash x) to denote the derivative function. The process
of finding f 0 (x) from f (x) is called differentiation or finding the derivative.
Exercise 4.3
√
Use the limit definition of the derivative to find the derivative of f (x) = x.
40
F17XA 4 Introduction to Differentiation
As this example shows, we want to avoid differentiating from first principles every time. Instead,
we will develop a set of rules to make our task easier.
Before we proceed, we will introduce some notation to make our lives easier.
Definition 4.4
Given a function y = f (x), the process of differentiation (with respect to x) is denoted
d d
(f (x))0 or f (x) or y
dx dx
The resulting function is called the derivative and can be denoted
df dy
f 0 (x) or or
dx dx
Example 4.5
Given the function f (x) = x2 , we could write
d 2
(x ) = 2x
dx
Example 4.6
Other variables can be used in place of x. For example:
d√ 1 d 2
u= √ u = 2u
du 2 u du
41
F17XA 4 Introduction to Differentiation
4.2 Derivatives
4.2.1 Powers
The most common function we will encounter is the following7 .
Theorem 4.7 Power Rule
The derivative for the function f (x) = xn for some real number n is given by:
f 0 (x) = nxn−1
Example 4.8
As we’ve seen before:
1. If f (x) = x2 , then f 0 (x) = 2x2−1 = 2x.
√ 1 1 1
2. If f (x) = x = x 2 , then f 0 (x) = 12 x 2 −1 = 21 x− 2 = 1
√
2 x
.
And a few that are new:
1. If f (x) = x3 , then f 0 (x) = 3x3−1 = 3x2 .
3 3 1
√
2. If f (x) = x 2 , then f 0 (x) = 32 x 2 −1 = 32 x 2 = 3 x
2
.
3. If f (x) = xπ , then f 0 (x) = πxπ−1 .
Combined with the following rules for addition and subtraction of derivatives, we can differentiate
all polynomials.
Theorem 4.9 Linearity rules
Given differentiable functions f (x), g(x) and a constant c, we have
d
(f (x) + g(x)) = f 0 (x) + g 0 (x) (4.1)
dx
d
(f (x) − g(x)) = f 0 (x) − g 0 (x) (4.2)
dx
d
c · f (x) = c · f 0 (x) (4.3)
dx
In other words, derivatives work nicely with addition, subtraction, and constant multiplication.
Let’s see some examples.
Exercise 4.10
Find the derivative of f (x) = x3 + x.
Calculations such as Exercise 4.2 only prove the result for specific values of n. For a general value of n, more
7
42
F17XA 4 Introduction to Differentiation
Solution: We can apply the power rule with the addition rule
d 3 d 3 d
x +x = x + x Equation 4.1
dx dx dx
2
= 3x + 1 Theorem 4.7
Exercise 4.11
Differentiate the following functions:
1. f (x) = −2x5
2. g(x) = 4x(x − 7)
Solution:
1. We can apply a similar method:
d
f 0 (x) = −2x5
dx
d
= −2 x5 Equation 4.3
dx
= −2 · 5x4 Theorem 4.7
= −10x4
d
g 0 (x) = (4x(x − 7))
dx
d 2
= 4x − 28x Expand
dx
d 2 d
= 4x − 28x Equation 4.2
dx dx
d 2 d
= 4 x − 28 x Equation 4.3
dx dx
= 4 · (2x) − 28 · (1) Theorem 4.7
= 8x − 28
These solutions are written with all details included. Once you become familiar with the process,
you may decide to combine multiple steps.
Exercise 4.12
Find the slope of the function f (u) = u3 − 7u2 + 4u − 9 at the point u = 2.
43
F17XA 4 Introduction to Differentiation
Solution: Using the above rules, we can check that f 0 (u) = 3u2 − 14u + 4. The slope
of the function at u = 2 is f 0 (2) = 3(2)2 − 14(2) + 4 = −12.
Exercise 4.13
Find the equation of the tangent of the function f (x) = x4 − 4x2 + x about the point
x = 1.
Solution: We can find the derivative f 0 (x) = 4x3 − 8x + 1. Hence the slope of the
function at x = 1 is f 0 (1) = 4(1)3 − 8(1) + 1 = −3. We can then use the point-slope
formula with the point (1, f (1)) = (1, −2) to get the tangent line.
y − b = m(x − a)
y − f (1) = f 0 (1)(x − 1)
y − (−2) = (−3)(x − 1)
y = −3x + 1
sin(ax + b) a cos(ax + b)
cos(ax + b) −a sin(ax + b)
eax aeax
a
ln(ax + b)
ax + b
sinh(ax + b) a cosh(ax + b)
cosh(ax + b) a sinh(ax + b)
Note: All measurements for trigonometric functions must be measured in radians.
44
F17XA 4 Introduction to Differentiation
Exercise 4.15
Find the derivative of the following functions:
1. f (x) = 5 sin(x)
2. g(x) = 3 cos(2x − 1)
Solution:
1. For f (x): Using the table with a = 1, b = 0, and Equation 4.3 gives f 0 (x) =
5 cos(x).
2. For g(x): Using the table with a = 2 and b = −1,
Exercise 4.16
Given f (x) = 5 sin(4x) − 8 cos(4x). Evaluate f 0 π
2
.
Exercise 4.17
An alternating voltage v at time t (in seconds) is given by v = v(t) = 80 sin(10t) volts.
Find the rate of change of voltage when t = 0.2.
Exercise 4.18
Find f 0 (x) given f (x) = 2 ln(x) − x2 .
45
F17XA 4 Introduction to Differentiation
Exercise 4.19
The following circuit contains a resistor R, a capacitor C and a constant voltage source V .
The current I(t) which flows at time t is given by:
V −t
I(t) = exp
R RC
R
Show that
dI I
=−
dt RC
46
F17XA 4 Introduction to Differentiation
d
[f (x) · g(x)] = f (x) · g 0 (x) + f 0 (x) · g(x)
dx
This is sometimes written as y = u · v with u = f (x) and v = g(x) to give
y 0 = u · v 0 + u0 · v
Exercise 4.21
Find the derivative of h(x) = x2 e−4x .
Solution: Since this is a product of two functions, we can apply the product rule. Let
f (x) = x2 f 0 (x) = 2x
g(x) = e−4x g 0 (x) = −4e−4x
Exercise 4.22
Find the derivative of h(x) = x2 cos(x).
Solution: Let
f (x) = x2 f 0 (x) = 2x
g(x) = cos(x) g 0 (x) = − sin(x)
47
F17XA 4 Introduction to Differentiation
Exercise 4.23
Find the derivative of y = x3 ln(x).
Solution: Let
u = x3 u0 = 3x2
1
v = ln(x) v0 =
x
By the product rule,
y 0 (x) = u · v 0 + u0 · v
1
3
= (x ) · + (3x2 ) · (ln(x))
x
= x2 + 3x2 ln(x)
0 v · u0 − v 0 · u
y =
v2
Beware of the minus sign! Unlike the product rule, the order matters here.
48
F17XA 4 Introduction to Differentiation
Exercise 4.25
5x + 1
Find the derivative of h(x) = .
3x + 2
Solution: Since this is a quotient of two functions, we can apply the quotient rule. Let
f (x) = 5x + 1 f 0 (x) = 5
g(x) = 3x + 2 g 0 (x) = 3
Exercise 4.26
x
Find the derivative of y = .
x2 + 4
Solution: Let
u=x u0 = 1
v = x2 + 4 v 0 = 2x
49
F17XA 4 Introduction to Differentiation
Exercise 4.27
Find the derivative of y = tan(x).
u = sin(x) u0 = cos(x)
v = cos(x) v 0 = − sin(x)
v · u0 − v 0 · u
y0 =
v2
cos(x) · cos(x) − (− sin(x)) · cos(x)
=
(cos(x))2
cos(x)2 + sin(x)2
=
(cos(x))2
1
= Trig identity
cos2 (x)
Since sec(x) = 1
cos(x)
, this is sometimes written as sec2 (x).
Example 4.28
Suppose we have functions f (x) = x2 , g(x) = sin(x), and h(x) = ex . Some compositions of
these functions are:
Example 4.29
The following functions can be decomposed into simpler functions:
5
• The function y = (x2 − 3x + 8) can be decomposed as y = u5 with u = x2 − 3x + 8.
50
F17XA 4 Introduction to Differentiation
• The function f (x) = e−3x can be decomposed as f (u) = e−u with u(x) = −3x2 .
2
Exercise 4.31
Find the derivative of y = (x2 − 3x + 8)5 .
Since the function is originally given in terms of x, we have to always back-substitute to give
the final answer in terms of x.
As with the product and quotient rule, you will be able to use the chain rule without introducing
much notation once you become familiar with it.
Exercise 4.32
Find the derivative of f (x) = e−3x .
2
51
F17XA 4 Introduction to Differentiation
Solution: The function f (x) = e−3x can be decomposed with f (u) = eu and u(x) =
2
−3x2 . Hence,
f (u) = eu f 0 (u) = eu
u(x) = −3x2 u0 (x) = −6x
Exercise 4.33
Find the derivative of y = ecos(ln(x)) .
dy
y = eu = eu
du
du
u = cos(v) = − sin(v)
dv
dv 1
v = ln(x) =
dx x
By the chain rule (twice):
dy dy du
= · Chain rule
dx du dx !
dy du dv
= · · Chain rule again
du dv dx
dy du dv
= · · (4.4)
du dv dx
1
= eu · (− sin(v)) ·
x
1
= ecos(v) · (− sin(v)) · substitute u = cos(v)
x
1
= ecos(ln(x)) · (− sin(ln(x))) · substitute v = ln(x)
x
sin(ln(x))ecos(ln(x))
=−
x
52
F17XA 4 Introduction to Differentiation
Equation 4.4 shows what happens when a function is created by embedded compositions, and
also how the chain rule got its name.
d2 y
• (read as ‘dee two y by dee x squared’) is given by:
dx2
! !
d2 y d dy d d
2
= = y
dx dx dx dx dx
Exercise 4.34
Find the second derivative of y = x4 − 5x.
Exercise 4.35
Find y 00 if y = 6 ln(t) − 4t.
8
Some texts will refer to this as the concavity
53
F17XA 4 Introduction to Differentiation
Exercise 4.36
Find the second derivative of the following functions:
1. f (x) = 9 sin(3x)
2. f (x) = 3e−5x
Solution:
1. We have:
f (x) = 9 sin(3x)
f 0 (x) = 27 cos(3x)
f 00 (x) = −81 sin(3x)
2. We have:
f (x) = 3e−5x
f 0 (x) = −15e−5x
f 00 (x) = 75e−5x
Exercise 4.37
d2 y
Given y = 5 cos(3x), find a constant A such that y satisfies the equation + Ay = 0.
dx2
54
F17XA 4 Introduction to Differentiation
y = 5 cos(3x)
dy
= −15 sin(3x)
dx
d2 y
= −45 cos(3x)
dx2
Then we have:
d2 y
0 = 2 + Ay
dx
0 = −45 cos(3x) + A · (5 cos(3x))
0 = (5A − 45) cos(3x)
Finding higher order derivatives can involve the rules we’ve covered previously.
Exercise 4.38
Find the second derivative of f (x) = xe−3x .
Solution: Since this is a product of functions, we should use the product rule:
u=x u0 = 1
v = e−3x v 0 = −3e−3x
u = (1 − 3x) u0 = −3
v = e−3x v 0 = −3e−3x
55
F17XA 4 Introduction to Differentiation
Simplifying after derivatives will sometimes make taking higher order derivatives easier. Alterna-
tively, we can use the linearity rules after the first derivative apply the product rule twice.
Example 4.39
Suppose y is a function of x and evaluate
d
y(x)2
dx
dw dw dy dy
= × = 2y
dx dy dx dx
Similarly, we have
d d dy dy
y(x)2 = y 2 × = 2y
dx dy dx dx
The key point is remembering that to differentiate a function of y with respect to x, we first
differentiate with respect to y then multiply by dx
dy
. We write this mathematically as
d d dy
(f (y)) = (f (y)) ×
dx dy dx
Exercise 4.40
dy
Consider the equation y 3 + 2xy − 5x2 = 0 and find
dx
Solution: In theory, we could try to solve for y before differentiating. But in this case,
56
F17XA 4 Introduction to Differentiation
we will use implicit differentiation and differentiate both sides with respect to x.
d 3 d
y + 2xy − 5x2 = 0
dx dx
d 3 d d d
y + 2xy − 5x2 = 0 Linearity
dx dx dx dx
Differentiating term by term
• By the chain rule dxd 3 dy
y = 3y 2 dx
• By the chain and product rule dx d dy
2xy = 2y + 2x dx
• We also have dx d
5x2 = 10x and dx d
0=0
Combining, we get:
!
dy dy
3y 2 + 2y + 2x − 10x = 0
dx dx
dy
Solving for gives:
dx
dy 10x − 2y
= 2
dx 3y + 2x
Exercise 4.41
dy
Consider the equation ln(y) − sin(y) + x2 y = 0 and find
dx
Solving for dy
dx
gives:
dy −2xy
= 1
dx y
− cos(y) + x2
57
F17XA 4 Introduction to Differentiation
Exercise 4.42
e x x2 dy
Consider the equation y = √ 2 and find
x +1 dx
This is given as an explicit equation. To differentiate this, one would require a product rule,
quotient rule, and a chain rule. However, we can simplify the computation using implicit differ-
entiation.
Solution: Recall that for a differentiable function f (x), we have the following via the
chain rule:
d f 0 (x)
ln(f (x)) =
dx f (x)
So we can manipulate the original equation to get:
e x x2
y=√ 2
x +1
e x x2
!
ln(y) = ln √ 2
x +1
√
ln(y) = ln ex x2 − ln x2 + 1
1
ln(y) = ln (ex ) + ln x2 − ln (x2 + 1) 2
1
ln(y) = x ln (e) + 2 ln (x) − ln x2 + 1
2
1 2
ln(y) = x + 2 ln (x) − ln x + 1
2
We can now differentiate term by term to get:
d d 1
ln(y) = x + 2 ln (x) − ln x2 + 1
dx dx 2
1 dy 2 x
=1+ − 2
y dx x x +1
Solving for dy
dx
gives:
dy 2 x
=y 1+ − 2
dx x x +1
x 2
e x 2 x
=√ 2 1+ − 2
x +1 x x +1
By manipulating the equation first, we could make our lives slightly less difficult.
58
F17XA 4 Introduction to Differentiation
To write the equation back in terms of x, we make use of the trigonometric identity
So we get
q
cos(y) = 1 − sin2 (y)
√
= 1 − x2
d 1
Hence we have arcsin(x) = √
dx 1 − x2
Suppose that y(t) satisfies the equation y 00 (t) + ky(t) = 0 for a constant k. Define g(t) to
be
2
g(t) = (y 0 (t)) + k (y(t))2
and find g 0 (t).
59
F17XA 4 Introduction to Differentiation
From the original assumption of y(t), the bracketed term is exactly zero. Hence we have
g 0 (t) = 0.
In the above example, g(t) is related to the energy of the system modelled by the differential
equation y 00 (t) + ky(t) = 0. Since g 0 (t) = 0, we have that g(t) is constant which means that
energy is conserved.
Example 4.45
The equation of a circle of radius r is given by
x2 + y 2 = r 2
It is important to realise that there could multiple parametrisations of the same equation.
Suppose that a bicycle is moving in a straight line at constant speed. If we consider the path
60
F17XA 4 Introduction to Differentiation
traced out by a specifiic point on a wheel. After t seconds, the position of the point is
dy dy
In this section we want to find for a curve defined by parametric equations. Recall that
dx dx
is the slope of the curve at a given point. In the example shown in Figure 4.4, at any time t, the
dy
gradient is the slope at curve at the point (x(t), y(t)).
dx
Theorem 4.46 Parametric differentiation
Suppose that a curve is given by the parametric equations (x(t), y(t)). The derivative is
given by:
dy dy 1 y 0 (t)
= × dx = 0
dx dt dt
x (t)
Exercise 4.47
dy
Consider the parametric curve defined by x(t) = t2 + 3, y(t) = 4t3 . Find .
dx
Solution: Since dx
dt
= 2t and dy
dt
= 12t2 , we have:
dy dy 1
= × dx
dx dt dt
12t2
=
2t
= 6t
Exercise 4.48
dy
Find for the cycloid defined by x(t) = t − sin(t) and y(t) = 1 − cos(t).
dx
61
F17XA 4 Introduction to Differentiation
dy y 0 (t)
= 0
dx x (t)
sin(t)
=
1 − cos(t)
The second derivative of a parametric curve defined by the equations (x(t), y(t)) is given by
!
d2 y d dy 1
2
= × dx
dx dt dx dt
Exercise 4.49
d2 y
Find 2 for the parametric equation defined by x(t) = 2t2 and y(t) = t4 − 8t3 .
dx
Solution: Since dx
dt
= 4t and dy
dt
= 4t3 − 24t2 ,
dy dy 1
= × dx
dx dt dt
4t3 − 24t2
=
4t
2
= t − 6t
Hence,
!
d2 y d dy 1
2
= × dx
dx dt dx dt
1
= (2t − 6) ×
4t
t−3
=
2t
62
F17XA 4 Introduction to Differentiation
Example 4.51
Consider the following function:
a y
1
b x
-1 1
-1
c
On the shown domain, the point a is both a local maximum and a global maximum. The point
c is both a local minimum and global minimum. However, if the function was extended beyond
the given domain, then the points may no longer be global maximum and minimum.
Example 4.53
The function in Example 4.51 has three stationary points in the given domain located at x = a,
x = b, and x = c.
The definition of stationary points coincides with the location of local extrema. If a point (c, f (c))
is a local extremum, then it must also be a stationary point.
The order of the previous statement is critical. Whilst local extrema happens at stationary
points, not all stationary points are local extrema.
63
F17XA 4 Introduction to Differentiation
The problem of classifying stationary points is to determine which category each stationary point
is in. To answer this question, we will need to define when a function is increasing or decreasing.
In other words, a function is increasing if the output increases as the input increases. Conversely, a
function is decreasing if the output decreases as the input decreases. The following diagram shows
an example of a function that is increasing, decreasing, and neither increasing nor decreasing over
the intervals 0 < x < 2.
y y y
8 8 8
6 6 6
4 4 4
2 2 2
x x x
1 2 1 2 1 2
Increasing function Decreasing function Neither
Example 4.55
Looking at Example 4.51. We see that the intervals of increase and decrease are exactly
partitioned by the stationary points.
• The function is increasing when x < a and x > c
• The function is decreasing when a < x < b and b < x < c.
Since the stationary points can be found by using derivatives, we expect a similar connection
between derivatives and intervals of increase or decrease
Theorem 4.56
Consider a differentiable function f on the interval a < x < b.
• If f 0 (x) > 0 on the interval, then f (x) is an increasing function on that interval.
• If f 0 (x) < 0 on the interval, then f (x) is a decreasing function on that interval.
The definition of increasing/decreasing functions does not depend on the existence of the
derivative. But this theorem tells us that knowing the derivative will make our lives easier.
64
F17XA 4 Introduction to Differentiation
Example 4.57
• The function f (x) = ex has derivative f 0 (x) = ex > 0 for all x. Hence f (x) is increasing
over the entire number line.
• The function f (x) = −x3 − x has derivative f 0 (x) = −3x2 − 1 < 0 for all x. Hence
f (x) is decreasing over the entire number line.
• The function f (x) = −x2 has derivative f 0 (x) = −2x.
– The derivative is positive for x < 0 and hence f (x) is increasing for x < 0
– The derivative is negative for x > 0 and hence f (x) is decreasing for x > 0
Exercise 4.58
Consider the function f (x) = 5 − x2 . Determine the intervals where f (x) is increasing or
decreasing.
Solution: First find the derivative: f 0 (x) = −2x. There is a stationary point is at x = 0.
So we look at the derivative on both sides.
• When x < 0, f 0 (x) > 0. So f is increasing when x < 0
• When x > 0, f 0 (x) < 0. So f is decreasing when x > 0
Exercise 4.59
Consider the function f (x) = 2x3 − 3x2 . Determine the intervals where f (x) is increasing
or decreasing.
Solution: First find the derivative: f 0 (x) = 6x2 − 6x = 6x(x − 1). There are stationary
points at x = 0 and x = 1. So we look at the derivative after partitioning the domain
• When x < 0, f 0 (x) > 0. So f is increasing when x < 0
• When 0 < x < 1, f 0 (x) < 0. So f is decreasing when 0 < x < 1
• When 1 < x, f 0 (x) > 0. So f is increasing when 1 < x
Hence, f is decreasing when 0 < x < 1 and f is increasing when x < 0 or 1 < x.
Exercise 4.60
A resistor of 3 ohms connected in parallel with a variable resistor of x ohms has a combined
resistance R(x) given by
3x
R(x) =
x+3
Show that R(x) is an increasing function for x > 0.
65
F17XA 4 Introduction to Differentiation
Solution: One way to show that R is increasing is the show that the derivative R0 (x) > 0
for x > 0. Using the quotient rule
3(x + 3) − 3x 9
R0 (x) = 2
=
(x + 3) (x + 3)2
Since numerator and denominator are both positive for x > 0, we have R0 (x) > 0 and
hence R is an increasing function.
We can validate this with a plot of R(x)
R
1.5
3x
1 R(x) = x+3
0.5
x
1 2 3
Exercise 4.61
Consider the function f (x) = x + 1
x
on the domain x > 0. Determine the intervals where
f (x) is increasing or decreasing.
x2 − 1
Solution: First find the derivative: f 0 (x) = 1 − x12 = . There is a stationary point
x2
at x = 1. So we look at the derivative after partitioning the domain
• When 0 < x < 1, f 0 (x) < 0. So f is decreasing when 0 < x < 1
• When 1 < x, f 0 (x) > 0. So f is increasing when 1 < x
Hence, f is decreasing when 0 < x < 1 and f is increasing when 1 < x. We can plot the
function to validate our answer
y
6
x
1 2 3
66
F17XA 4 Introduction to Differentiation
This result tells us we should first find the stationary points and then use the sign of the derivative
on either side of the stationary point to determine the nature of the stationary point.
Exercise 4.63
Find and classify the stationary point of f (x) = x2 − 6x + 3.
Exercise 4.64
Find and classify the stationary point of f (x) = x + 4
x
on the domain x > 0
x2 −4
Solution: To find stationary points, we solve for f 0 (x) = 0. Since f 0 (x) = 1− x42 = x2
,
we have x = 2 as the only stationary point.
To classify the stationary point, we consider the derivative on either side:
• When x < 2, f 0 (x) < 0. So f is decreasing when x < 0
• When x > 2, f 0 (x) > 0. So f is increasing when x > 0
By the first derivative test, we have a local minimum at x = 2.
Exercise 4.65
Find and classify the stationary point of f (x) = 2x3 − 6x. Determine the intervals where
f is increasing or decreasing.
Solution: To find stationary points, we solve for f 0 (x) = 0. Since f 0 (x) = 6x2 − 6 =
6(x2 − 1), we have x = −1, 1 as stationary points.
To classify the stationary point, we partition the domain by the stationary points and
determine the sign of the derivative in each interval.
67
F17XA 4 Introduction to Differentiation
If you are familiar with the effect of the second derivative on the shape of the function, The
following test is an alternative method for classifying stationary points.
Theorem 4.66 Second derivative test
Let f be a twice differentiable function and x = c be a stationary point of f .
• If f 00 (c) > 0 then x = c is a local minimum.
• If f 00 (c) < 0 then x = c is a local maximum.
• If f 00 (c) = 0 then this test is inconclusive.
Example 4.67
Consider the function f (x) = e3x − 3ex . We have
f 0 (x) = 3e3x − 3ex = 3ex (ex )2 − 1
f 00 (x) = 9e3x − 3ex
Exercise 4.68
Sketch f (x) = x3 − x2 − x + 1
Solution: We will try to extract all the information we can from f and then f 0 .
• Since f (x) = (x + 1)(x − 1)2 , we have:
– y-intercept at (0, 1)
– Roots at x = −1, 1. Partition the domain by roots.
– f (x) > 0 when −1 < x < 1 and 1 < x
– f (x) < 0 when x < −1
• Looking at the first derivative f 0 (x) = 3x2 − 2x − 1 = (3x + 1)(x − 1)
– Partitioning the domain by stationary points at x = − 31 and x = 1.
– f 0 (x) > 0 when x < −1, so f in increasing on the interval
68
F17XA 4 Introduction to Differentiation
x
-2 -1 1 2
Example 4.69
Suppose two quantities x, y are related via a function y(x). If we are given how fast x changes
dx
with time t, ie . We can determine dydt
using the chain rule:
dt
dy dy dx
= ×
dt dx dt
Exercise 4.70
dx dy
Let y = x2 . If = 5, find when x = 3.
dt dt
Exercise 4.71
The length of the sides of a square are changing. The area of the square is increasing at a
rate 16cm2 per hour. At what rate is the length of each side increasing, when the length
is 4cm.
69
F17XA 4 Introduction to Differentiation
Solution: Define x to be the length of a side of the square and y to be the area. Both
x and y are changing as time t changes. The area of the square is given by y = x2 .
We are given that dy
dt
= 16 and want to find dx
dt
when x = 4.
Using dx
dy
= 2x and the chain rule
dy dy dx
= ×
dt dx dt
dx
16 = 2x ×
dt
dx
16 = 8 when x = 4
dt
Hence dx
dt
= 2. That is, the length is increasing at 2cm per hour when x = 4cm.
Exercise 4.72
Water is dropping into a circular puddle at a uniform rate of 1cm3 per second . The depth
of the puddle remains constant at 0.1cm . When the radius of the puddle is 5cm, find
1. the rate of increase of the radius,
2. the rate of increase of the area,
3. the rate of increase of the circumference.
Solution: Define r be the radius of the puddle, V the volume, A the area, and C the
circumference. We have the follow connect between the variables:
πr2
V = A = πr2 C = 2πr
10
1. We are given that dV
dt
= 1 . We have to find dr
dt
when r = 5.
Using the chain rule
dV dV dr
= ×
dt dr dt
2πr dr
1= ×
10 dt
10 dr
=
2πr dt
10 dr
= when r = 5
2π(5) dt
Hence we have dr
dt
= π1 when r = 5.
2. To find dt we use A = πr2 and the chain rule to obtain
dA
70
F17XA 4 Introduction to Differentiation
dA dA dr
= ×
dt dr dt
dA dr
= 2πr ×
dt dt
dA 1
= 2πr ×
dt π
dA 1
= 2π(5) × when r = 5
dt π
dA
= 10
dt
So dA
dt
= 10 when r = 5.
3. To find dC
dt
, we use C = 2πr and the chain rule to obtain
dC dC dr
= ×
dt dr dt
dC dr
= 2π ×
dt dt
dC 1
= 2π ×
dt π
dC
=2
dt
So dC
dt
= 2 when r = 5.
71
F17XA 5 Introduction to Integration
5 Introduction to Integration
5.1 Antidifferentiation (Indefinite Integration)
Now that we have differentiation, it is natural to ask whether we can reverse the operation.
Starting with a function f (x), an antiderivative F (x) is a function such that dx
d
F (x) = f (x).
Example 5.1
• Since the derivative d 2
dx
x = 2x, the function x2 is an antiderivative of the function 2x.
• Since the derivative d 3
dx
x = 3x2 , the function x3 is an antiderivative of the function 3x2 .
• Since the derivative d
dx
sin(x) = cos(x), the function sin(x) is an antiderivative of the
function cos(x).
• Since the derivative d x
dx
e = ex , the function ex is an antiderivative of the function ex .
Example 5.2
Since the derivative of the functions x2 , x2 + 1, x2 − 3, x2 + eπ are all 2x, they are all antideriva-
tives of function 2x.
The above example shows that any function of the form “x2 +some constant” are all antiderivatives
of 2x. In other words, the antiderivative of 2x is the entire class of functions of the form x2 + C
for any constant C.
To further Rcomplicate issues, there are functions for which there are no easy antiderivatives.
For example, e−x dx has no easy antiderivative.
2
For reasons that will become apparent in the next section, the term antiderivative is also commonly
known as the indefinite integral.
Example 5.4
• Since the derivative d 2
dx
x = 2x, we have
Z
2xdx = x2 + C
72
F17XA 5 Introduction to Integration
Because additional and constant multiplication works nicely with derivatives, we expect these
operations to work nicely with antiderivatives too.
Theorem 5.5 Linearity Rules
Given two functions f (x) and g(x) for which antiderivatives exists and a constant k, we
have
Z Z Z
f (x) + g(x)dx = f (x)dx + g(x)dx
Z Z Z
f (x) − g(x)dx = f (x)dx − g(x)dx
Z Z
kf (x)dx = k f (x)dx
Example 5.6
Since we know
d n+1
x = (n + 1)xn
dx
we can apply the above result to get
Z
xn+1
xn dx =
n+1
73
F17XA 5 Introduction to Integration
f (x) f 0 (x)
xn nxn−1
sin(ax + b) a cos(ax + b)
cos(ax + b) −a sin(ax + b)
eax aeax
a
ln(ax + b)
ax + b
we can read the table backwards and account for constants to get the following.
Theorem 5.7 Standard Antiderivatives
We have the following standard antiderivatives
Z
f (x) f (x)dx
(ax + b)n+1
(ax + b)n + C for n 6= −1
a(n + 1)
1
sin(ax + b) − cos(ax + b) + C
a
1
cos(ax + b) sin(ax + b) + C
a
1 ax
eax e +C
a
1 1
ln(ax + b) + C for ax + b > 0
ax + b a
Exercise 5.8
Find the antiderivative of
1
+ 2x
x
with respect to x for x > 0.
74
F17XA 5 Introduction to Integration
Example 5.9
Some further examples of antidifferentiation along these lines are:
Z
8x2
(3 + 8x) dx = 3x + +C
2
= 3x + 4x2 + C
Z
18
Z
2
5 − x + 4 dx = 5 − x2 + 18x−4 dx
x
x3 18x−3
= 5x − − +C
3 3
x3
= 5x − − 6x−3 + C
3
Z sin(7x) 2e−5x
cos(7x) + 2e−5x dx = − +C
7 5
Z
6
− 4 sin 2t dt = 2 ln(3t − 5) + 2 cos 2t + C
3t − 5
One application of antidifferentiation in physical context is to calculate the amount of work done
by a variable force. For a constant force, we have W ork = F orce × Distance. But to compute
the work done over variable force, we need something more complicated.
75
F17XA 5 Introduction to Integration
Exercise 5.10
Two electric charges with charge q1 and q2 respectively are separated by a distance r in a
vacuum and f (r) is the force exerted. Coulomb’s inverse square law states
q1 q2
f (r) =
4π0 r2
where 0 is a constant. Find the work done w = f (r)dr.
R
Solution:
Z Z
q1 q2
w= f (r)dr = dr
4π0 r2
q1 q2 Z 1
= dr
4π0 r2
−q1 q2 1
= +C
4π0 r
Exercise 5.11
Find the area under the function f (x) = 2x between x = 0 and x = 5.
x
(0, 0)
Since the shape of the area is a triangle, we can obtain the area using the geometric
formula. We know f (5) = 10. Hence the area under this function is A = 5×10
2
= 25.
For the purposes of this subject, we will work with the following definition of integration
76
F17XA 5 Introduction to Integration
The area under the graph of the function f (x) between a ≤ x ≤ b is given the by the
definite integral Z b
Area = f (x)dx
a
The right-hand side of the equation is read as “The definite integral between a and b of f (x) dx”.
Before we proceed, we note a few things about the definite integral:
• a is the lower limit of integration, or lower terminal,
• b is the upper limit of integration, or upper terminal,
• The function f (x) is the integrand.
If the curve is below the x-axis, then there is negative area under the curve.
Example 5.13
We will illustrate the rigourous definition of integration in this example. In cases where the
function is not a simple geometric shape, then we approximate the area using rectangles with
one corner touching the function. The more rectangles we use, the better the approximation.
This approximation becomes exact as the number of rectangles go to infinity.
y
The process of finding area under the curve is a technically challenging problem. However, the
following result gives us a more convenient approach to finding areas.
Theorem 5.14 Fundamental Theorem of Calculus
If f is a continuous function on the interval [a, b], then
Z b
f (x)dx = F (b) − F (a) = [F (x)]ba
a
Since F can be any antiderivative, we can pick the one with C = 0 so that it effectively
disappears from our computations.
77
F17XA 5 Introduction to Integration
This result connects two fundamental ideas in calculus. Finding areas and undoing the slope are
seemingly different ideas. Yet this result tells us finding area is equivalent to undoing the slope
at two different points.
As a result of this connection, the antiderivative also known as the indefinite integral. In
particular, the results we have for antiderivatives apply for indefinite integrals.
For example, Theorem 5.7 contains the indefinite integrals for a range of common functions.
Exercise 5.15
Evaluate the definite integral Z 6
x2 dx
3
Since integration is the process of finding areas, we have the following results to help partition
complicated functions.
Theorem 5.16
Let a < b < c be real numbers, then we have the following
Z c Z b Z c
f (x)dx = f (x)dx + f (x)dx
Z aa a b
f (x)dx = 0
a
Z a Z b
f (x)dx = − f (x)dx
b a
78
F17XA 5 Introduction to Integration
Exercise 5.17
Consider the function given by
ex −2 ≤ x < 0
f (x) = 3
x + 1 0 ≤ x ≤ 2
and evaluate Z 2
f (x)dx
−2
Solution: Using the previous result, we can partition the interval at x = 0 to get
Z 2 Z 0 Z 2
f (x)dx = f (x)dx + f (x)dx
−2 −2 0
Z 0 Z 2
= ex dx + x3 + 1dx
−2 0
" #2
x4
= [ex ]0−2 + +x
4 0
" ! !#
4
h i 2 04
= e0 − e−2 + +2 − +0
4 4
= 1 − e−2 + 6
= 7 − e−2
Exercise 5.18
Let F (x) = e−3x . Find F 0 (x) and use the result to evaluate
2
Z 1
2
xe−3x dx
0
79
F17XA 5 Introduction to Integration
Z 1
−3x2 1Z 1 2
xe dx = − −6xe−3x dx
0 6 0
1Z 1 2
=− −6xe−3x dx
6 0
1 h −3x2 i1
= − −e
6 0
1 h −3 i1
= − −e + e0
6 0
−3
1 e
= −
6 6
In cases where one (or both) of the terminals are infinite, we define
Z ∞ Z b
f (x)dx = lim f (x)dx
a b→∞ a
and Z b Z b
f (x)dx = lim f (x)dx
−∞ a→−∞ a
Exercise 5.20
Evaluate Z ∞
1
dx
1 x2
80
F17XA 5 Introduction to Integration
1
As b grows, 1
b
gets smaller and smaller, so we have lim = 0. Thus b tends to infinity
b→∞ b
and 1 − 1
b
tends to 1. Thus
Z ∞
1 Z b
1
2
dx = lim dx
1 x b→∞ 1 x2
1
= lim 1 − = 1.
b→∞ b
An infinite limit may seem counter-intuitive when computing physical areas. However, it can be
extremely useful when applied to mechanics.
Example 5.21
The force of attraction F due to gravity between a space shuttle and Earth is given by
GM m
F =
r2
where G is the gravitation constant, M is the mass of the earth, m is the mass of the shuttle,
and r is the distance between the two objects.
The work (energy) required to elevate the shuttle from height a to height b is given by
Z b
GM m
W = dr
a r2
Since the gravitation force has infinite range, the total work required by the shuttle to escape
the Earth’s orbit can be computed when the upper terminal is infinite. That is
Z ∞ Z b
GM m GM m
W = 2
dr = lim dr
a r b→∞ a r2
Based on the previous exercise, we have
b
GM m GM m
W = lim − =
b→∞ r a a
So the energy required by the shuttle to escape the Earth’s orbit starting at distance a from
the Earth’s core is GMa m .
Exercise 5.22
Z ∞
Evaluate the integral 2xdx, if it exists.
0
81
F17XA 5 Introduction to Integration
So as b tends to infinity, the term b does not approach a finite number. Hence the integral
2
82
F17XA 6 Statistics and Probability
6.1 Statistics
Statistics is concerned with the collection and analysis of data. Data consists of observations of
one or more variables. The first consideration of data collection is the nature of the observations
made. Observations generally fall into several categories: Categorical, Ordinal, and Cardinal. In
this subject, we will be focused on cardinal data. This type of data can be divided into discrete or
continuous. Discrete data are exact and require no rounding. For example, the number of people
living in a house or a person’s shoe size. Continuous data require rounding of some sort and can
take on any value within a given range. For example, the temperature of an electronic component
or the time required to complete a race.
The mean is the sum of all observations divided by the number of observations. The median is
the “middle” observation if n is odd and the midpoint of the middle two observation when n is
9
We use this word loosely as there are different interpretation of expectation, even amongst experts
83
F17XA 6 Statistics and Probability
even.
Exercise 6.2
Find the mean and median of the following data set.
5 7 −4 1 9 12
Solution: First, we count the number of observations to get n = 6. Then we can apply
the formula to get:
1 30
x= (5 + 7 − 4 + 1 + 9 + 12) = =5
6 6
To obtain the median, we first arrange the observations in ascending order:
−4, 1, 5, 7, 9, 12. Since n is even, we take the midpoint of the middle two observations.
So the median is 5+7 2
= 6.
There is no “right” answer for the correct choice of average. The sensible choice will depend on
the nature of the data sets. The mean is more sensitive to changes in individual observations,
and outliers in particular. On the other hand, the median is more robust to outliers and changes
to individual observations.
Example 6.3
Consider the following scenarios:
• Suppose we are computing the average weight of a component off a mechanical production
line. The mean will be a sensible measure of average since almost all components will be
of similar weight with very few outliers. Large changes in the mean over a short period
of time might signal a malfunction. On the other hand, the median may not be robust
enough to detect such anomalies
• Suppose we want to compute the average household income for a population. The mean
will not be descriptive of a typical household since it will be heavily skewed towards the
small number of extremely wealthy households leaving the majority of the population with
“below average” income. In this case, the median is typically quoted as being a better
representation of the population.
84
F17XA 6 Statistics and Probability
then there are two modes. In general, the data has N modes if there are N observations that
share the highest frequency.
Exercise 6.5
Find the standard deviation of the following dataset:
5 7 −4 1 9 12
Thus " #
1
2 302 166
σ = 316 − = = 33.2
5 6 5
as before, so we have σ = 5.76.
85
F17XA 6 Statistics and Probability
Exercise 6.6
Tyre tread depth was measured during roadside checks by the transport police. The results
for 100 cars are tabulated below:
Tyre tread depth (in mm) xi Frequency fi
1.6 12
1.8 22
2.0 26
2.2 40
1. Calculate the mean tyre tread depth
2. Calculate the standard deviation of the tyre tread depth
3. State the mode of tyre tread depth measurements
Solution:
1. We calculate the mean as: P
f i xi
x = Pi
i fi
which is given by
12 × 1.6 + 22 × 1.8 + 26 × 2.0 + 40 × 2.2 198.8
x= = = 1.988mm
12 + 22 + 26 + 40 100
2. We calculate the standard deviation as:
fi (xi − x)2
P
2 i
σ =
( i fi ) − 1
P
86
F17XA 6 Statistics and Probability
3. The mode is the most common measurement, looking at the table gives 2.2mm as
the one with the higest frequency.
The first property tells us that the probability of any event is between 0 (impossible) and 1
(certain). The second property tells that the probability of any outcome happening is certain.
The third property tells us that we can add probabilities nicely provided that two events don’t
overlap.
One assumption we commonly make is the equiprobable assumption. That is, every outcome
in the sample space is equally likely to occur with probability |Ω|
1
, where |Ω| is the number of
outcomes in the sample space.
Example 6.8
• Rolling a fair six-sided die:
outcome 1 2 3 4 5 6
probability 1
6
1
6
1
6
1
6
1
6
1
6
87
F17XA 6 Statistics and Probability
An event E is a set of desirable outcomes. The probability P (E) of the event E is the sum of
the probabilities of these outcomes.
Exercise 6.9
Let A be the event that a die shows at most 4 and let B be the event that the die shows
an even number. Find P (A) and P (B).
Solution: First, we have the events A = {1, 2, 3, 4} and B = {2, 4, 6}. Hence
1 1 1 1 2
P (A) = + + + =
6 6 6 6 3
1 1 1 1
P (B) = + + =
6 6 6 2
Exercise 6.10
In a series of repeated measurements, the number of radioactive particles emitted over a
fixed time interval is recorded:
outcome: 0 1 2 3 4 5 ≥6
probability: 0.368 0.368 0.184 0.061 0.015 0.003 0.001
Let D be the event that at most 2 particles are emitted. What is P (D)?
88
F17XA 6 Statistics and Probability
In particular complement of the entire sample space Ωc = ∅ = {} is the event with no desirable
outcomes. Hence P (∅) = 0.
Example 6.12
A fair six-sided dice is rolled Ω = {1, 2, 3, 4, 5, 6} and two events are defined:
2
A = {1, 2, 3, 4} P (A) =
3
1
B = {2, 4, 6} P (B) =
2
Then we can compute the following events and probability:
1
Ac = {5, 6} P (Ac ) =
3
1
B c = {1, 3, 5} P (B c ) =
2
1
A ∩ B = {2, 4} P (A ∩ B) =
3
5
A ∪ B = {1, 2, 3, 4, 6} P (A ∪ B) =
6
1
(A ∪ B)c = {5} P ((A ∪ B)c ) =
6
Ω = {1, 2, 3, 4, 5, 6} P (Ω) = 1
∅ = Ωc = {} P (∅) = 0
Based on the example, we can see that for any event A, we have
P (Ac ) = 1 − P (A)
However, probabilities involving intersections and unions are more complicated and require more
machinery.
89
F17XA 6 Statistics and Probability
A∩B =∅
Theorem 6.14
Given two events A and B, we have
P (A ∪ B) = P (A) + P (B) − P (A ∩ B)
P (A ∪ B) = P (A) + P (B)
The subtraction is to ensure that we do not count the same outcomes twice.
Example 6.15
A fair six-sided dice is rolled and the following events are defined:
P (A ∪ C) = P (A) + P (C)
4 1 5
= + =
6 6 6
The events A and B are not mutually exclusive. We have A ∩ B = {2, 4} and hence
P (A ∪ B) = P (A) + P (B) − P (A ∩ B)
4 3 2
= + −
6 6 6
5
=
6
We can also compute P (A ∪ B) directly since A ∪ B = {1, 2, 3, 4, 6}.
Exercise 6.16
Outside work on an oil platform cannot proceed if it is either too wet or too windy. The
probability of the event A that work is cancelled because it is too wet is 0.3, and the
probability of the event B that work is cancelled because it is too windy is 0.2. The
probability of the event A ∩ B that it is simultaneously both too wet and too windy is 0.1.
Find the probability that work can proceed.
90
F17XA 6 Statistics and Probability
Solution: The event that it is either too wet or too windy is A ∪ B and we have
The event C that work can proceed is the same as the event A ∪ B not occurring. Hence,
the probability that work can proceed is given by
P (A ∩ B) = P (A) × P (B)
In other words, independent events do not affect each other. Knowing whether event A happened
does not affect the probability of B happening.
Example 6.18
• The outcomes of flipping two coins are independent event since knowing the outcome of
one does not impact on the outcome of the other.
• The outcome of drawing two cards from a deck (without replacement) are dependent
events since knowledge of the first card affects the second card.
Exercise 6.19
A computer is defective with probability 14 and a second computer is defective with prob-
ability 10
1
. Assuming that these events are independent, find the probability that at least
one computer is working.
Solution: Let A be the event that the first computer is defective, B the event that the
second computer is defective. So we have
1
P (neither computer working) = P (A ∩ B) = P (A)P (B) =
40
Hence
39
P (at least one working) = 1 − P (neither working) =
40
91
F17XA 6 Statistics and Probability
Exercise 6.20
A plant has three synthesis reactors: A,B, and C. All three produce urea. Urea is produced
in discrete batches, each of 100 kg. Reactor A produces 60% of the total volume, Reactors
B and C 25% and 15% respectively. The failure rate of the three Reactors also varies:
5% of the production by Reactor A has failed levels of contamination. The failure rate
for Reactors B and C are 2% and 6%. A batch passes if it does not fail. Calculate the
probability that a batch picked at random is
1. from Reactor B
2. from Reactor A and passes
3. from Reactor C and fails
4. passes
Solution: First, convert the percentages to probabilities and construct the tree diagram:
O
0.6 0.15
0.25
A B C
0.05 0.95 0.02 0.98 0.06 0.94
92
F17XA 6 Statistics and Probability
Example 6.22
Example of Bernoulli trials include:
• Flipping a coin with success to be heads has p = 21 .
• Rolling a fair six-sided die with success to be rolling a 6 has p = 16 .
• An item off a manufacturing line with success to be passing the quality assessment. Here,
p might be obtained experimentally.
A single Bernoulli trial is not very complicated. We are mostly interest in repeating Bernoulli
trials. We will illustrate some important ideas via an example.
Exercise 6.23
A reactor produces batches of urea of which 85% passes quality assessment. Three batches
are selected. Calculate the probability that the number of batches that pass is:
1. zero
2. one
3. two
4. three?
Solution: Create a Bernoulli trial by defining S to be the event that a given batch passes
and define F as the batch is fails. Then P (S) = 0.85 and P (F ) = 0.15. We can construct
the tree diagram
93
F17XA 6 Statistics and Probability
O
0.85 0.15
3. If two batches passes then the outcomes are SSF or SF S or F SS. As with the
previous part these events are mutually exclusive
While this approach is suitable for a small number of repeated trials, we may struggle to create
the full tree diagram as the number of trials increases. To simplify the process, we observe that
in each of the computations, the probability of each mutually exclusive outcome were identical.
Hence we only have to compute the number of ways we can get the right number of successes
and fails.
94
F17XA 6 Statistics and Probability
Example 6.25
For small values of n, we have
• 3! = 3 × 2 × 1 = 6
• 4! = 4 × 3 × 2 × 1 = 24
• 5! = 5 × 4 × 3 × 2 × 1 = 120
When computing binomial coefficients, we should avoid expanding the full factorial.
!
2 2! 2! 2!
= = = =1
0 (2 − 0)!0! 2!0! 2!
!
4 4! 4! 4 × 3 × 2! 4×3
= = = = =6
2 (4 − 2)!2! 2!2! 2!2! 2!
!
8 8! 8! 8 × 7 × 6 × 5! 8×7×6
= = = = = 56
3 (8 − 3)!3! 5!3! 5!3! 3!
95
F17XA 6 Statistics and Probability
Exercise 6.27
A reactor produces batches of urea, 0.87 of which passes quality assessment. 10 batches
are selected. Calculate the probability that the number of acceptable batches from these
10 are:
1. zero
2. nine
3. ten
Solution: Let S and F be defined as S “the batch passes” and F “the batch fails” with
P (S) = 0.87 and P (F ) = 0.13. We then have
1.
!
10
P (0 passes) = (0.87)0 (0.13)10
0
= 1 × 1 × 1.37 × 10−9
2.
!
10
P (9 passes) = (0.87)9 (0.13)1
9
= 10 × 0.2855 × 0.13 = 0.371
3.
!
10
P (10 passes) = (0.87)10 (0.13)0
9
= 1 × 0.248 × 1 = 0.248
P (outcome is exactly a) = 0
Example 6.28
There is zero probability that:
• A race is completed in exactly 10 minutes
• The height of a person is exactly 1.80 metres
96
F17XA 6 Statistics and Probability
Instead, continuous probabilities are defined on an interval. So while there is zero probability of
a competitor finishing a 100m race in exactly 15 seconds, there is a good chance that they finish
the race sometime between 14 and 16 seconds.
The probability density function P (X) is the function that describes the probability of an
event with continuous outcome over an interval. Further the probability density function
has the following properties.
• the function is never negative. That is, P (X) ≥ 0
• the total area under the graph of the function is equal to 1
The probability that the variable X lies between two given values, say a and b, is given by
the area under the curve between X = a and X = b. That is
Z b
P (a ≤ X ≤ b) = P (X)dX.
a
In the event that one of the inequalities involve ±∞, we can omit the inequality. Hence
P (−∞ ≤ X ≤ b) = P (X ≤ b) and P (a ≤ X ≤ ∞) = P (a ≤ X).
It is equivalent to denote a probability over an interval as P (a < X < b) instead of P (a ≤
X ≤ b) since
Exercise 6.30
Consider the function
e−X X≥0
P (X) =
0 X<0
1. Show that P (X) is a probability density function
2. Find the probability that the outcome is between 0 and 1.
Solution:
1. We check the two conditions:
• Since e−X > 0, have P (X) ≥ 0 for all X
97
F17XA 6 Statistics and Probability
• we have to evaluate
Z ∞ Z 0 Z ∞
P (X)dX = P (X)dX + P (X)dX
−∞ −∞ 0
Z 0 Z ∞
= 0dX + e−X dX
−∞ 0
−X ∞
h i
= 0 + −e
0
h i
= 0 + lim 1 − e−b
b→∞
=1
The standard normal distribution Z has mean 0 and standard deviation 1. This is denoted Z = 0
and σ = 1.
98
F17XA 6 Statistics and Probability
0.3
0.2
0.1
−4 −3 −2 −1 0 1 2 3 4
= −
a b b a
2. P (Z ≤ b) = 1 − P (Z ≥ b)
= 1−
b b
3. P (Z ≥ b) = P (Z ≤ −b)
99
F17XA 6 Statistics and Probability
b −b
Based on these relations, we can reduce the computation of probabilities down to computing
P (Z ≤ b) for different values of b. Unfortunately, this is where things get challenging. There are
no easy methods of finding an antiderivatives of P (Z). So instead of computing these probabilities
analytically, we use the table for the Standard Normal distribution (Appendix B). The table is
indexed by b in the rows and columns (also known as z-scores) and the corresponding entry in
the table is P (Z ≤ b).
Exercise 6.32
Let Z be the standard normal distribution. Compute the following probabilities.
1. P (Z ≤ 1.83), P (Z ≤ 0.57)
2. P (Z ≥ 0.57)
3. P (Z ≤ −0.57)
4. P (−0.57 ≤ Z ≤ 1.83)
Solution:
1. We look up the z-scores in the table by selecting the row corresponding to the first
decimal place and selecting the right column based on the second decimal place.
Doing this, we get:
P (Z ≥ 0.57) = 1 − P (Z ≤ 0.57)
= 1 − 0.7157
= 0.2843
P (Z ≤ −0.57) = P (Z ≥ 0.57)
= 0.2843
100
F17XA 6 Statistics and Probability
For mathematical reasons beyond the scope of this subject, we use the symbol ∼ to denote
that two outcomes that have the same probability distribution.
Comparing Definition 6.31 with Definition 6.33 shows that any normal distribution can be con-
verted back to the standard normal distribution via a series of graph transformations.
Theorem 6.34
We can convert a normal distribution X ∼ N (µ, σ) to the standard normal distribution
Z ∼ N (0, 1) via the transformation
X −µ
Z=
σ
In particular, if two numbers x, z satisfy the relation
x−µ
z=
σ
then P (X ≤ x) = P (Z ≤ z).
The number x is usually given in the context of the question and we have to compute z (hence,
z-score). From there, we can find the probability using the stardard normal table Definition 6.33.
Let’s see this in an example:
Exercise 6.35
The thickness of an aluminium sheet is normally distributed with mean 52µm and standard
deviation 5µm. What proportion of the aluminium sheets have thickness:
1. less than 60.3µm;
2. less than 40µm;
10
In fact the Central Limit Theorem says that data sets involving continuous measurement will tend towards
normal distribution as the sample size increases.
101
F17XA 6 Statistics and Probability
Solution: We denote the distribution of the aluminium sheet thickness (in µm) by X ∼
N (52, 5).
1. To find P (X ≤ 60.3), we first compute z = 60.3−52
5
= 1.66. Hence
P (X ≤ 60.3) = P (Z ≤ 1.66)
= 0.9515
P (X ≤ 40) = P (Z ≤ −2.4)
= P (Z ≥ 2.4)
= 1 − P (Z ≤ 2.4)
= 1 − 0.9918
= 0.0082
102
F17XA 7 Vectors
7 Vectors
7.1 Basic Operations
A scalar is a quantity which has magnitude only, e.g., temperature, mass, pressure.
Many other quantities are not sufficiently defined by their magnitude alone, e.g., displacement,
velocity, magnetic field. These quantities involve both a magnitude and direction. Such quantities
are called vectors.
Let P and Q be two points in space. The vector P~Q is the directed line segment starting
at P and finishes at Q.
• The vector can be labelled using an underlined lower case letter a = P~Q.
• The point P is the tail of the vector
• The point Q is the head of the vector
• The magnitude of the vector is the positive length of the line, denoted |a| or |P~Q|.
• The direction is in the direction from P towards Q.
Q
P~Q = a
P
In this subject, we will restrict our discussion of vectors to 2 or 3 dimensions. However, the same
logic can extended to higher dimensions or more abstract spaces.
Before we proceed, some notation. In this section, we shall
• Use upper case letters P, Q, R, S, . . . to denote points in space. The upper case O denotes
the origin.
• Use overhead arrow P~Q, underlined lower case a, or bold font a to denote vectors.
Two vectors a and b are equal, denoted a = b, if they have the same magnitude in the same
direction. An immediate consequence of this definition is that the absolute position of the tail
and head of a vector is not essential in defining a vector, only their position relative to each other.
Example 7.2
Consider the parallelogram with corners at P QRS
Q S
P~Q
~
RS
P R
103
F17XA 7 Vectors
Q S
T~U
P~Q
T
~
RS
P R
Given two vectors a and b, we can add the two vectors a + b by joining the tail of one to the head
of the other. This is called the triangle rule for vector addition.
Example 7.3
Define the vectors a and b as follows:
a b
a
a+b
a+b
a
a
The null vector or zero vector, denoted 0, is the vector of magnitude 0. The direction is undefined
for this vector. From this, we can define the negative vector of a as the vector with the same
magnitude but in the opposite direction. This vector is denoted −a. Hence we have the immediate
result
a + (−a) = a − a = 0
104
F17XA 7 Vectors
Example 7.4
Define the vectors a = P~Q and b = QR,
~ then we have −b = QS
~ and a − b = P~S.
−b b
S Q R
a−b a
Example 7.5
Define the vector a as follows:
a
For n ≥ 0, we can generalise na as a vector in the direction of a with length scaled by a factor of
n. In the case where the scalar is negative, we have −na = n(−a). That is, the negative vector
scaled by n.
−2a
2a
a
1
2
a
Since the length of vectors can be scaled, we can introduce the notion of an unit vector. For any
vector a 6= 0, we can create a unit vector u = |a|
1
a which has magnitude 1 in the direction of a.
105
F17XA 7 Vectors
7.2 Components
Let P be a point in 2 dimensions (the xy-plane) with Cartesian coordinates (x, y). And let O
denote the origin with coordinates (0, 0). We say that P has a position vector p = OP
~ . Using
the Pythagorean theorem, we get that
q
|p| = x2 + y 2
y-axis
y P
~
OP
x-axis
O x
P~Q = OQ
~ − OP
~ =q−p
P P~Q
~ =p
OP
Q
~ =q
OQ
O
At this point, we define three key unit vectors that forms the standard basis.
It follows from the triangle rule that we can write the position vector of the point P with Cartesian
coordinates (x, y) as p = xi + yj. That is we have
106
F17XA 7 Vectors
p
yj
O xi
Exercise 7.8
Find the position vector p of the point P with Cartesian coordinates (−1, 3)
In the physical world, we can think of the point P with Cartesian coordinates (x, y, z) as a physical
point in space is being occupied and the vectors i, j, k indicating a direction from point P . The
two need not be related.
We can express any vector P~Q in terms of the unit vectors i and j in this way. Consider the
following example.
Example 7.9
Consider the points P and Q defined by (2, 1) and (6, 4) respectively.
Q = (6, 4)
P~Q
bj
P = (2, 1) ai
We can write any vector v as v = ai + bj, for some suitable choice of a, b. In this case, we say
the vector v is expressed or resolved in terms of i, j. The numbers a and b are called the i and
j components of v.
For students who have encountered matrices, we can express the vector v = ai + bj as a column
matrix !
a
v=
b
or a row matrix v = (a, b).
We adopt the convention of the comma separated row matrix notation (a, b) for vectors.
107
F17XA 7 Vectors
Be aware to not confuse this notation with the notation for Cartesian coordinates.
Exercise 7.10
If a = (a1 , a2 ), find |a|.
Solution:
a = (a1 , a2 ) = a1 i + a2 j
q
|a| = a21 + a22
The same ideas will hold in 3 dimensions. The position vector of a point P with Cartesian
coordinates (x, y, z) is
~ = xi + yj + zk
p = OP
More generally, we can resolve any 3 dimensional vector v as
v = ai + bj + ck
Exercise 7.11
Express v = (2, 3, −1) in the form v = ai + bj + ck and find |v|.
q √
Solution: We have v = 2i + 3j − k and |v| = 22 + 32 + (−1)2 = 14.
Exercise 7.13
If a = (−3, 9, 1) and b = (−6, 9, −12), find a + 13 b.
108
F17XA 7 Vectors
Solution:
1 1
a + b = (−3, 9, 1) + (−6, 9, −12)
3 3
= (−3, 9, 1) + (−2, 3, −4)
= (−5, 12, −3)
Exercise 7.14
Points P and Q have Cartesian co-ordinates (−1, 3, 4) and (−3, 2, 5). Find the distance
between P and Q, then determine the Cartesian coordinates of the midpoint M of P~Q.
Solution: For this type of geometrical problem, the first step is always to draw the the
relevant vectors (you can draw it in 2D and you don’t need to worry about the detailed
position of points). We have
Q
M
~
OM P~M
O ~
P
OP
~ = OP
OM ~ + P~M
~ + 1 P~Q
= OP
2
1
= (−1, 3, 4) + (−2, −1, 1)
2
1 1
= (−1, 3, 4) + −1, − ,
2 2
5 9
= −2, ,
2 2
Thus the Cartesian coordinates of M are −2, 52 , 92 .
109
F17XA 7 Vectors
a · b = |a||b| cos(θ)
b
b θ
θ
a a
Since |a| ≥ 0 and |b| ≥ 0, the dot product is entirely determined by the angle between the vectors.
Given non-zero vectors a and b,
• if they are in the same direction, then θ = 0, cos(θ) = 1 and
a · b = |a||b|
Exercise 7.16
Find the scalar product of the vectors (1, 1, 0) and (2, 2, 0).
110
F17XA 7 Vectors
Solution: These vectors are in the same direction since (2, 2, 0) = 2(1, 1, 0). Hence,
√ √ √
(1, 1, 0) · (2, 2, 0) = 2 8 = 16 = 4
Scalar products of the standard basis vectors (i, j, k) are useful to know.
Theorem 7.17
Let i, j and k be the unit vectors of the standard basis. Since they are all mutually
perpendicular, we have:
This means we can compute the scalar product of vectors expressed in component form.
Example 7.18
Consider the vectors a = (a1 , a2 , a3 ) = a1 i + a2 j + a3 k and b = (b1 , b2 , b3 ) = b1 i + b2 j + b3 k,
then we have:
Hence, we have:
a · b = a1 b 1 + a2 b 2 + a3 b 3
a · b = a1 b 1 + a2 b 2 + a3 b 3
111
F17XA 7 Vectors
In particular:
• For any vector a: a · a = a21 + a22 + a23 = |a|2
• For 2 dimensional vectors a = (a1 , a2 ) and b = (b1 , b2 ) we have a · b = a1 b1 + a2 b2 .
Exercise 7.20
Find a · b when a = (1, −4, 3) and b = (6, −2, −1).
Exercise 7.21
Find the angle between the vectors e = i − 4j and f = 3i + 2j.
e · f = |e||f | cos(θ)
e · f = 1 × 3 + (−4) × 2 = −5
q √
|e| = 12 + (−4)2 = 17
√
|f | = 13
Hence,
e·f −5
cos(θ) = =√ √
|e||f | 17 13
θ = 1.913 radians (3DP)
Exercise 7.22
Find the angle at A in the triangle with vertices A, B, C with Cartesian coordinates
(1, 0, −1), (2, 1, 3), (3, 2, 1) respectively.
112
F17XA 7 Vectors
~
AB
B
O
We then use Theorem 7.6 to get
~ = OB
AB ~ − OA
~ = (2, 1, 3) − (1, 0, −1) = (1, 1, 4)
~ = OC
AC ~ − OA
~ = (3, 2, 1) − (1, 0, −1) = (2, 2, 2).
Thus we have
~ · AC
AB ~ = 1 × 2 + 1 × 2 + 4 × 2 = 12
√ √
~ = 1 + 1 + 16 = 18,
|AB|
√ √
~ = 4 + 4 + 4 = 12,
|AC|
Hence
~ · AC
AB ~
cos(θ) =
~ AC|
|AB|| ~
s
2
=
3
Given non-zero vectors a and b in 3 dimensions, the vector product (or cross product) is
defined by
a × b = |a||b| sin(θ)n
where 0 ≤ θ ≤ π is the angle between the positive directions of a and b and n is the unit
normal vector described by the right-hand rule.
113
F17XA 7 Vectors
Example 7.25
Consider the following examples:
b c
d
a
The right-hand rule has the unit normal of a × b pointing out of the plane towards the reader,
whereas the unit normal of c × d points into the page away from the reader
As for the scalar product, we can simplify the complexity of these calculations by understanding
how vector products work on the standard basis.
• The angle between each vector and itself is θ = 0. So a × a = 0. Hence
i×i=j×j =k×k =0
• Since i and j are perpendicular, we have sin(θ) = 1. With the right-hand rule, we get:
i×j =k
j × i = −k
11
no pun intended
114
F17XA 7 Vectors
Theorem 7.26
Let i, j and k be the unit vectors of the standard basis. We have the cross products:
i j k
b × a = −a × b
a×a=0
a × (b + c) = a × b + a × c
(a + b) × c = a × c + b × c
Example 7.28
Consider the vectors a = (a1 , a2 , a3 ) = a1 i + a2 j + a3 k and b = (b1 , b2 , b3 ) = b1 i + b2 j + b3 k,
then we have:
115
F17XA 7 Vectors
Hence, we have:
Exercise 7.29
If a = (2, −1, 3) and b = (1, 2, 1), find a × b.
Solution:
a × b = (2i − j + 3k) × (i + 2j + k)
= −7i + j + 5k
= (−7, 1, 5)
We have a geometrical interpretation to the magnitude of the vector product if we consider the
parallelogram below
b
h
θ
a
a · (b × c)
Example 7.31
Combining the results of Example 7.18 and Example 7.28, we can show that for:
a = a1 i + a2 j + a3 k
b = b1 i + b 2 j + b3 k
c = c1 i + c2 j + c3 k
116
F17XA 7 Vectors
we have:
and hence
Exercise 7.32
Find the scalar triple product a · (b × c) when a = (1, 2, 3), b = (2, 1, 3), c = (4, 0, 1)
Solution: We have
A geometrical interpretation may also be given to the scalar triple product. Consider the following
parallelepiped:
φ a
c
The volume V of this object is the base area B times the height h = |a| cos(φ). We also know
B = |b × c| from above, and that b × c points up from the right-hand-rule. Hence, we have
V = Bh
= |b × c||a| cos(φ)
= |a||b × c| cos(φ)
= a · (b × c)
Thus we have that the volume of such a parallelepiped is given by the scalar triple product a·(b×c).
This application of the formula gives the identity
a · (b × c) = b · (c × a) = c · (a × b)
since the volume does not depend upon the cyclic order we choose for a, b and c.
117
F17XA 7 Vectors
If any two of the vectors are the same, then the resulting parallelepiped gives zero volume.
a · (a × b) = a · (b × a) = 0
In arriving at V = a · (b × c), we have been rather careful about choosing the orientation of a, b
and c. More generally, we have V = |a · (b × c)|.
d
P ~ + td
OP
~ +d
OP
we see that the position vector of one point on the line L is given by OP
~ + d. In fact, any point
~ + td for some value of the parameter
R on the line will have a position vector of the form r = OP
t.
Let P = (px , py , pz ) be a point on the line and d = (dx , dy , dz ) be a vector parallel to the
line. The set of points R = (x, y, z) on the line is given by:
• The parametric equation of the line L
r = p + td
for a parameter t.
• The set of parametric equations
x =px + tdx
y =py + tdy
z =pz + tdz
118
F17XA 7 Vectors
Exercise 7.34
Find the parametric equation of the line through the point (1, 2, 1) in the direction of the
vector (−1, 1, 3). Hence, find the Cartesian form of the line.
x=1−t
y =2+t
z = 1 + 3t
If we are given two points on the line P and Q, then we can produce a vector parallel to the line
using Theorem 7.6 and d = OQ ~ − OP ~ . Then we recover the parametric equation r = p + td.
Exercise 7.35
Find the parametric equation of the line passing through the two points P = (1, 1, 0) and
Q = (2, 1, 3).
119
F17XA 7 Vectors
Exercise 7.36
The Cartesian equation of a line is given by
x−1 y+2
= = z − 2.
3 2
Write down the parametric version of the equation and hence find a vector parallel to the
line, and a point on the line.
Hence, (3, 2, 1) is a vector parallel to the line, and (1, −2, 2) is a point on the line (set
t = 0).
Exercise 7.37
Find the Cartesian equation of the line through the point (1, −3, 2) in the direction of the
vector (−1, 0, 3).
x=1−t
y = −3 + 0t = −3
z = 2 + 3t
The challenge comes from 0 entries in d. In fact, every 0 entry in d will increase the number of
conditions in the resulting line.
120
F17XA 7 Vectors
P~R · n = 0
n
R
P~R
P
p r
Any vector from P perpendicular to n gives a point R in the plane. Another way to interpret this
definition is that all points R in the plane have the same normal vector. Using Theorem 7.6, we
have
P~R · n = 0
(r − p) · n = 0
r·n−p·n=0
r·n=p·n
Exercise 7.39
Find the equation of the plane that contains the point P with position vector p = (1, 2, 4),
and has the normal vector n = (1, 1, −1).
In cases where we are not given the normal directly, we will have to compute it. For example,
if we are given the position vectors a, b and c of 3 points that lie in the plane. In this case, we
121
F17XA 7 Vectors
can construct a normal n by taking the vector product of any two vectors that are parallel to the
plane. For example, we can use Theorem 7.6 to get AB ~ = b − a and AC ~ = c − a, and thus we
can take
n = (b − a) × (c − a)
before applying the equation of the plane.
Exercise 7.40
Find a plane through the points A = (1, 1, −1), B = (2, 0, 2), and C = (0, −2, 1).
Their vector product is perpendicular to both vectors and therefore to the plane. This will
give a normal n to the plane.
~ × AC
AB ~ = (i − j + 3k) × (−i − 3j + 2k)
= 7i − 5j − 4k
Thus a normal to the plane is n = (7, −5, −4)a . The point A with position vector
a = (1, 1, −1) is on the plane so the equation of a plane is given by
r·n=a·n
Subsituting gives:
122
A Formula Sheet
Hyperbolic Functions Standard Integrals
Z
x −x
e −e f (x) f (x)dx
sinh(x) =
2
ex + e−x (ax + b)n+1
cosh(x) = (ax + b) n
+ C for n 6= −1
2 a(n + 1)
1 = cosh2 (x) − sinh2 (x) 1
sin(ax + b) − cos(ax + b) + C
sinh(2x) = 2 sinh(x) cosh(x) a
cosh(2x) = 2 sinh2 (x) + 1 = 2 cosh2 (x) − 1 1
cos(ax + b) sin(ax + b) + C
a
1 ax
eax e +C
Standard Derivatives a
1 1
f (x) f 0 (x) ln(ax + b) + C for x > 0
ax + b a
Vectors
xn nxn−1
Scalar product: a · b = |a||b| cos(θ)
sin(ax + b) a cos(ax + b) For a = (a1 , a2 , a3 ), b = (b1 , b2 , b3 )
a · b = a1 b 1 + a2 b 2 + a3 b 3
cos(ax + b) −a sin(ax + b)
For standard basis:
eax aeax
i·i=1 j·j =1 k·k =1
ln(ax + b) a
ax+b
i·j =0 j·k =0 k·i=0
z 0.00 0.01 0.02 0.03 0.04 0.05 0.06 0.07 0.08 0.09
0.0 0.5000 0.5040 0.5080 0.5120 0.5160 0.5199 0.5239 0.5279 0.5319 0.5359
0.1 0.5398 0.5438 0.5478 0.5517 0.5557 0.5596 0.5636 0.5675 0.5714 0.5753
0.2 0.5793 0.5832 0.5871 0.5910 0.5948 0.5987 0.6026 0.6064 0.6103 0.6141
0.3 0.6179 0.6217 0.6255 0.6293 0.6331 0.6368 0.6406 0.6443 0.6480 0.6517
0.4 0.6554 0.6591 0.6628 0.6664 0.6700 0.6736 0.6772 0.6808 0.6844 0.6879
0.5 0.6915 0.6950 0.6985 0.7019 0.7054 0.7088 0.7123 0.7157 0.7190 0.7224
0.6 0.7257 0.7291 0.7324 0.7357 0.7389 0.7422 0.7454 0.7486 0.7517 0.7549
0.7 0.7580 0.7611 0.7642 0.7673 0.7704 0.7734 0.7764 0.7794 0.7823 0.7852
0.8 0.7881 0.7910 0.7939 0.7967 0.7995 0.8023 0.8051 0.8078 0.8106 0.8133
0.9 0.8159 0.8186 0.8212 0.8238 0.8264 0.8289 0.8315 0.8340 0.8365 0.8389
1.0 0.8413 0.8438 0.8461 0.8485 0.8508 0.8531 0.8554 0.8577 0.8599 0.8621
1.1 0.8643 0.8665 0.8686 0.8708 0.8729 0.8749 0.8770 0.8790 0.8810 0.8830
1.2 0.8849 0.8869 0.8888 0.8907 0.8925 0.8944 0.8962 0.8980 0.8997 0.9015
1.3 0.9032 0.9049 0.9066 0.9082 0.9099 0.9115 0.9131 0.9147 0.9162 0.9177
1.4 0.9192 0.9207 0.9222 0.9236 0.9251 0.9265 0.9279 0.9292 0.9306 0.9319
1.5 0.9332 0.9345 0.9357 0.9370 0.9382 0.9394 0.9406 0.9418 0.9429 0.9441
1.6 0.9452 0.9463 0.9474 0.9484 0.9495 0.9505 0.9515 0.9525 0.9535 0.9545
1.7 0.9554 0.9564 0.9573 0.9582 0.9591 0.9599 0.9608 0.9616 0.9625 0.9633
1.8 0.9641 0.9649 0.9656 0.9664 0.9671 0.9678 0.9686 0.9693 0.9699 0.9706
1.9 0.9713 0.9719 0.9726 0.9732 0.9738 0.9744 0.9750 0.9756 0.9761 0.9767
2.0 0.9772 0.9778 0.9783 0.9788 0.9793 0.9798 0.9803 0.9808 0.9812 0.9817
2.1 0.9821 0.9826 0.9830 0.9834 0.9838 0.9842 0.9846 0.9850 0.9854 0.9857
2.2 0.9861 0.9864 0.9868 0.9871 0.9875 0.9878 0.9881 0.9884 0.9887 0.9890
2.3 0.9893 0.9896 0.9898 0.9901 0.9904 0.9906 0.9909 0.9911 0.9913 0.9916
2.4 0.9918 0.9920 0.9922 0.9925 0.9927 0.9929 0.9931 0.9932 0.9934 0.9936
2.5 0.9938 0.9940 0.9941 0.9943 0.9945 0.9946 0.9948 0.9949 0.9951 0.9952
2.6 0.9953 0.9955 0.9956 0.9957 0.9959 0.9960 0.9961 0.9962 0.9963 0.9964
2.7 0.9965 0.9966 0.9967 0.9968 0.9969 0.9970 0.9971 0.9972 0.9973 0.9974
2.8 0.9974 0.9975 0.9976 0.9977 0.9977 0.9978 0.9979 0.9979 0.9980 0.9981
2.9 0.9981 0.9982 0.9982 0.9983 0.9984 0.9984 0.9985 0.9985 0.9986 0.9986
3.0 0.9987 0.9987 0.9987 0.9988 0.9988 0.9989 0.9989 0.9989 0.9990 0.9990
3.1 0.9990 0.9991 0.9991 0.9991 0.9992 0.9992 0.9992 0.9992 0.9993 0.9993
3.2 0.9993 0.9993 0.9994 0.9994 0.9994 0.9994 0.9994 0.9995 0.9995 0.9995
3.3 0.9995 0.9995 0.9995 0.9996 0.9996 0.9996 0.9996 0.9996 0.9996 0.9997
3.4 0.9997 0.9997 0.9997 0.9997 0.9997 0.9997 0.9997 0.9997 0.9997 0.9998
3.5 0.9998 0.9998 0.9998 0.9998 0.9998 0.9998 0.9998 0.9998 0.9998 0.9998
3.6 0.9998 0.9998 0.9999 0.9999 0.9999 0.9999 0.9999 0.9999 0.9999 0.9999
3.7 0.9999 0.9999 0.9999 0.9999 0.9999 0.9999 0.9999 0.9999 0.9999 0.9999
3.8 0.9999 0.9999 0.9999 0.9999 0.9999 0.9999 0.9999 0.9999 0.9999 0.9999
3.9 1.0000 1.0000 1.0000 1.0000 1.0000 1.0000 1.0000 1.0000 1.0000 1.0000