Introduction To Abstract Algebra
Introduction To Abstract Algebra
e o sag
fre mes
1% al
10 min
bli
su
MA136
Samir Siksek
Mathematics Institute
University of Warwick
Contents
Chapter I. Prologue
I.1. Who Am I?
I.2. A Jolly Good Read!
I.3. Proofs
I.4. Acknowledgements and Corrections
1
1
1
2
2
5
5
6
7
7
8
9
9
11
12
15
15
16
18
19
21
22
28
Chapter V. Groups
V.1. The Definition of a Group
V.2. First Examples (and Non-Examples)
V.3. Abelian Groups
V.4. Symmetries of a Square
29
29
29
31
32
37
38
40
41
ii
CONTENTS
41
42
45
45
48
49
49
49
57
58
59
60
61
63
66
67
69
71
72
74
76
76
77
78
81
81
83
84
85
86
86
89
89
90
93
93
96
97
101
102
CONTENTS
iii
109
109
110
111
114
117
119
121
121
122
123
124
Appendices
124
125
127
127
128
CHAPTER I
Prologue
I.1. Who Am I?
I Samir Siksek have the immense pleasure of introducing you to three
heroes of abstract algebra: groups, rings and fields. I am not an algebraist, but I have nothing but love, admiration and enthusiasm for the
subject. Some of my best friends are algebraists.
three tips
Abstract algebra is incredibly useful, but to get any benefit from it you
need to develop three essential habits:
(i) Study as many different examples as you can. The examples are
as important as the theorems and definitions. There is absolutely no use in knowing the definition of a group if youre not
familiar with the standard examples.
(2) Do calculations. Use calculations with matrices, permutations,
symmetries, etc. to test your ideas. Calculations will lead you to
counterexamples that can correct any erroneous ideas that you
have. But also with practice, you will find that calculations often
contain the germ of the proof youre looking for.
(c) Think geometrically and draw pictures. The true meaning of
most mathematical concepts is geometric. If you spend all your
time manipulating symbols (i.e. doing algebra) without understanding the relation to the geometric meaning, then you will
have very little in terms of mathematical insight.
The three habits will not only help you learn the subject and apply it,
you will develop great mathematical taste. Here is my favourite quotation
about algebra:
1
I. PROLOGUE
Algebra is the offer made by the devil to the mathematician. The devil says: I will give you this powerful machine, it will answer any question you like. All you need
to do is give me your soul: give up geometry and you
will have this marvellous machine.
Michael Atiyah
(Fields Medalist and Abel Prize winner)
I.3. Proofs
When I was a student I found it very hard to follow proofs in books and
lectures. So when I read a theorem, I would put down the book and try out
a few examples. After that I would try to prove the theorem myself. After I
finished (or if I failed) I would look at the proof in the book and compare.
I heartily recommend this strategy. Youll gain a great understanding of
the subject. Youll also get really good practice for the exam, where you
may asked to prove statements that you havent seen before.
I.4. Acknowledgements and Corrections
I offer my most enthusiastic congratulations the creaters of Vim, LATEX,
and TikZ on their . . . ermm . . . creations. I thank Alex Best, George Christofi,
Jenny Cooley, Dave McCormick, Joseph Miller, Ghaleo Tsoi Kwok-Wing
and James Soffe for suggesting corrections to previous versions of these
notes. I offer my profuse apologies to Vandita Ditz Patel for not deleting the smileys on page 20. Her reasoning nevertheless deserves to be
quoted:
Warwick students arent retards anymore; you cant have stuff
like that in your notes!
Please email me your comments, misprints and corrections. My address is samir.siksek@gmail.com.
profound entreaty
Are previous exam papers available? The module was only offered once before, and so only one exam paper is available. It is
an appendix to these notes. However, the best preparation for the
exam is to work hard at the homework.
Can we have the answers to last years exam? Not yet! I want you
to have a go yourself. But Ill post the answers online roughly one
month before your final exam. Feel free to remind me.
Are we required to know the proofs taken during the lectures or
found in the lecture notes? Yes, theorems, definitions, proofs and
homework questions. I love bookwork.
Can questions of a similar type to part C appear in the exam?
Yes they can! The exam will NOT be the academic equivalent of a
chainsaw massacre, but certainly some interesting questions must
appear, otherwise your maths degree would be trivial.
The exam is tomorrow/next week/within six months. Im running around like a headless chicken and stressing all my friends
because I cant do a homework question. Can I knock on your
door and ask you about it? Dont worry, Ive already branched out
into agony-aunting. Yes come and ask; I promise not to set the
dogs on you.
After Warwick I plan to devote my life to drunkenness and antisocial behaviour. Whilst Im here, I want to enjoy mathematics to
the full. Pleeeeeease set us obscene amounts of
homework? We must be careful. If you do too much homework, youll suffer severe withdrawal symptoms once the term is
over, and theres no telling what you might do to yourself. I simply
cant have that on my conscience. Ill therefore limit the homework
to one sheet per week. It cuts me deep to be so hard on you, but
sometimes you have to be tough to be kind.
I cant get hold of a pitchfork, and Im worried that a torch would
set off the fire alarm. How can I make constructive criticisms?
Constructive criticisms are welcome, face-to-face or by email.
CHAPTER II
FAQ
Why is this FAQ upside down? This is to improve the chances that
you will notice it and read it.
Your lectures are excruciatingly boring. Besides, 12 noon is a
perversely early time to schedule a lecture and no self-respecting
student can be expected to be awake yet. Do I really have to attend your lectures, or can I make do with these lecture notes? Ill
take that as an endorsement of the greatness of my writing skills
rather than a criticism of my lecturing skills. I love improvisation
during lectures. So the material we cover in the lectures will not be
identical to that in these printed notes. You need to come to the
lectures and make your own notes. The exam will be based on the
contents of the lectures as well as these printed lecture notes.
I was ill and missed a homework 1 deadline. These matters are
handled by the Undergraduate Office.
How is this course assessed? 15% for four homework assignments
and 85% for a one hour exam in term 3.
My copy of homework assignment x is lost/stolen. Where can I
get another copy? You can get all the assignments from my homepage (just put my name into google).
The questions on the homework sheets are divided into parts A,
B, C. Are the questions in part C optional? You are asked to handin both parts A and B but not part C. Part C questions should be
attempted by students who hope to obtain a First or a II:1. Students who hope to get at II:2 or a Third should avoid attempting
part C questions at all cost.
Do you subscribe to the illustrious Warwick tradition of setting
the same exam every year? Youre paying 9000 a year. You hardly
expect me to rip you off with a second-hand exam? Come on, how
low do you think I am?
1homework /"h@Umw3:k/ Noun. A mandatory regular course of mental
stimulation designed to vanquish intellectual impotence.
4
II. FAQ
CHAPTER III
Algebraic Reorientation
III.1. Sets
Sets are a basic notation for most of modern pure mathematics, but
life is too short to spend too much time on them. A set is simply a collection of objects. We use curly brackets to denote sets. For example, if I
write
A = {2, 5, 13},
then Im saying that the set A consists of the elements 2, 5, 13. This is
one way of specifying a set; we simply list all its elements between curly
brackets. The notation x S means x is a member of the set S and the
notation x S means x is not a member of the set S. For the set A above,
we know 13 A but 11 A.
We can also specify some infinite sets in this fashion; for example, the
set of all integers
Z = {. . . , 3, 2, 1, 0, 1, 2, 3, . . . }.
This is absolutely standard notation: when you see Z, youre expected to
know that its the set of integers. The set of natural numbers is
N = {0, 1, 2, 3, 4 . . . }.
Again this is standard notation (but not all mathematicians include 0 in
the natural numbers).
Here is an example of another way of specifying a set:
B = {x Z : x 2 = 16}.
This is saying that B is the set of all integers x satisfying the equation
x 2 = 16. Of course, another way of specifying the same set would be to
write B = {4, 4}. If we write
C = {x N : x 2 = 16},
then C = {4}.
If we write
D = {u Z : u 3 = 2},
then D is the set of integers u satisfying u 3 = 2. There are no integers
satisfying this equation, so D is the empty set. We denote the empty set
by ;, so we can write D = ;. Here are a couple more examples of empty
sets:
{w N : w 1} = ;,
{v Z : 301 v 399} = ;.
5
To get more practice with this notation, observe that another way of
specifying the natural numbers is to write
N = {x Z : x 0}.
Yet another correctalthough admittedly stupid wayis to write
N = {x Z : x 05}.
Here are some other sets that youre meant to know:
(1) Q is the set of rational numbers. We can write this as
na
o
Q=
: a, b Z, b 6= 0 .
b
Examples of elements of Q are 0, 5, 7/11, 3/2, 6/4 (the last two
being
pthe same element). From Foundations you should know
that
2 is irrational. You can write this statement in set notation:
p
2 Q. Other examples of irrational numbers are e and .
(2) R is the set of real numbers. It isnt possible to write R in straightforward way as for the sets above, but you can think of the elements of R as points
p on the real line. Examples of elements of R
are 7, 3/5, 385, 7, ( + 1)/2, sin 5.
(3) C is the set of complex numbers. You have seen complex numbers in your Further Mathematics A-Level. Recall that i is a symbol that satisfies i 2 = 1. We can write the set of complex numbers as
C = { a + bi : a, b R }.
While were on the subject of notation, compare the following two state- Tip
ments:
Some positive real numbers are irrational.
x R s.t. x > 0 x Q.
The two statements say exactly the same thing. A professional mathematician prefers the first, and an amateur prefers the second. Use only as
much mathematical notation as needed to make your ideas transparent
and precise, but no more 1.
III.2. Binary Operations
Let S be a set. A binary operation on S is a rule which for every two
elements of S gives another element of S. For example, addition is a binary operation on R, because given any two real numbers, their sum is
a real number. One way mathematicians like to say this is, R is closed
under addition. All that means is that the sum of two real numbers is a
real number.
Addition is also a binary operation on C, Q, Z and N. Likewise, multiplication is a binary operation on N, Z, Q, R, C.
1Coming soon to a supervision area near you:
B, C, and F .
g : S2 S3.
f (x) = x 2 5
g : R R,
g (x) = 3x + 2.
and
Then
( f g )(x) = f (g (x)) = f (3x + 2) = (3x + 2)2 5 = 9x 2 + 12x 1,
(g f )(x) = g ( f (x)) = g (x 2 5) = 3(x 2 5) + 2 = 3x 2 13.
The order matters here: f g is the result of substituting g into f , and g f
is the result of substituting f into g .
Note that in the example we started with functions R R and composed to obtain functions R R. Likewise, in the above definition, if
S 1 = S 2 = S 3 = S say, so that f and g are functions S S then g f is a
function S S. In this case (i.e. when the domains and codomains are
equal) is a binary operation. It is not a binary operation on S, because it
doesnt take two elements of S and give us another element. It is a binary
operation on the set of functions from S to itself.
The following lemma might look silly and useless, but it one of the
most important results we shall meet in this module, and we shall use it
again and again.
Lemma III.2. Let S 1 , S 2 , S 3 , S 4 be sets and let f , g , h be functions
h : S1 S2,
g : S2 S3,
f : S3 S4.
Then f (g h) = ( f g ) h.
P ROOF. To stop ourself from getting muddled, let k = g h and ` = f g .
Note that k(x) = g (h(x)) and `(x) = f (g (x)). So
( f (g h))(x) = ( f k)(x) = f (k(x)) = f (g (h(x))).
Also
(( f g ) h)(x) = (` h)(x) = `(h(x)) = f (g (h(x))).
So f (g h) = ( f g ) h.
b a = a,
c b = b,
a a = b, . . . .
You might think that this example is somewhat contrived, and youre absolutely right. But later on well meet more natural composition tables
that arise from studying groups, permutations, etc.
e ((c b) (d a)) = a b c d e.
on any set.
10
Example III.6. Let S = {a, b, c} and let be the binary operation given by
the composition table in Example III.3. Then is not commutative; for
example
a b = c,
b a = a.
It is also not associative; for example
(a b) c = c c = c,
a (b c) = a a = b.
Example III.7. Composition of functions from a set A to itself is associative but not commutative. We know that it is associative from Lemma III.2.
We know that it isnt commutative by Example III.1. When a binary operation is associative bracketing doesnt matter. For example,
(a b) ((c d ) e) = (a (b c)) (d e).
As long as we keep a, b, c, d , e in the same order from left to right, then
the order in which we do the compositions does not matter. Thus there
would be no ambiguity in writing
(a b) ((c d ) e) = a b c d e.
Example III.8. Are there binary operations that are commutative but not
associative? Yes but it isnt easy to come up with natural examples. However it is easy to invent a finite set and a composition table that is commutative but not associative. Let S = {a, b, c}. Let be the binary operation
on S with the following composition table:
a b c
a b c a
b c c a
c a a c
Note that is commutative; you can see this by noting that the table
is symmetric about the diagonal from the top left corner to the bottom
right corner. But it isnt associative. For example,
(b c) a = a a = b,
b (c a) = b a = c.
11
a + b = b + a (addition is commutative)
(a + b) + c = a + (b + c) (addition is associative)
a + 0 = a (0 is the additive identity element)
there is a real number a (the additive inverse of a) such that
a + (a) = 0.
ab = ba (multiplication is commutative)
(ab)c = a(bc) (multiplication is associative)
a(b + c) = ab + ac (multiplication distributes over addition)
a 1 = a (1 is the multiplicative identity element)
if a 6= 0, there is a real number denoted by a 1 (the multiplicative
inverse of a) such that a a 1 = 1.
= (c a d b) + (d a + cb)i .
12
i j = j i = k,
j k = k j = i ,
ki = i k = j.
13
CHAPTER IV
a 11 a 12 a 13 . . . a 1n
a 21 a 22 a 23 . . . a 2n
..
..
..
.. .
.
.
.
.
a m1 a m2 a m3 . . . a mn
Example IV.1. Let
A=
1 2 0
,
1 7 14
3 2
B = 1 8 ,
2
5
3
1
5
C = 6 8 12 .
2
5
0
A, B , C are matrices. The matrix A has size 23 because it has 2 rows and
3 columns. Likewise B has size 3 2 and C has size 3 3.
a 11 a 12 a 13 . . . a 1n
a 21 a 22 a 23 . . . a 2n
A = ..
..
..
.. .
.
.
.
.
a m1 a m2 a m3 . . . a mn
wastes a lot of space. It is convenient to abbreviate this matrix by the
notation A = (a i j )mn . This means that A is a matrix of size m n (i.e. m
rows and n columns) and that we shall refer to the element that lies at the
intersection of the i -th row and j -th column by a i j .
15
16
a 11 a 12 a 13
A=
.
a 21 a 22 a 23
Notice that A has 2 rows and 3 columns. The element a 12 belongs to the
1st row and the 2nd column.
M 22 (R) =
a b
: a, b, c, d R .
c d
A=
2 5
,
2 8
4 3
B = 1 0 ,
1 2
4 2
C = 0 6 .
9 1
4 3
4 2
0 5
B +C = 1 0 + 0 6 = 1 6 .
1 2
9 1
8 3
Likewise A B and A C are undefined, but B C is:
4 3
4 2
8
1
6 .
B C = 1 0 0 6 = 1
1 2
9 1
10 1
Scalar multiplication is always defined. Thus, for example
8 6
6
3
2 5
9 .
A =
,
2 B = 2 0 ,
1.5C = 0
2 8
2 4
13.5 1.5
17
1 2
A=
,
1 3
5 3
B=
.
0 2
15+20
1 3 + 2 2
5 7
AB =
=
.
1 5 + 3 0 1 3 + 3 2
5 3
Likewise
BA=
5 1 + 3 1 5 2 3 3
8 1
=
.
0 1 2 1 0 2 + 2 3
2 6
2 1 3
C=
.
3 4 0
Then
8 7
3
.
AC =
7 13 3
However, C A is not defined because the number of columns of C is not
equal to the number of rows of A.
Remark. If m 6= n, then we cant multiply two matrices in M mn (R). However, matrix multiplication is defined on M nn (R) and the result is again
in M nn (R). In other words, multiplication is a binary operation on M nn (R).
Exercise IV.7. CommutativityWhat can go wrong?
Give a pair of matrices A, B , such that AB is defined but B A isnt.
Give a pair of matrices A, B , such that both AB and B A are defined but they have different sizes.
18
x 0 = ax + b y
y 0 = c x + d y.
x 00 = x 0 + y 0
y 00 = x 0 + y 0 ,
x 00 = (a + c)x + (b + d )y
y 00 = (a + c)x + (b + d )y.
x
a b x
(IV.7)
=
.
y 00
c d y
19
a + c b + d
a b
.
=
a + c b + d
c d
A quick check using the definition of matrix multiplication shows that
this is indeed the case.
IV.4. How to think about matrices?
Let A M 22 (R). In words, A is a 2 2 matrix with real entries. Write
a b
.
A=
c d
For now, think of the elements of R2 as column vectors: any u R2 can be
written as
x
u=
y
with x, y real numbers. Thus were thinking of the elements of R2 as 21matrices. Note that in equation
(IV.2), the matrix A converts the vector
0
x
u to another vector u0 = 0 .
y
Some mathematicians would think that the word converts is not very
mathematical. Instead they would think of the matrix A as defining a
function
T A : R2 R2 ,
T A (u) = Au.
Other (less pedantic) mathematicians would not distinguish between the
matrix and the function it defines. One of the points of the previous section is that if C = B A then TC = TB T A , so that matrix multiplication is
really an instance of composition of functions.
Let us look at some examples of these functions T A .
Example IV.8. Let
0 1
A=
,
1 0
2 0
B=
,
0 1
0 0
C=
.
0 1
20
0
x
. Thus geometrically, TC represents projection onto
=
Also TC
y
y
the y-axis.
Again, if we choose not to distinguish between the matrix and the
function it defines we would say that A represents reflection in the line
y = x, B represents stretching by a factor of 2 in the x-direction, and C
represent projection onto the y-axis.
Now is a good time to revisit the non-commutativity of matrices. Let non-commutativity
us see a geometric example of why matrix multiplication is not commu- seen geometrically
tative. Consider the matrices AB and B A where A, B are the above matrices. Notice (AB )u = A(B u). This means stretch u by a factor of 2 in the
x-direction, then reflect it in the line y = x. And (B A)u = B (Au), which
means reflect u in the line y = x and then stretch by a factor of 2 in the
x-direction. The two are not the same as you can see from Figure IV.1.
Therefore AB 6= B A.
F IGURE IV.1. Non-commutativity of matrix multiplication. The matrix A represents reflection in the line y = x
and the matrix B represents stretching by a factor of 2 in
the x-direction. On the top row we apply B first then A;
the combined effect is represented by AB . On the bottom
we apply A first then B ; the combined effect is represented
by B A. It is obvious from comparing the last picture on the
top row and the last one on the bottom row that AB 6= B A.
21
x
x
2
2
.
=
S :R R ,
S
y +1
y
Geometrically, S translates a vector by 1 unit in the y-direction. Can we
get S from a matrix A? Suppose we can, so S = T A for some matrix A.
What this means is that Su = T A u for all u R2 . But T A u = Au. So Su =
Au. Now let u = 0. We see that
0
,
Au = 0
Su =
1
which contradicts Su = Au. So we cant get S from a matrix, and the reason as youll see in Term 2 is that S is not a linear transformation.
IV.4.1. Why is matrix multiplication associative? At the end of Example IV.8 there was a slight-of-hand that you might have noticed 1. We
assumed that matrix multiplication is associative when we wrote (AB )u =
A(B u); here were thinking of u as a 2 1-matrix. In fact matrix multiplication is associative whenever it is defined.
Theorem IV.9. Let A be an m n matrix, B be an n p matrix and C a
p q matrix, then
(AB )C = A(BC ).
(IV.8)
Dont worry too much if this proof makes you uncomfortable! When
you do Linear Algebra in Term 2 you will see a much more computational
proof, but in my opinion the proof above is the most enlightening one.
For now, you should be pleased if you have digested Example IV.8.
IV.5. Why Column Vectors?
You will have noticed that early on in these notes we were thinking
of the elements of Rn as row vectors. But when we started talking about
matrices as functions, we have taken a preference for column vectors as
opposed to row vectors. Let us see how things are different if we stuck
with row vectors. So for the moment think of elements of Rn , Rm as row
vectors. Let A be an m n matrix.
1well-done if you did notice, and learn to read more critically if you havent
2Here T , T , T respectively are functions Rn Rm , Rp Rn , Rq Rp .
A
B
C
22
maticians. I find most of these bewildering. However, they do have a very good point
in the way they write functions. We said that B A means do A first and then B , because
(B A)u = B (Au). However if you use row vectors then u(B A) = (uB )A, so B A means do B
first then A, which seems entirely natural.
23
a b
,
I2 =
c d
x
a b x
.
=
y
c d y
We want this to be true for all values of x, y, because we want the matrix
I 2 to mean do nothing to all vectors. Multiplying the two matrices on the
right and equating the entries we obtain
ax + b y = x,
c x + d y = y.
1 0
I2 =
0 1
has the effect of do nothing. Lets check algebraically that I 2 is a multiplicative identity for 2 2 matrices. What we want to check is that
(IV.9)
AI 2 = I 2 A = A
A=
.
Now multiplying we find
1 0
1+0 0+1
AI 2 =
=
=
= A.
0 1
1+0 0+1
In exactly the same way, you can do the calculation to show that I 2 A = A,
so weve established (IV.9).
Before moving on to inverses, it is appropriate to ask in which world
does the identity (IV.9) hold? What do I mean by that? Of course A has to
be a 22 matrix, but are its entries real, complex, rational, integral? If you
read the above again, you will notice that weve used properties common
to the number systems R, C, Q, Z. So (IV.9) holds for all matrices A in
M 22 (R), M 22 (C), M 22 (Q), M 22 (Z).
Now what about inverses? Let A be a 2 2 matrix, and let A 1 be its
inverse whatever that means. If A represents a certain geometric operation then A 1 should represent the opposite geometric operation. The
matrix A 1 should undo the effect of A. The product A 1 A, which is the
result of doing A first then A 1 , should now mean do nothing. In other
words, we want A 1 A = I 2 whenever A 1 is the inverse of A. Another way
of saying the same thing is that if v = Au then u = A 1 v.
24
0 1
,
A=
1 0
2 0
,
B=
0 1
0 0
.
C=
0 1
Recall that A represents reflection in the line y = x. If we repeat a reflection then we end up where we started. So we expect that A A = I 2 (or
more economically A 2 = I 2 ). Check this by multiplying. So A is its own
inverse.
The matrix B represents stretching by a factor of 2 in the x-direction.
So its inverse B 1 has to represent stretching by a factor of 1/2 (or shrinking by a factor of 2) in the x-direction. We can write
1/2 0
1
B =
.
0 1
Check for yourself that B 1 B = I 2 . Also note that
x/2
1 x
B
=
y
y
which does what we want: B 1 really is the inverse of B .
Finally recall that C represents projection onto the y-axis. Is there
such as thing as unprojecting from the y-axis? Note that
1
0
2
0
3
0
4
0
C
=
,
C
=
,
C
=
,
C
=
,....
0
0
0
0
0
0
0
0
Lets assume that C has an inverse and call it C 1 . One of the things we
want is for v = C u to imply u = C 1 v. In other words, C 1 is the opposite
of C . If there was such an inverse C 1 then
1
2
3
4
1 0
1 0
1 0
1 0
C
=
,
C
=
,
C
=
,
C
=
,....
0
0
0
0
0
0
0
0
This is clearly absurd! Therefore, C is not invertible 1. For a more graphic
illustration of this fact, see Figure IV.2.
The matrix C is non-zero, but it still doesnt have an inverse. This
might come as a shock if you havent seen matrix inverses before. So lets
check it in a different way. Write
a b
1
C =
.
c d
1In Foundations, one of the things youll learn (or have already done) is that a func-
25
L
S1
S2
S3
x
a b
C=
c d
1 0
a 0
=
.
0 0
c 0
a b
A=
.
c d
Suppose that A is invertible and write
x y
1
A =
.
z w
We want to express A 1 in terms of A, or more precisely, x, y, z, w in
terms of a, b, c, d . We want
x y a b
1 0
=
.
z w c d
0 1
Multiplying and equating entries we arrive at four equations:
(IV.10)
ax + c y = 1
(IV.11)
bx + d y = 0
(IV.12)
az + c w = 0
(IV.13)
bz + d w = 1.
26
d
,
ad bc
y=
b
,
ad bc
z=
c
,
ad bc
w=
a
.
ad bc
Thus
A
1
d b
=
.
ad bc c a
a b
Theorem IV.11. A matrix A =
is invertible if and only if ad bc 6=
c d
0. If so, the inverse is
1
d b
1
A =
.
ad bc c a
Definition. Let A be a 2 2 matrix and write
a b
A=
.
c d
We define the determinant of A, written det(A) to be
det(A) = ad bc.
Another common notation for the determinant of the matrix A is the following
a b
c d = ad bc.
From Theorem IV.11 we know that a 2 2 matrix A is invertible if and
only if det(A) 6= 0.
Theorem IV.12. (Properties of Determinants) Let A, B be 2 2 matrices.
(a) det(I 2 ) = 1.
(b) det(AB ) = det(A) det(B ).
(c) If A is invertible then det(A) 6= 0 and det(A 1 ) =
1
.
det(A)
27
P ROOF. The proof is mostly left as an exercise for the reader. Parts (a), (b)
follow from the definition and effortless calculations (make sure you do
them). For (c) note that
det(A 1 A) = det(I 2 ) = 1.
Now applying (ii) we have 1 = det(A 1 ) det(A). We see that det(A) 6= 0 and
det(A 1 ) = 1/ det(A).
Exercise IV.13. (The Geometric Meaning of Determinant) You might be
wondering (in fact should be wondering) about the geometric meaning
of the determinant. This exercise answers your question. Let A be a 2 2
matrix and write
a b
A=
.
c d
a
b
Let u =
and v =
; in other words, u and v are the columns of A.
c
d
Show that |det(A)| is the area of the parallelogram with adjacent sides u
and v (See Figure IV.3). This tells you the meaning of |det(A)|, but what
about the sign of det(A)? What does it mean geometrically? Write down
and sketch a few examples and see if you can make a guess. Can you
prove your guess?
a b
let A be the matrix with columns u and v; i.e. A =
. Show (alc d
gebraically) that det(A) = 0 if and only if u, v are parallel. Explain this
geometrically.
28
IV.7. Rotations
We saw above some examples of transformations in the plane: reflection, stretching, projection. In this
section we take a closer look at rotax
be a point in R2 . Suppose that this
tions about the origin. Let P =
y
point is rotated anticlockwise about the origin
through an angle of . We
0
x
want to write down the new point P 0 = 0 in terms of x, y and . The
y
easiest way to do this to use polar coordinates. Let the distance of P from
the origin O be r and let the angle OP makes with the positive x-axis be
; in other words the polar coordinates for P are (r, ). Thus
x = r cos ,
y = r sin .
y 0 = r sin( + ).
y 0 = x sin + y cos ,
cos sin
R =
.
sin cos
Exercise IV.15. You know that R represents anticlockwise rotation about
the origin through angle . Describe in words the transformation associated to R . (Warning: dont be rash!)
cos 4
sin 4
sin 4
cos 4
x
=
y
CHAPTER V
Groups
V.1. The Definition of a Group
A group is a pair (G, ) where G is a set and is a binary operation on
G, such that the following four properties hold 1
(i) (closure) for all a, b G, a b G;
(ii) (associativity) for all a, b, c G,
a (b c) = (a b) c;
(iii) (existence of the identity element) there is an element e G such
that for all a G,
a e = e a = a;
(iv) (existence of inverses) for every a G, there is an element b G
(called the inverse of a) such that
a b = b a = e.
Remark. If is a binary operation then (i) automatically holds. So why
did I list (i) in the definition? Ive put it in for good measure! When you
suspect an operation gives you a group the first thing you should check is
that the operation is really a binary operation.
V.2. First Examples (and Non-Examples)
Example V.1. (R, +) is a group. We know already that addition is a binary
operation on R, so closure holds. We know addition of real numbers
is associative. What is the identity element? We want an element e R
so that a + e = e + a = a for all a R. It is clear that e = 0 works and
is the only possible choice. Moreover, the (additive) inverse of a is a:
a + (a) = (a) + a = 0.
if you wish. I call them the defining properties of a group since I think that the word
axiom should be reserved for statements of universal truth. An example of an axiom is:
a + b = b + a for any two integers a, b.
29
30
V. GROUPS
Example V.5. (R2 , +) is a group. Lets prove this. Were allowed to assume
the usual properties of the real numbers (see Section III.8). The elements
of R2 are pairs (a 1 , a 2 ) where a 1 , a 2 are real numbers. Addition is defined
by
(a 1 , a 2 ) + (b 1 , b 2 ) = (a 1 + b 1 , a 2 + b 2 ).
Note that the entries a 1 + b 1 and a 2 + b 2 are real numbers, and so (a 1 +
b 1 , a 2 + b 2 ) is a pair of real numbers. Hence (a 1 + b 2 , a 2 + b 2 ) is in R2 . In
other words, R2 is closed under addition, which shows that (R2 , +) satisfies condition (i). Next we want to prove associativity of addition. Consider a, b, c in R2 . We can write
a = (a 1 , a 2 ),
b = (b 1 , b 2 ),
c = (c 1 , c 2 ).
(a 2 + b 2 ) + c 2 = a 2 + (b 2 + c 2 ).
Hence
(a + b) + c = a + (b + c).
This shows that (R , +) satisfies (ii).
2
31
Example V.9. All the groups we have met so far are infinite. Here is an
example of a finite group. Let A = {+1, 1}. Then (A, ) is a group (where
of course is multiplication).
p
Example V.10. Let B = {1, i , 1, i }, where i = 1. Then (B, ) is another
example of a finite group.
Example V.11. Let C = {1, i }. Then (C , ) is not a group since it isnt closed;
for example i i = 1 C .
Are there any non-abelian groups? There are many, but perhaps not
ones that youre used to thinking about. In the next section we give an
example of a non-abelian group.
32
V. GROUPS
33
2
2
3
O
0
1
2
3
0
1
2
3
0
0
1
2
3
0
1
2
3
1
1
2
3
0
3
0
1
2
2
2
3
0
1
2
3
0
1
3
3
0
1
2
1
2
3
0
0
0
1
2
3
0
1
2
3
1
1
2
3
0
3
0
1
2
2
2
3
0
1
2
3
0
1
3
3
0
1
2
1
2
3
0
34
V. GROUPS
It is not worth your while to check every entry in the table, but make
sure you check four or five entries at random to get an idea of how to
compose symmetries, and let me know if Ive made any mistakes!
I want to convince you that (D 4 , ) is a group. The first thing we should
ask about is closure. This is clear from the table; when you compose two
elements of D 4 you get an element of D 4 . Lets skip associativity for now
and talk about the existence of the identity element. For this we dont
need the table. It is clear that 0 (=do nothing) is an identity element.
It is also (geometrically) clear that every element has an inverse which
does belong to D 4 . If you reflect twice in the same line you end up where
you started, so i i = 0 ; in other words, i is its own inverse for i =
0, 1, 2, 3. The inverse of an anticlockwise rotation around O by 90 is an
anticlockwise rotation around O by 270 . We find that the inverses of 0 ,
1 , 2 and 3 respectively are 0 , 3 , 2 and 1 .
Whats left is to prove associativity. So we have to prove that ( f g )h =
f (g h) for all f , g , h D 4 . But there are 512 triples f , g , h, so thats a lot
of checking. Is there a clever way? Yes there is, and it relies on thinking
about the elements of D 4 as functions 1. Remember that we labelled the
vertices of the square with 1, 2, 3, 4 as in Figure V.1. Now 1 takes vertex
1 to where vertex 2 was and vertex 2 to where vertex 3 was and vertex 3 to
where vertex 4 was and vertex 4 to where vertex 1 was. We can think of 1
as a function from {1, 2, 3, 4} to itself as in Figure V.2.
1
F IGURE V.2. 1 (left) and 0 (right) considered as functions from {1, 2, 3, 4} to itself.
In fact, this way every element of D 4 can be thought of as a function a key trick
from {1, 2, 3, 4} to itself. If we think like this, simply becomes composition of functions from D 4 to itself. We know that composition of functions is associative by Lemma III.2. Thus the binary operation on D 4 is
1Youve seen this idea before: in the proof of Theorem IV.9 we showed that matrix
multiplication is associative by thinking of matrices as functions and matrix multiplication as composition of functions.
35
associative. We have now checked all the conditions (i)(iv) for a group,
so (D 4 , ) is a group!
Remarks.
Moral
non-abelian group
1 0 = 1 .
36
V. GROUPS
closed under composition. So (S, ) is not a group. Are there any other
subgroups inside (D 4 , ) besides (R, )? Yes. See Figure V.3 for a complete
list.
(D 4 , )
({ 0 , 2 , 0 , 2 }, )
({ 0 , 0 }, )
({ 0 , 1 , 2 , 3 }, )
({ 0 , 2 }, )
({ 0 , 2 }, )
({ 0 , 2 , 1 , 3 }, )
({ 0 , 1 }, )
({ 0 , 3 }, )
({ 0 }, )
F IGURE V.3. The figure shows the subgroups of (D 4 , ) and
how they fit inside each other.
Again, check that a couple of these are subgroups. Dont waste time
checking there arent other subgroups of (D 4 , ); when you know a lot
more about groups and subgroups you can come back to this question,
but even then it will still be a little tedious!
Exercise V.13. In this exercise you will write out the composition table for
the group D 3 which is the group of symmetries of an equilateral triangle.
Sketch an equilateral triangle and label the vertices 1, 2, 3 in anticlockwise order. Label the centre of the triangle with O. Let 0 , 1 , 2 denote
anticlockwise rotations about O through angles 0, 2/3 and 4/3. Let 1 ,
2 , 3 denote reflections about the lines respectively joining vertices 1, 2,
3 to O. Let
D 3 = { 0 , 1 , 2 , 1 , 2 , 3 }.
Write down a composition table for D 3 and explain why it is a group 1. Is
it abelian? It has six subgroups; write them down.
Exercise V.14. Write down the symmetries of a triangle that is isoceles
but not equilateral and a composition table for them. Do they form a
group?
sides. These are called the dihedral groups. Some mathematicians denote D n by D 2n
because it has 2n elements. Mysteriously, they dont denote S n by S n! .
CHAPTER VI
First Theorems
Our first two theorems deal with subconscious assumptions. One of
the defining properties of a group is the existence of the identity element
(property (iii)). The word the contains a hidden assumption; how do we
know there is only one identity element? Shouldnt we be talking about
the existence of an identity element?
Theorem VI.1. Let (G, ) be a group. Then (G, ) has a unique identity
element.
P ROOF. Suppose that e and e 0 are identity elements. Thus, for all a G
we have
(VI.15)
a e = e a = a,
and
(VI.16)
a e 0 = e 0 a = a.
a c = c a = e,
37
38
= b (a c)
= (b a) c
= e c
=c
Thus b = c. Since any two inverses of a must be equal, we see that the
inverse of a is unique.
VI.1. Getting Relaxed about Notation
It is quite tedious to keep writing for the group operation. If (G, ) is
a group and a, b G, we shall write ab for a b, unless there is reason to
fear confusion. For example if (G, ) = (R, +) then it is stupid to write ab
for a + b because the usual meaning for ab is a b. But it is OK most of
the time, and when it is OK we will do it. Moreover, we shall often say let
G be a group, without giving an explicit name to the binary operation.
When we talk of the groups R, R2 , R[x], R , etc. we shall mean the groups
(R, +), (R2 , +), (R[x], +), (R , ), etc.
If G is a group, and were writing ab for a b, then it makes sense to
use 1 to denote the identity element instead of e. We write a 1 for the
(unique) inverse of a. Now
aa 1 = a 1 a = 1,
which looks familiar. Moreover, if n is a positive integer we shall write
a n = |aa{z
a} .
n times
n 1
a (a a) = c.
Lets get back to groups. Its the associativity which makes it OK for us
to write a n , and you can convince yourself quickly that
(a m )n = a mn ,
a m a n = a m+n .
You should realize that all this is happening inside the group G that contains the element a. In particular, a n G for all n Z. How do we know
this? For a start, G is closed under composition, so because a G, so
is a 2 = a a. Now that we know that a and a 2 are in G, we know that
39
an algebraic booby
What about (ab)n = a n b n ? Does this identity hold too? Let us think
trap about this with n = 2. Now in the old notation 1
(ab)2 = a b a b
and
a 2 b 2 = a a b b.
Do these have to be the same? No, because the order of the middle two is
different and since were not assuming that our group is abelian we have
no right to assume that b a = a b.
Example VI.4. In D 4 you can check that
21 20 = 2 ,
( 1 0 )2 = 0 ,
and so 21 20 6= ( 1 0 )2 .
a m a n = a m+n .
by associativity
1b
= 1,
1Perhaps you think that I should write (ab)2 = (a b) (a b), but because of asso-
ciativity the bracketing does not matter. Whichever bracketing you apply to a b a b
you will get the same result.
40
Never write a/b unless the group is abelian. This notation is ambigu- pitfall
ous; does a/b mean b 1 a or ab 1 ? The two arent the same in the nonabelian world.
Exercise VI.7. Use D 4 to give counterexamples to the following:
b 1 a = ab 1 ,
(ab)1 = a 1 b 1 ,
a 1 ba = b.
Exercise VI.8. Let G be a group satisfying a 2 = 1 for all a in G. Show that
G is abelian.
VI.2. Additive Notation
For some groups the binary operation is addition (whatever that means).
These include (R, +), (Z, +), (R[x], +), (R2 , +) etc. An important convention is that additive notation is only ever used for abelian groups. A multiplicative group can be abelian, such as (R , ), and can be non-abelian,
such as (D 4 , ).
You need to rephrase statements appropriately when using additive
notation. For example, instead of speaking of
a n = |aa{z
a},
n times
ma + na = (m + n)a.
CHAPTER VII
a b
GL2 (R) =
: a, b, c, d R and ad bc 6= 0 .
c d
Recall that ad bc is the determinant of the 2 2-matrix ac db , and the
matrix is invertible if and only if this determinant is non-zero (Theorem IV.11). So GL2 (R) contains all the invertible 2 2 matrices (with real
entries) and none of the non-invertible ones.
Theorem VII.1. GL2 (R) is group under multiplication of matrices.
We call GL2 (R) the general linear group.
P ROOF. The first thing to check is that GL2 (R) is closed under multiplication. If A and B are in GL2 (R) then AB is a 2 2 matrix with real entries.
Also, we know that det(AB ) = det(A) det(B ) (by Theorem IV.12). Because
A and B have non-zero determinants, so does AB . So AB is in GL2 (R).
41
42
a b
: a, b, c, d Z and ad bc 6= 0
c d
is not a group with respect to multiplication.
It turns out that there is a natural definition for a group GL2 (Z). Well
return to this in Example XV.24.
VII.2. Congruence Classes
Let m 2 be an integer. By Z/mZ we mean the set of congruence
classes modulo m. In Foundations this is denoted by Z/m and in most
algebra textbooks by Zm . Our notation is the least economical, but also Zm vs. Z/mZ
the least arbitrary. I have an excellent reason for writing Z/mZ instead of
Z/m and Zm . I want you to get used to the notation of quotient groups
which well cover in Chapter XIII.
If a is an integer, we shall write a for the congruence class of a modulo
m. Thus
a = {. . . , a 3m, a 2m, a m, a, a + m, a + 2m, a + 3m, . . . }.
In otherwords, a consists of all integers congruent to a modulo m. From
Foundations you know that
Z/mZ = {0, 1, 2, . . . , m 1}
and that the classes 0, 1, . . . , m 1 are distinct, so Z/mZ consists of exactly m classes. You know how addition and multiplication is defined on
Z/mZ:
a + b = a + b,
a b = ab.
Example VII.3. The addition and multiplication tables for Z/6Z are in
Table VII.1.
Exercise VII.4. Write down the addition and multiplication tables for Z/4Z
and Z/5Z.
0 1 2 3 4 5
0 1 2 3 4 5
0 1 2 3 4 5
0 0 0 0 0 0
1 2 3 4 5 0
0 1 2 3 4 5
2 3 4 5 0 1
0 2 4 0 2 4
3 4 5 0 1 2
0 3 0 3 0 3
4 5 0 1 2 3
0 4 2 0 4 2
5 0 1 2 3 4
0 5 4 3 2 1
43
addition in Z is associative
= a +b +c
= a + (b + c).
Thus addition in Z/mZ is associative. Obviously 0 is the additive identity.
What about the additive inverse? Note that a + a = 0 so every class has
an additive inverse 1.
Thus (Z/mZ, +) is a group. We leave the proof that it is abelian as an
easy exercise.
1Perhaps you prefer the inverse of a where 0 0 < m to be of the form b where b
also satisfies 0 b < m. In this case, if 0 < a < m, the observe that 0 < m a < m, and
a + m a = 0, since a + (m a) 0 (mod m). Moreover 0 0 (mod m), thus 0 = 0.
CHAPTER VIII
Example VIII.3. When you saw the equation a n = 1 in the above example, Im sure you immediately remembered the n-th roots of unity! The
n-th roots of unity dont all live in R; they live in C. In fact, they live in C .
For concreteness we take n = 3. You will know from Foundations that
there are three cube roots of unity. These are 1, , 2 , where = e 2i /3 . See
Figure VIII.1. Let us think of these inside the group C . Then and 2
45
46
(2 )2 = 4 = 3 = ,
(2 )3 = (3 )2 = 12 = 1.
Exercise VIII.4. Write down and sketch the sixth roots of unity. What are
their orders? Repeat with the eighth roots of unity.
Exercise VIII.5. C has plenty of elements of infinite order. Write down a
few.
Exercise VIII.6. Let G = GL2 (R). Show that
0 1
1 1
A=
,
B=
1 0
0 1
belong to G. Determine their orders.
Whilst reading the above examples and working out your own, the following observations will have dawned on you (given here in multiplicative notation).
Lemma VIII.7. Let G be a group and g be an element of G.
(i) g has order 1 if and only if g is the identity element.
(ii) Let m be a non-zero integer. Then g m = 1 if and only if g has
finite order d with d | m.
47
q, r Z and 0 r < d .
Youre probably wondering if in every additive group, the identity element 0 is the only one of finite order. The following example shows that
this isnt true.
Example VIII.10. Observe that in (Z/mZ, +), every element a has finite
order. Indeed, ma 0 (mod m) and so ma = 0. This does not mean
that every element has order m, since the order of a is defined to be
the least positive integer n such that na = 0. However, we do know by
Lemma VIII.7 that the order n is a divisor of m.
Let us look at the elements of (Z/6Z, +) and determine their orders.
We quickly find that 0 has order 1 (as usual); 1 and 5 have order 6; 2 and 4
have order 3; and 3 has order 2.
Exercise VIII.11. Find the orders of the elements of (Z/4Z, +) and (Z/5Z, +).
48
CHAPTER IX
Subgroups
Excruciating pain
It will be a long time before you come to appreciate and enjoy groups.
precedes orgasmic Abstract algebra goes from being mind-numbingly boring to being an acpleasure! quired taste and then an exhilarating experience and finallyif youre not
carefula hopeless addiction. Were still in the mind-numbingly boring
part of the journey; you should see this part as an initiation rite that cant
be skipped.
IX.1. What Were They Again?
We met subgroups in the last chapter when we discussed the group
D 4 . Let us write down the formal definition.
Definition. Let (G, ) be a group. Let H be a subset of G and suppose that
(H , ) is also a group. Then we say that H is a subgroup of G (or more
formally (H , ) is a subgroup of (G, )).
For H to be a subgroup of G, we want H to a group with respect to the
same binary operation that makes G a group.
a trivial but Example IX.1. R is a subset of R and both are groups. But R is not a
important example subgroup of R, since the operation that makes R a group is multiplication and the operation that makes R a group is addition.
Example IX.3. R is a subgroup of R[x] since any real number can be viewed
as a polynomial of degree 0.
Example IX.4. (;, +) is not a subgroup of (R, +), simply because (;, +)
is not a group; a group has to be non-empty since it has to contain an
identity element.
50
IX. SUBGROUPS
Example IX.6. Lets take G = R and H the subset of positive real numbers:
H = {a R : a > 0}.
Lets show that H is a subgroup of G. First, 1 is positive, so 1 H . Hence
condition (a) is satisfied.
To check (b), suppose that a, b are in H . Thus a and b are positive,
and so their product ab is also positive. Hence ab H and we know that
(b) is satisfied.
Finally, we want to check condition (b). Suppose a is an element of
H . Then a is positive, and so a 1 is positive. Hence a 1 is also an element
of H . It follows that condition (c) is satisfied.
By Theorem IX.5, H is a subgroup of R .
51
y
V
V 0 = {(a, a) : a R}.
union of subgroups You know from Example IX.9 that V is a subgroup of R2 (and is the line
not (always) y = x). It is just as easy to show that V 0 (which happens to be the line y =
subgroup x) is also a subgroup of R2 . What about their union U = V V 0 ? You can
check that U satisfies conditions (a) and (c) of Theorem IX.5. However,
52
IX. SUBGROUPS
(1, 1)
x
(1, 1)
F IGURE IX.2. The ray W = {(a, a) : a R, a 0} is not a
subgroup of R2 . It contains the identity element (0, 0) and
is closed under addition. The problem is with the existence of additive inverses; e.g. (1, 1) is in W but its inverse
(1, 1) isnt in W .
(1, 1) and (1, 1) are in U but their sum (2, 0) is not in U . So U does not
satisfy (b), and is therefore not a subgroup of R2 . See Figure IX.3.
On the other hand, the intersection V V 0 = {(0, 0)} is a subgroup of
R2 .
y
V
(0, 2)
(1, 1)
V
(1, 1)
x
53
y
C
(a, a3 )
(a, a3 )
Choose and fix a point Q on the plane and let u = OQ be the position
: n (x u) = 0.
Here n is any (non-zero) vector normal to the plane, and u is the position
vector of any point on the plane.
The plane in (IX.17) defines a set
V = {x R3 : n (x u) = 0}.
This is the set of points on the plane. It is a subset of the group R3 . Is
V a subgroup? Of course to be a subgroup it has to contain the identity
element of R3 which is 0. So we can choose u = 0. This doesnt mean that
our original Q was the origin. Were free to choose Q anywhere we like on
, and if goes through the origin then we choose it to be the origin, and
so take u = 0. With this choice, we can simplify V to obtain
V = {x R3 : n x = 0}.
54
IX. SUBGROUPS
xu
O
F IGURE IX.5. The point-normal equation of a plane. Here
n is normal to the plane , Q is a fixed point on and u
is its position vector. If P is any point on with position
vector x, then xu is parallel to the plane, and so n(xu) =
0.
Lets check that this is indeed a subgroup of V . If x1 , x2 V then nxi = 0
so
n (x1 + x2 ) = n x1 + n x2 = 0 + 0 = 0.
Thus x1 + x2 V . Also
n (x1 ) = n x1 = 0 = 0.
Thus x1 V . Hence V is a subgroup of R3 .
Conclusion: a plane defines a subgroup of R3 if and only if it passes
through the origin.
Exercise IX.16. Which lines in R2 define a subgroup? Justify your answer. an important
Example IX.17. Recall that
C = { C : 6= 0}.
Geometrically, C is the whole complex plane minus the origin. We have
observed before that C is a group (where the binary operation is multiplication of complex numbers). Let
S = { C : || = 1}.
The set S is the set of all points in the complex plane with distance 1 from
the origin. Of course this is just the unit circle (the circle centred at the
origin with radius 1) as in Figure IX.6. Let us check that S is a subgroup of
C ; it is clearly a subset. Of course the unit element of C is 1 and |1| = 1
so 1 S, which proves (a). Suppose , S. Then || = 1 and || = 1.
geometric exercise
55
i
F IGURE IX.6. On the left, the group S which is just the unit
circle. On the right, the subgroup of the fourth roots of
unity.
From the properties of the absolute value 1 we have
|| = |||| = 1.
Thus S. This proves (b).
To check (c), suppose S, so that || = 1. Then, again from the
properties of the absolute value,
|1 | =
1
= 1,
||
56
IX. SUBGROUPS
(i)
(ii)
(iii)
(iv)
(v)
(vi)
(vii)
(viii)
(ix)
(x)
G = R, H = R .
G = R , H = {1, 1}.
G = C, H = 2Z.
G = C, H = {a + ai : a R}.
G = C , H = { C : 3 = 1}.
G = Z, H = Z/2Z.
G = R[x], H = Z[x].
G = R[x], H = { f R[x] : f (0) = 0}.
G = R[x], H = { f R[x] : f (0) = 1}.
G = Z/10Z, H = {0, 5}.
Exercise IX.19. Show that 1 the subgroups of Z/4Z are {0}, {0, 2} and Z/4Z.
Exercise IX.20. Show that the only subgroups of Z/3Z are {0} and Z/3Z.
Exercise IX.21. Let
D = { C : || 1}.
Sketch D. Show that D is not a subgroup of C .
Exercise IX.22. Let r be a positive real number. Let
Sr = { C : || = r }.
What does Sr represent geometrically? For what values of r will Sr be a
subgroup of C ?
P ROOF OF T HEOREM IX.5. The theorem has an if and only if statement.
It usual when proving an if and only if statement to break it up into if and only if
an if part, and an only if part, and prove each part separately. This
is what we will do here. The if part says: if H is a subset of G that
satisfies (a),(b),(c) then it is a subgroup of G. The only if part says: if
H is a subgroup of G then H satisfies (a), (b), (c).
Let us do the if part of the proof first. We have a group G and a
subset H of G. All we have been told is that H satisfies conditions (a), (b),
(c) in the statement of the theorem. We want to show that H is a group,
where the binary operation on H is the same as the binary operation on
G. This means that we have to show that H satisfies properties (i), (ii),
(iii), (iv) in the definition of a group.
Property (i) is closure: we want that if a, b H then ab H . But this
is what (b) is saying. So (i) is satisfied.
Property (ii) is associativity. We want to show that for all a, b, c H ,
we have (ab)c = a(bc). But if a, b, c are elements of H then they are also
elements of G. We know that associativity holds in G: (ab)c = a(bc). So
(ii) holds 2.
1When answering a maths question, you should always be careful about what is
being asked. Here youre being asked to show two things. The first is that the three listed
sets are indeed subgroups. The second is that there arent any other subgroups.
2There is a subtle point here that is camouflaged by our notation, and that is that
the binary operation were using on H is precisely the same one as the binary operation
57
58
IX. SUBGROUPS
cos sin
R =
sin cos
represents anticlockwise rotation about the origin through an angle . It
is geometrically clear that if compose two rotations about the origin we
obtain a rotation about the origin. So it is natural to expect that rotations
form a subgroup of GL2 (R), and indeed this is the case. We define
SO2 (R) = {R : R}.
This is called the special orthogonal group.
Theorem IX.27. SO2 (R) is a subgroup of GL2 (R).
P ROOF. First we have to check that SO2 (R) is a subset of GL2 (R). In other
words, we want to check that every matrix R has non-zero determinant.
Note det(R ) = cos2 + sin2 = 1. Hence SO2 (R) is contained in GL2 (R).
Also 4 I 2 = R 0 , so SO2 (R) contains the identity element of GL2 (R).
Next we have to show that SO2 (R) is closed under multiplication. Consider two elements of SO2 (R) and call them R and R . Now R and R
represent anticlockwise rotation about the origin through angles and
1Recall, the easiest way to show that a set is a group is to show that it is a subgroup
59
cos( + ) sin( + )
=
sin( + ) cos( + )
= R + .
Thus SO2 (R) is closed under multiplication.
Finally we must check that the inverse of every matrix in SO2 (R) is
again in SO2 (R). Geometrically, its easy to see that the inverse of R is
R ; Ill leave it to you to check this algebraically. This completes the
proof.
Remark. It is clear (at least geometrically) that R R = R R . Thus SO2 (R)
is an abelian subgroup of the non-abelian group GL2 (R). We saw this
phenomenon before: the group D 4 is non-abelian, but its subgroup of
rotations is abelian.
IX.5. Differential Equations
Let C be the set of infinitely differentiable real functions. This probably sounds scary, but to reassure you Ill just point out that C contains all
polynomials, as well as sin t , cos t , e t , e t . It is a fact that C is an additive
group. Dont worry about the proof; it depends on properties of differentiability that youll see eventually in analysis. Addition in C is done in a
common sense way. For example, if f (t ) = t 2 + sin(t ) and g (t ) = 2t 2 e t
then f (t ) + g (t ) = 3t 2 + sin(t ) e t . The identity element is 0.
Lets dive straight into an example. We define the following subset
dx
H = x(t ) C : t
2x = 0 .
dt
In other words, H is the set of infinitely differentiable functions x(t ) that
satisfy the differential equation
t
(IX.18)
dx
2x = 0.
dt
d x1
2x 1 = 0,
dt
d x2
2x 2 = 0.
dt
60
IX. SUBGROUPS
d x1 d x2
dx
t
2x = t
+
2(x 1 + x 2 )
dt
dt
dt
d x2
d x1
2x 1 + t
2x 2
=t
dt
dt
= 0.
Therefore x(t ) H . Similarly, using the properties of differentiation, you
can show that if x 1 (t ) H then x 1 (t ) H (easy exercise). So H is a subgroup of C .
Note that we didnt have to solve the differential equation to know
that its set of solutions is a group; we merely used the properties of differentiation. But in fact it is easy to solve this particular equation using
separation of variables. If you do that (try it) youll find that
H = {at 2 : a R}.
Now check again that H forms an additive group.
Exercise IX.28. Which of the following differential equations define subgroups of C ?
(i) t
(ii)
dx
2x = t 3 .
dt
d2 x
2
dx
+ 6x = 0.
dt
dt
dx
(iii)
x 2 = 0.
dt
61
Example IX.31. Consider the group U4 which is the group of fourth roots
of unity. Thus U4 = {1, i , 1, i }; of course the binary operation is multiplication. The trivial subgroup is {1}. We note that U2 = {1, 1} is a nontrivial proper subgroup. Are there any others? Suppose H is another nontrivial proper subgroup of U4 . Then 1 H , as subgroups always contain
the identity element. Since H is non-trivial, and H 6= {1, 1}, it must contain either i or i . Suppose H contains i . Then H contains i 2 = 1 and
i 3 = i . Therefore H = U4 , which contradicts the assumption that H is
proper. Similarly if H contains i then H = U4 (check). Therefore the
only non-trivial proper subgroup of U4 is U2 = {1, 1}.
Exercise IX.32. For what values of m does Z/mZ have non-trivial proper
subgroups? Try out a few examples and see if you can make a conjecture.
Can you prove your conjecture?
IX.7. Lagranges TheoremVersion 2
Here is another version of Lagranges Theorem. The relation between
this version and the earlier one (Theorem VIII.12) will be explained once
we have studied cyclic groups.
Theorem IX.33. (Lagranges TheoremVersion 2) Let G be a finite group,
and H a subgroup of G. Then the order of H divides the order of G.
Example IX.34. We saw in Example IX.31 that U4 , the group of 4-th roots
of unity, contains U2 , the group of square-roots of unity. Now U2 has order 2, U4 has order 4. Lagranges Theorem tells that the order of U2 must
divide the order of U4 which is correct.
Example IX.35. Recall that D 4 has order 8. In Figure V.3 we listed the ten
subgroups of D 4 . These have orders 1, 2, 4 and 8. This is consistent with
Lagranges Theorem.
CHAPTER X
63
64
{1}
{1, , 2 , 3 , 4 , 5 }
2 {1, 2 , 4 }
3 {1, 3 }
4 {1, 2 , 4 }
5 {1, , 2 , 3 , 4 , 5 }
TABLE X.1. The six elements of U6 and the cyclic subgroups they generate.
Example X.3. For each element of the group Z/mZ, we write down the
cyclic group it generates. Note that since Z/mZ is an additive group, the
subgroup generated by g is g = {ng : n Z}. That is, it is the set of
multiples of g rather than the set of powers of g . See Table X.2.
a a
0 {0}
1 {0, 1, 2, 3, 4, 5}
2 {0, 2, 4}
3 {0, 3}
4 {0, 2, 4}
5 {0, 1, 2, 3, 4, 5}
TABLE X.2. The six elements of Z/6Z and the cyclic subgroups that they generate.
65
ideas in algebra. Here we simply divided m by n; the integers q, r are respectively the quotient and the remainder. Thus
h = g m = g qn+r = (g n )q g r .
However, g n = 1 since g has order n. So h = g r . Since 0 r < n, we
see that r is one of 0, 1, . . . , n 1. Therefore h is in {1, g , . . . , g n1 }. By our
principle, we see that g = {1, g , . . . , g n1 }.
Exercise X.7. In each of the following groups G, write down the cyclic
subgroup generated by g .
(a) G = S, g = exp(2i /7).
(b) G = Z/12Z, g = 8.
0 1
(c) G = GL2 (R), g = 1
0 .
66
Exercise X.8. Which of the following groups G are cyclic? Justify your
answer for each, and if G is cyclic then write down a generator.
(a) G = kZ (where k is a non-zero integer).
(b) G = Z/mZ (where m is a positive integer).
(c) D 3 .
Exercise X.9. In this exercise, you will show using contradiction that R
is not cyclic. Suppose that it is cyclic and let g R be a generator. Then
R = g . In particular, |g |1/2 R and so |g |1/2 = g m for some integer
m. Show that the only solutions to this equation are g = 1. Wheres the
contradiction?
Exercise X.10. In this exercise youll show that Q is not cyclic. Let a, b be
integers with b 6= 0. Let p be a prime that does not divide b. Show that
1/p cannot be written in the form na/b with n an integer. Deduce that Q
is not cyclic.
Exercise X.11. Show that S is not cyclic.
X.2. SUBGROUPS OF Z
67
X.2. Subgroups of Z
Trust me, Im a
Im feeling particularly inarticulate at the moment, so I cant explain
doctor. why subgroups of Z are important. But nevertheless they are important
and so well do them.
The first thing to note about Z is that it is cyclic. Does that mean
that all elements of Z are powers of some element. No, because it is an
additive group. If G is an additive group, and g is an element of G then
g = {ng : n Z} = {. . . , 2g , g , 0, g , 2g , 3g , . . . }.
Thus Z = 1 and so it is cyclic. In fact, it is infinite and cyclic, unlike for
example, Un .
Lemma X.16. Let k be an integer. Write
kZ = {ka : a Z}.
Then kZ is a subgroup of Z.
P ROOF. You can prove this in a similar way to Example IX.7. However, it
is quicker to note that kZ = k, and so is a subgroup by Theorem X.1.
Note that 0Z = {0} has only the identity element. Also
(k)Z = {. . . , 2(k), (k), 0, k, 2(k), . . . }
= {. . . , 2k, k, 0, k, 2k, . . . }
= {. . . , 2k, k, 0, k, 2k}
= kZ
because the order of elements in a set does not matter. In other words,
Z = Z,
2Z = 2Z,
3Z = 3Z, . . . .
Z,
2Z,
3Z,
4Z, . . .
and we want to know if theyre all the subgroups of Z. The following theorem tells us that they are.
Theorem X.17. Any subgroup of Z has the form kZ for some non-negative 1
integer k. In particular, all subgroups of Z are cyclic.
P ROOF. Let H be a subgroup of Z. We want to show that there is a nonnegative integer k such that H = kZ. We divide the proof into two cases.
The first case is when H is the subgroup {0}. Then H = 0Z and weve done
what we wanted.
So lets look at the second case where H has non-zero elements. If a is
a non-zero element of H then because H is a(n additive) group, a is also
a non-zero element of H but it has a different sign. So we know for sure
that H has some positive elements. Let k be the smallest positive element
of H . We will prove that H = kZ.
1The non-negative integers are 0, 1, 2, 3, . . . .
68
Whenever you have two sets, A and B , and you want to prove that Is this familiar?
theyre equal, one way to do it is to show that every element of A belongs
to B and every element of B belongs to A.
As k is in H , we know by Theorem VI.9 that all the multiples of k belong to H . Thus every element of kZ belongs to H . We must show the
converse: every element of H is a multiple of k.
Let a be an element of H . By the division algorithm which you have
met in Foundations, we can write
a = qk + r,
CHAPTER XI
Isomorphisms
You mustve noticed that theres a lot in common between the group
of m-th roots of unity Um , and the group Z/mZ. If not, take another
look Tables X.1 and X.2. In fact the groups Um and Z/mZ are isomorphic.
What does this mean?
Definition. Let (G, ) and (H , ) be groups. We say that the function :
G H is an isomorphism if it is a bijection and it satisfies
(g 1 g 2 ) = (g 1 ) (g 2 )
for all g 1 , g 2 in G. In this case we say that (G, ) and (H , ) are isomorphic.
Isomorphic groups may look different, but in essence are the same.
An isomorphism is a way of relabeling the elements of one group to obtain another group, as the following examples will make clear.
Example XI.1. Define : Z/mZ Um by the simple rule
(a) = a ,
a = 0, 1, . . . , m 1.
cos sin
R =
sin cos
represents anticlockwise rotation around the origin through an angle .
The identity
R + = R R .
turns addition into multiplication, and so it should remind you of the
identity e + = e e . In fact, a more accurate analogy is identity
e i (+) = e i e i .
The reason is because multiplying a complex number by e i rotates it
about the origin anticlockwise through the angle (prove this using the
exponential form for complex numbers).
Now that you know that R and e i are analogues, you will expect that
the groups SO2 (R) and S are isomorphic. Recall that SO2 (R) is the special
69
70
XI. ISOMORPHISMS
(R ) = e i
Indignant?
Paranoid? Seething
with self-righteous
rage?
CHAPTER XII
Cosets
Cosets are what we get when we shift a subgroup by the elements of
the group.
Definition. Let G be a group and H a subgroup. Let g be an element of
G. We call the set
g H = {g h : h H }
a left coset of H in G and the set
H g = {hg : h H }
a right coset of H in G.
Example XII.1. Lets take G to be D 4 and R the subgroup made up of
rotations:
R = {1, 1 , 2 , 3 }.
Revisit Section V.4 to remind yourself of the notation. Lets compute 1 R.
By definition,
1 R = {1 1, 1 1 , 1 2 , 1 3 }
= {1 , 0 , 3 , 2 }
= {0 , 1 , 2 , 3 }.
Lets try another coset.
2 R = { 2 1, 2 1 , 2 2 , 2 3 }
= { 2 , 3 , 1, 1 }
= {1, 1 , 2 , 3 }.
We see that 2 R is equal to R, and 1 R isnt equal to R. In fact, 1 R isnt
even a subgroup of D 4 ; why? You can carry on computing all eight left
cosets, and youll find
1 R = 1 R = 2 R = 3 R = {1, 1 , 2 , 3 }
and
0 R = 1 R = 2 R = 3 R = {0 , 1 , 2 , 3 }.
72
XII. COSETS
73
1.5S
S
0.5S
1
Example XII.7. In Exercise IX.16 I asked you the following question: which
lines in R2 define a subgroup? Lets go back to this question and answer
it again, and this time for lines that define a subgroup we want to determine the cosets too.
One convenient way of a specifying a line L in R2 is as follows. Let Q
be a point on L, with position vector w. Let v be a vector parallel to L.
Then L has the parametric form
L : x = w + t v.
This is a (slightly clumsy) school way of saying things. What it means
is that the points with position vector w + t v are on the line, where t is
any scalar (i.e. real number). A much better way is to just write L in set
notation:
L = {w + t v : t R}.
74
XII. COSETS
v
x
75
Example XII.8. If you did matrices at school, then you will probably know
that a system of m linear equations in n variables can be written as a single matrix equation
Ax = b
(XII.19)
Ax0 = b.
Subtracting we find
A(x x0 ) = 0.
So the difference x x0 belongs to the subgroup K . Thus x belongs to the
coset x0 + K . In fact, the set of solutions to (XII.19) is precisely the coset
x0 + K .
d 2x
dx
+ c = f (t ),
+b
2
dt
dt
d 2x
dx
+ c = 0.
+b
2
dt
dt
76
XII. COSETS
and
{0 , 1 , 2 , 3 }.
So the index [D 4 : R] = 2.
Example XII.12. In Example XII.6, we found that the cosets of the circle
group S in C are the circles centred at the origin. So the index [C : S] =
.
Example XII.13. Now lets look at the index of the trivial group {0} as a
subgroup of Z. Note that
a + {0} = {a}.
So the cosets of {0} in Z are
. . . , {2}, {1}, {0}, {1}, {2}, . . .
Clearly [Z : {0}] = .
Exercise XII.14. Let G be a finite group. Let {1} be the trivial subgroup
consisting only of the identity element. Explain why [G : {1}] = |G|.
XII.4. The First Innermost Secret of Cosets
Apart from the definition, you need to know two facts about cosets.
The first is that a coset of a subgroup has the same size as the subgroup.
Lemma XII.15. Let G be a group and H a finite subgroup. If g G then
g H and H g have the same number of elements as H .
P ROOF. Well just prove the lemma for left cosets. The proof for right
cosets is nearly the same. Let g be an element of G. We want to show
that H and g H has the same number of elements. The sets H and g H are
77
A priceless tip! finite. The best way to show that two finite sets have the same number
of elements is to set up a bijection between them. Let
: H g H,
h 7 g h.
n
w
a
y
,
n
w
a
y
Example XII.16. Now is a good time to revisit the examples at the beginning of the chapter and make sure that Lemma XII.15 holds for them.
XII.5. The Second Innermost Secret of Cosets
Lemma XII.17. Let G be a group and H be a subgroup. Let g 1 , g 2 be elements of G. Then the cosets g 1 H , g 2 H are either equal or disjoint 1.
1Two sets A, B are disjoint if they have no members in common. Another way of
saying the same thing is: two sets A, B are disjoint if A B = ;. Im now confusedhave
I said it in another way, or in the same way but with more notation?
78
XII. COSETS
Example XII.18. Look again at Example XII.6 and in particular Figure XII.1.
There we looked the cosets of the circle subgroup S inside C . We found Geometric Epiphany
that the cosets are the circles centred at the origin of positive radius. It is I
obvious that two such circles are either equal or disjoint.
Example XII.19. In Example XII.7, we saw that a line L in R2 passing Geometric Epiphany
through the origin defines a subgroup. The cosets of L are the lines par- II
allel to it. Again it is clear that two lines parallel to L are either equal or
disjoint.
g 1 = g 2 h 2 h 11 .
by (XII.22)
= g 2 (h 2 h 11 h).
However, h 2 h 11 h is a product of elements of the subgroup H and therefore an element of H . Hence weve written g 1 h in the form g 2 h 0 where
h 0 = h 2 h 11 h is an element of H . Thus every element of g 1 H is again an
element of g 2 H . Similarly, every element of g 2 H is an element of g 1 H .
Hence g 1 H = g 2 H .
XII.6. Lagrange Super-Strength
Ive stated Lagranges Theorem a very long time ago, and kept you
waiting for the proof ever since. Surely you consider this delay a deliberate act of unspeakable cruelty. It was indeed deliberate; I thought the
prolonged wait would heighten the anticipation and make you appreciate and enjoy the proof even more. Alas, through an act of infinite selflessness, Ive sacrificed my popularity to intensify your infatuation with
the subject.
We now state an even stronger version of Lagranges Theorem.
Theorem XII.20. (Lagranges TheoremVersion 3) Let G be a finite group
and H a subgroup. Then
|G| = [G : H ] |H |.
79
CHAPTER XIII
Quotient Groups
Taking quotients is one of the most powerful concepts in mathematics. It should also be one of the least painful to assimilate. Instead of
revelling in quotients, most Warwickers go through three or four miserallow me to end your able years of being terrorised by them. The difficulties with quotients are
suffering purely psychological. To overcome them, you just need to study and visualize a good number of examples. What are we waiting for?
love quotie
ve
q
I lo
ve
uotients! I lo
tie
nts
eq
tients! I lov
ot
! I love qu
ie
n
ts!
I
ts
!
qu
o
uo
tie
nt
s!
Il
ov
ts
ien
ot
qu
I lo
ve
qu
o
s!
nt
e
ti
o
qu
e
v
I lo
82
(mod mZ) if and only if a b (mod m). The concept of congruence modulo subgroups is a generalization of the earlier concept of congruence modulo integers.
Example XIII.2. Z is a subgroup of R. Two real numbers a, b are congruent modulo Z if and only if a b Z. This means that their difference
is an integer. So, for example 1437.14 0.14 (mod Z). It may seem that
congruence modulo Z is a stupid idea. After all, were concentrating on
the small fractional part of number and ignoring the bigger integer part.
However in some situations, the fractional part is the important one. Lets
see one of those situations. In Example IX.17 we defined the circle group
S = { C : || = 1}.
Let
f : R S,
f () = e 2i .
Example XIII.3. Let X = {(a, 0) : a R}. Its easy to show that X is a subgroup of R2 , which is simply the x-axis. What does it mean for two points
to be congruent modulo X ? Suppose (a 1 , b 1 ) and (a 2 , b 2 ) are in R2 . Then
(a 1 , b 1 ) (a 2 , b 2 ) (mod X ) if and only if (a 1 a 2 , b 1 b 2 ) belongs to X . This
happens if and only if b 1 b 2 = 0. So two points are congruent modulo X
if and only if they have the same y-coordinate.
h = b0 + b1 x + + bn x n ,
looks wrong as there is no reason to suppose that g and h have the same degree. But
looks can be misleading. Here were in fact writing g and h as polynomials of degree at
most n. For example, if g = 2 + 7x and h = 4 3x + 2x 3 then we can take n = 3 and let
a 0 = 2, a 1 = 7, a 2 = a 3 = 0, and b 0 = 4, b 1 = 3, b 2 = 0, b 3 = 2
83
Exercise XIII.5. Let (G, +) be an abelian group. We know that {0} and G
are subgroups of G. What does it mean for a and b to congruent modulo
{0}? What does it mean for a and b to be congruent modulo G?
(mod H )}.
a + b = a + b.
84
XIII.3. R/Z
In Example XIII.2 we looked at congruences in R modulo Z. We now
want to understand the group R/Z. Note that every real number is congruent modulo Z to a unique number in the half-open interval
[0, 1) = {x R : 0 x < 1}.
Therefore
R/Z = {a : a [0, 1)}.
So when we add a + b, we take the result of a + b, and and simplify by
subtracting an integer if necessary to obtain c [0, 1), and then letting
a + b = c. For example,
0.7 + 0.2 = 0.9,
f () = e 2i ,
f = e 2i .
f + = f f .
In essence what this is saying is that the two groups (R/Z, +) and (S, ) are
essentially the same. Indeed, theyre isomorphic. Now is a good time to
look again at Chapter XI.
Here is how you should think about R/Z. We identified it with the
interval [0, 1). Think of starting at 0.95 and moving up in small steps of
0.01:
0.95, 0.96, 0.97, 0.98, 0.99, 0.00, 0.01, 0.02, 0.03, . . .
So we should really think of R/Z as the interval [0, 1) with one end joined
to the other. If you take a string and join one end to the other you obtain a loop, or a circle. This is what f is doing. It is showing that R/Z is
isomorphic to the unit circle S. Indeed, the 2 in the formula for f is a
stretching factor, since the interval [0, 1) of length 1 has to be stretched
around the unit circle of perimeter 2.
Exercise XIII.8. R/Z has four elements of order 5. Find them.
Exercise XIII.9. Let [0, 1). Show that has finite order in R/Z if and
only if is rational.
Exercise XIII.10. Show that f takes rational numbers to roots of unity.
XIII.4. R2 /Z2
85
XIII.4. R2 /Z2
After R/Z you shouldnt have any trouble imagining R2 /Z2 . Youre allowed to shift any point in R2 by an integer multiple of i and an integer
multiple of j. So you end up in the unit square:
{(x, y) : 0 x < 1, 0 y < 1}.
(XIII.24)
But you should think of this square as having the top side glued to the
bottom side, and the right side glued to the left side! See Figure XIII.2
(1, 1)
Peb
(0, 1)
bles
Bub
(0, 0)
(1, 0)
F IGURE XIII.2. R2 /Z2 is really just the unit square with the
top side glued to the bottom side, and the right side glued
to the left side.
2y = . . . , 1, 0, 1, 2, 3, . . . .
Therefore,
1
1
3
1
1
3
x = , , 0, , 1, , ,
y = , , 0, , 1, , .
2
2
2
2
2
2
As x and y belong to the interval [0, 1), we see that x = 0 or 1/2 and y = 0
or 1/2. Hence
(x, y) = (0, 0),
(1/2, 0),
(0, 1/2),
(1/2, 1/2).
However, the first of these has order 1. So the elements of order 2 in R2 /Z2
are
(x, y) = (1/2, 0), (0, 1/2), (1/2, 1/2).
bles
86
Exercise XIII.12. Find all elements of orders 3 in R2 /Z2 (there are eight
of them).
Exercise XIII.13. In Z2 we let i = (1, 0) and j = (0, 1) as usual. Write 2Z2 =
{(2a, 2b) : a, b Z}. Convince yourself that 2Z2 is a subgroup of Z2 of
index 4 and that
Z2 /2Z2 = {0, i, j, i + j}.
Write down an addition table for Z2 /2Z2 .
Exercise XIII.14. How would you describe C/Z[i ]? Is it really different
from R2 /Z2 ?
Exercise XIII.15. How would you describe C/Z? Find all elements of order 2.
XIII.5. R/Q
In this section, we shall briefly think about R/Q. In R/Z, we treat the
integers as zero. In R/Q, we treat the rationals as zero. This is a much
trickier quotient group. The trickiness does not come from the definition; there is no difficulty there. We can add in R/Q using the definition (XIII.23). The problem is with simplifying the result. Lets try some
p
numerical examples
so that you see what I mean. If we take a = 1 + 2
p
and b = 2/3 2, then
a + b = a + b = 5/3 = 0,
because 5/3 0 = 5/3 Q. However, if we take a = and b = e (where
these have their usual values) then
a + b = + e.
Can we simplify this? For example, is this equal to 0? It is if and only if
+ e is a rational number. No one knows if the number + e is rational
or not (but we know that both and e are irrational). So we dont know if
the result of the above calculation is equal to 0 or not.
XIII.6. Well-Defined and Proofs
We know that in Z/mZ, not only does addition make sense, but also
multiplication makes sense. In Z/mZ we defined multiplication by
(XIII.25)
a b = ab.
Now, you might ask why we dont define multiplication on R/Z in the
same way? OK, lets try using the same definition for multiplication on
R/Z and see what happens:
0.5 0.5 = 0.25,
There is a problem: in R/Z, the classes 1.5 and 0.5 are equal, but the
classes 0.75 and 0.25 arent. Multiplication doesnt make sense on R/Z.
87
b = b0,
in Z/mZ. Then
ab = a 0 b 0 .
We say that multiplication is well-defined on Z/mZ. This means that
the result of a product does not depend on the choice of representatitives,
even though it defined in terms of those representatives.
P ROOF. As a = a 0 and b = b 0 we know that
a 0 = a + km,
b 0 = b + `m,
(mod mZ),
which means
ab = a 0 b 0 .
Are we sure addition that addition is well-defined in R/Z and the other
quotient groups that weve been working with? The following lemma
checks that.
Lemma XIII.17. Let (G, +) be an additive abelian group and H a subgroup. Let a, a 0 , b, b 0 be elements of G such that in G/H we have
a = a0,
b = b0,
then
a + b = a0 + b0.
P ROOF. Suppose a = a 0 and b = b 0 in G/H . Then aa 0 = h 1 and bb 0 = h 2
where h 1 , h 2 H . Thus
(a + b) (a 0 + b 0 ) = (a a 0 ) + (b b 0 ) = h 1 + h 2 .
As H is a subgroup containing h 1 and h 2 , we know that the sum h 1 + h 2
belongs to H . Thus the classes a + b and a 0 + b 0 are equal.
88
To wrap up this chapter, we need to check one thing: that G/H is indeed a group.
Theorem XIII.18. Let (G, +) be an additive abelian group and H a subgroup. Then (G/H , +) is an abelian group.
P ROOF. All we have to do is check the defining properties for abelian
groups. Ill just check that addition is commutative and leave the rest
to you. Suppose a, b G. Then
b+a =b+a
(XIII.26)
= a +b
b + a = a + b as G is abelian
= a +b
Weve only looked at quotients of abelian additive groups. For general
groups, things are more tricky. At the heart of the trickiness is that in the
non-abelian setting the binary operation on cosets might not be welldefined. For now, if youve got to grips with Z/mZ, R/Z and R2 /Z2 then
youve made an excellent start with quotients.
CHAPTER XIV
Symmetric Groups
The coolest groups have elements that are functions. Matrix groups
are examples, and symmetric groups are other examples. It turns out that
every finite group is a subgroup of one of the symmetric groups. So if we
understand symmetric groups completely, then well understand finite
groups completely. Have I given you hope that youll have a complete
understanding of finite groups by the end of the chapter? Sorry, I was
saying if . . .
XIV.1. Motivation
Let A be a set, and let f , g be functions from A to itself. We know that
we can compose f , g to obtain f g which is also a function from A to
itself. We shall write Map(A) for the set of functions from A to itself. Then
is a binary operation on Map(A). And its natural to ask if this makes
Map(A) into a group. After all, we know by Lemma III.2 that composition
of functions is associative. The following example will help clarify these
ideas.
Example XIV.1. Let A = {1, 2}. You will quickly convince yourself that
there are only four functions from A to itself, which are given in Figure XIV.1.
1
2
f1
f3
f2
f4
90
Thus Map(A) = { f 1 , f 2 , f 3 , f 4 }. Is Map(A) a group with respect to composition of functions? Here is the composition table for Map(A):
f1
f2
f3
f4
f1
f1
f2
f3
f4
f2
f2
f1
f3
f4
f3
f3
f4
f3
f4
f4
f4
f3
f3
f4
Make sure you understand the table. The entry for f i f j is at the intersection of the i -th row and j -th column. As always, f i f j means apply
f j first then f i . We know that composition of functions is associative by
Lemma III.2. Moreover, it is clear from the table that f 1 is the identity element. But f 3 and f 4 dont have inverses; we cant combine either of them
with any of the four functions to obtain the identity f 1 .
But if you look carefully at the table, you will see a group with respect
to composition. It is the subset: { f 1 , f 2 }. If youve been paying attention
in Foundations you will know why f 1 , f 2 have inverses (which in this case
happen to be f 1 and f 2 respectively), and f 3 , f 4 dont: the functions f 1 , f 2
are bijections and f 3 and f 4 are not.
Now is a good time for you to revise Example IV.10. There you saw a
that non-invertible matrix (which represented projection onto the y-axis)
was also a non-bijective function R2 R2 .
b
2
f1
b
c
f3
91
b
2
f2
f4
Remarks.
(i) Instead of saying that a function is injective, mathematicians sometimes say that it is one-to-one (also written 1 1).
(ii) The definition of injective is often given in the contrapositive
form: if f (a 1 ) = f (a 2 ) then a 1 = a 2 . The way weve phrased the
definition is more helpful for this chapter, but you should get
used to both forms.
(iii) Instead of saying that a function is surjective, mathematicians
sometimes say that it is onto. I found this jarring when I first saw
it. But I quickly got used to it.
(iv) A bijection is also called a one-to-one correspondence.
Here is a theorem you have seen in Foundations where it was probably called the pigeon-hole principle.
Theorem XIV.4. Let A be a finite set and let f be a function from A to itself.
Then f is injective if and only if f is surjective.
92
Example XIV.5. Look back at the functions in Figure XIV.1 for a very basic
illustration of Theorem XIV.4.
Example XIV.6. Theorem XIV.4 is true for finite sets only. For infinite sets
it might or might not hold. Let f 1 : N N be given by f 1 (x) = x + 1. Then
f 1 is not surjective since 0 is not in the image. However, f 1 is injective. See
Figure XIV.3. By contrast, let f 2 : Z Z be also given by x 7 x + 1. Then
N
N
0
Z
..
.
Z
..
.
0
1
..
.
..
.
..
.
..
.
f1
f2
The following theorems collect together key results regarding bijections. Again you know all of this from Foundations.
Theorem XIV.7. Let f : A B and g : B C be bijections. Then g f is a
bijection A C .
Definition. Let A be a set. The identity map on A is the map id A : A A
satisfying id A (x) = x for all x A.
Let f : A A be a function on A. We say that f is invertible if there
exists a function g : A A such that f g = g f = id A . If such a g exists
then we call it the inverse of f and denote it by f 1 .
Of course you know from Foundations that if f has an inverse then it
is unique.
Theorem XIV.8. Let f : A A be a function. Then f is invertible if and
only if f is a bijection. If f is invertible then f 1 is also a bijection.
XIV.4. S n
93
Thus f id A = id A f = f holds.
Finally we want every element of Sym(A) to have an inverse in Sym(A).
This is true by Theorem XIV.8.
Example XIV.11. Let f 1 , f 2 be as in Example XIV.6. Note that f 1 Sym(N)
since it is not a bijection. However f 2 Sym(Z). What is f 21 ? It is simply
the function Z Z given by x 7 x 1.
Lets calculate g = f 23 = f 2 f 2 f 2 . Then
g (x) = f 2 ( f 2 ( f 2 (x))) = f 2 ( f 2 (x + 1)) = f 2 (x + 2) = x + 3.
It will be easy for you to show, for any integer n, that f 2n is the function
Z Z satisfying x 7 x + n. In particular, f 2n 6= id A for n 6= 0. Thus f 2 is an
element of infinite order in the group Sym(Z).
94
The elements of S n are called permutations. One way of representing permutations is to use diagrams such as those for f 1 , f 2 S 2 in Figure XIV.1. The following is a more economical way. Let a 1 , a 2 , . . . , a n be
the numbers 1, 2, . . . , n in some order. Then
1 2 n
a1 a2 an
represents the unique permutation in S n that sends 1 to a 1 , 2 to a 2 , . . . ,
and n to a n .
Example XIV.15. S 2 has two elements:
1 2
1 2
,
.
1 2
2 1
These are respectively the same as f 1 , f 2 in Figure XIV.1. The first of these
is the identity element. We noted in Example XIV.9 that S 2 = Sym({1, 2}) is
abelian.
Example XIV.16.
These are
1
1
1
1
2 3
,
2 3
2 3
,
3 2
1
2
1
2
2 3
,
3 1
2 3
,
1 3
1 2 3
3 1 2
1 2 3
.
3 2 1
Again, the first of these is the identity element. It is important that you
know what the notation means and how to multiply two permutations
written in this notation, so lets have some practice. Let
1 2 3
1 2 3
=
,
=
.
3 1 2
1 3 2
Never forget that these are bijections from {1, 2, 3} to itself. To find out
what does, look at the columns. is the function that sends 1 to 3, 2 to
1 and 3 to 2. Thus
(XIV.27)
(1) = 3,
(2) = 1,
(3) = 2.
XIV.4. S n
95
Likewise,
(1) = 1,
(2) = 3,
(3) = 2.
1 2 3
.
=
3 2 1
Similarly,
1 2 3
=
.
2 1 3
Note that 6= , so S 3 is non-abelian. How do we compute 1 ? From
(XIV.27) we find
1 = 1 (3),
2 = 1 (1),
3 = 1 (2).
1 (2) = 3,
1 (3) = 1.
We rearrange this:
1 (1) = 2,
Hence
1 2 3
=
.
2 3 1
1 2 3 4 5
=
,
2 3 5 1 4
1 2 3 4 5
=
.
3 1 2 5 4
Compute 1 , , 2 .
Exercise XIV.19. Show that S n is non-abelian for n 3.
Exercise XIV.20. Recall (page 34) that we interpreted elements of D 4 as
functions from {1, 2, 3, 4} to itself. Go back and check that these are bijections. Thus D 4 is a subgroup of S 4 .
96
XIV.4.1. Whats Special About S n ? We started the chapter by looking at symmetry groups of arbitrary sets A. Then we restricted ourself to
S n = Sym({1, 2, . . . , n}). This is not as big a restriction as it looks. Suppose
the set A is finite, and let n = |A|, the number of elements of A. Then
Sym(A) is isomorphic to S n . One way of seeing this is convince ourselves
that every permutation of {1, 2, . . . , n} gives us a permutation of A. For
example, suppose A = {a 1 , a 2 , a 3 }. Then the permutation {1, 2, 3} given by
1 2 3
2 3 1
corresponds to the permutation of A given by
a1 a2 a3
.
a2 a3 a1
Understanding Sym(A) with |A| = n is the same as understanding S n .
XIV.5. A Nice Application of Lagranges Theorem
Let n, m be integers with n 2 and 0 m n. You met before the
binomial coefficient
!
n
n!
=
.
m
m!(n m)!
It is not all obvious from this formula that the binomial coefficient is an
integer. As an application of Lagranges Theorem, we show that it is. Recall that S n has order n!. If we can find a subgroup H of S n of order
m!(n m)!, then
!
n
|S n |
=
.
m
|H |
We know by Lagranges Theorem that the right-hand side is simply [S n :
H ], the number of cosets of H in S n , and is therefore an integer. All
we have to do now is give the subgroup H of order m!(n m)!. Let H
be the subset of S n consisting of permutations such that permutes
{1, 2, . . . , m} and permutes {m + 1, m + 2, . . . , n}. What do we mean by this?
Write
A = {1, 2, . . . , m},
B = {m + 1, m + 2, . . . , n}.
Note that
{1, 2, . . . , n} = A B.
The elements of S n are the bijections from the set {1, 2, . . . , n} to itself. The
elements of H are those elements of S n that satisfy (a) A for all a
A, and (b) B for all b B . Its an easy exercise to show that H is a
subgroup of S n . We want to check that its order is really m!(n m)!. We
count the elements of H in a similar way to the argument in the proof
of Theorem XIV.14. Let be an element of H . Then (1) can be any of
1, 2, . . . , m. Once weve chosen (1), we know (2) can be any of 1, 2, . . . , m
except for (1). Thus there are m choices for (1), m 1 choices for (2)
97
and so on. Until we reach (m + 1). This can be any element of B , and so
there are n m choices for (m + 1). etc. You see that the order of H is
m(m 1) 1 (n m)(n m 1) 1 = m!(n m)!
Dont you just love maths?
Exercise XIV.21. To make sure youve understood the argument above,
let m = 2 and n = 4, so that
A = {1, 2},
B = {3, 4}.
Now write down all permutations in S 4 that satisfy (a) A for all a A
and (b) B for all b B , and convince yourself that these form a group.
XIV.6. Cycle Notation
Let a 1 , a 2 , . . . , a m be distinct elements of the set {1, 2, . . . , n}. By the notation
(a 1 , a 2 , . . . , a m )
(XIV.28)
1 2 3 4 5
(1, 4, 5) =
.
4 2 3 5 1
Notice that (1, 4, 5) = (4, 5, 1) = (5, 1, 4). However, (1, 4, 5) 6= (1, 5, 4).
The transposition (1, 5) S 5 is given in Figure XIV.5.
In our old notation, the transposition (1, 5) is written as follows:
1 2 3 4 5
(1, 5) =
.
5 2 3 4 1
Note that (1, 5) = (5, 1).
98
1
2
I hope that the above example has convinced you that cycle notation
is simultaneously more concise and and more transparent than the old
notation. If so, the following theorem will come as a pleasant surprize.
Theorem XIV.23. Every permutation can be written as a product of disjoint cycles.
What does disjoint mean? Two cycles (a 1 , a 2 , . . . , a n ) and (b 1 , b 2 , . . . , b m )
are said to be disjoint if a i 6= b j for all i , j . What does product mean? The
product of two permutation is of course their composition as functions.
Before we prove the theorem, lets see and example where we write down
a permutation as a product of cycles.
Example XIV.24. Let
1 2 3 4 5 6 7 8
=
.
5 7 1 4 8 2 6 3
Write as a product of disjoint cycles.
Answer: We start with 1 are repeatedly apply to it:
1 7 5 7 8 7 3 7 1.
Therefore contains the cycle (1, 5, 8, 3). Now we start with an element of
the set {1, 2, . . . , 8} that is not contained in the cycle (1, 5, 8, 3). For example
start with 2 and repeatedly apply to it:
2 7 7 7 6 7 2.
So also contains the cycle (2, 7, 6). Note that the cycles (1, 5, 8, 3) and
(2, 7, 6) are disjoint, and contains the product (or composition) (1, 5, 8, 3)(2, 7, 6).
There still remains one element of the set {1, 2, . . . , 8} that does not appear
as either of the two cycles (1, 5, 8, 3) and (2, 7, 6) and this is 4. Applying
to 4 we find:
4 7 4.
So
= (1, 5, 8, 3)(2, 7, 6)(4)
99
1 7 3 7 10 7 9 7 1,
2 7 5 7 6 7 2;
1 [ 3 [ 10 [ 9 [ 1,
2 [ 5 [ 6 [ 2.
Therefore 1 = (1, 9, 10, 3)(2, 6, 5). Check for yourself that 1 is indeed
the identity permutation.
100
= (b 1 , b 2 , . . . , b ` ).
1 = 1, 1, 2 1, . . . , u1 1 .
It is clear that 1 has the same effect as on the elements 1, 1, . . . , u1 1.
Now let a be the first element of the set {1, 2, . . . , n} not appearing in
the list 1, 1, . . . , u1 1. Repeat the above argument with the sequence
a, a, 2 a, 3 a, . . . .
We deduce the existence of a cycle
2 = a, a, . . . , v1 a
as promised
101
Exercise XIV.30. We will shortly meet the Alternating Groups, one of which
is
A 3 = {id, (1, 2, 3), (1, 3, 2)}.
Verify that A 3 is a subgroup of S 3 , and write down its left cosets.
Exercise XIV.31. Verify that H = {id, (1, 2)} is a subgroup of S 3 , and write
down its left cosets.
Exercise XIV.32.
(i) Use Lagranges Theorem to show that S 4 does
not have an element of order 5.
(ii) Let = (a 1 , a 2 , . . . , a m ) be a cycle of length m in S n . Explain why
has order m.
(iii) Now let = 1 2 . . . k where the i are disjoint cycles of lengths
m i in S n . Explain carefully why has order lcm(m 1 , m 2 , . . . , m k ).
(iv) Show that S 4 does not have elements of order 6. Could you have
shown this using Lagranges Theorem?
XIV.7. Permutations and Transpositions
Lemma XIV.33. Every permutation can be written a product of transpositions.
Note the absence of the word disjoint.
P ROOF. We know that every permutation can be written a product of cycles. So it is enough to show that a cycle can be written as a product of
transpositions. Check for yourself that
(XIV.29)
(a 1 , a 2 , . . . , a m ) = (a 1 , a m ) (a 1 , a 3 )(a 1 , a 2 ).
102
Example XIV.34. Equation (XIV.29) gives a recipe for writing any cycle as
a product of transpositions. For example,
(1, 5, 9) = (1, 9)(1, 5).
Note that these transpositions are not disjoint and so they dont have to
commute. Check that
(1, 9)(1, 5) 6= (1, 5)(1, 9).
One thing to be careful about is that decomposition of a permutation as
a product of transpositions is not in any way unique. For example, using
(XIV.29) we have
(1, 2, 3, 4) = (1, 4)(1, 3)(1, 2).
However, you can also check that
(1, 2, 3, 4) = (2, 3)(1, 3)(3, 5)(3, 4)(4, 5).
So we can write (1, 2, 3, 4) as a product of 3 transpositions and as a product of 5 transpositions. Can we write it as a product of 4 transpositions?
Spend no more and no less than five minutes thinking about this.
P 3 = (x 1 x 2 )(x 1 x 3 )(x 2 x 3 ),
(x i x j ).
1i < j n
103
x1 x5 ,
x3 x5 .
x1 x2 ,
x1 x4 ,
(II)
x2 x3 ,
x3 x4 ,
(III)
x2 x5 ,
x4 x5 ,
(IV)
x2 x4 .
(x 1 x 4 ) = x 1 x 2 .
Thus swaps the factors in group (I) whilst keeping their signs the same.
But
(x 2 x 3 ) = x 4 x 3 = (x 3 x 4 ),
(x 3 x 4 ) = x 3 x 2 = (x 2 x 3 ).
Thus swaps the factors in group (II) and changes the sign of each. Moreover,
(x 2 x 5 ) = x 4 x 5 ,
(x 4 x 5 ) = x 2 x 5 .
So swaps the factors in group (III) whilst keeping their signs the same.
Finally,
(x 2 x 4 ) = x 2 x 4 = x 4 x 2 = (x 2 x 4 ).
1The word groups here is used in its English language sense, not in its mathemat-
ical sense.
104
So the one factor in group (IV) simply changes sign. We see that (P 5 ) has
the same factors as P 5 with three sign changes: (P 5 ) = (1)3 P 5 = P 5 .
P ROOF OF L EMMA XIV.36. Let = (`, m). The transposition (`, m) swaps
` and m, and keeps everything else fixed. In particular (`, m) = (m, `).
So we can suppose that ` < m. Any factor x i x j where neither i nor j
is equal to ` nor m, is unaffected by . We pair off the other factors as
follows:
x1 x` ,
x1 xm ,
x 2 x ` ,
x2 xm ,
(I)
.
.
..
..
x `1 x ` , x `1 x m ,
x ` x `+1 , x `+1 x m ,
x ` x `+2 , x `+2 x m ,
(II)
..
..
.
.
x ` x m1 , x m1 x m ,
x ` x m+1 , x m x m+1 ,
x ` x m+2 , x m x m+2 ,
(III)
..
..
.
.
x` xn ,
xm xn ,
n
(IV)
x` xm .
Now swaps each pair in (I), keeping the signs the same; it swaps each
pair in (II) and changes the sign of each; it swaps each pair in (III), keeping the signs the same; it changes the sign of x ` x m . So (P n ) has exactly
the same factors as P n , up to a certain number of sign changes. How
many sign changes? The number of sign changes is:
2(m ` 1) + 1.
The 1 is for changing the sign of x ` x m . There are 2 sign changes coming
from each pair in (II). The number of such pairs is m ` 1. Since the
number of sign changes is odd, we see that (P n ) = P n .
Lemma XIV.38. If S n then (P n ) = P n . More precisely, if is a product of an even number of transpositions then (P n ) = P n and if is a product of an odd number of transpositions then (P n ) = P n .
P ROOF. Recall, by Lemma XIV.33, that we can write every permutation as
a product of transpositions. Every transposition changes the sign of P n .
The lemma follows.
Example XIV.39. We have noted in Example XIV.34 that the way we express a permutation as a product of transpositions is not unique. Indeed
105
we saw that
(1, 2, 3, 4) = (1, 4)(1, 3)(1, 2),
So we can write (1, 2, 3, 4) as a product of 3 transpositions and as a product of 5 transpositions. We asked the question of whether (1, 2, 3, 4) can
be written as a product of 4 transpositions? Write = (1, 2, 3, 4). From the
above lemma, we see that (P n ) = P n . If were able to write as a product of an even number of transpositions then (P n ) = P n . We would then
have P n = P n which is a contradiction. Therefore we cannot write as
a product of 4 transposition.
106
Therefore the product can be written as an even number of transpositions (even+even=even). Hence A n .
Finally we must show that the inverse of an even permutation is even.
Suppose is even. We can write
= 1 2 . . . m
where the i are transpositions, and m is even. Now
1 = (1 2 m )1
1
1
= 1
m m1 1
= m m1 1 .
Here you should convince yourself that 1 = for any transposition .
Since m is even, we find that 1 is even and so 1 A n .
Hence A n is a subgroup of S n .
Example XIV.43. Recall that S 2 = {id, (1, 2)}. We see that A 2 = {id} is the
trivial subgroup.
In the above examples we saw that A n has half the number of elements of S n for n = 2, 3. In fact, this pattern continues.
Theorem XIV.45. Let n 2. Then A n has order
n!
1
|A n | = |S n | = .
2
2
P ROOF. We know by Lagranges Theorem that
|S n | = [S n : A n ]|A n |.
To prove the theorem it is sufficient to show that the index [S n : A n ] = 2.
Fix a transposition (e.g. = (1, 2)). We shall show that the distinct cosets
of A n in S n are A n and A n . It will then follow that the index [S n : A n ] = 2,
completing the proof.
We know that A n is the subset (indeed subgroup) of S n consisting of
all the even permutations. Thus A n consists only of odd permutations.
Does A n contain all the odd permutations? Suppose is odd. Then
is even and is hence in A n . Therefore () is in the coset A n . But
() = 2 = ,
since transpositions have order 2, and so A n .
We have now shown that A n is the set of all odd permutations, and
we know that A n is the set of all even permutations. Are there any other
cosets? If there were any they would have to overlap with either A n or
107
A n , and we know that cosets are either disjoint or equal (Lemma XII.17).
So there arent any other cosets and the proof is complete.
Exercise XIV.46. Let and be as given in Exercise XIV.18. Write and
as products of transpositions and state if theyre even or odd.
Exercise XIV.47. Write down the elements of A 3 and check that it is cyclic
(and hence abelian). Show that A n is non-abelian for n 4.
Exercise XIV.48. Let f be a polynomial in variables x 1 , x 2 , . . . , x n . Let be
a permutation in S n . We define ( f ) to be the polynomial f (x 1 , x 2 , . . . , x n ).
For example, if f = x 1 + x 22 + x 3 x 4 and = (1, 4)(2, 3) then swaps x 1 and
x 4 , and swaps x 2 and x 3 ; thus ( f ) = x 4 + x 32 + x 2 x 1 . Compute ( f ) for the
following pairs f , :
(i) f = x 12 x 2 x 3 , = (1, 2, 3).
(ii) f = x 1 x 2 + x 3 x 4 , = (1, 3)(2, 4).
10
11
12
13
15
14
You can slide any tile adjacent to the blank into the position of the
blank. So starting from the initial arrangement there are two possible
moves:
108
10
11
10
11
12
13
15
14
13
15
12
14
10
11
12
13
14
15
Show that this is impossible! You might want to follow these hints and a mathematical scam
tips:
(i) Think of the blank as a tile numbered 16. This way every rearrangement is a permutation on 1, . . . , 16 and so an element of
S 16 .
(ii) Observe that every move is a transposition involving 16.
(iii) Observe that to go from the initial arrangement to the desired
final arrangement, tile 16 must make the same number moves
down as up, and the same number of moves right as left.
Can you use your knowledge of maths to think of other ways of ripping applied maths!
people off? This of course is a purely intellectual exercise. As citizens of
Warwick plc you are fine upright human beings who would not dream
of putting such ideas into practice. But DONT share these ideas with
friends from less scrupulous universities. I dont have a particular two
universities in mind.
Riveted?
CHAPTER XV
Rings
If groups took your breath away, wait till you meet rings.
XV.1. Definition
A ring is a triple (R, +, ), where R is a set and +, are binary operations
on R such that the following properties hold
(i) (closure) for all a, b R, a + b R and a b R;
(ii) (associativity of addition) for all a, b, c R
(a + b) + c = a + (b + c);
(iii) (existence of an additive identity element) there is an element
0 R such that for all a R,
a + 0 = 0 + a = a.
(iv) (existence of additive inverses) for all a R, there an element,
denoted by a, such that
a + (a) = (a) + a = 0;
(v) (commutativity of addition) for all a, b R,
a + b = b + a;
(vi) (associativity of multiplication) for all a, b, c R,
a (b c) = (a b) c;
(vii) (distributivity) for all a, b, c R,
a (b + c) = a b + a c;
(b + c) a = b a + c a;
110
XV. RINGS
Note that the word commutative in the phrase commutative ring refers
to multiplication. Commutativity of addition is part of the definition of
ring. Some textbooks omit property (viii) from the definition of a ring.
Those textbooks call a ring satisfying (viii) a ring with unity. We shall
always assume that our rings satisfy (viii).
Observe, from properties (i)(v), if (R, +, ) is a ring, then (R, +) is an
abelian group.
XV.2. Examples
Example XV.1. You know lots of examples of rings: Z, Q, R, C, R[x], etc.
All these examples are commutative rings.
Example XV.2. Let
M 22 (R) =
a b
: a, b, c, d R .
c d
This is the set of 2 2 matrices with real entries. From the properties of
matrices it is easy to see that M 22 (R) is a ring with the usual addition
and multiplication of matrices. The additive identity is the zero matrix,
and the multiplicative identity is I 2 . The ring M 22 (R) is an example of a
non-commutative ring, as matrix multiplication is non-commutative.
Similarly we define M 22 (C), M 22 (Z), M 22 (Q). These are all noncommutative rings.
a b = b a.
XV.3. SUBRINGS
111
Example XV.5. Consider (R[x], +, ), where is composition of polynomials. Is this a ring? No. It is easy to see that all the required properties
hold except for distributivity (the multiplicative identity is the polynomial f (x) = x). Let us give a counterexample to show that distributivity
fails. Let
f (x) = x 2 ,
g (x) = x,
h(x) = x.
Then
f (g + h) = f (2x) = 4x 2 ;
f g + f h = x 2 + x 2 = 2x 2 .
Example XV.6. Lets step back a little and think about R2 . We know that
(R2 , +) is an abelian group. Is there a way of defining multiplication on R2
so that we obtain a ring? There are actually two ways. The first is rather
obvious: we define
(a 1 , a 2 ) (b 1 , b 2 ) = (a 1 b 1 , a 2 b 2 ).
With this definition, you can check that (R2 , +, ) is a ring, where the multiplicative identity is 1 = (1, 1).
The other way is more subtle: we define
(XV.31)
(a 1 , a 2 ) (b 1 , b 2 ) = (a 1 b 1 a 2 b 2 , a 1 b 2 + a 2 b 1 ).
Where does this definition come from? Recall that R2 is represented geometrically by the plane, and C is represented geometrically by the plane.
If were thinking of points in the plane as elements of R2 then we write
them as ordered pairs of real numbers: (a, b). If were thinking of points
in the plane as elements of C then we write them in the form a +i b where
again a, b are real numbers. We multiply in C using the rule
(XV.32)
(a 1 + i a 2 ) (b 1 + i b 2 ) = (a 1 b 1 a 2 b 2 ) + i (a 1 b 2 + a 2 b 1 ).
Notice that definitions (XV.31), (XV.32) are exactly the same at the level of
points on the plane. Weve used the multiplicative structure of C to define
multiplication on R2 . With this definition, (R2 , +, ) is a ring. What is the
multiplicative identity? Its not (1, 1). For example (1, 1) (1, 1) = (0, 2).
Think about the multiplicative identity in C. This is simply 1 = 1 + 0i . So
the multiplicative identity in (R2 , +, ) (with multiplication defined as in
(XV.31)) is (1, 0). Check for yourself that
(a 1 , a 2 ) (1, 0) = (1, 0) (a 1 , a 2 ) = (a 1 , a 2 ).
This example is NOT IMPORTANT. Dont lose any sleep over it.
XV.3. Subrings
Just as we have subgroups, so we have subrings. Im sure youve guessed
the definition.
112
XV. RINGS
XV.3. SUBRINGS
113
Answer. We can try checking the eight defining properties of a ring. However, we note that Z[i ] is contained in C. Indeed, it is the set of complex
numbers where the real and imaginary parts are integers. So lets prove
that Z[i ] is a subring of C.
Now 0 = 0+0i , 1 = 1+0i are clearly in Z[i ]. Suppose , Z[i ]. Write
= a1 + a2 i ,
= b1 + b2 i ,
= a 1 + (b 1 )i ,
and
= (a 1 a 2 b 1 b 2 ) + (a 1 b 2 + a 2 b 1 )i .
Since we want to show that +, and are in Z[i ], we want to show
that their real and imaginary parts are integers. Now as a 1 , a 2 , b 1 , b 2 are
integers, so are
a1 + b1 ,
a2 + b2 ,
a 1 ,
b 1 ,
a1 a2 b1 b2 ,
a1 b2 + a2 b1 .
na
o
:
a,
r
Z,
r
0
.
2r
a + 2r s b
.
2r
114
XV. RINGS
a b
: a, b, c R .
(i)
0 c
a b
: a, b R .
(ii)
0 0
a b
(iii)
: a, b R .
0 1
a 0
(iv)
: a R, b Z .
0 b
a b
(v)
: a, b R .
b a
(vi) {A M 22 (R) : det(A) = 1}.
XV.4. The Unit Group of a Ring
Recall that we defined R , Q , C be removing from R, Q, C the zero
element; e.g.
R = {a R : a 6= 0}.
We found that R is group with respect to multiplication. In Example V.4
we tried to do the same with Z and failed to obtain a group. Note that R,
Q, C are rings and so is Z. Given a ring, is there a naturally defined subset
that is a group with respect to multiplication? It turns out that the answer
is yes, and that for R, Q and C we obtain R , Q , C as wed expect. To
define this subset, we need the concept of a unit.
Definition. Let R be a ring. An element u is called a unit if there is some
element v in R such that uv = vu = 1. In other words, an element u of R
is a unit if it has a multiplicative inverse that belongs to R.
Example XV.17. In any ring, 0 is a non-unit.
115
Example XV.20. Recall that R[x] is the ring of polynomials with real coefficients. Then x is not a unit, since 1/x is not a polynomial. However, 2 is
a unit, since 1/2 is a polynomial in R[x] with real coefficients:
1 1
= + 0x.
2 2
(XV.33)
Just because weve called R the unit group of R doesnt get us out of
checking that it is really a group.
Lemma XV.21. Let (R, +, ) be a ring and let R be the subset defined in (XV.33).
Then (R , ) is a group.
P ROOF. We must first show that R is closed under multiplication. Suppose u 1 , u 2 R . Thus u 1 , u 2 are units of R, and so there are v 1 , v 2 R
such that
(XV.34)
u 1 v 1 = v 1 u 1 = 1,
u 2 v 2 = v 2 u 2 = 1.
associativity of multiplication
since u 2 v 2 = 1
since u 1 v 1 = 1.
offence intended
(u 1 u 2 )(v 1 v 2 ) = (u 1 v 1 )(u 2 v 2 ) = 1 1 = 1.
116
XV. RINGS
Example XV.22. Note that R , C , Q have exactly the same meaning as
before.
Example XV.23. We showed that the units of Z are 1. Therefore the unit
group of Z is
Z = {1, 1}.
Example XV.24. Recall that M 22 (R) is the ring of 2 2 matrices with real
entries. It is clear from the definition of a unit, that the units of M 22 (R)
are the invertible matrices. In other words, they are the ones having nonzero determinant. Thus
(M 22 (R)) = GL2 (R).
Similarly,
(M 22 (Q)) = GL2 (Q),
117
You will easily see that this is consistent with the earlier definitions of
GL2 (R), GL2 (C), GL2 (Q) and GL2 (Z), and that moreover, (M 22 (R)) = GL2 (R).
Example XV.25. Let
S=
a b
: a, b, c Z .
0 c
Show that S is a ring under the usual addition and multiplication of matrices. Compute S .
Answer: To show that S is a ring it is enough to show that it is a subring
of M 22 (Z). We leave that as an easy exercise.
Let us compute the unit group. Suppose A = a0 bc is in S. To be unit
it is not enough for this matrix to be invertible, we also want the inverse
to belong to S. So we require the determinant ac to be non-zero and we
want
1 c b
1/a b/ac
1
A =
=
0
1/c
ac 0 a
to belong to S. Thus we want the integers a, b, c to satisfy
1 1
b
, ,
Z.
a c ac
This happens precisely when a = 1 and c = 1. Thus
1 b
S =
:bZ .
0 1
ac 6= 0,
a, b Z.
118
XV. RINGS
P ROOF. and are complex numbers, and you can see that N () = ||2 .
From the properties of the absolute value you know that || = || ||.
The lemma follows.
Theorem XV.28. The unit group of Z[i ] is {1, 1, i , i }.
In other words, (Z[i ]) = U4 , the group of fourth-roots of unity.
P ROOF. We want the units of Z[i ]. Let be a unit. Then there is some
Z[i ] such that 1 = 1. Applying the norm map, and recalling that it
is multiplicative, we see that
N ()N () = N () = N (1) = 1.
Now N () and N () are in Z (go back to the definition of the norm map
to see this), and they multiply to give 1. So
N () = N () = 1,
or
N () = N () = 1.
Exercise XV.29. In Exercise XV.15 you met the ring Z[2i ]. Find its unit
group. (Hint: Show first that any unit in Z[2i ] is a unit in Z[i ].)
p
p
p
Exercise XV.30. Let
p Z[ 2] = {a + b 2 : a, b Z}. Show that Z[ 2] is a
ring
p and that 1 + 2 is unit. What is its order as an element of the group
Z[ 2] ?
Exercise XV.31. Let = e 2i /3 (this is a cube root of unity). Check that
= 2 . Let Z[] = {a + b : a, b Z}.
(i) Show that 2 Z[] (Hint: the sum of the cube roots of unity is
. . . ).
(ii) Show that Z[] is a ring.
(iii) Show that 1, and 2 are units in Z[].
(iv) (Harder) Show that Z[] = {1, , 2 }.
(v) Show that this group is cyclic.
= 1 is enough.
Remark. Compare the above proof to our determination of the unit group
of M 22 (Z) in Example XV.24. I hope you agree that the similarities are
striking!
CHAPTER XVI
Fields
A field (F, +, ) is a commutative ring such that every non-zero element
is a unit. Thus a commutative ring F is a field if and only if its unit group
is
F = {a F : a 6= 0}.
Example XVI.1. R, C, Q are fields.
Example XVI.3. R[x] is not a field, since for example x R[x] is non-zero
but not a unit.
= c + di
where a, b, c, d Q. Then
+ = (a + c) + (b + d )i .
Since Q is closed under addition, a + c and b + d Q. So + Q[i ].
Similarly, check for yourself that and are in Q[i ]. Thus Q[i ] is a
subring of C and so a ring 1.
Finally we have to show that every non-zero element of Q[i ] is a unit.
Suppose is a non-zero element of Q[i ]. We can write = a +bi where a,
b Q, and not both zero. We want to show that existence of some Q[i ]
such that = = 1. In other words, we want to show that 1/ is in
Q[i ]. But we know how to compute 1/. Recall that to divide complex
1We couldve made the proof more tedious by writing
r u
k m
+ i,
= + i,
s v
` n
where r , s, u, v, k, `, m, n are integers and s, v, `, n are non-zero. This wouldve worked,
but why do it? Get used to thinking of rational numbers as numbers in their own right!
=
119
120
XVI. FIELDS
a + bi a bi
a bi
= 2
a + b2
a
b
= 2
2
i.
2
a +b
a + b2
As a, b are rationals, so are a/(a 2 + b 2 ) and b/(a 2 + b 2 ). So 1/ is in Q[i ].
Therefore Q[i ] is a field.
p
p
p
Exercise XVI.5. Let Q[ 2] = {a + b 2 : a, b Q}. Show that Q[ 2] is a
field.
Exercise XVI.6. Let
a b
F = b
a : a, b R .
(a) Show that F is a field (under the usual addition and multiplication of matrices). (Hint: Begin by showing that F is a subring of
M 22 (R). You need to also show that F is commutative and that
every non-zero element has
in F .)
aanbinverse
=
a
+bi
. Show that is a bijec(b) Let : F C be given by b
a
tion that satisfies (A + B ) = (A) + (B ) and (AB ) = (A)(B ).
(c) Show that
a b
F 0 = b
a : a, b C
is not a field.
CHAPTER XVII
Congruences Revisited
We saw that there are two binary operations defined on Z/mZ, addition and multiplication. These make (Z/mZ, +, ) a commutative ring,
and (Z/mZ, +) a cyclic group of order m. We want to know about the unit
group of Z/mZ.
XVII.1. Units in Z/mZ
Example XVII.1. Find the unit groups of Z/mZ for m = 2, 3, 4, 5, 6.
Answer: You dont have be very clever here! Just look at the multiplication
table for Z/6Z in Example VII.3 and youll see that
(Z/6Z) = 1, 5 .
In the same way youll find that
(Z/2Z) = 1 ,
(Z/4Z) = 1, 3 ,
(Z/3Z) = 1, 2 ,
(Z/5Z) = 1, 2, 3, 4 .
In particular, Z/2Z, Z/3Z and Z/5Z are fields and Z/4Z, Z/6Z are not
fields. Can you make a general guess as to which Z/mZ are fields and
which arent? Can you prove your guess?
122
to write 1 as a linear
19 = 2 9 + 1.
Thus
1 = 19 2 9 = 19 2 (256 13 19) = (1 2 13) 19 2 256,
so
1 = 27 19 2 256.
Hence 27 19 1 (mod 256), so 27 is the inverse of 19 in Z/256Z.
a p1 1 (mod p).
p1
p1
= 1.
1It is easy to get muddled in the substitutions involved in Euclids Algorithm. One
way to reduce the muddle is to somehow distinguish the numbers you started with, here
256 and 19, and the remainders from the quotients. I did the distinguishing by writing
the numbers we started with and the remainders in boldtype. In your calculations, you
can underline them.
123
m = p 11 p kk
where p 1 , . . . , p k are distinct primes and r 1 , . . . , r k are positive integers. Then
r
r 1
(m) = (p 11 p 11
r 1
) (p kk p kk
).
124
Vale Dicere
With tear-filled eyes I say goodbye, and begin to suffer the heart-rending
pangs of separation . . .
XVII.4.
Exercise XVII.17. Write a 5000 word essay on how abstract algebra has
changed your outlook on life, detailing the insights you have gained into
the great challenges facing/menacing humanity 1.
Exercise XVII.18. Veneration of abstract algebra is at the root of contemporary mathematical decline. Discuss.
1Heres a good way to start your essay: Unbeknownst to me, I misspent the first 18
years of my life wallowing in a cesspool of intellectual stagnation, until week six of term
1 when the Abstract Algebra lectures started . . . .
a sincere outpouring
of grief
APPENDIX A
This is a 1 hour paper. Answer two questions only. If you have answered more than two questions, you will only receive credit for your best
two answers.
Question 1
(1) Let and be the following permutations:
= (1, 2, 5)(3, 4),
a b
(4) You may assume that GL2 (R) =
: a, b, c, d R, ad bc 6= 0
c d
is a group with respect to matrix multiplication. Let
1 r
H=
: r R, s R .
0 s
(a) Show that H is a subgroup of GL2 (R).
(b) Show that H is non-abelian by giving a non-commuting pair
of elements.
Question 2
(1) Let (R, +, ) be a ring.
(a) What does it mean for a R to be a unit?
(b) Define R .
(c) Show that (R , ) is a group.
125
126
(2) Let
p
p
Z[ 2] = {a + b 2 : a, b Z}.
p
(a) Show that Z[ p
2] is a ring.
2 is a unit, and that it has infinite order in
(b) Show
that
1
+
p
Z[ 2] .
p
(c) Show that Z[ 2] is not a field.
Question 3
In this question, you may assume that every permutation is either
odd or even, but cannot be both. You may not use Lagranges Theorem.
(1) Write down the distinct left cosets of the subgroup 8Z in the
group 2Z (no explanation necessary). What is the index [2Z :
8Z]?
(2) Let G be a finite group and H a subgroup. Show that |g H | = |H |
for any g in G.
(3) Let n 2 be an integer. Show that |A n | = 21 |S n |.
(4) Give a counterexample to show that the following statement is
false: if , S n have orders r , s respectively, then has order
dividing lcm(r, s).
APPENDIX B
x n
x
e = lim 1 +
n
n
and wants to deduce the well-known power series expansion for e x . So
he introduces infinite numbers, and reasons that if i is an infinite number
then obviously
i
= 1,
i
i (i 1)
= 1,
i2
i (i 1)(i 2)
= 1, . . . .
i3
Thus
x n
e x = lim 1 +
n
n
x i
= 1+
i infinite
i
x i (i 1) x 2
= 1+i +
+
using the binomial theorem
i
2! i 2
x2
= 1+x +
+
2!
Next Euler wants to derive the power series expansion for log(1 + t ). To
do this he defines the infinitesimal = 1/i . This infinitesimal is so small
that
(B.36)
2 = 3 = = 0.
127
128
Now to get the power series expansion for log(1 + t ), write x = 1 + t , and
y = log x. Thus
y i
x = ey = 1+
.
i
Hence
y
x = x 1/i = 1 + = 1 + y.
i
Rearranging we get
1
1
(B.37)
log(1 + t ) = log x = y = (x 1) = 1 + (1 + t ) .
= 1 + t t 2 + t 3 t 4 +
2
3
4
Substituting into (B.37) we obtain
log(1 + t ) = t t 2 + t 3 t 4 +
2
3
4
t2 t3 t4
= t + +
2
3
4
B.2. Nothing to see heremove along please
R
Here
is
an
beautiful
example
I
saw
on
mathoverflow.net
.
Let
=
Rx
0 . We want to solve the integral equation
Z
f f = 1.
Factoring out the f we have
Z
1
f = 1.
Hence
Z 1
f = 1
1
Z
= 1+ +
+
+ 1
Z x
Z xZ x
Z xZ
= 1+
1+
1+
0
= 1+x +
= ex.
x2 x3
+
+
2! 3!
geometric series
xZ x
0
1+