Hirst
Hirst
Hirst
2015 Edition1
1
2002
c by Jeffry L. Hirst and Holly P. Hirst. All rights reserved.
ii
Contents
Introduction v
1 Propositional Calculus 1
1.1 Building Blocks . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
1.2 Tautologies and Contradictions . . . . . . . . . . . . . . . . . . . 9
1.3 Logical Equivalence . . . . . . . . . . . . . . . . . . . . . . . . . . 10
1.4 Contrapositives and Converses . . . . . . . . . . . . . . . . . . . 11
1.5 Analysis of Arguments . . . . . . . . . . . . . . . . . . . . . . . . 13
1.6 A Proof System . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15
1.7 The Deduction Theorem . . . . . . . . . . . . . . . . . . . . . . . 19
1.8 Generalizing L . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21
1.9 Soundness and Completeness of L . . . . . . . . . . . . . . . . . . 23
1.10 Modifying L . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 25
1.11 Assessing Propositional Calculus . . . . . . . . . . . . . . . . . . 27
2 Predicate Calculus 29
2.1 Building Blocks . . . . . . . . . . . . . . . . . . . . . . . . . . . . 29
2.2 Translations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 31
2.3 A brief interlude: Truth . . . . . . . . . . . . . . . . . . . . . . . 35
2.4 Free variables . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 36
2.5 Models . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 38
2.6 Truth and Sentences . . . . . . . . . . . . . . . . . . . . . . . . . 40
2.7 Truth and free variables . . . . . . . . . . . . . . . . . . . . . . . 42
2.8 Logical validity . . . . . . . . . . . . . . . . . . . . . . . . . . . . 44
2.9 Formulas that aren’t logically valid . . . . . . . . . . . . . . . . . 45
2.10 Some logically valid formulas . . . . . . . . . . . . . . . . . . . . 46
2.11 Free for... . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 48
2.12 A proof system for predicate calculus . . . . . . . . . . . . . . . . 51
2.13 Dealing with ∀ . . . . . . . . . . . . . . . . . . . . . . . . . . . . 54
2.14 Rule T . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 54
2.15 The Deduction Theorem . . . . . . . . . . . . . . . . . . . . . . . 55
2.16 Adding ∃x . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 57
2.17 Removing ∃x . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 58
2.18 Proof strategies in predicate calculus . . . . . . . . . . . . . . . . 60
iii
iv CONTENTS
v
vi INTRODUCTION
Propositional Calculus
The big idea in propositional calculus is to study the structure of simple state-
ments. We will discover connections between the structure and the truth val-
ues of these statements, and devise fast methods for determining truth values.
Eventually, we will write some formal proofs of statements.
Propositions
A proposition is a statement, containing a subject and a verb like a sentence in
English. We will eventually work with mathematical statements almost exclu-
sively, but for now any statements can be used.
2 is prime.
4 + 6 = 10.
Today it is raining.
Today is Thursday and tomorrow will be sunny implies that yesterday was
rainy.
1
2 CHAPTER 1. PROPOSITIONAL CALCULUS
Propositions can be true or false. If we know what truth value to assign one
we can utilize this information. Otherwise, we check what happens when the
proposition is assumed to be true and then false by using a truth table. The
following truth tables reveal the meaning of the various connectives.
Connectives
The symbols ¬, ∧, ∨, →, and ↔ are called propositional connectives. Their
properties are best shown via truth tables.
Negation
Symbol: ¬
Interpretation: ¬a means “not a”
a ¬a
T F
F T
Notice that the proposition a has a column containing all possible truth
values, in this case simply T and F. Then the second column contains the truth
value for “not a” for each possible value of a. We can read this table as follows:
If a is true then not a is false. If a is false, then not a is true. Not too bad.
Other books use the symbols ∼ a or !a to denote ¬a. Let’s look at the other
connectives.
Conjunction
Symbol: ∧
Interpretation: a ∧ b means “a and b”
Vocabulary: a and b are the conjuncts in the compound proposition a ∧ b.
a b a∧b
T T T
T F F
F T F
F F F
1.1. BUILDING BLOCKS 3
Notice here that all possible combinations of truth values for a and b are
listed, along with the corresponding value for the connective. The quick story on
the and connective is that both propositions need to be true for the conjunction
to be true.
Disjunction
Symbol: ∨
Interpretation: a ∨ b means “a or b”
Vocabulary: a and b are the disjuncts in the compound proposition a ∨ b.
a b a∨b
T T T
T F T
F T T
F F F
Summarizing, a disjunction is true whenever at least one of the propositions
is true. This connective is sometimes called inclusive or to differentiate it from
exclusive or (which is often denoted by +). The formula a + b is interpreted as
“a or b, but not both.”
Implication
Symbol: →
Interpretation: a → b means “if a then b” (in the mathematical sense.)
Vocabulary: In the formula a → b, the proposition a is referred to as the
hypothesis (or sometimes as the premise). The proposition b is referred to as
the conclusion.
a b a→b
T T T
T F F
F T T
F F T
The truth values for implication seem pretty peculiar at first. Some people
might argue that the interpretation is distinctly different from typical English
usage. They’re probably right. However the truth values do correspond exactly
to the way that mathematicians use this symbol. The only time an implication
is false is when the hypothesis is true and the conclusion is false. False may
imply false and false may imply true, but true cannot imply false.
Mathematical texts use all of the following phrases to represent a → b:
if a then b,
a implies b,
Biconditional
Symbol: ↔
Interpretation: a ↔ b means “a if and only if b”
a b a↔b
T T T
T F F
F T F
F F T
The biconditional is true exactly when the propositions have the same truth
value. In some texts, the phase “a is a necessary and sufficient condition for b”
is used for a ↔ b.
¬ (p → q)
F T T T
T T F F
F F T T
F F T F
(a ∨ b) ∧ c a ∨ (b ∧ c)
T T T T T T T T T T
T T T F F T T T F F
T T F T T T T F F T
T T F F F T T F F F
F T T T T F T T T T
F T T F F F F T F F
F F F F T F F F F T
F F F F F F F F F F
Note that in the last example, the two formulas have the same proposition
letters and connectives in the same order. Only the location of the parentheses
is different. However, the truth values for the main connectives do not match
in the second and fourth rows. There is a moral here. Parentheses make a
difference! You can leave out parentheses when the meaning of the statement
is clear. However, if you have any doubt, retain the parentheses.
Sometimes books will leave out more parentheses than you might like. In a
pinch, you can assume that connectives are evaluated in the following order ¬,
∧, ∨, →, ↔, with like connectives evaluated from left to right. For example, the
formula b∧a → b → c∨¬d should be parenthesized as ((b∧a) → b) → (c∨(¬d)).
Translations
Given a key listing the meanings of the propositional symbols, we can translate
symbolized statements into English sentences.
Example. Given the interpretations below, translate each of the following sen-
tences into English.
(a) Translate: (a ∨ b) → c
Solution: If Fritz likes trout or Waldo is tiny, then violins don’t melt.
(b) Translate: a ↔ c
Solution: Fritz likes trout if and only if violins don’t melt.
Alternate solution: Fritz liking trout is a necessary and sufficient
condition for the non-melting of violins.
(c) Translate: ¬(b ∧ c)
Solution: It is not the case that both Waldo is tiny and violins don’t
melt.
Alternate solution: Waldo isn’t tiny or violins melt.
1.1. BUILDING BLOCKS 7
Example. Translate: If Fritz is the king of France, then Bert eats trout.
Solution: Let k denote “Fritz is the king of France.” Let b denote “Bert
eats trout.” The sentence “If Fritz is the king of France, then Bert eats trout”
translates as
k → b.
Example. Given the information below, and using common sense where appli-
cable, try to assign truth values to the following compound statements.
c means “Waldo is the milkman’s pet trout.” (We won’t assume any truth
value here.)
(a) a ∧ b
Solution: F ∧ T is F, so a ∧ b is false.
(b) a ∨ b
Solution: F ∨ T is T, so a ∨ b is true.
(c) b ∨ c
8 CHAPTER 1. PROPOSITIONAL CALCULUS
b ∨ c
Solution: T T T So b ∨ c is true.
T T F
(d) a ↔ c
a ↔ c
Solution: F T F So a ↔ c depends on truth value of c.
F F T
Exercises.
1. Build the truth table for p ∧ q.
2. Build the truth table for p → (q ∨ r)
3. Build the truth table for ¬(p ∨ ¬p)
4. Build the truth table for p ∧ ¬p
5. Build the truth table for (p ∧ q) → p
6. Build the truth table for p → (p ∨ q)
7. Given the interpretations below, translate each of the following sentences
into English. p means “2 + 2 = 5.” q means “3 is prime.”
(a) Translate: p → q
(b) Translate: (¬p) ∨ q
(c) Translate: ¬(p ∨ q)
8. Given the interpretations below, translate each of the following sentences
into English. d means “Waldo wears a hat.” m means “All milkmen like
trout.” w means “4 + 8 = 32.” z means “2 6= 5.”
(a) Translate: d → (m ∨ w)
(b) Translate: z ↔ w
(c) Translate: z ∧ (w → z)
(d) Translate: ¬(¬d ∧ z)
9. Translate into formal symbols: Either money is green or the sky is blue.
10. Translate into formal symbols: If either 2 6= 5 or 4 + 5 = 9, then 52 6= 25.
11. Translate into formal symbols: Fritz likes chocolate bunnies and Waldo
likes umbrellas.
12. Translate into formal symbols: If 5 is not an odd integer, then 8 is prime.
13. Given the information below, and using common sense where applicable,
try to assign truth values to the following compound statements.
1.2. TAUTOLOGIES AND CONTRADICTIONS 9
p means “2 6= 5.”
q means “7 is an integer multiple of 2.”
r means “Fritz is a tap-dancing investment banker.”
(a) p ∨ q
(b) p ∧ r
(c) q → r
(d) r → (q → r)
Solution: We’ll show that the main connective column in the truth table for
p → q contains at least one T and at least one F.
p → q
T T T
T F F
F T T
F T F
The first and second rows will fill the bill. We didn’t even need to write the
last two rows.
Exercises.
1. Show that a → (b → a) is a tautology.
2. Show that (a → (b → c)) → ((a → b) → (a → c)) is a tautology.
3. Show that (¬b → ¬a) → ((¬b → a) → b) is a tautology.
4. Show that p ↔ (p → ¬p) is a contradiction.
5. Show that p ∧ p is a contingency.
6. Classify each of the following formulas as a tautology, a contradiction, or a
contingency. Provide enough of the truth table for the formula to justify
your answer. (For tautologies and contradictions, you need the whole
table. For contingencies you can get by with just two cleverly selected
rows.)
(a) (p ∧ q) → p
(b) p → (p ∨ q)
(c) (p ∨ q) → (p ∧ q)
(d) p ↔ ¬p
(e) p → (¬p → (q ∧ ¬q))
(f) (p → q) ∨ (q → p)
Solution: We’ll verify that the main connective column in the truth table
for the biconditional constructed from these two formulas contains only Ts.
(p → q) ↔ ((¬ q) → (¬ p))
T T T T F T T F T
T F F T T F F F T
F T T T F T T T F
F T F T T F T T F
Exercises.
1. Show that ¬p ∧ ¬q is logically equivalent to ¬(p ∨ q).
Exercises.
1. Write the contrapositives of the following.
(a) p → q
(b) (p ∨ r) → q
(c) (a ∧ b) → (c ∨ d)
(d) If Waldo likes trout, then Elmer is a sailor.
(e) If 0 6= 1, then 4 is a prime.
(f) If tap-dancing is foolish, then I want to be a fool.
1.5. ANALYSIS OF ARGUMENTS 13
(a) p → q
(b) (p ∨ r) → q
(c) (a ∧ b) → (c ∨ d)
(d) If Waldo likes trout, then Elmer is a sailor.
(e) If 0 6= 1, then 4 is a prime.
(f) If tap-dancing is foolish, then I want to be a fool.
5. Find a formula that is logically equivalent to its converse and one that
is not. You could be imaginative and pick examples other than those in
exercises 3 and 4.
P1
P2
P3
...
Pn
Pn+1
is an argument.
We say that an argument (like the one above) is logically valid if and only if
(P1 ∧ P2 ∧ P3 ∧ ... ∧ Pn ) → Pn+1 is a tautology. Note that the conclusion of
a logically valid argument is not necessarily true. Logical validity ensures only
that if all the premises are true, then the conclusion is true.
Exercises.
1. Here are some other common argument forms. Show that they are all
logically valid.
a→b
Modus Tollens: ¬b
¬a
p→r
q→r
Constructive Dilemma:
p∨q
r
p→q
Hypothetical Syllogism: q→r
p→r
s∨t
Disjunctive Syllogism: ¬s
t
We’re in trouble.
It never rains.
5. Is the following argument logically valid?
If today is Sunday, then tomorrow is Monday.
Today is not Sunday.
The proofs we’ll be doing will be fun, provided you keep your cool. Some-
times people get stuck and become frustrated. Ask any practicing mathemati-
cian, and he or she can tell you about being seriously stuck on a proof. It’s
O.K., and (almost) everybody survives. Remember, working on a proof is a
worthy endeavor. Sit back and enjoy the process.
The system L
A proof is a sequence of formulas with justifications. Each line in a proof in the
system L must be one of the following:
• an axiom of L,
• a lemma.
Axioms
There are three axioms in L:
Axiom 1: A → (B → A)
We can use any instance of an axiom in a proof. That is, A, B, and C can be
uniformly replaced by any formula we like. We’ll use A := p to denote replacing
A by the formula p. Here are three instances of Axiom 1 and the substitutions
used to create them.
Modus Ponens
The rule of inference Modus Ponens says that if A and A → B are lines in
a proof, we can write B as a (later) line. Here A and B can represent any
formulas.
Rather than mess with a concocted example, let’s do a proof!
1. A → ((A → A) → A) Axiom 1
A := A and B := (A → A)
A := A, B := (A → A) and C := A
Lines 1 and 2
4. A → (A → A)) Axiom 1
A := A and B := A
5. A → A Modus Ponens
Lines 3 and 4
Lemmas
We’ve proved `L A → A. Now we can use any instance of A → A in future
proofs. In this case, we say that we’re using this theorem as a lemma. For
example,
Theorem L 2. `L (¬B → B) → B
1. ¬B → ¬B Theorem L1
A := ¬B
A := B and B := B
18 CHAPTER 1. PROPOSITIONAL CALCULUS
Lines 1 and 2
The use of lemmas is actually just a convenient shortcut. Rather than writing
the instance of Theorem L1 as the first line in the preceding proof, we could
write the five lines of the proof of Theorem L1, replacing all the uses of A in
that proof with ¬B. The justification for these lines would be unchanged from
our original proof of Theorem L1. In the resulting proof of L2, every single line
would be an axiom or a use of modus ponens. However, since we have already
written the proof of Theorem L1 once, it seems silly to recopy it. Consequently,
we use lemmas.
Theorem L 3. A → (B → C), A → B `L A → C
1. A → (B → C) Given
2. A → B Given
Theorem L 4. A → ((B → A) → C) `L A → C
1. A → ((B → A) → C) Given
2. A → (B → A) Axiom 1
3. A → C Theorem L3
A := A, B := B → A, and C := C
1.7. THE DEDUCTION THEOREM 19
Exercises.
Prove the following theorems. Remember, any theorem with a lower number
may be used in the proof. (Using earlier theorems can save a lot of work.)
Theorem L 5. B `L A → B
Theorem L 6. A → (B → C), B `L A → C
Theorem L 7. A → (B → C) `L B → (A → C)
Theorem L 8. A → B, B → C `L A → C
Theorem L 9. P → R `L P → (Q → R)
Theorem L 10. `L (¬B → ¬A) → (A → B)
Theorem L 11. `L ¬¬B → B
Theorem L 12. `L B → ¬¬B
1. A → (B → C) Given
2. B Given
3. A → C Theorem L6, lines 1 and 2
Since every proof in L can be rewritten using just hypotheses, axioms and
modus ponens, the techniques used in the preceding theorem can be adapted
to any proof. This explains why the deduction theorem always works. For
a detailed proof of the deduction theorem, we should really use some form of
induction on proof length. For more on this see [8] or [1].
Exercises.
Prove the following theorems. You will enjoy using the deduction theorem.
Theorem L 15. `L ¬A → (A → B)
1.8 Generalizing L
Our proof system, L, is pretty powerful. It works fine on formulas involving
only ¬ and → as connectives. Unfortunately, it doesn’t handle ↔, ∧, and ∨.
We can get around this problem by using the following abbreviations.
Theorem L 18. A, B `L A ∧ B
1. A Given
2. B Given
5. B → ¬¬B L12
Exercises.
1. Use a truth table to show that A∧B is logically equivalent to ¬(A → ¬B).
Theorem L 19. A ∧ B `L A
Theorem L 20. A ∧ B `L B
Theorem L 21. A `L A ∨ B
Theorem L 22. B `L A ∨ B
Theorem L 23. A ↔ B `L A → B
Theorem L 24. A ↔ B `L B → A
Theorem L 25. A → B, B → A `L A ↔ B
Why does the Soundness Theorem always work? Here is a rough sketch of
the proof. Suppose that we have a proof in L of A. If we want, we can eliminate
any uses of the deduction theorem and any uses of lemmas from our proof, so
that the proof consists only of axioms and uses of modus ponens. Start at the
beginning of the proof. The first two lines have to be axioms, and all the axioms
of L are tautologies. (See exercises 1, 2, and 3 on page 10.) If the next line is
an axiom, then it is a tautology, too. The next line could be an application of
modus ponens to previous lines of the form P and P → Q, yielding Q. Since
the previous lines, namely P and P → Q, are both tautologies, Q must be a
tautology also. Proceeding in this fashion, we can prove that every line in the
proof is a tautology, including the last line, which is A. Thus, if L proves A,
then A must be a tautology.
Our next theorem is actually the converse of the Soundness theorem.
We can use the Completeness Theorem for L to show that a formula can be
proved in L, without actually producing the proof! Actually, this happens all
the time in mathematics. Lots of theorems assert the existence of a set, or a
function, or a solution to a problem, or even a proof, without actually provid-
ing the desired object. People call these theorems “non-constructive existence
theorems.” This sort of theorem is surprisingly useful. It’s sort of like saying,
“I know there’s a vacuum cleaner in that closet, and I could find it if I really
needed it.”
Example. Use the Completeness Theorem for L to prove that L can prove
A → A (Theorem L1).
Exercises.
1. Use the Completeness Theorem for L to show that L can prove Theorem
L7.
1.10. MODIFYING L 25
2. Use the Completeness Theorem for L to show that L can prove Theorem
L13.
3. Use the Completeness Theorem for L to show that L can prove Theorem
L15.
4. Is there a proof in L of (A ∨ B) → B?
5. Is there a proof of B `L A ∨ B?
1.10 Modifying L
As noted in the previous section, L has some nice properties. How can we
modify L, retaining soundness and completeness? There are three reasonable
approaches: adding an axiom, discarding an axiom, and starting from scratch.
Adding an axiom
Suppose we add a tautology to L as a new axiom. Let’s call the new axiom
N EW . Suppose also that there is some formula A that we can prove in L,
using the new axiom. Then for some instance of N EW (call it N EW ∗ ) we have
N EW ∗ `L A. By the Deduction Theorem, `L N EW ∗ → A. Since N EW ∗
is a tautology, by the Completeness Theorem for L, `L N EW ∗ . We can put
together a short proof of `L A.
1. N EW ∗ Axiom NEW
So far, anything provable with the new axiom is provable in the original
axiom system. Also, anything provable in L must be provable in the new axiom
system. Thus, the new axiom system has exactly the same theorems as L.
What does this do for us? The new axiom system satisfies the completeness
and soundness theorems. It’s another reasonable axiom system. In reality, it’s
just L with a lemma disguised as an axiom. Big deal.
Suppose now that we add a formula that isn’t a tautology as a new axiom.
Remember that since we want to treat this new formula just like any other
axiom, we have to allow any instance of the new formula to appear as a line in a
proof. This always results in an inconsistent theory. (And inconsistent theories
don’t satisfy the Soundness Theorem.) Let’s look at a particular example.
Consider adding A → B to L as a new axiom. We can show that the resulting
theory is inconsistent: Notice that a particular instance of this new axiom is
26 CHAPTER 1. PROPOSITIONAL CALCULUS
Discarding an axiom
Axiom 3 is particularly ugly. Can we throw it away? Unfortunately, the re-
sulting theory doesn’t satisfy the Completeness Theorem. Here’s another way
to descibe the situation. Axiom 3 can’t be proved from Axiom 1 and Axiom 2.
Also, since L is consistent, Axiom 1 and Axiom 2 can’t prove the negation of
Axiom 3. Logicians would say, “Axiom 3 is independent of Axiom 1 and Axiom
2.” Independence statements of this sort are often very challenging to prove.
Overall, discarding axioms yields systems that don’t satisfy the Complete-
ness Theorem. This isn’t such a hot way to modify L.
The axioms:
A → (B → A)
(A → (B → C)) → ((A → B) → (A → C))
(A ∧ B) → A
(A ∧ B) → B
A → (B → (A ∧ B))
A → (A ∨ B)
B → (A ∨ B)
(A → C) → ((B → C) → ((A ∨ B) → C))
(A → B) → ((A → ¬B) → ¬A)
¬¬A → A
Use the above axioms with Modus Ponens. The theorems of this system are
exactly the tautologies. In other words, completeness and soundness theorems
hold for this system. By using this system, we could avoid using abbreviations
to deal with conjunction and disjunction. If we wanted, we could add more
axioms to deal with biconditional or any other connectives we might like to
append.
Use the above axiom with Modus Ponens. The theorems of this system are
exactly the tautologies. Meredith’s system is very elegant with its single axiom
and single rule of inference. Unfortunately, it is not so easy to prove theorems
in this system or even to recognize instances of the axiom. For an exceptionally
challenging exercise, try proving L1 in Meredith’s system.
Exercises.
1. Show that Meredith’s axiom is a tautology
2. Write down two instances of Meredith’s axiom.
3. Prove the following using Kleene’s axiom system:
(a) A → A
(b) A → B, B → C ` A → C
(c) A → B ` ¬B → ¬A
Propositional calculus is pretty nice. We’ve managed to talk about a lot of logic
without being terribly technical. Our proof system, L, is slick. It’s consistent,
and has a deduction theorem, a soundness theorem, and a completeness theorem.
Propositional calculus is useful for analyzing lots of different sorts of ar-
guments. In particular, we can use it to understand the structure of lots of
mathematical proofs. For example, we know that mathematicians use the De-
duction Theorem every day to prove implications. Also, mathematicians follow
the format of L25 to prove biconditional statements.
The big disadvantage of propositional calculus is that it glosses over any fine
distinctions. It’s just not very expressive. For example, suppose we want to use
propositional calculus to formalize the statement “if n > 0 then n + 1 > 0.” If
we let P denote n > 0 and Q denote n+1 > 0, then our formalization is P → Q.
This certainly shows us that the statement is an implication, but it hides the
fact that the hypothesis and the conclusion are both talking about n. In order
to overcome this limitation, we need a logical system that includes variables.
Exercises.
1. Use propositional calculus to formalize the following argument.
Socrates is a man.
All men have ugly feet.
Socrates has ugly feet.
28 CHAPTER 1. PROPOSITIONAL CALCULUS
Predicate Calculus
Predicates
We use capital letters, (A, B, C, etc.) to represent predicates. A predicate
letter will usually be associated with a list of at least one variable. For example,
29
30 CHAPTER 2. PREDICATE CALCULUS
might represent the statement “x < y” or the statement “x and y are kinds of
fish.”
Sometimes, we’ll use special predicate symbols like =, ≤, or >. Rather than
writing the symbol in front of the variables, we’ll put it between the variables.
Thus, we would write x = y rather than = (x, y). Writing the predicate in front
is called prefix notation. Writing it in the middle is called infix notation. No
matter which notation we use or which symbols we use, we should specify any
intended meaning of the predicate symbols.
Terms
The list after a predicate symbol can include more than just variables. Any
term can be used in the list. Terms are either variables, constants, or functions
applied to terms.
• Functions are small letters (like f , g, and h); functions take as input a list
of terms and have a unique output.
Despite the fact that functions and variables are both denoted by small
letters, it is easy to keep them straight. Functions have lists associated with
them, just like predicates. For example, f (x, y) is a function f applied to
the variables x and y. Functions can act on other terms, too. For example,
g(x, a, h(z)) is the function g applied to the variable x, the constant a, and the
function h, where h is a function applied to z.
Functions and predicates differ in one very important respect. The value
of a function is an object, while the value of a predicate is a truth value. For
example, if we want to represent “the father of x,” it makes sense to use a
function like f (x). On the other hand, if we want to say “x is a father,” we
would use a predicate symbol, like P (x). If f (Chelsea) is Bill, then P (Bill) is
true.
What sort of objects these terms represents depend upon the context, i.e.,
the universe of objects that we are examining. If we’re talking about numbers,
x would represent a number. If we’re talking about milkmen, x would represent
a milkman. A variable represents a non-specific object, like some milkman.
A constant represents a particular object, like “Waldo the milkman who lives
up the street.” A function represents an object that is somehow related to the
objects in its variable list. For example, if x represents a milkman in the universe
of milkmen, then f (x) could represent the milkman who took over x’s old route.
If w represents Waldo the milkman, then f (w) represents the milkman who took
over Waldo’s old route.
2.2. TRANSLATIONS 31
Connectives
The connectives are ∧, ∨, →, ↔, and ¬. These are the same connectives we
used in propositional calculus, and they mean exactly the same thing.
Quantifiers
We’ll use ∀ and ∃ as our quantifier symbols. Quantifier symbols must be followed
by a single variable (never a constant or a function).
∀x is read as “for all x.”
∃x is read as “there exists an x.”
We’ll build all our formulas from the sorts of symbols listed above, inserting
parentheses where we need them. If we need to represent an entire formula with
a single letter, we’ll just use a capital letter (A, B, C, etc.). If there is any
possibility confusing a whole formula with a predicate, we’ll be sure to include
extra explanation. Usually, we will use A(x, y, z) to represent a formula that
includes the variables x, y, and z.
Rather than being really technical about what constitutes a properly con-
structed formula, let’s look at some examples.
2.2 Translations
We can translate English statements into predicate calculus, and vice versa. In
either case, we must be careful to specify what the symbols represent.
(a) ∀x(x · 0 = 0)
For all real numbers x, x times 0 equals 0.
(c) ∀x∃y(x · y = 1)
For all real numbers x, there is a real number y such that x · y = 1.
Most of the translations we have done to this point involved one quantifier.
Before proceeding let’s consider the following example, which will clarify the
convention on how to interpret multiple quantifiers:
Example. Let the universe be the set of all people, and the predicate L(x, y)
stand for x loves y (and equivalently, y is loved by x). Here are all of the possible
versions with x and y quantified. Notice the careful treatment of the translation
in each case. “Loves” is not assumed to be reflexive here.
Both variables quantified with ∀:
∀x∀yL(x, y): Everyone loves everyone.
∀y∀xL(x, y): Everyone is loved by everyone.
These sentences mean the same thing, and this will be true for all interpreta-
tions, as we will prove more formally later.
Both variables quantified with ∃:
∃x∃yL(x, y): Someone loves someone.
∃y∃xL(x, y): Someone is loved by someone.
Again, these sentences mean the same thing, and this will be true for all inter-
pretations, as we will prove more formally later.
First variable quantified with ∀:
∀x∃yL(x, y): Everyone loves someone.
∀y∃xL(x, y): Everyone is loved by someone.
These sentences do not mean the same thing. In the first case a more formal
translation would be “Every person has someone that they love.” In the second
case, “Every person has someone who loves them.” The order of the variables
is important here.
First variable quantified with ∃:
∃x∀yL(x, y): Someone loves everyone.
∃y∀xL(x, y): Someone is loved by everyone.
Again, these sentences do not mean the same thing. In the first case we have
“there is someone who loves all people.” In the second case, “there is someone
who is loved by everyone.” Notice also that none of the four alternations of
quantifiers means the same thing. Order is extremely important!
natural numbers. Let s(x) be the function x2 and P (x, y) be the predicate for
x ≤ y. We can formalize the statement as:
∀x P (x, s(x)).
On the other hand, we can use the ≤ and square symbols with their usual
meanings directly and use n as our variable name:
∀n(n ≤ n2 ).
What if we needed the universe to be all real numbers rather than all natural
numbers? How could we adapt the first version to handle this? We would need
to have a new predicate for “x is a natural number.” Let N (x) stand for this:
Example. Now let’s try a more complex example: Socrates is a man. All men
have ugly feet. Socrates has ugly feet. Again, we can proceed in several ways.
Let the universe be the set of all people. Let U (x) be the predicate x has ugly
feet. Let M (x) be the predicate x is a man. Let s be the constant in the universe
representing the man Socrates. Then the three statements above translate as
follows.
M (s) ∀x(M (x) → U (x)) U (s)
Example. Let’s look at a statement that uses a function in a non-mathematical
universe. Translate: Each man has a father. There are many ways to proceed.
Let the universe be all men. Let f (x) be the function “ father of x.” Let S(x, y)
mean is that x same person as y. Then we can formalize the statement as
∀x∃yS(y, f (x)).
Exercises.
1. Assuming that the universe consists of all people, f (x) means “father of
x,” and B(x) means “x is the chancellor,” translate the following into
English.
(a) ∀x¬B(f (x))
(b) ∃xB(x)
(c) ∃x¬B(x)
2. Assuming that the universe consists of all people, f (x) means “father of x,”
D(x) means “x is tiny,” and w represents Waldo, translate the following
into English.
(a) ∀x(D(x))
(b) D(w)
(c) ∀x(D(f (x)) → D(x))
34 CHAPTER 2. PREDICATE CALCULUS
(d) ¬∃x(D(x))
3. Assuming that the universe is the set of natural numbers, E(x) means “x
is even,” O(x) means “x is odd,” and S(x) means “x is a multiple of 3,”
translate the following into English.
(a) ∀x(E(x) ∨ O(x))
(b) ∀x(S(x) → O(x))
(c) ∃x(S(x) ∧ ¬E(x))
(d) ∀x(O(x)) → ∀x(E(x))
4. Assume the universe is all real numbers and L(x, y) means x is less than
y. Match each formula in the first list with a translation in the second
list.
(a) ∀x∀yL(x, y)
(b) ∀y∀xL(x, y)
(c) ∃x∃yL(x, y)
(d) ∃y∃xL(x, y)
(e) ∀x∃yL(x, y)
(f) ∀y∃xL(x, y)
(g) ∃x∀yL(x, y)
(h) ∃y∀xL(x, y)
i. There is a real number that is greater than any real number.
ii. There is a real number that is less than any real number.
iii. Given any real number, we can find a greater real number.
iv. Given any real number, we can find a lesser real number.
v. If x and y are reals, then x is less than y.
vi. There are reals x and y such that x is less than y.
5. Formalize: No elbow is an ankle.
6. Assume that the universe is the set of all fish. Using the information
below, formalize each of the given statements.
• T (x) means “x is a trout”
• S(x) means “x is shiny”
• J(x) means “x jumps”
8. Formalize the four statements below, assuming that the universe is all
men.
So now we have two tasks. First, let’s look at how quantifiers act on for-
mulas. Then we’ll specify exactly what information we need to generate good
translations of formulas. That’s what is in the next two sections. Once we have
the tools we need, we’ll talk about truth.
The predicate M (x, y, f (x, z)) is included in the scope of quantifiers on the
variables x and y. The predicate G(x, y, z) is not in the scope of any quantifier.
Underlining the free variables gives us:
Exercises.
1. Underline the free occurrences of variables in the following formulas.
(a) ∀x(P (x, y) → ∃z(P (x, z)))
2.5 Models
Back in section 2, we were given information that we used to translate formulas
into English. A list of information used in translations is called a model. Our
models must include:
• a universe,
• interpretations of all predicate symbols,
• interpretations of all function symbols, and
• interpretations of all constant symbols.
There are a few rules. The interpretations of the predicate symbols must make
sense for everything in the universe. The interpretations of the function symbols
must be functions that are defined for everything in the universe and take values
in the universe. Finally, the constants must be specific elements of the universe.
There is a lot of freedom in defining a model. We can make the predicates,
functions, and constants mean pretty much whatever we like. Notice that we
don’t get to redefine the quantifiers or connectives, though. ∀x always means
“for all x”, and ∨ always means “or”. Some things never change.
Example. Construct three different models where the formula
1. Let the universe be all real numbers, let P (x, y) represent x is greater than
y, and let B(x) represent x is rational. A translation in this model is:
For all real numbers x, there is a corresponding real number y where either
x is greater than y or x is rational.
2. Let the universe be all people, let P (x, y) represent x is y’s father, and let
B(x) represent x is deceased. A translation in this model is:
For all people x, there is a corresponding person y where either x is y’s
father or x is deceased.
3. Let the universe be cans of soup, let P (x, y) represent x was canned after
y, and let B(x) represent x is too old to eat. A translation in this model
is:
For all cans of soup x, there is a corresponding can y where either x was
canned after y or x is too old to eat.
1. Let the universe be real numbers, f (x, y) denote x·y (usual multiplication),
0 denote 0, and L(x, y) denote x > y (usual inequality). A translation in
this model is:
For every real number x, we can find a real number y such that x · 0 > y.
(Note that this statement is true in this model.)
3. Let the universe be all people and suppose f (x, y) denotes the youngest
person in the set {y, the father of x}. Note that for any pair of people
chosen, the function yields a person. Let 0 denote Zeno, and let L(x, y)
denote “x was born before y was born.” A translation in this model is:
For every person x, we can find a person y such that the younger of x’s
father and Zeno was born before y was born. (This statement is true in
this model, since if x’s father is younger than Zeno we can set y to be x,
and otherwise we can let y be Einstein.)
Exercises.
∀x∃y(x = y ∨ x + 1 > y)
∀x∃y(x = y ∧ f (x) = y)
1. ∀xC(x)
“All natural numbers are multiples of 10” is false, since 9 is not a multiple
of 10.
2.6. TRUTH AND SENTENCES 41
2. ∀xL(x)
“All natural numbers are even” is false, since 3 is not even.
3. ∃x(C(x) ∧ L(x))
“There is a natural number that is both even and a multiple of 10” is true.
For example, 20 is such a number.
4. ∃x(L(x) ∧ ¬C(x))
“There is a natural number that is both even and not a multiple of 10” is
true. For example, 4 is such a number.
5. ∀x(L(x) → C(x))
“For all natural numbers, being even implies being a multiple of 10” is
false, since 4 is even but not a multiple of 10.
Exercises.
1. Let K be the model where the universe is the natural numbers, G(x, y)
means x is greater than y and h(x) represents the function x + 1. Decide if
the following sentences are true in K or false in K. Justify your answers.
(a) ∀x∀yG(x, y)
(b) ∀xG(h(x), x)
(c) ∀x∃yG(x, y)
(d) ∀y∃xG(x, y)
(e) ∀x∀y(G(x, y) → ∃z(G(x, z) ∧ G(z, y)))
2. Let K be the model where the universe is the real numbers, G(x, y) means
x is greater than y and h(x) represents the function x + 1. Decide if the
following sentences are true in K or false in K. Justify your answers.
(a) ∀x∀yG(x, y)
(b) ∀xG(h(x), x)
(c) ∀x∃yG(x, y)
(d) ∀y∃xG(x, y)
(e) ∀x∀y(G(x, y) → ∃z(G(x, z) ∧ G(z, y)))
3. Find a model M where the sentence ∀x∃yA(x, y) is true, and the sentence
∃y∀xA(x, y) is false.
4. Can you find a model M where the sentence ∀x∃yA(x, y) is false, and the
sentence ∃y∀xA(x, y) is true? Explain.
42 CHAPTER 2. PREDICATE CALCULUS
Example. Let M be the model where the universe is the real numbers, G(x, y)
means x > y, and h(x) represents the function x + 1. Classify the following
formulas as true in M , false in M , or satisfiable in M .
1. G(x, x)
This is false in the model, because there does not exist a real number that
is less than itself.
2. G(x, h(x))
This is false in the model, because there does not exist a real number that
is greater than one plus itself.
3. G(0, x)
This is satisfiable in the model, because 0 is greater than negative two
plus one, so there is a number for which the formula is true. Also, not all
real numbers work, so the formula is satisfiable but not true.
4. G(h(x), 0)
This is satisfiable in the model, because one plus one is greater than zero,
so there is a number for which the formula is true. Also, not all real
numbers work, so this formula is satisfiable but not true.
5. G(x, y)
This is satisfiable in the model, because we can find numbers x and y
where x < y. Also, not all real numbers would work, so the formula is
satisfiable but not true.
2.7. TRUTH AND FREE VARIABLES 43
Example. Let K be the model where the universe is the natural numbers
N = {0, 1, 2, ...}, G(x, y) means x > y, and h(x) represents the function x + 1.
Classify the following formulas as true in K, false in K, or satisfiable in K.
This is the same set of formulas as in the previous example; note how the truth
values are different.
1. G(x, x)
False – same as part 1 above.
2. G(x, h(x))
False – same as part 2 above.
3. G(0, x)
False – not the same as part 3 above because we are now looking at the
natural numbers, which do not include the negative numbers.
4. G(h(x), 0)
True – not the same as part 4 above because we are now looking at the
natural numbers, which do not include the negative numbers.
5. G(x, y)
Satisfiable but not true– same as part 5 above.
Exercises.
1. Consider the formula P (x, 0). Find a model where this formula is true, a
model where it is satisfiable but not true, and a model where it is false.
2. Consider the formula ∀xS(x, y). Find a model where this formula is true,
a model where it is satisfiable but not true, and a model where it is false.
3. Consider the formula ∃yR(x, y). Find a model where this formula is true,
a model where it is satisfiable but not true, and a model where it is false.
Exercises.
1. Give an example of a logically valid formula.
Example. Show that ∀x∃yC(x, y) does not logically imply ∃y∀xC(x, y).
We will need to construct a model where ∀x∃yA(x, y) is true, but ∃y∀xA(x, y)
is false. Here is an entertaining graphical technique for building finite models.
Let the universe be {0, 1}. Draw an arrow from from 0 to 1 and a second arrow
from 1 to 0. Let A(x, y) mean that there is an arrow from x to y. For every choice
of x, there is an arrow that starts at x, so ∀x∃yA(x, y) is true in this model.
On the other hand, A(0, 0) and A(1, 1) are both false, so ∃y∀xA(x, y) is false.
46 CHAPTER 2. PREDICATE CALCULUS
Exercises.
1. Show that ∀x(A(x) ∧ B(x)) is not logically valid.
2. Show that ∀x∃yC(x, y) is not logically valid.
3. Show that ∃x(A(x) → B(x)) does not logically imply (∃xA(x)) → (∃xB(x)).
4. Show that ∀x∃yC(x, y) does not logically imply ∃zC(z, z).
5. Show that (∀xA(x)) ↔ (∀xB(x)) is not logically equivalent to ∀x(A(x) ↔
B(x)).
6. Show that the formula
∃x∀yQ(x, y) → ∃x∀yQ(x, y)
A(x) → A(x)
Similarly, A → (B → A) is a tautology, so
is an instance of a tautology.
Here is the fact that makes instances of tautologies interesting. Every in-
stance of a tautology is logically valid. Since we can easily construct instances
of tautologies, we can easily list lots of logically valid formulas. We’ve already
seen three:
∃x∀yQ(x, y) → ∃x∀yQ(x, y)
2.10. SOME LOGICALLY VALID FORMULAS 47
A(x) → A(x)
∀x∃yC(x, y) → (∀xA(x) → ∀x∃yC(x, y))
Note that the formula A(x) → A(x) has a free variable. Since we know it
is logically valid, we know it is true in every model. Now A(x) → A(x) is true
in a model M exactly when ∀x(A(x) → A(x)) is true in M . Formulas that are
true in every model are logically valid, so ∀x(A(x) → A(x)) is logically valid.
The reasoning of the preceding paragraph works for any formula and any
variable. If we know that P is a logically valid formula, then so are the formulas
∀xP , ∀yP , ∀x∀yP , and so on. We can use this rule to build logically valid
formulas that are not instances of tautologies. For example, A(x) → A(x) is an
instance of a tautology, ∀x(A(x) → A(x)) is not an instance of a tautology, but
both are logically valid.
Here’s another way to build more logically valid formulas. If a formula P
is true in a model M , then it is satisfiable in M . Since P is satisfiable in M ,
the formula ∃xP is true in M . Consequently, if P is logically valid, then so is
∃xP . As with adding universal quantifiers, this works for any formula P and
any variable x. We can combine this with our previous work to build more
complicated logically valid formulas. For example, ∀x∃y(C(x, y) → ¬¬C(x, y))
is logically valid, but not an instance of a tautology.
Summarizing, any instance of a tautology is logically valid. Any formula got-
ten by stringing quantifiers in front of a logically valid formula is logically valid.
Not every logically valid formula is an instance of a tautology. Indeed there are
logically valid formulas that simply cannot be built using the techniques of this
section.
Exercises.
1. Each of the following formulas is logically valid. Mark those that are
instances of tautologies.
(a) A(x) → (∀yB(y) → A(x))
(b) ∀x(A(x) → (∀yB(y) → A(x)))
(c) A(x) → (¬B(y) ∨ B(y))
(d) ∃x(A(x)) → (¬B(y) ∨ B(y))
(e) ∃x(¬¬A(x) → (¬B(y) ∨ B(y)))
(f) ∃y∃x(¬¬A(x) → (¬B(y) ∨ B(y)))
2. Each of the following formulas is logically valid. Mark those that are
instances of tautologies.
(a) C(x, y) → C(x, y)
(b) ∀x∃y(C(x, y) → C(x, y))
(c) ∀x∃yC(x, y) → ∀x∃y¬¬C(x, y)
(d) ∀x∃yC(x, y) → ¬¬∀x∃yC(x, y)
48 CHAPTER 2. PREDICATE CALCULUS
∀xP (x, h(x)) – As with the last substitution, this one ”binds” a variable
that was previously free.
When working with formulas in predicate calculus, we will not allow substi-
tutions like the last two. The first two are fine. We can nail this concept down
with a single definition. It’s a little technical, but that’s what makes it precise
and useful.
We can approximate this definition as follows. It’s fine to plug in a term for
a free variable if none of the variables in the term are accidentally captured by
quantifiers.
Example. Consider the terms x, y, f (x, y), and 3. Determine which of these
terms are free for x in each formula below.
1. ∀y(A(x, y) ∨ B(z))
Terms:
(a) x is free for x in A(x) ∨ ∀z(C(z, z) ∧ B(z, y)); A(x) is not in the scope
of any quantifiers and x is always free for x in any formula anyway.
(b) y is free for x in A(x) ∨ ∀z(C(z, z) ∧ B(z, y)); A(x) is not in the scope
of any quantifiers.
(c) f (x, y) is free for x in A(x) ∨ ∀z(C(z, z) ∧ B(z, y)); A(x) is not in the
scope of any quantifiers.
(d) 3 is free for x in A(x) ∨ ∀z(C(z, z) ∧ B(z, y)); A(x) is not in the scope
of any quantifiers, and constants can never be captured by quantifiers
anyway.
Terms:
(a) x is free for x in the formula B(y) → ∀y(A(x, z) ∧ ∃xC(x, y)); the
only free occurrence of x is in the A(x, z) predicate, and x is always
free for x.
(b) y is not free for x in the formula B(y) → ∀y(A(x, z) ∧ ∃xC(x, y)); x
occurs free in the A(x, z) predicate, and y will be captured by the ∀y
quantifier.
(c) f (x, y) is not free for x in the formula B(y) → ∀y(A(x, z)∧∃xC(x, y));
x occurs free in the A(x, z) predicate, and the y in f (x, y) will be
captured by the ∀y quantifier.
(d) 3 is free for x in the formula B(y) → ∀y(A(x, z) ∧ ∃xC(x, y)); con-
stants can never be captured
4. d) ∀x∃yD(x, y, z)
Terms: There are no free occurrences of x to plug in for in this formula.
Consequently, every term is free for x in ∀x∃yD(x, y, z). This is a weird
case, but the idea is not too hard. If there is no place to plug in, then you
can plug in anything safely.
Here’s a summary of all the work done above. In the table below, an OK appears
if the term is free for x in the formula. If not, then an X appears.
x y f(x,y) 3
∀y(A(x, y) ∨ B(z)) OK X X OK
A(x) ∨ ∀z(C(z, z) ∧ A(z, y)) OK OK OK OK
B(y) → ∀y(A(x, z) ∧ ∃xC(x, y)) OK X X OK
∀x∃yD(x, y, z) OK OK OK OK
Let’s summarize some shortcuts. We can plug x in for x in any formula,
and not worry. We can plug a constant symbol in for x in any formula, and not
worry. Note that we only ever plug terms into free occurrences of variables. We
never plug terms of any sort into bounded occurrences of variables.
Exercises.
1. Use the following lists of formulas and terms to solve the exercises below.
Formulas: Terms
1. ∀y(A(x, y) ∨ B(z)) (a) x
2. A(x) ∨ ∀z(C(z, z) ∧ A(z, y)) (b) y
3. B(y) → ∀z(A(x, z) ∧ ∃xC(x, y)) (c) f (x, y)
4. ∀x∃yD(x, y, z) (d) 3
2.12. A PROOF SYSTEM FOR PREDICATE CALCULUS 51
(a) Determine which of the terms are free for y in each formula.
(b) Determine which of the terms are free for z in each formula.
2. Use the following lists of formulas and terms to solve the exercises below.
Formulas: Terms
1. ∀xA(x, y, z) (a) x
2. ∀yB(x, y) ∨ ∀zC(z, y) (b) y
3. ∀y(B(x, y) ∨ ∀zC(z, y)) (c) h(z, 3)
4. ∀y∀z(B(x, y) ∨ C(z, y)) (d) g(x, y, z)
(a) Determine which of the terms are free for x in each formula.
(b) Determine which of the terms are free for y in each formula.
(c) Determine which of the terms are free for z in each formula.
Axioms
Any formulas of predicate calculus may be substituted for A, B, and C in the
following schemes. Also, other variables may be substituted for the use of x in
Axiom 4 and Axiom 5.
Axiom 1: A → (B → A)
Rules of inference
Modus Ponens (MP): From A and A → B, deduce B.
Be careful with Axioms 4 and 5. It’s vital to pay attention to the extra
conditions on free occurrences of x. Here are two examples of common mis-
takes. First, the formula ∀x∃yB(x, y, z) → ∃yB(y, y, z) is not a correct instance
of Axiom 4 because the variable y is not free for x in ∃yB(x, y, z). Second,
the formula ∀x(∃yA(x, y) → B(x)) → (∃yA(x, y) → ∀xB(x)) is not a correct
instance of Axiom 5 because x occurs free in ∃yA(x, y).
Proofs in K look a lot like proofs in L. Here is an example of a short proof.
Exercises.
1. Build two instances for each of the axioms, showing the substitution made
in each case.
2. Are the following instances of one of the axioms? If so, show the substi-
tution made.
(a) ∀x∀yA(x, y) → (∃xB(x) → ∀yA(x, y)).
(b) ∀x∀yA(x, y) → ∀yA(y, y).
(c) ∀x∀yA(x, y) → ∀yA(x, y).
(d) ∃xB(x) → ((∀x∃yC(x, y) → A(x)) → ∃xB(x)).
In order to make efficient use of our new proof system, K, we could use some
shortcuts. When we learned to write proofs in L, progress was painful until
54 CHAPTER 2. PREDICATE CALCULUS
we learned the Deduction Theorem for L. One of the shortcuts we will learn
is a Deduction Theorem for Predicate Calculus. The next four sections consist
entirely of shortcuts and proof strategies for our new proof system.
1. ∀x∀yA(x, y) Given
2.14 Rule T
Since Axiom 1, Axiom 2, Axiom 3 and MP are all included in K, every proof
in L is also a proof in K. The following rule lets us use all the results we could
have proved in L, even if we didn’t actually do the proof.
The only use of GEN in this proof is on the variable x, which does not occur
free in the hypothesis. Thus, we can apply the deduction theorem to obtain
` ∀x(A(x) ∧ B(x)) → ∀xA(x), as desired.
In the preceding proof, we actually used the following restricted version of
the deduction theorem:
Corollary. If A ` B and A has no free variables, then ` A → B.
The restriction on applications of generalization to variables not occurring
free in the hypothesis is a vital part of the statement of the deduction theorem.
Without it, our proofs would not be sound. For example, consider the following
correct proof of x = y ` x = 0.
1. x = y Given
2. ∀y(x = y) GEN, line 1
3. ∀y(x = y) → x = 0 Axiom 4
4. x = 0 MP, lines 2 and 3
Note that the application of GEN in line 2 to the variable x, which occurs free
in x = y, precludes us from applying the deduction theorem. This is a good
thing, because if we incorrectly applied the deduction theorem, we could deduce
x = y → x = 0, which could be generalized to ∀x∀y(x = y → x = 0). This
formula is not logically valid, since in the model with the natural numbers as
its universe and the usual interpretation of = and 0, the implication 1 = 1 →
1 = 0 is false. Just one misapplication of the deduction theorem destroys the
soundness of our proof system. Always check for uses of GEN before applying
the deduction theorem.
Can every proof done using the deduction theorem be done without using
the deduction theorem? Sure! That’s essentially what the deduction theorem
says. At this point, we have enough tools to do a lot of proofs without too much
effort.
Exercises.
Prove the following in K. You may use any theorem with a lower number in
your proof.
2.16 Adding ∃x
If we have ∀xA(x), we ought to be able to prove ∃xA(x). In order to do this,
we need to be able to add ∃x to a formula. Here’s an example of how to do this
using the shortcuts we have so far.
Theorem K 11. A(x) ` ∃xA(x)
1. A(x) Given
2. ∀x¬A(x) → ¬A(x) Axiom 4
3. (∀x¬A(x) → ¬A(x)) → (A(x) → ¬∀x¬A(x)) Rule T
4. A(x) → ¬∀x¬A(x) MP, lines 2 and 3
5. ¬∀x¬A(x) MP, lines 1 and 4
6. ∃xA(x) Abbreviation of line 5
We can extract the content of the proof of Theorem K11, and create an
Add-∃x Rule that handles even more cases. We need to be careful that the
hidden use of Axiom 4 is acceptable. In the following, the clauses requiring that
A(t) is the result of replacing every free occurrence of x in A(x) with t and t is
free for x in A(x) insure that ∀x¬A(x) → ¬A(t) is a correct instance of Axiom
4. This allows us to modify the proof of K11 to obtain a proof of ∃xA(x) from
A(t). Stating the rule will save us from having to mess with contrapositives
every time we want to tack on an existential quantifier.
Add ∃x Rule: If A(t) is the result of replacing every free occurrence of x in A(x)
with t, and t is free for x in A(x), then from A(t) we may deduce ∃xA(x).
In short, if ∀xA(x) → A(t) is a correct instance of Axiom 4, then from
A(t) we may deduce ∃xA(x).
Here is an application of the Add ∃x Rule that makes use of the ability to
substitute for a term. In the following, A(y, y) is the result of replacing every
free occurrence of x in A(x, y) with y, and y is free for x in A(x, y). Thus the
Add ∃x Rule allows us to deduce ∃xA(x, y) from A(y, y).
Theorem K 12. A(y, y) ` ∀y∃xA(x, y)
58 CHAPTER 2. PREDICATE CALCULUS
1. A(y, y) Given
Exercises.
1. Prove:
(a) Prove K14 using the deduction theorem, Axiom 4 and K13.
(b) Prove K14 using the Add ∃x Rule.
3. Prove:
4. Prove the following theorems. Like the proof of Theorem K11, these re-
sults use the fact that ∃xA(x) is an abbreviation for ¬∀x¬A(x). Math-
ematicians use results like these whenever they “push negations past a
quantifier.”
(a)
Theorem K 16. ¬∃xA(x) ` ∀x¬A(x)
(b)
Theorem K 17. ∀x¬A(x) ` ¬∃xA(x)
(c)
Theorem K 18. ∃x¬A(x) ` ¬∀xA(x)
(d)
Theorem K 19. ¬∀xA(x) ` ∃x¬A(x)
2.17 Removing ∃x
So far, we have strategies for adding and removing ∀x, and a rule for adding
∃x. To complete our survey of techniques for manipulating quantifiers, we need
a rule for removing ∃x.
Informally, if we have ∃xA(x), we should be able to find some element to
plug in for x. If we give that element a temporary name, we could proceed
with our proof. The best thing to use as a name is a constant symbol. If we
2.17. REMOVING ∃X 59
use a constant symbol that already appears in the proof (or appears in some
weird axioms that we plan to use later), we will be implicitly making additional
assumptions about the element. Consequently, we want to use a new constant
symbol. Here’s the rule, presented more formally.
1. ∀y(y = y) Given
2. ∀y(y = y) → y = y Axiom 4
3. y = y MP, lines 1 and 2
Rule again in line 7. It is legal there, too. The only illegal step is in line 6,
where we apply GEN to a variable that appears free in line 4, the formula to
which we applied Rule C. That violates the second condition of Rule C, and it
is a very bad idea. There is a model where ∀y(y = y) is true, but ∃x∀y(x = y)
is false, so ∀y(y = y) does not logically imply ∃x∀y(x = y). The illegal use of
GEN with Rule C has destroyed the soundness of our proof system.
We’ve seen the bad effects of violating the conditions of Rule C. However,
if we can prove A using Rule C correctly, then we can prove A without using
Rule C. Consequently, correct uses of the shortcut Rule C do not mess up the
completeness and soundness theorems for our proof system. One way to prove
this is to construct an algorithm that converts proofs that use the shortcut
to proofs that do not. This sort of argument closely resembles a proof of the
deduction theorem, and is used in [8].
Exercises.
Use Rule C to prove the following:
Of course, we also can use techniques from propositional calculus like apply-
ing the deduction theorem or proving the contrapositive as a lemma. Here are
some problems that use a variety of methods.
Exercises.
Transition to Informal
Proofs
We just spent the whole last chapter talking about predicate calculus. Two of
the important theorems we discussed were the Completeness Theorem and the
Soundness Theorem. These two theorems can be summarized as follows: The
formulas that are provable in predicate calculus are exactly the logically valid
formulas.
These results indicate both the main weakness and the main strength of
predicate calculus. On the one hand, the only formulas we can prove are the
logically valid formulas, which will always be true in every model. However, most
of the interesting formulas in mathematics state properties that are peculiar to
the integers, or the reals, or to some other entertaining specific model. For
example,
∀x(P (x) ∨ ¬P (x))
is true in absolutely every model, but
P(x) := x is prime
O(x) := x is odd
In this model (a particular model), the second statement above translates to:
All prime natural numbers bigger than 2 are odd. This is a true statement, but
since it is not logically valid, we need new axioms in order to prove it.
63
64 CHAPTER 3. TRANSITION TO INFORMAL PROOFS
Exercises.
Prove the following in E.
Theorem E 2. `E x = y → (y = z → x = z) (This is often called the transitive
law of equality.)
Theorem E 3. `E (x = y ∧ x = z) → y = z (This is an axiom of Euclid:
things equal to the same thing are equal to each other.)
Theorem E 4. `E x = y → ∀z(f (z, x) = f (z, y))
Note that Theorem E4 holds regardless of the choice of f (x, z). For example,
we could replace f (x, z) by x + z, x − z, x · z, xz , z x , or any other two place
function.
66 CHAPTER 3. TRANSITION TO INFORMAL PROOFS
Axiom 8: ∀x∀y(x = y → x0 = y 0 )
Axiom 9: ∀x(0 6= x0 )
Axiom 10: ∀x∀y(x0 = y 0 → x = y)
Axiom 11: ∀x(x + 0 = x)
Axiom 12: ∀x∀y(x + (y 0 ) = (x + y)0 )
Axiom 13: ∀x(x · 0 = 0)
Axiom 14: ∀x∀y(x · (y 0 ) = (x · y) + x)
Axiom 15: If A(x) is a formula of PA, then
We can easily paraphrase what these axioms say. Axioms 8, 9 and 10 say
that equality acts the way we expect equality to act related to numbers and
successors. Axiom 9 says that 0 is the least counting number. Axiom 10 says
that 0 is the additive identity. Axioms 11 and 12 outline the behavior of addition,
and Axioms 13 and 14 do the same for multiplication. Axiom 15 says that we
can use induction to prove facts about the counting numbers. All in all, this
seems like a very reasonable list of properties of the natural numbers.
The language of P A is very expressive. That is, lots of properties of the
natural numbers can be written as formulas of P A. Here are some examples.
Example. Each of the following mathematical concepts is presented with its
formalization in PA. Note that these are just formulas representing properties
of natural numbers, not provable statements.
1. x is even: ∃k(x = 2 · k)
(2 is an abbreviation for 000 .)
3.2. FORMAL NUMBER THEORY 67
2. x is odd: ∃k(x = 2 · k + 1)
(1 is an abbreviation for 00 .)
3. y|x (y divides x evenly): ∃k(x = y · k) )
4. x ≤ y (x is less than or equal to y): ∃k(x + k = y)
5. x < y (x is strictly less than y): ∃k(x + k = y ∧ k 6= 0)
(k 6= 0 is an abbreviation for ¬(k = 0.)
6. x is a prime number: 1 < x ∧ ∀y(y|x → (y = 1 ∨ y = x))
(y|x is an abbreviation for ∃k(x = y · k), as shown in part 3.)
Besides being able to express a multitude of number theoretical concepts,
PA can actually prove gobs of facts about the natural numbers. Here are some
additional statements and their formalizations, each of which can be proved in
PA.
Example. Each of the following mathematical statements is presented with its
formalization in PA. Each of these properties of natural numbers can be proved
from the axioms of PA. Since GEN is included in PA, we can also prove closed
versions of these statements with universal quantifiers in the front.
1. Addition is commutative: x + y = y + x
2. Addition is associative: x + (y + z) = (x + y) + z
3. Multiplication is distributive over addition: x · (y + z) = x · y + x · z
4. Strict inequality is transitive: x < y → (y < z → x < z)
5. Inequality is preserved by addition: x ≤ y → x + z ≤ y + z
6. Strict inequality is preserved by addition: x < y → x + z < y + z
7. Inequality is preserved by multiplication: x ≤ y → x · z ≤ y · z
8. Strict inequality is preserved by nonzero multiplication:
z 6= 0 → (x < y → x · z < y · z)
9. Inequalities can be added: (w ≤ x ∧ y ≤ z) → w + y ≤ x + z
10. Inequalities can be multiplied: (w ≤ x ∧ y ≤ z) → w · y ≤ x · z
We keep asserting that statements can be proved in PA without providing the
proof. There is a reason for this. Generally speaking, the proofs are somewhat
long and complicated, though the next two examples aren’t too bad.These first
two results give a formal proof that 0 · n = 0. From Axiom 13, we know that
n · 0 = 0, but since commutativity of multiplication is not one of our axioms, we
have the prove the new statement. Indeed, this can be used as an initial step
in a proof that multiplication is commutative. (See the exercises.) Remember
that in our proofs in PA, we can use axioms of PA, and results from L, K and
E.
68 CHAPTER 3. TRANSITION TO INFORMAL PROOFS
Exercises.
1. Translate the following statements into formulas of P A. (Note that each of
these could actually be proven in P A, but that is not part of this exercise.)
3. Prove:
Theorem PA 3. ∀n(n = 0 + n)
4. Prove:
Theorem PA 4. ∀x∀y(x + y = y + x)
5. Prove:
Theorem PA 5. ∀x∀y(x · y = y · x)
0 · (n0 ) = 0 · n + 0 Axiom 14
=0·n Axiom 11
=0 Induction hypothesis
Note that Theorem PA6 has a formal proof in PA, though we will not bother
to write it up. Also, in the statement of the theorem, we use k ≤ n as an
abbreviation for ∃x(k + x = n), applying our work from part 4 of the example
on page 67.
We can use Theorem PA6 to create informal proofs in exactly the same way
that we used Axiom 15 before. We will call these proofs by induction with a
shifted starting point. If we want to prove that A(n) holds for all n ≥ k, we can
use the following three steps.
1. Base case: (n = k) Prove that A(k) holds.
3.3. MORE ABOUT INDUCTION 71
The starting point of the summation can be a value other than 0. For example,
5
X 1 1 1 1 1
= + + + .
i=2
i 2 3 4 5
Shifting the starting points and endpoints allow us to rewrite sums in various
convenient formats. For example,
n
X n
X n
X n−2
X
i=1+ i=1+2+ i = (n − 1) + n + i.
i=1 i=2 i=3 i=1
Note that for finite summations, any rearrangement of the terms yields the same
final sum.
Using our new notation, we can state a nice theorem which we will then
prove by an informal induction argument with a shifted starting point. Since
this is the first result that we will prove using strictly informal methods, we will
call it Theorem 1.
n
X n(n + 1)
Theorem 1. For all n ≥ 1, i= .
i=1
2
Before writing the proof, we should do some scratch work. We will need
to use induction with the starting point kP = 1. In the induction step, our
n
induction hypothesis will be that n ≥ 1 and i=1 i = n(n + 1)/2. We will want
Pn0 n+1
to prove that i=1 i = n0 (n0 + 1)/2, that is, i=1 i = (n + 1)(n + 2)/2. Now
P
Pn+1 Pn
i=1 i = ( i=1 i) + (n + 1), so after applying the induction hypothesis we just
need to show that n(n + 1)/2 + (n + 1) = (n + 1)(n + 2)/2. We can prove this
by rewriting the second (n + 1) on the left side of the equation as 2(n + 1)/2
to achieve a common denominator, and then adding fractions and simplifying.
Now we have all the elements of the proof. A little reorganization and we will
have a nice informal proof. We are done with the scratch work; here we go with
the proof.
Proof. We will use induction with a shifted starting point.
1
X 1(1 + 1)
Base case: Note that i=1= , so the theorem holds for n = 1.
i=1
2
72 CHAPTER 3. TRANSITION TO INFORMAL PROOFS
n
X n(n + 1)
Induction step: Suppose that 1 ≤ n, and i= . Then,
i=1
2
n+1 n
!
X X
i= i +n+1
i=1 i=1
n(n + 1)
= +n+1 (by the induction hypothesis)
2
n(n + 1) 2(n + 1)
= +
2 2
n(n + 1) + 2(n + 1)
=
2
(n + 1)(n + 2)
= .
2
n
X n(n + 1)
Conclusion: By induction, we have shown i= for all n ≥ 1.
i=1
2
Exercises.
1. Prove the following using informal induction arguments. You may use any
previously proved theorems of PA, the algebraic properties in the example
on page 67, and basic high school algebra facts.
n
2k = 2n+1 − 1
P
Theorem 2.
k=0
n
2k = n2 + n
P
Theorem 3.
k=1
Theorem 4. k ≥ 1 → 8|(9k − 1)
Theorem 5. n ≥ 3 → n2 ≤ 5n!
3. Find a formula in the language of PA with the free variables n and x that
formalizes the statement “x is the sum of the natural numbers less than
or equal to n.”
k
k2 +k+2
P
Non-Proof. Assume the statement is true for n = k, so i= 2 , and
i=1
consider
k+1 k
X X k2 + k + 2
i = (k + 1) + i = (k + 1) + .
i=1 i=1
2
Expanding out the binomials and combining terms over the common denomi-
nator gives:
k+1
X 2k + 2 + k 2 + k + 2 (k + 1)2 + (k + 1) + 2
i= =
i=1
2 2
A = S 6 {x}
B = S 6 {y}
A and B are sets of k horses, and thus by the induction hypothesis each contains
horses of one color. Now choose any z contained in A ∩ B. z and y are both
in A and are therefore the same color. z and x are both in B and are therefore
the same color as well. Thus x is the same color as y.
What is wrong here? We know that the statement is false! Somehow induc-
tion was not properly done. The error is a subtle one. We chose horses x 6= y
out of our set of horses S – that’s two horses – and then we chose another horse
z from S. That makes three horses in the smallest S possible for the induction
step to make sense. So the base case should have been two horses, which is
false! The moral of this example is: Think very carefully about the base step.
Now that we have mastered induction and seen some errors to avoid, it
is very tempting to use our new hammer on every nail we see. Induction is
not always the most direct approach to proving a theorem. In an induction
proof, if the induction hypothesis is not used in the proof of the induction step,
then induction can be avoided. The direct proof will resemble the proof of
the induction step with the base case omitted. The next exercise illustrates
this situation. Our erroneous proof of Non-theorem 2 looked like an induction
step with the base case omitted, but in that argument we used the induction
hypothesis. In the exercise, we can shorten the proof while avoiding the error.
Exercise.
1. Consider the theorem:
Theorem 10. n is odd → n2 − 1 is divisible by 4.
(a) Prove this theorem by induction. (Your induction step will not re-
quire the use of the induction hypothesis.)
(b) Prove this theorem without using induction.
1. ¬A → (B ∧ ¬B) Lemma
1. ¬(x 6= 00 ) Given
2. ¬(x 6= 00 ) → x = 00 Rule T
5. ∀x(x = 00 ) → 0 = 00 Axiom 4
7. ∀x(0 6= x0 ) Axiom 9 of PA
8. ∀x(0 6= x0 ) → 0 6= 00 Axiom 4
Proof. We will assume the negation of the theorem, giving us n2 is even and
n is not even. We expect to find a contradiction. Since n is not even, it is not
a multiple of 2, so there is a k such that n = 2 · k + 1. Thus,
n2 = n · n = (2k + 1) · (2k + 1)
= 4k 2 + 4k + 1
= 2(2k 2 + 2k) + 1
Theorem 12. At a party of more than two people, there are at least two people
who have the same number of friends at the party.
Proof. We will prove this by contradiction. Assume that there are n people at
the party and that no two people have the same number of friends at the party.
This means that we can match each person to their number of friends and list
them in order:
Since person n has n − 1 friends present, every person at the party is a friend
of person n. In particular, person 1 is a friend of person n. However, person 1
has 0 friends at the party, so person 1 is no friend of person n. Contradiction!
So there are at least two people with the same number of friends present.
78 CHAPTER 3. TRANSITION TO INFORMAL PROOFS
Besides doing party tricks, proof by contradiction can be used to prove the
next interesting theorem about prime numbers. Recall from item 6 in the ex-
ample on page 67 that n is prime if and only if both n > 1 and if k divides n
then k = 1 or k = n.
Theorem 13. There are infinitely many primes.
Proof. We will prove this theorem by contradiction. Assume that there are
finitely many primes, say n of them. We can list them in order:
Choose any prime from the prime factorization on the right, say pi1 . Our number
pi1 divides the right side of this equation, so it must also divide the left side.
Because pi1 is a prime, it appears in the list p1 , p2 , . . . pn , and so pi1 divides
p1 · p2 · · · pn . Since pi1 divides the first term of the left side, and it divides the
entire left side, it must divide the second term on the left, so pi1 must divide
1. Thus pi1 = 1, contradicting the assumption that pi1 is a prime! Thus there
must not be a largest prime, and so there are infinitely many.
We used two facts in this proof that deserve further discussion:
If a + b = c, p|a and p|c then p|b.
q is not prime, then it has a prime factorization.
These are both worthy of proofs themselves. They are certainly not obvious
from our basic axioms or from the definitions of divisibility and prime. A math-
ematician would prove these results first, perhaps as lemmas, and then cite them
in the main proof. Let’s prove these lemmas.
Lemma 1. If a + b = c, p|a and p|c then p|b.
Proof. Let a + b = c. Then subtracting a from both sides of this equation gives
b = c − a. Since p|a and p|c, p|(c − a). Because c − a is b, this implies that
p|b.
Lemma 2. Every natural number greater than 1 is either prime or has a prime
factorization.
3.5. PROOFS BY CONTRADICTION 79
Proof. We’ll assume that 2n − 1 is prime and that n is not prime, and look
for a contradiction. Since n is not prime, there are factors x and y such that
n = x · y, and 1 < x, y < n.
So
2n − 1 = 2xy − 1 = (2x )y − (1)y
= (2x − 1)((2x )y−1 + (2x )y−2 + ... + 1).
Wow! This proof uses a very weighty fact of algebra, which we will leave as
an exercise:
Theorem 16. If n > 2 then there is no m such that n|m and n + m = nm.
Exercises.
1. Prove Theorem 15.
2. Prove the following theorems. You may wish to use proofs by contradic-
tion.
Theorem 17. If n2 is odd, then n is odd.
Theorem 18. The sum of the cubes of two consecutive natural numbers
cannot be equal to the cube of the next largest number.
Theorem 19. If m and n are odd then x2 − 2mx + 2n = 0 has no natural
number solution, x.
√
Theorem 20. If for all m, 1 < m ≤ p implies m does not divide p, then
p is prime.
Theorem 21. There are at least two people in the world with exactly
the same number of hairs on their heads.
Existence: To show something exists, we must either cite it, (i.e., give an
example) or show that it has to occur without actually producing it. This
latter situation is called an indirect proof, and is sometimes accomplished
through a proof by contradiction: Assume that it doesn’t exist and derive
a contradiction.
Uniqueness: To show that something is unique, it is almost always easiest
to assume that there are two with the same properties and then prove
that the two are equal.
If and Only If Statements (iff or ↔ ): From propositional logic, we know
that to accomplish the proof of A ↔ B, we can prove both A → B and
B → A. The two directions might use very different approaches. It is
unusual for such a statement to be proven without dividing it into the two
cases.
Showing that two Expressions are Equal: We have seen direct proofs of
A = B. Sometimes, it is not obvious how to accomplish this. There are
other, equivalent forms of equality that may be easier to prove:
A−B =0
(A ≥ B) ∧ (B ≥ A)
A
B = 1, B 6= 0
3.6. OTHER STRATEGIES 81
Exercises.
1. Give informal proofs of the following theorems.
Theorem 22. There are natural numbers a, b, and c such that a2 + b2 =
c2 .
Theorem 23. There is a natural number n such that 2n + 7n is prime.
Theorem 24. Every natural number has a unique prime factorization.
Theorem 25. Every pair of natural numbers has a common multiple.
Theorem 26. There is a unique natural number n satisfying the equation
2n2 − 3n − 2 = 0.
Theorem 27. If n is a multiple of 3, then either n is odd or n is a multiple
of 6.
Theorem 28. If n 6= 0 then either n is a multiple of 2 or n = 2k + 1 for
some k.
Theorem 29. No natural number is both even and odd.
Theorem 30. If b is a multiple of 2 and a multiple of 5 then b is a multiple
of 10.
Theorem 31. If a|b, b|c, and c|a, then a = b = c.
Theorem 32. m2 = n2 iff m = n.
(m+n)2
Theorem 33. If m · n = 2 then n = m = 0.
82 CHAPTER 3. TRANSITION TO INFORMAL PROOFS
Theorem 34. 3 must divide the sum of any three consecutive numbers.
Theorem 35. If 3|n then 3 divides the sum of the digits of n.
Theorem 36. For any number with at least two trailing zeros, if 4 di-
vides the number obtained by deleting the trailing zeros, the 4 divides the
original number.
Theorem 37. If X is a set with n ≥ 2 elements, then X has 12 n(n − 1)
subsets with exactly 2 elements.
1 1 1 n
Theorem 38. For n ≥ 1, the equation 1·2 + 2·3 + ··· + n·(n+1) = n+1
holds.
Theorem 39. Every set with n elements has exactly 2n subsets.
Alternation of Quantifiers –
Sequences
In this section we will study sequences and sequence convergence. The state-
ments in this chapter will involve the alternation of quantifiers, so let’s review
some of what we did formally.
Remember that ∀x∃y has a different meaning from ∃y∀x. In the first, we
are required to produce for each x, a corresponding y, and in the second we are
required to produce a single y that works for every x. Let’s review this idea
with an example:
Let the universe be the set of people Tom, Dick and Harry. Consider the
predicate P (x, y) = x likes y. Then all possible alternating quantifiers are:
Having this under our belts, notice the occurances of alternating quantifiers
in the basic definitions about sequences below.
83
84 CHAPTER 4. ALTERNATION OF QUANTIFIERS – SEQUENCES
han i
where n takes on every value in N, unless otherwise stated.
hni: 0, 1, 2, 3, 4, . . . , n, . . .
h2−n i: 1, 21 , 14 , 18 , . . . , 21n , . . .
h2−n i∞
1 :
1 1 1 1
2 , 4 , 8 , ..., 2n , . . .
h(−1)n i∞
0 : 1, −1, 1, −1, 1, −1, 1, . . .
hni∞
3 : 3, 4, 5, 6, . . . , n, . . .
1
<
2(2m + 1)
Solving this for gives
1
< 2m + 1
2
So it looks like
1
2 −1
m>
2
would give us what we need.
How do we write this up in a proper proof?
1
2 −1
Proof. Fix > 0, and suppose N is an integer with N ≥ 2 . Let m > N .
Then
m 1
|am − L| = −
2m + 1 2
−1
=
2(2m + 1)
1
=
2(2m + 1)
1
< h 1 −1 i
2 2 2
2 + 1
1
= 1 = .
86 CHAPTER 4. ALTERNATION OF QUANTIFIERS – SEQUENCES
Thus, for any given > 0, we have shown how to produce an N such that if
m > N then |am − L| < .
n
Theorem 2. h 2n+1 i is bounded.
(
2−n if n is even
Theorem 3. If an = then han i converges to 0.
3−n if n is odd
−N
2 − 0 <
2−N <
One way to solve for N would be to take the log2 of both sides:
−N < log2
N > − log2
4.1. SEQUENCES, BOUNDS AND CONVERGENCE 87
Ok; so recapping, the proof should work as follows: Fix , and let N ≥
− log2 . Algebra should get us
−N
2 − 0 <
We have
−m
− 0 = 2−m .
2
The proof for convergence of h3−n i should work the same way using an N
of at least − log3 . How do we combine these to prove Theorem 2? We need to
find an N that works regardless of whether the later term is 2−m or 3−m .
So what we really need to do is to choose the larger N given , i.e., choose
the larger of − log2 and − log3 . This will guarantee that we are far enough
out on the sequence so that the terms are within of 0. Which if these two logs
is larger? That depends on , and since we don’t want to make any assumptions
regarding , we will simply let N ≥ max{− log2 , − log3 }.
Here is the proof of Theorem 3:
2
Theorem 5. h n2n
3 +1 i converges to 0.
2N 2
How should we proceed? We could try to solve N 3 +1 < for N in terms of
. This would be very messy:
2N 2
<
N3 + 1
2N 2 < · (N 3 + 1)
2N 2 − · (N 3 + 1) < 0
Yuk! It’s cubic in N .
A better approach: Note that
2N 2 2N 2 2
3
< 3
=
N +1 N N
because we have made the denominator smaller by subtracting 1, and hence the
fraction is larger.
This implies that if we find an N that works for the series with general
2
term N2 then it will also work for the series with general term N2N
3 +1 . Why?
2
The inequality tells us that no matter what index we are interested in, N2N
3 +1 is
below N2 , so if we find the place on N2 where all subsequent terms are within
2
of L, all subsequent terms of N2N3 +1 will also be within of L.
So we have a new question: What N works for h N2 i? Let’s try the algebraic
way of getting N in terms of :
2
−0<
N
2
<N
How do we put this all together in our proof?
Proof. Choose an arbitrary > 0, and let N be an integer with N ≥ 2 . For
2
the sequence h n2n
3 +1 i and m > N , we need to show that
2m2
m3 + 1 − 0 < .
We have
2m2 2
= 2m < 2 .
m3 + 1 − 0 m3 + 1 m
Since m > N and N ≥ 2 , we have that m > 2 , so:
2m2
2 2
m3 + 1 − 0 < m < 2 = .
4.2. MORE ON CONVERGENCE AND BOUNDEDNESS 89
Exercise 4.1. Do the following sequences converge? If so, to what? Are they
bounded? If so, by what? Provide proofs for those that converge and/or are
bounded.
1
a. an = 1 + n
1+(−1)n
b. an = 2
1 1 1
c. an = 1 + 2 + 3 + ... + n
2n
d. an = 3n+1
3n+7
e. an = n
n2
f. an = n+1
n2
g. an = 2n2 +1
h. an = (−1)n 2−n
sin(n)
i. an = n
n!
j. an = 2n
5n
k. an = 4−n (where n ≥ 5)
How shall we start this one? Let the limit be A and note that eventually
(for m’s greater than some N ) |am − A| < 1, or
A − 1 < am < A + 1.
Why? We are using the definition for the specific case of = 1. Thus a lower
bound can be found by choosing the smallest number from the set {a1 , a2 , ..., an , A−
1} and an upper bound can be found by choosing the largest number from the
set {a1 , a2 , ..., an , A + 1}.
Proof. Suppose |an | converges to A. Choose = 1; then there is an N such
that m > N implies
A − 1 < am < A + 1.
Let L = min{a1 , a2 , ..., an , A − 1} and U = max{a1 , a2 , ..., an , A + 1}. Then for
all n, L < an < U .
Theorem 7. The limit of a convergent sequence is unique.
So how will we prove this? In the standard way: Assume there are 2 limits
and show that they must be equal. This is not so easy as it sounds. Here is a
proof. Notice that we are using a specific again.
Proof. We will prove this by contradiction. Assume han i converges to A and
also to B, and assume that A < B. Let = 21 (B − A). Since han i converges to
A, we know that there is an N1 such that for all m > N1 we have |am − A| <
or
A − < am < A + .
Similarly, since han i converges to B, we know that there is an N2 such that
for all m > N2 we have |am − B| < or
B − < am < B + .
As long as we are beyond both N1 and N2 , we have:
1 1
A + = A + (B − A) = B − (B − A) = B −
2 2
This is a contradiction, since B − < am < A + .
This is not the only way to prove this theorem. Can you develop another
proof?
Theorem 8. If han i converges to A and hbn i converges to B then han + bn i
converges to A + B.
So how do we approach this? We need
|(aN + bN ) − (A + B)| <
or
|(aN − A) + (bn − B)| < .
Can we say that |aN − A| < 2 and |bN − B| < 2 ? Sure! The definition says
that we have the result for all , so we also have it for any 2 . Here is the proof:
4.3. A NOTE ON DIVERGENT SEQUENCES 91
Most upper level mathematics courses use at least some notation from set theory
and many apply results and techniques from set theory. The logical expertise
that you have gained in the preceding chapters is directly applicable to the study
of sets.
Naı̈ve set theory and axiomatic set theory are the Scylla and Charybdis of
this topic. On the one hand, naı̈ve set theory concentrates on the portion of
set theory that is most frequently applied by mathematicians. Unfortunately,
it glosses over some problems with existence of sets that can lead to paradoxes.
Axiomatic set theory is engineered specifically to address these existence prob-
lems. As appealing as a purely axiomatic approach is, the usual formulation of
the axioms treats many of the central constructs of set theory as abbreviations,
sidestepping the material which is most useful in everyday practice.
We will try to thread a course between the two extremes, mentioning axioms
when they are pertinent and introducing many of the concepts you will need for
your future courses. With any luck, we won’t be drawn down or chewed up.
Integers: Z = {· · · − 1, −2, 0, 1, 2 . . . }
Real numbers: R
√
A few real numbers: 0, −6, 13 , 3, π
93
94 CHAPTER 5. INTRODUCTION TO SET THEORY
In our definition of the rationals, we used the following set builder notation:
Q = { ab | a ∈ Z ∧ b ∈ N ∧ b 6= 0}. Informally, we would read this as Q is
the set of all fractions of the form ab , where a is an integer and b is a postive
natural number. In general, we write X = {x | ϕ(x)} for “X is the set of all
objects x that satisfy the formula ϕ(x).” The notation is handy, but can lead
to difficulties. For more on this, see section 5.6.
The fancy script used for these set symbols is called “blackboard bold.” The
idea is that it is easier to draw two close lines with a piece of chalk than it
is to draw one wide line. The particular letters seem to make some sense. N
for natural numbers and R for real numbers are obvious choices. Since every
rational number can be written as a quotient, Q seems reasonable. Finally, the
german word for number is Zahl, so Z fur die Zahlen steht.
Various authors may or may not include 0 in the set of natural numbers.
Since we would like N to be a model of the Peano axioms, and those axioms
include a constant symbol for 0, we should definitely include 0. People who
don’t care about the Peano axioms will sometimes leave it out. It is always
safest to double check what the particular book says.
The formula x ∈ y means “x is an element of y.” We will also write x ∈ /y
as an abbreviation for ¬(x ∈ y). Combined with our letters for common sets,
this notation provides a very convenient shorthand. Rather than saying “2 is a
natural number and π is not a natural number,” we can write 2 ∈ N and π ∈ / N.
The formula A ⊂ B is read “A is a subset of B.” A ⊂ B means that
every element of A is also an element of B. More formally, A ⊂ B means
∀t(t ∈ A → t ∈ B). Note that this tells us that to prove that A ⊂ B, we should
start with an arbitrary element t and prove the implication t ∈ A → t ∈ B.
Depending on the situation, we might prove the implication directly, prove the
contrapositive, or do a proof by contradiction.
Notation for subsets varies from book to book. Some books use A ⊆ B to
denote A ⊂ B and then use A ⊂ B for A ⊆ B ∧ A 6= B. We will use A ( B
to denote A ⊆ B ∧ A 6= B. The formula A ( B is read “A is a proper subset
of B.” All these symbols can be reversed in the same fashion as < and >, so
A ⊂ B means exactly the same thing as B ⊃ A. We can also write A 6⊂ B as an
abbreviation of ¬(A ⊂ B), and make similar negated forms of all the preceding
notation.
Although both ∈ and ⊂ denote forms of containment, they denote different
sorts of containment. Consider the set A = {1, {2}, {1, 2}}. The set A has
exactly three elements. We can see that 1 ∈ A and {1, 2} ∈ A, since 1 is the
first element in our listing of A and {1, 2} is the third. While {2} ∈ A (look
for it in the list), 2 ∈
/ A (it’s not in the list). Here 2 and {2} are very different
objects. We can also say that {1, {2}} ⊂ A, because each element of {1, {2}}
(namely 1 and {2}) is also an element of A. On the other hand, {1, 2} 6⊂ A,
because 2 ∈ / A.
The formula A = B is read “A equals B.” We are not monkeying with
the meaning of equality here. That is, equality is assumed to be the familiar
relationship satisfying the axioms of E. In particular, substitutivity of equality
holds, so A = B implies ∀t(t ∈ A ↔ t ∈ B). Furthermore, we will adopt the
5.1. FAMILIAR SETS AND SYMBOLS 95
We have one last bit of notation to introduce in this section. The set con-
taining no elements is called the empty set, and is denoted by ∅. We have an
axiom that says exactly this.
Empty Set Axiom: The empty set contains no elements. Formally, ∀t(t ∈
/ ∅).
Here are three little brain teasers involving the empty set.
Theorem 49. ∅ =
6 {∅}.
Proof. Note that ∅ ∈ {∅}, but by the Empty Set Axiom, ∅ ∈ / ∅. Thus ∅ and
{∅} don’t have the same elements, so by the contrapositive of substitutivity of
equality, they are not equal.
Proof. If ∀t(t ∈
/ A), then ∀t(t ∈ A ↔ t ∈ ∅). By the Axiom of Extensionality,
A = ∅.
96 CHAPTER 5. INTRODUCTION TO SET THEORY
Exercises.
1. Prove: For all A and B, if A ( B then B 6= ∅. (Hint: A ( B means that
A ⊂ B and A 6= B. In light of the Axiom of Extensionality, what does
A 6= B imply?)
3. The subset relation is almost, but not quite, an equivalence relation. The
three parts of this exercise fill in the details.
(a) Prove that ⊂ is a reflexive relation. (That is, show that for all A,
A ⊂ A.)
(b) Prove that ⊂ is a transitive relation. (That is, show that if A ⊂ B
and B ⊂ C, then A ⊂ C, for all A, B, and C.)
(c) Prove that ⊂ is not a symmetric relation. (That is, show that it is
not the case that for all sets A and B, if A ⊂ B then B ⊂ A. Your
proof should consist of two concrete sets that violate symmetry.)
Definition. The union of two sets consists of all the elements that are in either
set (or both). More formally, A ∪ B = {x | x ∈ A ∨ x ∈ B}. This is the union
operator that you probably saw the first time you learned any set theory.
Example. The binary union operator is the one that you probably saw the
first time you learned any set theory. If A = {0, 2} and B = {2, 3, 4}, then
A ∪ B = {0, 2, 3, 4}.
Definition. The union of a collection of sets consists of all the elements that
are in at least one of the sets in the collection. More formally, if I is a set of
sets, then ∪I = {x | ∃y(y ∈ I ∧ x ∈ y)}.
Example. We can rewrite the previous example using the unary union oper-
ator. If A = {0, 2} and B = {2, 3, 4} and I = {A, B}, then ∪I = A ∪ B =
{0, 2, 3, 4}.
The unary union operator can be used to find the union of one set. The
set ∪{A} consists of all those those x such that there is a y ∈ {A} such that
x ∈ y. Since the only y satisfying y ∈ {A} is A itself, ∪{A} consists of all those
x such that x ∈ A. That is, ∪{A} = A. One slick way to think about this is to
5.2. OPERATORS ON SETS 97
Theorem. ∪B = Z.
Proof. To prove the equality, we’ll prove containment in each direction. First,
suppose j ∈ ∪B. Then for some n ∈ N, j ∈ Zn . Since Zn ⊂ Z, we must have
j ∈ Z. Summarizing, we have shown that ∪B ⊂ Z. To complete the proof,
suppose k ∈ Z. Since −|k| ≤ k ≤ |k|, we know that k ∈ Z|k| . Since Z|k| ∈ B, we
must have k ∈ ∪B. Consequently Z ⊂ ∪S. By Theorem 47, the claim holds.
The intersection symbol can also be used as either the traditional binary
operator or as a unary operator. The following definitions and examples closely
parallel those for unions
Definition. The intersection of two sets consists of all the elements that are in
both sets. More formally, A ∩ B = {x | x ∈ A ∧ x ∈ B}.
Example. With this intersection symbol, we are back to something you prob-
ably saw a long time ago. If A = {0, 2} and B = {2, 3, 4}, then A ∩ B = {2}.
Example. We can rewrite the preceding example using the unary notation. If
A = {0, 2} and B = {2, 3, 4} and I = {A, B}, then ∩I = A ∩ B = {2}.
98 CHAPTER 5. INTRODUCTION TO SET THEORY
We can use the unary notation to take the intersection of one set. The
intersection ∩{A} consists of all those elements x that are elements of every
y ∈ {A}. Since the only y in {A} is A itself, this simplifies to the collection of
those elements x that are in A. Thus ∩{A} = A. As we did with union, we
could write ∩{A} = A ∩ A = A.
The definition of the unary intersection specifies that the collection of sets
that we are intersecting is nonempty. Consequently, we can’t take the intersec-
tion of zero sets. This turns out to be a good thing. Suppose we think of V as
being the collection of all those elements x such that ∀y(y ∈ ∅ → x ∈ y). Since
y ∈ ∅ is false for every y, the implication y ∈ ∅ → x ∈ y is true, regardless of the
choice of x and y. Consequently, V would contain every set x and be a so-called
“universal set.” Axiomatic set theory denies the existence of a universal set in
order to avoid a paradox. We should follow suit, avoid universal sets, and avoid
trying to find the intersection of zero sets. For more about axiomatic set theory,
see section 5.6.
As with the unary union operator, the unary intersection operator really
shines when we want to take intersections of large collections of sets. As an
example, we will return to our consideration of balanced intervals.
Example. Let B be the set of balanced intervals in Z used in the example for
unions.
Theorem. ∩B = {0}.
Proof. To prove the equality, we’ll prove containment in each direction. First
suppose that t ∈ ∩B. Then for every Zn ∈ B, t must be an element of Zn .
In particular, t ∈ Z0 = {0}. Thus t = 0, and so t is an element of the set on
the right side of the equality. To prove the reverse containment, suppose t is an
element of the set on the right side of the equality. Clearly, t = 0. For every
n ∈ N, we know −n ≤ 0 ≤ n, so 0 ∈ Zn . Since 0 ∈ Zn for every Zn ∈ B, we
have 0 ∈ ∩B. Since t = 0, we also have t ∈ ∩B.
There are some very nice connections between union and intersection. For
example, the binary operators differ only in their logical operator. Compare
these equations:
A ∪ B = {x | x ∈ A ∨ x ∈ B}
A ∩ B = {x | x ∈ A ∧ x ∈ B}
The unary operators have a similar connection. This is especially notable if
we switch to bounded quantifier notation.
Definition. The power set of a set X is denoted by P(X) and is the set of all
subsets of X. Formally, P(X) = {y | y ⊂ X}.
Example. If A = {0, 2}, then P(A) = {∅, {0}, {2}, {0, 2}}. It is not too hard
to show that if X has n elements, then P(X) has 2n elements. We are lucky
that A was a pretty small set.
Example. Suppose B is the set of balanced intervals from Z as described earlier.
Theorem. B ( P(Z).
Proof. For each n, Zn ∈ P(Z), so B ⊂ P(Z). Additionally, {0, 3, 5} ⊂ Z and
{0, 3, 5} ∈ 6 P(Z).
/ B, so B =
Exercises.
1. Prove: For all A, B, and C, A ∩ (B ∪ C) = (A ∩ B) ∪ (A ∩ C).
2. Prove: For all sets of sets A, ∀t(t ∈ A → t ⊂ ∪A).
3. Suppose that for all n ∈ N, Xn = {n, n + 1, n + 2, . . . }. Let A = {Xn |
n ∈ N}.
(a) Find ∪A.
(b) Find ∩A.
4. Prove: For all A, B, and C, A ∪ (B ∩ C) = (A ∪ B) ∩ (A ∪ C).
5. Prove: For all nonempty sets of sets A, ∀t(t ∈ A → ∩A ⊂ t).
6. Prove: ∀X(∪P(X) = X).
7. (a) Prove if A is a set of sets and ∅ ∈ A then ∩A = ∅.
(b) Prove ∀X(∩P(X) = ∅).
8. Prove: ∀X(X ⊂ P(∪X)).
9. Find a set X such that X = P(∪X).
10. Find P(∅).
11. Prove that ∪∅ = ∅.
12. Find the following sets: P(∅), P(P(∅)), P(P(P(∅)))
13. A set S is called transitive if ∀x∀y((x ∈ y ∧ y ∈ S) → x ∈ S). (So S is
transitive if ∈ is a transitive relation on the elements of S.)
(a) Prove that S is transitive if and only if ∪S ⊂ S.
(b) Prove that ∅ is transitive. (Hint: You could use the definition or part
13a.)
(c) Prove that {∅, {∅}, {{∅}}, . . . } is transitive.
(d) Prove that {∅, {{∅}}} is not transitive.
100 CHAPTER 5. INTRODUCTION TO SET THEORY
A function is a set of ordered pairs that is single valued. Rather than writing
(x, y) ∈ f , we often use the convenient abbreviation f (x) = y.
Requiring f to be single-valued insures that the equality in the abbreviation
is transitive, as we would expect. In particular, if f (x) = y1 and f (x) = y2 , then
y1 = y2 holds. For any function f from R to R, the fact that f is single-valued
is equivalent to saying that every vertical line in the plane hits the graph of f
in at most one point. So for functions on the reals, the preceding definition says
“functions must satisfy the vertical line test.” On the other hand, for functions
of other sorts, the vertical line test makes less sense. For functions that are
(for example) linear transformations on R3 , or complex valued holomorphisms,
or automorphisms on finite groups, the idea of a “vertical line” doesn’t make
a lick of sense. However, we can certainly talk about all these functions being
single-valued.
Example. Consider the set F = {(0, 1), (1, 2), (2, 1)}. Since each first coordi-
nate appears in only one ordered pair, F is a function. Because (0, 1) ∈ F , we
can write F (0) = 1. Similarly we can write F (1) = 2 and F (2) = 1. If you plot
the three pairs in F as points in the real plane, you will see that they satisfy
the vertical line test.
Example. Consider X = {(1, 0), (2, 1), (1, 2)}. X is not a function, because
(1, 0) and (1, 2) witness that X is not single-valued. These pairs indicate that
X(1) = 0 and X(1) = 2, and that would cause problems. As in the previous
example, you could draw a picture here to illustrate the situation.
Example. Consider Y = {(a, b), (b, c), (a, c)}, where none of the letters are
equal. Since this is practically identical to the preceding example, Y is not a
5.3. CARTESIAN PRODUCTS AND FUNCTIONS 101
function. We can demonstrate this by just pointing out that Y (a) = b and
Y (a) = c, so Y is not single-valued. Since we’re not dealing with numbers here,
drawing a picture would require making some assumptions about a, b, and c.
Unless we identify the letters with numbers, the “vertical line test” doesn’t make
much sense.
Example. Suppose S = {(x, x2 ) | x ∈ R}. S is the familiar squaring function
on the real numbers, often described using the equation S(x) = x2 .
Example. Consider T = {(x2 , x) | x ∈ R}. Since (1, −1) and (1, 1) are both
elements of T , T is not single-valued. Because we are working in the reals, we
could say that T fails the vertical line test.
When we think of functions as sets of ordered pairs, the concepts of domain
and range become completely transparent. Consider the following definition.
Definition. If f is a function, the domain of f is defined by the equation
dom(f ) = {x | ∃y(x, y) ∈ f }.
ran(f ) = {y | ∃x(x, y) ∈ f }.
Any one to one function from a set X into a set Y matches each element of
X with a unique partner in Y . When we match students with student identifier
numbers, or match books with call numbers, or give everyone an individual
bag of chips for their lunch, we are implementing one to one functions. In
applying the definition, generally it is easiest to prove the contrapositive of the
implication. In practice, to prove that f is one to one, we usually prove
We have slightly modified the formula for one to one here, but this version is
equivalent to the one above. Note that the definition of one to one has two
xs and one y, while the definition of single-valued has one x and two ys. A
relation (from R to R) that is not single-valued fails the vertical line test, while
a function that is not one to one fails a horizontal line test.
Exercises
The following problems use these sets of ordered pairs:
f = {(n, n2 ) | n ∈ N}
g = {(x, x2 ) | x ∈ Z}
h = {(x2 , x) | x ∈ Z}
r = {(n, n + 3) | n ∈ N}
s = {(x, x + 3) | x ∈ Z}
1. (a) Prove that f is single-valued.
(b) Prove that h is not single-valued. (Use a concrete example.)
2. (a) Prove that f is one to one.
(b) Prove that g is not one to one. (Use a concrete example.)
3. Both r and s are functions.
(a) Find the domain and range of r.
(b) Is r one to one and onto N? Justify your answer.
(c) Find the domain and range of s.
(d) Is s one to one and onto Z? Justify your answer.
4. In axiomatic set theory, (see section 5.6) we encode ordered pairs by sets.
This amounts to asserting that the ordered pair notation is just a short-
hand for a particular set. An encoding suggested by Kuratowski [7] is
(a, b) = {{a, b}, a}.
(a) Using Kuratowski’s encoding and the Axiom of Extensionality, prove
that (a, b) = (c, d) if and only if a = c and b = d. (That is, show
that the set encoding (a, b) has exactly the same elements as the set
encoding (c, d) if and only if a = c and b = d.)
(b) Using Kuratowski’s encoding, prove that X × Y ⊂ P(X ∪ P(X ∪ Y )).
5. The Axiom of Regularity (see section 5.6) implies that ∀x(x ∈ / x). Show
that the Axiom of Regularity implies that no set with exactly one element
can represent an ordered pair in the Kuratowski encoding.
6. Find an example of sets a, b, c, and d such that {{a}, b} = {{c}, d} but it
is not the case that a = b and c = d. (This shows that (a, b) = {{a}, b}
doesn’t work as an encoding of ordered pairs.)
104 CHAPTER 5. INTRODUCTION TO SET THEORY
Before the last theorem, we said that the flipped set should be the inverse
function, provided that it is a function at all. We still need to prove that.
Notation alert: The pre-image of a set under f is often defined even when
the inverse function f −1 does not exist. Consequently, f −1 (V ) (for a set V )
may denote a perfectly well defined set even when f −1 (y) (for an element y of
ran(f )) may be complete gibberish.
Example: Define the function f : {0, 1, 2, 3} → {a, b, c} by the set of ordered
pairs f = {(0, a), (1, c), (2, c), (3, b)}. So f (0) = a, f (1) = c, and so on. Then
and
Exercises
1. Prove that if an inverse exists for a function, it must be unique.
x 1 2 5
g(x) t a r
Note that the function f is one to one and maps the set {1, 2, 5, 7} onto
the set {t, a, r, e}. So f matches each element of {1, 2, 5, 7} with an element of
{t, a, r, e}. As a consequence of this, it is reasonable to conclude that the set
{1, 2, 5, 7} is the same size as the set {t, a, r, e}.
Why bother with f ? Each of these sets contains four elements, so of course
they are the same size. Note that in counting the elements, we actually match
each element with one of the numbers 1, 2, 3, and 4. Consequently, as we count
the elements in each set, we are actually defining a one to one and onto map
between the set and {1, 2, 3, 4}. When we count the elements in the two sets, we
are actually constructing two functions. If we compose one of those functions
with the inverse of the other, we get a one to one function mapping one of the
sets onto the other. In some sense, counting the elements is just a process for
constructing a function like f .
Now g witnesses that the sets {1, 2, 5} and {t, a, r} are the same size. Since
g can also be viewed as a one to one map from {1, 2, 5} into {t, a, r, e}, this also
shows us that the set {t, a, r, e} is at least as big as the set {1, 2, 5}. Note that
in the previous sentence, when we expand the codomain of g, we get a function
into {t, a, r, e}, not onto {t, a, r, e}.
The next definition sets up some nice shorthand notation for this notion of
size of sets.
Definition. Let A and B be sets. If there is a one to one function from A into
B, we say that B is at least as big as A, and write A 4 B (or B < A). If there
is a one to one function from A onto B, then we say that A and B are the same
size, and write A ∼ B.
The really beautiful thing about this definition is that it allows us to compare
the sizes of infinite sets. Here are two examples.
5.5. SIZES OF SETS 107
Theorem 53. N 4 Z, that is, the integers are at least as big as the natural
numbers.
Theorem 54. N ∼ Z, that is, the integers and the natural numbers are the
same size.
Verification that this function is one to one and onto is left as an exercise.
Sometimes cooking up these one to one and onto functions is quite challeng-
ing. The next theorem provides perhaps one of the most useful shortcuts in the
set theory literature. Unfortunately, the proof of this theorem is beyond the
scope of this book, but you can find a proof in [3], if you are interested.
So far, all the infinite sets that we have examined were the same size. This
is a complete fluke. In the case of finite sets, the power set of a set is always
larger than the original set. Cantor proved that this is true for infinite sets, too!
If we write X ≺ Y for (X 4 Y ∧ X 6∼ Y ), we can state the next theorem very
elegantly.
Theorem 56. ∀X(X ≺ P(X)). That is, for every set X, the power set of X is
strictly larger than X.
108 CHAPTER 5. INTRODUCTION TO SET THEORY
If we plug the natural numbers in for X in the preceding theorem and then
iterate this process, we get a nice chain of infinite sets of strictly increasing sizes.
Exercises
1. Give a proof that the function defined in the proof of Theorem 53 is one
to one.
2. Give a detailed proof that the function defined in the first proof of Theorem
54 is indeed one to one and onto.
3. Give a detailed proof that the function h defined in the alternate proof of
Theorem 54 (following the statement of the Cantor-Berstein theorem) is
one to one.
4. Prove that the rationals are countable. (Hint: It is easiest to use the
Cantor-Berstein Theorem to prove that Q ∼ N.)
As a general rule of thumb, as long as you stick to subsets of sets that you
know already exist (e.g. sets of natural numbers or sets of continuous functions
on the reals), you will (hopefully) never run into a Russell-style paradox. If you
think you have, we would all like to hear about it.
Exercises
1. Prove that for all x, P(x) 6⊂ x. (Use a proof by contradiction. Suppose
P(x) ⊂ x and define the set U = {t ∈ x | x ∈ / t}. Then U is a perfectly
good subset of x, whose existence could be proved in ZF by means of
the separation axiom. Consequently, either x ∈ U or x ∈ / U . Show that
neither possibility can hold.)
BIBLIOGRAPHY 111
Bibliography
[1] Herbert B. Enderton, A mathematical introduction to logic, 2nd ed., Harcourt/Academic
Press, Burlington, MA, 2001. MR1801397 (2001h:03001)
[2] Kurt Gödel, Die Vollständigkeit der Axiome des logischen Funktionenkalküls, Monatsh.
Math. Phys. 37 (1930), no. 1, 349–360 (German). MR1549799
[3] John M. Harris, Jeffry L. Hirst, and Michael J. Mossinghoff, Combinatorics and graph the-
ory, Undergraduate Texts in Mathematics, Springer-Verlag, New York, 2000. MR1770510
[4] Leon Henkin, The completeness of the first-order functional calculus, J. Symbolic Logic
14 (1949), 159–166. MR0033781 (11,487d)
[5] László Kalmár, Zurückführung des Entscheidungsproblems auf den Fall von Formeln
mit einer einzigen, binären, Funktionsvariablen, Compositio Math. 4 (1937), 137–144
(German). MR1556967
[6] Stephen Cole Kleene, Introduction to metamathematics, D. Van Nostrand Co., Inc., New
York, N. Y., 1952. MR0051790 (14,525m)
[7] K. Kuratowski, Sur la notion del l’ordre dans la théorie des ensembles, Fund. Math. 2
(1921), 161-171.
[8] Elliott Mendelson, Introduction to mathematical logic, 4th ed., Chapman & Hall, London,
1997. MR1630547 (99b:03002)
[9] Carew A. Meredith, Single axioms for the systems (C, N ), (C, O) and (A, N ) of the
two-valued propositional calculus, J. Computing Systems 1 (1953), 155–164. MR0055953
(15,1b)
[10] Joseph R. Shoenfield, Recursion theory, Lecture Notes in Logic, vol. 1, Association
for Symbolic Logic, Urbana, IL, 2001. Reprint of the 1993 original. MR1813760
(2001m:03078)
[11] Alfred Tarski, Logic, semantics, metamathematics. Papers from 1923 to 1938, Oxford at
the Clarendon Press, 1956. Translated by J. H. Woodger. (For truth in models, especially
see Der Wahrheitsbegriff in den formalisierten Sprachen.) MR0078296 (17,1171a)
Index
112
INDEX 113
PA, 66
parentheses, 6
power set P, 99
predicate, 29
predicate calculus, 29
axioms of K, 51
proposition, 1
propositional calculus
axioms of L, 16
Kleene’s axioms, 26
Meredith’s axioms, 26
proof system L, 16
quantifiers, 31
Rule C, 59
Rule T, 54
Russell’s paradox, 109
satisfiable, 42
sequences, 83
bounded, 85
convergence, 84
divergence, 91
set theory, 93
sets
size of, 106
single-valued, 100
soundness
predicate calculus, 52
propositional calculus, 23
subset, 94
substitutivity of equality, 64
tautology, 9
instance of, 46
term, 30
truth in a model