Internal Set Theory
Internal Set Theory
Internal Set Theory
x.
A formula of ordinary mathematicsone that does not involve the new
predicate standard, even indirectlyis called internal ; otherwise, a
formula is called external. The eleven denitions given above are all
external formulas.
Notice that the adjectives internal and external are metamath-
ematical conceptsthey are properties of formulas and do not apply to
sets.
One of the basic principles of ordinary mathematics, or internal
mathematics as I shall call it from now on, is the subset axiom. This
asserts that if X is a set then there is a set S, denoted by x X : A,
such that for all x we have x S x X & A. Here A is a formula
of internal mathematics. Usually it will have x as a free variable, but it
may have other free variables as well. When we want to emphasize its
dependence on x, we write it as A(x). Nothing in internal mathematics
has been changed by introducing the new predicate, so the subset axiom
continues to hold. But nothing in internal mathematics refers to the new
predicate, so nothing entitles us to apply the subset axiom to external
formulas. For example, we cannot prove that there exists a set I such
that x I x R & x is innitesimal. The notation x R : x 0
is not allowed; it is an example of illegal set formation. Only internal
1.2 THE TRANSFER PRINCIPLE 3
formulas can be used to dene subsets. (Nevertheless, in Chapter 4 we
shall nd a way to introduce so-called external sets.)
Certain set formations that might at rst sight appear to be illegal
are perfectly legitimate. For example, suppose that we have an innites-
imal x > 0. Then we can form the closed interval [x, x] consisting of
all y such that x y x. This is simply because we already know
that for any x > 0 we can form the set [x, x].
Exercises for Section 1.1
1. Let n be a nonstandard natural number. Can one form the set of all
natural numbers k such that k n? Is this set nite?
2. Can one form the set of all limited real numbers?
3. Can one prove that every standard positive real number is limited?
4. Can one form the set of all standard real numbers x such that x
2
1?
5. Assume that 1 is standard. Can one form the set of all limited real
numbers x such that x
2
1?
6. Can one prove that the sum of two innitesimals is innitesimal?
1.2 The transfer principle
We cannot yet prove anything of interest involving standard be-
cause we have made no assumptions about it. Our rst axiom is the
transfer principle (T).
The notation
st
means for all standard, and
st
means there
exists a standard. Let A be an internal formula whose only free variables
are x, t
1
, . . . , t
n
. Then the transfer principle is
st
t
1
st
t
n
[
st
xA xA]. (1.1)
We may think of the t
1
, . . . , t
n
as parameters; we are mainly inter-
ested in x. Then the transfer principle asserts that if we have an internal
formula A, and all the parameters have standard values, and if we know
that A holds for all standard x, then it holds for all x. (The converse
direction is trivial, and we could have stated (T) with just instead
of .)
The intuition behind (T) is that if something is true for a xed, but
arbitrary, x then it is true for all x.
Notice that two formulas are equivalent if and only if their negations
are. But we have xA xA, so if we apply (T) to A we obtain
the dual form of the transfer principle:
st
t
1
st
t
n
[
st
xA xA]. (1.2)
4 CHAPTER 1. INTERNAL SET THEORY
Let us write A B (A is weakly equivalent to B) to mean that
for all standard values of the free variables in the formulas, we have
A B. Then we can rewrite the two forms of (T) as
st
xA xA and
st
xA xA whenever A is an internal formula. Applying these rules
repeatedly, we see that any internal formula A is weakly equivalent to
the formula A
st
obtained by replacing each by
st
and by
st
. Then
t
1
, . . . , t
n
are standard A
st
,
where t
1
, . . . , t
n
are the free variables in A, is called the relativization
of A to the standard sets.
Consider an object, such as the empty set , the natural numbers N,
or the real numbers R, that can be described uniquely within internal
mathematics. That is, suppose that there is an internal formula A(x)
whose only free variable is x such that we can prove existence xA(x)
and uniqueness A(x
1
) & A(x
2
) x
1
= x
2
. By the dual form of transfer,
st
xA(x); so by uniqueness, the x such that A(x) holds is standard. For
example, let A(x) be y[y / x]. There is a unique set, the empty set ,
that satises A(x). Therefore is standard. The formulas describing N
and R are longer, but by the same reasoning, N and R are standard.
Any object that can be uniquely described within internal mathematics
is standard: the real numbers 0, 1, and , the Hilbert space L
2
(R, dx)
where dx is Lebesgue measure, the rst uncountable ordinal, and the
loop space of the fteen dimensional sphere are all standard. The real
number 10
100
is standard, so if x is innitesimal then [x[ 10
100
.
The same reasoning applies to internal formulas A(x) containing
parametersprovided the parameters have standard values. For exam-
ple, if t is a standard real number then so is sin t (let A(x, t) be x = sin t)
and if X is a standard Banach space so is its dual.
As an example of transfer, we know that for all real x > 0 there is
a natural number n such that nx 1; therefore, for all standard x > 0
there is a standard n such that nx 1. But suppose that x > 0 is
innitesimal. Do we know that there is a natural number n such that
nx 1? Of course; we already know this for all x > 0. But if we try
to argue as followsthere is an n such that nx 1; therefore, by the
dual form of transfer, there is a standard n such that nx 1then we
have made an error: transfer is only valid for the standard values of the
parameters (in this case x) in the formula. This is an example of illegal
transfer. It is the most common error in learning nonstandard analysis.
Before applying transfer, one must make sure that any parameters in
the formulaeven those that may be implicit in the discussionhave
standard values.
1.3 THE IDEALIZATION PRINCIPLE 5
Another form of illegal transfer is the attempt to apply it to an
external formula. For example, consider for all standard natural num-
bers n, the number n is limited; by transfer, all natural numbers are
limited. This is incorrect. Before applying transfer, one must check
two things: that the formula is internal and that all parameters in it
have standard values.
Exercises for Section 1.2
1. Can one prove that the sum of two innitesimals is innitesimal?
2. If r and s are limited, so are r + s and rs.
3. If x 0 and [r[ , then xr 0.
4. If x ,= 0, then x is innitesimal if and only if 1/x is unlimited.
5. Is it true that the innitesimals are a maximal ideal in the integral
domain of limited real numbers? What does the quotient eld look like?
6. Consider the function f: x x
2
on a closed interval. Then f is
bounded, and since it is standard it has a standard bound.Is this reasoning
correct?
7. Let x and y be standard with x <
y. Prove that x y.
1.3 The idealization principle
So far, we have no way to prove that any nonstandard objects exist.
Our next assumption is the idealization principle (I).
The notation
stn
means for all standard nite sets, and
stn
means there exists a standard nite set such that. Also, xX means
for all x in X, and xX means there exists x in X such that. Let A
be an internal formula. Then the idealization principle is
stn
x
yxx
A y
st
xA. (1.3)
There are no particular pitfalls connected with this assumption: we must
just be sure that A is internal. It can contain free variables in addition
to x and y (except for x
).
The intuition behind (I) is that we can only x a nite number of
objects at a time. To say that there is a y such that for all xed x we
have A is the same as saying that for any xed nite set of xs there is
a y such that A holds for all of them.
As a rst application of the idealization principle, let A be the for-
mula y ,= x. Then for every nite set x
we have y ,= x.
Therefore, there exists a nonstandard y. The same argument works when
x and y are restricted to range over any innite set. In other words, every
6 CHAPTER 1. INTERNAL SET THEORY
innite set contains a nonstandard element. In particular, there exists a
nonstandard natural number.
By transfer, we know that 0 is standard, and we know that if n
is standard then so is n + 1. Do we have a contradiction here? Why
cant we say that by induction, all natural numbers are standard? The
induction theorem says this: if S is a subset of N such that 0 is in S
and such that whenever n is in S then n + 1 is in S, then S = N. So
to apply induction to prove that every natural number is standard, we
would need a set S such that n is in S if and only if n is standard, and
this we dont have.
As a rough rule of thumb, until one feels at ease with nonstandard
analysis, it is best to apply the familiar rules of internal mathematics
freely to elements, but to be careful when working with sets of elements.
(From a foundational point of view, everything in mathematics is a set.
For example, a real number is an equivalence class of Cauchy sequences
of rational numbers. Even a natural number is a set: the number 0
is the empty set, the number 1 is the set whose only element is 0, the
number 2 is the set whose only elements are 0 and 1, etc. When I
refer to elements or objects rather than to sets, only a psychological
distinction is intended.)
We can prove that certain subsets of R and N do not exist.
Theorem 1. There does not exist S
1
, S
2
, S
3
, S
4
, or S
5
such that,
for all n in N and x in R, we have n S
1
n is standard, n S
2
n
is nonstandard, x S
3
x is limited, x S
4
x is unlimited, or
x S
5
x is innitesimal.
Proof. As we have seen, the existence of S
1
would violate the in-
duction theorem. If S
2
existed we could take S
1
= N S
2
, if S
3
existed
we could take S
1
= NS
3
, if S
4
existed we could take S
3
= R S
4
, and
if S
5
existed we could take S
4
= x R : 1/x S
5
.
This may seem like a negative result, but it is frequently used in
proofs. Suppose that we have shown that a certain internal property
A(x) holds for every innitesimal x; then we automatically know that
A(x) holds for some non-innitesimal x, for otherwise we could let S
5
be the set x R : A(x) . This is called overspill.
By Theorem 1 there is a non-zero innitesimal, for otherwise we
could let S
5
= 0. We can also see this directly from (I). We denote by
R
+
the set of all strictly positive real numbers. For every nite subset
x
of R
+
there is a y in R
+
such that y x for all x in x
. By (I), there
is a y in R
+
such that y x for all standard x in R
+
. That is, there
exists an innitesimal y > 0.
1.3 THE IDEALIZATION PRINCIPLE 7
Notice that in this example, to say that y is smaller than every
element of x
, so the nite set really plays no essential role. This situation occurs
so frequently that it is worth discussing it in a general context in which
the idealization principle takes a simpler form. Recall that a directed set
is a set D together with a transitive binary relation such that every
pair of elements in D has an upper bound. For example, R
+
is a directed
set with respect to . (In this example, the minimum of two elements is
an upper bound for them.) Let A be a formula with the free variable x
restricted to range over the directed set D. We say that A lters in x
(with respect to D and ) in case one can show that whenever x z and
A(z), then A(x). (Here A(z) is the formula obtained by substituting z
for each free occurrence of x in A(x), with the understanding that z is
not a bound variable of A.) Suppose that the internal formula A lters
in x. Then the idealization principle takes the simpler form
st
xyA y
st
xA. (1.4)
That is, we can simply interchange the two quantiers.
The idealization principle also has a dual form. If A is internal, then
stn
x
yxx
A y
st
xA. (1.5)
We say that A colters in x in case we can show that whenever x z
and A(x), then A(z). Then (1.5) takes the simpler form:
st
xyA y
st
xA. (1.6)
In practice, there is no need to make the distinction between ltering
and coltering, and I may say lters when colters is correct.
As an example of the dual form, suppose that f: R R is such that
every value of f is limited. Then f is bounded, and even has a standard
bound. To see this, use (1.6), where the ltering relation is on the
values of the function. We know that for all y there exists a standard x
such that [f(y)[ x. Hence there is a standard x such that [f(y)[ x
for all y.
8 CHAPTER 1. INTERNAL SET THEORY
Theorem 2. Let S be a set. Then S is a standard nite set if and
only if every element of S is standard.
Proof. Let S be a standard nite set, and suppose that it contains
a nonstandard y. Then there exists a y in S such that for all standard x
we have y ,= x, so by (I), for all standard nite sets x
, and in particular
for S, there is a y in S such that for all x in S we have y ,= x. This is a
contradiction, so every element of S is standard.
Conversely, suppose that every element of S is standard. We al-
ready know that S must be nite, because every innite set contains a
nonstandard element. Let x and y range over S and apply the dual form
of idealization to the formula x = y. We know that for all y there is a
standard x with x = y, so there is a standard nite set x
with x = y. Then x
S; that is, S
x
where
is standard, so by (T)
x
is standard.
Also, x
is nite, so
x
is standard, so S is standard.
2
This has the following corollary.
Theorem 3. Let n and k be natural numbers, with n standard and
k n. Then k is standard.
Proof. Let S = k N : k n. By (T), S is standard. It is
nite, so all of its elements are standard.
Therefore we can picture the natural numbers as lying on a tape,
with the standard numbers to the left and the nonstandard numbers to
the right. The demarcation between the two portions is strange: the left
portion is not a set, and neither is the right. I want to emphasize that
we did not start with the left portion and invent a new right portion to
be tacked on to itwe started with the whole tape, the familiar set N
of all natural numbers, and invented a new way of looking at it.
Theorem 4. There is a nite set that contains every standard ob-
ject.
Proof. This is easy. Just apply (I) to the formula x y & y is
nite.
If we think of standard semanticallywith the world of mathe-
matical objects spread out before us, some bearing the label standard
and others notthen these results violate our intuition about nite sets.
But recall what it means to say that X is a nite set. This is an internal
2
I am grateful to Will Schneeberger for pointing out a gap in my rst proof of
this theorem.
1.3 THE IDEALIZATION PRINCIPLE 9
notion, so it means what it has always meant in mathematics. What
has it always meant? There are two equivalent characterizations. For n
in N, we let I
n
= k N : k < n. Then a set X is nite if and only if
there is a bijection of X with I
n
for some n. There is also the Dedekind
characterization: X is nite if and only if there is no bijection of X with
a proper subset of itself. Consider the set I
n
where n is a nonstandard
natural number. This certainly satises the rst property. Nothing was
said about n being standard; this cannot even be formulated within in-
ternal mathematics. But does it satisfy the Dedekind property? Suppose
that we send each nonstandard element of I
n
to its predecessor and leave
the standard elements alone; isnt this a bijection of I
n
with I
n1
? No,
its denition as a function would involve illegal set formation.Perhaps
it is fair to say that nite does not mean what we have always thought
it to mean. What have we always thought it to mean? I used to think
that I knew what I had always thought it to mean, but I no longer think
so. In any case, intuition changes with experience. I nd it intuitive to
think that very, very large natural numbers and very, very small strictly
positive real numbers were there all along, and now we have a suitable
language for discussing them.
Exercises for Section 1.3
1. A natural number is standard if and only if it is limited.
2. Does there exist an unlimited prime?
3. What does the decimal expansion of an innitesimal look like?
4. Let H be a Hilbert space. Does there exist a nite dimensional subspace
containing all of its standard elements? Is it closed?
5. Suppose that the S
n
, for n in N, are a sequence of disjoint sets with
union S, and suppose that every element x in S is in S
n
for some standard n.
Can one show that all but a nite number of them are empty? Can one show
that all but a standard nite number of them are empty?
6. Prove Robinsons lemma: Let n a
n
be a sequence such that a
n
0
for all standard n. Then there is an unlimited N such that a
n
0 for all
n N.
7. Let n a
n
be a sequence such that a
n
for all standard n. Is
there an unlimited N such that a
n
for all n N?
8. We have been saying let x range over X to mean that each xA
should be replaced by x[x X A] and each xA should be replaced by
x[x X & A]. Let
n
X be the set of all non-empty nite subsets of X. Show
that if x ranges over X, then (I) can be written as
st
x
yxx
A y
st
xA
where x
ranges over
n
X.
10 CHAPTER 1. INTERNAL SET THEORY
1.4 The standardization principle
Our nal assumption about the new predicate standard is the
standardization principle (S). It states that
st
X
st
Y
st
z[z Y z X & A]. (1.7)
Here A can be any formula (not containing Y ), external or internal. It
may contain parameters (free variables in addition to z and X).
The intuition behind (S) is that if we have a xed set, then we can
specify a xed subset of it by giving a criterion for judging whether each
xed element is a member of it or not.
Two sets are equal if they have the same elements. By (T), two stan-
dard sets are equal if they have the same standard elements. Therefore,
the standard set Y given by (S) is unique. It is denoted by
S
z X : A,
which may be read as the standard set whose standard elements are
those standard elements of X such that A holds. Unfortunately, any
shortening of this cumbersome phrase is apt to be misleading. For stan-
dard elements z, we have a direct criterion for z to be an element of
S
z X : A, namely that A(z) hold. But for nonstandard elements z,
this is not so. It may happen that z is in
S
z X : A but A(z) does not
hold, and conversely A(z) may hold without z being in
S
z X : A.
For example, let X =
S
z R : z 0 . Then X = 0 since 0 is the
only standard innitesimal. Thus we can have z 0 without z being
in X. Let Y =
S
z R : [z[ . Then Y = R since every standard
number is limited. Thus we can have z Y without z being limited.
The standardization principle is useful in making denitions. Let
x range over the standard set X. When we make a denition of the
form: for x standard, x is something-or-other in case a certain property
holds, this is understood to mean the same as x is something-or-other
in case x
S
x X : a certain property holds . For example, let
f: R R and let us say that for f standard, f is nice in case every
value of f is limited. Thus f is nice if and only if f N, where we
let N =
S
f R
R
: every value of f is limited . (The notation X
Y
signies the set of all functions from Y to X.) I claim that f is nice if
and only if f is bounded. To prove this, we may, by (T), assume that
f is standard. We already saw, in the previous section, that if every
value of f is limited, then f is bounded. Conversely, let f be bounded.
Then by (T) it has a standard bound, and so is nice. This proves the
claim. But, one might object, the rst transfer is illegal because nice
was dened externally. The point is this: being nice is the same as being
1.4 THE STANDARDIZATION PRINCIPLE 11
in the standard set N, so the transfer is legal. In complete detail, (T)
tells us that for all standard N we have
st
f[f N f is bounded] f[f N f is bounded].
In other words, this kind of denition by means of (S) is a way of dening,
somewhat implicitly, an internal property.
Let x and y range over a set V , let y range over V
V
, and let A be
internal. Then
xyA(x, y) yxA
_
x, y(x)
_
.
The forward direction is the axiom of choice, and the backward direction
is trivial. If we apply transfer to this, assuming that V is standard, we
obtain
st
x
st
yA(x, y)
st
y
st
xA
_
x, y(x)
_
.
But we can do much better than this. Let A be any formula, external
or internal. Then
st
x
st
yA(x, y)
st
y
st
xA
_
x, y(x)
_
. (1.8)
The backward direction is trivial, so we need only consider the forward
direction. Suppose rst that for all standard x there is a unique stan-
dard y such that A(x, y). Then we can let y =
S
x, y) : A(x, y) . In
the general case, let
Y =
S
x, Y ) : Y =
S
y : A(x, y) .
Then
Y is a standard set-valued function whose values are non-empty
sets, so by the axiom of choice relativized to the standard sets, it has a
standard cross-section y of the desired form. We call (1.8) the functional
form of standardization (
st
x
st
yA(x, y)
st
y
st
xA
_
x, y(x)
_
. (1.9)
The requirements in (1.8) and (1.9) are that x and y range over some
standard set V and that y range over V
V
. This includes the case that
x and y range over dierent standard sets X and Y , with y ranging
over Y
X
, because we can always let V = XY and include the conditions
x X and y Y in the formula A.
If X is a set, contained in some standard set V , we let
S
X =
S
x V : x X.
12 CHAPTER 1. INTERNAL SET THEORY
This clearly does not depend on the choice of V , and in practice the
requirement that X be contained in some standard set is not restrictive.
Then
S
X is the unique standard set having the same standard elements
as X. It is easy to see that if f is a function then so is
S
f.
The theory obtained by adjoining (I), (S), and (T) to internal math-
ematics is Internal Set Theory (IST).
So far, we have not proved any theorems of internal mathematics
by these new methods. Here is a rst example of that, a theorem of
de Brujn and Erdos
3
on the coloring of innite graphs.
By a graph I mean a set G together with a subset R of GG. We
dene a k-coloring of G, where k is a natural number, to be a function
g: G 1, . . . , k such that g(x) ,= g(y) whenever x, y) R. An
example is G = R
2
and R = x, y) : [x y[ = 1 where [x y[ is the
Euclidean metric on R
2
. (This graph is known to have a 7-coloring but
no 3-coloring.) The theorem asserts that if every nite subgraph of G
has a k-coloring, so does G. This is not trivial, because if we color a
nite subgraph, we may be forced to go back and change its coloring
to color a larger nite subgraph containing it. To prove the theorem,
we assume, by (T), that G and k are standard. By (I), there is a nite
subgraph F of G containing all its standard elements, and by hypothesis
F has a k-coloring f. Let g =
S
f. Since f takes only standard values,
every standard element of G is in the domain of g. By (T), every element
of G is in the domain of g. To verify that g is a k-coloring, it suces,
by (T), to examine the standard elements, where it agrees with f. This
concludes the proof.
Exercises for Section 1.4
1. Show that if f is a function, so is
S
f. What can one say about its
domain and range?
2. Deduce (S) from (
S).
3. Dene f by f(x) = t/(t
2
+ x
2
) where t > 0 is innitesimal. What is
_
n=1
a
n
2
n
where each a
n
is
0 or 1, and conversely each number of this form is in [0, 1]. This binary
expansion is determined by a function a: N
+
0, 1. Let b =
S
a. Then
b
n
= a
n
for all standard n, so if we let x
0
=
n=1
b
n
2
n
then x
0
is
standard, x
0
x, and x
0
is in [0, 1]. Therefore [0, 1] is compact. This
has the following corollary.
Theorem 5. Each limited real number is innitely close to a unique
standard real number.
Proof. Let x be limited. Then [x] is a limited, and therefore stan-
dard, integer. Since x [x] is in the standard compact set [0, 1], there
is a standard y
0
innitely close to it, so if we let x
0
= y
0
+ [x] then x
0
is standard and innitely close to x. The uniqueness is clear, since 0 is
the only standard innitesimal.
14 CHAPTER 1. INTERNAL SET THEORY
If x is limited, the standard number that is innitely close to it is
called the standard part of x, and is denoted by st x.
Now let us prove that a continuous function f on a compact set E
is bounded. By (T), we assume them to be standard. Let K , and
let x be in E. It suces to show that f(x) K. There is a standard
x
0
in E with x x
0
, and since f(x
0
) is standard we have f(x
0
) K.
But by the continuity of the standard function f we have f(x) f(x
0
),
so f(x) K, which concludes the proof. (It may seem like cheating
to produce an unlimited bound. By (T), if a standard function f is
bounded, then it has a standard bound. But this is a distinction that
can be made only in nonstandard analysis.)
Somewhat more ambitiously, let us show that a continuous func-
tion f on a compact set E achieves its maximum. Again, we assume
them to be standard. By (I), there is a nite subset F of E that con-
tains all the standard points, and the restriction of f to the nite set F
certainly achieves its maximum on F at some point x. By compactness
and continuity, there is a standard x
0
in E with x
0
x and f(x
0
) f(x).
Therefore f(x
0
) >
E =
S
x X : y x for some y in E . (1.10)
Theorem 6. Let E be a subset of the standard topological space X.
Then the shadow of E is closed.
Proof. Let z be a standard point of X in the closure of
E. Then
every open neighborhood of z contains a point of
E, so by (T) every
standard open neighborhood U of z contains a standard point x of
E.
But for a standard point x of
E, there is a y in E with y x, so y is
in U. That is to say,
st
Uy[y E U]. The open neighborhoods of
z are a directed set under inclusion and this formula lters in U, so by
the simplied version (1.4) of (I) we have y
st
U[y E U]; that is,
yE[y z]. Thus z is in
E. We have shown that every standard point
z in the closure of
E is in
E, so by (T) every point z in the closure of
E is in
E. Thus
E is closed.
There is a beautiful nonstandard proof of the Tychonov theorem.
Let T be a set, let X
t
for each t in T be a compact topological space, and
let be the Cartesian product =
tT
X
t
with the product topology.
We want to show that is compact. By (T), we assume that t X
t
is
standard, so that is also standard. Let be in . For all standard t
there is a standard point y in X
t
such that y (t), so by (
S) there is
a standard in such that for all standard t we have (t) (t). By
the denition of the product topology, , so is compact.
16 CHAPTER 1. INTERNAL SET THEORY
Exercises for Section 1.5
1. Precisely how is (
st
to the left of the internal quantiers and , and then using (T) to
get rid of them entirely. In this way we can show that the denitions
made using (S) in the previous section are equivalent to the usual ones.
More interestingly, we can reduce the rather weird external theorems
that we have proved to equivalent internal form. The proviso, which has
only nuisance value, is that whenever we use (
S) to introduce a standard
function y(x), then x and y must be restricted to range over a standard
set, to give the function y a domain. (Actually, it suces to make this
restriction on x alone.) I will not make this explicit all of the time.
To reduce a formula, rst eliminate all external predicates, replacing
them by their denitions until only standard is left. Even this should
be eliminated, replacing x is standard by
st
y[y = x].
Second, look for an internal quantier that has some external quan-
tiers in its scope. (If this never happens, we are ready to apply (T) to
obtain a weakly equivalent internal formula.) If the internal quantier
is , use (
S) to pull the
st
s to the left of the
st
s (if necessary, thereby
introducing standard functions), where they can then be pulled to the
left of . If the internal quantier is , proceed by duality.
Third, whenever y
st
xA (or its dual) occurs with A internal, use
(I), taking advantage of its simplication (1.4) whenever A lters in x.
One thing to remember in using this reduction algorithm is that the
implication
st
xA(x) B is equivalent to
st
x[A(x) B], and dually
st
xA(x) B is equivalent to
st
x[A(x) B], provided in both cases
that x does not occur free in B. (This has nothing to do with the super-
script st; it is a general fact about quantiers in the hypothesis of an
1.6 REDUCTION OF EXTERNAL FORMULAS 17
implication. For example, let A(s) be s is an odd perfect number and
let B be the Riemann hypothesis. Then sA(s) B and s[A(s) B]
say the same thing.) If B begins with an external quantier, it comes out
of the implication unchanged, but we have our choice of which quantier
to take out rst. The idea is to do this in such a way as to introduce as
few functions as possible. If we have an equivalence, we must rewrite it
as the conjunction of two implications and rename the bound variables,
since they come out of the implications in dierent ways. For example,
the formula
st
xA(x)
st
yB(y) is equivalent to
[
st
xA(x)
st
yB(y)] & [
st
uB(u)
st
vA(v)],
which in turn is equivalent to
st
x
st
u
st
y
st
v
_
[A(x) B(y)] & [B(u) A(v))
and also to
st
y
st
v
st
x
st
u
_
[A(x) B(y)] & [B(u) A(v))
.
If all of this is preceded by t, where t is a free variable in A or B, the
second form is advantageous; if it is preceded by t, one should use the
rst form.
Let us illustrate the reduction algorithm with the denitions by (S)
in the previous section of continuous at x, continuous, and uni-
formly continuous. The formulas in question are respectively
y
_
y x f(y) f(x)
st
xy
_
y x f(y) f(x)
,
xy
_
y x f(y) f(x)
.
We must eliminate . With and ranging over R
+
, we obtain
y
_
st
[[y x[ ]
st
[[f(y) f(x)[ ]
, (1.11)
st
xy
_
st
[[y x[ ]
st
[[f(y) f(x)[ ]
, (1.12)
xy
_
st
[[y x[ ]
st
[[f(y) f(x)[ ]
. (1.13)
In all of these formulas, f is a standard parameter, and x is a standard
parameter in (1.11). First bring
st
out; it goes all the way to the left.
18 CHAPTER 1. INTERNAL SET THEORY
Then bring
st
out; it changes to
st
, and since the formula lters in ,
it goes to the left of y, and also of x in (1.13). This yields
st
st
y
_
[y x[ [f(y) f(x)[
st
st
x
st
y
_
[y x[ [f(y) f(x)[
st
st
xy
_
[y x[ [f(y) f(x)[
.
Now apply transfer; this simply removes the superscripts st and shows
that our denitions are equivalent to the usual ones.
Now consider x
st
x
0
[x x
0
]; that is,
x
st
x
0
st
_
[x x
0
[
. (1.14)
It is understood that x and x
0
range over the standard set E; this was
our denition by (S) of E being compact. First we move
st
to the
left, by (
st
x
st
x
0
_
[x x
0
[ (x
0
)
. (1.15)
Now use (I). There is no ltering in (1.15), so it becomes
st
stn
x
0
xx
0
x
0
_
[x x
0
[ (x
0
)
.
Now apply (T). We obtain
n
x
0
xx
0
x
0
_
[x x
0
[ (x
0
)
,
where
n
means there exists a nite set such that (and similarly
n
means for all nite sets). But this is mathematically equivalent to the
usual denition of a set being compact.
Denitions by (S) are an external way of characterizing internal
notions, but they simultaneously suggest new external notions. These
are often indicated by the prex S-. Thus we say that f is S-continuous
at x in case whenever y x we have f(y) f(x). If both f and
x are standard, this is the same as saying that f is continuous at x.
But let t > 0 be innitesimal, and let f(x) = t/(t
2
+ x
2
). Then f is
continuous at 0 (this internal property is true for any t > 0), but it is
not S-continuous at 0. Let g(x) = t for x ,= 0 and g(0) = 0. Then g is
discontinuous at 0 but is S-continuous at 0. Let h(x) = x
2
. Then h is
continuous at 1/t but is not S-continuous at 1/t. Similarly, we say that f
is S-uniformly continuous in case whenever y x we have f(y) f(x).
1.6 REDUCTION OF EXTERNAL FORMULAS 19
I am not very fond of the S-notation. It seems to imply that to every
internal notion there is a unique corresponding external S-notion, but
this is not so. There may be distinct external notions that are equivalent
for standard values of the parameters.
Now let us apply the reduction algorithm to some of our external
theorems. Let us split Theorem 2 into three dierent statements.
(i) If every element of a set is standard, then the set is nite. That
is,
S
_
yS
st
x[y = x] S is nite
,
SyS
st
x
_
y = x S is nite
stn
x
SySxx
_
y = x S is nite
, (1.16)
n
x
SySxx
_
y = x S is nite
, (1.17)
n
x
S
_
S x
S is nite
.
In other words, (i) is equivalent to (i
. (1.18)
Apply (I) to the hypothesis:
S
_
stn
x
ySxx
[y = x]
st
T[S = T]
;
simplify:
S
_
stn
x
[S x
]
st
T[S = T]
;
pull out the quantiers and apply (I):
stn
x
stn
T
STT
[S x
S = T];
simplify and apply (T):
n
x
n
T
S[S x
S T
].
This is true for T
=
x
, where
X denotes the power set of X (the set
of all subsets of X). In other words, (ii) is equivalent to (ii
): the power
set of a nite set is nite. When reducing an external formula, it is a
20 CHAPTER 1. INTERNAL SET THEORY
good idea to attack subformulas rst. For example, had we pulled the
quantiers out of (1.18) directly, we would have obtained
S
st
TyS
st
x[y = x S = T],
which after reduction and simplication gives:
n
T
S
_
S
TT
(T) S T
_
. (1.19)
Every function must have a domain, so the use of (
S) to produce the
nite-set valued function
x
=
_
(T
0
)
_
T
0
.
If S
TT
(T
0
) and so S T
. Thus
(1.19) is an obscure way of saying that the power set of a nite set is
nite.
(iii) Every element of a standard nite set is standard. That is,
st
S
_
S is nite yS
st
x[y = x]
.
This reduces easily to
S
n
x
[S is nite S x
].
In other words, (iii) is equivalent to (iii
,
which reduces easily to
n
n
j
k[k n k j
].
Theorem 4, that there is a nite set containing every standard object,
reduces immediately to the following triviality: for all nite sets x
there
is a nite set F such that every element of x
is an element of F.
1.6 REDUCTION OF EXTERNAL FORMULAS 21
Theorem 4 is shocking, but the informal statement there is a nite
set containing any xed element appears quite reasonable. Similarly,
with the variables ranging over R
+
, one is accustomed to saying there
is an x less than any xed . Nonstandard analysis takes the further
step of saying call such an x an innitesimal. I want to emphasize
again that the predicate standard has no semantic content in IST; it
is a kind of syntactical place-holder signifying that the object in question
is to be held xed. With many objects in play at once, some depending
on others, the syntax of being held xed becomes complicated, and the
rules for handling the idea correctly are (I), (S), and (T). What Abraham
Robinson invented is nothing less than a new logic. He was explicit about
this in the last paragraph of his epoch-making book:
4
Returning now to the theory of this book, we observe that it is
presented, naturally, within the framework of contemporary Mathe-
matics, and thus appears to arm the existence of all sorts of inni-
tary entities. However, from a formalist point of view we may look at
our theory syntactically and may consider that what we have done is
to introduce new deductive procedures rather than new mathematical
entities.
One technical point is worth commenting on. It is essential to the success
of the reduction algorithm that the idealization principle hold for internal
formulas with free variables; this feature was not present in Robinsons
notion of enlargement [Ro, 2.9].
Exercises for Section 1.6
1. Find a function f that is S-uniformly continuous without being uni-
formly continuous, and vice versa.
2. What is the reduction of the formula x 0?
3. What is the reduction of Theorem 5?
4. Show that the denitions by (S) of open, closed, and bounded in the
previous section are equivalent to the usual ones.
5. Show that the denition by (S) of a complete metric space in the
previous section is equivalent to the usual one.
6. Say that a sequence a: N R is S-Cauchy in case for all unlimited n
and m we have a
n
a
m
. Show that a standard sequence is S-Cauchy if and
only if it is Cauchy. Say that a is of limited uctuation in case for all standard
> 0 and all k, if n
1
< < n
k
and [a
n
1
a
n
2
[ , . . . , [a
n
k1
a
n
k
[ ,
then k is limited. Show that a standard sequence is of limited uctuation if
and only if it is Cauchy. Can a sequence be S-Cauchy without being of limited
uctuation, or vice versa?
7. Is the unit ball of R
n
, where n is unlimited, compact?
4
[Ro] Abraham Robinson, Non-Standard Analysis, Revised Edition, American
Elsevier, New York, 1974.
22 CHAPTER 1. INTERNAL SET THEORY
8? Can the reduction of Theorem 6 be made intelligible? (The ? means
that I dont know the answer.)
1.7 Answers to the exercises
Section 1.1
1. Yes and yes; this set can be formed for any natural number, and it is
a nite set.
2. No.
3. Let x be standard and positive. Then x x, so x is limited.
4. No.
5. At rst sight this looks like illegal set formation, but let S be the set
of all real numbers x such that x
2
1. Assuming that 1 is standard, we see
that every x in S is limited, so S is the set in question.
6. Not yet; so far, nothing guarantees that if is standard then so is /2.
Section 1.2
1. Yes. Let x and y be innitesimal and let > 0 be standard. By transfer,
/2 is standard, so that [x[ /2 and [y[ /2. Then [x + y[ , and since
was an arbitrary strictly positive standard number, x + y is innitesimal.
2. There are standard numbers R and S such that [r[ R and [s[ S.
By (T), R + S and RS are standard.
3. Let > 0 be standard. There is a standard R, which we may take to
be non-zero, such that [r[ R. By (T), /R is standard, so [x[ /R. Hence
[xr[ .
4. For > 0 and r = 1/, is standard if and only if r is standard, by
the transfer principle.
5. We cannot speak of the innitesimals as an ideal, or of the integral
domain of limited real numbers, without illegal set formation. If we avoid
these illegal set formations, the preceding exercises essentially give an arma-
tive answer to the rst question. But to talk about the quotient eld we
really need sets. The discussion must be postponed to Chapter 4, where we
shall have external sets at our disposal.
6. No. If the closed interval is [0, b] where b is unlimited, the function has
no standard bound. The transfer was illegal because the closed interval was a
parameter.
7. We have x y + where is innitesimal and x and y are standard.
Then x y + for all standard > 0, so by (T) we have x y + for all
> 0. Hence x y.
1.7 ANSWERS TO THE EXERCISES 23
Section 1.3
1. This is just a restatement of Theorem 3.
2. Yes. Euclid showed that for any natural number n there is a prime
greater than n.
3. Let x > 0 be innitesimal. Then 0 < x < 1, so x is of the form
x =
n=1
a
n
10
n
where each a
n
is one of 0, . . . , 9. Since 10
n
is standard if
n is, by (T), each a
n
with n standard is 0. But x > 0, so not all of its decimal
digits are 0. As we already know, any x > 0 has a rst non-zero decimal digit.
What about the number whose decimal digits are 0 for all standard n and 7
for all nonstandard n? The question makes so sense. The decimal digits form
a sequence; a sequence is a set; and I committed illegal set formation in posing
the question.
4. Yes; by (I), there is a nite subset of H containing all of its standard
points, so take its span. (This is by no means unique, and we cannot form
the smallest such space without illegal set formation.) Any nite dimensional
subspace of a Hilbert space is closed.
5. We have x
st
n[x S
n
], where x ranges over S. By (I), we have that
stn
n
xnn
[x S
n
], and since the sets are disjoint, all but a standard
nite number of them are empty.
6. The set of all N such that [a
n
[ 1/n for all n N contains all
standard N, so by overspill it contains some unlimited N.
7. Not necessarily; consider the identity function.
8. We can assume that X is non-empty, since otherwise both sides are
vacuously true. The backward direction holds by (I). In the forward direction,
by (I) we have some nite standard x
S) we have
st
y
st
z[ y(z) = 1 A].
Then we let Y = | z X : y(z) = 1 .
3. By elementary calculus, this integral is equal to 1 for any t > 0. There
are no standard pairs x, f(x)), so
S
f is the empty set.
4. Let S =
S
| n N : A(n) . Then 0 is in S. By assumption, for all
standard n, if n is in S then n + 1 is in S, so by (T) this is true for all n. By
induction, S = N. In particular, every standard n is in S, so A(n) holds for
every standard n.
24 CHAPTER 1. INTERNAL SET THEORY
5. Let
S =
S
| : A() .
Then every standard element of S is an ordinal, so the same is true by (T) for
all elements of S. Since S is non-empty (it contains ), it contains a least ,
which is standard by (T). Then . Consider a standard such that A().
If < , then certainly ; if then S, so again . This
concludes the proof.
This is often used to prove
st
B(). Argue indirectly. If not, there is a
least standard such that A() where A is B. If we can show that cannot
be 0, a successor, or a limit ordinal, then the proof will be complete.
6. To establish (I), we need only show (without using the backward di-
rection of the idealization principle) that every element of a standard nite
set x
tT
X
t
such that (t) X
t
for all t in T. So if we know that for all standard t there is a y in X
t
with
y (t), then (
st
[[x x
0
[ ]],
st
rx
st
x
0
st
[[x[ r [x x
0
[ ],
st
r
st
stn
x
0
xxx
0
[[x[ r [x x
0
[ (x
0
)];
then remove the superscripts st. This says that every interval [r, r] is
compact.
4. For open we have
st
x
0
E zR[
st
[[z x
0
[ ] z E].
Bring
st
out as
st
, which lters past zR. Then use (T) to obtain
x
0
zR[[z x
0
[ z E].
Similarly, the formula for closed reduces to
x
0
RxE[[x x
0
[ x
0
E],
which is perhaps more readable if we push xE inside the parentheses as
xE. For bounded, use the ltering form of (I) and (T) to obtain the
usual formulation rxE[[x[ r].
5. Let X, d) be a standard metric space. To avoid confusion, let us
temporarily call it nice in case for all x, if for all standard > 0 there is a
standard y with d(x, y) , then there is a standard x
0
with x x
0
. Suppose
that X is incomplete. By (T), there is a standard Cauchy sequence x
n
with
no limit. Let and let > 0 be standard. Consider the set S of all n
such that d(x
n
, x
x
0
, for then it would be the limit of the x
n
. Thus an incomplete
standard metric space is not nice. Conversely, let X be complete and let x
be a point in X such that for all standard > 0 there is a standard y with
d(x, y) . Then for all standard n there is a standard y with d(x, y) 1/n,
so by (
st
y[d(x, y) ]
st
x
0
st
[d(x, x
0
) ]].
Use (
st
st
yx
st
x
0
st
[d(x, y()) d(x, x
0
)
(x
0
)],
which by (I) and (T) is equivalent to
y
n
n
x
0
xx
0
x
0
[
was pushed back inside the implication. Now we have the internal
problem of seeing that this is equivalent to completeness of the metric space.
Only the Cauchy y are relevant (to obtain a more customary notation, we
could let y
n
= y(1/n)), for otherwise we can nd a two-point set
that
violates the hypothesis of the implication, by the triangle inequality. If the
space X is complete, just let x
0
be the singleton consisting of the limit as
0 of y(). To construct a counterexample when X is not complete, let y
be Cauchy without a limit, and let
(x
0
) be the distance from x
0
to the limit
(in the completion) of y().
6. The S-Cauchy condition is
nm[
st
r[n r & m r]
st
[[a
n
a
m
[ ]].
The r lters out to give the usual denition of a Cauchy sequence. The limited
uctuation condition is
st
k[A(k, , a)
st
r[k r]],
where A(k, , a) is an abbreviation for the assertion that the sequence a con-
tains k -uctuations. Again the r lters out, and a standard sequence is of
limited uctuation if and only if for all > 0 there is a bound r on the number
of -uctuations, which is the same as being Cauchy. Now let a be an S-Cauchy
sequence, not necessarily standard, and let > 0 be standard. Consider the
set of all n such that for all m > n we have [a
n
a
m
[ ; this set contains
all unlimited n, so by overspill it contains some limited n. Consequently, an
S-Cauchy sequence is of limited uctuation. But let and let a
n
= 0
for n < and a
n
= 1 for n . This sequence is of limited uctuation but
is not S-Cauchy. Thus we have two distinct external concepts that agree on
standard objects.
7. The unit ball of any Euclidean space is compact.