Oersted Medal Lecture 2002 Reforming The Mathemati
Oersted Medal Lecture 2002 Reforming The Mathemati
Oersted Medal Lecture 2002 Reforming The Mathemati
net/publication/243492634
CITATIONS READS
264 2,946
1 author:
David Hestenes
Arizona State University
118 PUBLICATIONS 15,832 CITATIONS
SEE PROFILE
All content following this page was uploaded by David Hestenes on 03 April 2015.
David Hestenes
Department of Physics and Astronomy
Arizona State University, Tempe, Arizona 85287-1504
The connection between physics teaching and research at its deepest level
can be illuminated by Physics Education Research (PER). For students and
scientists alike, what they know and learn about physics is profoundly shaped
by the conceptual tools at their command. Physicists employ a miscellaneous
assortment of mathematical tools in ways that contribute to a fragmenta-
tion of knowledge. We can do better! Research on the design and use of
mathematical systems provides a guide for designing a unified mathemati-
cal language for the whole of physics that facilitates learning and enhances
physical insight. This has produced a comprehensive language called Geo-
metric Algebra, which I introduce with emphasis on how it simplifies and
integrates classical and quantum physics. Introducing research-based re-
form into a conservative physics curriculum is a challenge for the emerging
PER community. Join the fun!
I. Introduction
The relation between teaching and research has been a perennial theme in
academia as well as the Oersted Lectures, with no apparent progress on re-
solving the issues. Physics Education Research (PER) puts the whole matter
into new light, for PER makes teaching itself a subject of research. This shifts
attention to the relation of education research to scientific research as the central
issue.
To many, the research domain of PER is exclusively pedagogical. Course
content is taken as given, so the research problem is how to teach it most effec-
tively. This approach to PER has produced valuable insights and useful results.
However, it ignores the possibility of improving pedagogy by reconstructing
course content. Obviously, a deep knowledge of physics is needed to pull off
anything more than cosmetic reconstruction. It is here, I contend, in addressing
the nature and structure of scientific subject matter, that PER and scientific
research overlap and enrich one another with complementary perspectives.
The main concern of my own PER has been to develop and validate a sci-
entific Theory of Instruction to serve as a reliable guide to improving physics
teaching. To say the least, many physicists are dubious about the possibility.
Even the late Arnold Arons, patron saint of PER, addressed a recent AAPT
session with a stern warning against any claims of educational theory. Against
this backdrop of skepticism, I will outline for you a system of general principles
that have guided my efforts in PER. With sufficient elaboration (much of which
1
already exists in the published literature), I believe that these principles provide
quite an adequate Theory of Instruction.
Like any other scientific theory, a Theory of Instruction must be validated by
testing its consequences. This has embroiled me more than I like in developing
suitable instruments to assess student learning. With the help of these instru-
ments, the Modeling Instruction Program has amassed a large body of empirical
evidence that I believe supports my instructional theory. We cannot review that
evidence here, but I hope to convince you with theoretical arguments.
A brief account of my Theory of Instruction sets the stage for the main
subject of my lecture: a constructive critique of the mathematical language used
in physics with an introduction to a unified language that has been developed
over the last forty years to replace it. The generic name for that language
is Geometric Algebra (GA). My purpose here is to explain how GA simplifies
and clarifies the structure of physics, and thereby convince you of its immense
implications for physics instruction at all grade levels. I expound it here in
sufficient detail to be useful in instruction and research and to provide an entreé
to the published literature.
After explaining the utter simplicity of the GA grammar in Section V, I
explicate the following unique features of the mathematical language:
(1) GA seamlessly integrates the properties of vectors and complex numbers
to enable a completely coordinate-free treatment of 2D physics.
(2) GA articulates seamlessly with standard vector algebra to enable easy
contact with standard literature and mathematical methods.
(3) GA Reduces “grad, div, curl and all that” to a single vector derivative
that, among other things, combines the standard set of four Maxwell equations
into a single equation and provides new methods to solve it.
(4) The GA formulation of spinors facilitates the treatment of rotations and
rotational dynamics in both classical and quantum mechanics without coordi-
nates or matrices.
(5) GA provides fresh insights into the geometric structure of quantum me-
chanics with implications for its physical interpretation.
All of this generalizes smoothly to a completely coordinate-free language for
spacetime physics and general relativity to be introduced in subsequent papers.
The development of GA has been a central theme of my own research in
theoretical physics and mathematics. I confess that it has profoundly influenced
my thinking about PER all along, though this is the first time that I have made
it public. I have refrained from mentioning it before, because I feared that my
ideas were too radical to be assimilated by most physicists. Today I am coming
out of the closet, so to speak, because I feel that the PER community has reached
a new level of maturity. My suggestions for reform are offered as a challenge
to the physics community at large and to the PER community in particular.
The challenge is to seriously consider the design and use of mathematics as an
important subject for PER. No doubt many of you are wondering why, if GA is
so wonderful – why have you not heard of it before? I address that question in
the penultimate Section by discussing the reception of GA and similar reforms
by the physics community and their bearing on prospects for incorporating GA
2
into the physics curriculum. This opens up deep issues about the assimilation
of new ideas – issues that are ordinarily studied by historians, but, I maintain,
are worthy subjects for PER as well.
3
5. Expert learning requires deliberate practice with critical feed-
back.
There is substantial evidence that practice does not significantly improve
intellectual performance unless it is guided by critical feedback and deliberate
attempts to improve.4 Students waste an enormous amount of time in rote
study that does not satisfy this principle.
I believe that all five principles are essential to effective learning and instruc-
tional design, though they are seldom invoked explicitly, and many efforts at
educational reform founder because of insufficient attention to one or more of
them. The terms “concept” and “conceptual learning” are often tossed about
quite cavalierly in courses with names like “Conceptual Physics.” In my expe-
rience, such courses fall far short of satisfying the above learning principles, so
I am skeptical of claims that they are successful in teaching physics concepts
without mathematics. The degree to which physics concepts are essentially
mathematical is a deep problem for PER.
I should attach a warning to the First (Constructivist) Learning Principle.
There are many brands of constructivism, differing in the theoretical context
afforded to the constructivist principle. An extreme brand called “radical con-
structivism” asserts that constructed knowledge is peculiar to an individual’s
experience, so it denies the possibility of objective knowledge. This has radi-
calized the constructivist revolution in many circles and drawn severe criticism
from scientists.5 I see the crux of the issue in the fact that the constructivist
principle does not specify how knowledge is constructed. When this gap is
closed with the other learning principles and scientific standards for evidence
and inference, we have a brand that I call scientific constructivism.
I see the five Learning Principles as equally applicable to the conduct of
research and to the design of instruction. They support the popular goal of
“teaching the student to think like a scientist.” However, they are still too
vague for detailed instructional design. For that we need to know what counts
as a scientific concept, a subject addressed in the next Section.
4
Standards and the AAAS Project 2061. The Modeling Instruction Project has
led the way in incorporating it into K-12 curriculum and instruction.3
Though I first introduced the term “Modeling Theory” in connection with
my Theory of Instruction,8 I have always conceived of it as equally applicable
to scientific research. In fact, Modeling Theory has been the main mechanism
for transferring what I know about research into designs for instruction.
In so far as Modeling Theory constitutes an adequate epistemology of sci-
ence, it provides a reliable framework for critique of the physics curriculum and
a guide for revising it. From this perspective, I see the standard curriculum as
seriously deficient at all levels from grade school to graduate school. In partic-
ular, the models inherent in the subject matter are seldom clearly delineated.
Textbooks (and students) regularly fail to distinguish between models and their
implications.2 This results in a cascade of student learning difficulties. However,
we cannot dwell on that important problem here.
I have discussed Modeling Theory and its instructional implications at some
length elsewhere,2, 8–10 although there is still more to say. The brief account
above suffices to set the stage for application to the main subject of this lecture.
Modeling Theory tells us that the primary conceptual tools mentioned in the 4th
Learning Principle are modeling tools. Accordingly, I have devoted considerable
PER effort to classification, design, and use of modeling tools for instruction.
Heretofore, emphasis has been on the various kinds of graphs and diagrams used
in physics, including analysis of the information they encode and comparison
with mathematical representations.9, 10 All of this was motivated and informed
by my research experience with mathematical modeling.
In the balance of this lecture, I draw my mathematics research into the PER
domain as an example of how PER can and should be concerned with basic
physics research.
5
“cannot be extracted from experience, but must be freely invented.”11 Note also
that Einstein’s assertion amounts to a form of scientific constructivism in accord
with the Learning Principles in Section II.
The point I wish to make by citing these two examples is that without
essential mathematical concepts the two theories would have been literally in-
conceivable. The mathematical modeling tools we employ at once extend and
limit our ability to conceive the world. Limitations of mathematics are evident
in the fact that the analytic geometry that provides the foundation for classical
mechanics is insufficient for General Relativity. This should alert one to the
possibility of other conceptual limits in the mathematics used by physicists.
Since Newton’s day a variety of different symbolic systems have been in-
vented to address problems in different contexts. Figure 1 lists nine such systems
in use by physicists today. Few physicists are proficient with all of them, but
each system has advantages over the others in some application domain. For ex-
ample, for applications to rotations, quaternions are demonstrably more efficient
than the vectorial and matrix methods taught in standard physics courses. The
difference hardly matters in the world of academic exercises, but in the aerospace
industry, for instance, where rotations are bread and butter, engineers opt for
quaternions.
Fig. 1. Multiple mathematical systems contribute to the fragmentation
of knowledge, though they have a common geometric nexus.
6
sented in several different systems, but one of them is invariably better suited
than the others for a given application. For example, Goldstein’s textbook on
mechanics12 gives three different ways to represent rotations: coordinate matri-
ces, vectors and Pauli spin matrices. The costs in time and effort for translation
between these representations are considerable.
3. Deficient integration. The collection of systems in Fig. 1 is not an
integrated mathematical structure. This is especially awkward in problems that
call for the special features of two or more systems. For, example, vector algebra
and matrices are often awkwardly combined in rigid body mechanics, while Pauli
matrices are used to express equivalent relations in quantum mechanics.
4. Hidden structure. Relations among physical concepts represented in
different symbolic systems are difficult to recognize and exploit.
5. Reduced information density. The density of information about
physics is reduced by distributing it over several different symbolic systems.
Evidently elimination of these defects will make physics easier to learn and
apply. A clue as to how that might be done lies in recognizing that the various
symbolic systems derive geometric interpretations from a common coherent core
of geometric concepts. This suggests that one can create a unified mathematical
language for physics by designing it to provide an optimal representation of
geometric concepts. In fact, Hermann Grassmann recognized this possibility
and took it a long way more than 150 years ago.13 However, his program to
unify mathematics was forgotten and his mathematical ideas were dispersed,
though many of them reappeared in the several systems of Fig. 1. A century
later the program was reborn, with the harvest of a century of mathematics
and physics to enrich it. This has been the central focus of my own scientific
research.
Creating a unified mathematical language for physics is a problem in the
design of mathematical systems. Here are some general criteria that I have
applied to the design of Geometric Algebra as a solution to that problem:
1. Optimal algebraic encoding of the basic geometric concepts: magni-
tude, direction, sense (or orientation) and dimension.
2. Coordinate-free methods to formulate and solve basic equations of
physics.
3. Optimal uniformity of method across classical, quantum and relativis-
tic theories to make their common structures as explicit as possible.
4. Smooth articulation with widely used alternative systems (Fig. 1) to
facilitate access and transfer of information.
5. Optimal computational efficiency. The unified system must be at
least as efficient as any alternative system in every application.
Obviously, these design criteria ensure built-in benefits of the unified lan-
guage. In implementing the criteria I deliberately sought out the best available
mathematical ideas and conventions. I found that it was frequently necessary
to modify the mathematics to simplify and clarify the physics.
This led me to coin the dictum: Mathematics is too important to be
left to the mathematicians! I use it to flag the following guiding principle
for Modeling Theory: In the development of any scientific theory, a
7
major task for theorists is to construct a mathematical language that
optimizes expression of the key ideas and consequences of the theory.
Although existing mathematics should be consulted in this endeavor, it should
not be incorporated without critically evaluating its suitability. I might add that
the process also works in reverse. Modification of mathematics for the purposes
of science serves as a stimulus for further development of mathematics. There
are many examples of this effect in the history of physics.
Perhaps the most convincing evidence for validity of a new scientific theory
is successful prediction of a surprising new phenomenon. Similarly, the most
impressive benefits of Geometric Algebra arise from surprising new insights into
the structure of physics.
The following Sections survey the elements of Geometric Algebra and its
application to core components of the physics curriculum. Many details and
derivations are omitted, as they are available elsewhere. The emphasis is on
highlighting the unique advantages of Geometric Algebra as a unified mathe-
matical language for physics.
V. Understanding Vectors
A recent study on the use of vectors by introductory physics students summa-
rized the conclusions in two words: “vector avoidance!” 14 This state of mind
tends to propagate through the physics curriculum. In some 25 years of graduate
physics teaching, I have noted that perhaps a third of the students seem inca-
pable of reasoning with vectors as abstract elements of a linear space. Rather,
they insist on conceiving a vector as a list of numbers or coordinates. I have
come to regard this concept of vector as a kind of conceptual virus, because it
impedes development of a more general and powerful concept of vector. I call
it the coordinate virus! 15
Once the coordinate virus has been identified, it becomes evident that the
entire physics curriculum, including most of the textbooks, is infected with the
virus. From my direct experience, I estimate that two thirds of the graduate
students have serious infections, and half of those are so damaged by the virus
that they will never recover. What can be done to control this scourge? I suggest
that universal inoculation with Geometric Algebra could eventually eliminate
the coordinate virus altogether.
I maintain that the origin of the problem lies not so much in pedagogy
as in the mathematics. The fundamental geometric concept of a vector as
a directed magnitude is not adequately represented in standard mathematics.
The basic definitions of vector addition and scalar multiplication are essential
to the vector concept but not sufficient. To complete the vector concept we
need multiplication rules that enable us to compare directions and magnitudes
of different vectors.
8
A. The Geometric Product
I take the standard concept of a real vector space for granted and define the
geometric product ab for vectors a, b, c by the following rules:
9
b
a a b = = b a
a
b
From the geometric interpretations of the inner and outer products, we can
infer an interpretation of the geometric product from extreme cases. For or-
thogonal vectors, we have from (5)
a·b = 0 ⇐⇒ ab = −ba. (8)
On the other hand, collinear vectors determine a parallelogram with vanishing
area (Fig. 2), so from (6) we have
i = σ 1 σ 2 = σ 1 ∧ σ 2 = −σ 2 σ 1 (11)
The suggestive symbol i has been chosen because by squaring (11) we find that
i2 = −1 (12)
√
Thus, i is a truly geometric −1. We shall see that there are others.
From (11) we also find that
σ 2 = σ 1 i = −i σ 1 and σ 1 = iσ 2 . (13)
10
In words, multiplication by i rotates the vectors through a right angle. It follows
that i rotates every vector in the plane in the same way. More generally, it
follows that every unit bivector i satisfies (12) and determines a unique plane in
Euclidean space. Each i has two complementary geometric interpretations: It
represents a unique oriented area for the plane, and, as an operator, it represents
an oriented right angle rotation in the plane.
b = aU and a = Ub . (15)
where θ is the angle from a to b. Accordingly, with the angle dependence made
explicit, the decomposition (7) enables us to write (14) in the form
b θ
7
. a
.
Fig. 3. A pair of unit vectors a, b determine a directed arc on the
unit circle that represents their product U = ab. The length of the arc
is (radian measure of) the angle θ between the vectors.
11
7 a a
7 .
a
7
Fig. 4. All directed arcs with equivalent angles are represented by a
single rotor Uθ , just as line segments with the same length and direction
are represented by a single vector a.
7θ 7ϕ 7θ + ϕ
| z |2 = zz † = λ2 = baab = a2 b2 = | a |2 | b |2 (21)
12
z z
.
0
U
1 λ
.
0
7
1 λ
Anyone who has worked with complex numbers in applications knows that
it is usually best to avoid decomposing them into real and imaginary parts.
Likewise, in GA applications it is usually best practice to work directly with
the geometric product instead of separating it into inner and outer products.
GA gives complex numbers new powers to operate directly on vectors. For
example, from (19) and (20) we get
Thus, z rotates and rescales a to get b. This makes it possible to construct and
manipulate vectorial transformations and functions without introducing a basis
or matrices.
This is a good point to pause and note some instructive implications of
what we have established so far. Every physicist knows that complex numbers,
especially equations (17) and (18), are ideal for dealing with plane trigonometry
and 2D rotations. However, students in introductory physics are denied access to
this powerful tool, evidently because it has a reputation for being conceptually
difficult, and class time would be lost by introducing it. GA removes these
barriers to use of complex numbers by linking them to vectors and giving them
a clear geometric meaning.
GA also makes it possible to formulate and solve 2D physics problems in
terms of vectors without introducing coordinates. Conventional vector algebra
cannot do this, in part because the vector cross product is defined only in 3D.
That is the main reason why coordinate methods dominate introductory physics.
The available math tools are too weak to do otherwise. GA changes all that!
For example, most of the mechanics problems in introductory physics are 2D
problems. Coordinate-free GA solutions for the standard problems are worked
out in my mechanics book.16 Although the treatment there is for a more ad-
vanced course, it can easily be adapted to the introductory level. The essential
GA concepts for that level have already been presented in this section.
13
Will comprehensive use of GA significantly enhance student learning in in-
troductory physics? We have noted theoretical reasons for believing that it
will. To check this out in practice is a job for PER. However, mathematical
reform at the introductory level makes little sense unless it is extended to the
whole physics curriculum. The following sections provide strong justification
for doing just that. We shall see how simplifications at the introductory level
get amplified to greater simplifications and surprising insights at the advanced
level.
σ 1 σ 2 = iσ 3 , σ 2 σ 3 = iσ 1 , σ 3 σ 1 = iσ 2 . (25)
Geometric interpretations for the pseudoscalar and bivector basis elements are
depicted in Figs. 7 and 8.
σ3 σ3
σ3
σ 1σ 2 σ 3
σ2
σ 1σ 2
σ1 σ1 σ2 σ1 σ2
14
σ3
σ2 σ 3
σ3 σ 1
σ2
σ1 σ 2
σ1
Fig. 8. Unit bivectors representing a basis of directed areas in planes
with orthogonal intersections
ab = a · b + i a × b . (29)
This is the definitive relation among vector products that we need for smooth ar-
ticulation between geometric algebra and standard vector algebra, as is demon-
strated with many examples in my mechanics book.16
The elements in any geometric algebra are called multivectors. The special
properties of i enable us to write any multivector M in G3 in the expanded form
M = α + a + ib + iβ, (30)
where α and β are scalars and a and b are vectors. The main value of this form
is that it reduces multiplication of multivectors in G3 to multiplication of vec-
tors given by (29). Note that the four terms in (30) are linearly independent, so
scalar, vector, bivector and pseudoscalar parts combine separately under multi-
vector addition, though they are mixed by multiplication. Thus, the Geometric
Algebra G3 is a linear space of dimension 1 + 3 + 3 + 1 = 23 = 8.
The expansion (30) has the formal algebraic structure of a “complex scalar”
α + iβ added to a “complex vector” a + ib, but any physical interpretation
15
attributed to this structure hinges on the geometric meaning of i. The most
important example is the expression of the electromagnetic field F in terms of
an electric vector field E and a magnetic vector field B:
F = E + iB . (31)
i† = −i. (33)
16
At least it is a design flaw in standard vector algebra that has been almost
universally overlooked. As we have just seen, elimination of the flaw enables us
to combine electric and magnetic fields into a single electromagnetic field. And
we shall see below how it enables us to construct spinors from vectors (contrary
to the received wisdom that spinors are more basic than vectors)!
U x = ±U x U † , (37)
where U is a unimodular multivector called a versor, and the sign is the parity
of U , positive for a rotation or negative for a reflection. The condition
U † U = 1. (38)
To show how this function works, the vector x has been decomposed on the
right into a parallel component x = (x · a)a that commutes with a and an
orthogonal component x⊥ = (x ∧ a)a that anticommutes with a. As can be
seen below, it is seldom necessary or even advisable to make this decomposition
in applications. The essential point is that the normal vector defining the direc-
tion of a plane also represents a reflection in the plane when interpreted as a
versor. A simpler representation for reflections is inconceivable, so it must be
the optimal representation for reflections in every application, as shown in some
important applications below. Incidentally, the term versor was coined in the
19th century for an operator that can re-verse a direction. Likewise, the term is
used here to indicate a geometric operational interpretation for a multivector.
The reflection (39) is not only the simplest example of an orthogonal trans-
formation, but all orthogonal transformations can be generated by reflections of
this kind. The main result is expressed by the following theorem: The product
of two reflections is a rotation through twice the angle between the normals of
17
a
x
x
x
-x
x'
the reflecting planes. This important theorem seldom appears in standard text-
books, primarily, I presume, because its expression in conventional formalism is
so awkward as to render it impractical. However, it is an easy consequence of
a second reflection applied to (39). Thus, for a plane with unit normal b, we
have
x = −b x b = baxab = U x U † , (40)
where a new symbol has been introduced for the versor product U = ba. The
theorem is obvious from the geometric construction in Fig. 10. For an algebraic
proof that the result does not depend on the reflecting planes, we use (17) to
write
1
U = ba = cos 12 θ + i sin 12 θ = e 2 i θ , (41)
where, anticipating the result from Fig. 9, we denote the angle between a and b
by 12 θ and the unit bivector for the b ∧ a-plane by i. Next, we decompose x into
a component x⊥ orthogonal to the i-plane and a component x in the plane.
Note that, respectively, the two components commute (anticommute) with i, so
x⊥ U † = U † x⊥ , x U † = U x . (42)
Inserting this into (40) with x = x + x⊥ , we obtain
x = U x U † = x⊥ + U 2 x . (43)
These equations show how the two-sided multiplication by the versor U picks
out the component of x to be rotated, so we see that one-sided multiplication
works only in 2D. As we learned from our discussion of 2D rotations, the versor
U 2 = ei θ rotates x⊥ through angle θ, in agreement with the half-angle choice
in (41).
The great advantage of the canonical form (37) for an orthogonal trans-
formation is that it reduces the composition of orthogonal transformations to
versor multiplication. Thus, composition expressed by the operator equation
U2 U1 = U3 (44)
is reduced to the product of corresponding versors
U2 U1 = U3 . (45)
18
x''
1
2 θ a
b θ
x
x'
U2
b c
U1
U3
a
19
In 3D a rotor is depicted as a directed arc confined to a great circle on the
unit sphere. The product of rotors U1 and U2 is depicted in Fig. 11 by connecting
the corresponding arcs at a point c where the two great circles intersect. This
determines points a = cU1 and b = U2 c, so the rotors can be expressed as
products with a common factor,
with the corresponding arc for U3 depicted in Fig. 11. It should not be for-
gotten that the arcs in Fig. 11 depict half-angles of the rotations. The non-
commutativity of rotations is illustrated in Fig. 12, which depicts the construc-
tion of arcs for both U1 U2 and U2 U1 .
U1 U1 U2
U2 U2
U1
U2U1
Those of you who are familiar with quaternions will have recognized that
they are algebraically equivalent to rotors, so we might as well regard the two
as one and the same. Advantages of the quaternion theory of rotations have
been known for the better part of two centuries, but to this day only a small
number of specialists have been able to exploit them. Geometric algebra makes
them available to everyone by embedding quaternions in a more comprehen-
sive mathematical system. More than that, GA makes a number of significant
improvements in quaternion theory — the most important being the integration
of reflections with rotations as described above. To make this point more ex-
plicit and emphatic, I describe two important practical applications where the
generation of rotations by reflections is essential.
Multiple reflections. Consider a light wave (or ray) initially propagating
with direction k and reflecting off a sequence of plane surfaces with unit normals
20
a1 , a2 , . . . , an (Fig. 12). By multiple applications of (39) we find that it emerges
with direction
an a2
k a1
'
k
c
x4 x3
x2
x5 π/
π /3 6
b
x6 x1= a
21
Fig. 15. Symmetry vectors for the methane molecule.
b' 2
c' c 2
a' 3
π/
2
π/
3
π/4
a b 2
Fig. 16. Symmetry vectors for crystals with cubic symmetry. Vectors
a, b, c generate reflection symmetries. Vectors a , b , c specify axes of
rotation symmetries.
22
Remarkably, every point symmetry group is generated multiplicatively by
some combination of three unit vectors a, b, c satisfying the versor conditions
(ab)p = (bc)q = (ca)r = −1, (49)
where {p, q, r} is a set of three integers that characterize the symmetry group.
These conditions describe p-fold, q-fold and r-fold rotation symmetries. For
example, the set is {6, 2, 2} for the planar benzene molecule (Fig. 14), and
(ab)6 = −1 represents a sixfold rotation that brings all atoms back to their
original positions.
The methane molecule (Fig. 15) has tetrahedral symmetry characterized by
{3, 3, 3}, which specifies the 3-fold symmetry of the tetrahedron faces. This
particular symmetry cannot be extended to a space-filling crystal, for which it
can be shown that at least one of the symmetries must be 2-fold.
There are precisely 32 crystallographic point groups distinguished by a small
set of allowed values of p and q in {p, q, 2}. For example, generating vectors
for the case {4, 3, 2} of crystals with cubic symmetry are shown in Fig. 16. A
complete analysis of all the point groups is given elsewhere,17 along with an
extension of GA techniques to handle the 230 space groups.
The point symmetry groups of molecules and crystals are increasing in im-
portance as we enter the age of nanoscience and molecular biology. Yet the
topic remains relegated to specialized courses, no doubt because the standard
treatment is so specialized. However, we have just seen that the GA approach
to reflections and rotations brings with it an easy treatment of the point groups
at no extra cost.
This is a good place to summarize with a list of the advantages of the GA
approach to rotations, including some to be explained in subsequent Sections:
1. Coordinate-free formulation and computation.
2. Simple algebraic composition.
3. Geometric depiction of rotors as directed arcs.
4. Rotor products depicted as addition of directed arcs.
5. Integration of rotations and reflections in a single method.
6. Efficient parameterizations (see ref.16 for details).
7. Smooth articulation with matrix methods.
8. Rotational kinematics without matrices.
Moreover, the approach generalizes directly to Lorentz transformations, as will
be demonstrated in a subsequent paper.
23
These two sets of equations can be solved for the matrix elements as a function
of U , with the result
αkj = ek · σ j = U σ k U † σ j , (52)
where ... means scalar part. Alternatively, they can be solved for the rotor as
a function of the frames or the matrix.16 One simply forms the quaternion
ψ = 1 + ek σ k = 1 + αkj σ j σ k (53)
This makes it easy to move back and forth between matrix and rotor representa-
tions of a rotation. We have already seen that the rotor is much to be preferred
for both algebraic computation and geometric interpretation.
Let the frame {ek } represent a set of directions fixed in a rigid body, perhaps
aligned with the principal axes of the inertia tensor. For a moving body the ek =
ek (t) are functions of time, and (50) reduces the description of the rotational
motion to a time dependent rotor U = U (t). By differentiating the constraint
U U † = 1, it is easy to show that the derivative of U can be put in the form
dU
= 12 ΩU, (55)
dt
where
Ω = −i ω (56)
dek
= ω × ek (57)
dt
employed in the standard vectorial treatment of rigid body kinematics.
The point of all this is that GA reduces the set of three vectorial equations
(57) to the single rotor equation (55), which is easier to solve and analyze for
given Ω = Ω(t). Specific solutions for problems in rigid body mechanics are
discussed elsewhere.16 However, the main reason for introducing the classical
rotor equation of motion in this lecture is to show its equivalence to equations
in quantum mechanics given below.
24
D. Maxwell’s Equation
We have seen how electric and magnetic field vectors can be combined into a
single multivector field
F (x, t) = E(x, t) + iB(x, t) (58)
representing the complete electromagnetic field. Standard vector algebra forces
one to consider electric and magnetic parts separately, and it requires four field
equations to describe their coordinated action. GA enables us to put Humpty
Dumpty together and describe the complete electromagnetic field by a single
equation. But first we need to learn how to differentiate with respect to the
position vector x.
We can define the derivative ∇ = ∂x with respect to the vector x most
quickly by appealing to your familiarity with the standard concepts of divergence
and curl. Then, since ∇ must be a vector operator, we can use (29) to define
the vector derivative by
∇E = ∇ · E + i ∇ × E = ∇ · E + ∇ ∧ E . (59)
This shows the divergence and curl as components of a single vector derivative.
Both components are needed to determine the field. For example, for the field
due to a static charge density ρ = ρ(x), the field equation is
∇E = ρ. (60)
The advantage of this form over the usual separate equations for divergence and
curl is that ∇ can be inverted to solve for
E = ∇−1 ρ. (61)
25
An electromagnetic field F = F (x, t) with charge density ρ = ρ(x, t) and
charge current J = J(x, t) as sources is determined by Maxwell’s Equation
1 1
∂t + ∇ F = ρ − J . (63)
c c
To show that this is equivalent to the standard set of four equations, we employ
(58), (59) and (30) to separate, respectively, its scalar, vector, bivector, and
pseudoscalar parts:
∇ · E = ρ, (64)
1 1
∂t E − ∇ × B = − J, (65)
c c
1
i ∂t B + ∇ × E = 0, (66)
c
i∇ · B = 0 (67)
Here we see the standard set of Maxwell’s equations as four geometrically dis-
tinct parts of one equation. Note that this separation into several parts is similar
to separately equating real and imaginary parts in an equation for complex vari-
ables.
Of course, it is preferable to solve and analyze Maxwell’s Equation without
decomposing it into parts. This is especially true for propagating wave solutions.
For example, monochromatic plane waves in a vacuum have solutions with the
familiar form
F2 = f2 = 0 (69)
is required here, the i is the unit pseudoscalar and the two signs in (68) represent
the two states of circular polarization. A more detailed GA treatment of plane
waves was published in this journal three decades ago.20
26
function has stimulated a veritable orgy of philosophical speculation about the
nature of matter and our knowledge of it. Curiously, virtually all philosophizing
about the interpretation of quantum mechanics has been based on Schroedinger
theory, despite the fact that electrons, like all other fermions, are known to
have intrinsic spin. We shall see that that is a serious mistake, for it is only in
a theory with electron spin that one can see why the wave function is complex.
You may wonder why this fact is not common knowledge.
The reason is that the geometric meaning of the wave function lies buried
in the standard matrix version of the Pauli theory. We shall exhume it by
translating the matrix √ wave function Ψ into a real spinor ψ in GA where, as
we
√ have seen, every −1 has a geometric meaning. We discover then that the
−1 in Schroedinger theory emerges in the real Pauli version as a bivector that
is related to spin in an essential way. In other words, we see that geometry
dictates that spin is not a mere add-on in quantum mechanics, but an essential
feature of fermion wave functions.
By reformulating quantum mechanics in terms of real spinors, we establish
GA as a common mathematical language for quantum mechanics and classical
mechanics as formulated in preceding Sections. This simplifies and clarifies the
relation of classical theory to quantum theory. In particular, we find that the
spinor wave function operates as a rotor in essentially the same way as rotors in
classical mechanics. This suggests that the bilinear dependence of observables
on the wave function is not unique to quantum mechanics — it is equally natural
in classical mechanics for geometrical reasons.
Though the relation of spin to the unit imaginary was first discovered in the
Dirac theory,21, 22 it is easiest to see in the Pauli theory. The resulting real spinor
wave equation leads to the surprising conclusion that spin was inadvertently
incorporated
√ into the original Schroedinger equation in the guise of the distinctive
factor −1h̄. Extension to the Dirac theory will be covered in a sequel to this
paper.
27
is explained below. For the moment, we note the potential for this change in
perspective to bring classical mechanics and quantum mechanics closer together.
Texts on quantum mechanics describe the three σ i as a vector σ with ma-
trices for components. They combine σ with an ordinary vector a with scalar
components ai by writing
σ · a = ai σ i . (71)
σ 3 Ψ± = ±Ψ± . (73)
28
coefficient can be derived by mimicking Dirac’s argument in the nonrelativistic
domain, so spin is not a “relativistic phenomenon” after all. The trick is to
define a momentum operator
e
σ · p = σ · (−i h̄∇ − A) (76)
c
and assume a hamiltonian of the form
1
HP = (σ · p )2 + V (x) . (77)
2m
When this is expanded with the help of the identity (72), it gives the result
in (75). I do not know who originated this argument, but I learned it from
Feynman, who gave me the impression that he had devised it himself. As we
see below, the definition (76) of the momentum operator is less mysterious in
GA, where it is apparent that the appearance of σ in it has nothing to do with
spin. Indeed, the “trick” (76) is justified by GA, where σ i appear naturally as
basis for the vector A and the vector derivative ∇.
Then(74) gives us
σ 1 σ 2 u = i u. (79)
Ψ = ψu, (80)
29
Furthermore, the polynomial can be taken as an even multivector, for if any
term is odd, then (78) allows us to make it even by multiplying it on the right
by σ 3 . Therefore, we can assume with complete generality that ψ in (80) is a
real even multivector. Now we can reinterpret the σ k in ψ as vectors in GA
instead of matrices. Thus, we have established a one-to-one correspondence
between Pauli spinors Ψ and even multivectors ψ in GA.
Since the ungeometrical imaginary i has been thereby eliminated, I refer to
ψ as a real spinor (or quaternion). Note that ψ has four degrees of freedom, one
for its scalar part and three for its bivector part, as does Ψ in its two complex
matrix components. The big gain in replacing Ψ by ψ is that the latter has a
more transparent geometric interpretation, as shown below.
Now, it is easy to extract a real wave equation for the real spinor wave
function from the matrix equation (75). Thus we arrive at the real Pauli-
Schroedinger (PS) equation
eh̄
∂t ψ iσ 3 h̄ = HS ψ − B ψσ 3 , (81)
2mc
where i is now the unit pseudoscalar so iσ 3 = σ 1 σ 2 , and σ · B is replaced by
B for reasons explained above. I have inserted Schroedinger’s name here to
emphasize the fact (explained below) that in this real theory the Schroedinger
wave function is the same real spinor, and (81) reduces to the Schroedinger
equation by dropping the last term. In the same way, equations (76) and (77)
can be re-expressed in the real theory, but we will not have need of them below.
The equivalence of (81) to the matrix equation (75) is easily proved by
interpreting (81) as a matrix equation, multiplying on the right by the basis
spinor u and using (78), (79) and (80). Note that the σ 3 on the right side of
(81) is essential to make the last term have even parity, for the vector B is odd
while ψ is an even multivector. Thus all terms in (81) are even.
It should be noted that the condition imposed in the Pauli theory by as-
suming that the Pauli spin matrices are hermitian is equivalent to the definition
(34) of reversion in GA. Thus we see that the standard association of “spin
observables” with hermitian operators amounts to declaring that they represent
vectors. Evidently, this is an assumption about geometry rather than spin —
a critical geometric assumption that has been inadvertently incorporated into
quantum mechanics. Spin must get into the theory some other way. This raises
serious questions about the physical significance of the common association of
physical observables with hermitian operators.
The explicit σ 3 in equation (81) may make it look less general than (75),
but that is an illusion, because σ 3 is an arbitrarily chosen constant vector. It
can be related to any other choice σ 3 by a constant rotation
σ 3 = Cσ k C † , (82)
(ψσ 3 )C † = ψC † Cσ 3 C † = ψ σ 3 , (83)
30
where ψ = ψC † is the wave function relative to the alternative quantization
axis σ 3 . The matrix analog of this transformation is a change in matrix repre-
sentation for the column spinor Ψ.
ψσ k ψ † = ρ ek , (85)
ek = U σ k U † . (86)
These observables are invariant under the change in choice of standard frame
ψσ k ψ † = ψ σ k ψ † defined by (82). Thus, the wave function determines an
invariant time-dependent field of frames {ek (x, t)} attached to each position x.
We show below that
s = 12 h̄ e3 = 12 h̄ U σ 3 U † (87)
i ≡ iσ 3 = σ 1 σ 2 (89)
√
plays the role of −1 in the real PS equation (81).
The hidden relation of spin to the imaginary i in the matrix theory can be
made more manifest by regarding S = Sij σ j σ i as a spin operator on the wave
function. Multiplying (88) on the right by (84) we get
h̄
S ψ = ψi . (90)
2
Regarding this as a matrix equation, we use (80) and (79) to get
S Ψ = 12 i h̄ Ψ. (91)
31
Thus, 12 i h̄ is the eigenvalue of the “spin operator” S. Otherwise said, the factor
i h̄ in the Pauli matrix equation (75) is a representation of the spin bivector by
its eigenvalue. The eigenvalue is imaginary because the spin tensor Sij = −Sji
is skewsymmetric. We can conclude, therefore, that spin was originally intro-
duced into quantum mechanics with the factor i h̄ in the original Schroedinger
equation.
Spin components sk = s · σ 3 = sσ 3 are related to conventional matrix
elements by
h̄ h̄ h̄
ρsk = σ k ψσ 3 ψ † = Tr (σ k ΨΨ† ) = Ψ† σ k Ψ, (92)
2 2 2
where the scalar part M of a real multivector M corresponds to the Trace
Tr (M ) of its matrix representation.
In conventional Pauli theory,31 the σ k are regarded as “spin operators” with
eigenvalues corresponding to results of spin measurements in orthogonal direc-
tions. However, (92) shows that in Real PS Theory the σ k are operators only
in the trivial sense of basis vectors that pick out components of the spin vector
determined by the wave function. Conventional theory also interprets the non-
vanishing commutator [σ 1 , σ 2 ] = σ 1 σ 2 − σ 2 σ 1 as a measure of incompatibility
in spin measurements. Whereas, according to (11) and (25), the real theory
interprets
[σ 1 , σ 2 ] = 2σ 1 ∧ σ 2 = 2iσ 3 (93)
B s = B · s + i(B × s) (95)
32
where ES is the Schroedinger energy. For a stationary solution with B × s = 0,
s must be parallel or antiparallel to B and we have
h̄
B · s = ± | B |. (98)
2
This is the basis for declaring that spin is “two-valued.” However, when B is
variable the vectorial nature of s becomes apparent.
We can generalize (96) to arbitrary states by interpreting
as energy density. Here the energy E = E(x, t) can be a variable function, and
we see that an energy eigenstate is defined by the assumption that E is uniform.
Inserting (84) into (99), we discover that the density ρ drops out to give us
∂t U = − 12 iωU, (101)
∂t ek = ω × ek , (102)
in exact correspondence with the classical equations (55) and (57). In particular,
(102) gives us the kinematic equation for spin precession
∂t s = ω × s. (103)
33
at some length before,24 and I will return to it in a subsequent paper within the
context of the Dirac theory.
When the magnetic field B = B(t) is a function of time alone, we can define
a rotor D by assuming that it satisfies the equation
dD e
= 12 iB D. (105)
dt mc
and factoring the wave function into
Substituting this into the PS-equation (81) separates the factors so that D
satisfies (105) and ψS satisfies Schroedinger’s equation
∂t ψS ih̄ = HS ψS . (107)
The rotor equation (105) exhibits a magnetic torque (e/m)B in perfect agree-
ment with the classical model of magnetic resonance discussed in my book.16
This exact analogy with classical physics is a great help in interpreting magnetic
resonance experiments, and it raises more questions about the interpretation of
electron spin.
Note that the factor 12 on the right side of (105) is the same factor that,
following Pauli, was attributed to spin in (87) and (98). However, (105) and
(101) suggest that the 12 is more correctly associated with the rotational velocity
in a rotor equation.
Although our real version of the Pauli theory gives new insight into the
geometric properties of the wave function, spin and interaction with the mag-
netic field, it is mathematically isomorphic with the standard matrix version,
so no new physical consequences are to be expected, and it is straightforward
to translate results from one version to the other. More details about the real
PS theory have been published in this journal before.24
Standard techniques for analyzing angular momentum and solving the Pauli
equation31 can be converted to GA,25 where they yield some surprises and
simplifications that deserve further study.
Without delving into the complexities of angular momentum analysis, it is
obvious that the standard Schroedinger wave function is a solution the Schroe-
dinger equation (107), but with the bivector i = iσ 3 as unit imaginary, so there
is no way to eliminate spin from the theory without eliminating complex num-
bers. It must be concluded, therefore, that standard Schroedinger theory does
not describe electrons without spin, but rather electrons with constant spin (or,
equivalently, electrons in a spin eigenstate).
The difference between Pauli and Schroedinger solutions of (107) is in the
class of eigenfunctions allowed. The simple Schroedinger hamiltonian HS does
not discriminate between them. A distinction is forced when the hamiltonian is
generalized to include spin-orbit interactions,31 but this is not the place to go
into details.
34
This discussion has been limited to single particle Pauli theory. But in
closing, it should be mentioned that GA methods for spin representations of
many particle systems is currently an area of active research,32, 38 where GA
gives fresh insights into entangled states, quantum computing and the like.
To summarize, let me highlight the most provocative conclusions from this
section:
√
• The explicit −1 in fermion wave functions represents a bivector specifying
the direction of spin.
• ih̄ = iσ 3 h̄ represents spin in Schroedinger’s equation. This implies that
spin is not a simple add-on in quantum mechanics but an essential ingredient
of the theory. That is likely to be true for all fermions and bosons that are
composites of fermions.
• Pauli matrices represent vectors, not spin operators in quantum mechanics.
• Bilinear observables are geometric consequences of rotational kinematics,
so they are as natural in classical mechanics as in quantum mechanics.
• The real spinor wave function is easier to interpret and solve than the
matrix version.
35
A. History and reception of geometric algebra
Geometric algebra originated in the work of Hermann Grassmann (1809–1877)
and William Kingdon Clifford (1845–1879). Clifford provided the axiomatic
formulation for the geometric product in Section VA, so the system is called
Clifford algebra in most of the mathematics literature. However, Clifford him-
self called it geometric algebra, and he acknowledged the primary contribution
of Grassmann. He recognized its unique significance as an algebraic represen-
tation of geometric concepts as conceived by Grassmann. Unfortunately, his
deep insight into the geometric foundations of algebra did not survive his early
death. Mathematicians abstracted Clifford algebra from its geometric origins,
and, for the better part of a century, it languished as a minor subdiscipline of
mathematics — one more algebra among so many others.
Grassmann’s program to develop a universal geometric algebra for mathe-
matics and physics was revived in 1966 with my book Space Time Algebra.21
Actually, it was many years later before I realized how much of my own research
program for developing a universal mathematical language had been anticipated
by Grassmann.13 Of course, I had an additional century of mathematics and
physics to draw on, so geometric algebra today is vastly more developed with a
scope that was inconceivable in Grassmann’s day.
My 1966 book introduced the critical innovations that made the unified
language in this paper possible. It developed a coordinate-free formulation of
spacetime physics that will be discussed in the sequel to this paper. After
employing it exclusively in my research and teaching over the next few years,
my own perspective on GA in physics was consolidated around the essential
form and content in this lecture. I found GA so empowering that ever since I
have not been satisfied with my understanding of any subject in physics until I
have formulated it in terms of GA. Invariably I have found that GA simplifies
and illuminates every subject.
My 1966 book had a serious limitation. Though it simplified the funda-
mental equations of physics, applications required access to the great repertoire
of available mathematical methods, and translation into standard formulations
often destroyed the advantages of geometric algebra. This motivated me to
extend geometric algebra to a unified language for mathematics that incorpo-
rated all the methods needed in physics. Besides, I must confess to an abiding
interest in pure mathematics for its own sake. I consider myself equal parts
mathematician and physicist. This ambitious program to redesign mathematics
was greatly stimulated by an amazing early success: the realization that GA
enables a natural generalization and extension to arbitrary dimension of the
central result of complex variable theory, Cauchy’s integral formula.18 This re-
sult has been independently discovered by several others, and mathematicians
have developed it in recent years into a vigorous new branch of mathematics
called Clifford analysis. Unfortunately, most practitioners in this field pay insuf-
ficient attention to the geometric meaning of their results that is needed for easy
application to physics. Consequently, the evolution of geometric algebra into a
comprehensive Geometric Calculus to serve as a unified language for both math-
36
ematics and physics has so far proceeded independently of Clifford analysis.29
The development of Geometric Calculus culminated in the 1986 publication of
a comprehensive mathematics monograph19 and has continued to this day. To
my knowledge, no other mathematical system integrates such a broad range of
mathematics, just about all the mathematics needed for physics.
For the first 20 years of my professional life my research on developing and
applying geometric algebra proceeded in almost complete isolation, except for
interaction with a few students. During that period I published regularly, but
there were only a handful of citations to my papers and no positive feedback from
colleagues. I learned to be very careful about when, where and how I presented
my work to other physicists, because the reaction was invariably dismissive as
soon as they detected deviation from standard practice or beliefs.27
I did not even try to publish in Physical Review. Instead, I conducted
a sociological experiment by submitting proposals to the NSF for support of
my work on relativistic electron theory. After the first couple of tries I was
convinced that it would never get funded, but I continued to submit 12 times in
all as an experiment on how the system works. I found that there was always
a split opinion on my proposal that typically fell into three groups. About
one third dismissed me outright as a crank. About one third was intrigued
and sometimes gave my proposal an Excellent rating. The other third was
noncommittal, mainly because they were not sure they understood what I was
talking about. The fate of the proposal was always decided by averaging the
scores from the reviewers, despite the fact that the justifications for different
scores were blatantly contradictory. Such is the logic of the funding process.
I thought that my last proposal was particularly strong. But one reviewer
torpedoed it with an exceptionally long and impressively documented negative
opinion that overwhelmed the positive reviews. He was factually and logically
wrong, however, and I could prove it. So I decided to appeal the decision. I
wanted to challenge the NSF to resolve serious contradictions among reviews
before making a funding decision. I learned that the first two stages of the
appeal process were entirely procedural without considering merits of the case.
In the third stage, the appeal had to be submitted through the Office of Research
at my university. The functionaries in the Office worried that the appeal would
irritate the NSF, so it was delayed until the deadline passed. Such is the politics
of grant proposals.
The prospects for dissemination of GA were suddenly changed about 1986
by two new developments. The first was an “International Conference on Clif-
ford Algebras and their Applications in Mathematical Physics” conceived and
organized by Roy Chisholm.30 I was invited as keynote speaker on the strength
of my STA book. There I learned that my book had impressed a lot of people,
though they did not see how to use it in their own work. I was fortunate to
have it published in a series with many distinguished authors that was affordably
priced and widely marketed. Chisholm’s conference precipitated the formation
of an international community of researchers in mathematics and physics, and
the sixth in the series of international conferences that he started was held in
May 2002.
37
The second new development to enhance dissemination of GA was publi-
cation of my mechanics and mathematics books.16, 19 Early in my career, I
naively thought that if you give a good idea to competent mathematicians or
physicists, they will work out its implications for themselves. I have learned
since that most of them need the implications spelled out in utter detail. For
example, I published the ideas in Section V in a 1971 AJP article20 with the
expectation that others would find them as compelling as I do. But there was
no noticeable reaction until the implications for mechanics were fully worked
out in my book.16 To be fair, the implications turned out to be a lot richer than
I had anticipated. I learned that only when I taught the graduate mechanics
course where I was forced to work out the details.
I measure the impact of my work, not by words of praise or approval, but
by evidence of influence in the published work of others, not including papers
where GA has been incompetently used. I regard the impact as strong when GA
is employed in essentially the way that I would do it, and exceptional when GA
is used in a way that surprises me. By those criteria, my work had little impact
before the late 1980s, but its impact has accelerated during the 1990s as more
and more able researchers adopt GA as their mathematical language of choice.
During the last decade GA has started spreading to engineering and computer
science with applications to robotics, computer vision, space flight navigation
and control, collision detection, optical design and much more. In the last year
alone, several volumes of applications have been published.33–35 This lively use
of GA in the research domain has convinced me at last that it is time to press
for GA in the physics curriculum.
38
Many of the theoretically inclined students in my courses wanted to pursue
doctoral research with me. Unfortunately, it takes something like three years to
develop sufficient proficiency with GA to use it in research (unless one has an
exceptionally strong background in mathematical physics). Since GA was not
used in any courses besides mine, the students had to get the rest by independent
study. This meant an exceptionally long induction time before starting research.
Besides, I was acutely aware of the reception they were likely to get for a GA
based dissertation. Consequently, I took on only a few of the most dedicated
students. Then when I took my turn teaching introductory physics the supply
of doctoral students dried up completely.
I have not ventured to teach GA in undergraduate courses. However, one
young professor at a prestigious liberal arts college adopted my mechanics book
for an upper division mechanics course and published two pedagogical papers on
GA in the AJP36 that editor Bob Romer put on the list of his favorite papers
in 1993. His reward was denial of tenure. This is not an unfamiliar fate for
untenured faculty who attempt teaching innovation of any kind.
There is really no point in introducing GA into the core undergraduate
curriculum unless it is employed in a sufficient number of courses to develop
proficiency in the language. That requires collaboration of several faculty mem-
bers, at least, and the blessing of the rest of the physics department — a near
impossibility in most departments. However, we have already noted two good
reasons why departments need not fear that introducing GA in the undergrad-
uate curriculum will impede students in subsequent graduate studies where GA
is not used. First, after a working proficiency with GA has been acquired, it
is easy to translate to other formalisms. Second, such translation is a powerful
stimulant to understanding.
The usual mechanism for curriculum reform through textbooks is as slow as
it is perilous for the author. Any GA-based textbook is unlikely to be taken
on by a major textbook publisher, no matter how superb it might be, because
the prevailing demand is near zero. For that reason, I was forced to publish my
mechanics textbook in a series for advanced monographs in physics. Since few
such monographs are purchased by anyone but libraries, prices are necessarily
high for the publisher to break even on most books, too high for the competitive
textbook market. Moreover, the advertising does not reach the textbook market.
Though my book has been a continual best seller in the series for well over a
decade, it is still unknown to most teachers of mechanics in the U.S. To be
suitable for the series, I had to design it as a multipurpose book, including a
general introduction to GA and material of interest to researchers, as well as
problem sets for students. It is not what I would have written to be a mechanics
textbook alone. Most students need judicious guidance by the instructor to get
through it.
Despite the perils, there are other authors writing GA based textbooks.37, 38
But history tells us that incremental change is the most we can expect from
textbooks, while revolutionary change will be needed to give GA a powerful
presence in the curriculum within, say, ten years. That will require a new
mechanism for curriculum change. I suggest that PER can play that role with a
39
strong program of scholarly research and consensus building. I suggest further
that GA is only one among many topics that need to be addressed. The physics
curriculum has many other deficiencies that should be subjected to scholarly
scrutiny.
X. Challenge
Let me close with a challenge to PER and the physics community to critically
examine the following claims supported by the argument in this paper:
• GA provides a unified language for the whole of physics that is conceptu-
ally and computationally superior to alternative mathematical systems in every
application domain.
• GA can enhance student understanding and accelerate student learning of
physics.
40
• GA is ready to incorporate into the physics curriculum.
• GA provides new insight into the structure and interpretation of quantum
mechanics as well as its relation to quantum mechanics.
• Research on the design and use of mathematical tools is equally important
for instruction and for theoretical physics.
• Reforming the mathematical language of physics is the single most essen-
tial step toward simplifying physics education at all levels from high school to
graduate school.
Note. Most of my papers listed in the references are available on line.
PER papers can be accessed from <http://modeling.asu.edu>. GA papers can
be accessed from <http://modelingnts.la.asu.edu>. Many fine papers on GA
applications in physics and engineering are available at the Cambridge website
<http://www.mrao.cam.ac.uk/˜clifford/>.
References
[1] J. Piaget, To Understand Is To Invent (Grossman, New York, 1973). pp.
15–20.
[2] D. Hestenes, “Modeling Games in the Newtonian World,” Am. J. Phys. 60:
732–748 (1992).
[3] M. Wells, D. Hestenes, and G. Swackhamer, “A Modeling Method for High
School Physics Instruction,” Am. J. Phys. 63: 606-619 (1995).
[4] K. Ericsson & J. Smith (Eds.), Toward a general theory of expertise:
prospects and limits (Cambridge Univ. Press, Cambridge, 1991).
41
[11] A. Einstein, Ideas and Opinions (Three Rivers Press, New York, 1985). p.
274.
[18] D. Hestenes, “Multivector Functions,” J. Math. Anal. and Appl. 24, 467–
473 (1968).
42
[27] D. Hestenes, “Clifford Algebra and the Interpretation of Quantum Me-
chanics.” In J.S.R. Chisholm & A. K. Common (eds.), Clifford Algebras
and their Applications in Mathematical Physics, (Reidel Publ. Co., Dor-
drecht/Boston, 1986), pp. 321–346.
[28] F. Dyson, From Eros to Gaia (Pantheon books, New York, 1992). Chap.
14.
[30] In J. Chisholm & A. Common (eds.), Clifford Algebras and their Applica-
tions in Mathematical Physics (Reidel Publ. Co., Dordrecht/Boston, 1986).
[33] R. Ablamowicz & B. Fauser (Eds.), Clifford Algebras and their Applications
in Mathematical Physics, Vol. 1 & 2 (Birkhäuser, Boston, 2000).
[34] E. Bayro Corrochano & G. Sobczyk (Eds.), Geometric Algebra with Appli-
cations in Science and Engineering (Birkhäuser, Boston 2001).
43