Dorobantu - The Postulates of Quantum Mechanics - Arxiv Physics 062145

The postulates of Quantum Mechanics*
V. Dorobantu,

Physics Department, Politehnica University,Timisoara,Romania

Abstract
As a starting point in understanding Quantum Mechanics, the postulates of Quantum
Mechanics are presented, and few of the main eigenvalue problems, as well.

Introduction.
Quantum Mechanics is an axiomatic theory because it is well-grounded on few
principles (from the Latin principium), or axioms (from the Greek, axios), or postulates
(from the Latin postulatum), all of these words meaning the same thing: a truth which
doesnt need any further proof, because it is obvious by itself.
There is not a consensus of how many axioms one needs to describe the machinery of
Quantum Mechanics, but I think that five is an appropriate number. The first four
postulates, as we shall see, make up the mathematical background of Quantum
Mechanics, and the fifth supplies the connection between the mathematics introduced by
the first four and the results of a measurement process.
As a general rule, the statement of each postulate will be followed by comments, so that
the significance of words within the postulates will be explained at the right time,
namely, when they are introduced.

1. The first postulate of Quantum Mechanics

To every state of a physical system there is a function ascribed to and defining the
state.

First of all: there is a function ascribed to doesnt mean that there is a one-to-one
correspondence function state. As we will see, a state may have more than one function
if certain conditions are fulfilled.
To define means to limit according to the English writer, Oscar Wilde, so our
definitions will be closer to the usual vocabulary in order to make the book more
accessible, but it doesnt mean that any accuracy is lost.
A physical system is (from the quantum point of view) a free particle, a particle moving
in some potential, a hydrogen atom, a hydrogen molecule, or an atom (or molecule) of
whatever kind, or many particles of the same kind or different. As we can see, by a
physical system we can understand a finite region of space having certain characteristics
which make that region different from others.

* Published in Quantum Computability, Vol.1, Quantum Mechanics, Politehnica University Press,
Timisoara, Dec.2005
1
If there are certain physical quantities, or parameters, which at least in principle can be
measured, and they remain constant for a finite time interval, then we can speak about the
state of the physical system. It should be noted that there is a difference between a state
from the classical point of view and the quantum point of view:
a) let us say we have a thermodynamic state of an ideal gas, meaning that we know (for
sure) the pressure, the temperature and the volume. It is understood that we have to do
with a maximum specification when we speak of state;
b) things are different from the quantum point of view when we speak of state, because
the verb to know does not have the meaning of we know for sure, but, as we shall see,
we know with certain probabilities.
To make a connection, we will speak about a state with maximum specification only
when that state is a pure state, and less then maximum specification when we deal with a
mixed (or an impure) state. What do we mean by pure and mixed states, we will find
out a little bit later.
, depending of the usual space coordinates (x, y, z) and time t, is called a
wavefunction. When we speak about the state of a physical system, then depends of
space coordinates only, and when we speak about evolution, the time dependence of
must be taken into account. Generally, the wave-function is a complex function, which
means that we have to do with its complex conjugate
t) z, (x,y,
, as well, obtained by changing

the imaginary number i into -i.

The significance of the wave-function
Max Born has the merit of clarifying the meaning of the wave-function as being the
probability amplitude. Let us consider the simplest atom consisting of one proton and
one electron the hydrogen atom. Taking the proton as the centre of a reference system,
the hydrogens electron must be somewhere around the proton at, say the distance r. The
wave.-function will be ) , (r, z) y, (x, = , if we work in spherical coordinates. The
probability to find the electron somewhere at a distance r from the proton is:

= =
r
0
2
2
r
0
2
dr d d sin r (r) dV ) , (r,
dr r (r) 4
2
r
0
2
(1)
and the probability density is:

2
2 2
(r) r 4 (r) (r) r 4 (r) = =

(2)
If we take the integral of expression (1), from zero to infinity, we are sure that the
electron will be in this sphere, and, as a consequence the probability will be 1.
Considering { }
n 3 2 1
,.......x x , x , x x = , the generalized coordinates [1],[28], or
Lagrangean parameters, the probability to find the particle in the entire space must be 1,
so:
1 =
space all
2
dV (x) (3)
2
where is the elementary volume in the configuration
space (the space of generalized coordinates).
n 3 2 1
.......dx dx dx dx dV =
An expression like (3) defines a special class of functions of unit norm, meaning that all
s have a modulus of one.

How does one work with the probability amplitude ?
Let us see Fig. 1, with electrons leaving the source EG and reaching the detector D. The
probability amplitude for this process is .
(EG)D

1
E
1
E
2
2
D
EG
E
12
x axis

Fig.1

The particle can take the path through slit 1, or through slit 2 and the probability
amplitude for these processes is = , respectively =
with:
D 1 (EG)
1 (EG)
D 1
D 2 (EG)
2 (EG)
D 2
= + (4)
(EG)D
1 (EG)
D 1
2 (EG)
D 2
and the density probability:

2
2D 2 (EG) 1D 1 (EG)
2
(EG)D
+ = (5)

Another way of writing the probability amplitude is to use Diracs bra and ket
vectors.
First of all, why vectors? Because there is another name for the wave-function, or
probability amplitude, namely state vector. Let us have a vector r
r
of modulus 1, and
, , k the unit vectors of Ox, Oy and Oz axes. Let also , , be the angles made
by with Ox, Oy and Oz, see Fig. 2.
i
r
j
r r
x
r
r

k z j y i x r
r r r
r
+ + = ,
z y x
cos z , cos y , cos x = = = (6)
with and r having modulus 1 we will get:
2 2 2 2
z y x r + + =

(7) 1 cos cos cos
z
2
y
2
x
2
= + +
3

x
x
y
y
z
z
r
r
i
r
j
r
k
r
Fig. 2

A straightforward generalization of expression (6) is to consider an n dimensional space
spanned over the base unit
vectors . Then each other vector of modulus 1 can be written
as:
n 3 2 1
,........ , ,
(8)
=
=
n
1 i
i i
c
with being complex coefficients satisfying a condition similar to (7)
k
c
1 c
n
1 k
2
k
=
=
(9)
Actually, the expression (8) is the mathematical formulation of a general physical
principle, namely, the superposition principle stating that if
represent physically realizable processes, then any linear
combination of the form (8) is also a physical realizable process.
n 3 2 1
,........ , ,
A system of functions allowing the expression (8) and the closure relation (9) is called
complete and closed.

Diracs writing.
Splitting the word bracket in two: bra and ket, Dirac invented a special form of writing,
generally used today in Quantum Theories. Here is Diracs writing: a physical process
starting from an initial state and reaching a final state is described by the probability
amplitude
initial final (10)
The vector final is the bra vector and initial is the ket vector, observing that
the bra vector final is the complex conjugate of the ket vector final . What we
have in the right part of the vertical line is always the initial state, and what we have in
4
left part is the final state. For instance, , the probability amplitude describing the
process of a particle leaving the source EG and reaching the detector D is now written as:
(EG)D
=
(EG)D
EG D (11)
and the equivalent of formula (4) is :
EG D = 1 D EG 1 + 2 D EG 2 (4)
reading the expressions from the right to the left as Feynman said [2]: the particle leaves
the source EG, passes through orifice 1 (or 2) and reaches the detector D. EG 1 is the
probability amplitude for the process of leaving the source and reaching orifice 1, and
1 D the probability amplitude for the process of leaving the orifice 1 and reaching the
detector D, their product giving the probability amplitude for the process EG 1- D.

Conclusion
1.We saw that the wave-function, the probability amplitude and the state vector,
represent the same thing.
2. We were introduced to functions such as, , which have the following properties:
a) the probability amplitudes are uniform functions, namely in every point of the physical
space have a definite value;
b) the wave functions are bounded in entire space;
c) the wave- functions are continous and their partial derivatives are also continous;
d) the functions are integrable in squared modulus, namely
dV
2
is convergent;
Functions with properties a) d) make up Hilberts complex linear vector space of
wave- functions.
3. To every pair of the probability amplitudes and from the Hilbert space is
associated a number called inner product defined as:
=

dV (12)
The inner product:
(i) is non-comutative or skew-symetric:

=

(13)
(ii) has the property of linearity:

2 1 2 1
b a ) b (a + = + (14)
with a and b two constants and
1

(iii) has the property of positivity:
0 (15)
= 0, only if =0

4. Orthogonality. If the inner product of the two wave-functions is null, then those two
wave-functions are orthogonal. Here, the word orthogonal is not too far from the usual
5
meaning, namely: perpendicularity. In Fig. 2 we have examples of orthogonality: the unit
vectors i
r
, j
r
, k
r
are each other perpendicular. To see how it works, lets write the
complex conjugate of the expression (8):
=

k
k k
c (16)
Taking the inner product with , we have:

= =
k
k k
k
k k
c c c (17)
because wave-functions have the norm 1, see (3), and the complex coefficients
satisfy the closure relation (9). By comparing coefficients in (17) we get:
k
c
c
k k
= (18)
According to expression (18), the coefficients of the expansion (8) are uniquely
determined. Introducing the development in (18) we have:

= =
j
j k j
j
j j k k
c c c (19)
For the expression (19) to be satisfied,

=
= =
j k 0,
j k 1,
,
j k j k j k
(20)
where is the Kronecker symbol.
j k
The above expression, (20) represents the general relation of orthonormality : the inner
product is 1 for normalized wave-functions, and zero for orthogonal wave-functions.
Any collection of n (
1
,
2
,....
n
) mutually orthogonal vectors of unit length in
an n-dimensional space vector, satisfying condition (20), makes up the orthonormal basis
for that space.
Note. The first postulate doesnt tell us how to find the wave-function, and, as matter of
fact, it is not meant to tell us that. This postulate only says that the wave-function offers
the maximum knowledge. Regarding the probability amplitude, or wave-fuction, or state
vector, and what are they good for, we shall learn more after the third postulate.

2. The second postulate of Quantum Mechanics

If
1
is the Hilbert space associated with the physical system S
1
, and
2
is the
Hilbert space corresponding to the other physical system S
2
, then the composite system
S
1
+ S
2
will be associated with the tensor product of the two Hilbert vector spaces
2
.

As a summary, the second postulate of Quantum Mechanics defines the word and, as
Roger Penrose [3] suggests. Let us consider the non-null vectors
2 1
,
1
and

2 1
,
2
. The state vector
1 1
is going to tell us the following: the system S
1
is
in state and, at the same time, the system S
1
2
is in state . This is similar for
1
6
2 2
.We have to note that the concept of tensor product, allowing the presence of
both S

1
and S
2
at the same time, it is completely different from the linear superposition
where if, say, and are two possible states for one particle, then a linear
combination of the form (8) will also a possible state for the same particle, but not for two
paricles. Having in mind the second postulate, the state for two particles is described by
the tensor product. Now, the vectors
2 1
,
1
are of unit length, and they are
orthogonal. This also holds for . What about the new unit vector ?
2 1
,
(
2 2 1 1

2
1
+ = ) (21)
is a vector in
1
2
describing a state of the composite system S
1
+ S
2
. When the
composite system S
1
+ S
2
is in the state , can we say in what state S
1
is in, and what
state S
2
is in? Definitely not! Neither S
1
, nor S
2
are in a definite state.
The state given by (21) is an entangled state [4].
This second postulate outlines another strange possibility: quantum non-locality. If both
systems S
1
and S
2
are well separated in space, the entangled state of S
1
+ S
2
is a
manifestation of non-locality, but a non-locality in a correlated manner, because if we
measure a physical quantity for the first particle, the same physical quantity for the
second seems to be already fixed by the measurement done on the first, as we shall see
later.
The entangled states and the quantum non-locality are very important for quantum
computation.

3. The third postulate of Quantum Mechanics

To every observable of a physical system is associated a self-adjoint (or Hermitian)
operator allowing a complete set of eigenfunctions.

An observable is any physical quantity which can be measured by an experimental
procedure, such as: position, momentum, angular momentum, energy, etc.
An operator is an instruction showing us how to obtain a function g(x) if we know
another function f(x), or the operator maps a function f(x) into another function g(x).
From now on the operators will be written with capital bold letters. Let O be an operator
and f(x) a certain function of coordinate x. Applying the operator O to the function f(x)
we will get the function g(x):
g(x) f(x)= O (22)
Being in the frame of Quantum Mechanics, our functions are the wave-functions, s, so
that the operators will map a function from Hilbert space into another function from
Hilbert space.
The physical equivalent of an operator is some device changing something in a particles

state. For instance: an electron moves along a straight line. If the electron enters the
electric field of a parallel-plate capacitor, its path will be curved, so the electron is in
another state. In this case, the parallel-plate capacitor is the physical operator.
7
Let be an observable, B the corresponding operator and , two vector states from
Hilbert space; then according to (22) we can write:
= B (23)
If can be written as b , with b a number, then the equation:
b = B (24)
is the eigenvalue equation of the operator B, being eigenfunctions and b the
eigenvalues of B.
Let b and f be constants and , two wave-functions.
B is a linear operator if:
f b ) f (b B B B + = + (25)
Since we deal with linear operators only, as a general rule, we shall omit the word linear.
The theory of linear operators is the mathematical apparatus of Quantum Mechanics.

Examples
Log is an operator, is also an operator, but both, Log and are not linear
operators because Log f Log b ) f (b Log + + and, also,
f b f b + + , because they do not satisfy rule (25)
Derivative and integral are linear operators because they satisfy rule (25) being
distributive against addition.

Eigenvalues spectra
The general problem of Quantum Mechanics is to solve equations of the type (24), or, in
other words, to get the solutions for eigenfunctions , and eigenvalues b.
Regarding the eigenvalues, the ensemble of the b values we have got as solutions of the
linear operator equation makes up the eigenvalues spectrum. If b can take only certain
values, we say that the eigenvalues spectrum is discrete. If b can take any value, then
the spectrum is continuous. For example, the position of a particle can take any value
along the axis of real numbers, so its eigenvalues spectrum is a continuous one.
The hydrogens electron can be only on certain levels, so the electrons energy can take
certain values only, its eigenvalues spectrum being discrete.
If the eigenvalue spectrum is a discrete one, then the norm is of the type (3) or (20) and
the expansion using the basis vectors is of the form (8) with closure relation (9), but if the
spectrum is a continuous one, such kind of expressions are, as we shall see, of a different
type.
For some details of continuous spectra, see, e.g. [5,6].

A few necessary definitions
a) If we have two operators A and B, then their sum A + B is also an operator:
) ( B A B A + = + (26)
b) If we have two operators A and B, then their product A B is also an operator:
) ) ( B A B A ( = (27)
c) If we have two operators A and B, then:
[ ] A B B A B A, = (28)
is also an operator and if
8
[ ] 0 = = A B B A B A, (29)
then we say that the two operators A and B commute and [A,B] is called the commutator
of A and B
d) If A is an operator and if there is an operator A
-1
, such that
(30) I A A A A
1 1
= =

then A
-1
is the inverse operator, with I the identity operator.

Self-adjoint operator
While studying a physical process, we are interested in values: values of energy, values
of angular momentum, etc., namely in real numbers. The corresponding operators
dealing with real numbers (as eigenvalues) are self-adjoint or Hermitian operators. Lets
see how it works!
Let A be an operator and , two state vectors. If the following equality takes place:
) ( ) (
+
= A A (31)
then A
+
is called the adjoint operator of A.
If A = A
+
(32)
then A is a self-adjoint, or Hermitian operator.
The eigenvalues of self-adjoint operators are real numbers.
Let A be a Hermitian operator and one of its eigenfunctions, then according to (32) we
can write:
A A = (33)
With , equation (51) becomes: a = A

real is a a a 0 0, ) a (a
a a a a
= =
= =
(34)
Observation. I saw the following sentence in a university book [7, p38]: In quantum
mechanics, an observable is a self-adjoint operator. It seems to me the observation is
somewhat shallow. An observable is a physical quantity, and an operator is a
mathematical device acting on a vector space, as the third postulate clarifies.

The momentum and position eigenvalue problems

Momentum and position are two observables and, according to the third postulate, two
Hermitian operators are associated with them.
Momentums eigenvalue problem
Let us, for the beginning, consider that p
r
is the particles momentum, P
x
the operator for
the momentum corresponding to the x-axis, p
x
its eigenvalue and the
corresponding eigenfunction. The eigenvalue equation is:
z) y, (x,
z) y, (x, p z) y, (x,
x
=
x
P (35)
If we solve this equation, then we will know the eigenfunctions and eigenvalues, but,
first we must know the form of the operator P
x
. The third postulate does not explain how
to find the operators corresponding to observables. So? The answer is similitude. One of
the solutions of the differential equation of electromagnetic waves (and not only) is the
plane wave. Let be such a monochromatic plane wave : z) y, (x,
9

i t) r k ( i
e a e a z) y, (x, = =

r
r
(36)
with i the imaginary number , a = amplitude, k 1 i
2
=
r
= the wave vector, r
r
=
position vector, = frequency, t = time, = phase. Looking at the phase , we see that
the derivative of phase with respect to coordinates gives the wave vector k
r
. According to
de Broglies hypothesis, a wave will be associated to each particle, and the wave vector
is connected to the momentum, so the momentum operator can be associated to
derivatives with respect to coordinates. Knowing that h E h = = , k
h
p , where
h is Plancks constant (6.62607*10
h = =
-34
Js),
2
h
= h , 2 = is the angular frequency
(usually named frequency) and k is the wave number (modulus of the wave vector),
2
k = , can be written as: z) y, (x,

t) E p z p y p (x
i
t) E r p (
i
z y x
e a e a
+ +
= =
h
r r
h
(37)
Lets take the derivatives with respect to coordinates:

p
i
e a p
i
z
p
i
e a p
i
y
p
i
e a p
i
x
z
) t E p z p y p (x
i
z
y
) t E p z p y p (x
i
y
x
) t E p z p y p (x
i
x
z y x
z y x
z y x
h h
h h
h h
h
h
h
h
h
h
= =
= =
= =
+ + +
+ + +
+ + +
(38)

Writing:
p
x
i
x
=
h , p
y
i
y
=
h , p
z
i
z
=
h and comparing with

, , p
x
=
x
P p
y
=
y
P p
z
=
z
P
we get:

z
i
y
i
x
i
z
y
x
=
h
h
h
P
P
P
(39)
Or, generally = h i P (40)
We have obtained the momentum operator, as derivatives with respect to coordinates.
Let be the wave-function corresponding to the operator. Then, the eigenvalue
equation for is:
(x)
x
p x
P
x
P
10
(x) p
x
(x)
i
x
x
p x
p
=
h (41)
Admitting momentum conservation and integrating equation (41), we will get:

x
x
p x
i
p
e N (x)
h
= (42)
with N the norm constant.
What about eigenvalues? Momentum being the product of mass and velocity, the
eigenvalue spectrum is continuous, because the velocity can take any value between -c
and +c, (c light speed in vacuum), and mass can be any positive real number. Now, the
momentum eigenfunction can be written as:

x
p x
i
x
e N ) p (x,
h
= (43)
because p
x
is a real variable like x. What about N? It will be determined very soon. The
wavefunctions for and will be like (61) with corresponding coordinates and
momenta.
y
P
z
P
How does one norm eigenfunctions corresponding to continuous spectra of eigenvalues?
Let us recall the direct product (12) and consider that we are dealing with a one-
dimensional problem. It means that and can be written as:
x (x) , x (x) = = (44)
and (12) becomes:
x d x x
x all
= (45)
Now, let us consider that is just x , x and x being particles positions on a straight
line. (45) becomes:
x d x x x x
x all
= (46)
Recalling (44), we have:
x d (x) x x ) x (
x all
= (47)
An expression like (47) is valid if and only if x x is Diracs , x) x ( , function. By
definition, Diracs function is:
(48)

= (a) f dx a) (x f(x)
with when and a a real number. 0 (x) f x
A usual function with such property does not exist. The integral, in the Riemann sense, of
a function identical to zero excepting one point equals zero. By we have to
understand a limiting process, meaning that there are functions depending on a parameter
, so that
a) (x
(x) ) f(x, Lim
o
=
(49)

11
Important properties:
(50)

= 1 dx (x)
With a a constant, (x)
a
1
x) (a = (51)
Fourier expansion,
= d e
2
1
(x)
x i
(52)
x) ( (x) = (53)
So, if and are wavefunctions corresponding to the momentum
continuous eigenvalue spectrum, then:
) p (x,
x
) p (x,
'
x
(54) ) p (p dx ) p (x, ) p (x,
'
x x
'
x x
=
Expression (54) is valid for any wavefunction corresponding to the continuous

eigenvalue spectrum.
Using (43) in the above expression, we have:

= ) p (p dx e N N
x
'
x
) p (p x
i
x
'
x
h
(55)
With properties (52) and (51), we have:
) p (p ) p (p N 2
'
x x
'
x x
2
= h (56)
Taking the integral of (56) and using the property (50) we get:

h 2
1
N
2
= (57)
If N is real, then:

h 2
1
N = (58)
The momentum wavefunction will be:

x
p x
i
x
e
2
1
) p (x,
h
h
=
(59)
and similarly
y
p y
i
y
e
2
1
) p (y,
h
h
=
(60)

z
p z
i
z
e
2
1
) p (z,
h
h
=
(61)
The general solution for P must be of the same type, namely an exponential of p r
r r
, and
this can be done multiplying (59), (60),(61).
12

p r
i
3/2
z y x
e
) (2
1
) p , p , p z, y, (x,
r r
h
h
=
(62)
The positions eigenvalue problem
To find the position operator we will use the similitude again. From Classical Mechanics
[6] [32] we know the Poisson brackets. Let f and g be two functions of coordinates
( ) and momenta ( ). The Poisson bracket corresponding to these two
functions is:
z y, x,
z y x
p , p , p
{ }

=

=
z y, x, j j j j j
p
g
x
f
x
g
p
f
g f, (63)
Taking and , their Poisson bracket is:
x
p f = x g =
{ } 1 = x , p
x
(64)
If we multiply the expression (64) by a function , we have: (x)
{ } x , p
x
= (65)
Now, lets take the derivative with respect to x of the product
x .
( )
x
x x
x
+ =
(66)
Subtracting
x
from both sides of (66), we get:

( )
x
x x
x
=
(67)
This expression (66), seems to be related to (65) if we can find something which is closed
to Poisson bracket.
Now, let be the wavefunction of the position operator X. (x)
The eigenvalue equation of X is:
x = X (68)
Then, equation (67) can be written:
( )
x
x
=
X X or
x x
=
X X (69)
The transition from (67) to (69) can be made only if the action of the position operator
consists in multiplying a function. In the expression (69) the bracket seems to play a
similar role with the Poisson bracket. Has it any significance? Yes, it is also an operator,
more precisely the commutator of the operators X and
x
, and perhaps Dirac followed

a similar way to introduce the operators commutativity and the corresponding properties.
The eigenvalues of the position operator make up a continuous spectrum because the
position of a particle can be any real number.

13
The angular momentums eigenvalue problem

Classical Mechanics told us that there is a physical quantity called angular momentum,
which is a vector, defined as:

z y x
z y x
p p p
z y x
k j i
p x r k L j L i L L
r r r
r r
r r r r
= = + + = (70)
where is the particles vector position with respect to a reference system whose basis
are the unit vectors
r
r
k , j , i
r r r
, and p
r
is the particles momentum. From (70) we get:

x y z
z x y
y z x
p y p x L
p x p z L
p z p y L
=
=
=
(71)
Knowing that to the observables position and momentum correspond Hermitian operators
X, Y, Z, and P
x
, P
y
, P
z
respectively, we can get the angular momentum operators, by
replacing the classical observables by corresponding operators according to the third
postulate.

= =
= =
= =
y
x
x
y i
x
z
z
x i
z
y
y
z i
h
h
h
x y z
z x y
y z x
P Y P X L
P X P Z L
P Z P Y L
(72)
Note. In (72) i is the imaginary number 1 i
2
=
Also, (73)
2
z
2
y
2
x
2
L L L L + + =
To find the angular momentums operators it is easier to pass to spherical system of
coordinates. Let be a certain function of coordinates. In spherical coordinates
we have:
z) y, f(x,

cos r z
sin sin r y
cos sin r x
=
=
=
(73)
In (72) we have to replace the derivatives with respect to x, y, z with , r, derivatives.
Connection
z
,
y
,
x
, ,
r
will be found solving the following system
of algebraic equation with respect to the unknowns
z
f
,
y
f
,
x
f
:
14

y
f
cos sin r
x
f
sin sin r
f
z
f
sin r
y
f
sin cos r
x
f
cos cos r
f
z
f
cos
y
f
sin sin
x
f
cos sin
r
f

(74)
After some calculations, we get:

=
h
h
h
i
ctan cos cos i
ctan cos sin i
z
y
x
L
L
L
(75)
With (75) and (73) we, also, will get:

=
2
2
2
2
sin
1
) (sin
sin
1

h
2
L (76)
An important result we have got: the angular momentums operators act upon the
angular coordinates and only.

L
z
eigenvalue problem
Because acts upon the variable
z
L only, lets consider the corresponding
eigenfunctions ) ( . The s eigenvalue equation will be:
z
L
) ( ) ( =
z
L (77)
With done by (75) the eigenvalue equation takes the form:
z
L
) (
d
) ( d
i
= h (78)
(78) is a simple differential equation with separable variables, and recalling the angular
momentum conservation law ( constant), the solution is:

i
e N ) (
h
=
(79)
Now, we have to determine the eigenvalues and the norm constant N. The uniformity
condition of the wave-functions claims
) 2 ( ) ( + = (80)
which gives:
2,....... 1, 0, m , m = = with h (81)
So, the eigenvalue spectrum of the angular momentum operator
z
is a discrete one: the
eigenvalue can be (in principle) any whole multiple of h , including zero too.
L
15
Because the eigenvalue spectrum is discrete, the norm
condition: 1 d e e N N d ) (
2
0
2
2
0
m
= =

m i m i
gives (if N is a real constant):
2
1
N = . So, the angular momentums eigenfunction is:
z
L

m i
m
e
2
1
) ( = (82)
What is the significance of m? We will find out soon.

L
2
eigenvalue problem
Lets have:

2
2
2
sin
1
) (sin
sin
1

= (83)
Then: (84) L
2 2
h =
Because L
2
acts upon the angular variables only, let ) , Y( be its eigenfunction. The
eigenvalue equation will be:
or ) , Y( ) , Y( =
2
L 0 ) , Y(
) , Y(
2
= +
h
(85)
The equation (85) is known as the spherical waves differential equation being solved in
any book of Special Functions, see, e.g. [8] or [5]. The solutions of (85) are the
spherical functions ) , ( Y
m l
:
) ( ) (cos P ) , ( Y
m m l m l
= (86)
) (cos P
m l
being the associated Legendre polynomials of the
first kind:

l
l
l
l
l
l
l l
l l
) cos (1
) (dcos
d
) cos (1 ! 2
1
! m) (
! m) (
2
1 2
1) ( ) (cos P
2
m
m
m/2 2
m

+ +
=
(87)
With (82) we have:

m i 2
m
m
m/2 2
m l
e cos
dcos
d
) cos m
m
) , ( Y
2
1
) 1 (
) (
1 ( ! 2
1
)! (
)! (
2
1 2
) 1 (
l
l
l
l
l
l l
l l

+ +
=
(88)
From (88) we see that for l > m , ) , ( Y
m l
vanish, so, there is no solution of (85).
Hence m, for fixed l can take the values:
l l l l m , 1 ....... , 1 , 0 , 1 ,....... 1 , + = (89)
The eigenvalues are with
2
L ) 1 (
2
+ l l h ,...... 3 , 2 , 1 , 0 = l (90)
16
What are these numbers l and m ? As one can see, l and m are connected to the
eigenvalues of the angular momentum operators and
z
. They are called quantum
numbers: l is the orbital quantum number and m is the magnetic quantum number. Why
are these numbers called orbital and magnetic respectively? I think there are historical
reasons: the first application of Quantum Mechanics was the simplest atom, namely,
hydrogen, so, the electron orbiting the nucleus has an angular momentum , and
z
its
projection onto the z-axis (an external magnetic field, e.g.), hence we have got l as orbital
quantum number and m as magnetic quantum number which has nothing magnetic in
itself.
2
L L
L
r
L

Dynamics
4. The forth postulate of Quantum Mechanics

The time evolution of a quantum state is governed by a unitary transformation.
If is the probability amplitude of a quantum state at time t , then is
its probability amplitude at a later time
(t) t) (t +
t t+ , so that
(t) t) t, (t t) (t + = + U , where U is a unitary linear operator.

U is unitary if , where is Us adjoint operator. 1 = =
+ +
U U U U
+
U
To find out something about U we will proceed in a similar way with Feynmans [2], and
let us consider the basis vectors, so the expression
n
(t) t) t, (t t) (t + = + U can be written as:

(t) t) t, (t t) (t
k k
k
n n
+ = +

U (91)
With t) t, (t U t) t, (t
k n k n
+ = + U (92)
(91) becomes:
(t) t) t, (t U t) (t
k
k
k n n
+ = + (93)
What do we know about the matrix t) t, (t U
k n
+ ?
Well, first of all, when we have to reach the initial state, which means that
and
0 t
1 t) (t, U
n n
k n 0, t) (t, U
k n
, therefore can be written as:
k n
U
t H
i
U
k n k n k n
h
= (94)
Indeed, (94) (constants i and are introduced for convenience) satisfy the above
conditions, when , being the Kronecker symbol and another matrix
whose significance will be clarified soon. With (94), (93) becomes:
h
0 t
k n
k n
H
(t) H t
i
(t) t) (t
k
k
k n n n
= +
h
(95)
Or:
17
(t)
i
t
(t) t) (t
(t) H t
i
(t) t) (t
k k
k
n
n n
k
k
k n n n
H
=
+
= +
h
h

=
+
=
+
k
k k
k k
k
n
n
(t)
i
t
(t) - t) (t
(t)
i
t
(t) - t) (t
H
H
h
h

Passing the limit in the above expression, we get: 0 t

(t)
t d
(t) d
i H = h (96)
In the above equation H is going to play a major role. What is it? Lets find out! Firstly,
if H doesnt depend on time, starting from (96), we will get
t something
i
e (t)

h
.
What can be that something? Lets come back to classical Physics..
The expression (37) is a plane wave, and it seems to be the relative of the solution
containing something. By comparison we can think of something as being energy.
Taking the time derivative of (37), we get:
(t) E
t
(t)
i (t) E
i
t
(t)
=
h
h
(96)
From (95) and (96) we conclude that H must be the operator corresponding to the total
energy of the physical system.
What about U? Expanding t) (t+ in Taylor series around t, we have:

(t) e (t) t) (
! k
1
(t) t) (
! 2
1
t (1
..... (t) t) (
! 2
1
(t) t (t) t) (t
t
t k
t
k
0 k
2
t
2
t
2
t
=
=
=
= + + + =
= + + + = +
)
2
t

For convenience, the above expression can be written as:

(97)
S S
U
i i
e (t), e t) (t = = + gives which
Here S is a Hermitian operator. From (97) one can see that the unitarity condition is
fulfilled.

18
The Schrdinger equation
Classical Physics tells us that the total energy of a physical system is the sum of the
kinetic energy and potential energy:

p
0
2
p
2
0
E
m 2
p
E
2
v m
E + = + = (98)
Obviously, in the above expression, the total energy is the non-relativistic energy.
Note
Here, and everywhere else (unless it is explicitly said), by m we will understand the
particles (or physical systems) mass in the sense of Einsteins Theory of Relativity,
namely, the particles mass in the reference system where the particle (or the physical
system) is at rest.
In (98) we see two kinds of physical quantities, or (if measurable), two kinds of
observables: momentum, whose self-adjoint operators are derivatives with respect to
coordinates, and potential energy. The momentum has its own operators, but what about
the potential energy? The potential energy depends on the particles (or physical
systems) coordinates, so it is not that crazy to think that the operator corresponding to
the potential energy is a multiplicative one, much like the positions operator. Hence, the
total energys operator will be found by replacing the classical quantities with quantum
corresponding operators.
U U H + = + =
0
2
2
0
2
m 2 m 2
h h
(99)
Where
2
2
2
2
2
2
2
z y x
= = is Laplaces operator, or Laplacean, and U is the

multiplicative operator corresponding to potential energy. From Classical Physics [1,9],
we know that the total energy of a physical system is just Hamiltons function, hence H
will be called Hamiltonian. H may be, or may not be dependent on time. With H given
by (99), equation (96) becomes:

t) z, y, (x, z) y, U(x,
t) z, y, (x,
m 2 t d
t) z, y, (x, d
i
0
2
+
+ =
h
h
(100)
This is the fundamental equation of Quantum Mechanics and is called the Schrdinger
equation.
Can we split the function into a function depending on coordinates only,
and the other depending on time? The plane wave (again!) and equation (96) suggest that
we can write:
t) z, y, (x,

t E
i
e z) y, (x, t) z, y, (x,
h
= (101)
Introducing (101) in (100) we have:
z) y, (x, E z) y, (x, z) y, U(x, z) y, (x,
m 2
2
= +
h
(102)
Equation (102) is, also, the Schrdinger equation, but, this version of the equation is
independent of time. In other words, it is the stationary Schrdinger equation. This
equation (102) is the energy eigenvalue equation, which tells us that if the physical
19
system has the energy E at the initial time, then at any subsequent time it will have the
same energy. The stationary Schrdinger equation is the fundamental equation of the
microscopic world with definite energies, e.g. molecules, atoms, nuclei, etc. Knowing the
potential energy, or the interaction potential U(x,y,z), by integrating the stationary
Schrdinger equation we can find the energy eigenvalues and eigenfunctions of the
physical system.
Observation. It is necessary to outline that the operator H in the time-dependent
Schrdinger equation (100) is the Hamiltonian of the system, and H from the stationary
Schrdinger equation is the operator energy . The two H are identical only if the
Hamiltonian doesnt depend on time.

An important property
If and are two solutions of the time dependent Schrdinger equation, then their
inner product is constant. The Schrdinger equations for these functions and the inner
product are:
i ,
t
= h h H H ,
Taking the time derivative of the inner product and making use of the fact that and
satisfy the time dependent Schrdinger equations, and also, that H is the Hermitian we
will have:

( ) ( ) 0
i
1

i
1

i
1

i
1
i
1

i
1
t d
d

t d
d
t d
d
= =
= + =
+ = + =
H H H H
H H H
H
h h
h h h
h

The result is: constant = (103)
If ,
time subsequent any time initial
= (104)

Measurements
5. The fifth postulate of Quantum Mechanics

The fifth postulate outlines the statistical nature of Quantum Mechanics and bridges the
mathematical apparatus introduced by the first four postulates and the experimental
results of a measuring process.
Let be an observable and F the corresponding Hermitian operator. With
the eigenfunctions, the eigenvalue equation can be written as:
z) y, (x,
f = F (105)
Considering the general case of a joint (discrete + continuous) eigenvalue spectrum, the
superposition principle may be written as:
20
(106)
where (k = 1,2,3.) are the eigenfunctions corresponding to the discrete spectrum,
and the eigenfunctions corresponding to the continuous
spectrum ( any real number). We already know that the expansion coefficients are:
d ) z, y, (x, ( c z) y, (x, c z) y, (x,
n
1 k
all
k k
)

=
+ =
k
z) y, (x, ) z, y, (x,

c(
c
k k
=
=
)
(107)
Question. If we arrange an experimental set-up to measure the observable , then what
are the values we will expect to get? The answer is given by the fifth postulate:
As a result of a measuring process performed upon an observable , we will obtain
only the eigenvalues of the Hermitian operator, F, associated to the observable. The
probability of getting an eigenvalue corresponding to the discrete spectrum is
k
f
2
k
c ,
and the probability of getting an eigenvalue corresponding to the continuous
spectrum within an interval d is
f
d c
2
.
Over the years, the measuring process in Quantum Mechanics gave rise to a lot of
discussions, not only concerning physics, but also philosophy. As we have seen before,
there are entangled states, so if we have two separate particles being (each one of them)
in definite states, then the system of both particles is in an indefinite state. Furthermore, if
we measure a physical quantity for one particle, then Quantum Mechanics predicts the
correct value of the same physical quantity for the second particle without measurements,
as we shall see soon.

The expected value of an observable
Let a be a random variable taking (along a measuring process) the values
. By definition its expected value
n 2 1
,........a a , a a is:

k
k
k
P a a

= (108)
where is the probability of getting the value .
k
P
k
a
Now, let us choose an observable whose corresponding operator, according to the third
postulate, is F. The fifth postulate says that during the measuring process we will get the
eigenvalues f of this operator only. Let, also, be the eigenfunctions
corresponding to the discrete spectrum and
z) y, (x,
k
) z, y, (x, the eigenfunctions from the
continuous spectrum. The eigenvalue equation is (102). Using formula (108) and the
words of the fifth postulate, the expected value of will be:
< > = d ) ( c f c f
2
2
k
k
k

+ (109)
The complex conjugates of (107) being
( c
c
k k
=
=
)
, we will get:
21

=
= +
= +
= +

k
k k
k k
k
k k k
k
k k
k
k
d ) ( c c
d ) ( c c
d f ) ( c f c
d ) ( c f c f
F F
F F

d ) ( c c

k
k k
F F =

So, the expected value of is an inner product:
< > = F (110)

The time derivative of the expected value
Lets take the time derivative of the expression (110).

t d
d
< > =
t

t

t
F
F
F
Since is a solution of the Schrdinger equation (100), we have:

t d
d
< > =
i
-
t

i
H F
F
F H
h h
+
+
Or, because H is a Hermitian operator:

t d
d
< > =
t

i
-
i
+ +
F
H F F H
h h

t d
d
< > = ( )
i
t
H F F H
F
h
(111)
If the operator F doesnt explicitly depend on time, we have:

t d
d
< > = ( ) [ ]
i

i
F H, H F F H
h h
= (112)
A very interesting expression! If the operators H and F commute, then the time
derivative of the observable is zero, which means the observable remains constant.
Here is a symmetry statement: for any physical system with the Hamiltonian H, the
observables which will be conserved are those whose operators commute with H. And
more: if two operators commute, they have simultaneous eigenfunctions. For details, see
e.g.[5]. For example, the operators X and P
y
commute, which means that the observable
x and p
y
are independent variables and can be precisely measured simultaneously.
22
And even more: the commutativity of operators outlines the possibility of measuring
simultaneously the corresponding observables. For instance, operators X and P
x
, and the
next two pairs, do not commute,
[ ] [ ] [ ] h i = = =
z y x
P Z, P Y, P X, (113)
and it means that the corresponding pairs of variables can not
be precisely measured simultaneously, as the uncertainty principle says.
) p (z, ), p (y, ), p (x,
z y x

Uncertainty relations

We have learned enough to deduce the general form of the uncertainty relations.
Let us consider two Hermitian operators F and G. Let C be the commutator of the two:
C = [F,G] = FG - GF
The operator iC is also Hermitian, as one can easily prove.
Lets also, consider Q = F + i G, with a real constant. Because the adjoint operator of
Q is Q
+
= F - i G, Q is not Hermitian. Lets have R = Q
+
Q = (F - iG)( F + iG) = F

2
+ iC+
2
G
2
,
and be the Rs eigenvalue. Then, if is the probability amplitude describing the
quantum system, according to (129), the expected value of is:
< > = Q Q
+
= Q Q 0
< > = 0, if and only if Q = 0. On the other side,
< > = <
2
+ i +
2
2
> = <
2
> + <i > +
2
<
2
> 0
The last expression is an algebraic equation of the second degree, coefficients <
2
>,
<i >,<
2
> being real numbers because the corresponding operators are Hermitian. So,
in order to be satisfied, one has to have: <i
2
2
> - 4 <
2
><
2
> 0, or
<
2
><
2
> r
4
1

Equal ( =) sign means the root of the above algebraic equation is double and real, and so,
Q = 0.
Let and be two observables and A and B the corresponding Hermitian operators. We
can choose = - < > and = - <>, then <
2
> = <
2
> and <
2
> =
<
2
>, so we have: <
2
><
2
> r
4
1

. If = (<
2
>)
1/ 2
and =
(<
2
>)
1/ 2
, assimilate with standard deviations, then we will get the most general form
of the uncertainty relations:
r
2
1
(114)
is the observable corresponding to the commutator operator C, and becomes zero
only if C = 0, which means that F and G commute, and r 0, equal sign (=)
meaning that may be zero, or can be zero, or even both of them are null. Such a
23
situation opens up the possibility of measuring exactly, and simultaneously, two physical
quantities, the corresponding operators A and B, for the same wave function, .
If 0 there is no possibility of having both =0 and = 0, which means that, by
no means, can we have a quantum physical system that allows to get well determined
values simultaneously. If, for instance the observable is precisely determined, then
=0, without having any information about because , and reversely.
If we take F = X and G = P
x
, then C = [X, P
x
] = i , h h = , h = , and also, =
x, = p
x
, according to Heisenbergs formula
2
p x
x
h
.

A couple of things that will prove themselves to be useful one day

Lets have a physical system described by the amplitude probability and
let
t) z, y, (x,
2
= be the probability density.
has the dimensions of m

-3
, so it can be used
in finding the density of a number of particles, the charge density, the mass density, and
so on.
a) If N is the total number of particles,
N will be the density of the

particles.
b) If e is the electrons charge, and Ne is the total electric charge carried,
Ne is the charge density.

c) If m
0
is the mass of a particle, and N is the total number of particles,
then

N m
0
is the mass density.
If satisfies the Schrdinger equation, then its complex conjugate
will be also a solution. Lets do some calculations. The Schrdinger
equations for and multiplied by and
respectively, are:
t) z, y, (x,
t) z, y, (x,
t) z, y, (x, t) z, y, (x,
t) z, y, (x,
t) z, y, (x,
U
m 2 t d
d
i
0
2

+ =
h
h

+ = U
m 2 t d
d
i -
0
2
h
h
Subtracting, part by part, the two equations above, and knowing that
( )
m 2
i
j
0
=

h
r
(115)
we will have, after some calculations;
0 j
t

= +
(116)
Equation (116) is nothing else but the continuity equation,
j
r
being the probability
density current. With (116) we have different conservation laws. Lets integrate this
equation on some volume V encompassed by a surface :
24

= =

V

V
A d n j V d j V d
t
r
r r

Or, A d n j V d
t
V

r
r

=
(117)
In the above calculations we made use of the Gauss formula connecting volume and
surface integrals, n
v
being the outer normal of the elementary surface dA. Reading
formula (117), we can say that the change in time of the probability equals the negative
of the flux of the probability current density through the surface . If we integrate over
the entire space, then becomes the surface tending to infinity and taking into account
the properties of the vectors of Hilbert space (the wave-functions at infinity are zero) the
right side of the expression (117) will cancel, conserving probability and other quantities
constructed with
.

The classical limit of Quantum Mechanics

There is a principle due to Niels Bohr: the correspondence principle. Essentially,
according to this principle, every new physical theory must contain as a limit case the old
theory. So, the Classical Mechanics must be a limiting case of Quantum Mechanics. How
is that? Similitude again! The geometrical optics are obtained from the wave optics as a
limiting case when the wavelength . It means that the phase 0 in (54) is a very big
number and as a result is a slowly varying function. Similarly, we
can consider a wave-function describing a physical system as being
. How can we choose the quantum phase
i
e a z) y, (x, =
q
i
e z) y, (x, =
q
so that at the
classical limit
q
will be a big number? There is a constant existing all throughout
Quantum Mechanics: h . It is a very small quantity, if we look at it from the classical
point of view, because it is of the order of . Choosing
34
10
q
to be , then the
classical limit of Quantum Mechanics will be . Now, the wave-function for a
quasi-classical physical system can be written as:
h S/
0 h

S
i
e
h
= (118)
Let us suppose that equation (118) is a solution of the Schrdinger equation (100).
Introducing (118) in (100) and separating the real and the imaginary parts, we will have:

+ +
=
+ +
S
m 2
) S (
m
1
i
m 2
t
S
U
m 2
S) (
0 0
0
2
0
2
h
h
(119)
The classical limit means , hence the right part of (119) is null, which involves: 0 h
0 =
+ +
t
S
U
m 2
S) (
0
2
(120)
What about S? From (119) we can see that S has the dimensions of action, as we know
from Classical Mechanics [1,6,9], so, if S is the action of the physical system, then
25
equation (120) is nothing else but the Hamilton Jacobi equation of a particle of mass
m moving in a potential U. Indeed, the Classical Mechanics is the limiting case of
Quantum Mechanics, when . Also, Paul Ehrenfest [10] proved two theorems
asserting: the expected values of Quantum Mechanics observables satisfy the same
equations as Classical Mechanics variables. To prove Ehrenfests theorems is a pretty
easy task if we use the results of the expected values and the time derivative of the
expected values discussed above.
0 h

Conclusion
The five postulates presented above, may be considered as the general frame of the
conventional Quantum Mechanics and they will help in applying Quantum Mechanics to
simple physical systems.

Bibliography

[1]. V. Dorobanu, Physics between fear and respect, Vol.1, Classical Mechanics
Ed. Politehnica, Timioara, 2003 (in Romanian),

[2] . R. P. Feynman, The Feynman lectures on Physics, Addison- -Wesley, Reading,
Massachusetts,

[3]. Roger Penrose, Shadows of the Mind, Oxford University Press, 1994,

[4]. Abner Shimony, Conceptual foundation of Quantum Mechanics in The New
Physics, Ed. by Paul Davis, Cambridge University Press, 1989,

[5]. V.A. Fock, Fundamentals of QuantumMechanics, Mir Publishers Moscow,
1978
[6]. L.Landau, Mcanique, ditions Mir, Moscou,1966
E. Lifchitz,
[7]. John Preskill, Lecture Notes for Physics 229: Quantum Information and
Computation, California Institute of Technology, September,1998,

[8]. A. Angot, Complments de Mathmatique, ditions de la Revue dOptique,

[9]. L. Landau, Curso abreviado de Fisica Teorica,
E. Lifchitz, Libro 1,Mecanica y Electrodinamica, Editorial Mir, Moscu, 1979

[10]. P.Ehrenfest, Zeitschrift fr Physik, 45, 455, (1927)

26

Dorobantu - The Postulates of Quantum Mechanics - Arxiv Physics 062145

Uploaded by

Copyright:

Available Formats

Dorobantu - The Postulates of Quantum Mechanics - Arxiv Physics 062145

Uploaded by

Document Information

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Dorobantu - The Postulates of Quantum Mechanics - Arxiv Physics 062145

Uploaded by

Copyright:

Available Formats

The postulates of Quantum Mechanics*

, as well, obtained by changing

(r) r 4 (r) (r) r 4 (r) = =

and the density probability:

The physical equivalent of an operator is some device changing something in a particles

h and comparing with

Expression (54) is valid for any wavefunction corresponding to the continuous

from both sides of (66), we get:

, and perhaps Dirac followed

(t) t) t, (t t) (t + = + U can be written as:

= = is Laplaces operator, or Laplacean, and U is the

= be the probability density.

has the dimensions of m

N will be the density of the

Ne is the charge density.

You might also like