Kolmogorov Complexity
Kolmogorov Complexity
Kolmogorov Complexity
Crash course
13 January 17 January 2014
12:00 to 14:00
Monday
Probability theory
Uncertainty and coding
Student presentations
27 January 31 January 2014
12:00 to 14:00
Tuesday
The weak law of large numbers
The source coding theorem
Location
ILLC, room F1.15,
Science Park 107, Amsterdam
Wednesday
Random processes
Arithmetic coding
Materials
informationtheory.weebly.com
Thursday
Divergence
Kelly Gambling
Contact
Mathias Winther Madsen
mathias.winther@gmail.com
Friday
Kolmogorov Complexity
The limits of statistics
PLAN
Some combinatorical
preliminaries
Turing machines
Kolmogorov complexity
The universality of
Kolmogorov complexity
The equivalence of
Kolmogorov complexity
and coin flipping entropy
Monkeys with typewriters
PLAN
Some combinatorical
preliminaries:
Factorials
Stirling's approximation
Binomial coefficients
10
20
30
40
50
ln(n!)
15.1
42.3
74.6
110.3
148.5
Stir(n)
13.0
40.0
72.0
107.6
145.6
Sproof:
There are
24
4 3 ==
==
== 12
2
2!
4!
1
2
1
2
1
2
2
1
== 6
*
*
*
*
*
*
n!
==
(n k)! k!
n
k
k
== n H2
n
Example: n = 40, k = 20
137846528820
log
== 25.649 27.726.
1099511627776
PLAN
Some combinatorical
preliminaries
Turing machines
Kolmogorov complexity
The universality of
Kolmogorov complexity
The equivalence of
Kolmogorov complexity
and coin flipping entropy
Monkeys with typewriters
Shortest description?
001001001001.
1100111100111111001111111100.
01011011101111011111.
0100011011000001010011100101110111.
101100010111001000010111111101.
001001001001.
1100111100111111001111111100.
01011011101111011111.
0100011011000001010011100101110111.
101100010111001000010111111101.
Theorem:
There are universal machines.
Sproof:
Consequence:
The Kolmogorov complexity of a string
on two different universal machines
differs only by the length of the
longest simulation program:
KM1(x) KM2(x) == c(M1, M2)
(And constants are sublinear.)
001001001001001001 001001001.
11001111001111110011111111 00.
01011011101111011111 11111111.
010001101100000101001110010 1.
10110001011100100001011111 01.
Theorem:
Most strings don't have any structure.
Proof:
There are 2n strings of length n, and
1 + 2 + 4 + 8 + 16 + + 2n 1
programs of length < n.
PrintString(n, k, i):
construct all string of length n
select the ones containing k 1s
print the ith of those strings.
n = 10,
k = 3,
i = 13.
1 0 1 0 , 1 1 , 1 1 0 1
11 00 11 00 01 11 11 01 11 11 00 11
n = 10,
k = 3,
i = 13.
1 0 1 0 , 1 1 , 1 1 0 1
11 00 11 00 01 11 11 01 11 11 00 11
2 + 4 + 2
n
k
k
== n H2
n
So
K(x)
k
2n H2
n
where is sublinear.
(n)
For other sequences, Kolmogorov complexity is smaller than the Shannon entropy of
the string if modeled as as a coin fl ipping
sequence.
Random
monkey
Universal
machine
Finite string
(or nothing)