0% found this document useful (0 votes)

98 views

E1 216 Computer Vision: Lecture 02: Camera Geometry

The document discusses camera geometry and the pinhole camera model. It explains that a camera can be modeled as a pinhole, which provides a powerful mathematical model. Using this model, the document describes how 3D geometry can be recovered by projecting a 3D point onto 2 image planes from 2 cameras and intersecting the lines. It notes that recovering 3D geometry from images is non-trivial but there have been important advances, with state-of-the-art methods able to handle datasets with over 10,000 images.

Uploaded by

NANDEESH KUMAR K M

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

98 views

E1 216 Computer Vision: Lecture 02: Camera Geometry

Uploaded by

NANDEESH KUMAR K M

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 83

E1 216 COMPUTER VISION

LECTURE 02 : CAMERA GEOMETRY

Venu Madhav Govindu

Department of Electrical Engineering
Indian Institute of Science, Bengaluru

2021
From previous lecture:
• UG course approval
• Linear algebra, Probability, SP, Programming
• Timing ? Extra Classes
• Discuss model-algorithm distinction later
Szeliski 2nd Edition
Projection Models

Szeliski 2nd Edition

Camera Geometry

How do we capture light ?

Science for the Curious Photographer, Steve Seitz

Camera Geometry

How do we capture light ?

Pinhole Camera

Science for the Curious Photographer, Steve Seitz

Camera Geometry

How do we capture light ?

Pinhole Camera

Why ?

Science for the Curious Photographer, Steve Seitz

What is a Camera ?

Camera = Pinhole

Science for the Curious Photographer, wikipedia

What is a Camera ?

Camera = Pinhole

Powerful Mathematical Model

Science for the Curious Photographer, wikipedia

Camera Geometry

Pinhole Camera Model

• What are the consequences of this model ?
• Imagine you project a 3D point onto the image plane
• Where did it come from ?
Camera Geometry

Pinhole Camera Model

• What are the consequences of this model ?
• Imagine you project a 3D point onto the image plane
• Where did it come from ?
Camera Geometry

Pinhole Camera Model

• What are the consequences of this model ?
• Imagine you project a 3D point onto the image plane
• Where did it come from ?
Camera Geometry

Recovering 3D Geometry
• Consider two cameras (one is never enough)
• Take pictures
• Maps to points on image planes
• Know linear constraint on 3D point from left camera
• Use right camera constraint to intersect
Camera Geometry

Many Considerations
• Do we know camera parameters ? (intrinsic calibration)
• Do we know orientations of cameras ? (extrinsic calibration)
• Match features (representation,matching,robustness)
• Do the backprojected rays intersect ? (structure estimation)
• Extend this principle to multiple images
• Non-trivial, but many important advances
• State-of-the-art can handle large datasets (> 104 images)
Camera Geometry

Camera Systems
• Camera imaging surface - typically a rectangular plane
• Human retina is closer to a spherical surface
• Vastly diﬀerent image plane geometries
• Fundamental 3D-2D imaging model is the same
• Spatial sampling is uniform for typical cameras
• Omnidirectional cameras
Camera Model : Perspective Projection

• Very simple geometry

• Suﬃciently powerful representation
• Virtual Image considered in front of focus
• Real cameras do deviate from this model
Camera Model : Perspective Projection

Y
X X
y
x
x fY/Z
C Z C
p p Z
principal axis f
camera
centre image plane

• Coordinate system with origin at camera centre

• World coordinates of point P = (X , Y , Z )
• Image projection measured in local image coordinate system
• Image coordinates p = (x, y)

By simple similarity of triangles we have

x = fZX
y = fZY
Camera Model : Perspective Projection

Changing focal length

• Keep camera ﬁxed, change focal length
• What happens to the volume imaged ?

Science for the Curious Photographer; Forstyh and Ponce 2nd Edition.
Camera Model : Perspective Projection
fX
x= Z
fY
y= Z

Implications
• Diﬀerent points are scaled diﬀerent according to depth
• Introduces non-linearities in the relationships
• Distant objects are smaller
• Cannot judge object size with a single image
Perspective projection
• Cannot judge object size with a single image
• Judgement of size can be wrong!

From twitter.com/rainmaker1973
Y

Y
X X
y
x
x fY/Z
C Z C
p p Z
principal axis f
camera
centre image plane

Two co-ordinate systems!

• Remember that we have two measurements of interest
• Measurements on the image plane
• Measurements in the 3D world
• Our interest is to relate the two
Camera Model (contd.)

Consider perspective projection model

f

x X
=
y Z Y
Now let’s translate the frame of reference (or camera), new co-ordinates
0
f

x X + tx
0 =
y Z + tz Y + ty
Camera Model (contd.)

Consider perspective projection model

f

x X
=
y Z Y
Now let’s translate the frame of reference (or camera), new co-ordinates

0 (X + tx )
x = f
(Z + tz )
0 (Y + ty )
y = f
(Z + tz )
Camera Model (contd.)

Or if we were to rotate the camera by rotation matrix R

r11 r12 r13

 

R =  r21 r22 r23 

r31 r32 r33
The new 3D coordinates would be
 0  
r11 r12 r13
 
X X
 0  
 Y  = r21 r22 r23   Y 
Z
0
r31 r32 r33 Z
Camera Model (contd.)

Therefore, the new image projections would look like

r11 X + r12 Y + r13 Z

x = f
r31 X + r32 Y + r33 Z
r21 X + r22 Y + r23 Z
y = f
r31 X + r32 Y + r33 Z

• Now if we apply an additional transformation, the two rotations

would get entangled
• End result of multiple transformations is very messy!
• Need a cleaner approach
Homogeneous Representations

To arrive at a solution, we take recourse to geometry

Geometric approaches
• “Purist” view - co-ordinate free approach to geometry
• Classical theorems due to Euclid
• Since Descartes, there’s an algebraic view of geometric constructs
• Duality : Geometry ↔ Algebra
T
• Circle : Centre + Radius ↔ (p − p 0 ) (p − p 0 ) = r 2
Homogeneous Representations

Consider a line y = mx + c
Rewrite as mx − y + c = 0
or generally as
ax+by+c = 0

Rewriting this we have

x
 

a b c  y =0

1
Homogeneous Representation of a Line

p
z }| {
x
a b c  y =0

| {z } 1
l

this results in a nice symmetric form

lT p = 0

This form has many advantages over y = mx + c form

Homogeneous Representation of a Line

Consider the intersection of two lines

To solve for the point of intersection
y = m1 x + c1
y = m2 x + c2
(y−c1 )
Solve simultaneous equations by substitution, x = m1

m2
y = (y − c1 ) + c2
m1
m2 c1 m2
(1 − )y = c2 −
m1 m1
(c2 − c1mm12 )
y = =
(1 − mm21 )

Quite a mess!!
Homogeneous Representation of a Line

In the homogeneous system of representation we have

l 1T p = 0
l 2T p = 0

Therefore, the co-ordinates of the intersection is given by

p = l1 × l2

Verify
• l 1 T (l 1 × l 2 ) = 0
• l 2 T (l 1 × l 2 ) = 0
• Much cleaner way of solving
Homogeneous Representation of a Line
Consider the line through two given points
(X1,Y1)

Line : Y=mX+c

(X2,Y2)

Usual solution is messy

Instead, using homogenous coordinates, we get the dual representation

Line : l = p 1 × p 2

Easily veriﬁed that this satisﬁes the requirements

T
• (p 1 × p 2 ) p 1 =0
T
• (p 1 × p 2 ) p 2 =0
Homogeneous Representation

The key relationship to note is that

p
z }| {
x
a b c  y =0

| {z } 1
l

results in a nice symmetric (and homogeneous) form

lT p = 0

This form has many advantages over y = mx + c form

Homogeneous Representation

In homogenous form everything upto unknown scalar

Homogeneous Inhomogeneous
Rn 7→ Rn+1 Rn 7→ Rn−1
 u 
u u
   
u w

7  v 
→  v  7→  v 
v w
1 w 1

Homogeneous Forms
• Embed in higher dimensions by appending a 1 (canonical)
• Homogeneous forms are equivalent upto scale
• Only ratios matter
• [u, v, w] = λ [u, v, w] , ∀λ 6= 0
• Notice [0, 0, 0] is not admissible
Homogeneous Representation

In homogenous form everything upto unknown scalar

Homogeneous Inhomogeneous
Rn 7→ Rn+1 Rn 7→ Rn−1
 u 
u u
   
u w

7  v 
→  v  7→  v 
v w
1 w 1

• Geometry : Topological Space + Axioms

• Different set of axioms Different Geometries
• Euclidean (Distances and Angles)
• Affine (Parallelism)
• Projective (Straight Line)
• Non-linear (Riemannian Manifolds)

Stratiﬁcation of transform space

Euclidean ⊂ Aﬃne ⊂ Projective
Euclidean Geometry

Axioms of incidence
• Familiar concepts from Euclidean geometry
• Length is a fundamental property of Euclidean Geometry
• Construction with straightedge and compass
• Axioms of Euclid

Following Hilbert state the axioms as

• For any two points A, B, a unique line passes through A, B
• Every line contains at least two points
• There exist three points not all on the same line
• Parallel axiom : For any line L and point P outside L, there is
exactly one line through P that does not meet L
Wikipedia

• Two points have a unique line through them (join)

• Two lines have a unique intersection point (meet)
• What happens when the lines are parallel ?
• What does it mean to say that they “intersect at ∞”?
• Question : Are all ∞ intersection points the same ?
• The answer lies in the geometry of projective space
• Take a detour via aﬃne geometry
• Recall homogeneous representations
Homogeneous Forms

Parallel Lines
T
• Recall line equation: l p = 0
T
• l and p upto scale factor l T p = (λl) (λ p) = 0
0