Lecture 1
Lecture 1
Lecture 1
1
Course logistics
Times: Mon, Wed, Fri 12:00pm-1250pm
E-mail: little@cs.ubc.ca
Raise your hand (in Zoom) if you want to ask a question, I will call on you (possibly
not immediately), and then you can unmute and ask it, then mute again
If you don’t have a microphone, you can ask a question in Chat, but I prefer spoken
questions.
PhD, MSc
1978 - 1985
Research Associate
1975-1978
Research Analyst
1972 - 1975
4
Course logistics
Times: Mon, Wed, Fri 12:00pm-1250pm Location: Online (Zoom)
rayat137@cs.ubc.ca whbae@cs.ubc.ca
bichengx@cs.ubc.ca
Tanzila Rahman Gabriel Huang
E-mail: little@cs.ubc.ca
Office: ICICS 117
trahman8@cs.ubc.ca Gabrie20@cs.ubc.ca
5
Course logistics
Times: Mon, Wed, Fri 12-1250pm Locations: Online (Zoom)
7
Course logistics
Times: Mon, Wed, Fri 12:00pm-1250pm Locations: Online (Zoom)
I will use the Canvas and the Course Webpage for assignment
(Canvas) and lecture slide distribution (both).
I will post slides before each lecture, so you can take notes over
them if you wish.
Lectures (Live: Zoom; Recorded: Canvas; Slides: Canvas & Web Page)
Office and TA hours (Zoom)
Assignments (Instructions: Web Page & Canvas; Handin: Canvas)
Assigned Readings (Web Page)
Schedule (Web Page)
Questions & Assignment Support (Piazza)
1
1
Course Origins
CPSC 425 was originally developed by David Lowe, then Bob Woodham, then Fred
Tung and Leon Sigal, and has evolved over the years.
Previously taught by:
— 2021-2022 Term 1 by Jim Little
— 2020-2021 Term 2 by Jim Little
— 2019-2020 Term 2 by Leon Sigal
— 2019-2020 Term 1 by Jim Little
— 2018-2019 Term 2 by Leon Sigal
— 2018-2019 Term 1&2 by Leon Sigal
— 2017-2018 Term 2 by Leon Sigal
— 2016-2017 Term 2 by Jim Little
— 2015-2016 Term 2 by Fred Tung
— 2015-2015 Term 2 by Jim Little
1
2
Course Origins
The course is a very broad, but relatively shallow introduction to a very diverse
and complex field that draws material from geometry, statistics, AI, machine
learning, computer graphics, psychology and many others.
— This means we will cover many topics and different algorithms.
— I will give you as much background and connective tissue as I can
… but, there is no “linear” way to learn the material we will cover
… I will not be able to go into depth on some of the topics
1
3
How to do Well in the Course?
1
4
How to do Well in the Course?
— Some topics we will cover are theoretic and fundamental (e.g., geometry)
— Others are algorithmic (i.e., you make certain assumptions about the world, these
assumptions may not always hold, but will be useful in building algorithms that
ultimately perform well on a prescribed task)
— Computer vision is more of an experimental science - ultimately we are looking at
performance to determine whether our algorithmic choices are successful.
1
5
Grading Criteria
Number of quizzes has not been determined and each quiz may have
different number of questions / points.
Quizzes are designed to get you to think more deeply about what we are
covering and to keep you on track with the material.
Assignments Due (tentative) dates are already posted (so you can plan ahead)
There will be 7 assignments in total (6 marked)
— Approximately 1 every 2 weeks (two are 1.5 weeks)
— You will hand these in by 11:59pm on the due date (read hand-in instructions and late
policy on course webpage)
You will use the Python, with the following libraries: Python
[ Tentatively ] on TBD
— Online, during the lecture period
— Closed book, no notes allowed
2
1
Final Exam
The Final exam is held during the regular examination period and is scheduled by the
Registrar’s Office
Similar to the midterm but longer and with more extensive short/medium answer
questions
2
2
Textbooks
The course uses the following textbook, which is recommended (but not required):
Can be freely downloaded as a PDF from SpringerLink,
Skim the reading before coming to the lecture – read in detail after.
— Reading assignments will be posted on the course webpage
— They will also be mentioned in class
2
4
Computer Vision!
How important is Vision?
2
6
How important is Vision?
2
7
What is Computer Vision?
2
8
What is Computer Vision?
Computer vision, broadly speaking, is a research field aimed to enable computers to
process and interpret visual data, as sighted humans can.
2
9
What do you see?
3
3
What is Computer Vision?
Compute vision, broadly speaking, is a research field aimed to enable computers to
process and interpret visual data, as sighted humans can.
Sensing Device
Interpreting Device
Image (or video) Interpretation
blue sky,
trees,
Image Credit: https://www.flickr.com/photos/flamephoenix1991/8376271918
fountains
UBC, …
3
4
Computer vision … the beginning …
3
7
Can computers match (or beat) human vision?
3
8
Can computers match (or beat) human vision?
3
9
Can computers match (or beat) human vision?
4
0
Can computers match (or beat) human vision?
• Yes and No (mostly NO)
• The shading example shows that human vision does not “see” what’s
in the image. Rather we see the scene and “explain” what is in the
image using prior knowledge and experience.
4
1
Computer Vision Problems
1. Computing properties of the 3D world from visual data (measurement)
Wang et al.
46
4
9
Computer Vision Problems
1. Computing properties of the 3D world from visual data (measurement)
3. Algorithms to mine, search, and interact with visual data (search and
organization)
3. Algorithms to mine, search, and interact with visual data (search and
organization)
Instagram Flickr
4
3. Search and Organization
5
Computer Vision Problems
1. Computing properties of the 3D world from visual data (measurement)
3. Algorithms to mine, search, and interact with visual data (search and
organization)
5
7
4. Visual Imagination
5
8
Computer Vision Problems
1. Computing properties of the 3D world from visual data (measurement)
3. Algorithms to mine, search, and interact with visual data (search and
organization)
7
1
Face Detection
Technology available in any digital camera now
(one of the first big commercial successes of vision algorithms)
7
3
Smile Detection
7
4
Face Recognition
Apple’s iPhoto
http://www.apple.com/ilife/iphoto/
7
6
Vision for Biometrics
“How the Afghan Girl was Identified by Her Iris Patterns” Read the story wikipedia
new laptops,
other devices
Image Credit: James Hays (GA Tech)
7
8
Object Recognition (in supermarkets)
https://www.youtube.com/watch?v=NrmMk1Myrxc
7
9
Object Recognition (in mobile devices)
https://www.youtube.com/watch?v=8SdwVCUJ0QE
https://en.wikipedia.org/wiki/Nokia_Point_&_Find
8
0
3D Urban Modeling and Virtual Tourism
8
5
Vision for Robotics, Space Exploration
NASA'S Mars Exploration Rover Spirit captured this westward view from
atop a low plateau where Spirit spent the closing months of 2007.
Vision systems (JPL) used for several tasks
•Panorama stitching
•3D terrain modeling
•Obstacle detection, position tracking
•For more, read “Computer Vision8 on Mars” by Matthies et al.
6
Vision for Medical Imaging
3D Image guided
imaging surgery
MRI, CT Grimson et al.,
Slide Credit: James Hays (GA Tech)
8 MIT
7
Captioning and Visual Question Answering
Demo: http://vqa.cloudcv.org
Demo: http://demo.visualdialog.org [ Seo et al., NIPS 2017 ]
8
8
Related Disciplines
Artificial Intelligence (AI)
Robotics
Computer Vision
Machine Human Computer
Learning Scope of CPSC 425 Interaction
Graphics Image
Medical Imaging
Processing
Computational
Photography Geometric Neuroscience
Optics
Reasoning
Slide Credit: James Hays (GA Tech)
Recognition
8
9
Prepare for the Next Lecture
Readings:
— Next Lecture: Forsyth & Ponce (2nd ed.) 1.1.1 — 1.1.3
(optional – Secs. 1.1 and 1.2 from Szeliski)
Reminders:
Images Model
Vision
Graphics
9
3
9
4
Wired’s 100 Most Influential People in the World
9
5
9
6
CVPR Attendance
9
7