0% found this document useful (0 votes)

17 views

Week5_Computer_Vision

Uploaded by

albertadi412

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

17 views

Week5_Computer_Vision

Uploaded by

albertadi412

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 58

CSCI218: Foundations

of Artificial Intelligence
Human Vision System

2
Robot Vision System

3
Image Formation

4
Image Formation

5
Simple Image Feature

Image Color Histogram 6

Simple Image Feature

Edge

7
Simple Image Feature

Edge

8
Simple Image Feature

Texture (e.g., Gray-Level Co-Occurrence Matrix (GLCM))

- Characterise how often pairs of pixel with specific values and in a specified spatial relationship occur in an image

9
Simple Image Feature

Optical Flow: Whenever there is relative movement between the camera and one or more
objects in the scene, the resulting apparent motion in the image is called optical flow.

10
Simple Image Feature

Optical Flow: Whenever there is relative movement between the camera and one or more
objects in the scene, the resulting apparent motion in the image is called optical flow.

11
Simple Image Feature

Segmentation of natural images

12
Classifying Images

Important sources of appearance variation

13
Classifying Images

Why convolutional neural networks classify images well

14
Detecting Objects

Faster RCNN for object detection

15
The 3D World

Binocular stereopsis

16
Using Computer Vision

Understanding what people are doing

17
Using Computer Vision

Understanding what people are doing

18
Using Computer Vision

Automated image captioning

19
Using Computer Vision

Visual question-answering
20
Using Computer Vision

Reconstruction from many views

21
Using Computer Vision

Geometry from a single view

22
Using Computer Vision

Making pictures

23
Using Computer Vision

Image Transformation (Paired)

24
Using Computer Vision

Image Transformation (Unpaired)

25
Using Computer Vision

Image Transformation (Style transfer)

26
Using Computer Vision

Image Generation (by GAN)

27
Using Computer Vision

Controlling movement with vision

28
Using Computer Vision

Navigation

29
Image Analysis
§ Overview of Image Analysis
§ Collecting and Representing Image
§ Image Recognition
§ Bag-of-Visual-Words model
§ Deep Convolutional Neural Networks
Overview of Image Analysis
§ Image analysis
§ Refers to the representation, processing, and modelling of visual data to
derive useful insights
§ Suffers from the semantic gap
§ Visual data (image, video, …) is unstructured
§ Semantic gap
§ The gap between high-level concepts used by human and the low-level
features used by computer
Overview of Image Analysis
§ Image recognition (in a narrow sense)
§ Image classification
§ Object detection, localisation, tracking
§ Scene segmentation and reconstruction
§ Image search and retrieval
Overview of Image Analysis
§ Image classification

Face OCR recognition

recognition

Scene recognition Object recognition

Overview of Image Analysis
§ Object detection, localisation, tracking

Object detection and localization

Object tracking (https://www.youtube.com/watch?v=dKpRsdYSCLQ)

Overview of Image Analysis
§ Scene segmentation and reconstruction

[Farabet et al. PAMI 2013]

http://twd20g.blogspot.com.au/2011/12/this-work-presents-novel-system-that.html https://www.3dflow.net/elementsCV/S4.xhtml
Image Analysis Steps
§ Collection and labelling
§ Collect representative images from a given task and label the ground
truth
§ Image representation
§ Select and/or design appropriate image representations (invariant and
discriminative)
§ Image analysis techniques
§ Apply and/or design appropriate analysis techniques for the given tasks
(classification, detection, tracking, segmentation, etc.)
Representing Image
§ Why representing images is difficult?
§ Scale, rotation, illumination, occlusion, background clutter, deformation, …
§ Invariant and Discriminative representation

Cat:
Representing Image
§ Traditional representation (before year 2000)
§ Hand-crafted, global features
§ Intensity, colour, texture, shape, structure, etc.

Colour histogram in a RGB space Face recognition with raw pixel

intensities
Representing Image
§ Days of the BoVW model (2000 ~ 2012)
§ SIFT, HOG, SURF, CENTRIST, filter-based, …
§ Invariant to view angle, scale, illumination, ...

SIFT (Scale Invariant Feature

Transform)

http://www.robots.ox.ac.uk/~vgg/software
/ Image courtesy of David Lowe, IJCV04
Deep Learning Model
Convolutional Neural Networks (CNNs)
§ A special multi-stage architecture inspired by visual system
§ Higher stages compute more global, more invariant features
Deep Learning Model

https://www.datasciencecentral.com/lenet-5-a-classic-cnn-architecture/
Convolution

§ For standard 2D convolution:

Filter

§ The stride is 1.
§ The height and width are changed as:
&'( )&*'+,-.
!"#$ = + 1 = (5 − 3)⁄1 + 1 = 3.
/$0123
Convolution

We need Zero-Padding to keep image size:

The width/height will become:

!&' − !)&*$+, + 2×0122345
!"#$ = +1
678329
Convolution Layers
In convolution layers:
§ Filters are called Kernels and become 3D. The parameters of
kernels (i.e., weights) are to be learned.

Kernel 1
…
Kernel N

'( ×') ×*%&

!×#×$%& !×#×$+,-
Convolution Layers
In convolution layers:
§ Feature maps are the outputs of each layer. The number of
feature maps is the channel.

Feature map 1
…
Feature map N

!×#×$%& !×#×$'()
Convolutional Neural Networks

§ Multi-stage Architecture
Convolution
Non-linearity
Pooling
Convolutional Neural Networks
Convolution
- A set of filters convolve with the input
- Share weights across the input space (translation equivariance)

Input
Filters
Feature Map
Convolutional Neural Networks
Non-linearity

Sigmoid: f(x)=1/(1+e-x) Tanh: f(x)=(ex − e-x)/(ex +e-x) ReLu: f(x)=max(x, 0)

Convolutional Neural Networks

Spatial pooling
§ Non-overlapping / overlapping regions
§ Max or sum
§ Invariance to small transformations

Max pooling

Sum/Average
pooling
Deep Learning Model
CNNs: ImageNet Breakthrough

[Krizhevsky et al. NIPS 2012]

● Krizhevsky et al. win 2012 ImageNet classification with a much bigger ConvNet
○ deeper: 7 stages vs 3 before
○ larger: 60 million parameters vs 1 million before
○ 16.4% error (top-5) vs Next best 26.2% error

● This was made possible by:

○ fast hardware: GPU-optimized code
○ big dataset: 1.2 million images vs thousands before
○ better regularization: dropout et al. Image courtesy of Deng et al.
Deep Learning Model
Learned Features of CNNs

[Matthew D. Zeiler et al. ECCV 2014]

Deep Learning Model

Object detection (Source: Rich feature hierarchies for accurate object detection and semantic
segmentation, CVPR 2014)

Face Recognition (Source: DeepFace: Closing the Gap to Human-Level Performance in Face Verification,
CVPR 2014)
Deep Learning Model

§ Directly use pre-trained CNNs

§ Which layer to use?
§ How to pool the features in a convolutional layer?
Deep Learning Model

§ Directly use pre-trained CNNs

§ Which layer to use?
Convolutional layer
Fully connected
layer
Deep Learning Model
§ Fine-tune pre-trained CNNs
§ To incorporate extra information from the images of a
new recognition task
§ Make the pre-trained CNNs adapt to this new task
Pre-trained CNNs New recognition task
on

Fine-
tune

Image courtesy of Deng et al.

http://people.csail.mit.edu/bzhou/
Summary
§ Computer vision is a key component of AI
§ Image analysis is an important and broad area
§ Feature representation is key for image analysis
§ Deep Learning techniques are now widely used
Acknowledgement

The lecture slides are based on the materials from ai.Berkey.edu

Thank you. Questions?

Generative AI Class9 Skill Education
No ratings yet
Generative AI Class9 Skill Education
27 pages
L3 - UUCLxDeepMind DL2020
No ratings yet
L3 - UUCLxDeepMind DL2020
110 pages
Convolutional Neural Networks: Computer Vision CS 543 / ECE 549 University of Illinois Jia-Bin Huang
No ratings yet
Convolutional Neural Networks: Computer Vision CS 543 / ECE 549 University of Illinois Jia-Bin Huang
76 pages
Oct2022 CSC649 SupervisedDL - CNN
No ratings yet
Oct2022 CSC649 SupervisedDL - CNN
79 pages
4a Convolutional Neural Networks
No ratings yet
4a Convolutional Neural Networks
56 pages
Final Review
No ratings yet
Final Review
24 pages
Introduction To Deep Convolutional Neural Networks: March 2016
No ratings yet
Introduction To Deep Convolutional Neural Networks: March 2016
51 pages
Lecture Sematic-Segmentation
No ratings yet
Lecture Sematic-Segmentation
23 pages
The Three R's of Computer Vision:: Jitendra Malik UC Berkeley
No ratings yet
The Three R's of Computer Vision:: Jitendra Malik UC Berkeley
54 pages
Deep Learning: Alberto Ezpondaburu
No ratings yet
Deep Learning: Alberto Ezpondaburu
58 pages
Create Your Own CamScanner Using Python and OpenCV
No ratings yet
Create Your Own CamScanner Using Python and OpenCV
20 pages
Computer_vision_part1
No ratings yet
Computer_vision_part1
96 pages
Deep Convolutional Neural Networks For Image Classification: Many Slides From Rob Fergus (NYU and Facebook)
No ratings yet
Deep Convolutional Neural Networks For Image Classification: Many Slides From Rob Fergus (NYU and Facebook)
55 pages
[Slide] Module 42
No ratings yet
[Slide] Module 42
95 pages
Chap 1 Digital Image Fundamentals DD
No ratings yet
Chap 1 Digital Image Fundamentals DD
63 pages
The Three R's of Computer Vision:: Jitendra Malik UC Berkeley
No ratings yet
The Three R's of Computer Vision:: Jitendra Malik UC Berkeley
54 pages
Lecture 16 Hao
No ratings yet
Lecture 16 Hao
56 pages
INFO AI Ch4
No ratings yet
INFO AI Ch4
90 pages
Introduction To Convolutional Neural Networks (CNNS)
No ratings yet
Introduction To Convolutional Neural Networks (CNNS)
28 pages
visualProcessing
No ratings yet
visualProcessing
25 pages
Intro
No ratings yet
Intro
23 pages
01 - Introduction To Deep Learning
No ratings yet
01 - Introduction To Deep Learning
56 pages
Convolution Neural Network (CNN) Unit 2: Dr. Kavita R Singh
No ratings yet
Convolution Neural Network (CNN) Unit 2: Dr. Kavita R Singh
65 pages
Neural Image Compression and Explanation: Submitted By: Sampad Mohanty 2002070059
No ratings yet
Neural Image Compression and Explanation: Submitted By: Sampad Mohanty 2002070059
19 pages
1.neural Networks and Convolutional Processing
No ratings yet
1.neural Networks and Convolutional Processing
94 pages
CVlecture 6
No ratings yet
CVlecture 6
33 pages
Context Encoders: Feature Learning by Inpainting
No ratings yet
Context Encoders: Feature Learning by Inpainting
12 pages
Xu DisCoScene Spatially Disentangled Generative Radiance Fields For Controllable 3D-Aware Scene CVPR 2023 Paper
No ratings yet
Xu DisCoScene Spatially Disentangled Generative Radiance Fields For Controllable 3D-Aware Scene CVPR 2023 Paper
11 pages
W11 Lecture ITS69204 Image Recognition (1)
No ratings yet
W11 Lecture ITS69204 Image Recognition (1)
44 pages
paper3
No ratings yet
paper3
11 pages
Context Encoders Feature Learning by Inpainting
No ratings yet
Context Encoders Feature Learning by Inpainting
9 pages
Steps in The Process
No ratings yet
Steps in The Process
15 pages
DRAW: A Recurrent Neural Network For Image Generation
No ratings yet
DRAW: A Recurrent Neural Network For Image Generation
10 pages
Image Processing With Python
No ratings yet
Image Processing With Python
21 pages
Object Detection and Identification
67% (3)
Object Detection and Identification
20 pages
CV - Lec01 - Introduction
No ratings yet
CV - Lec01 - Introduction
50 pages
Cancer Detection and Segmentation Project PPT Compressed
No ratings yet
Cancer Detection and Segmentation Project PPT Compressed
12 pages
Thesis PPT Hritu Raj-1
No ratings yet
Thesis PPT Hritu Raj-1
26 pages
Roy Slides Part 1 3D Reconstruction With Deep Neural Networks
No ratings yet
Roy Slides Part 1 3D Reconstruction With Deep Neural Networks
74 pages
L11 Learning III Neural Network Architectures
No ratings yet
L11 Learning III Neural Network Architectures
35 pages
02 Semantic Segmentation 2024
No ratings yet
02 Semantic Segmentation 2024
53 pages
Keras DL Framework
No ratings yet
Keras DL Framework
29 pages
CNN2
No ratings yet
CNN2
70 pages
Seminar
No ratings yet
Seminar
23 pages
J. Sil 1
No ratings yet
J. Sil 1
6 pages
1.1. Introduction To DIP
No ratings yet
1.1. Introduction To DIP
61 pages
UNIT2-CNN
No ratings yet
UNIT2-CNN
34 pages
Skin Melanoma Stage Detection - CNN
No ratings yet
Skin Melanoma Stage Detection - CNN
55 pages
CV#7 SIFT Scale Invariant Feature Transform
No ratings yet
CV#7 SIFT Scale Invariant Feature Transform
70 pages
Kim Arbitrary-Scale Image Generation and Upsampling Using Latent Diffusion Model and CVPR 2024 Paper
No ratings yet
Kim Arbitrary-Scale Image Generation and Upsampling Using Latent Diffusion Model and CVPR 2024 Paper
10 pages
Computer
No ratings yet
Computer
22 pages
Reviewer - Convolutional Neural Networks (CNNs) - Muqaddas Bin Tahir
No ratings yet
Reviewer - Convolutional Neural Networks (CNNs) - Muqaddas Bin Tahir
8 pages
Medical Image Computing (Cap 5937) : Pre-Processing Medical Images (I)
No ratings yet
Medical Image Computing (Cap 5937) : Pre-Processing Medical Images (I)
79 pages
Chapter 1 [CV & IP]
No ratings yet
Chapter 1 [CV & IP]
41 pages
Pathak Context Encoders Feature CVPR 2016 Paper
No ratings yet
Pathak Context Encoders Feature CVPR 2016 Paper
9 pages
Intro Imaging
No ratings yet
Intro Imaging
41 pages
CO2_CNN_3
No ratings yet
CO2_CNN_3
31 pages
Objectdetection
No ratings yet
Objectdetection
7 pages
Image and Video Super-Resolution
No ratings yet
Image and Video Super-Resolution
62 pages
Pyramid Image Processing: Exploring the Depths of Visual Analysis
From Everand
Pyramid Image Processing: Exploring the Depths of Visual Analysis
Fouad Sabry
No ratings yet
Hidden Surface Determination: Unveiling the Secrets of Computer Vision
From Everand
Hidden Surface Determination: Unveiling the Secrets of Computer Vision
Fouad Sabry
No ratings yet
Week1_Lecture1
No ratings yet
Week1_Lecture1
40 pages
Week4_LearningII
No ratings yet
Week4_LearningII
39 pages
Week2_Lecture
No ratings yet
Week2_Lecture
39 pages
Week3_LearningI
No ratings yet
Week3_LearningI
48 pages
Unit 4 Notes
No ratings yet
Unit 4 Notes
17 pages
Week-3 Module-2 Neural Network
No ratings yet
Week-3 Module-2 Neural Network
58 pages
KNN - Asg 1
No ratings yet
KNN - Asg 1
9 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
15 pages
ML Lecture # 02 Linear Regression
No ratings yet
ML Lecture # 02 Linear Regression
28 pages
Skin Cancer Detection
No ratings yet
Skin Cancer Detection
16 pages
1000 Machine Learning MCQ (Multiple Choice Questions) - Sanfoundry
No ratings yet
1000 Machine Learning MCQ (Multiple Choice Questions) - Sanfoundry
16 pages
Human Detection
No ratings yet
Human Detection
8 pages
Anomaly Detection Using The Numenta Anomaly Benchmark
No ratings yet
Anomaly Detection Using The Numenta Anomaly Benchmark
8 pages
(2023-Arxiv) VisionLLM Large Language Model Is Also An Open-Ended Decoder For Vision-Centric Tasks
No ratings yet
(2023-Arxiv) VisionLLM Large Language Model Is Also An Open-Ended Decoder For Vision-Centric Tasks
15 pages
Inauguration Program - (10th July)
No ratings yet
Inauguration Program - (10th July)
6 pages
Seminar
No ratings yet
Seminar
9 pages
Time Series Forecast of Electrical Load Based On XGBoost
No ratings yet
Time Series Forecast of Electrical Load Based On XGBoost
10 pages
Chapter One
No ratings yet
Chapter One
9 pages
ML Unit 3
No ratings yet
ML Unit 3
10 pages
CS 229 - Deep Learning Cheatsheet
No ratings yet
CS 229 - Deep Learning Cheatsheet
6 pages
Must Know Questions Deep Learning
No ratings yet
Must Know Questions Deep Learning
22 pages
Harmful Insects Detection Using Convolutional Neural Networks (Faster R-CNN)
No ratings yet
Harmful Insects Detection Using Convolutional Neural Networks (Faster R-CNN)
8 pages
Neural Networks and Deep Learning Practical
No ratings yet
Neural Networks and Deep Learning Practical
15 pages
Micro-Report-format 5 (1)
No ratings yet
Micro-Report-format 5 (1)
13 pages
Data Science Course
100% (1)
Data Science Course
51 pages
A Feature-Wise Attention Module Based On The Difference With Surrounding Features For Convolutional Neural Networks
No ratings yet
A Feature-Wise Attention Module Based On The Difference With Surrounding Features For Convolutional Neural Networks
10 pages
CV - Deep Convolutional Neural Networks
No ratings yet
CV - Deep Convolutional Neural Networks
55 pages
AWS AI and ML Scholarship Skills Guide 2024
No ratings yet
AWS AI and ML Scholarship Skills Guide 2024
9 pages
AI & ML Question Bank
No ratings yet
AI & ML Question Bank
10 pages
2018 12 Abbeel - AI PDF
No ratings yet
2018 12 Abbeel - AI PDF
105 pages
Introduction to Deep Learning Using R: A Step-by-Step Guide to Learning and Implementing Deep Learning Models Using R Taweh Beysolow Ii - The latest ebook is available for instant download now
No ratings yet
Introduction to Deep Learning Using R: A Step-by-Step Guide to Learning and Implementing Deep Learning Models Using R Taweh Beysolow Ii - The latest ebook is available for instant download now
68 pages
Artificial Intelligence:, John Mccarthy
No ratings yet
Artificial Intelligence:, John Mccarthy
29 pages
Artificial Intelligence (Professional Elective - I) Ech. III Year II Sem. LTP C Course Code: CS613PE 3 0 0 3
No ratings yet
Artificial Intelligence (Professional Elective - I) Ech. III Year II Sem. LTP C Course Code: CS613PE 3 0 0 3
2 pages