0% found this document useful (0 votes)

98 views

Coursework Assignment Summer

Uploaded by

Denis Mcdenoh

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

98 views

Coursework Assignment Summer

Uploaded by

Denis Mcdenoh

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

Ivor Simpson: Computer Vision @ University of Sussex – Summer 2021

Coursework Assignment

1 Assignment Overview
This assignment will involve you designing, building, testing and critiquing a system for per-
forming face alignment, aka. locating facial landmarks in images. There is also a small extension
task detailed below.
It is worth 100% of the grade for this module. It’s designed to ensure you can demonstrate
achieving the learning outcomes for this module, which are:

• Write and document a computer program to extract useful information from image data.

• Propose designs for simple computer vision systems.

• Determine the applicability of a variety of computer vision techniques to practical prob-

lems.

• Describe and recognise the effects of a variety of image processing operations.

1.1 Secondary Tasks

You will design and implement a simple system for face pose-based image retrieval. This system
should return a small ranked set of images with the most similar pose to a given query (input) face
image. The pose/shape of the face is described by the relative positions of the face landmarks.
This task will require developing a simple approach to measure the distance between face
poses, given the landmarks, and an algorithm to retrieve images that have the most similar pose
to the query face (25%).

2 What to hand in?

1. A report that comprises a maximum of 8 pages and 1500 words, including captions but
excluding references. I’m expecting several pictures, diagrams, flowcharts and charts to be
included.

• A summary and justification for all the steps in your face alignment system, including
preprocessing, choice of image features and prediction model. Explaining diagram-
matically is very welcome.
• Results of your experiments: This should include some discussion of qualitative (ex-
ample based) and quantitative (number based) comparisons between different ap-
proaches that you have experimented with.
Ivor Simpson: Computer Vision @ University of Sussex – Summer 2021

• Qualitative examples of your face alignment approach running on the small set of
provided example images, found in the compressed numpy file (examples.npz) here.
• Examples of failure cases in the face alignment system and a critical analysis of these,
identifying potential causes and solutions.
• Results and a description of your pose-based image retrieval system. This should
include details of how the distance between face poses is calculated and the closest
examples are retrieved.

2. A .csv file that contains the face landmark positions on the test set of images, found in the
compressed numpy file (“test images”.npz) here. You must use the provided “save as csv”
function in the colab worksheet to process an array of shape (number test image, num-
ber points, 2) to a csv file. Please make sure you run this on the right data and submit in
the correct format to avoid losing marks.

3. Either .ipynb files or .py files containing annotated code for all data preprocessing, model
training and testing.

3 How will this be graded?

The breakdown of marks for this assignment are given below:

20 Marks Accuracy and robustness of face alignment

These marks are allocated based on the performance of the face alignment method. This
will be evaluated on the held out test set, which includes some difficult cases. The test
images, without annotations are provided in the compressed numpy file (test images.npz)
here and the error on the predicted points will be calculated after submission. Marks will
be awarded for average accuracy and robustness (% of images with error below a certain
threshold).

30 Marks Outline of methods employed

Justifying and explaining design decisions for the landmark finding. This does not have to
be in depth, and I do not expect you to regurgitate the contents of the lecture notes/papers.
You should state clearly:

• what methods you have used, with what training parameters and why.
• what image features you have used, briefly describe how they were calculated, and
why you chose them.
• any image pre-processing steps you have used, and why.

For top marks, you should clearly demonstrate a creative and methodical approach for
designing your system, drawing ideas from different sources. Explaining using diagrams
and/or flowcharts is very welcome.

20 Marks Analysing results and failure cases

Critically evaluate the results produced by your system on test/validation data. You should
include quantitative (number based) and qualitative (example based) comparisons between
different approaches that you have tried. Quantitative measures including measuring the
cumulative error distribution (see lecture slides) or using boxplots or other plots to compare
Ivor Simpson: Computer Vision @ University of Sussex – Summer 2021

methods. Please note that I am interested in your final prediction results, rather than how
the cost function changes during training. A detailed qualitative analysis would investigate
and identify systematic failure cases, providing visual examples, and propose potential
solutions.

25 Marks Pose-based image retrieval Describe and justify your pose-based image retrieval algo-
rithm, ideally using a diagram, flowchart or psuedocode. Provide a mathematical definition
for your distance function between face poses. The absolute coordinates of the face in the
image are not the best description of pose as they incorporates the position in the image
as well as shape, so you should compensate for this in your approach.
The results should be presented for up to 5 images of different poses taken from either
the training/testing set for use as query images, and you should retrieve the 3 images that
have the closest facial pose. Marks will be allocated for the quality of the description,
appropriateness of the method and presentation of the results.

5 Marks Code annotation is for annotating sections of the training/testing code with what they
do. To get maximum marks, explain each algorithmic step (not necessarily each line) in
your notebook/.py files.

General Points on the report

• Provide references, if you read anything useful. You can take figures from other works as
long as you reference them appropriately.

• Diagrams, flowcharts and pictures are very welcome, make sure you label them properly
and refer to them from the text.

• All plots should have labelled axis.

4 What resources are provided for me?

The training images are provided for you in a compressed Python array. They have already
been preprocessed to be the same size with the faces roughly in the middle of the image and the
eye corners in the same position. The training data can be downloaded as a compressed numpy
file (training images.npz) here.
The data can be read by:
import numpy as np

# Load the data using np . load

data = np . load ( ’ training_images . npz ’ , allow_pickle = True )

# Extract the images : shape = (2811 , 242 , 242 , 3)

images = data [ ’ images ’]
# and the data points : shape = (2811 , 32 , 2)
pts = data [ ’ points ’]
In this very basic colab worksheet I provide code for:

• Loading the data

Ivor Simpson: Computer Vision @ University of Sussex – Summer 2021

• Estimating the error between predictions and ground truth.

• Visualising points on an image

• Saving the results to a .csv file, which contains some checks to make sure you’re predicting
on the correct dataset.

A set of test images, without landmarks is provided in the compressed numpy array (test images.npz)
here. This data is loaded the same way as before, but there are no points stored in the file.
I also include 6 images to use for qualitative comparisons found in the compressed numpy
array (examples.npz) here. These images should be included in your report to demonstrate face
alignment performance across different genders, ethnicities and poses.

4.1 Notes on using Colab

Either you can complete this project using Google colab, which gives you a few hours of com-
puting time completely free of charge, or you can use your personal/lab machine. If you are
using Google colab, try and familiarise yourself with some of it’s useful features. To keep your
saved models, preprocessed data etc. you can save it to Google drive following the instructions
here. You can also directly download a file you make in colab using the code below:
from google . colab import files
files . download ( filename )
If you’re refactor code into extra .py files, these should be stored in your google drive as well,
or on Box such that they are easy to load into your Colab worksheet.

4.2 Most important links

Contents filetype links
Training images and points compressed numpy array link or link
(training images.npz)
Test images compressed numpy file link or link
(test images.npz)
Examples images for qualitative comparisons compressed numpy file (ex- link or link
amples.npz)
Colab worksheet with some useful functions colab worksheet link

4.3 What library functionality can I use?

You’re free to use fundamental components and functions from libraries such as OpenCV, numpy,
scipy, scikitlearn to solve this assignment, although you don’t have to. Here, fundamental compo-
nents refers to things like regression/classification models and pre-processing/feature extraction
steps and other basic functionality. What you are not allowed to use are library functions that
have been written to directly solve the tasks you have been given, i.e. face alignment. You
cannot use the dlib face alignment tool. Also, face detection is not required on this data.
In terms of tools and frameworks, it’s absolutely fine to use convolutional neural networks
(CNNs) if you want to, which are introduced in fundamentals of machine learning. The best
packages would be either TensorFlow (probably with Keras) or PyTorch. If you use such an
approach you should be sure to document how you chose the architecture and loss functions.
Ivor Simpson: Computer Vision @ University of Sussex – Summer 2021

A well justified and high performing CNN approach will receive equivalently high marks as if
you’d built it any other way.
In terms of sourcing additional labelled data, this is not allowed for this assignment. This
is because in real-world commercial projects you will typically have a finite dataset, and even if
there are possibly useful public datasets available, their license normally prohibits commercial
use. On the other hand data augmentation, which effectively synthesises additional training
examples from the labelled data that you have, is highly encouraged. If you use this, please try
and add some text or a flow-chart of this process in your report.

5 Where do I start
5.1 Face Alignment
Face alignment is covered in lecture 14, so that’s a good place to look for information. I briefly
discussed the assignment at the end of the lecture, which you can listen to on Canvas. I’ve also
included some references below.
I have included a very basic colab worksheet illustrating how to load the data and visualise
the points on the face.
The simplest approach would be to treat this as either a regular or a cascaded regression
problem. To follow this approach you will need to consider what image features are helpful to
predict the landmarks and whether some pre-processing is required on the data. One simple way
to proceed is to choose a set of locations, either evenly spaced across the image, or in some more
useful pattern (think about where in the image you might want to calculate more informations).
Using a feature descriptor, such as SIFT, you could predict the landmark locations using linear
regression (see FML lecture 13) and the associated lab.
You’re not restricted to taking this approach, and for higher marks creativity is very much
encouraged. Face alignment has seen a lot of interesting and varied ideas, and if you find some
good ideas while reading around the topic that would be great.

5.2 Pose Based Image Retrieval

The pose of the face is described by the location of the face landmarks. You will need to think
about how to calculate how close two facial poses are to each other, and then retrieve the images
in the dataset with the closest pose. The absolute coordinates of the face in the image are not
the best description of pose as they incorporates the position in the image as well as shape, so
you should compensate for this in your approach.
If possible you should provide results using the test set, but if the predictions given by your
alignment model aren’t accurate enough, you can just use the training set for this section.

6 Top Tips for Success

• Remember Occam’s razor, complexity should not be added unnecessarily. The more com-
plicated your system the more things to explain/justify etc.

• Start with a simple achievable goal and use that as a baseline to test against. Keep track
of early models/results to use as points of comparison.
Ivor Simpson: Computer Vision @ University of Sussex – Summer 2021

Figure 1: Illustration of the 0-indexed (counting from 0 as you would in Python) locations of
the points on the face. For example, if we wanted to find the tip of the nose, that’s index 14
so we would look up points[14,:], which would give you the x and y coordinate of the tip of the
nose.

• Remember that even if it doesn’t work well, having a go at the extension tasks is worth a
few marks. We’re only looking for simple solutions.

• You don’t need to work at very high resolution to get accurate results. Particularly when
doing initial tests, resize your images to a lower resolution images. Make sure you also
transform your training points so they are in the same geometry as the image. For your
predicted points, make sure these are all at the same resolution as the original images.

• Think about things that you’ve learned about in FML as well as Computer Vision. Di-
mensionality reduction could be helpful. Overfitting and outliers may be an issue, and you
should consider using methods to minimise this.

7 Further reading
Face alignment is a reasonably well researched field, and a wide variety of methods have been
proposed. Some relatively recent approaches are documented below. [1] is probably a good one
to look at, [2] contains a survey of methods, which might give you some ideas and [3] describes
the results of a competition. The other references are very much optional reading on other
popular recent methods.

References
[1] Xiong X, De la Torre F. Supervised descent method and its applications to face alignment.
InProceedings of the IEEE conference on computer vision and pattern recognition 2013
(pp. 532-539). Paper link.
Ivor Simpson: Computer Vision @ University of Sussex – Summer 2021

[2] Learned-Miller E, Huang GB, RoyChowdhury A, Li H, Hua G. Labeled faces in the wild: A
survey. InAdvances in face detection and facial image analysis 2016 (pp. 189-248). Springer,
Cham. Paper link.

[3] Sagonas C, Antonakos E, Tzimiropoulos G, Zafeiriou S, Pantic M. 300 faces in-the-wild

challenge: Database and results. Image and vision computing. 2016 Mar 1;47:3-18. Paper
link.

[4] Cao X, Wei Y, Wen F, Sun J. Face alignment by explicit shape regression. International
Journal of Computer Vision. 2014 Apr 1;107(2):177-90. Paper link.

[5] Burgos-Artizzu XP, Perona P, Dollár P. Robust face landmark estimation under occlusion.
InProceedings of the IEEE international conference on computer vision 2013 (pp. 1513-
1520). Paper link

[6] Zhu S, Li C, Change Loy C, Tang X. Face alignment by coarse-to-fine shape searching.
InProceedings of the IEEE conference on computer vision and pattern recognition 2015
(pp. 4998-5006). Paper link

Cis6004 CW1
No ratings yet
Cis6004 CW1
9 pages
Web Image Reranking Project Report
100% (1)
Web Image Reranking Project Report
28 pages
BIT Project Report On Gym Management System
No ratings yet
BIT Project Report On Gym Management System
44 pages
Project Report E-Campus
50% (4)
Project Report E-Campus
21 pages
Installing and Maintaining-Assignment 1-Investigation
No ratings yet
Installing and Maintaining-Assignment 1-Investigation
10 pages
Online Student's Academic Registration System
100% (1)
Online Student's Academic Registration System
22 pages
15 - Conceptual Database Design
100% (1)
15 - Conceptual Database Design
31 pages
DMDW Auto Final
No ratings yet
DMDW Auto Final
12 pages
IASP 550 Final Project: Network Intrusion Detection System
No ratings yet
IASP 550 Final Project: Network Intrusion Detection System
2 pages
Capstone
No ratings yet
Capstone
8 pages
Final Assignment - For Nepal
No ratings yet
Final Assignment - For Nepal
7 pages
Student Franchisee Management System
No ratings yet
Student Franchisee Management System
21 pages
SPSD Assignment
No ratings yet
SPSD Assignment
21 pages
Developing Examination Management System Senior Capstone Project A Case Study
No ratings yet
Developing Examination Management System Senior Capstone Project A Case Study
7 pages
CT071-3-M-OOSSE-Question Paper
No ratings yet
CT071-3-M-OOSSE-Question Paper
4 pages
CIS6003 Advanced Programming: Student Details (Student Should Fill The Content)
No ratings yet
CIS6003 Advanced Programming: Student Details (Student Should Fill The Content)
9 pages
Seham Ali 2021: Chapter-1
No ratings yet
Seham Ali 2021: Chapter-1
33 pages
Project Report Vehicle Management System
No ratings yet
Project Report Vehicle Management System
132 pages
Analytical Survey On Bug Tracking System
No ratings yet
Analytical Survey On Bug Tracking System
10 pages
Report School Management System
No ratings yet
Report School Management System
54 pages
SDS - Software Design Specifications
No ratings yet
SDS - Software Design Specifications
14 pages
Dcs 106 Resit
100% (1)
Dcs 106 Resit
6 pages
Week 1 Module 1 Graded Quiz
No ratings yet
Week 1 Module 1 Graded Quiz
7 pages
Atm Doc
29% (7)
Atm Doc
60 pages
Snapdeal Casestudy by Nishant
100% (1)
Snapdeal Casestudy by Nishant
15 pages
SRS - Software Requirement Specification
No ratings yet
SRS - Software Requirement Specification
53 pages
SPM Project Planning Proposal
No ratings yet
SPM Project Planning Proposal
32 pages
INF09801 Cwk2 - Assess Brief 2021-22 ND
No ratings yet
INF09801 Cwk2 - Assess Brief 2021-22 ND
7 pages
Chapter 4 System Analysis and Design (SAD) Note
No ratings yet
Chapter 4 System Analysis and Design (SAD) Note
6 pages
Ppmob Notes
No ratings yet
Ppmob Notes
148 pages
Corn Leaf Disease Detection (The Crop Master)
No ratings yet
Corn Leaf Disease Detection (The Crop Master)
7 pages
School Management System Project Report.
100% (2)
School Management System Project Report.
44 pages
School Management Proposal 2.2
No ratings yet
School Management Proposal 2.2
17 pages
Smarter Work Management System
No ratings yet
Smarter Work Management System
3 pages
Project 2
0% (1)
Project 2
81 pages
Internal Mark Assessment System: Purpose of The Project
No ratings yet
Internal Mark Assessment System: Purpose of The Project
3 pages
Final Project-I
No ratings yet
Final Project-I
59 pages
DBMS
No ratings yet
DBMS
32 pages
Capstone Project
No ratings yet
Capstone Project
10 pages
CIS6006-Cyber Security WRIT1
No ratings yet
CIS6006-Cyber Security WRIT1
13 pages
Table of Content Chapter 1 Introduction: School Management System
No ratings yet
Table of Content Chapter 1 Introduction: School Management System
34 pages
Synopsis LPU UMS
No ratings yet
Synopsis LPU UMS
7 pages
Online Help Desk
No ratings yet
Online Help Desk
36 pages
Systems Planning and Selection
100% (1)
Systems Planning and Selection
11 pages
Facial Emotion Detection: 1) Background/ Problem Statement
No ratings yet
Facial Emotion Detection: 1) Background/ Problem Statement
6 pages
Daniel
No ratings yet
Daniel
67 pages
Petrol Pump Management
0% (1)
Petrol Pump Management
10 pages
Unit-1 STQA
No ratings yet
Unit-1 STQA
127 pages
Book Shop Management System Documentation
100% (1)
Book Shop Management System Documentation
53 pages
E-Care Help Desk System Java Project
38% (8)
E-Care Help Desk System Java Project
72 pages
Uml Diagram Questions and Answers
No ratings yet
Uml Diagram Questions and Answers
2 pages
MC0088 Data Warehousing & Data Mining
No ratings yet
MC0088 Data Warehousing & Data Mining
10 pages
COCOMO
No ratings yet
COCOMO
45 pages
Placement Office Automation: Bachelor of Technology
No ratings yet
Placement Office Automation: Bachelor of Technology
57 pages
Intern Report Progress
No ratings yet
Intern Report Progress
59 pages
Online Student Result Management System
No ratings yet
Online Student Result Management System
24 pages
Entrepreneurship EXAM Aug
No ratings yet
Entrepreneurship EXAM Aug
3 pages
Pulkit Hospital Management Report
No ratings yet
Pulkit Hospital Management Report
53 pages
Electronic test equipment Third Edition
From Everand
Electronic test equipment Third Edition
Gerardus Blokdyk
No ratings yet
Equity of Cybersecurity in the Education System: High Schools, Undergraduate, Graduate and Post-Graduate Studies.
From Everand
Equity of Cybersecurity in the Education System: High Schools, Undergraduate, Graduate and Post-Graduate Studies.
Joseph O. Esin
No ratings yet
Small business software Standard Requirements
From Everand
Small business software Standard Requirements
Gerardus Blokdyk
No ratings yet
Project 7
No ratings yet
Project 7
7 pages
Implement The Topology With Fully Functional Routers (I.E. Network Connecting The Two Vyos Routers)
No ratings yet
Implement The Topology With Fully Functional Routers (I.E. Network Connecting The Two Vyos Routers)
22 pages
AC413 Fall 2015 CASE STUDY
No ratings yet
AC413 Fall 2015 CASE STUDY
4 pages
Project 4: Message Display Terminal: 1 Objective
No ratings yet
Project 4: Message Display Terminal: 1 Objective
4 pages
AI Cheatsheet Withlinks Compressed
No ratings yet
AI Cheatsheet Withlinks Compressed
15 pages
10 1108 - LHT 07 2021 0242
No ratings yet
10 1108 - LHT 07 2021 0242
24 pages
Automated Vehicle Parking Occupancy Detection in Real-Time
No ratings yet
Automated Vehicle Parking Occupancy Detection in Real-Time
6 pages
Prospectus
No ratings yet
Prospectus
148 pages
Color Vision Test
No ratings yet
Color Vision Test
4 pages
Vedant%20Kumar%20Resume
No ratings yet
Vedant%20Kumar%20Resume
4 pages
Handbook of MRI Pulse Sequence S: Matta
No ratings yet
Handbook of MRI Pulse Sequence S: Matta
7 pages
1 Lecture AI Module1 Intro
No ratings yet
1 Lecture AI Module1 Intro
53 pages
Artificial Intelligence For Robotics and Autonomous Systems Applications
No ratings yet
Artificial Intelligence For Robotics and Autonomous Systems Applications
488 pages
Arteriovenous Malformation (Avm) Di Instalasi Radiologi Rsup DR
No ratings yet
Arteriovenous Malformation (Avm) Di Instalasi Radiologi Rsup DR
5 pages
AI-900 slides
No ratings yet
AI-900 slides
91 pages
Project Report
No ratings yet
Project Report
16 pages
It6005-Digital Image Processing-737663277-It6005 Dip
No ratings yet
It6005-Digital Image Processing-737663277-It6005 Dip
13 pages
I3D-Shufflenet Based Human Action Recognition
No ratings yet
I3D-Shufflenet Based Human Action Recognition
14 pages
CADCAM Question Bank MID-2-1
No ratings yet
CADCAM Question Bank MID-2-1
7 pages
Artifical Intelligence Notes Part 1
No ratings yet
Artifical Intelligence Notes Part 1
22 pages
Professional Certificate Course in AI - Machine Learning E - ICT Academy, IIT Kanpur
No ratings yet
Professional Certificate Course in AI - Machine Learning E - ICT Academy, IIT Kanpur
33 pages
1 Chapter01
No ratings yet
1 Chapter01
21 pages
Render Mental Ray
No ratings yet
Render Mental Ray
57 pages
2022 Slit Lamp Imaging Competition Participation Terms
No ratings yet
2022 Slit Lamp Imaging Competition Participation Terms
3 pages
Computer Vision Notes
No ratings yet
Computer Vision Notes
72 pages
DIP2
No ratings yet
DIP2
22 pages
Dip Lab
No ratings yet
Dip Lab
5 pages
وظائف نظم المعلومات الجغرافية
No ratings yet
وظائف نظم المعلومات الجغرافية
30 pages
Lecture 1 2 Pose in 2d and 3d
No ratings yet
Lecture 1 2 Pose in 2d and 3d
48 pages
Photoshop Notes
No ratings yet
Photoshop Notes
23 pages
Jurnal MRI
No ratings yet
Jurnal MRI
6 pages
Learning Deep Architectures For AI - Yoshua Bengio
No ratings yet
Learning Deep Architectures For AI - Yoshua Bengio
130 pages
04 Morphological Image Processing (Chapter 09)
No ratings yet
04 Morphological Image Processing (Chapter 09)
40 pages
BTP Presentation
No ratings yet
BTP Presentation
21 pages