CNN RNN Assignment Set 4

The document provides instructions for building an image captioning model using a convolutional neural network (CNN) encoder and recurrent neural network (RNN) decoder. The tasks include: 1. Importing libraries and checking GPU availability. 2. Reading the dataset, visualizing sample images and captions, and preprocessing the train and test splits. 3. Building a model using a pretrained ResNet-50 CNN for image features and a 5-layer GRU RNN for caption generation with regularization. 4. Compiling the model with an appropriate loss and optimizer. 5. Training the model for epochs and evaluating loss/accuracy on validation data. 6. Generating captions for new images.

Uploaded by

Surendra Tanwar

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (1 vote)

488 views

CNN RNN Assignment Set 4

Uploaded by

Surendra Tanwar

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

Prepare

python notebook (recommended- use Google Colab) to build, train and evaluate model (tensorflow or tensorflow.keras
library recommended). Read the instructions carefully.

Question: Image Captioning : Image Captioning is the process of generating textual description of an image. It uses
both Natural Language Processing and Computer Vision to generate the captions. The dataset will be in the form [image
→ captions]. The dataset consists of input images and their corresponding output captions.

Encoder
The Convolutional Neural Network(CNN) can be thought of as an encoder. The input image is given to CNN to extract
the features. The last hidden state of the CNN is connected to the Decoder.

Decoder
The Decoder is a Recurrent Neural Network(RNN) which does language modelling up to the word level. The first time
step receives the encoded output from the encoder and also the <START> vector.

(13 marks)
1. Import Libraries/Dataset (0 mark)
a. Import the required libraries
b. Check the GPU available (recommended- use free GPU provided by Google Colab).

2. Data Visualization and augmentation (3 mark)
1. Read the pickle file (https://drive.google.com/file/d/1-JvZrIgH3xVBV--yjiACQXaC-6-vQag8/view?usp=sharing) and
convert the data into the correct format which could be used for ML model.
Pickle file contains the image id and the text associated with the image.
Eg: '319847657_2c40e14113.jpg#0\tA girl in a purple shirt hold a pillow .
Each image can have multiple captions.
319847657_2c40e14113.jpg -> image name
#0 -> Caption ID
\t -> separator between Image name and Image Caption
A girl in a purple shirt hold a pillow . -> Image Caption
Corresponding image wrt image name can be found in the image dataset folder.

Image dataset Folder : https://drive.google.com/file/d/1-mPKMpphaKqtT26ZzbR5hCHGedkNyAf1/view?usp=sharing

a. Plot at least two samples and their captions (use matplotlib/seaborn/any other library).
b. Bring the train and test data in the required format.

3. Model Building (7 mark)
a. Use Pretrained Resnet-50 model trained on ImageNet dataset (available publicly on google) for image feature extraction.
b. Create 5 layered GRU layer model and other relevant layers for image caption generation.
c. Add L2 regularization to all the GRU layers.
d. Add one layer of dropout at the appropriate position and give reasons.
e. Choose the appropriate activation function for all the layers.
f. Print the model summary.

4. Model Compilation (1 mark)
a. Compile the model with the appropriate loss function.
b. Use an appropriate optimizer. Give reasons for the choice of learning rate and its value.
5. Model Training (1 mark)
a. Train the model for an appropriate number of epochs. Print the train and validation loss for each epoch. Use the appropriate
batch size.
b. Plot the loss and accuracy history graphs for both train and validation set. Print the total time taken for training.
6. Model Evaluation (1 mark)
a. Take a random image from google and generate caption for that image.

Fullstack Development Test
No ratings yet
Fullstack Development Test
10 pages
Bootstrap Powerpoint
100% (1)
Bootstrap Powerpoint
20 pages
Normalization Example: Project Management Report
No ratings yet
Normalization Example: Project Management Report
3 pages
Motion Detection
No ratings yet
Motion Detection
33 pages
Information Storage and Retrieval: Chapter One - Introduction
No ratings yet
Information Storage and Retrieval: Chapter One - Introduction
50 pages
Introduction of Machine Learning
No ratings yet
Introduction of Machine Learning
58 pages
Object Detection and Tracking Algorithms For Vehicle Counting: A Comparative Analysis
No ratings yet
Object Detection and Tracking Algorithms For Vehicle Counting: A Comparative Analysis
11 pages
Lab Manual
No ratings yet
Lab Manual
28 pages
Chapter 8 Code Optimization and Code Generation
No ratings yet
Chapter 8 Code Optimization and Code Generation
58 pages
ML Unit-Iv
No ratings yet
ML Unit-Iv
19 pages
Deep Learning Approach For Ethiopian Banknote Denomination Classification and Fake Detection System
No ratings yet
Deep Learning Approach For Ethiopian Banknote Denomination Classification and Fake Detection System
8 pages
Practice Final sp22
No ratings yet
Practice Final sp22
10 pages
Introduction To Object Detection
No ratings yet
Introduction To Object Detection
24 pages
Neural Networks
No ratings yet
Neural Networks
29 pages
Answers For End-Sem Exam Part - 2 (Deep Learning)
No ratings yet
Answers For End-Sem Exam Part - 2 (Deep Learning)
20 pages
18AI61
No ratings yet
18AI61
3 pages
Data Literacy Questions All Types
No ratings yet
Data Literacy Questions All Types
2 pages
Loss Functions
No ratings yet
Loss Functions
37 pages
ML First Unit
No ratings yet
ML First Unit
70 pages
DATA MINING Chapter 1 and 2 Lect Slide
No ratings yet
DATA MINING Chapter 1 and 2 Lect Slide
47 pages
DSF - UNIT III Notes
No ratings yet
DSF - UNIT III Notes
17 pages
Question Bank Module-1: Department of Computer Applications 18mca53 - Machine Learning
No ratings yet
Question Bank Module-1: Department of Computer Applications 18mca53 - Machine Learning
7 pages
1 Introduction
No ratings yet
1 Introduction
45 pages
Soft Computing Assignment
100% (1)
Soft Computing Assignment
13 pages
Deep Learning Unit1
No ratings yet
Deep Learning Unit1
63 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
6 pages
Object Detection Tutorial
No ratings yet
Object Detection Tutorial
9 pages
Evaluation Metrics For Regression: Dr. Jasmeet Singh Assistant Professor, Csed Tiet, Patiala
No ratings yet
Evaluation Metrics For Regression: Dr. Jasmeet Singh Assistant Professor, Csed Tiet, Patiala
13 pages
Tools Machine Learning
No ratings yet
Tools Machine Learning
9 pages
Data Science
No ratings yet
Data Science
39 pages
Clustering & Association Algorithms 4
No ratings yet
Clustering & Association Algorithms 4
17 pages
SOC Lab Manual
No ratings yet
SOC Lab Manual
11 pages
Single Layer Perceptron Classifier
No ratings yet
Single Layer Perceptron Classifier
62 pages
Matlab Lab Manual
0% (1)
Matlab Lab Manual
22 pages
Convolution Neural Networks U2
No ratings yet
Convolution Neural Networks U2
24 pages
ML Unit 1
No ratings yet
ML Unit 1
44 pages
Chandigarh Group of Colleges College of Engineering Landran, Mohali
No ratings yet
Chandigarh Group of Colleges College of Engineering Landran, Mohali
47 pages
Deep Learning With Tensorflow
No ratings yet
Deep Learning With Tensorflow
15 pages
Chapter - 4 - Association Rule Mining
No ratings yet
Chapter - 4 - Association Rule Mining
86 pages
02 ML Supervised Learning
No ratings yet
02 ML Supervised Learning
32 pages
Instructions For Physics Practical Exam
No ratings yet
Instructions For Physics Practical Exam
2 pages
Module 4
No ratings yet
Module 4
18 pages
Oop Assignment 1 Fa19-Bee-012
0% (1)
Oop Assignment 1 Fa19-Bee-012
11 pages
Deep Learning Exp
No ratings yet
Deep Learning Exp
25 pages
Object Detection
No ratings yet
Object Detection
4 pages
ML MCQ Unit 1
No ratings yet
ML MCQ Unit 1
8 pages
Lab#10 Ai
No ratings yet
Lab#10 Ai
3 pages
A-Simple-Neural-Network-From-Scratch - Jupyter Notebook
No ratings yet
A-Simple-Neural-Network-From-Scratch - Jupyter Notebook
9 pages
PDS Question Bank
No ratings yet
PDS Question Bank
1 page
Unit -3-NNDL- Notes
No ratings yet
Unit -3-NNDL- Notes
17 pages
Linear Regression - Numpy and Sklearn
No ratings yet
Linear Regression - Numpy and Sklearn
7 pages
KNN Algorithm
No ratings yet
KNN Algorithm
3 pages
CS2055 - Software Quality Assurance
No ratings yet
CS2055 - Software Quality Assurance
15 pages
Thyroid Disease Classification Using Machine Learning Project
No ratings yet
Thyroid Disease Classification Using Machine Learning Project
34 pages
Packages in Python
No ratings yet
Packages in Python
54 pages
Chapter 8 - Arrays - PPT Slides
No ratings yet
Chapter 8 - Arrays - PPT Slides
96 pages
Brochure of ATAL Sponsored Workshop On Robotics 3rd-7th Feb 2020
No ratings yet
Brochure of ATAL Sponsored Workshop On Robotics 3rd-7th Feb 2020
2 pages
AI Project Report
No ratings yet
AI Project Report
3 pages
Enabling Technologies and Federated Cloud
100% (1)
Enabling Technologies and Federated Cloud
38 pages
Unit I R Data Structures
No ratings yet
Unit I R Data Structures
30 pages
MODULE 5
No ratings yet
MODULE 5
31 pages
The Today and Future of WSN, AI, and IoT: A Compass and Torchbearer for the Technocrats
From Everand
The Today and Future of WSN, AI, and IoT: A Compass and Torchbearer for the Technocrats
Dr.Chandrakant
No ratings yet
Practice Set 5
No ratings yet
Practice Set 5
1 page
Full Stack Web Developer Learning Path - codedamn
No ratings yet
Full Stack Web Developer Learning Path - codedamn
7 pages
Using The Virtual Table Server
No ratings yet
Using The Virtual Table Server
34 pages
ARM Instruction Set Quick Reference Card
No ratings yet
ARM Instruction Set Quick Reference Card
6 pages
NetSol CV-Format (Sample) - 1 - 1
No ratings yet
NetSol CV-Format (Sample) - 1 - 1
3 pages
C++ Lecture Notes 3
No ratings yet
C++ Lecture Notes 3
18 pages
Ge8151 PSPP ND 2019 Rejinpaul
No ratings yet
Ge8151 PSPP ND 2019 Rejinpaul
2 pages
Practical File Veer
No ratings yet
Practical File Veer
35 pages
Cruz CoE702 Review Paper
No ratings yet
Cruz CoE702 Review Paper
1 page
Unit 5 - Compiler Design - WWW - Rgpvnotes.in
No ratings yet
Unit 5 - Compiler Design - WWW - Rgpvnotes.in
14 pages
ICT Training Needs Analysis: Visual Content 0 1 2 3
No ratings yet
ICT Training Needs Analysis: Visual Content 0 1 2 3
5 pages
MENU 1 - Insert 2 - Delete 3 - List 4 - Close Please Enter The Option
No ratings yet
MENU 1 - Insert 2 - Delete 3 - List 4 - Close Please Enter The Option
10 pages
Using The Amicus18 Compiler With MPLAB IDE
No ratings yet
Using The Amicus18 Compiler With MPLAB IDE
13 pages
rocm-docs-amd-com-rccl-en-develop
No ratings yet
rocm-docs-amd-com-rccl-en-develop
70 pages
Google Interview Questions
67% (6)
Google Interview Questions
2 pages
Development I in Microsoft Dynamics AX 2012
No ratings yet
Development I in Microsoft Dynamics AX 2012
4 pages
Gokaraju Rangaraju Institute of Engineering and Technology
No ratings yet
Gokaraju Rangaraju Institute of Engineering and Technology
17 pages
SPSE Slides - Module1
No ratings yet
SPSE Slides - Module1
67 pages
Chapter 8
No ratings yet
Chapter 8
17 pages
Howto Argparse
No ratings yet
Howto Argparse
12 pages
Distributed Mutual Exclusion
No ratings yet
Distributed Mutual Exclusion
41 pages
I.interfaces and Packages
No ratings yet
I.interfaces and Packages
19 pages
J Carousel
No ratings yet
J Carousel
37 pages
EXPT-0-Introduction To Keil
100% (1)
EXPT-0-Introduction To Keil
4 pages
SAP System Directories On UNIX
0% (1)
SAP System Directories On UNIX
8 pages
Linux Case Study
No ratings yet
Linux Case Study
11 pages
DCPUM
No ratings yet
DCPUM
370 pages

CNN RNN Assignment Set 4

Uploaded by

CNN RNN Assignment Set 4

Uploaded by

Prepare

You might also like