Image Caption Generator

This document summarizes an image caption generator project built by students. It uses a CNN-RNN model with an Xception CNN to extract image features and an LSTM to generate captions. The model is trained on datasets like Flickr8k and MSCOCO. Requirements include Python, Keras, and NLP libraries. Applications include image search tools, self-driving cars, Google Photos, and medical imaging analysis.

Uploaded by

Samrat Singh

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

192 views

Image Caption Generator

Uploaded by

Samrat Singh

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 13

Image Caption Generator

Department of Computer Science and

Engineering
Rajkiya Engineering College Kannauj

By:-
Somya Yadav (1783910053) Under Supervision of:-
Nitesh Singh (1783910038) Naveen Tiwari
Kaustubh Rajput (1783910027)
Vedbrat Dwivedi (1783910057)
Overview
 About
 CNN(Convolutional Neural Network)
 LSTM(Long Short-Term Memory)
 Model
 Data Set
 Requirements
 Application
What is Image Caption
Generator?
Image caption generator is a task that involves computer vision
and natural language processing concepts to recognize the
context of and image and describe them in a natural language
like English.
Example:-

There is a very colorful bus coming

on the street.

A Dog is running on a beach.

About The Project:-
The Objective of our project is to learn the concept of a CNN
and LSTM model and build a working model of Image caption
generator.

In this project , we will be implementing the caption generator using CNN and
LSTM .The image feature will be extracted from Xception which is CNN model
trained on the imagenet dataset and then we feed the features into the LSTM model
which will be responsible for generating the image caption.
What is CNN?
Convolutional Neural Network are specialized deep neural
networks which can process the data that has input shape like a
2D matrix . Images are easily represented as a 2D matrix and
CNN is very useful in working with images.
CNN is basically used for image classification and identifying if an image
is a bird , a plane , etc.

It scans image from left to right and top to bottom to pull out important
features from the image and combines the features to classify images .
It can handle the images that have been translated , rotated, scaled and
changes in perspective.
What is LSTM?
LSTM stands for Long short term memory , they are a type of RNN
which is well suited for sequence prediction problems . Based on
the previous text , we can predict what the next word will be. It has
proven itself effective from the traditional RNN by overcoming the
limitations of RNN which had short term memory . LSTM can carry
out relevant information throughout the processing of inputs and
with a forget gate, it discards non relevant information.
Image Caption Generator
Model:-
So, to make our image caption generator model, we will be
merging these architectures. It is also called a CNN-RNN
model.
• CNN is used for extracting features from the image . We will use the
pre-trained model Xception.

• LSTM will use the information from CNN to help generate a description
of the image.
Model- Image Caption
Generator :-
Data Sets
Flickr8k
8000 images, each annotated with 5 sentences via AMT
1000 for validation, testing
Flickr 30k
30k images
1000 validation, 1000 testing
MSCOCO
123,000 images
5000 for validation, testing
Requirements:-
1. Deep Learning.
2. Python.
3. Jupyter notebook.
4. Keras library.
5. Numpy.
6. Natural Language Processing.
Application:-
1. Image Searching Tool.
2. Self driving car .
3. Google Photos.
4. Skin Vision.
Thank You

2 DNN-CNN-RNN
100% (1)
2 DNN-CNN-RNN
87 pages
Image Caption Generator Using Deep Learning: Guided by Dr. Ch. Bindu Madhuri, M Tech, PH.D
No ratings yet
Image Caption Generator Using Deep Learning: Guided by Dr. Ch. Bindu Madhuri, M Tech, PH.D
9 pages
Image Caption Generator
100% (1)
Image Caption Generator
20 pages
Chapter 9
No ratings yet
Chapter 9
73 pages
Synopsis P
100% (1)
Synopsis P
6 pages
Image Caption Generator
No ratings yet
Image Caption Generator
69 pages
Chapter 9
No ratings yet
Chapter 9
73 pages
A Novel Adoption of LSTM in Customer Touchpoint Prediction Problems Presentation 1
No ratings yet
A Novel Adoption of LSTM in Customer Touchpoint Prediction Problems Presentation 1
73 pages
Image Captioning Using CNN & LSTM: Digital Signal Processing Laboratory (EEE - 316)
No ratings yet
Image Captioning Using CNN & LSTM: Digital Signal Processing Laboratory (EEE - 316)
24 pages
Practical 3 ANN
No ratings yet
Practical 3 ANN
3 pages
Approaches To AI
0% (1)
Approaches To AI
7 pages
320 Cohort 9 Report Final
No ratings yet
320 Cohort 9 Report Final
46 pages
Aktu ECE 4th Yr Syllabus
No ratings yet
Aktu ECE 4th Yr Syllabus
17 pages
Unit 3 Full Notes
No ratings yet
Unit 3 Full Notes
30 pages
DL Unit-4
No ratings yet
DL Unit-4
26 pages
Unit Iv Web Retrieval and Web Crawling 9
No ratings yet
Unit Iv Web Retrieval and Web Crawling 9
1 page
Cognitive Computing (Course Code: 18CS3272) : CO1 - Session4 Session Topic: The Elements of A Cognitive System
No ratings yet
Cognitive Computing (Course Code: 18CS3272) : CO1 - Session4 Session Topic: The Elements of A Cognitive System
9 pages
Image Enhancement
No ratings yet
Image Enhancement
144 pages
AI Important Questions
No ratings yet
AI Important Questions
196 pages
Notes On COMPUTER VISION
No ratings yet
Notes On COMPUTER VISION
10 pages
Deep Learning Handout
100% (1)
Deep Learning Handout
6 pages
RNN Neural Network
No ratings yet
RNN Neural Network
23 pages
Age and Gender Detection Using Deep Learning: HYDERABAD - 501 510
No ratings yet
Age and Gender Detection Using Deep Learning: HYDERABAD - 501 510
11 pages
CS485 Ch5 Transformers
No ratings yet
CS485 Ch5 Transformers
50 pages
CSE Dept. PPT 176 173
No ratings yet
CSE Dept. PPT 176 173
17 pages
18AI61
No ratings yet
18AI61
3 pages
Twitter Sentiment Analysis Using Python
No ratings yet
Twitter Sentiment Analysis Using Python
21 pages
Well Posed Learning Problems and Applications of ML
No ratings yet
Well Posed Learning Problems and Applications of ML
17 pages
CO1 CC PPT Session 6
No ratings yet
CO1 CC PPT Session 6
22 pages
MRI Brain Image Classification Using Various Deep Learning
No ratings yet
MRI Brain Image Classification Using Various Deep Learning
18 pages
CO1-CC-PPT Session-2
100% (1)
CO1-CC-PPT Session-2
14 pages
Machine Learning/ Artificial Intelligence (MLAI) Internship
No ratings yet
Machine Learning/ Artificial Intelligence (MLAI) Internship
4 pages
Web Security Unit 4
No ratings yet
Web Security Unit 4
14 pages
Word2Vec Tutorial - The Skip-Gram Model Chris McCormick
No ratings yet
Word2Vec Tutorial - The Skip-Gram Model Chris McCormick
18 pages
NLP Synopsis
No ratings yet
NLP Synopsis
9 pages
AIML 4th and 5th Module Notes
No ratings yet
AIML 4th and 5th Module Notes
77 pages
DLunit 2
No ratings yet
DLunit 2
8 pages
2marks For Pondicherry University
No ratings yet
2marks For Pondicherry University
45 pages
CNN 1
No ratings yet
CNN 1
23 pages
AI QuestionBank PartA&PartB Rejinpaul
No ratings yet
AI QuestionBank PartA&PartB Rejinpaul
111 pages
Unit -3-NNDL- Notes
No ratings yet
Unit -3-NNDL- Notes
17 pages
Unit 3 CC
No ratings yet
Unit 3 CC
8 pages
AI - (Deep Learning/NLP) : 5 Days
No ratings yet
AI - (Deep Learning/NLP) : 5 Days
4 pages
CS 3 - Problem Solving Agent
No ratings yet
CS 3 - Problem Solving Agent
80 pages
Cie QP 2 - 21ai71
No ratings yet
Cie QP 2 - 21ai71
2 pages
Deep Learning in Healthcare
No ratings yet
Deep Learning in Healthcare
23 pages
AIML Notes
No ratings yet
AIML Notes
187 pages
(New) (New) ML KNN Introduction Handwritten Notes
No ratings yet
(New) (New) ML KNN Introduction Handwritten Notes
6 pages
Introduction To Ai
No ratings yet
Introduction To Ai
13 pages
Data Science
No ratings yet
Data Science
34 pages
Text To Image Generator
No ratings yet
Text To Image Generator
12 pages
"Introduction To Computer Vision": Submitted by
No ratings yet
"Introduction To Computer Vision": Submitted by
45 pages
Python-IEEE-Project-Titles-2024-2025-Machine-Learning-Projects-Artificial-Intelligence-Projects-Deep-Learning-Projects
No ratings yet
Python-IEEE-Project-Titles-2024-2025-Machine-Learning-Projects-Artificial-Intelligence-Projects-Deep-Learning-Projects
10 pages
Traffic Prediction For Intelligent Transportation Systems Using Machine Learning
No ratings yet
Traffic Prediction For Intelligent Transportation Systems Using Machine Learning
28 pages
Generative AI
No ratings yet
Generative AI
2 pages
Automatic Image Caption Generation System
No ratings yet
Automatic Image Caption Generation System
4 pages
GenerativeAdversialNetwork
No ratings yet
GenerativeAdversialNetwork
21 pages
2.building Blocks of Neural Networks
100% (1)
2.building Blocks of Neural Networks
2 pages
ML MCQ Unit 1
No ratings yet
ML MCQ Unit 1
8 pages
Synopsis Main
No ratings yet
Synopsis Main
11 pages
EDU406 Quiz 3 File by Tanveer Online Academy
No ratings yet
EDU406 Quiz 3 File by Tanveer Online Academy
10 pages
Games_based_learning_in_mathematics_education_A_sy
No ratings yet
Games_based_learning_in_mathematics_education_A_sy
10 pages
Project Proposal
No ratings yet
Project Proposal
9 pages
Avid Reflection
No ratings yet
Avid Reflection
4 pages
Ramos Maria Jasmine C-Esp-6 Q1 W1
No ratings yet
Ramos Maria Jasmine C-Esp-6 Q1 W1
2 pages
Feb 23 - Values Ed - Joy g12
No ratings yet
Feb 23 - Values Ed - Joy g12
2 pages
Narrative Report in Elln
100% (5)
Narrative Report in Elln
6 pages
CREAM Strategy
No ratings yet
CREAM Strategy
8 pages
My Portfolio
100% (1)
My Portfolio
12 pages
Daily Lesson Plan Grade 7 Math Sample
100% (2)
Daily Lesson Plan Grade 7 Math Sample
2 pages
February 2010 - Issue 6 - Vol
No ratings yet
February 2010 - Issue 6 - Vol
16 pages
Faculty of Civil Engineering and Built Environment Universiti Tun Hussein Onn Malaysia
No ratings yet
Faculty of Civil Engineering and Built Environment Universiti Tun Hussein Onn Malaysia
3 pages
Indonesian Journal of English Language Teaching and Applied Linguistics
No ratings yet
Indonesian Journal of English Language Teaching and Applied Linguistics
21 pages
Training Design Phil Iri Training
83% (6)
Training Design Phil Iri Training
4 pages
Rancangan Pelajaran Harian: English Daily Lesson Plan
No ratings yet
Rancangan Pelajaran Harian: English Daily Lesson Plan
3 pages
IXL Real-Time Diagnostic: Have Your Students Sign in To IXL at and Walk Them Through The Following Steps!
No ratings yet
IXL Real-Time Diagnostic: Have Your Students Sign in To IXL at and Walk Them Through The Following Steps!
8 pages
Ninth Grade Art of Persuasion Unit Plan
No ratings yet
Ninth Grade Art of Persuasion Unit Plan
38 pages
A Development of Brake Shoe and Brake Pads Trainer Mockup For Sedan Cars System: Evaluation of Learners Psychomotor Skills Remarks and Performance Rating
No ratings yet
A Development of Brake Shoe and Brake Pads Trainer Mockup For Sedan Cars System: Evaluation of Learners Psychomotor Skills Remarks and Performance Rating
13 pages
(eBook PDF) Teaching Students with Special Needs in General Education Classrooms 9th Edition all chapter instant download
100% (1)
(eBook PDF) Teaching Students with Special Needs in General Education Classrooms 9th Edition all chapter instant download
53 pages
What Physical and Health Impairment
No ratings yet
What Physical and Health Impairment
1 page
Exam Schedule UGRC150 Main - 2024
No ratings yet
Exam Schedule UGRC150 Main - 2024
4 pages
Minutes of The Meeting (Discussing Feedback)
100% (3)
Minutes of The Meeting (Discussing Feedback)
2 pages
Department of Engineering Fee Structure: Btech (Batch 2019-23) Applicable For Btech-Computer Science & Engineering (Cse)
No ratings yet
Department of Engineering Fee Structure: Btech (Batch 2019-23) Applicable For Btech-Computer Science & Engineering (Cse)
1 page
Gifted Procedures Manual 2018-2019
No ratings yet
Gifted Procedures Manual 2018-2019
35 pages
Lev Vygotsky Sociocultural Theory
100% (1)
Lev Vygotsky Sociocultural Theory
7 pages
Inset Day 3 - Narrative Reflection
No ratings yet
Inset Day 3 - Narrative Reflection
2 pages
Storytelling History
No ratings yet
Storytelling History
12 pages
Introducation To Applied Behavior Analysis
No ratings yet
Introducation To Applied Behavior Analysis
8 pages
Curriculum Map Health&Pe2quarter2
No ratings yet
Curriculum Map Health&Pe2quarter2
2 pages
Reciprocal Teaching Lesson Plan
No ratings yet
Reciprocal Teaching Lesson Plan
4 pages