neuro-captioner

This project takes an image and generates a suitable caption. It uses one of the two CNN to extract features of the image then feeds it into an LSTM. The LSTM then generates a sentence, word by word. This project is built on Python 3.7.4 using TensorFlow 1.14. Inception_v3 and VGG16 are the two pre-trianed CNN used in this project. It uses the Flickr30k dataset to train and test. Each image is resized to 299 x 299 pixels for Inception_v3 and 224 x 224 pixels for VGG16.

Here is an example:

Caption: a street corner with a light in front of it

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
11365.png		11365.png
README.md		README.md
feature_extractor.py		feature_extractor.py
image_loader.py		image_loader.py
index_to_words.py		index_to_words.py
neuro_captioner_LSTM.py		neuro_captioner_LSTM.py
not_found.txt		not_found.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

11365.png

11365.png

README.md

README.md

feature_extractor.py

feature_extractor.py

image_loader.py

image_loader.py

index_to_words.py

index_to_words.py

neuro_captioner_LSTM.py

neuro_captioner_LSTM.py

not_found.txt

not_found.txt

Repository files navigation

neuro-captioner

About

Releases

Packages

Languages

hammadab/neuro-captioner

Folders and files

Latest commit

History

Repository files navigation

neuro-captioner

About

Topics

Resources

Stars

Watchers

Forks

Languages