Skip to content

Latest commit

 

History

History
15 lines (12 loc) · 793 Bytes

README.md

File metadata and controls

15 lines (12 loc) · 793 Bytes

Image-Captioning-Model

Unique Features of the Model are:


-> The model is capable on running on low-end GPUs such as the GTX 1050 with 4GB GPU Memory and 8GB RAM
-> The learning of the model is Steep
-> The Model was based on Images and captions scrapped from the E-Commerce Websites

Description of the Convolution Neural Network

image

Description of the Recurrent Neural Network model

image
The Picture above summarises the RNN model that was used.

The Features of each image was extracted through the VGG16 model with the last softmax layer popped out and connected to a RNN layer.