Giter Site home page Giter Site logo

hlamba28 / automatic-image-captioning Goto Github PK

View Code? Open in Web Editor NEW
115.0 6.0 79.0 258 KB

Generating Captions for images using Deep Learning

Home Page: https://towardsdatascience.com/image-captioning-with-keras-teaching-computers-to-describe-pictures-c88a46a311b8

Jupyter Notebook 100.00%
convolutional-neural-networks lstm-neural-networks deep-learning keras-tensorflow image-captioning

automatic-image-captioning's People

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar

automatic-image-captioning's Issues

changing learning rate after 20 epochs error

code uses a direct method to change the learning rate of the model to 0.0001 which raises an error for reason unknown. A quick fix would be to use keras backend functionality to change the learning rate.
Instead of model.optimizer.lr = 0.0001 use K.set_value(model.optimizer.lr, 0.0001) after importing keras.backend as K

Train model

It takes so much time to train the model. Can someone please provide me 'model.h5' file(pre-trained model)

randomization/shuffling in data generator

Hi, I read your blog post, which I found really helpful for learning about the image captioning problem. But I was curious about the lack of randomization or shuffling of the data in the generator. Do you see this as a potential shortcoming, or is there a reason that it isn't necessary in this problem?

import error

pickle is not imported. It causes error while trying to use pickle.dump function. A simple import pickle would suffice.

data_generator error prevents training

There is an error in the data_generator function that, when trying to train the model, it will throw the following error:

could not broadcast input array from shape (168,2048) into shape (168)

The project was run from the cloned repo, dataset was copied and all paths are checked. All cell outputs until the training cell are equal to the original notebook.

Undefined Variable

Hello,
I went ahead and tried to make your tutorial work. It has a variable called z that idk what it does and it's not defined before the line it's used, tried to assign integers but I get errors during greedy search (naturally).
Can you help me out?

z+=1
pic = list(encoding_test.keys())[z]
image = encoding_test[pic].reshape((1,2048))
x=plt.imread(images+pic)
plt.imshow(x)
plt.show()
print("Greedy:",greedySearch(image))

Loss value for the model

Can you please tell me what was the best loss value (on validation set) you were getting and after which epoch?

Thanks in advance.

Less accuracy

I have used same code and same dataset but getting very much less accuracy. Can you please help with issue.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.