Giter Site home page Giter Site logo

maluuba / geneva Goto Github PK

View Code? Open in Web Editor NEW
85.0 85.0 22.0 2.12 MB

Code to train and evaluate the GeNeVA-GAN model for the GeNeVA task proposed in our ICCV 2019 paper "Tell, Draw, and Repeat: Generating and modifying images based on continual linguistic instruction"

Home Page: https://www.microsoft.com/en-us/research/project/generative-neural-visual-artist-geneva/

License: Other

Python 100.00%
chatpainter codraw dialogue gan generative-adversarial-networks generative-neural-visual-artist geneva geneva-gan i-clevr iccv iccv-2019 iccv2019 interactive-generative-art interactive-image-generation keep-drawing-it tell-draw-repeat

geneva's People

Contributors

kracwarlock avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

geneva's Issues

How to reproduce the best result?

I follow the instructions in README.md, but I can't get the best results as the paper said.
Here is what I haved done:

  1. Follow the instructions from https://github.com/Maluuba/GeNeVA_datasets/, and generate the data file.
  2. Download the pretrained object detector model from https://www.microsoft.com/en-us/research/project/generative-neural-visual-artist-geneva/
  3. Use the default parameters and run python geneva/inference/train.py @example_args/iclevr-d-subtract.args

Here is my results in iclevr dataset, I trained this model with 2 NVIDIA TITAN Xp about 4 days:

precision
recall
f1
relsim
It seems it's far from the performance in the origin paper.
image

How to change the specified text condition?

Excuse me,sir. I'm a student interested in this course. Sorry to bother you, I want to know how to use the specified text to generate the corresponding image? If I want to add a block diagram by pyqt, where should I add it? Wait for the good sound.

What is your pytorch version?

Hi I wonder which pytorch version you use? I ran into some weird warning issue. The one that bothers a lot is this warning message:
/opt/conda/conda-bld/pytorch_1573049304260/work/aten/src/ATen/native/cudnn/RNN.cpp:1268: UserWarning: RNN module weights are not part of single contiguous chunk of memory. This means they need to be compacted at every call, possibly greatly increasing memory usage. To compact weights again call flatten_parameters().

I think it is because you are applying DataParallel on the RNN, but I am not quite sure how to resolve it.
Thank you for taking a look at this issue.

pre-trained models

Do you have a pre-trained model? Where can I find it? Is that possible that we can just run one script to see your examples mentioned in your paper Fig. 5?

AttributeError: 'NoneType' object has no attribute 'seek'.

Hello! Thank you very much for your wonderful work!
When I reproduced the code, some problems occurred:
First of all, I can't seem to generate the corresponding h5 file on the CoDraw data set. Do you have a classified data connection?
Then, when I tested the model, some problems appeared:
During handling of the above exception, another exception occurred:

Traceback (most recent call last):
File "geneva/inference/test.py", line 64, in
tester = Tester(cfg, test_eval=True)
File "geneva/inference/test.py", line 31, in init
self.model.load(model_path, iteration)
File "/home/zhaoliuqing/GeNeVA/geneva/models/inference_models/recurrent_gan.py", line 111, in load
snapshot = _read_weights(pre_trained_path, iteration)
File "/home/zhaoliuqing/GeNeVA/geneva/models/inference_models/recurrent_gan.py", line 148, in _read_weights
snapshot = torch.load(pre_trained_path)
File "/home/zhaoliuqing/anaconda3/envs/geneva/lib/python3.6/site-packages/torch/serialization.py", line 358, in load
return _load(f, map_location, pickle_module)
File "/home/zhaoliuqing/anaconda3/envs/geneva/lib/python3.6/site-packages/torch/serialization.py", line 520, in _load
_check_seekable(f)
File "/home/zhaoliuqing/anaconda3/envs/geneva/lib/python3.6/site-packages/torch/serialization.py", line 179, in _check_seekable
raise_err_msg(["seek", "tell"], e)
File "/home/zhaoliuqing/anaconda3/envs/geneva/lib/python3.6/site-packages/torch/serialization.py", line 172, in raise_err_msg
raise type(e)(msg)
AttributeError: 'NoneType' object has no attribute 'seek'. You can only torch.load from a file that is seekable. Please pre-load the data into a buffer like io.BytesIO and try to load from it instead.
Have you ever encountered a similar problem? Thank you!

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.