Giter Site home page Giter Site logo

Comments (7)

eecrazy avatar eecrazy commented on July 30, 2024 2

I think the predefined parameters in target_params.pkl is just random initialization, to ensure that every time use the same initial parameters.

if you want to use your own parameters, just import pg_bleu/target_lstm.py, which is written by random initialization every time. In this way, if you have different global parameters such as EMB_DIM, just give different parameters to TARGET_LSTM() class.

====UPDATE:
There is mistake in above suggestions. Actually, TARGET_LSTM is random initialized from target_params.pkl, but it is regarded as the oracle and its parameters will not be updated. The sequence generated by TARGET_LSTM is thought real data. And the generator (model.lstm class)
is trained to learn the params in TARGET_LSTM.

from seqgan.

GenTxt avatar GenTxt commented on July 30, 2024

Hello:

I've imported "target_lstm.py" from pg_bleu folder (after renaming original?) Should a new .pkl be created that refers to the new text file save/real_data.txt ?

The error below is generated.

Traceback (most recent call last):
File "sequence_gan.py", line 257, in
main()
File "sequence_gan.py", line 130, in main
target_lstm = TARGET_LSTM(vocab_size, 64, 32, 32, 20, 0, target_params)
TypeError: init() takes exactly 7 arguments (8 given)

Your paper mentions the generation of poetry. Could you please explain the steps required to repeat this process?

Cheers

from seqgan.

eecrazy avatar eecrazy commented on July 30, 2024

@GenTxt I suggest you read the code by yourself, this is a easy question.

After you imported "target_lstm.py" from pg_bleu folder, there is no need to give the class target_params.

from seqgan.

xiaopyyy avatar xiaopyyy commented on July 30, 2024

Thanks for the updated information.
Could you explain how to get the parameters in TARGET_LSTM for new training dataset?

from seqgan.

GenTxt avatar GenTxt commented on July 30, 2024

Hello eecrazy:

Hmm ... maybe not such an easy question after all? Your own reply from 21 days ago contains an UPDATE which follows your mistaken suggestion.

What I was hoping to see posted was simple, single line terminal instructions for training and generation which are common to most deep learning repos posted on github.

The classic is, as you know, karpathy's which goes into wonderful detail concerning the setting up of a training environment that would be new to many programmers (torch/lua). His detailed instructions worked perfectly and opened the door to this platform. His lua code, and the improved versions that followed, are EASY to edit and test. The same can be said for most tensorflow/python versions.

I wasn't looking for the same level of detail but a few one liners would go a long way to help test the output quality of this code compared to the above lstm versions.

from seqgan.

eecrazy avatar eecrazy commented on July 30, 2024

@xiaopppy Actually, there is no TARGET_LSTM for new training dataset.
please refer to this issue: #3

from seqgan.

eecrazy avatar eecrazy commented on July 30, 2024

@GenTxt In this code repos, not like karpathy's char-rnn, you must totally understand the code and edit it by yourself to fit to your own dataset. Because it is not written so clean to be a tool-code-base.

from seqgan.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.