Comments (7)
I think the predefined parameters in target_params.pkl is just random initialization, to ensure that every time use the same initial parameters.
if you want to use your own parameters, just import pg_bleu/target_lstm.py, which is written by random initialization every time. In this way, if you have different global parameters such as EMB_DIM, just give different parameters to TARGET_LSTM() class.
====UPDATE:
There is mistake in above suggestions. Actually, TARGET_LSTM is random initialized from target_params.pkl, but it is regarded as the oracle and its parameters will not be updated. The sequence generated by TARGET_LSTM is thought real data. And the generator (model.lstm class)
is trained to learn the params in TARGET_LSTM.
from seqgan.
Hello:
I've imported "target_lstm.py" from pg_bleu folder (after renaming original?) Should a new .pkl be created that refers to the new text file save/real_data.txt ?
The error below is generated.
Traceback (most recent call last):
File "sequence_gan.py", line 257, in
main()
File "sequence_gan.py", line 130, in main
target_lstm = TARGET_LSTM(vocab_size, 64, 32, 32, 20, 0, target_params)
TypeError: init() takes exactly 7 arguments (8 given)
Your paper mentions the generation of poetry. Could you please explain the steps required to repeat this process?
Cheers
from seqgan.
@GenTxt I suggest you read the code by yourself, this is a easy question.
After you imported "target_lstm.py" from pg_bleu folder, there is no need to give the class target_params.
from seqgan.
Thanks for the updated information.
Could you explain how to get the parameters in TARGET_LSTM for new training dataset?
from seqgan.
Hello eecrazy:
Hmm ... maybe not such an easy question after all? Your own reply from 21 days ago contains an UPDATE which follows your mistaken suggestion.
What I was hoping to see posted was simple, single line terminal instructions for training and generation which are common to most deep learning repos posted on github.
The classic is, as you know, karpathy's which goes into wonderful detail concerning the setting up of a training environment that would be new to many programmers (torch/lua). His detailed instructions worked perfectly and opened the door to this platform. His lua code, and the improved versions that followed, are EASY to edit and test. The same can be said for most tensorflow/python versions.
I wasn't looking for the same level of detail but a few one liners would go a long way to help test the output quality of this code compared to the above lstm versions.
from seqgan.
@xiaopppy Actually, there is no TARGET_LSTM for new training dataset.
please refer to this issue: #3
from seqgan.
@GenTxt In this code repos, not like karpathy's char-rnn, you must totally understand the code and edit it by yourself to fit to your own dataset. Because it is not written so clean to be a tool-code-base.
from seqgan.
Related Issues (20)
- About the sequences to process
- 模型问题
- the input data?
- Training on custom dataset HOT 1
- 关于采样的问题
- What is the difference between SeqGAN and LM for text generation? HOT 1
- Nothing. Ignore it.
- About dataset. HOT 1
- About oracle model
- If the positive sample and negative sample of each training are corresponding, will it affect the training result
- gradient decent implementation
- About generator in adversarial training HOT 2
- How should I understand the RL loss function
- data format
- About the accuracy of discriminator during training?
- NLG
- dataset HOT 1
- How can i use my own training data ? HOT 2
- How to resume training in Colab?
- Questions about the recurring results of the code HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from seqgan.