Giter Site home page Giter Site logo

zlinao / variational-transformer Goto Github PK

View Code? Open in Web Editor NEW
82.0 82.0 11.0 69.74 MB

Variational Transformers for Diverse Response Generation

License: MIT License

Python 98.34% Perl 1.66%
dialog dialogue-systems machine-learning transformer variational-transformer

variational-transformer's People

Contributors

evelynfan avatar zlinao avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar

variational-transformer's Issues

print()s not printing

Hey,

For some reason I can't seem to run print() functions through the code. I've isolated the problem to this being the case when save_path is included as an argument. For example:

python3 main.py \
--model trs --emb_dim 300 --hidden_dim 300 \
--hop 4 --heads 4 --cuda --batch_size 128 \
--lr 0.001 --pretrain_emb --kl_ceiling 0.48 \
--aux_ceiling 1 --full_kl_step 20000 \
--save_path save/trs_new_bow_batch/ > save/trs_new_bow_batch/out.txt \

Won't print() anything in the code, whereas:

python3 main.py \
--model trs --emb_dim 300 --hidden_dim 300 \
--hop 4 --heads 4 --cuda --batch_size 128 \
--lr 0.001 --pretrain_emb --kl_ceiling 0.48 \
--aux_ceiling 1 --full_kl_step 20000 \
#--save_path save/trs_new_bow_batch/ > save/trs_new_bow_batch/out.txt \

Will run print() correctly.

Has anyone else run into this issue? Is there any stdout/buffer flushing that could be playing this up? I'm on Ubuntu running PyTorch 1.7.1

Is PRETRAIN necessary?

Hi,
I notice that SVT is trained after loading the parameters of a pretrained model (including encoder & decoder).
I am curious about if pretraining is necessary. Had you tried train SVT from scratch?
What's the difference between these two training schemes?


如果您能看懂中文,希望可以加微信高效沟通,我的微信号 He_2262,感激不尽!

maybe i find a bug

when adding the data in data/persona/*_self_original.txt,the context is not fit to the responce.

There are many utterences on each line. This code connects the first few dialogues on line n with the first few dialogues on line n + 1, but the former is not the context of the latter, ignoring so many information(or context) on line n.

How to test with a specific epoch

Thank you for releasing your code!
In utils/config.py, there is a parameter "--test", but it cannot be used in main.py to calculate the metrics in Line 36&37.
How to test the model with a specific given epoch?
Thx

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.