Comments (12)
from mlconvgec2018.
I downloaded two training datasets (nucle and lang8v2) and ran prepare_data.sh
and preprocess.sh
They ran without error.
from mlconvgec2018.
Can you share your train log?
from mlconvgec2018.
Can you share your train log?
I have one in the priginal post.
I"ll upload another after another training.
from mlconvgec2018.
Sorry, I missed the log file.
You seem to be using a newer version of Pytorch than what we used for this project. We used an old fork of Fairseq (https://github.com/shamilcm/fairseq-py) which required Pytorch 0.2.0 compiled from source.
If you want to use a later version of Fairseq (v 0.5), you can use the scripts in this fairseq0.5
branch of our repository (https://github.com/nusnlp/mlconvgec2018/tree/fairseq0.5). This has been tested to work with Pytorch 0.4.1 (no need for compilation from source, can be installed via conda)
from mlconvgec2018.
Sorry, I missed the log file.
You seem to be using a newer version of Pytorch than what we used for this project. We used an old fork of Fairseq (https://github.com/shamilcm/fairseq-py) which required Pytorch 0.2.0 compiled from source.
If you want to use a later version of Fairseq (v 0.5), you can use the scripts in this
fairseq0.5
branch of our repository (https://github.com/nusnlp/mlconvgec2018/tree/fairseq0.5). This has been tested to work with Pytorch 0.4.1 (no need for compilation from source, can be installed via conda)
Thank you for your help!
I have had trouble running the original branch.
(I had to test multiple pytorch and fairseq versions and modify some codes)
I'll test the new version and post the result.
from mlconvgec2018.
Hahaha I found the problem and it was a trivial mistake.
I should have run training/run_trained_model.sh
, but I ran run.sh
instead.
Sorry for bugging you for my mistake.
(anyway, the fairseq0.5
branch worked well)
from mlconvgec2018.
@theincluder @shamilcm @gurunath-p
i am having trouble getting the m2 score. i ran the
./run_trained_model.sh
Got the output.bpe.nbest.txt, output.bpe.txt and output.tok.txt.
But i could not get the m2 score.
Note: i did not train the reranker. I did it without it. Can you tell me what i am missing? Any help would be appreciated.
from mlconvgec2018.
If you have decoded the CoNLL-2014 test set, you need to get the reference M2 file from https://www.comp.nus.edu.sg/~nlp/conll14st.html. Download the annotated test data. The reference M2 file for the competition is the official-2014.combined.m2
file in the no-alt/
directory. Download the official M2 scorer from the same page. Run m2 scorer using ./m2scorer output.tok.txt /path/to/official-2014.combined.m2
from mlconvgec2018.
@shamilcm
thanks for the link. i ran the m2 scorer, but the problem was difference in output.tok.txt and conll14-test.m2.
So, i followed this another issue
#2
so i followed as u told in the issue by :
- Using interactive.py instead of generate.py with a --interactive.
- I tried to preprocess again with --testpref as the conll14-test.tok.src.Now the error is that i dont have conll14-test.tok.trgt target file.
I would be so thankful if you could help me here. Thanks in advance
from mlconvgec2018.
@shamilcm
I have another doubt regarding the accuracy of model.
I got the results one model.
Can you describe more about some other wiki corpora described in the paper to bolster the F0.5 score?
Could you also share how you created the ensemble with different initializations? I would like to know that too.
Thanks a lot in advance.
from mlconvgec2018.
from mlconvgec2018.
Related Issues (20)
- Using interactive instead of generate to evaluate HOT 1
- Python error: <stdin> is a directory, cannot continue HOT 2
- the size of training dataset? HOT 2
- How to use language model (94Bcclm.trie)? HOT 2
- TypeError: iter() returned non-iterator of type 'NBestList' HOT 9
- An error about Compile and install PyTorch(0.2.0) HOT 5
- Fairseq-py Installation Issue HOT 5
- About training data
- reranker error HOT 6
- pytorch version? HOT 3
- pre-trainined embeddings. HOT 1
- ImportError: cannot import name 'libbleu' HOT 4
- 'Levenshtein greater than source size' when training re-ranker HOT 1
- Output directory error while running run.sh in google colab HOT 1
- ImportError: cannot import name 'libbleu' from 'fairseq' HOT 3
- Requesting NUCLE dataset 3.2
- Error loading state_size in FConvModel : size mismatch in encoder-decoder weights
- General question: GEC data preprocessing
- Can this model be used for any language?
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from mlconvgec2018.