Comments (6)
Hi Sid,
That's my bad, I broke it in one of the previous PRs. I have created a new PR #17 and have tested that the text NER training and evaluation works fine. Kindly refer to the PR comments for more details.
We appreciate the updates! Thank you for your patience.
-Ankita
from slue-toolkit.
Hi @ankitapasad ,
Thanks I can verify now that the code runs without any issues. However there is something strange happening, which I am not sure of -
- I start with training the model and I get the final value evaluation for the model to be -
bash baselines/ner/nlp_scripts/ft-deberta.sh deberta-base combined
eval_overall_accuracy = 0.987
eval_overall_f1 = 0.8642
eval_overall_precision = 0.8379
eval_overall_recall = 0.8922
eval_runtime = 0:00:05.90
eval_samples = 1753
eval_samples_per_second = 296.799
eval_steps_per_second = 4.741
- However, when I run the evaluation script I seem to be getting a really low F1 score.
bash baselines/ner/nlp_scripts/eval-deberta.sh deberta-base dev combined
[micro-averaged F1] Precision: 0.00, recall: 0.01, fscore = 0.00
[micro-averaged F1] Precision: 0.02, recall: 0.49, fscore = 0.04
Is it possible that it's not loading the correct model or something?
Thanks
Sid
from slue-toolkit.
I think this might be label conversion problem and this line might cause this issue.
train_label="combined" in your case.
@ankitapasad can you check this?
from slue-toolkit.
Yes, that's right Suwon, the value of train_label
must be changed to the appropriate option. Additionally, the evaluation code wasn't handling this combination correctly (training and evaluating on combined label tag):
I added PR #18 to resolve this.
-Ankita
from slue-toolkit.
Thanks @ankitapasad ! It's working now.
[0] → bash baselines/ner/nlp_scripts/eval-deberta.sh deberta-base dev combined combined
[micro-averaged F1] Precision: 0.85, recall: 0.89, fscore = 0.87
[micro-averaged F1] Precision: 0.89, recall: 0.92, fscore = 0.90
Just a minor suggestion. Maybe you can add a label standard
or label
next to the two scores?
Thanks
Sid
from slue-toolkit.
@siddalmia Thanks for suggestion. Updated!
from slue-toolkit.
Related Issues (19)
- Problems to run ASR baselines HOT 1
- Voxceleb evaluation HOT 1
- Issues with baseline scripts HOT 2
- Plans to release ASR finetuned-models HOT 1
- About submission HOT 1
- Command to run E2E model of NER directly/only by speech model? HOT 1
- Save N-Best Hypotheses from Finetuned ASR for Sentiment Classification
- Inconsistent use of tabs and spaces HOT 1
- wav2letter installation error! HOT 2
- sentiment classes mismatch in fine-tune and dev tsvs HOT 1
- Questions about SLUE challenge 2023 HOT 2
- preprocessed manifest files not on "assumed" location HOT 2
- problem with prepare_voxpopuli.py HOT 5
- Key 'label_dir' not in 'AudioClassificationConfig' HOT 4
- Dataset Indexing Issue
- Reproduce e2e NER experiments HOT 2
- Text NER Evaluation Pipeline doesn't seem to work HOT 2
- Sentiment Analysis baseline HOT 7
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from slue-toolkit.