Comments (11)
I understand. I'll try to get the statistics soon.
from docee.
Thanks a lot!
from docee.
The number of gold arguments in PTPCG is the same as other baselines that use ChFinAnn.
You can download the original data from here and get the statistics.
from docee.
Hi there, does my response answer your questions? I'd like to close this issue if there's no further discussion.
from docee.
Hi @Spico197 after training the model on ChFinAnn, the test data arguments TP+FN = 28,545 but when I count the arguments from the original test data, it is 29,345.
I traced the missing arguments and found that they are dropped during the truncation of sentences and documents. Can you confirm?
Thanks.
from docee.
Yes. The default setting of the number of sentences in a document is 64, while the max sequence length is 128, so some documents are trucated. Doc2EDAG, GIT, PTPCG use the same setting. It may be potentially unfair if you use other settings.
from docee.
I didn't check the exact numbers yet, but do you mean arguments instead of mentions or entities?
from docee.
yes, I mean the arguments in event tables
from docee.
@Spico197 Hi! Would you be able to share the model predictions for ChFinAnn and DuEE-fin dev? I really appreciate your valuable time.
from docee.
Hi there, sorry for the late response. Things been busy these days.
The attachment below contains:
- PTPCG test evaluation results on ChFinAnn Epoch=57 (you can calculate the number of arguments from TP, FP and FNs in overall/overall) and middle prediction outputs.
- PTPCG dev evaluation results and middle prediction outputs on DuEE-Fin Epoch=99
from docee.
In case of any inconvenience for your analysis, I updated the PTPCG task dump trained on DuEE-Fin.
You can find it here: https://github.com/Spico197/DocEE/releases/tag/tasks-ptpcg-dueefin
from docee.
Related Issues (20)
- doc_lang=self.setting.doc_lang报错 HOT 3
- 这个库里面哪些代码是ptpcg这个算法用到的 HOT 29
- 新数据集的训练 HOT 18
- PTPCG 分布式训练的效率 HOT 8
- 关于trigger HOT 3
- 触发词的问题 HOT 10
- 训练 teacher prob 的问题 HOT 7
- wikievents 等英文数据集实验 HOT 23
- ner 这块的问题 HOT 7
- ner 参数设置的问题 HOT 1
- 在计算相似度时是否忽略了实体的相对位置 HOT 1
- Failed to reproduce the result with inference.py HOT 2
- Transformer相关的问题 HOT 2
- loss weight mismatch
- some doubts HOT 1
- 代码疑似错误 HOT 4
- 请问可以提供各个模型的checkpioint吗? HOT 4
- 怎么加bert模型呢 HOT 3
- 请问怎么查看到中文格式的事件预测结果? HOT 6
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from docee.