sapienzanlp / mcl-wic Goto Github PK
View Code? Open in Web Editor NEWSemeval-2021 Multilingual and Cross-lingual Word-in-Context Task
Semeval-2021 Multilingual and Cross-lingual Word-in-Context Task
Hi!
It seems to me that some indexes of the target tokens are wrong.
I found problem in the next instances:
ru-ru: 0, 4
en-en: 2, 3
en-ru: 13
And may be the other pairs of languages have the same problem
Examples:
trial.en-en.2, resolution, NOUN, en, 17, en, 8, 'Ability to detect vertical movements , variations and dislocations of ground areas to a resolution of a few centimetres ;',
'Although Iraq was fully implementing all United Nations resolutions , it continued to suffer the devastating effects of the sanctions .'
trial.ru-ru.4, год, NOUN, ru, 3, ru, 12, 'В мае 1987 года на вершине горы Окубо в 4,1 км к западу от пусковой площадки был построен новый телеметрический центр.',
'Так , более 100 000 пуэрториканцев приняли участие в демонстрации 14 июля прошлого года в знак протеста против присоединения .'
We wanted to know what is the main channel of communication for the task(We had few queries about the task) as we are unable to find the google group mentioned on the official SemEval page. Also, we wanted to know when will the training dataset be released.
Hi,
It seems to me that some of the labels for the Chinese and Arabic multilingual trial data are wrong, because they include numbers in addition to the label e.g.
trial.zh-zh.0 3F
List of all ids where this is the case:
Hi!
There are 4 instances in the training data collection, where extra string follow the POS tags.
These are instances at indices 2000, 2001, 3396 and 3397 in the list of training instances.
Can one just safely remove these trailing suffixes from these POS tags or are these instances in a pending status where updates could still happen in a later release?
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.