Giter Site home page Giter Site logo

kc's Introduction

This repository contains code, datasets, and links related to the Knowledge Computing (KC) group at Microsoft Research Asia (MSRA).

Our group is hiring both research interns and full-time employees! If you are interest, please take a look at:

News:

Recent Papers:

Related Projects:

Contributing

This project welcomes contributions and suggestions. Most contributions require you to agree to a Contributor License Agreement (CLA) declaring that you have the right to, and actually do, grant us the rights to use your contribution. For details, visit https://cla.opensource.microsoft.com.

When you submit a pull request, a CLA bot will automatically determine whether you need to provide a CLA and decorate the PR appropriately (e.g., status check, comment). Simply follow the instructions provided by the bot. You will only need to do this once across all repos using our CLA.

This project has adopted the Microsoft Open Source Code of Conduct. For more information see the Code of Conduct FAQ or contact [email protected] with any additional questions or comments.

Trademarks

This project may contain trademarks or logos for projects, products, or services. Authorized use of Microsoft trademarks or logos is subject to and must follow Microsoft's Trademark & Brand Guidelines. Use of Microsoft trademarks or logos in modified versions of this project must not cause confusion or imply Microsoft sponsorship. Any use of third-party trademarks or logos are subject to those third-party's policies.

kc's People

Contributors

arleneyuzhiwei avatar daisydan avatar dependabot[bot] avatar hitercs avatar iofu728 avatar microsoft-github-operations[bot] avatar microsoftopensource avatar mtt1998 avatar qianhuiwu avatar tellarin avatar yhshu avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

kc's Issues

Missing information on README file and trained models

Missing Information

It seems that in the ReTraCk README, on the "Redis dump files" and "Model checkpoints" sections its missing some informations.
I just want to know if that is a way to disponibilize those parts.

Trained Models

Complementing that models question, there is a trained model available for public use? If not, how is the correct way to train one?

404 - page not found

I am very interested in the paper Multi-Level Knowledge Distillation for Out-of-Distribution Detection in Text, but when I opened the code link provided in the paper, I found a 404 error. Could you please provide the source code, thanks.

Exemplary Logical Form Retrieval problem

Thanks for your contribution。
When I got to step two and run enumerate_candidates.py ,something made mistakes.
I have reviewed the code and data you provided and it does not seem to contain a file in.expr.json format. May I ask if this file is generated by additional operations?

In the instructions you said to generate an Exemplary Logical Form as per rng-kbqa. Does that mean I can ignore your way of generating and just do it his way?
image

I am anticipating your reply

File doesn't exist [Retrack"

Dear author:

Thank you for your work!
Am just wondering why the relevant files mentioned in the README.md doesn't exist.

python ./tests/debug/launch_schema_retriever.py

For examples on how to use ReTraCkRetriever in your code base, please check /tests/debug/launch_schema_retriever.py and

TRAIN.MD.

[TIARA] Get unexpected results

Hi
This article is very creative.
When I execute the following code,

python algorithm/grailqa_generation.py --prompt lf_schema

I get the following results:

>> root@train-grailqa-0:/data1/Projects/KC/papers/TIARA/src# python utils/statistics/grailqa_evaluate.py ../dataset/GrailQA/grailqa_v1.0_dev.json ../logs/grailqa_dev_2023_02_01_14_59_42_log.json

{'em': 0.6590270589974864, 'f1': 0.7318337303152712, 'em_iid': 0.770872567482737, 'f1_iid': 0.8097763582287697, 'em_comp': 0.5977542932628798, 'f1_comp': 0.6808620585344852, 'em_zero': 0.6356673960612691, 'f1_zero': 0.7189804767074774}

Could you please upload your dev result file? I can't sure whether my freebase is set correctly.

Thanks.

[ReTrack Issue] Unable to reproduce the evaluate results using the demo script

Hello! There seem to be two paths currently to hit the system. Use processed grail qa file with evaluate.py under the parser directory. There is a demo pipeline that can be setup using the demo section in the read me.

Current I'm getting different results for the same questions. I have set all the flags mentioned in

For the best possible results, please enable the complete checker (use_beam_check, use_virtual_forward, use_type_checking, and use_entity_anchor. in the demo overrides.

The Redis cache seems up and running. For a sample of 100 questions, nearly 2-5% loss is there in F1 score, EM .

Can you please help with this issue?
Please let me know if you need any further information

[TIARA] Unable to download TIARA_DATA.zip

Hello,

Thank you for sharing your great work!
I've been attempting to follow the instructions in the README.md to run TIARA, but I'm encountering difficulties downloading the TIARA_DATA.zip file.

On Windows, I've tried to download the file by clicking the provided hyperlink multiple times, but without success. Additionally, I attempted to download it on Linux using the following command:
wget https://kcpapers.blob.core.windows.net/tiara-emnlp2022/TIARA_DATA.zip

image

Unfortunately, I continue to face issues. Can you please provide some assistance or guidance on how to resolve this problem? Thank you in advance!

Training Steps

Hello,
Can you please update the README with instructions for training the model from scratch?

When will the code about TIARA be open sourced?

After reading your article, I feel very innovative, and there are still some details that I don't understand very well, and I want to understand it through code. When will the code be open sourced?

[Retrack] file download

When I click on the download link, the webpage displays: This XML file does not appear to have any style information associated with it. The document tree is shown below.
I try to use wget, but it also reports: ERROR 409: Public access is not permitted on this storage account.
May I ask if there is a problem with the download link?

TIARA data problems

Thanks for your work.
TIARA requires some pre-processed data. It can be downloaded from Azure Storage This download address may be not open to the public. Can you give another access to this link.
When I click the link, the result will be like this.
1690551455081

[TIARA] Entity Linker.

Hello, it is mentioned in the readme file to use the PURE project for training mention detection model.
Command: python run_entity.py
--do_train --do_eval --eval_test
--learning_rate=5e-6 --task_learning_rate=5e-6
--train_batch_size=32
--eval_batch_size=108
--context_window 0
--max_span_length 15
--task grailqa
--data_dir grailqa_data/json/
--model bert-base-uncased
--output_dir grailqa_models/checkpoint
--num_epoch 10
--seed 42

But the PURE project has no task "grailqa" as given in arguments.
Can you also mention the code changes required in PURE to make it run for grailqa.

[TIARA] Schema Retriever : Not able to reproduce results.

Thank you for the code.

I'm trying to retrain the schema retriever according to README's instructions.

  1. For class: I was able to reproduce the results (by changing batch size to 128).
  2. For relations: While training the eval loss is decreasing and eval accuracy also reaches around 95%, but prediction on the dev set results in 0 hits@k for all k. (I am not changing any hyperparameters in the code).

Am I missing something? Please help with the issue.

ReTraCk resource files

I'm having problems to download the ReTraCk resource files, it stops every time in 2%,
i already tried to use other computer and other internet provider but is always the same.

It's possible for you to upload those files in other plataform?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.