Light

fairy-of-9 / bert4coqa Goto Github PK

View Code? Open in Web Editor NEW

This project forked from adamluo1995/bert4coqa

0.0 1.0 0.0 59 KB

Bert for CoQA based on PyTorch.

Python 100.00%

bert4coqa's Introduction

Bert4CoQA

Introduction

This code is example demonstrating how to apply Bert on CoQA Challenge. It is implemented under PyTorch framework and pytorch-pretrained-BERT.

Code is basically combined from run_squad.py and SDNet.

I train model with config:

--bert_model bert-base-uncased
--num_train_epochs 3.0
--do_lower_case
--max_seq_length 450
--doc_stride 150
--max_query_length 75

on 2x RTX2080TI in 1.5 Hour and achieve 76.5 F1-score.

that can definitely be improved :)

Not tested on multi-machine training

Requirement

pytorch (tested on 1.1.0)
spacy (tested on 2.0.16)
tensorBoardX (Not necessary)
tqdm

How to run

make sure that:

Put train-coqa-v1.0.json and dev-coqa-v1.0.json on the same dict with run_coqa.py
The binary file, config file, and vocab file of bert_model in your bert_model dict name as pytorch_model.bin, config.json, vocab.txt
Enough memory on GPU according to this, you can tune --train_batch_size, --gradient_accumulation_steps, --max_seq_length and --fp16 for memeory saving.

and run

python run_coqa.py --bert_model your_bertmodel_dir --output_dir your_outputdir [optional]

for calculating F1-score, use evaluate-v1.0.py

python evaluate-v1.0.py --data-file <path_to_coqa-dev-v1.0.json> --pred-file <path_to_predictions.json>

Contact

If you have any questions, please new an issue or contact me, [email protected].

bert4coqa's People

Contributors

Watchers

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.