Giter Site home page Giter Site logo

definition_modeling_project's Introduction

Definition_Modeling_Project

Download Dataset

wget http://www.tkl.iis.u-tokyo.ac.jp/~ishiwatari/naacl_data.zip
unzip naacl_data.zip

Requirements

  • pytorch-lightning (0.9.0)
  • transformers (3.1.0)
  • pytorch (1.6.0)

Run Experiments

The dataset can be specified in script (wordent, oxford, slang, wiki).

./finetune_t5_experiment.sh

definition_modeling_project's People

Contributors

amanotaiga avatar

Stargazers

Jacklanda avatar Ziang Wu avatar Hang Jiang avatar Mario avatar Dawei Li avatar

Watchers

James Cloos avatar  avatar

definition_modeling_project's Issues

Special character in finetune_t5_experiment.sh

In finetune_t5_experiment.sh:

export CUDA_VISIBLE_DEVICES=0
python finetune.py --model_name_or_path $MODEL --data_dir $DATA_DIR --eval_batch_size $TEST_BATCH_SIZE --output_dir $T5_OUTPUT_DIR --max_source_length $MAX_LEN --max_target_length $MAX_LEN --gpus 1 --do_predict --option "forward" &

export CUDA_VISIBLE_DEVICES=2
python finetune.py --model_name_or_path $MODEL --data_dir $DATA_DIR --eval_batch_size $TEST_BATCH_SIZE --output_dir $T5_SPECIFIC_OUTPUT_DIR --max_source_length $MAX_LEN --max_target_length $MAX_LEN --do_predict --gpus 1 --option "t5_specific" &

export CUDA_VISIBLE_DEVICES=1
python finetune.py --model_name_or_path $MODEL --data_dir $DATA_DIR --eval_batch_size $TEST_BATCH_SIZE --output_dir $T5_GENERAL_OUTPUT_DIR --max_source_length $MAX_LEN --max_target_length $MAX_LEN --do_predict --gpus 1 --option "t5_general" &
wait

There is a special character between --model_name_or_path and $MODEL instead of space in the 3rd command, causing this command to not work properly.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.