Giter Site home page Giter Site logo

fairseq-image's Introduction

Setup

  1. Install fairseq
git clone https://github.com/pytorch/fairseq.git
cd fairseq
git checkout 8a0b56efeecba6941c1c0e81a3b86f0a219e6653 # the commit for fairseq-0.10.1
export CUDA_HOME=/usr/local/cuda
python setup.py install
  1. (Optional) Install apex
# note that apex requires that nvcc on system should be same version as that used to build torch 
# e.g. torch 1.7.0+cu110 ==> nvcc -V should be 11.0,
#      torch 1.6.0+cu102 ==> nvcc -V should be 10.2, etc.

git clone https://github.com/NVIDIA/apex
cd apex
pip install -v --no-cache-dir --global-option="--cpp_ext" --global-option="--cuda_ext" \
  --global-option="--deprecated_fused_adam" --global-option="--xentropy" \
  --global-option="--fast_multihead_attn" ./
  1. Install dependencies
pip install -r requirements.txt

Usage

Download the dataset for training:

mkdir -p DATA/download
cd DATA/download
wget http://lstm.seas.harvard.edu/latex/data/im2latex_validate_filter.lst
wget http://lstm.seas.harvard.edu/latex/data/im2latex_train_filter.lst
wget http://lstm.seas.harvard.edu/latex/data/im2latex_test_filter.lst
wget http://lstm.seas.harvard.edu/latex/data/formula_images_processed.tar.gz
wget http://lstm.seas.harvard.edu/latex/data/im2latex_formulas.norm.lst
tar -zxvf formula_images_processed.tar.gz

Preprocess:

cd ../../
DLDIR=./DATA/download
OUTDIR=./DATA/data-bin/im2latex
# optionally set fairseq directory, where we can find ${fairseq_dir}/scripts/spm_train.py & spm_encode.py 
# export fairseq_dir=./fairseq 
bash datasets/prepare-im2latex.sh ${DLDIR} ${OUTDIR} 

Train

cd exp
# optionally tune hyper parameters in train_insertion.sh
# then begin training
bash train_insertion.sh

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.