Giter Site home page Giter Site logo

seq2seq-nmt's Introduction

Neural Machine Translation with seq2seq architecture


Image from Standford CS224N

Usage

usage: main.py [-h] [--learning-rate LEARNING_RATE]
               [--lang {afr,sqi,arq,ara,aze,eus,bel,ben,ber,bos,bul,mya,yue,cat,
                        ceb,dtp,cbk,cmn,chv,hrv,ces,dan,nld,est,fin,fra,glg,kat,
                        deu,ell,heb,hin,hun,isl,ilo,ind,ita,jpn,kab,kan,pam,kha,
                        khm,kor,kur,lvs,lit,nds,mkd,zsm,mal,mri,mar,nst,max,nob,
                        pes,pol,por,ron,rus,srp,slk,slv,spa,swe,tgl,tam,tat,tel,
                        tha,tur,tuk,ukr,urd,uig,vie,war,zza}]
               [--reversed] [--epochs EPOCHS] [--dropout DROPOUT]
               [--max-words MAX_WORDS] [--cv-size CV_SIZE] [--use-attention]
               [--verbose-rate VERBOSE_RATE]
               [--sets-size SETS_SIZE [SETS_SIZE ...]]
               [--teacher-forcing {beam-search,curriculum}]

optional arguments:
  -h, --help
     show this help message and exit
  --learning-rate LEARNING_RATE
     step size toward minimum of loss (default: 0.01)
  --lang {afr,sqi,arq,ara,aze,eus,bel,ben,ber,bos,bul,mya,yue,cat,
          ceb,dtp,cbk,cmn,chv,hrv,ces,dan,nld,est,fin,fra,glg,kat,
          deu,ell,heb,hin,hun,isl,ilo,ind,ita,jpn,kab,kan,pam,kha,
          khm,kor,kur,lvs,lit,nds,mkd,zsm,mal,mri,mar,nst,max,nob,
          pes,pol,por,ron,rus,srp,slk,slv,spa,swe,tgl,tam,tat,tel,
          tha,tur,tuk,ukr,urd,uig,vie,war,zza}
     lang pair to translate from/to (default: fra)
  --reversed
     if defined, translate from --lang to english, otherwise translate from
     english (default: False)
  --epochs EPOCHS
     number of epochs to train on dataset (default: 10)
  --dropout DROPOUT
     probability to apply dropout for regularization (default: 0.1)
  --max-words MAX_WORDS
     maximum number of words by sentence (default: 3)
  --cv-size CV_SIZE
     size of the context vector to represent source sequence (default: 256)
  --use-attention
     use attention mechanism in decoder (default: False)
  --verbose-rate VERBOSE_RATE
     print interval (default: 10)
  --sets-size SETS_SIZE [SETS_SIZE ...]
     percentage for train, dev and test sets (default: [0.8, 0.1, 0.1])
  --teacher-forcing {beam-search,curriculum}
     teacher forcing technique to use (default: curriculum)

Data from https://www.manythings.org/anki/, extracted from Tatoeba Project

TODO

  • Beam Search
  • Curriculum Teacher Forcing
  • Scheduled sampling
  • use LSTM
  • Attention
  • Dropout
  • Save model state at defined intervals
  • Use trained Embeddings (GloVe, Doc2Vec...)
  • Streamlit playground with trained models (or upload model)
  • Multiple language pairs support
  • train, dev, test sets
  • buckets
  • backward feeding
  • Bleu score

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.