Giter Site home page Giter Site logo

trellixvulnteam / curriculum-learning-for-vln_q4f9 Goto Github PK

View Code? Open in Web Editor NEW

This project forked from imnearth/curriculum-learning-for-vln

0.0 0.0 0.0 6.52 MB

Code for NeurIPS 2021 paper "Curriculum Learning for Vision-and-Language Navigation"

License: MIT License

Shell 0.55% Python 99.45%

curriculum-learning-for-vln_q4f9's Introduction

Curriculum Learning For LVN

License: MIT

This is the PyTorch implementation of our paper:
Curriculum Learning for Vision-and-Language Navigation [arxiv]
Jiwen Zhang, Zhongyu Wei, Jianqing Fan, Jiajie Peng
35th Conference on Neural Information Processing Systems (NeurIPS 2021)

Most Recent Events

  • 2021-12-27: We upload the tasks/R2R-judy/main.py and training instructions.
  • 2021-11-16: We have our paper arxived, now you can acess it by clicking here !
  • 2021-11-14: We update package of agents and methods. (tasks/R2R-judy/src)
  • 2021-11-08: We update the installation instructions.
  • 2021-11-06: We uploaded the CLR2R dataset mentioned in our paper. (tasks/R2R-judy/data)

Model architectures

This repository includes several SOTA navigation agents previously released. They are

and a path-instruction scorer

Installation

Setting up Environments

  1. Install Python 3.6 (Anaconda recommended: https://docs.anaconda.com/anaconda/install/index.html).

  2. Install PyTorch following the instructions on https://pytorch.org/ (in our experiments, it isPyTorch 1.5.1+cu101).

  3. Following build instructions in this github to build up a v0.1 Matterport3D simulator.

    Besides, just in case you have an error when compiling the simulator, you can try this

    mkdir build && cd build
    cmake -D CUDA_TOOLKIT_ROOT_DIR=path/to/yout/cuda
    make
    cd ../

    For more details on the Matterport3D Simulator, you can refer to README_Matterport3DSimulator.md.

Dataset Download

Luckily, this repository contains the R2R dataset and CLR2R dataset, so you ONLY have to download precomputing ResNet image features from Matterport3DSimulator.

Download and extract the tsv files into the img_features directory. You will only need the ImageNet features to replicate our results.

Clone Repo

Clone (or just download) this reposiroty and replace tasks directory in original Matterport3D simulator with the one in this reposiroty.

After following the steps above the your file directory should look like this:

Matterport3D/
    build/            # should be complied in your machine
    cmake/
    connectivity/     # store Json connecivity graphs for each scan
    img_features/     # store precomputed image features, i.e. ResNet-152 features
    include/
    pybind11/         # a dependency of Matterport3D Simulator
    ...
    tasks/R2R-judy/   # replace it with the one in this directory
    ...

Usage Instructions

To replicate the Table 3 in our paper, try the following command in shell.

CONFIG_PATH="path-to-config-file"
CL_MODE="" # "" / "NAIVE" / "SELF-PACE"

python tasks/R2R-judy/main.py \
--config-file $CONFIG_PATH \
TRAIN.DEVICE your_device_id \
TRAIN.CLMODE $CL_MODE \
...

You can refer to task/tasks/R2R-judy/runner for more details.

curriculum-learning-for-vln_q4f9's People

Contributors

imnearth avatar trellixvulnteam avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.