PReMVOS: Proposal-generation, Refinement and Merging for Video Object Segmentation

PReMVOS: Proposal-generation, Refinement and Merging for Video Object Segmentation, Jonathon Luiten, Paul Voigtlaender, Bastian Leibe, ACCV2018, CVPRW2018, ECCVW2018

PReMVOS is a video object segmentation method that won both the CVPR DAVIS challenge on video object segmentation, and the ECCV 1st Large-scale video object segmentation challenge on the YouTube-VOS dataset.

The arxiv version of PReMVOS paper (accepted at ACCV 2018) can be found here: arxiv paper

The CVPR workshop version of the paper for the DAVIS challenge can be found here: DAVIS paper

The ECCV workshop short abstract version of the paper can be found here: YouTube-VOS paper

Setup on which the code was tested

python 3.6
tensorflow 1.8
pytorch 0.2.0_3
CUDA 9.0
cudnn 7

Usage

To run a simple version of PReMVOS with pre-trained weights already finetuned on the first frame annotations of YouTube-VOS and DAVIS (this version was used to win the ECCV YouTube-VOS challenge), see simple_run.sh
To run the full version, presented in the ACCV/arxiv paper, and that won the CVPR DAVIS challenge, the process is a little more complicated. First you need to generate 'lucid data dreaming' first frame image augmentations for each video, then you need to train seperate weights for the proposal networks and refinement network for each video individually. Having done this, you can then run the rest of the PReMVOS method with these fine-tuned weights. There are however, a number of other differences betweeen this version and the 'simple_run.sh' version including different parameters, a slightly different merging algorithm, different ways to get proposals and a different optical flow method. For more details see the ECCV Workshop short abstract paper. All the code and config files to do this are present in the dataset, but it is by no means trivial to get running. Also note that this process takes an extremely long time to run, especially the first frame image augmentations, and the per video fine-tuning.
To run either the simple or the full method on a new video (not from DAVIS or YouTube-VOS), the method should be fine-tuned on the first frame annotations from that video, not doing so will result in a large drop in performance.

Contact

If you encounter any problems with the code, please contact Jonathon Luiten (luiten at vision dot rwth-aachen dot de).

Citation

If you use this code, please consider citing the following papers:

@inproceedings{luiten2018premvos,
  title={PReMVOS: Proposal-generation, Refinement and Merging for Video Object Segmentation},
  author={Luiten, Jonathon and Voigtlaender, Paul and Leibe, Bastian},
  booktitle={Asian Conference on Computer Vision},
  year={2018}
}

@article{DAVIS2018-Semi-Supervised-1st,
   author={Luiten, Jonathon and Voigtlaender, Paul and Leibe, Bastian},
   title = {PReMVOS: Proposal-generation, Refinement and Merging for the DAVIS Challenge on Video Object Segmentation 2018},
   journal = {The 2018 DAVIS Challenge on Video Object Segmentation - CVPR Workshops},
   year = {2018}
}

@article{youtubevos2018-1st,
   author={Luiten, Jonathon and Voigtlaender, Paul and Leibe, Bastian},
   title = {PReMVOS: Proposal-generation, Refinement and Merging for the YouTube-VOS Challenge on Video Object Segmentation 2018},
   journal = {The 1st Large-scale Video Object Segmentation Challenge - ECCV 2018 Workshops},
   year = {2018}
}

gunpowder78 / premvos Goto Github PK

premvos's Introduction

PReMVOS: Proposal-generation, Refinement and Merging for Video Object Segmentation

Setup on which the code was tested

Usage

Contact

Citation

premvos's People

Contributors

Stargazers

Watchers

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent