❗ This is cloned repository!

This repository is cloned from backseason/PoolNet and modified for research

A Simple Pooling-Based Design for Real-Time Salient Object Detection

This is a PyTorch implementation of our CVPR 2019 paper.

Development Environments

conda-py38torch17.yml is appropriate if you'd like to use this repository on conda environment
Please refer to [Managing environments — conda documentation](https://docs.conda.io/projects/conda/en/latest/user-guide/tasks/manage-environments.html?highlight=yml file#creating-an-environment-from-an-environment-yml-file) for more details

Prerequisites

cudatoolkit>=10.1.0
torch>=1.7.0
torchvision>=0.8.0

Update

We released our code for joint training with edge, which is also our best performance model.
You may refer to this repo for results evaluation: SalMetric.

Installation

Clone the repository

$ git clone https://github.com/backseason/PoolNet.git
$ cd ./PoolNet/

Create the conda environment

$ conda env create -f ${REPO_ROOT}/dev-envs/conda-py38torch17.yml

Data Preparation

Download the following datasets and unzip them into data folder.

MSRA-B and HKU-IS dataset. The .lst file for training is data/msrab_hkuis/msrab_hkuis_train_no_small.lst.
DUTS dataset. The .lst file for training is data/DUTS/DUTS-TR/train_pair.lst.
BSDS-PASCAL dataset. The .lst file for training is ./data/HED-BSDS_PASCAL/bsds_pascal_train_pair_r_val_r_small.lst.
Datasets for testing.

Pre-trained Models

Models	FPS
PoolNet-ResNet50 w/o edge model GoogleDrive \| BaiduYun (pwd: 2uln).	1.29 (CPU) 29.82 (GPU)
PoolNet-ResNet50 w/ edge model (best performance) GoogleDrive \| BaiduYun (pwd: ksii).	-
PoolNet-VGG16 w/ edge model (pre-computed maps) GoogleDrive \| BaiduYun (pwd: 3wgc).	-
Unspecified GoogleDrive \| BaiduYun (pw: 27p5).	-

FPS is measured including pre/post-processing with batch size 1 .

FPS on CPU is measured on Threadripper 2950X CPU.

FPS on GPU is measured on RTX 2080Ti GPU.

✋ Note

only support bath_size=1
Except for the backbone we do not use BN layer.

Demo

Execute the command below:

$ python ${REPO_ROOT}/model_inspect.py --runmode infer --model_path ${PTH_PATH} --input_img_path ${INPUT_IMG_PATH} --output_img_path ${OUTPUT_IMG_PATH}

FPS Benchmark

Execute the command below for CPU:

$ python ${REPO_ROOT}/model_inspect.py --runmode fps --model_path ${PTH_PATH} --input_img_path ${INPUT_IMG_PATH} --cpu

Training

Set the --train_root and --train_list path in train.sh correctly.
We demo using ResNet-50 as network backbone and train with a initial lr of 5e-5 for 24 epoches, which is divided by 10 after 15 epochs.

./train.sh

We demo joint training with edge using ResNet-50 as network backbone and train with a initial lr of 5e-5 for 11 epoches, which is divided by 10 after 8 epochs. Each epoch runs for 30000 iters.

./joint_train.sh

After training the result model will be stored under results/run-* folder.

Test

For single dataset testing: * changes accordingly and --sal_mode indicates different datasets (details can be found in main.py)

python main.py --mode='test' --model='results/run-*/models/final.pth' --test_fold='results/run-*-sal-e' --sal_mode='e'

For all datasets testing used in our paper: 2 indicates the gpu to use

./forward.sh 2 main.py results/run-*

For joint training, to get salient object detection results use

./forward.sh 2 joint_main.py results/run-*

to get edge detection results use

./forward_edge.sh 2 joint_main.py results/run-*

All results saliency maps will be stored under results/run-*-sal-* folders in .png formats.

Contact

If you have any questions, feel free to contact me via: j04.liu(at)gmail.com.

Citation

@inproceedings{Liu2019PoolSal,
  title={A Simple Pooling-Based Design for Real-Time Salient Object Detection},
  author={Jiang-Jiang Liu and Qibin Hou and Ming-Ming Cheng and Jiashi Feng and Jianmin Jiang},
  booktitle={IEEE CVPR},
  year={2019},
}

Thanks to DSS and DSS-pytorch.

We can't achieve the MAE stated in the paper when we train PoolNet ourselves

Issue description

We trained PoolNet with pre-trained weights on ImageNet ourselves on union set of MSRA-B and HKU-IS datasets
However, the MAE achived by ours is too lower than one stated in the paper

Code example

Configuration defined in launch.json for vscode

        {
            "name": "Python: Test system environments",
            "type": "python",
            "request": "launch",
            "program": "${workspaceRoot}/main.py",
            "console": "integratedTerminal",
            "args": [
                "--mode", "train",
                "--arch", "resnet",
                "--model", "/data/swook/models/poolnet/resnet50_caffe.pth",
                "--pretrained_model", "/data/swook/models/poolnet/resnet50_caffe.pth",
                "--train_root", "/data/swook/dataset/poolnet/msrab_hkuis",
                "--train_list", "/data/swook/dataset/poolnet/msrab_hkuis/msrab_hkuis_train_no_small.lst",
                "--test_root", "/data/swook/dataset/duts/DUTS-SOD100K/DUTS-TE",
                "--test_list", "/data/swook/dataset/duts/DUTS-SOD100K/DUTS-TE/test.lst",
                // "--cpu"
                ]
        },

System Info

PyTorch version: 1.8.1
Is debug build: False
CUDA used to build PyTorch: 10.2
ROCM used to build PyTorch: N/A

OS: Ubuntu 16.04.3 LTS (x86_64)
GCC version: (Ubuntu 5.4.0-6ubuntu1~16.04.12) 5.4.0 20160609
Clang version: Could not collect
CMake version: version 3.5.1

Python version: 3.8 (64-bit runtime)
Is CUDA available: True
CUDA runtime version: Could not collect
GPU models and configuration: 
GPU 0: GeForce RTX 2080 Ti
GPU 1: GeForce RTX 2080 Ti

Nvidia driver version: 440.33.01
cuDNN version: Could not collect
HIP runtime version: N/A
MIOpen runtime version: N/A

Versions of relevant libraries:
[pip3] numpy==1.20.2
[pip3] torch==1.8.1
[pip3] torchvision==0.9.1
[conda] blas                      1.0                         mkl  
[conda] cudatoolkit               10.2.89              hfd86e86_1  
[conda] ffmpeg                    4.3                  hf484d3e_0    pytorch
[conda] mkl                       2021.2.0           h06a4308_296  
[conda] mkl-service               2.3.0            py38h27cfd23_1  
[conda] mkl_fft                   1.3.0            py38h42c9631_2  
[conda] mkl_random                1.2.1            py38ha9443f7_2  
[conda] numpy                     1.20.2           py38h2d18471_0  
[conda] numpy-base                1.20.2           py38hfae3a4d_0  
[conda] pytorch                   1.8.1           py3.8_cuda10.2_cudnn7.6.5_0    pytorch
[conda] torchvision               0.9.1                py38_cu102    pytorch

Models	FPS
PoolNet-ResNet50 w/o edge model GoogleDrive \| BaiduYun (pwd: 2uln).	1.29 (CPU) 29.82 (GPU)
PoolNet-ResNet50 w/ edge model (best performance) GoogleDrive \| BaiduYun (pwd: ksii).	-
PoolNet-VGG16 w/ edge model (pre-computed maps) GoogleDrive \| BaiduYun (pwd: 3wgc).	-
Unspecified GoogleDrive \| BaiduYun (pw: 27p5).	-

swoook / poolnet Goto Github PK

poolnet's Introduction

❗ This is cloned repository!

A Simple Pooling-Based Design for Real-Time Salient Object Detection

Development Environments

Prerequisites

Update

Installation

Data Preparation

Pre-trained Models

Demo

FPS Benchmark

Training

Test

Contact

Citation

poolnet's People

Contributors

Stargazers

Watchers

poolnet's Issues

Issue description

Code example

System Info

Recommend Projects

Recommend Topics

Recommend Org