Giter Site home page Giter Site logo

liuguoyou / spsr Goto Github PK

View Code? Open in Web Editor NEW

This project forked from maclory/spsr

0.0 3.0 0.0 3 KB

Pytorch implementation of Structure-Preserving Super Resolution with Gradient Guidance (CVPR 2020)

Python 27.48% MATLAB 40.98% C 12.70% CSS 0.12% HTML 18.42% Shell 0.27% M 0.03%

spsr's Introduction

SPSR

PyTorch implementation of Structure-Preserving Super Resolution with Gradient Guidance (CVPR 2020) [arXiv]

If you find our work useful in your research, please consider citing:

@inproceedings{ma2020structure,
  title={Structure-Preserving Super Resolution with Gradient Guidance},
  author={Ma, Cheng and Rao, Yongming and Cheng, Yean and Chen, Ce and Lu, Jiwen and Zhou, Jie},
  booktitle={Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
  year={2020}
}

Dependencies

  • Python 3 (Recommend to use Anaconda)
  • PyTorch >= 1.0
  • NVIDIA GPU + CUDA
  • Python packages: pip install numpy opencv-python lmdb pyyaml
  • TensorBoard:
    • PyTorch >= 1.1: pip install tb-nightly future
    • PyTorch == 1.0: pip install tensorboardX

Dataset Preparation

Download Datasets

Commonly used training and testing datasets can be downloaded here.

Preprocess Datasets

We also provide code to preprocess the datasets here.

  • After downloading the original datasets, please store them to a specific GT folder.

  • You can obtain the LR, HR and Bicubic-upsampled versions of the datasets.

  • Then you can extract sub-images with unified scales for training.

  • The training sets can also be transformed into LMDB format for faster IO speed.

Training

To train an SPSR model:

python train.py -opt options/train/train_spsr.json

  • The json file will be processed by options/options.py. Please refer to this for more details.

  • Before running this code, please modify train_spsr.json to your own configurations including:

    • the proper dataroot_HR and dataroot_LR paths for the data loader (More details)
    • saving frequency for models and states
    • whether to resume training with .state files
    • other hyperparameters
    • loss function, etc.
  • You can find your training results in ./experiments.

  • During training, you can use Tesorboard to monitor the losses with tensorboard --logdir tb_logger/NAME_OF_YOUR_EXPERIMENT

  • You can choose to use a pretrained RRDB model as a parameter initialization by setting the pretrain_model_G option in options/train/train_spsr.json. Please download the pretrained model from Google Drive or Baidu Drive (extraction code muw3) and place RRDB_PSNR_x4.pth into ./experiments/pretrain_models.

Testing

To generate SR images by an SPSR model:

python test.py -opt options/test/test_spsr.json

  • Similar to training, the configurations can be modified in the test_spsr.json file.

  • You can find your results in ./results.

  • We provide our SPSR model used in our paper that can be downloaded in Google Drive or Baidu Drive (extraction code muw3). Download spsr.pth and put it into ./experiments/pretrain_models. Then modify the directory of pretrained model in test_spsr.json and run test.py.

  • You can put you LR images in a certain folder and just change the dataroot_LR setting in test_spsr.json and run test.py.

Evaluation Toolbox

We provide a easy and useful evaluation toolbox to simplify the procedure of evaluating SR results. In this toolbox, you can get the MA, NIQE, PI, PSNR, SSIM, MSE, RMSE, MAE and LPIPS values of any SR results you want to evaluate.

Results

Visual Results

Quantitative Results

From the below two tables of comparison with perceptual-driven SR methods, we can see our SPSR method is able to obtain the best PI and LPIPS performance and comparable PSNR and SSIM values simultaneously. The top 2 scores are highlighted.

PI/LPIPS comparison with perceptual-driven SR methods.

Method Set5 Set14 BSD100 General100 Urban100
Bicubic 7.3699/0.3407 7.0268/0.4393 7.0026/0.5249 7.9365/0.3528 6.9435/0.4726
SFTGAN 3.7587/0.0890 2.9063/0.1481 2.3774/0.1769 4.2878/0.1030 3.6136/0.1433
SRGAN 3.9820/0.0882 3.0851/0.1663 2.5459/0.1980 4.3757/0.1055 3.6980/0.1551
ESRGAN 3.7522/0.0748 2.9261/0.1329 2.4793/0.1614 4.3234/0.0879 3.7704/0.1229
NatSR 4.1648/0.0939 3.1094/0.1758 2.7801/0.2114 4.6262/0.1117 3.6523/0.1500
SPSR 3.2743/0.0644 2.9036/0.1318 2.3510/0.1611 4.0991/0.0863 3.5511/0.1184

PSNR/SSIM comparison with perceptual-driven SR methods.

Method Set5 Set14 BSD100 General100 Urban100
Bicubic 28.420/0.8245 26.100/0.7850 25.961/0.6675 28.018/0.8282 23.145/0.9011
SFTGAN 29.932/0.8665 26.223/0.7854 25.505/0.6549 29.026/0.8508 24.013/0.9364
SRGAN 29.168/0.8613 26.171/0.7841 25.459/0.6485 28.575/0.8541 24.397/0.9381
ESRGAN 30.454/0.8677 26.276/0.7783 25.317/0.6506 29.412/0.8546 24.360/0.9453
NatSR 30.991/0.8800 27.514/0.8140 26.445/0.6831 30.346/0.8721 25.464/0.9505
SPSR 30.400/0.8627 26.640/0.7930 25.505/0.6576 29.414/0.8537 24.799/0.9481

Acknowledgement

The code is based on BasicSR, MA, NIQE, PI and LPIPS.

Contact

If you have any questions about our work, please contact [email protected]

spsr's People

Contributors

maclory avatar

Watchers

James Cloos avatar  avatar paper2code - bot avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.