Dilated-Convolution-with-Learnable-Spacings-PyTorch

This is an official implementation of Dilated Convolution with Learnable Spacings by Ismail Khalfaoui Hassani, Thomas Pellegrini and Timothée Masquelier.

Dilated Convolution with Learnable Spacings (abbreviated to DCLS) is a novel convolution method based on gradient descent and interpolation. It could be seen as an improvement of the well known dilated convolution that has been widely explored in deep convolutional neural networks and which aims to inflate the convolutional kernel by inserting spaces between the kernel elements.

In DCLS, the positions of the weights within the convolutional kernel are learned in a gradient-based manner, and the inherent problem of non-differentiability due to the integer nature of the positions in the kernel is solved by taking advantage of an interpolation method.

For now, the code has only been implemented on PyTorch, using Pytorch.

What's new
Installation
Usage
Device Supports
Publications and Citation
Contribution

The method is described in the article Dilated Convolution with Learnable Spacings. The Gaussian and triangle versions are described in the arXiv preprint Dilated Convolution with Learnable Spacings: beyond bilinear interpolation.

What's new

Jun 16, 2023:

A new tutorial on how to use DCLS in vision backbones is now available: DCLS Vision Tutorial.
A short blog post which summarizes the DCLS method has been published in Medium: What is Dilated Convolution with Learnable Spacings (DCLS) and how to use it ?.

Jun 2, 2023:

New DCLS version supports Gaussian and triangle interpolations in addition to previous bilinear interpolation. To use it, please do:

pip3 install --upgrade --force-reinstall dcls

or recompile after a git update.

import torch
from DCLS.construct.modules import  Dcls2d

# Dcls2d with Gaussian interpolation. available versions : ["gauss", "max", "v1", "v0"]
m = Dcls2d(96, 96, kernel_count=26, dilated_kernel_size=17, padding=8, groups=96, version="gauss")
input = torch.randn(20, 96, 50, 100)
output = m(input)
loss = output.sum()
loss.backward()
print(output, m.weight.grad, m.P.grad, m.SIG.grad)

Learning techniques for this method are described in Dilated Convolution with Learnable Spacings: beyond bilinear interpolation.

Apr 16, 2023:

Fix an important bug in Dcls1d version. Please reinstall the pip wheel via

pip3 install --upgrade --force-reinstall dcls

or recompile after a git update.

Jan 7, 2023:

Important modification to ConstructKernel{1,2,3}d algorithm which allows to use less memory, this modification enables very large kernel counts. For example:

from DCLS.construct.modules import  Dcls2d

m = Dcls2d(96, 96, kernel_count=2000, dilated_kernel_size=7, padding=3, groups=96).cuda()

After installation of the new version 0.0.3 of DCLS, the use remains unchanged.

Nov 8, 2022:

Previous branch main is moved to branch cuda, now in main branch we have fully native torch conv{1,2,3}d.

Sep 27, 2022:

Code release for ConvNeXt-dcls experiments. See ConvNeXt-dcls.

Installation

DCLS is based on PyTorch and CUDA. Please make sure that you have installed all the requirements before you install DCLS.

Requirements:

Pytorch version torch>=1.6.0. See torch.

Preferred versions:

pip3 install torch==1.8.0+cu111 torchvision==0.9.0+cu111 -f https://download.pytorch.org/whl/torch_stable.html

Install the latest developing version from the source codes:

From GitHub:

git clone https://github.com/K-H-Ismail/Dilated-Convolution-with-Learnable-Spacings-PyTorch.git
cd Dilated-Convolution-with-Learnable-Spacings-PyTorch
python3 -m pip install --upgrade pip
python3 -m build 
python3 -m pip install dist/dcls-0.0.5-py3-none-any.whl

Install the last stable version from PyPI:

pip3 install dcls

Usage

Dcls methods could be easily used as a substitue of Pytorch's nn.Convnd classical convolution method:

import torch
from DCLS.construct.modules import  Dcls2d

# With square kernels, equal stride and dilation
m = Dcls2d(16, 33, kernel_count=3, dilated_kernel_size=7)
input = torch.randn(20, 16, 50, 100)
output = m(input)
loss = output.sum()
loss.backward()
print(output, m.weight.grad, m.P.grad)

A typical use is with the separable convolution

import torch
from DCLS.construct.modules import  Dcls2d

m = Dcls2d(96, 96, kernel_count=34, dilated_kernel_size=17, padding=8, groups=96)
input = torch.randn(128, 96, 56, 56)
output = m(input)
loss = output.sum()
loss.backward()
print(output, m.weight.grad, m.P.grad)

Dcls with different dimensions

import torch
from DCLS.construct.modules import  Dcls1d 

# Will construct kernels of size 7x7 with 3 elements inside each kernel
m = Dcls1d(3, 16, kernel_count=3, dilated_kernel_size=7)
input = torch.rand(8, 3, 32)
output = m(input)
loss = output.sum()
loss.backward()
print(output, m.weight.grad, m.P.grad)

import torch
from DCLS.construct.modules import  Dcls3d

m = Dcls3d(16, 33, kernel_count=10, dilated_kernel_size=(7,8,9))
input = torch.randn(20, 16, 50, 100, 30)
output = m(input)
loss = output.sum()
loss.backward()
print(output, m.weight.grad, m.P.grad)

DepthWiseConv2dImplicitGEMM for 2D-DCLS:

For 2D-DCLS, to install and enable the DepthWiseConv2dImplicitGEMM, please follow the instructions of RepLKNet. Otherwise, Pytorch's native Conv2D method will be used.

Device Supports

DCLS supports CPU and Nvidia CUDA GPU devices now.

Nvidia GPU
CPU

Make sure to have your data and model on CUDA GPU.

Publications and Citation

If you use DCLS in your work, please consider to cite it as follows:

@inproceedings{
hassani2023dilated,
title={Dilated convolution with learnable spacings},
author={Ismail Khalfaoui-Hassani and Thomas Pellegrini and Timoth{\'e}e Masquelier},
booktitle={The Eleventh International Conference on Learning Representations },
year={2023},
url={https://openreview.net/forum?id=Q3-1vRh3HOA}
}

If you use DCLS with Gaussian or triangle interpolations in your work, please consider to cite as well:

@article{khalfaoui2023dilated,
  title={Dilated convolution with learnable spacings: beyond bilinear interpolation},
  author={Khalfaoui-Hassani, Ismail and Pellegrini, Thomas and Masquelier, Timoth{\'e}e},
  journal={arXiv preprint arXiv:2306.00817},
  year={2023}
}

Contribution

This project is open source, therefore all your contributions are welcomed, whether it's reporting issues, finding and fixing bugs, requesting new features, and sending pull requests ...

sunpengfei1122 / dilated-convolution-with-learnable-spacings-pytorch Goto Github PK