dl-vit,github

twins

Two simple and effective designs of vision transformer, which is on par with the Swin transformer

u-dit

The official code of "U-DiTs: Downsample Tokens in U-Shaped Diffusion Transformers"

u-vit

A PyTorch implementation of the paper "All are Worth Words: A ViT Backbone for Diffusion Models".

uctransnet

Implementation of our AAAI'22 work: 'UCTransNet: Rethinking the Skip Connections in U-Net from a Channel-wise Perspective with Transformer'.

This repo is the official implementation of 'Narrowing the semantic gaps in U-Net with learnable skip connections: The case of medical image segmentation' which is an improved journal version of UCTransNet.

uformer

[CVPR 2022] Official repository for the paper "Uformer: A General U-Shaped Transformer for Image Restoration".

um-mae

Official Codes for "Uniform Masking: Enabling MAE Pre-training for Pyramid-based Vision Transformers with Locality"

unified-normalization

# Unified Normalization (ACM MM'22) By Qiming Yang, Kai Zhang, Chaoxiang Lan, Zhi Yang, Zheyang Li, Wenming Tan, Jun Xiao, and Shiliang Pu. This repository is the official implementation of "Unified Normalization for Accelerating and Stabilizing Transformers"

uniformer

[ICLR2022] official implementation of UniFormer

uniformerv2

UniFormerV2: Spatiotemporal Learning by Arming Image ViTs with Video UniFormer

unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

uninext

uperformer

A Multi-scale Transformer-based Decoder for Semantic Segmentation

v2x-vit

[ECCV2022] Official Implementation of paper "V2X-ViT: Vehicle-to-Everything Cooperative Perception with Vision Transformer"

vid-tldr

Official implementation of CVPR 2024 paper "vid-TLDR: Training Free Token merging for Light-weight Video Transformer".

video-swin-transformer

This is an official implementation for "Video Swin Transformers".

vilt

Code for the ICML 2021 (long talk) paper: "ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision"

vipformer

[ICRA 2023] ViPFormer: Efficient Vision-and-Pointcloud Transformer for Unsupervised Pointcloud Understanding. https://arxiv.org/abs/2303.14376

visformer

vision-diffmask

Official PyTorch implementation of Vision DiffMask, a post-hoc interpretation method for vision models.

vision-longformer

vision_transformer

visual_token_matching

[ICLR'23 Oral] Universal Few-shot Learning of Dense Prediction Tasks with Visual Token Matching

visualrecognition-nommer

Code for CVPR 2022 paper "NomMer: Nominate Synergistic Context in Vision Transformer for Visual Recognition"

vit-adapter

Vision Transformer Adapter for Dense Predictions

vit-anti-oversmoothing

[ICLR 2022] "Anti-Oversmoothing in Deep Vision Transformers via the Fourier Domain Analysis: From Theory to Practice" by Peihao Wang, Wenqing Zheng, Tianlong Chen, Zhangyang Wang

vit-dd

Offical ViT-DD repository

vit-pytorch

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

vit-rgts

Open source implementation of "Vision Transformers Need Registers"

vitae-transformer

This is an official implementation for "ViTAE: Vision Transformer Advanced by Exploring Intrinsic Inductive Bias", "ViTAEv2: Vision Transformer Advanced by Exploring Inductive Bias for Image Recognition and Beyond".

dl-vit Goto Github PK

dl-vit's Projects

Recommend Projects

Recommend Topics

Recommend Org