Topic: vit Goto Github

Some thing interesting about vit

👇 Here are 304 public repositories matching this topic...

aiprogrammer / visual-transformer-paper-summary

vit,Summary of Transformer applications for computer vision tasks.

User: aiprogrammer

transformer computer-vision papers awesome attention detr segmentation vit attention-visualization survey

br-idl / paddlevit

vit,:robot: PaddleViT: State-of-the-art Visual Transformer and MLP Models for PaddlePaddle 2.0+

User: br-idl

Home Page: https://github.com/BR-IDL/PaddleViT

cv computer-vision paddlepaddle vit mlp transformer encoder-decoder classification detection segmentation

chinhsuanwu / mobilevit-pytorch

vit,A PyTorch implementation of "MobileViT: Light-weight, General-purpose, and Mobile-friendly Vision Transformer"

User: chinhsuanwu

Home Page: https://arxiv.org/abs/2110.02178

mobilenetv2 mobilevit vision-transformer vit

cmhungsteve / awesome-transformer-attention

vit,An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites

User: cmhungsteve

transformer attention-mechanism vision-transformer deep-learning awesome-list transformer-cv transformer-architecture transformer-awesome transformer-with-cv transformer-models

cydia2018 / vit-cifar10-pruning

vit,Vision Transformer Pruning

User: cydia2018

vision-transformer cifar10 pruning visual-transformer-pruning vit transformer vision-visiontransformer-pruning

daniel-code / tubevit

vit,An unofficial implementation of TubeViT in "Rethinking Video ViTs: Sparse Video Tubes for Joint Image and Video Learning"

User: daniel-code

paper-implementations pytorch video-classification vit tube-vit video tubevit deep-learning

deftruth / awesome-sd-inference

vit,📖A small curated list of Awesome SD/DiT/ViT/Diffusion Inference with Distributed/Caching/Sampling: DistriFusion, PipeFusion, AsyncDiff, DeepCache, Block Caching etc.

User: deftruth

Home Page: https://github.com/DefTruth/Awesome-SD-Distributed-Inference

deepcache diffusion dit gpu inference multi-gpus sd15 sdxl stable-diffusion vit

vit,i. A practical application of Transformer (ViT) on 2-D physiological signal (EEG) classification tasks. Also could be tried with EMG, EOG, ECG, etc. ii. Including the attention of spatial dimension (channel attention) and *temporal dimension*. iii. Common spatial pattern (CSP), an efficient feature enhancement method, realized with Python.

User: eeyhsong

deep-learning attention-mechanism vit transformer attention common-spatial-pattern eeg eeg-classification physiological-signals

gcambara / cape

vit,Continuous Augmented Positional Embeddings (CAPE) implementation for PyTorch

User: gcambara

positional-encoding positional-encoder positional-embedding transformer cape pytorch audio speech text vit

gupta-abhay / pytorch-vit

vit,An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale

User: gupta-abhay

Home Page: https://arxiv.org/abs/2010.11929

image-recognition transformers image-classification vit hybrid-vit vision-transformer

hila-chefer / transformer-explainability

vit,[CVPR 2021] Official PyTorch implementation for Transformer Interpretability Beyond Attention Visualization, a novel method to visualize classifications by Transformer based networks.

User: hila-chefer

deep-learning vision-transformer bert-model bert explainability transformer-interpretability perturbation attention-visualization visualize-classifications vit

hunto / image_classification_sota

vit,Training ImageNet / CIFAR models with sota strategies and fancy techniques such as ViT, KD, Rep, etc.

User: hunto

pytorch imagenet cifar nas kd pruning rep vit transformer image-classification

hunto / lightvit

vit,Official implementation for paper "LightViT: Towards Light-Weight Convolution-Free Vision Transformers"

User: hunto

backbone imagenet vit lightvit

hzcirving / dlrl-playground

vit,The code repo contains multiple code reproduction processes of various SOTA deep learning algorithms

User: hzcirving

deeplearning transformer pytorch swintransformer vit deep-learning rl

implus / mae_segmentation

vit,reproduction of semantic segmentation using masked autoencoder (mae)

User: implus

mae self-supervised-learning semantic-segmentation masked-autoencoder vit vision-transformer

jaehyunnn / vitpose_pytorch

vit,An unofficial implementation of ViTPose [Y. Xu et al., 2022]

User: jaehyunnn

vitpose computer-vision human-pose pose-estimation transformers vision-transformers vit

jittor-image-models / jittor-image-models

vit,Jittor Image Models is a library for pulling together a wide variety of SOTA deep learning models in the Jittor framework.

User: jittor-image-models

jittor timm efficientnet vit resnet hrnet

kamalkraj / vision-transformer

vit,Vision Transformer using TensorFlow 2.0

User: kamalkraj

Home Page: https://openreview.net/forum?id=YicbFdNTTy

tensorflow vit transformer image-classification

kyegomez / navit

vit,My implementation of "Patch n’ Pack: NaViT, a Vision Transformer for any Aspect Ratio and Resolution"

User: kyegomez

Home Page: https://discord.gg/qUtxnK2NMf

vit attention-mechanism clip gpt4 multimodal multimodal-deep-learning multimodal-learning multimodality

kyegomez / vit-rgts

vit,Open source implementation of "Vision Transformers Need Registers"

User: kyegomez

Home Page: https://discord.gg/qUtxnK2NMf

attention-mechanism gpt4 vision-api vision-transformer vit

lukas-blecher / latex-ocr

vit,pix2tex: Using a ViT to convert images of equations into LaTeX code.

User: lukas-blecher

Home Page: https://lukas-blecher.github.io/LaTeX-OCR/

machine-learning transformer im2latex deep-learning image2text latex dataset pytorch im2markup ocr

megvii-research / revcol

vit,Official Code of Paper "Reversible Column Networks" "RevColv2"

Organization: megvii-research

cnn computer-vision pytorch transformer iclr2023 mae vit

nerminnuraydogan / vision-transformer

vit,Vision Transformer explanation and implementation with PyTorch

User: nerminnuraydogan

computer-vision deep-learning paper-implementations vision-transformer vit beginners-tutorial-series

nmndeep / revisiting-at

vit,Code for the paper "Revisiting Adversarial Training for ImageNet: Architectures, Training and Generalization across Threat Models"

User: nmndeep

adversarial-training convnext convstem deep-learning imagenet robustness vit

open-compass / vlmevalkit

vit,Open-source evaluation toolkit of large vision-language models (LVLMs), support ~100 VLMs, 30+ benchmarks

Organization: open-compass

Home Page: https://huggingface.co/spaces/opencompass/open_vlm_leaderboard

gpt-4v large-language-models llava multi-modal openai vqa llm openai-api qwen gpt

paddlepaddle / passl

vit,PASSL包含 SimCLR，MoCo v1/v2，BYOL，CLIP，PixPro，simsiam, SwAV, BEiT，MAE 等图像自监督算法以及 Vision Transformer，DEiT，Swin Transformer，CvT，T2T-ViT，MLP-Mixer，XCiT，ConvNeXt，PVTv2 等基础视觉算法

Organization: paddlepaddle

deep-learning moco moco-v2 simclr clip self-supervised-learning paddle swin-transformer vision-transformer beit

paddlepaddle / plsc

vit,Paddle Large Scale Classification Tools，supports ArcFace, CosFace, PartialFC, Data Parallel + Model Parallel. Model includes ResNet, ViT, Swin, DeiT, CaiT, FaceViT, MoCo, MAE, ConvMAE, CAE.

Organization: paddlepaddle

face-recognition arcface cosface partial-fc data-parallel model-parallel large-scale paddlepaddle paddle distributed-training

padeler / pe-former

vit,2D Human Pose estimation using transformers. Implementation in Pytorch

User: padeler

dnn transformers pytorch xcit dino deit vit coco

purbayankar / hyperspectral-vision-transformer

vit,A PyTorch implementation of CNN+Vision Transformer for hyperspectral image classification

User: purbayankar

hyperspectral-imaging vit 3dcnn

qanastek / hugsvision

vit,HugsVision is a easy to use huggingface wrapper for state-of-the-art computer vision

User: qanastek

Home Page: https://pypi.org/project/hugsvision/

huggingface transformers computer-vision pretrained-models image-classification semantic-segmentation object-detection image-generation pythorch pytorch-transformers yolo vit deit detr pytorch deep-learning machine-learning state-of-the-art bert torchvision

rasbt / pytorch-memory-optim

vit,This code repository contains the code used for my "Optimizing Memory Usage for Training LLMs and Vision Transformers in PyTorch" blog post.

User: rasbt

Home Page: https://lightning.ai/pages/community/tutorial/pytorch-memory-vit-llm/

deep-learning llm memory-optimization pytorch vision vit

roboflow / inference

vit,A fast, easy-to-use, production-ready inference server for computer vision supporting deployment of many popular model architectures and fine-tuned models.

Organization: roboflow

Home Page: https://inference.roboflow.com

computer-vision inference-api inference-server vit yolact yolov5 yolov7 yolov8 jetson tensorrt

s-chh / pytorch-scratch-vision-transformer-vit

vit,From scratch, simple and easy to understand PyTorch implementation of Vision Transformer (ViT) for small datasets like MNIST, FashionMNIST, SVHN and CIFAR10 with detailed steps.

User: s-chh

vision-transformer vit transformer vit-mnist transformer-mnist pytorch-vit scratch simple vit-scratch vit-fashionmnist

sail-sg / adan

vit,Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Models

Organization: sail-sg

adan bert-model convnext deep-learning fairseq mae optimizer resnet timm vit

shivamrajsharma / transformer-architectures-from-scratch

vit,Implementation of transformers based architecture in PyTorch.

User: shivamrajsharma

deep-learning nlp transformer bert gpt-2 gpt vit performer machine-learning pytorch

ssitvit / code-canvas

vit,A hub for innovation through web development projects

Organization: ssitvit

Home Page: https://codecanvas.ieeessitvit.com/

css gssoc23 html js vit

szq0214 / sret

vit,Official PyTorch implementation of our ECCV 2022 paper "Sliced Recursive Transformer"

User: szq0214

efficient-neural-networks efficient-transformers transformer-architecture vision-transformer vit

tensorops / transformerx

vit,Flexible Python library providing building blocks (layers) for reproducible Transformers research (Tensorflow ✅, Pytorch 🔜, and Jax 🔜)

Organization: tensorops

attention attention-mechanism deep-learning transfomers vit xformers multihead-attention nlp self-attention transformers

towhee-io / towhee

vit,Towhee is a framework that is dedicated to making neural data processing pipelines simple and fast.

Organization: towhee-io

Home Page: https://towhee.io

machine-learning convolutional-networks embedding-vectors embeddings computer-vision image-processing video-processing feature-extraction image-retrieval unstructured-data

uta-smile / tvt

vit,Code of TVT: Transferable Vision Transformer for Unsupervised Domain Adaptation, WACV 2023

Organization: uta-smile

domain-adaptation transfer-learning vision-transformer vit pytorch

v-iashin / video_features

vit,Extract video features from raw videos using multiple GPUs. We support RAFT flow frames as well as S3D, I3D, R(2+1)D, VGGish, CLIP, and TIMM models.

User: v-iashin

Home Page: https://v-iashin.github.io/video_features

pytorch multi-gpu feature-extraction parallel video-features visual-features audio-features i3d vggish r2plus1d

vatz88 / ffcsonthego

vit,FFCS course registration made hassle free for VITians. Search courses and visualize the timetable on the go!

User: vatz88

Home Page: https://ffcsonthego.vatz88.in

vit vellore ffcs timetable hacktoberfest javascript

vitjs / vit

vit,🚀 React application framework inspired by UmiJS / 类 UmiJS 的 React 应用框架

Organization: vitjs

vite react vite-plugin vite-plugin-react vit vitjs react-framework umi umijs mock-data

xmindflow / awesome-transformer-in-medical-imaging

vit,[MedIA Journal] An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites

Organization: xmindflow

attention-mechanism awesome-list computer-vision deep-learning medical-image-segmentation segmentation transformer transformers vision-transformer vit

yangzhangcst / transformer-in-computer-vision

vit,A paper list of some recent Transformer-based CV works.

User: yangzhangcst

transformer transformer-cv transformer-awesome detr vit awesome computer-vision deep-learning papers

yaoxiaoyuan / mimix

vit,Mimix: A Text Generation Tool and Pretrained Chinese Models

User: yaoxiaoyuan

chinese-chatbot chinese-nlp gpt-2 poetry-generation question-generation seq2seq summarization text-similarity comment-generation essay-generation generative-qa product-description-generation product-review-generation pretrained-models novel-generation chinese-english-translator tag-generation spelling-correction vit clip

yitu-opensource / t2t-vit

vit,ICCV2021, Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet

Organization: yitu-opensource

vision-transformer t2t-transformer vit

zgcr / simpleaicv_pytorch_training_examples

vit,SimpleAICV:pytorch training and testing examples.

User: zgcr

pytorch darknet fcos resnet retinanet centernet ttfnet repvgg mae dino vit deeplabv3plus kd regnetx u2net solov2 yolact sam segment-anything

zubair-irshad / nerf-mae

vit,[ECCV 2024] Pytorch code for our ECCV'24 paper NeRF-MAE: Masked AutoEncoders for Self-Supervised 3D Representation Learning for Neural Radiance Fields

User: zubair-irshad

Home Page: https://nerf-mae.github.io/

3d 3d-deep-learning 3d-detection 3d-unet instant-ngp masked-autoencoder multi-view nerf neural-radiance-fields representation-learning

zwcolin / eeg-transformer

vit,A ViT based transformer applied on multi-channel time-series EEG data for motor imagery classification

User: zwcolin

bci eeg-classification transformer vit