Giter Site home page Giter Site logo

yingtiandt's Projects

ask-anything icon ask-anything

[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.

auditory-slow-fast icon auditory-slow-fast

Implementation of "Slow-Fast Auditory Streams for Audio Recognition, ICASSP, 2021" in PyTorch

avid-cma icon avid-cma

Audio Visual Instance Discrimination with Cross-Modal Agreement

brain-score icon brain-score

A framework for evaluating models on their alignment to brain and behavioral measurements (50+ benchmarks)

brainio icon brainio

Data management for quantitative comparison of brains and brain-inspired systems

cav-mae icon cav-mae

Code and Pretrained Models for ICLR 2023 Paper "Contrastive Audio-Visual Masked Autoencoder".

contrastive2021 icon contrastive2021

Implementation for paper "Towards the Generalization of Contrastive Self-Supervised Learning" (https://arxiv.org/abs/2111.00743)

dino icon dino

PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO

gdt icon gdt

We present a framework for training multi-modal deep learning models on unlabelled video data by forcing the network to learn invariances to transformations applied to both the audio and video streams.

ijepa icon ijepa

Official codebase for I-JEPA, the Image-based Joint-Embedding Predictive Architecture. First outlined in the CVPR paper, "Self-supervised learning from images with a joint-embedding predictive architecture."

llava icon llava

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

mae_st icon mae_st

Official Open Source code for "Masked Autoencoders As Spatiotemporal Learners"

mamba.py icon mamba.py

A simple and efficient Mamba implementation in PyTorch and MLX.

mmaction2 icon mmaction2

OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark

model-tools icon model-tools

Helper functions to extract model activations and translate from Machine Learning to Neuroscience

openstl icon openstl

OpenSTL: A Comprehensive Benchmark of Spatio-Temporal Predictive Learning

pdbpp icon pdbpp

pdb++, a drop-in replacement for pdb (the Python debugger)

result_caching icon result_caching

Store results of function calls with respect to the call parameters

scenic icon scenic

Scenic: A Jax Library for Computer Vision Research and Beyond

segformer-pytorch icon segformer-pytorch

Implementation of Segformer, Attention + MLP neural network for segmentation, in Pytorch

selavi icon selavi

This repo covers the implementation for Labelling unlabelled videos from scratch with multi-modal self-supervision, which learns clusters from multi-modal data in a self-supervised way.

videomae icon videomae

[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training

videomaev2 icon videomaev2

[CVPR 2023] VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.