Giter Site home page Giter Site logo

Ishrat Badami's Projects

aot-benchmark icon aot-benchmark

An efficient modular implementation of Associating Objects with Transformers for Video Object Segmentation in PyTorch

ask-anything icon ask-anything

[VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.

autoshot icon autoshot

AutoShot: A Short Video Dataset and State-of-the-Art Shot Boundary Detection - CVPR NAS 2023

consistent_depth icon consistent_depth

We estimate dense, flicker-free, geometrically consistent depth from monocular video, for example hand-held cell phone video.

detectron2 icon detectron2

Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.

e2fgvi icon e2fgvi

Official code for "Towards An End-to-End Framework for Flow-Guided Video Inpainting" (CVPR2022)

groundingdino icon groundingdino

Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

lir icon lir

Largest Interior/Inscribed Rectangle implementation in Python.

llava icon llava

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

medium icon medium

math renders for medium articles

mmflow icon mmflow

OpenMMLab optical flow toolbox and benchmark

planerecnet icon planerecnet

This is an official implementation for "PlaneRecNet" (BMVC 2021).

sceneseg icon sceneseg

Codebase for CVPR2020 A Local-to-Global Approach to Multi-modal Movie Scene Segmentation

segment-and-track-anything icon segment-and-track-anything

An open-source project dedicated to tracking and segmenting any objects in videos, either automatically or interactively. The primary algorithms utilized include the Segment Anything Model (SAM) for key-frame segmentation and Associating Objects with Transformers (AOT) for efficient tracking and propagation purposes.

semantic-segment-anything icon semantic-segment-anything

Automated dense category annotation engine that serves as the initial semantic labeling for the Segment Anything dataset (SA-1B).

transdepth icon transdepth

Code for Transformers Solve Limited Receptive Field for Monocular Depth Prediction

whisperx icon whisperx

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.