ishrat-tl Goto Github PK
Name: Ishrat Badami
Type: User
Company: Triplelift
Bio: CTV Computer Vision and Machine Learning Engineer
Location: Ottawa, Canada
Blog: https://triplelift.com/
Name: Ishrat Badami
Type: User
Company: Triplelift
Bio: CTV Computer Vision and Machine Learning Engineer
Location: Ottawa, Canada
Blog: https://triplelift.com/
An efficient modular implementation of Associating Objects with Transformers for Video Object Segmentation in PyTorch
[VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.
AutoShot: A Short Video Dataset and State-of-the-Art Shot Boundary Detection - CVPR NAS 2023
We estimate dense, flicker-free, geometrically consistent depth from monocular video, for example hand-held cell phone video.
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
Official code for "Towards An End-to-End Framework for Flow-Guided Video Inpainting" (CVPR2022)
Faster Whisper transcription with CTranslate2
Fast Segment Anything
Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
Largest Interior/Inscribed Rectangle implementation in Python.
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
math renders for medium articles
OpenMMLab optical flow toolbox and benchmark
OpenMMLab Semantic Segmentation Toolbox and Benchmark.
Monocular Depth Estimation Toolbox based on MMSegmentation.
🔥 chat with over 10K frames of video!
[CVPR'19] Single-Image Piece-wise Planar 3D Reconstruction via Associative Embedding
This is an official implementation for "PlaneRecNet" (BMVC 2021).
Code for the Recognize Anything Model (RAM) and Tag2Text Model
Codebase for CVPR2020 A Local-to-Global Approach to Multi-modal Movie Scene Segmentation
An open-source project dedicated to tracking and segmenting any objects in videos, either automatically or interactively. The primary algorithms utilized include the Segment Anything Model (SAM) for key-frame segmentation and Associating Objects with Transformers (AOT) for efficient tracking and propagation purposes.
Automated dense category annotation engine that serves as the initial semantic labeling for the Segment Anything dataset (SA-1B).
Code for Transformers Solve Limited Receptive Field for Monocular Depth Prediction
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.