Topic: video-question-answering Goto Github
Some thing interesting about video-question-answering
Some thing interesting about video-question-answering
video-question-answering,A simple attention deep learning model to answer questions about a given video with the most relevant video intervals as answers.
User: amrhendy
video-question-answering,[NeurIPS 2022] Zero-Shot Video Question Answering via Frozen Bidirectional Language Models
User: antoyang
Home Page: https://arxiv.org/abs/2206.08155
video-question-answering,[ICCV 2021 Oral + TPAMI] Just Ask: Learning to Answer Questions from Millions of Narrated Videos
User: antoyang
Home Page: https://arxiv.org/abs/2012.00451
video-question-answering,[CVPR 2022] A large-scale public benchmark dataset for video question-answering, especially about evidence and commonsense reasoning. The code used in our paper "From Representation to Reasoning: Towards both Evidence and Commonsense Reasoning for Video Question-Answering", CVPR2022.
Organization: bcmi
video-question-answering,A new multi-shot video understanding benchmark Shot2Story20K with detailed shot-level captions and comprehensive video summaries.
Organization: bytedance
Home Page: https://mingfei.info/shot2story
video-question-answering,[NAACL 2024] Official Implementation of paper "Self-Adaptive Sampling for Efficient Video Question Answering on Image--Text Models"
Organization: declare-lab
Home Page: https://arxiv.org/pdf/2307.04192.pdf
video-question-answering,Contrastive Video Question Answering via Video Graph Transformer (IEEE T-PAMI'23)
User: doc-doc
video-question-answering,Video as Conditional Graph Hierarchy for Multi-Granular Question Answering (AAAI'22, Oral)
User: doc-doc
video-question-answering,Can I Trust Your Answer? Visually Grounded Video Question Answering (CVPR'24, Highlight)
User: doc-doc
Home Page: https://arxiv.org/abs/2309.01327
video-question-answering,NExT-QA: Next Phase of Question-Answering to Explaining Temporal Actions (CVPR'21)
User: doc-doc
video-question-answering,Multi-Scale Progressive Attention Network for Video Question Answering
User: gzcsudo
video-question-answering,[CVPR 2021 Best Student Paper Honorable Mention, Oral] Official PyTorch code for ClipBERT, an efficient framework for end-to-end learning on image-text and video-text tasks.
User: jayleicn
Home Page: https://arxiv.org/abs/2102.06183
video-question-answering,[ACL 2020] PyTorch code for TVQA+: Spatio-Temporal Grounding for Video Question Answering
User: jayleicn
Home Page: http://tvqa.cs.unc.edu
video-question-answering,Part of my work for my Bachelor's Thesis Project on Counterfactual Reasoning for Videos.
User: jena-shreyas
video-question-answering,[NeurIPS 2022 Spotlight] Expectation-Maximization Contrastive Learning for Compact Video-and-Language Representations
User: jpthu17
video-question-answering,[CVPR 2023 Highlight] Video-Text as Game Players: Hierarchical Banzhaf Interaction for Cross-Modal Representation Learning
User: jpthu17
video-question-answering,DramaQA Starter Code (2021)
User: liveseongho
video-question-answering,Code for ACL SustaiNLP 2023 paper "Is a Video worth n × n Images? A Highly Efficient Approach to Transformer-based Video Question Answering"
User: lyuchenyang
video-question-answering,Code for ACL SRW 2023 paepr "Semantic-aware Dynamic Retrospective-Prospective Reasoning for Event-level Video Question Answering"
User: lyuchenyang
video-question-answering,LifeQA website code
Organization: michigannlp
Home Page: https://lit.eecs.umich.edu/lifeqa
video-question-answering,WildQA website code
Organization: michigannlp
Home Page: https://lit.eecs.umich.edu/wildqa
video-question-answering,Large Language Models are Temporal and Causal Reasoners for Video Question Answering (EMNLP 2023)
Organization: mlvlab
Home Page: https://ikodoh.github.io/flipped_vqa_demo.html
video-question-answering,MELTR: Meta Loss Transformer for Learning to Fine-tune Video Foundation Models (CVPR 2023)
Organization: mlvlab
video-question-answering,Open-Vocabulary Video Question Answering: A New Benchmark for Evaluating the Generalizability of Video Question Answering Models (ICCV 2023)
Organization: mlvlab
video-question-answering,Data and PyTorch code for the LifeQA LREC 2020 paper.
User: mmazab
Home Page: https://lit.eecs.umich.edu/lifeqa/
video-question-answering,Given a video, we are able to automaticaly answer questions about what is happening in the video.
User: nicolas-dufour
video-question-answering,ROCK model for Knowledge-Based VQA in Videos
User: noagarcia
video-question-answering,PyTorch code for ROLL, a knowledge-based video story question answering model.
User: noagarcia
video-question-answering,[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.
Organization: opengvlab
Home Page: https://vchat.opengvlab.com/
video-question-answering,Video Foundation Models & Data for Multimodal Understanding
Organization: opengvlab
video-question-answering,Video Graph Transformer for Video Question Answering (ECCV'22)
Organization: sail-sg
video-question-answering,Align and Prompt: Video-and-Language Pre-training with Entity Prompts
Organization: salesforce
video-question-answering,A PyTorch implementation of EmpiricalMVM
User: tsujuifu
video-question-answering,A PyTorch implementation of VIOLET
User: tsujuifu
video-question-answering,mPLUG-2: A Modularized Multi-modal Foundation Model Across Text, Image and Video (ICML 2023)
Organization: x-plug
video-question-answering,Youku-mPLUG: A 10 Million Large-scale Chinese Video-Language Pre-training Dataset and Benchmarks
Organization: x-plug
video-question-answering,[ICCV2023] Tem-adapter: Adapting Image-Text Pretraining for Video Question Answer
User: xliu443
video-question-answering,This repo contains code for Invariant Grounding for Video Question Answering
User: yl3800
video-question-answering,[TIP 2022] Official code of paper “Video Question Answering with Prior Knowledge and Object-sensitive Learning”
User: zchoi
Home Page: https://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=9882977
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.