Topic: visual-grounding Goto Github
Some thing interesting about visual-grounding
Some thing interesting about visual-grounding
visual-grounding,[ICRA 2023] Differentiable parsing and visual grounding of natural language instructions for object placement
User: 1989ryan
visual-grounding,Helper tools for extracting and projecting ENet features to ScanNet pointclouds.
Organization: 3dlg-hcvc
visual-grounding,[ICCV 2023] Multi3DRefer: Grounding Text Description to Multiple 3D Objects
Organization: 3dlg-hcvc
Home Page: https://3dlg-hcvc.github.io/multi3drefer/
visual-grounding,[CVPR 2022 Oral] TubeDETR: Spatio-Temporal Video Grounding with Transformers
User: antoyang
visual-grounding,Utilizing a transformer-based object detector for the task of 3D visual grounding.
User: bwittmann
visual-grounding,A curated list of papers and resources related to Described Object Detection, Open-Vocabulary/Open-World Object Detection and Referring Expression Comprehension. Updated frequently and pull requests welcomed.
User: charles-xie
visual-grounding,TransformerVG - 3D Visual Grounding with Transformers
User: chenbarryhu
visual-grounding,Dataset API for "PhraseCut: Language-based Image Segmentation in the Wild"
User: chenyunwu
visual-grounding,PyTorch code for: Learning to Generate Grounded Visual Captions without Localization Supervision
User: chihyaoma
Home Page: https://arxiv.org/abs/1906.00283
visual-grounding,Code used to train probing classifiers in the attribute prediction task
Organization: compguesswhat
Home Page: https://compguesswhat.github.io
visual-grounding,Toward Explainable and Fine-Grained 3D Grounding through Referring Textual Phrases
User: curryyuan
visual-grounding,[CVPR 2024] Visual Programming for Zero-shot Open-Vocabulary 3D Visual Grounding
User: curryyuan
Home Page: https://curryyuan.github.io/ZSVG3D/
visual-grounding,[ECCV2022] D3Net: A Unified Speaker-Listener Architecture for 3D Dense Captioning and Visual Grounding
User: daveredrum
Home Page: https://daveredrum.github.io/D3Net/
visual-grounding,[ECCV 2020] ScanRefer: 3D Object Localization in RGB-D Scans using Natural Language
User: daveredrum
Home Page: https://daveredrum.github.io/ScanRefer/
visual-grounding,Visual Relation Grounding in Videos (ECCV'20, Spotlight)
User: doc-doc
visual-grounding,Codebase for "Learning to ground medical text in a 3D human atlas (CoNLL 2020)".
User: gorjanradevski
visual-grounding,[EMNLP 22] Extending Phrase Grounding with Pronouns in Visual Dialogues.
User: izhx
Home Page: https://arxiv.org/abs/2210.12658
visual-grounding,Referring Video Object Segmentation / Multi-Object Tracking Repo
User: jerryx1110
visual-grounding,Under review. [IROS 2024] PGA: Personalizing Grasping Agents with Single Human-Robot Interaction
User: jhkim-snu
visual-grounding,A collection of 3D vision and language (e.g., 3D Visual Grounding, 3D Question Answering and 3D Dense Caption) papers and datasets.
User: jianghaojun
visual-grounding,[CVPR 2022] Pseudo-Q: Generating Pseudo Language Queries for Visual Grounding
User: leaplabthu
Home Page: https://arxiv.org/abs/2203.08481
visual-grounding,Dissertation for "Weakly Supervised Visual-Textual Grounding based on Concept Similarity" (MS thesis at University of Padua, Italy) - PyTorch implementation: https://github.com/lparolari/weakvtg
User: lparolari
visual-grounding,A collection of resources (work logs, state-of-the-art scores, experiment trace, scripts and proof-of-concepts) for my MS thesis "Weakly Supervised Visual-Textual Grounding based on Concept Similarity" - https://github.com/lparolari/weakvtg
User: lparolari
visual-grounding,A quasi-final short and summary report on my thesis "Weakly Supervised Visual-Textual Grounding based on Concept Similarity". (MS thesis at University of Padua, Italy). - https://github.com/lparolari/weakvtg
User: lparolari
visual-grounding,PyTorch implementation of the model described my MS thesis: "Weakly Supervised Visual-Textual Grounding based on Concept Similarity" (https://github.com/lparolari/master-thesis)
User: lparolari
visual-grounding,A list of research papers on knowledge-enhanced multimodal learning
User: marialymperaiou
visual-grounding,paper list of robotic grasping and some related works
User: rhett-chen
visual-grounding,Shortened version of the final exam for the Deep Learning course of the University of Trento in 2023.
User: rorosonoio
visual-grounding,SeqTR: A Simple yet Universal Network for Visual Grounding
User: seanzhuh
Home Page: https://arxiv.org/abs/2203.16265
visual-grounding,This is a deep learning project focused on the visual grounding task
User: tarasrashkevych99
visual-grounding,awesome grounding: A curated list of research papers in visual grounding
User: theshadow29
visual-grounding,[CVPR20] Video Object Grounding using Semantic Roles in Language Description (https://arxiv.org/abs/2003.10606)
User: theshadow29
visual-grounding,Implementation of Master Thesis on "Belief State for Visually Grounded, Task-Oriented Neural Dialogue Model"
User: timbmg
visual-grounding,[CVPR 2024] Code for "Improved Visual Grounding through Self-Consistent Explanations".
Organization: uvavision
Home Page: https://catherine-r-he.github.io/SelfEQ/
visual-grounding,Improving Visual Grounding with Visual-Linguistic Verification and Iterative Reasoning, CVPR 2022
User: yangli18
visual-grounding,[CVPR 2023] EDA: Explicit Text-Decoupling and Dense Alignment for 3D Visual Grounding
User: yanmin-wu
visual-grounding,HAIS_2GNN: 3D Visual Grounding with Graph and Attention
User: yuechengithub
visual-grounding,Explore new research topics, visual grounding
User: zhenzhao
visual-grounding,[Paper][AAAI 2023] DUET: Cross-modal Semantic Grounding for Contrastive Zero-shot Learning
Organization: zjukg
Home Page: https://arxiv.org/abs/2207.01328
visual-grounding,[ICCV2021] 3DVG-Transformer: Relation Modeling for Visual Grounding on Point Clouds
User: zlccccc
visual-grounding,[CVPR2022 Oral] 3DJCG: A Unified Framework for Joint Dense Captioning and Visual Grounding on 3D Point Clouds
User: zlccccc
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.