Comments (4)
Hello Eden!
First of all, thank you for your interest in our work.
Yes, you are right that positives and negatives are distinguished with the ground-truth labels.
However, as the higher-level aligned space is only needed to infer the fine-grained relationship (clip-word) that is utilized to train the attention weights, we do not use the clip-word correlation learner during the inference.
You can find this detail in our main figure (Fig. 2) that the lines linked to the clip-word correlation learner are dotted and also the contour of the box which denotes only used in training.
If have more questions, please feel free.
Thank you.
from cgdetr.
Thank you for replying so quickly.
Ah, I was careless, I get it. By the way, can I understand Clip-Word Correlation Learner as an auxiliary strategy to better learn dummy tokens?
Thank you.
from cgdetr.
Dummy tokens are mostly supervised with the saliency scores.
Although the clip-word correlation learner may provide further training guidance to learn the dummy tokens, its main objective is to calibrate the attention between each clip and word.
Thank you
from cgdetr.
OK. I see. Thank you.
Best wishes to you.
from cgdetr.
Related Issues (11)
- About the timing of releasing implementation codes HOT 2
- Customized dataset feature extraction in Charades-STA style HOT 6
- The result on moment retrieval datasets TACoS is from val datasets or test dataset? HOT 1
- 你好,感谢您的工作,根据您开源的代码程序和数据,我进行了复现,qvhighlights训练的val ,有些指标还是差异比较大的 HOT 3
- About Dummy Tokens. HOT 5
- Questions about some details? HOT 1
- Moment-adaptive Saliency Token Generator: Cross-Attention HOT 1
- Unable to reproduce the experimental results in your paper HOT 3
- Question about pre-training HOT 1
- Failed to find metrics for "QVHighlights only CLIP" model. HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from cgdetr.