singhjasdeep / attention-on-attention-for-vqa Goto Github PK
View Code? Open in Web Editor NEWVisual Question Answering Project with state of the art single Model performance.
License: MIT License
Visual Question Answering Project with state of the art single Model performance.
License: MIT License
this paper states that L2 normalization of the image features is crucial for good performance. However, you just use pool5 data, which is average pooled to become a 2048 vector in generate_tsv.py
However, nether did your repository Attention-on-Attention-for-VQA
nor the feature exactor repository bottom-up-attention
implement the L2-normaliation. I implemented it at the very beginning of the forward procedure by v = v / torch.norm(v, 2)
. But the validation score decreased by 0.5.
Can anybody explain it ? Thanks~
Hello,
I tried to download the image features from https://storage.googleapis.com/bottom-up-attention/trainval_36.zip and https://storage.googleapis.com/bottom-up-attention/test2015_36.zip but the server denies access.
<Error>
<Code>AccessDenied</Code>
<Message>Access denied.</Message>
<Details>Anonymous caller does not have storage.objects.get access to bottom-up-attention/test2015_36.zip.</Details>
</Error>
Hello. Launching main.py and recieveing memory error (cannot allocate memory) on the very first epoch. Launching on PC with 64 gb ram, 1080ti and core i9. Does this model requires more than 64 gb? Thanks in advance for your reply.
As titled, not sure you can provide the link for pre-trained model.
Thanks.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.