ucbdrive / dla Goto Github PK
View Code? Open in Web Editor NEWCode for the CVPR 2018 Oral Paper "Deep Layer Aggregation"
Home Page: https://www.vis.xyz/pub/dla/
License: BSD 3-Clause "New" or "Revised" License
Code for the CVPR 2018 Oral Paper "Deep Layer Aggregation"
Home Page: https://www.vis.xyz/pub/dla/
License: BSD 3-Clause "New" or "Revised" License
In your paper, figure 6 showed all results for fine-grained datasets. However, when I tried to reproduce the results in bird dataset, I can only get 53% acc. Did you use any pre-trained model for training that? And can you post the code that can be helpful to reproduce your paper's result? Best regards.
Hi, I'm trying to use dla with my own dataset but I don't really understand the eigenvalues and eigenvectors. How are they calculated/what does "image pixel variations" mean?
Thank you!
I've taken a look at the '.pth' file and I found there are two .project.weight
when the depth of a tree is 2.
This is for dla34:
According to the figure in your paper and your code:
self.project
comes after downsample and appears only once during a single level, but why here are two .project.weight
in the pth?
Hope for your reply soon.
Here is how to visualize the model
from tensorboardX import SummaryWriter
import torch
from dla import dla34
model = dla34(pretrained=None)
image = torch.rand(2, 3, 224, 224)
x = model(image)
with SummaryWriter(comment='LinearInLinear') as w:
w.add_graph(model, image, True)
model.eval()
In the forward
method of the Tree
class in dla.py
, the residual
argument is always ignored (overriden line 209).
Is it what is wanted ?
If it is, then building project
in the constructor and calling its forward method it line 209 is useless when levels > 1
. Some computation could be saved.
Also bottom
computation is not used if levels > 1
and level_root
is false.
Does the pre-trained DLA model expect input images in BGR or RGB channel order? Thanks.
The link http://dl.yf.io/dla/models/ to download pretrained models seems to be down.
there is a bug in DLA_UP that you forget initial parameters of proj in IDAUP?
Could you include a license for this repo? Thank you!
The paper mentions DLA, IDA and HDA.
But how are DLAUp and IDAUp in your code different? Can you provide some reference please?
Thanks.
Hello,
Is this repo maintained? I had a few questions regarding the DLA-X46C backbone?
Thanks.
hello, where can I find the slides of this paper?
Hi,
I am not quite sure whether the following initialization of deconv layer is a valid implementation of bilinear filter for all input feature channels:
def fill_up_weights(up):
w = up.weight.data
f = math.ceil(w.size(2) / 2)
c = (2 * f - 1 - f % 2) / (2. * f)
for i in range(w.size(2)):
for j in range(w.size(3)):
w[0, 0, i, j] = \
(1 - math.fabs(i / f - c)) * (1 - math.fabs(j / f - c))
for c in range(1, w.size(0)):
w[c, 0, :, :] = w[0, 0, :, :]
It doesn't initialize w in dim 1 except for the first input channel.
Thanks for your great work.
Some users reported that the pretrained model links failed. Please help fix the problem?
Hello, I want to apply your proposed model to other fields, and I need a pre-trained model on Image Net. My experimental equipment cannot support me to retrain a model. Could you please provide me with a pre-trained model?Thank you
Hi, as of February 9th, 2024. It seems that the server for http://dl.yf.io/dla/models is down again. Please check if possible to get it back.
hello
do you have do some experiments to validate the effectiveness on object detection?
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.