Comments (13)
Sorry, I made a silly mistake. it seems that you didn't use OHEM, and please modify this line: https://github.com/Orpine/py-R-FCN/blob/master/models/pascal_voc/ResNet-101/rfcn_end2end/train_agonistic.prototxt#L6929 to bottom: "res4b22"
from py-r-fcn.
@orpine
I use ResNet-50, how to fix this?
from py-r-fcn.
@brisker You need to change https://github.com/Orpine/py-R-FCN/blob/master/models/pascal_voc/ResNet-50/rfcn_end2end/test_agonistic.prototxt#L3532 to bottom: res5c
, but the real reason is that I forgot to modify https://github.com/Orpine/py-R-FCN/blob/master/models/pascal_voc/ResNet-50/rfcn_end2end/train_agonistic.prototxt#L3532 to bottom: res4f
from py-r-fcn.
@orpine
I modified following you advice, but new error occurs:
F1014 13:34:19.352131 5063 net.cpp:784] Cannot copy param 0 weights from layer 'rfcn_cls'; shape mismatch. Source param shape is 1029 1024 1 1 (1053696); target param shape is 98 1024 1 1 (100352). To learn this layer's parameters from scratch rather than copying from a saved net, rename the layer.
*** Check failure stack trace: ***
from py-r-fcn.
@brisker Please upload your prototxt (both train and test) and log to https://gist.github.com/, let me check it.
from py-r-fcn.
@orpine
here https://gist.github.com/brisker/66bb0775defb82e9b4255727b6eba887
from py-r-fcn.
Seems like those prototxts have no problem. I wonder if you use my demo model to finetune? That will raise a error since my demo model contains a rfcn_cls layer whose output channel is 1029. You should check Preparation for Training & Testing
step 7, download ResNet-50 and ResNet-101 imagenet pretrained model manually(from https://github.com/KaimingHe/deep-residual-networks).
Another solution is that you could change the layer name from rfcn_cls
to any other name, like rfcn_cls_binary
, then Caffe will reinitialize this layer rather than try to copy weights. You also need to rename rfcn_bbox
since I modified this layer's weights when I snapshot, you cannot continue training on the weights.
from py-r-fcn.
hello ,my graphics cars is GTX 1060 6GB.however when I run the code on windows ,I got the error lilke "error == cudaSuccess (2 vs. 0) out of memory".what should I do ,and the modification will result a lower accuracy.thanks
from py-r-fcn.
Hi @Simonhong111 , there maybe several reason for your situation. The first one is you must use cudnn to reduce your GPU memory cost. And you also need to exit any application that may occupy large GPU memory. On my machine R-FCN with ResNet-101 will use 5.5GB GPU memory. I think it is possible for GTX 1060 to run it. Otherwise you could try ResNet-50.
from py-r-fcn.
Should I change https://github.com/Orpine/py-R-FCN/blob/master/models/try1/ResNet-50/rfcn_end2end/class-aware/train_ohem.prototxt#L3532 as well? Getting the same error as OP.
from py-r-fcn.
@orpine I have tried the second approach where rfcn_cls
and rfcn_bbox
need to be renamed. However I am intersted also in trying out the first approach, but I can not find Preparation for Training & Testing Step 7
on the mentioned URL ?
from py-r-fcn.
@plopd Preparation for Training & Testing Step 7
is on https://github.com/Orpine/py-R-FCN/blob/master/README.md, the mentioned URL is to help you find where to download them.
from py-r-fcn.
@YuwenXiong
"Another solution is that you could change the layer name from rfcn_cls to any other name, like rfcn_cls_binary, then Caffe will reinitialize this layer rather than try to copy weights. You also need to rename rfcn_bbox since I modified this layer's weights when I snapshot, you cannot continue training on the weights."
What does this passage mean and how should it be understood?
many many thanks
from py-r-fcn.
Related Issues (20)
- how to solve the lib reimplement for minibatch size >1 error?
- error use caffe-Microsoft
- I changed it to a single category detection and the loss become NONE in train time after 1000 iter
- about lib/fast_rcnn/train.py
- about multi-scale traing/testing
- ValueError: total size of new array must be unchanged
- Is the backbone network never fine-tuned during training? HOT 1
- Why BN/Scale layers are not included in fine-tune?
- Issue with building microsoft caffe
- a implementation of light head rcnn by modifying R-FCN, low mAP
- train ResNet-50 Alternating training Erroe HOT 3
- when i train rfcn_end2end,there is an error in proposal_target_layer.py
- eltwise_layer.cpp:34 check failed: bottom[i]->shape() == bottom[0]->shape(). error
- undefined reference to symbol 'cudaSetDevice'
- About depth of RoI-wise subnetwork
- How to set iterations when training with rfcn_alt_opt_5stage_ohem
- How to deal with gt_boxes zero sample?>
- Could you give me some hints to fix this issue? Thanks a lot!
- question
- Message type "caffe.LayerParameter" has no field named "psroi_pooling_param".
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from py-r-fcn.