Giter Site home page Giter Site logo

Comments (13)

YuwenXiong avatar YuwenXiong commented on June 24, 2024

Sorry, I made a silly mistake. it seems that you didn't use OHEM, and please modify this line: https://github.com/Orpine/py-R-FCN/blob/master/models/pascal_voc/ResNet-101/rfcn_end2end/train_agonistic.prototxt#L6929 to bottom: "res4b22"

from py-r-fcn.

brisker avatar brisker commented on June 24, 2024

@orpine
I use ResNet-50, how to fix this?

from py-r-fcn.

YuwenXiong avatar YuwenXiong commented on June 24, 2024

@brisker You need to change https://github.com/Orpine/py-R-FCN/blob/master/models/pascal_voc/ResNet-50/rfcn_end2end/test_agonistic.prototxt#L3532 to bottom: res5c, but the real reason is that I forgot to modify https://github.com/Orpine/py-R-FCN/blob/master/models/pascal_voc/ResNet-50/rfcn_end2end/train_agonistic.prototxt#L3532 to bottom: res4f

from py-r-fcn.

brisker avatar brisker commented on June 24, 2024

@orpine
I modified following you advice, but new error occurs:
F1014 13:34:19.352131 5063 net.cpp:784] Cannot copy param 0 weights from layer 'rfcn_cls'; shape mismatch. Source param shape is 1029 1024 1 1 (1053696); target param shape is 98 1024 1 1 (100352). To learn this layer's parameters from scratch rather than copying from a saved net, rename the layer.
*** Check failure stack trace: ***

from py-r-fcn.

YuwenXiong avatar YuwenXiong commented on June 24, 2024

@brisker Please upload your prototxt (both train and test) and log to https://gist.github.com/, let me check it.

from py-r-fcn.

brisker avatar brisker commented on June 24, 2024

@orpine
here https://gist.github.com/brisker/66bb0775defb82e9b4255727b6eba887

from py-r-fcn.

YuwenXiong avatar YuwenXiong commented on June 24, 2024

Seems like those prototxts have no problem. I wonder if you use my demo model to finetune? That will raise a error since my demo model contains a rfcn_cls layer whose output channel is 1029. You should check Preparation for Training & Testing step 7, download ResNet-50 and ResNet-101 imagenet pretrained model manually(from https://github.com/KaimingHe/deep-residual-networks).

Another solution is that you could change the layer name from rfcn_cls to any other name, like rfcn_cls_binary, then Caffe will reinitialize this layer rather than try to copy weights. You also need to rename rfcn_bbox since I modified this layer's weights when I snapshot, you cannot continue training on the weights.

from py-r-fcn.

Simonhong111 avatar Simonhong111 commented on June 24, 2024

hello ,my graphics cars is GTX 1060 6GB.however when I run the code on windows ,I got the error lilke "error == cudaSuccess (2 vs. 0) out of memory".what should I do ,and the modification will result a lower accuracy.thanks

from py-r-fcn.

YuwenXiong avatar YuwenXiong commented on June 24, 2024

Hi @Simonhong111 , there maybe several reason for your situation. The first one is you must use cudnn to reduce your GPU memory cost. And you also need to exit any application that may occupy large GPU memory. On my machine R-FCN with ResNet-101 will use 5.5GB GPU memory. I think it is possible for GTX 1060 to run it. Otherwise you could try ResNet-50.

from py-r-fcn.

jhung0 avatar jhung0 commented on June 24, 2024

Should I change https://github.com/Orpine/py-R-FCN/blob/master/models/try1/ResNet-50/rfcn_end2end/class-aware/train_ohem.prototxt#L3532 as well? Getting the same error as OP.

from py-r-fcn.

dantp-ai avatar dantp-ai commented on June 24, 2024

@orpine I have tried the second approach where rfcn_cls and rfcn_bbox need to be renamed. However I am intersted also in trying out the first approach, but I can not find Preparation for Training & Testing Step 7 on the mentioned URL ?

from py-r-fcn.

YuwenXiong avatar YuwenXiong commented on June 24, 2024

@plopd Preparation for Training & Testing Step 7 is on https://github.com/Orpine/py-R-FCN/blob/master/README.md, the mentioned URL is to help you find where to download them.

from py-r-fcn.

foralliance avatar foralliance commented on June 24, 2024

@YuwenXiong
"Another solution is that you could change the layer name from rfcn_cls to any other name, like rfcn_cls_binary, then Caffe will reinitialize this layer rather than try to copy weights. You also need to rename rfcn_bbox since I modified this layer's weights when I snapshot, you cannot continue training on the weights."
What does this passage mean and how should it be understood?
many many thanks

from py-r-fcn.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.