Giter Site home page Giter Site logo

py-r-fcn's People

Contributors

daijifeng001 avatar eakbas avatar haozhiqi avatar liyi14 avatar timonzimm avatar yuwenxiong avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

py-r-fcn's Issues

How to train the py-R-FCN using my own data?

I am trying to train the py-R-FCN using my own data, and my data has only 2 categories. I changed the codes referring to faster-RCNN.
But what should I do with train_agonistic.prototxt and test_agonistic.prototxt? I have changed param_str: "'num_classes': 21 to param_str: "'num_classes': 3 in line 11 in train_agonistic.prototxt.
Should I change num_output: 1029 #21_(7^2) cls_num_(score_maps_size^2) in line 3736 and output_dim: 21 in line 3790?
Anything else?
Thank you!

RFCN with python layers on Windows

Static libcaffe.lib on windows can built following RFCN caffe branch. However, it reported PythonLayer cannot be initialized during importing libcaffe.lib using C++ programming on external project. Any idea? Thanks.

Segmentation fault when running demo in CPU mode

./tools/demo_rfcn.py --cpu

leads to segmentation fault, after these lines

I0922 20:40:52.680196  4690 net.cpp:771] Ignoring source layer silence
I0922 20:40:52.680202  4690 net.cpp:771] Ignoring source layer loss
I0922 20:40:52.680209  4690 net.cpp:771] Ignoring source layer accuarcy
I0922 20:40:52.680217  4690 net.cpp:771] Ignoring source layer loss_bbox

I don't have GPU :(.

About running it on Ubuntu

I followed most of your steps(caffe is copyed from py-faster-rcnn), yet it shows :

[libprotobuf ERROR google/protobuf/text_format.cc:245] Error parsing text-format caffe.NetParameter: 7106:25: Message type "caffe.LayerParameter" has no field named "psroi_pooling_param".
WARNING: Logging before InitGoogleLogging() is written to STDERR
F1106 19:37:52.764528  5270 upgrade_proto.cpp:68] Check failed: ReadProtoFromTextFile(param_file, param) Failed to parse NetParameter file: /home/yyc/Documents/py-R-FCN-master/models/pascal_voc/ResNet-101/rfcn_end2end/test_agnostic.prototxt
*** Check failure stack trace: ***
Aborted (core dumped)

Am I doing it wrong? I looked up "psroi_pooling", yet I didn't find the layer's definition anywhere, plz help~

The OHEM question

I read the codes and I'm confused in OHEM, you just put all the proposal rois (total 300 + gt number)into the next step in end2end training. When you sort the roi loss and hard mining the proposal rois examples? Thank you!

Segmentation fault in cpu mode.

I have successfully train the rfcn model and test in gpu model. But when i run the demo in cpu model the program will core dump with "Segmentation fault". Can anyone help me about this problem?

the gdb error info is:

#0 0x00007ffff6e44653 in __memcpy_ssse3_back () from /usr/lib64/libc.so.6
#1 0x00007fffe368c848 in caffe::ScaleLayer::Forward_cpu(std::vector<caffe::Blob, std::allocator<caffe::Blob> > const&, std::vector<caffe::Blob, std::allocator<caffe::Blob> > const&) ()

./lib/datasets/voc_eval.py BB = BB [sorted_ind, :] IndexError: too many indices

Hi @orpine , I want to train my own dataset with one target, which is just a binary classification problem. I modify the prototxt in /models/pascal_voc/ResNet-50/rfcn_alt_opt_5step_ohem/ and change each cls-num from 21 to 2, then I run the ./experiments/scripts/frcn_alt_opt_5stage_ohem.sh 0 ResNet-50 pascal_voc.
The training is very successful and the loss seems to decrease, while in the test_net.py, I meet the error:

./lib/datasets/voc_eval.py BB = BB [sorted_ind, :]
IndexError: too many indices

and I try to print the value of BB and sorted_ind, and both of them seems empty
I am wondering how to solve the problem, and can anyone give me some hints on how to solve the problem?

SystemError: NULL result without error in PyObject_Call

Traceback (most recent call last):
File "/home/deepinsight/py-R-FCN/tools/../lib/roi_data_layer/layer.py", line 15, in
from roi_data_layer.minibatch import get_minibatch
File "/home/deepinsight/py-R-FCN/tools/../lib/roi_data_layer/minibatch.py", line 12, in
import cv2
ImportError: /usr/local/lib/libopencv_ocl.so.2.4: undefined symbol: _ZN2cv16TLSDataContainerD2Ev
Traceback (most recent call last):
File "./tools/train_net.py", line 112, in
max_iters=args.max_iters)
File "/home/deepinsight/py-R-FCN/tools/../lib/fast_rcnn/train.py", line 202, in train_net
pretrained_model=pretrained_model)
File "/home/deepinsight/py-R-FCN/tools/../lib/fast_rcnn/train.py", line 43, in init
self.solver = caffe.SGDSolver(solver_prototxt)
SystemError: NULL result without error in PyObject_Call

Why the length of output bbox is 8 ?

Thanks for this great work first.
I have question about the network structure, this is, why the length of output bbox is 8? If we use class-agonistic bbox regression, the length of bbox vector should be 4. Are the first 4 values regressed for background, and the last 4 values regressed for objects?

availability of demo code

Good work to make python code available.
I have trained Resnet50 and like to check performance of some images.
Do you know when the demo code will be available?

Thanks

About position-sensitive score maps

How the position-sensitive score maps generate? I don not find anything in detail in paper and author just say that use a bank of specialized convolutional layers as the FCN output.

Thank you very much.

py-faster-rcnn support multiple imdb with --imdb voc_2007_trainval+voc_2012_trainval

Since py-faster-rcnn does not support multiple training datasets, we need to merge VOC 2007 data and VOC 2012 data manually. Just make a new directory named VOC0712, put all subfolders except ImageSets in VOC2007 and VOC2012 into VOC0712(you'll merge some folders). I provide a merged-version ImageSets folder for you, please put it into VOCdevkit/VOC0712/

To my knowledge, py-faster-rcnn support multiple imdb with --imdb voc_2007_trainval+voc_2012_trainval

train voc0712 error

layer {
  name: "res4c_branch2b_relu"
  type: "ReLU"
  bottom: "res4c_branch2b"
  top: "res4c
I0107 03:54:09.780325  9558 layer_factory.hpp:77] Creating layer input-data
./experiments/scripts/rfcn_end2end.sh: line 57:  9558 Segmentation fault      (core dumped) ./tools/train_net.py --gpu ${GPU_ID} --solver models/${PT_DIR}/${NET}/rfcn_end2end/solver.prototxt --weights data/imagenet_models/${NET}-model.caffemodel --imdb ${TRAIN_IMDB} --iters ${ITERS

Question about 'RPN' location!

Thanks for this great work.
A question here, as in faster rcnn work itself and its implementation in ResNet paper, 'RPN' layer is inserted right after Res4X, but in your implementation, you insert it right after Res5X, will it affect final results?

training error

Hello,

While training I am getting the following error

I0204 03:08:08.855443 21819 solver.cpp:228] Iteration 920, loss = 0.266415
I0204 03:08:08.855478 21819 solver.cpp:244] Train net output #0: accuarcy = 0.941176
I0204 03:08:08.855485 21819 solver.cpp:244] Train net output #1: loss_bbox = 0.000348185 (* 1 = 0.000348185 loss)
I0204 03:08:08.855489 21819 solver.cpp:244] Train net output #2: loss_cls = 0.202786 (* 1 = 0.202786 loss)
I0204 03:08:08.855494 21819 solver.cpp:244] Train net output #3: rpn_cls_loss = 0.0552239 (* 1 = 0.0552239 loss)
I0204 03:08:08.855497 21819 solver.cpp:244] Train net output #4: rpn_loss_bbox = 0.0136765 (* 1 = 0.0136765 loss)
I0204 03:08:08.855501 21819 sgd_solver.cpp:106] Iteration 920, lr = 0.001
Traceback (most recent call last):
File "./tools/train_net.py", line 112, in
max_iters=args.max_iters)
File "/home/sandeep/workspace/py-R-FCN/tools/../lib/fast_rcnn/train.py", line 205, in train_net
model_paths = sw.train_model(max_iters)
File "/home/sandeep/workspace/py-R-FCN/tools/../lib/fast_rcnn/train.py", line 146, in train_model
self.solver.step(1)
File "/home/sandeep/workspace/py-R-FCN/tools/../lib/rpn/proposal_target_layer.py", line 66, in forward
rois_per_image, self._num_classes)
File "/home/sandeep/workspace/py-R-FCN/tools/../lib/rpn/proposal_target_layer.py", line 185, in _sample_rois
fg_inds = npr.choice(fg_inds, size=fg_rois_per_this_image, replace=False)
File "mtrand.pyx", line 1176, in mtrand.RandomState.choice (numpy/random/mtrand/mtrand.c:18822)
TypeError: 'numpy.float64' object cannot be interpreted as an index

Please help me...

Thanks in advance

What is the benefit of using OHEM?

As author said, OHEM need all rois to select the hard examples.
What is the benefit of using OHEM specifically? And how the effect and efficiency changed when using OHEM?

Error: Message type "caffe.LayerParameter" has no field named "psroi_pooling_param".

W1221 22:46:14.682919 8972 _caffe.cpp:125] Net('/RFCN_root/py-R-FCN/models/pascal_voc/ResNet-101/rfcn_end2end/test_agnostic.prototxt', 1, weights='/RFCN_root/py-R-FCN/data/rfcn_models/resnet101_rfcn_final.caffemodel')
[libprotobuf ERROR google/protobuf/text_format.cc:245] Error parsing text-format caffe.NetParameter: 7106:25: Message type "caffe.LayerParameter" has no field named "psroi_pooling_param".
F1221 22:46:14.688835 8972 upgrade_proto.cpp:88] Check failed: ReadProtoFromTextFile(param_file, param) Failed to parse NetParameter file: /RFCN_root/py-R-FCN/models/pascal_voc/ResNet-101/rfcn_end2end/test_agnostic.prototxt
*** Check failure stack trace: ***
Aborted (core dumped)

Got the above error, any idea how to solve it ?

Does this project support trained by multiple GPUs?

I wander whether your project can be trained by multiple GPUs?
I have tried to use the command like --gpu 0,1,2 but it seems it does not support this way.
But in Dai's paper, they use 8 GPUs to train the model.

How to train on own dataset?

Hi, i'm trying to train on my own dataset with different classes, when i revised the train.prototxt, i just revised three place[new conv layer:num_output; input_data:num_classes],is that right?

What is the benefit of using OHEM?

In readme, author said that OHEM need all rois to select the hard examples.
What is the benefit of using OHEM specifically?And how the Efficiency and effectiveness will changed?
Thank you.

IOError: [Errno 2] No such file or directory

4.Since py-faster-rcnn does not support multiple training datasets, we need to merge VOC 2007 data and VOC 2012 data manually. Just make a new directory named VOC0712, put all subfolders except ImageSets in VOC2007 and VOC2012 into VOC0712(you'll merge some folders). I provide a merged-version ImageSets folder for you, please put it into VOCdevkit/VOC0712/.
when i follow this step and marged with provided Imageset than i got this error

'USE_GPU_NMS': True}
Loaded dataset voc_0712_trainval for training
Set proposal method: gt
Appending horizontally-flipped training examples...
voc_0712_trainval gt roidb loaded from /home/user01/Music/fc/py-R-FCN/data/cache/voc_0712_trainval_gt_roidb.pkl
Traceback (most recent call last):
File "./tools/train_net.py", line 104, in
imdb, roidb = combined_roidb(args.imdb_name)
File "./tools/train_net.py", line 69, in combined_roidb
roidbs = [get_roidb(s) for s in imdb_names.split('+')]
File "./tools/train_net.py", line 66, in get_roidb
roidb = get_training_roidb(imdb)
File "/home/user01/Music/fc/py-R-FCN/tools/../lib/fast_rcnn/train.py", line 142, in get_training_roidb
imdb.append_flipped_images()
File "/home/user01/Music/fc/py-R-FCN/tools/../lib/datasets/imdb.py", line 111, in append_flipped_images
assert (boxes[:, 2] >= boxes[:, 0]).all()
AssertionError

but if i use the Imageset from VOC2007 than after finish all ite i got this error,does anyone face same problem?

im_detect: 4948/4952 0.097s 0.001s
im_detect: 4949/4952 0.097s 0.001s
im_detect: 4950/4952 0.097s 0.001s
im_detect: 4951/4952 0.097s 0.001s
im_detect: 4952/4952 0.097s 0.001s
Evaluating detections
Writing aeroplane VOC results file
Traceback (most recent call last):
File "./tools/test_net.py", line 90, in
test_net(net, imdb, max_per_image=args.max_per_image, vis=args.vis)
File "/home/user01/Music/fc/py-R-FCN/tools/../lib/fast_rcnn/test.py", line 298, in test_net
imdb.evaluate_detections(all_boxes, output_dir)
File "/home/user01/Music/fc/py-R-FCN/tools/../lib/datasets/pascal_voc.py", line 321, in evaluate_detections
self._write_voc_results_file(all_boxes)
File "/home/user01/Music/fc/py-R-FCN/tools/../lib/datasets/pascal_voc.py", line 248, in _write_voc_results_file
with open(filename, 'wt') as f:
IOError: [Errno 2] No such file or directory: '/home/user01/Music/fc/py-R-FCN/data/VOCdevkit0712/results/VOC0712/Main/comp4_33830d8c-2942-4517-b6ca-6a2beeaa1c62_det_test_aeroplane.txt'

cls_num in prototxt is not same

For example, in /models/pascal_voc/ResNet-101/rfcn_end2end/class-aware/test.prototxt line 7054 and line 7079 , the value of cls_num in line 7054 is 21, while in line 7079 is 8, I don't know why, can anyone explain it ? Thank you !

question about the bottom of rpn_conv/3x3

hi,
For resnet-50, in train_agnostic.prototxt, rpn_conv/3x3 is connected to res5c, while in the ohem version rpn_conv/3x3 is connected to res4f, and in test_agnostic.prototxt, it's connected to res4f as in the ohem version. Is this a typo or something?

What's the benefit of warmup?

It's so nice of you to share the code with us,it's a wonderful work!
I know about ohem which is to select hard examples and re-train them,but what's the use of warmup?
waiting for your apply!

Error when I set IMS_PER_BATCH: 2 in the rfcn_end2end_ohem.yml

I get this error when I set IMS_PER_BATCH: 2 in the rfcn_end2end_ohem.yml
F1025 15:56:37.929327 3406 loss_layer.cpp:19] Check failed: bottom[0]->num() == bottom[1]->num() (2 vs. 1) The data and label should have the same number.

*** Check failure stack trace: ***

Is that other setting I should change when I want to train 2 images per batch?

why does roi_data_layer forward twice in one solver step?

I tried to train rfcn_end2end_ohem.

In /roi_data_layer/layer.py , I added print "forward RoiDataLayer" in forward() (line 146).
In fast-rcnn/train.py, I added
for layer_name, blob in self.solver.net.blobs.iteritems():
print layer_name+'\t'+ str(blob.data.shape)
in train_model() (line 148).

The output is
forward RoiDataLayer
forward RoiDataLayer
I1110 11:00:53.827004 25709 solver.cpp:228] Iteration 0, loss = 3.86447
I1110 11:00:53.827035 25709 solver.cpp:244] Train net output #0: accuarcy = 0
I1110 11:00:53.827045 25709 solver.cpp:244] Train net output #1: loss_bbox = 0.0267997 (* 1 = 0.0267997 loss)
I1110 11:00:53.827050 25709 solver.cpp:244] Train net output #2: loss_cls = 3.04452 (* 1 = 3.04452 loss)
I1110 11:00:53.827055 25709 solver.cpp:244] Train net output #3: rpn_cls_loss = 0.693147 (* 1 = 0.693147 loss)
I1110 11:00:53.827060 25709 solver.cpp:244] Train net output #4: rpn_loss_bbox = 0.111002 (* 1 = 0.111002 loss)
I1110 11:00:53.827067 25709 sgd_solver.cpp:106] Iteration 0, lr = 0.001
data (1, 3, 600, 943)
im_info (1, 3)
gt_boxes (1, 5, 1, 1)
data_input-data_0_split_0 (1, 3, 600, 943)
data_input-data_0_split_1 (1, 3, 600, 943)
im_info_input-data_1_split_0 (1, 3)
im_info_input-data_1_split_1 (1, 3)
gt_boxes_input-data_2_split_0 (1, 5, 1, 1)
gt_boxes_input-data_2_split_1 (1, 5, 1, 1)

It seems that it runs forward twice in one solver step, besides I'm sure IMS_PER_BATCH: 1. why is that ?

Fine Tuning

How to fine tune your model? I don't have sufficient data to retrain your model from scratch.I want to fine tune your model on my data which has only two classes ?

Small documentation issues for train&test ResNet-50 (without OHEM)

Thanks for making this code available. I am trying to train & test ResNet-50 (without OHEM). Here are the issues I have found in the docs:

  • Have to run make in $FRCN_ROOT/lib/ folder for cython_bbox compilation
  • Before running the training script, make sure your add $CAFFE_ROOT/python to your PYTHONPATH environment variable
  • The training script takes an extra argument for the dataset (pascal_voc or coco). This is not documented in the README, but it is documented in the script itself.
  • Despite what is stated in the README, test.agonistic and train.agonistic are set to True by default by the cfgs/rfcn_end2end.yml

AssertionError when running end to end training

Hi I met a problem when running end to end training with pascal dataset.

Traceback (most recent call last):
File "./tools/train_net.py", line 107, in
imdb, roidb = combined_roidb(args.imdb_name)
File "./tools/train_net.py", line 70, in combined_roidb
roidbs = [get_roidb(s) for s in imdb_names.split('+')]
File "./tools/train_net.py", line 67, in get_roidb
roidb = get_training_roidb(imdb)
File "/home/test/xianyan/py-R-FCN/tools/../lib/fast_rcnn/train.py", line 142, in get_training_roidb
imdb.append_flipped_images()
File "/home/test/xianyan/py-R-FCN/tools/../lib/datasets/imdb.py", line 111, in append_flipped_images
assert (boxes[:, 2] >= boxes[:, 0]).all()
AssertionError

training error with layer issues

@orpine
F1013 12:05:22.696523 14673 net.cpp:784] Cannot copy param 0 weights from layer 'rpn_conv/3x3'; shape mismatch. Source param shape is 512 1024 3 3 (4718592); target param shape is 512 2048 3 3 (9437184). To learn this layer's parameters from scratch rather than copying from a saved net, rename the layer.
*** Check failure stack trace: ***

py-RFCN slower on using CUDNN

The testing on GPUs (Nvidia K80) is 2-3 times slower on using CuDNN versus without CuDNN. I am using Cuda 7.5 (and 8.0), CuDNN 5.1 (and 5.0), Python 2.7, on Ubuntu 14.04 server.

I checked everything twice and this behavior is consistent. I don't understand how could this be the case?

can't reproduce the mAP reported in readme, only 28.0 mAP in coco minival

hi:

I have run the traning code without change anyting for three times, but get lower mAP than you report in the readme, here's the details:

VOC: 07+12 trainval ,07test, mine: 79.1 the reported: 79.4
COCO: COCO 2014 train, COCO 2014 minival, mine: 28.0 (with 1920000 iter) the reported: 29.0 (though I test on 2014 minival, I think it won't be much different from the 2014 val )

my test env: python 2.7 , TITAN X (Pascal) 12G,
I also reset git 1a2be8e.
so anything wrong? i wanna know what should i care to reproduce the results.

bests
jemmy li

Pretrain model extract error

Hi Orpine
I download the pretrained model in the OneDrive Link, but when I extract the file, it raise an error about "truncated gzip input". I download the model several times but this error still appears. Is is the problem of model?

About R-FCN+ resnet ensemble

I have noticed that R-FCN+ resnet ensemble have achieved the best result in VOC 2012. So what the difference between resnet and ensenmble? And is R-FCN used in VOC2012 exactly the same with this py-R-FCN?

Use on own dataset

Hi,

How would I go about using this model on my own dataset with different classes? Any pointers?

Thanks!

In demo_rfcn.py, if cfg.TEST.HAS_RPN is set to False, then how do I pass value for blob['rois'] in the _get_blobs function in /py-R-FCN/lib/fast_rcnn/test.py ?

I understand that blobs['rois'] is being set to None. However if None is sent to _get_rois_blob (in the if branch), then another function '_project_im_rois' is giving this (AttributeError: 'NoneType' object has no attribute 'astype') error.

def _get_blobs(im, rois):
"""Convert an image and RoIs within that image into network inputs."""
blobs = {'data' : None, 'rois' : None}

blobs['data'], im_scale_factors = _get_image_blob(im)
if not cfg.TEST.HAS_RPN:
    blobs['rois'] = _get_rois_blob(rois, im_scale_factors)
print blobs['rois'] , im_scale_factors,"<<<<<<<<<<<"
return blobs, im_scale_factors

Extracting features

Without a fc layer, what would be the best way to extract features (for a t-SNE plot for example)? Should I take the res5c layer output and flatten it?

what is the function of box_annotator_layer?

After reading the code inbox_annotator_layer.cu, I still have no idea what is the purpose of this layer? Is it used to select 128 ROIs to backpropagate? If so, why is the top->num not 128?

These are the output of the shapes:
gt_inds (2,)
all_rois shape: (138, 5)
gt_boxes shape: (2, 5)
rois_per_image inf
fg_rois_per_image inf
I1109 21:24:01.311867 32751 box_annotator_ohem_layer.cu:71] bottom_rois 140
I1109 21:24:01.311887 32751 box_annotator_ohem_layer.cu:72] bottom_loss 140
I1109 21:24:01.311890 32751 box_annotator_ohem_layer.cu:73] bottom_labels 140
I1109 21:24:01.311893 32751 box_annotator_ohem_layer.cu:74] num_imgs 1
I1109 21:24:01.311895 32751 box_annotator_ohem_layer.cu:75] roi_per_img_ 128
I1109 21:24:01.311898 32751 box_annotator_ohem_layer.cu:76] top_labels 140
I1109 21:24:01.311916 32751 box_annotator_ohem_layer.cu:77] top_bbox_loss_weights 140
gt_inds (2,)
all_rois shape: (300, 5)
gt_boxes shape: (2, 5)
rois_per_image inf
fg_rois_per_image inf
I1109 21:24:01.395092 32751 box_annotator_ohem_layer.cu:71] bottom_rois 302
I1109 21:24:01.395112 32751 box_annotator_ohem_layer.cu:72] bottom_loss 302
I1109 21:24:01.395129 32751 box_annotator_ohem_layer.cu:73] bottom_labels 302
I1109 21:24:01.395133 32751 box_annotator_ohem_layer.cu:74] num_imgs 1
I1109 21:24:01.395135 32751 box_annotator_ohem_layer.cu:75] roi_per_img_ 128
I1109 21:24:01.395138 32751 box_annotator_ohem_layer.cu:76] top_labels 302
I1109 21:24:01.395140 32751 box_annotator_ohem_layer.cu:77] top_bbox_loss_weights 302
gt_inds (2,)
all_rois shape: (216, 5)
gt_boxes shape: (2, 5)
rois_per_image inf
fg_rois_per_image inf
I1109 21:24:01.517530 32751 box_annotator_ohem_layer.cu:71] bottom_rois 218
I1109 21:24:01.517550 32751 box_annotator_ohem_layer.cu:72] bottom_loss 218
I1109 21:24:01.517554 32751 box_annotator_ohem_layer.cu:73] bottom_labels 218
I1109 21:24:01.517556 32751 box_annotator_ohem_layer.cu:74] num_imgs 1
I1109 21:24:01.517559 32751 box_annotator_ohem_layer.cu:75] roi_per_img_ 128
I1109 21:24:01.517561 32751 box_annotator_ohem_layer.cu:76] top_labels 218
I1109 21:24:01.517563 32751 box_annotator_ohem_layer.cu:77] top_bbox_loss_weights 218

train problem

I modifed the size of position sensitive score map from 7X7 to 3X3 and trained this net 200,000 times with ResNet50, but the mAP of the output model was ~6%. The pre-train-ResNet download from the https://github.com/daijifeng001/R-FCN, I followed your training step. Is there not enough training times?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.