Faster-RCNN bbox/image normalization

67 views
Skip to first unread message

Norbert Nyakó

unread,
Dec 27, 2016, 4:45:18 PM12/27/16
to Caffe Users
Hi,

I am playing with py-faster-rcnn on a custom dataset (about 3000 images, 7 different classes, including the background), following these tutorials:


I am using the end2end solution with VGG16 network. Everything works fine, expect my results so I have some questions:

- What kind of normalizations are needed on the images and on the bbox annotations?
- It is similar to the previous question: There are two config options: BBOX_NORMALIZE_TARGETS and BBOX_NORMALIZE_TARGETS_PRECOMPILED. Should I calculate the mean and std before the training and use these options for bbox normalization?
- I modified the num_output at the cls_score and bbox_pred layers (according to this thread: https://github.com/rbgirshick/py-faster-rcnn/issues/1), but in the end2end solution there are rpn_cls_score and rpn_bbox_pred layers too. Should I modify the num_outputs of these too? If I should then how could I calculate the number of outputs for 7 classes?

Thank you in advance!

Br,
Norbert
Reply all
Reply to author
Forward
0 new messages