Data Format for Multi-label/ Bounding Box

101 views
Skip to first unread message

Travis

unread,
Jun 19, 2015, 3:18:06 PM6/19/15
to caffe...@googlegroups.com
Hi,

I have just started using Caffe, and I am trying to solve the following problem:

My input file looks like:

Image_filename_1  x1, y1, x2, y2
Image_filename_2  x1, y1, x2, y2

where Image_filename is the name of my image file. All the images belong to the same category, eg: cats.
x1,y1,x2,y2 are the co-ordinates of the bounding box of the image normalized between 0-1.

My aim is to predict the bounding box using Caffe and then to calculate the EuclideanLoss in the output.
The goal is to output the bounding box co-ordinates for new images.

Question:
What will be the best data format to be used? lmdb or hdf5?
  • Is this possible with lmdb format? If yes, will the 4 co-ordinates be treated as 4 labels? Can we convert the above text file with convert_imageset.cpp?
  • How can I convert my above file and image folder into hdf5 format?



Reply all
Reply to author
Forward
0 new messages