Hey there guys!
I'm trying to augment the
Caffe ImageNet Tutorial to train on some custom image sets. My reasoning is this, since ImageNet takes 5 "business days" minimum to allow access to their image sets, I need get my own data or repurpose some other data.
What I've decided to do is to use the
Caltech101 images. So first of all, let me get this straightened out:
I have to (1) create a "synsets.txt" file containing the names of each class. (2) I need to create a "synset_words.txt" file containing the text labels of each number label. (3) I need to create a "train.txt" file with the filepaths of each training image, a space, and a number indicating label identity. (4) Do the same thing to create a "test.txt" but the number label needs to be 0. (5) put some number of each class's members into a folder named "test" and the rest into a folder named "train".
Is that all correct? The current ilsvrc12 auxillary data files has a file called "val.txt" and one called "det_synset_words.txt". I'm not sure what they're for. Do I need to create a version of those for my repurposing of Caltech101?
How the heck do I make an imagenet_mean.binaryproto for my new image set?
What the heck is the "imagenet.bet.pickle" file? I have never seen it mentioned in any tutorials or articles regarding Caffe.
After all that's done, I still need to resize my images, correct? I can use the shell script mentioned in the tutorial above, right?
for name in /path/to/imagenet/val/*.JPEG; do
convert -resize 256x256\! $name $name
done
Or does the "create_imagenet.sh" script do that for me when it runs?
When creating my new "imagenet_mean.binaryproto" file, do I need a second one that relates to the original data set that CaffeNet trained on or am I good picking up training of any image set with a newly made mean proto made just against new data?
Anything else I'll need to change? Anything in the prototxt files? I'm significantly reducing the class against which the original model was trained and I may even be removing or changing the names of many of them (since the image sets were made by different groups of people). What kind of classification results can I expect with this?
Thanks for any help people! I really appreciate anything you can offer!