You do not have permission to delete messages in this group
Copy link
Report message
Sign in to report message
Show original message
Either email addresses are anonymous for this group or you need the view member email addresses permission to view the original message
to Caffe Users
I'm trying to execute create_imagenet.sh script on ilsvrc12 dataset. The ilsvrc12 training set is 139 Gb, but when I create the train_lmdb, its size blew up to more than 550Gb and I was out of storage.
Just wondering where they is a 4x+ storage requirement, and if I'm not doing anything wrong, how big of an hard drive do I need?
Thanks!
Yan
Christos Nikolaou
unread,
Aug 14, 2015, 7:24:21 PM8/14/15
Reply to author
Sign in to reply to author
Forward
Sign in to forward
Delete
You do not have permission to delete messages in this group
Copy link
Report message
Sign in to report message
Show original message
Either email addresses are anonymous for this group or you need the view member email addresses permission to view the original message
to Caffe Users
In my system the ilsvrc12_train_lmdb is 240 Gb and the ilsvrc12_val_lmdb is 9.4 Gb (after running du -h), whereas the space required for the JPEG images is equal to what you mentioned (139 Gb). I do not know why your train_lmdb is so much larger and I do not thinl that it makes sense at all to be that large.
Christos Nikolaou
unread,
Aug 14, 2015, 7:29:38 PM8/14/15
Reply to author
Sign in to reply to author
Forward
Sign in to forward
Delete
You do not have permission to delete messages in this group
Copy link
Report message
Sign in to report message
Show original message
Either email addresses are anonymous for this group or you need the view member email addresses permission to view the original message
You do not have permission to delete messages in this group
Copy link
Report message
Sign in to report message
Show original message
Either email addresses are anonymous for this group or you need the view member email addresses permission to view the original message
to Caffe Users
Thanks Christos, did you use the encoding option for generating your 240Gb lmdb? What command are you using to generate it, presuming that you're using $CAFFE_HOME/example/imagenet/create_imagenet.sh?
Cheers,
Yan
Yan Ma
unread,
Aug 14, 2015, 10:43:12 PM8/14/15
Reply to author
Sign in to reply to author
Forward
Sign in to forward
Delete
You do not have permission to delete messages in this group
Copy link
Report message
Sign in to report message
Show original message
Either email addresses are anonymous for this group or you need the view member email addresses permission to view the original message
to Caffe Users
Also note that I'm using the NVIDIA fork of Caffe, will try the BVLC repo and get back to you.
Cheers,
Yan
Yan Ma
unread,
Aug 15, 2015, 12:17:53 AM8/15/15
Reply to author
Sign in to reply to author
Forward
Sign in to forward
Delete
You do not have permission to delete messages in this group
Copy link
Report message
Sign in to report message
Show original message
Either email addresses are anonymous for this group or you need the view member email addresses permission to view the original message
to Caffe Users
Found the answer. Turns out the create_imagenet.sh script does not enforce resize on default. After I set the resize parameter to true, the result corresponds to what Christos described. Thanks!
Yan
Christos Nikolaou
unread,
Aug 15, 2015, 7:51:16 AM8/15/15
Reply to author
Sign in to reply to author
Forward
Sign in to forward
Delete
You do not have permission to delete messages in this group
Copy link
Report message
Sign in to report message
Show original message
Either email addresses are anonymous for this group or you need the view member email addresses permission to view the original message
to Caffe Users
I have made a new PR on the Github to the project to clarify those things for future users. Hope it helps when it will be merged.