What do you mean by step by step tutorial ?
You can follow the existing cifar10 dataset and get an idea how to go about it. everything is self explanatory
you are free to do anykind of preprocessing/augmentation you'd like, but at the very least you'd want to zero-mean and normalize your data before converting them into lmdb/leveldb,