Better late than never... All number images are tranformed into 21 different png images. Sort of like a 'sprite' for those familiar with web development. The 28x28 pixel individual image is transformed into a 784 pixel wide row. And each png includes 3,000 rows of pixels, so in other words 3,000 flattened images. You can download the images from <demo url>/mnist/mnist_batch_
N.png where N is 0-20. (ex:
http://cs.stanford.edu/people/karpathy/convnetjs/demo/mnist/mnist_batch_0.png)
By the way, these 21 transformed images are created from the original images using a python script that the convnetjs author references on the mnist demo page description.