Hi all
I am interested in using half-precision computation for GPU.
As I know, torch.CudaHalfTensor() serves fp16 computation.
I tested to make variable torch.CudaHalfTensor(5,2) in several GPUs
- GTX980
- GTX1080 Ti
- Titan x
and all of the machines works to create it.
(All of the machines have cuda version >= 7.5)
But I am not sure they perform (for speed & memory) better than torch.CudaTensor() for all GPUs.
Which GPUs properly support usage of torch.CudaHalfTensor?