Hello Johannes,
I wanted to follow-up on this issue.
I sometimes observe the same problem (ms2020.py-based setup, x_hat = model.decompress(*tensors):
Detected at node 'location_scale_indexed_entropy_model/assert_equal_3/Assert/AssertGuard/Assert' defined at (most recent call last):
Node: 'location_scale_indexed_entropy_model/assert_equal_3/Assert/AssertGuard/Assert'
assertion failed: [Sanity check failed.] [Condition x == y did not hold element-wise:] [x (location_scale_indexed_entropy_model/EntropyDecodeFinalize_1:0) = ] [0] [y (location_scale_indexed_entropy_model/assert_equal_3/y:0) = ] [1]
[[{{node location_scale_indexed_entropy_model/assert_equal_3/Assert/AssertGuard/Assert}}]] [Op:__inference_restored_function_body_27038]
I noticed that PyTorch users typically use something similar to:
torch.backends.cudnn.deterministic=True
torch.backends.cudnn.benchmark=False
From my own experience I can definitely confirm that
import os
os.environ["CUDA_VISIBLE_DEVICES"]="" # no GPU
works, at least for CPU-based evaluation.
I would be quite surprised if you or your colleagues have not encountered similar problems. I typically use Google Colab Pro+ for basic tests (same hardware setup for both training and testing, no cross-platform).
It would be great if you could provide some guidance.
Thanks,
Nikolai