Hi Kris,
I resubmit your submission id 789 and didn’t see 500 error.
The following logs shows some problems with your submission code. Please check and see if you can resolve this yourself.
Regards,
Yixu Chen
---
[2025-09-02 17:45:02] INFO Starting a new decoding job for submission 808
[2025-09-02 17:45:02] DEBUG Connecting to SQL database
[2025-09-02 17:45:02] INFO Obtaining submission
[2025-09-02 17:45:02] INFO Submission 808 status set to DECODING
[2025-09-02 17:45:02] INFO Work directory: /var/lib/containers/submissions/image_0_075_valid_clic2025
[2025-09-02 17:45:02] INFO Using buckets - Submissions: clic-submissions-clic2025, Environments: clic-environments-clic2025
[2025-09-02 17:45:02] DEBUG Copying submission files
[2025-09-02 17:45:04] DEBUG Successfully synced 2 files from s3://clic-submissions-clic2025/static/image_0_075/valid/clic2025/808/ to /var/lib/containers/submissions/image_0_075_valid_clic2025
[2025-09-02 17:45:04] INFO Found decoder zip file: decoder.zip
[2025-09-02 17:45:04] INFO Unzipping decoder
[2025-09-02 17:45:07] INFO Successfully extracted decoder zip file
[2025-09-02 17:45:07] INFO Executable path: /var/lib/containers/submissions/image_0_075_valid_clic2025/decode
[2025-09-02 17:45:07] INFO Files in work_dir directory: ['bs.zip', 'decoder.zip', 'BK-SDM', 'decode', '__MACOSX', 'requirements.txt', 'unzip.py']
[2025-09-02 17:45:07] INFO Set execute permissions on decoder
[2025-09-02 17:45:07] INFO Logging into ECR
[2025-09-02 17:45:08] INFO Successfully pulled user image
[2025-09-02 17:45:08] INFO Resource limits - Memory: 12000MB, CPU: 2, Timeout: 18000s
[2025-09-02 17:45:08] INFO Using GPU container configuration
NGC-DL-CONTAINER-LICENSE
bin
boot
dev
etc
home
lib
lib64
media
mnt
opt
proc
root
run
sbin
srv
sys
tmp
usr
var
BK-SDM
__MACOSX
bs.zip
decode
decoder.zip
requirements.txt
unzip.py
image_0_075_valid_clic2025
ubuntu
Checking extracted files...
total 752
drwxr-xr-x. 2 root root 16384 Sep 2 17:45 .
drwxr-xr-x. 6 root root 16384 Sep 2 17:45 ..
-rw-r--r--. 1 root root 1712 Sep 2 17:45 extra_information.csv
Found config file: config.json
Config - bpp: 0.075, model: ../checkpoints/bk-sdm-v2-base, guidance_scale: 0, grad_term_weight: 0.2, mode: mpgd, use_captions: False
Working directory: /home/image_0_075_valid_clic2025/temp
Captions disabled in config
/usr/local/lib/python3.10/dist-packages/wandb/sdk/launch/builder/build.py:11: UserWarning: pkg_resources is deprecated as an API. See
https://setuptools.pypa.io/en/latest/pkg_resources.html. The pkg_resources package is slated for removal as early as 2025-11-30.
Refrain from using this package or pin to Setuptools<81.
import pkg_resources
/usr/local/lib/python3.10/dist-packages/timm/models/layers/__init__.py:48: FutureWarning: Importing from timm.models.layers is deprecated, please import via timm.layers
warnings.warn(f"Importing from {__name__} is deprecated, please import via timm.layers", FutureWarning)
/usr/local/lib/python3.10/dist-packages/diffusers/utils/outputs.py:63: FutureWarning: `torch.utils._pytree._register_pytree_node` is deprecated. Please use `torch.utils._pytree.register_pytree_node` instead.
torch.utils._pytree._register_pytree_node(
╭───────────────────── Traceback (most recent call last) ──────────────────────╮
│ /home/image_0_075_valid_clic2025/BK-SDM/src/denim_for_submission.py:240 in │
│ <module> │
│ │
│ 237 │ │ transforms.ToTensor() │
│ 238 │ ]) │
│ 239 │ │
│ ❱ 240 │ pipe1 = StableDiffusionNCPipeline.from_pretrained( │
│ 241 │ │ args.model_path, │
│ 242 │ │ torch_dtype=torch.float16, │
│ 243 │ ) │
│ │
│ /usr/local/lib/python3.10/dist-packages/diffusers/pipelines/pipeline_utils.p │
│ y:1086 in from_pretrained │
│ │
│ 1083 │ │ # use snapshot download here to get it working from from_pret │
│ 1084 │ │ if not os.path.isdir(pretrained_model_name_or_path): │
│ 1085 │ │ │ if pretrained_model_name_or_path.count("/") > 1: │
│ ❱ 1086 │ │ │ │ raise ValueError( │
│ 1087 │ │ │ │ │ f'The provided pretrained_model_name_or_path "{pr │
│ 1088 │ │ │ │ │ " is neither a valid local path nor a valid repo │
│ 1089 │ │ │ │ ) │
╰──────────────────────────────────────────────────────────────────────────────╯
ValueError: The provided pretrained_model_name_or_path
"../checkpoints/bk-sdm-v2-base" is neither a valid local path nor a valid repo
id. Please check the parameter.
Error: No output images found in /home/image_0_075_valid_clic2025/temp/results/out
Final results:
ls: cannot access 'images/*.png': No such file or directory
[2025-09-02 17:45:19] ERROR The decoder failed with exit code 2
[2025-09-02 17:45:19] ERROR Check the decoder logs below for more details about the failure
[2025-09-02 17:45:19] DEBUG Full error: Command '['podman', 'run', '--network', 'none', '--memory', '12000m', '--memory-swap', '12000m', '--cpus', '2', '--device', '
nvidia.com/gpu=all', '--name', 'image_0_075_valid_clic2025', '-v', '/var/lib/containers/submissions/image_0_075_valid_clic2025:/home/image_0_075_valid_clic2025',
'-w', '/home/image_0_075_valid_clic2025', '--entrypoint', './decode', '
800102672637.dkr.ecr.us-east-1.amazonaws.com/compression:mapl_clic']' returned non-zero exit status 2.
[2025-09-02 17:45:28] DEBUG Successfully synced 55 files from /var/lib/containers/submissions/image_0_075_valid_clic2025 to s3://clic-submissions-clic2025/static/image_0_075/valid/clic2025/808/
[2025-09-02 17:45:28] INFO Successfully synced results back to S3
[2025-09-02 17:45:28] INFO Cleanup completed
[2025-09-02 17:45:53] ERROR Unable to retrieve logs
CAUTION: This email originated from outside of the organization. Do not click links or open attachments unless you can confirm the sender and know the content is safe.
|