GPU availability

30 views
Skip to first unread message

justina...@gmail.com

unread,
Jan 24, 2019, 1:46:38 PM1/24/19
to Hops
Hi, we are running experiment with 1 gpu per executor, and while it says 4 gpus available, once we start the exeriment it tries to spin up 2 executors and gpu count reaches 5 queued gpu requests and our experiment cannot run. It should only use 2 GPUs in total, but looks like 9 are being reserved? Could you please look into this?

Robin Andersson

unread,
Jan 24, 2019, 2:48:14 PM1/24/19
to justina...@gmail.com, Hops
Hi!

Can you try running it again now?

Also are you certain that you set the number of GPUs per executor to be 1? And the max number of parallel experiments to be 2?

BR,
Robin

On Thu, Jan 24, 2019 at 7:46 PM <justina...@gmail.com> wrote:
Hi, we are running experiment with 1 gpu per executor, and while it says 4 gpus available, once we start the exeriment it tries to spin up 2 executors and gpu count reaches 5 queued gpu requests and our experiment cannot run. It should only use 2 GPUs in total, but looks like 9 are being reserved? Could you please look into this?

--
You received this message because you are subscribed to the Google Groups "Hops" group.
To unsubscribe from this group and stop receiving emails from it, send an email to hopshadoop+...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/hopshadoop/843ea697-d4d4-458a-8a2c-f966ab3b5d70%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

justina...@gmail.com

unread,
Jan 25, 2019, 2:15:51 PM1/25/19
to Hops
Hi, I think the problem with GPU queue is back, it worked whatever you did the last time, so maybe you could do that again when you can? Many thanks!

Robin Andersson

unread,
Jan 25, 2019, 2:31:52 PM1/25/19
to justina...@gmail.com, Hops
Hey! Looked into it and it is not actually a problem, although the UI is misleading to some extent. The GPUs are available but there is not enough memory on the machines to give you a container. I see you are running on 8 GPUs right now though. In the future please don't allocate more than 6 at a time to allow for other users to also make use of them. We are working on limiting how many GPUs users may use.

BR,
Robin

justina...@gmail.com

unread,
Jan 25, 2019, 2:37:17 PM1/25/19
to Hops
Thanks for the quick reply. We are running a quick grid search experiment, it should take around 1 hour. when I started the job, it said 13 GPUS were available that is why I wondered if it was correct. It's true, we're using many GPUs but only for a short while, promise :)
Reply all
Reply to author
Forward
0 new messages