Groups
Conversations
All groups and messages
Send feedback to Google
Help
Sign in
Groups
google-cloud-slurm-discuss
Conversations
About
google-cloud-slurm-discuss
1–30 of 184
Welcome to Google Cloud & Slurm Discuss - where we talk about
Slurm
on
Google Cloud Platform
, and
Slurm for GCP Deployment Manager
.
Mark all as read
Report abusive group
0 selected
Arthur Gilly
, …
Kevin Deitz
7
Jun 2
mounting a bucket on all nodes
Thanks a lot for sharing this. If anyone has encountered a similar problem using hpc-tookit, the
unread,
mounting a bucket on all nodes
Thanks a lot for sharing this. If anyone has encountered a similar problem using hpc-tookit, the
Jun 2
Abhilash Mathews
, …
Abhilash Mathews
11
May 23
Why does the login node connect to external networks but allocated compute node fail in Slurm-GCP?
The reason I believe it's related to Slurm deployments is because the compute nodes (which are
unread,
Why does the login node connect to external networks but allocated compute node fail in Slurm-GCP?
The reason I believe it's related to Slurm deployments is because the compute nodes (which are
May 23
Bo Langgaard Lind
, …
Joseph Schoonover
6
Mar 2
Custom usernames, uids and gids on SLURM GCP cluster
I'll add this is not a "place" in the workspace admin panel. You need to write code to
unread,
Custom usernames, uids and gids on SLURM GCP cluster
I'll add this is not a "place" in the workspace admin panel. You need to write code to
Mar 2
Bo Langgaard Lind
,
Joseph Schoonover
2
Mar 1
Speed improvements when baking data into a custom SLURM image
Hey Bo, Great work on getting this far - we've had this solution up for some time in marketplace
unread,
Speed improvements when baking data into a custom SLURM image
Hey Bo, Great work on getting this far - we've had this solution up for some time in marketplace
Mar 1
Kuba Perlin
, …
Bo Langgaard Lind
11
Mar 1
'Network is unreachable' issues when following tutorial "Slurm-GCP - V5 - Codelab guide for PDF"
For the record, I faced a similar issue which was resolved by adding a Cloud NAT to the GCP project
unread,
'Network is unreachable' issues when following tutorial "Slurm-GCP - V5 - Codelab guide for PDF"
For the record, I faced a similar issue which was resolved by adding a Cloud NAT to the GCP project
Mar 1
Aarya
Feb 6
MS Dynamics 365 Position - Remote
Hello, Aarya here from Blink Technology Partners! At the moment, I am fulfilling the requirements of
unread,
MS Dynamics 365 Position - Remote
Hello, Aarya here from Blink Technology Partners! At the moment, I am fulfilling the requirements of
Feb 6
Lazaro Calderin
12/17/22
sole-tenancy nodes with slurm for gcp?
Hi, Is slurm-gcp currently able to use sole-tenancy nodes without modifying it? If yes, please let me
unread,
sole-tenancy nodes with slurm for gcp?
Hi, Is slurm-gcp currently able to use sole-tenancy nodes without modifying it? If yes, please let me
12/17/22
Blake Fitch
,
Olivier Martin
2
11/29/22
Can slurm (slurmctld) run with zero defined partitions in slurm.conf?
Hi Blake, What you could try, to do that, would be to use the latest slurm on gcp (I believe version
unread,
Can slurm (slurmctld) run with zero defined partitions in slurm.conf?
Hi Blake, What you could try, to do that, would be to use the latest slurm on gcp (I believe version
11/29/22
Michael Martin
,
Tom Downes
3
10/24/22
Slurm on VM instance doesn't work with different MPI implementations
Sorry, the other important line is this: https://github.com/GoogleCloudPlatform/hpc-toolkit/blob/main
unread,
Slurm on VM instance doesn't work with different MPI implementations
Sorry, the other important line is this: https://github.com/GoogleCloudPlatform/hpc-toolkit/blob/main
10/24/22
tech-msp
,
Olivier Martin
4
10/17/22
How to update Slurm after applying new Terraform partition update
Not sure exactly which permissions is missing however but you need whichever user terraform is using
unread,
How to update Slurm after applying new Terraform partition update
Not sure exactly which permissions is missing however but you need whichever user terraform is using
10/17/22
Milad Alizadeh
,
Olivier Martin
3
10/3/22
Nodes stuck in DOWN*+CLOUD state, job stuck in CG
Hi, how did you deploy Slurm in the first place? What does the output of sinfo look like? And squeue?
unread,
Nodes stuck in DOWN*+CLOUD state, job stuck in CG
Hi, how did you deploy Slurm in the first place? What does the output of sinfo look like? And squeue?
10/3/22
Tomás Di Domenico
,
Joseph Schoonover
4
7/29/22
Nodes not spinning up: "Zone does not currently have sufficient capacity"
Thanks Joseph. I was just curious since I could create a machine of the same characteristics manually
unread,
Nodes not spinning up: "Zone does not currently have sufficient capacity"
Thanks Joseph. I was just curious since I could create a machine of the same characteristics manually
7/29/22
David Huggins-Daines
,
Alex Chekholko
2
7/28/22
Where is the correct documentation for /apps, I mean /opt/apps?
Hi David, I think you will want to use the latest info from the github repo and the latest debian
unread,
Where is the correct documentation for /apps, I mean /opt/apps?
Hi David, I think you will want to use the latest info from the github repo and the latest debian
7/28/22
David Huggins-Daines
3
7/28/22
Cloud Architecture Center tutorial doesn't seem to work (jobs stuck in BeginTime)
It seems that this codelab is vastly more informative, but is a bit out of date with respect to the
unread,
Cloud Architecture Center tutorial doesn't seem to work (jobs stuck in BeginTime)
It seems that this codelab is vastly more informative, but is a bit out of date with respect to the
7/28/22
Mgcini Keith Phuthi
, …
Tom Downes
4
6/27/22
Can't access/use schedmd public images at all
Keith- You might consider following the image building example in the Cloud HPC Toolkit. It is built
unread,
Can't access/use schedmd public images at all
Keith- You might consider following the image building example in the Cloud HPC Toolkit. It is built
6/27/22
Hyungsik Jo
,
Olivier Martin
9
6/21/22
Unspecified number of nodes limit
/var/log/slurmctld.log contains logs related to slum execution. The log of the operation in the log
unread,
Unspecified number of nodes limit
/var/log/slurmctld.log contains logs related to slum execution. The log of the operation in the log
6/21/22
Chathika Weerasuriya
,
Alex Chekholko
3
6/16/22
Adding new users to SLURM deployment
Hi Alex, Thanks for your reply. I'll try adding the new users as External Users, though this
unread,
Adding new users to SLURM deployment
Hi Alex, Thanks for your reply. I'll try adding the new users as External Users, though this
6/16/22
심문수MunSu Sim
,
Bo Langgaard Lind
2
6/16/22
Any way to set "nic_type" and "thread_per_core=1" for compute instance?
I believe that threads per core corresponds to image_hyperthreads = false. On Thursday, June 16, 2022
unread,
Any way to set "nic_type" and "thread_per_core=1" for compute instance?
I believe that threads per core corresponds to image_hyperthreads = false. On Thursday, June 16, 2022
6/16/22
심문수MunSu Sim
6/15/22
How to replace database of "Slurm on GCP v5" with Cloud SQL?
I'm trying to deploy Slurm on GCP with "terraform cloud full"(Version5). This full
unread,
How to replace database of "Slurm on GCP v5" with Cloud SQL?
I'm trying to deploy Slurm on GCP with "terraform cloud full"(Version5). This full
6/15/22
tech-msp
,
Olivier Martin
4
6/3/22
Error after deployment from MarketPlace
Hi Wayne, Did you enable full internal communications on the network? This will be required for
unread,
Error after deployment from MarketPlace
Hi Wayne, Did you enable full internal communications on the network? This will be required for
6/3/22
Bo Langgaard Lind
6/2/22
What happened to scripts/compute-shutdown?
We had a customization which sent a signal to the slurm processes (which propagates down to our
unread,
What happened to scripts/compute-shutdown?
We had a customization which sent a signal to the slurm processes (which propagates down to our
6/2/22
Gerhard Uwe Bartsch
,
Wyatt Gorman
5
5/9/22
SchedMD Slurm - Controller Slurm install fails
BTW, the one spin from my late reply should read: "If I then logon to each and issue the
unread,
SchedMD Slurm - Controller Slurm install fails
BTW, the one spin from my late reply should read: "If I then logon to each and issue the
5/9/22
Sidd Karamcheti
5/2/22
Spin up TPU VM with GCP-SLURM
Is there base VM / instructions to specify adding TPUs to a given VM when initializing a SLURM-GCP
unread,
Spin up TPU VM with GCP-SLURM
Is there base VM / instructions to specify adding TPUs to a given VM when initializing a SLURM-GCP
5/2/22
Ankit Maroo
, …
Ward Harold
6
5/2/22
Sudo on slurm compute node
While, per previous comments, that's a pretty obvious anti-pattern you could try removing '
unread,
Sudo on slurm compute node
While, per previous comments, that's a pretty obvious anti-pattern you could try removing '
5/2/22
Serena Lien
4/27/22
How to run as slurm user? Can't cancel jobs?
How can I run as slurm user, or what is the default password for the slurm user so I can su? I have a
unread,
How to run as slurm user? Can't cancel jobs?
How can I run as slurm user, or what is the default password for the slurm user so I can su? I have a
4/27/22
Robert Moulton
,
Ankit Maroo
2
4/26/22
srun/sbatch and sudo behavior
Hi. was this issue resolved for you? Thanks Ankit On Monday, August 12, 2019 at 11:48:43 AM UTC-7
unread,
srun/sbatch and sudo behavior
Hi. was this issue resolved for you? Thanks Ankit On Monday, August 12, 2019 at 11:48:43 AM UTC-7
4/26/22
Sidd Karamcheti
4/24/22
Special Images on Compute Nodes -> Interactive & Batch Jobs Stall
Hey folks, I've followed the SchedMD instructions to set up a 5-partition SLURM cluster on GCP.
unread,
Special Images on Compute Nodes -> Interactive & Batch Jobs Stall
Hey folks, I've followed the SchedMD instructions to set up a 5-partition SLURM cluster on GCP.
4/24/22
Martin Gordon
,
Serena Lien
2
4/21/22
Assistance Required Provisioning Slurm Cluster on shared VPC
I have also been trying to do this, and have had the same problem as you, but I got a little bit
unread,
Assistance Required Provisioning Slurm Cluster on shared VPC
I have also been trying to do this, and have had the same problem as you, but I got a little bit
4/21/22
fionn malone
4/13/22
Fixing / Adding new partition to an existing cluster
Hi, Is it possible to reconfigure a slurm cluster to either modify an existing partition or add a new
unread,
Fixing / Adding new partition to an existing cluster
Hi, Is it possible to reconfigure a slurm cluster to either modify an existing partition or add a new
4/13/22
Gohari, S M Iman
,
Wyatt Gorman
3
3/31/22
Cleaning up compute nodes with terraform destroy
Hi Wyatt and Tom, Thank you for the notes. Looking forward to the next release of slurm-gcp. Any
unread,
Cleaning up compute nodes with terraform destroy
Hi Wyatt and Tom, Thank you for the notes. Looking forward to the next release of slurm-gcp. Any
3/31/22