Hello,
I am getting some problems with “salloc” execution. My cluster is running SLURM v23.11.0. I’m going to explain problems:
- My first problem is that until some previous version (I can’t remember what version exactly), when I ran a “salloc”, I got a console inside the assigned host. Then,
I could execute whatever I wanted. However, now, after execute “salloc” (without forcing host), SLURM always tries to assign me the last host where a job has been executed. But if that host is now powered off, salloc waits and waits but not tries to connect
to other host
- Second problem is that previous execution gives me a host (the host is powered on), my session continues in the allocation host, not in the assigned host. So if
I want to execute some program, I need to run with “srun”. Is there anyway to go directly to the assigned host?
- Third problem is a question. How can I limit the hosts where I (as cluster admin) allow to run a “salloc”? For example, I would like that when a user runs a “salloc”,
it were execute always in a partition with “x” hosts, avoiding SLURM to assign me a host of the list of hosts.
Thanks.