SLURM: how to limit CPU job count to avoid wasting GPU resource?

We use SLURM to share CPU and GPU resources among nodes. Sometimes GPU jobs cannot be submitted because someone filled resources with CPU jobs. In that case, it wastes GPU resources.

How can I make the policy to avoid this conflict?

For example, is it possible to limit the maximum CPU job count on nodes for GPU jobs?

(node resource: 48 CPU cores, 4 GPU card, --> limit CPU jobs up to 44 to reserve 4 GPU jobs.)

Solution

A configuration that is sometimes used to do that is to have two (overlapping) partitions, one with all the nodes (CPU partition), and the other one with only the GPU nodes (GPU partition).

You then set MaxCPUsPerNode for the CPU partition to 44, and to 4 for the GPU partition.

Then, GPU jobs must be submitted to the GPU partition and the CPU only jobs to the CPU partition (which can be the default). That can be enforced either with "resource limits" or a "job submit" plugin

Setting resources dynamically on snakemake
In SLURM, lscpu and slurmd -c are not matched. so resources are not usable
SLURM slurmschd.log - extreme big file size
Get maximum number of jobs allowed in SLURM cluster as a user
No module named certifi
How do I configure to use shard in slurm?
NameError: name 'snakemake' is not defined when running snakemake
Export `stdout` and `stderr` of a single-line bash command to text file while using python module followed by `-m` syntax
How to get the ID of GPU allocated to a SLURM job on a multiple GPUs node?
unable to change slurm node status from inval to idle
SLURM: Embarrassingly parallel program inside an embarrassingly parallel program
Slurmctld: error: mysql_real_connect failed: 1045 Access denied for user 'root'@'localhost' (using password: NO)
In slurm.conf when you set Feature for a node, then SLURM_JOB_CONSTRAINTS becomes the corresponding env variable?
Can the Slurm job statistics (from seff and sacct) be trusted?
Slurm：Invalid qos specification
How do I find the queuing time for completed SLURM jobs?
Hold several jobs in Slurm
comment in bash script processed by slurm
Slurm environment variable for requested time
How to hold up a script until a slurm job (start with srun) is completely finished?
Changing the bash script sent to sbatch in slurm during run a bad idea?
What happens if a Slurm job uses memory than its maximum allowed?
How to get original location of script used for SLURM job?
how to determine the rank of the calling process in a bash script?
Python multiprocessing on slurm doesn't output print() to console
How to run the same python script for different $arg from a catalogue in parallel
How to submit a job to any [subset] of nodes from nodelist in SLURM?
How to run multiple python based slurm jobs together in HPC
How to comment out delay scheduling command in SLURM?
How do I get live usage statistics while running a slurm job