-
Notifications
You must be signed in to change notification settings - Fork 136
Closed
Milestone
Description
Description
Currently we have a handful of SLURM options exposed as runopt. Asking for more.
Motivation/Background
FAIR users typically set these configs: https://github.com/facebookresearch/pycls/blob/8c79a8e2adfffa7cae3a88aace28ef45e52aa7e5/pycls/core/distributed.py#L120-L130
Some of them can be set via the AppDef (especially those that have to do with resources: mem, gpu, cpu, etc). While others like "email" need to just be straight up offered as runopt or need to figure out a more dynamic way to pass them (see detailed proposal)
Detailed Proposal
Either:
- keep adding user requested sbatch options on a "need-to" basis
- support a dynamic kv pair ( "--cfg sbatch_options=k:v,k:v,k:v")
- support slurm specific options via appdef.metadata (we do this for our internal schedulers - to allow users to set thrift fields - as json - directly from the metadata).
Alternatives
(discussed in the proposal above)
Additional context/links
N/A
d4l3k and mannatsingh
Metadata
Metadata
Assignees
Labels
No labels