Thanks for your reply. I change the job name to bash, and run
srun --job-name=bash python main.py fit
however, it hangs on the initializing ddp stage.
PossibleUserWarning: The
srun
command is available on your system but is not used. HINT: If your intention is to run Lightning on SLURM, prepend your python command withsrun
Initializing distributed: GLOBAL_RANK: 0, MEMBER: 1/10