Converting deepspeed checkpoints to fp32 checkpoint

If you go up one directory, you should find a file called zero_to_fp32.py. Could you run that? It is also possible that your checkpoint is corrupted. Check that you can manually torch.load() the files in the checkpoint folder without errors.