Compute Loss After Sharing Tensor Across GPUs

Cade_Gordon · May 19, 2021, 1:08am

I’m currently attempting to make a Multi-GPU-supported CLIP training script, but am hitting a wall. I need to compute two matrices that are composed of whole batch statistics before I can compute loss. Namely, I need to compute the image and text embeddings of an entire batch. Only then can I compute the sub batch losses.

How can I first calculate and share the whole batch matrices across GPUs before computing losses?

Topic		Replies	Views
Combining loss, predictions in multi gpus implementation help	3	1499	July 9, 2023
Compute loss on entire dataset implementation help	0	311	September 12, 2022
Multi-gpu training, weird tensorboard loss image	1	952	February 22, 2021
Multi GPU computing Fabric	0	251	September 3, 2023
Devide missmatch with DP training DDP/GPU	1	1977	June 16, 2021

Compute Loss After Sharing Tensor Across GPUs

Related topics