Paralel Inference Over Network For Multiple Devices

risedangel · August 28, 2023, 12:31pm

Hello
I am trying to run inferencing via llama-2, I am unable to run it because my gpu isnt enough. I have multiple computers each having 32 ram. Is it possible to run a model parallel in this setting ?

Topic		Replies	Views
Multi-Gpu Inferencing DDP/GPU	2	1354	August 17, 2023
Lightning Trainer works on one gpu but OOM on more Trainer	1	1074	October 30, 2023
Deploy model as batch inference endpoint on lightning.ai	0	149	May 24, 2024
Odd Performance Using Multi-GPU + Azure	0	814	February 13, 2022
Training multiple model replicas on different GPUs	0	187	December 5, 2023

Paralel Inference Over Network For Multiple Devices

Related topics