After fine-tuning on multi-GPU my model is moved to CPU for testing
|
|
0
|
179
|
September 18, 2023
|
Deepspeed stage 3 + quantization
|
|
0
|
291
|
September 15, 2023
|
How to call the validation and ModelCheckpoint callback after certain epochs?
|
|
0
|
119
|
September 14, 2023
|
PyTorch Profiling with TensorBoard: cannot get trace-level information
|
|
0
|
344
|
September 13, 2023
|
s there any plan to support the tensorflow?
|
|
1
|
157
|
September 12, 2023
|
Dependabot suggests installing version lightning 2022.10.25
|
|
1
|
191
|
September 7, 2023
|
Why is ckpt_path='last' not working?
|
|
0
|
272
|
September 6, 2023
|
Problem in creating python package from pytorch lightning module
|
|
0
|
761
|
August 25, 2023
|
Astrophysica a black hole and nebula travelling python based library
|
|
0
|
158
|
August 25, 2023
|
PyTorch Lightning ".validate()" returns empty list - [SOLVED]
|
|
0
|
343
|
August 22, 2023
|
Why Lightning is almost 3x slower than plain PyTorch?
|
|
7
|
3128
|
August 21, 2023
|
Is there a way to add metric to progress bar without calling `log`?
|
|
1
|
234
|
August 19, 2023
|
Modify dataloader at a given epoch
|
|
3
|
288
|
August 10, 2023
|
Optimizer got an empty parameter list
|
|
2
|
3830
|
August 7, 2023
|
Alternative to wandb for hyperparameter sweeps
|
|
1
|
583
|
July 27, 2023
|
Slurm - CPU time limit exceeded
|
|
0
|
304
|
July 20, 2023
|
Logger not act as expected (log to a new version)
|
|
2
|
314
|
July 18, 2023
|
Tuner's Batch Size Finder gets stuck on batch size 4
|
|
0
|
166
|
July 17, 2023
|
Logs to progress bar only appear in training progress bar?
|
|
1
|
222
|
July 16, 2023
|
How to log metrics and losses correctly when model returns dictionary as output
|
|
1
|
442
|
June 19, 2023
|
Code review/suggestions: Progress bar for dataset with no `len()`
|
|
0
|
523
|
June 14, 2023
|
Saving model state dict with fsdp
|
|
2
|
885
|
June 12, 2023
|
reset_real_features=True from TorchMetrics for validation in LightingModule
|
|
0
|
170
|
June 12, 2023
|
Disable CombinedLoader
|
|
0
|
168
|
June 12, 2023
|
Loading/saving state_dict as a regular pytorch net
|
|
1
|
437
|
June 8, 2023
|
Videos rendering problem
|
|
0
|
210
|
May 28, 2023
|
Crash if numworkers>0
|
|
2
|
1185
|
May 25, 2023
|
Vulnerability from Lightning requirements
|
|
3
|
358
|
May 18, 2023
|
Error for training a video classification model
|
|
2
|
429
|
May 17, 2023
|
It tooks a long time before starting per epoch
|
|
0
|
153
|
May 12, 2023
|