Number of steps drifts for `val_check_interval` when gradient accumulation turned on
|
|
0
|
280
|
March 26, 2023
|
Global_step increased at new epoch regardless of gradient accumulation
|
|
2
|
696
|
March 26, 2023
|
Incorrect batch size being inferred using trainer.fit(), correct batch size in dataloader? What could be going wrong? [PyLightning]
|
|
1
|
485
|
March 26, 2023
|
Model Works on CPU but Error out while running on GPU
|
|
1
|
809
|
March 25, 2023
|
How to continue training for more epochs?
|
|
1
|
1147
|
March 25, 2023
|
Changing batch size during trainig
|
|
3
|
1922
|
March 20, 2023
|
Modifying the Trainer when calling Trainer.fit() multiple times
|
|
2
|
1332
|
February 18, 2023
|
Error while training simclr model
|
|
0
|
203
|
February 12, 2023
|
Question about auto_lr_find()
|
|
1
|
2202
|
January 31, 2023
|
How do I prevent initial validation run in Trainer 1.9.0?
|
|
1
|
312
|
January 24, 2023
|
Save_last and monitor in ModelCheckpoint
|
|
0
|
154
|
January 23, 2023
|
Why `precision=16` for me is almost useless for speeding up?
|
|
1
|
1022
|
January 16, 2023
|
Resume_from_checkpoint not work
|
|
4
|
5401
|
December 7, 2022
|
Dealing with large dataset
|
|
1
|
3340
|
December 3, 2022
|
Auto_lr_find dependence on initial learning rate
|
|
1
|
478
|
November 22, 2022
|
Gradient Accumulation with Dual (optimizer, scheduler) Training
|
|
0
|
423
|
November 10, 2022
|
Filename for last checkpoint
|
|
1
|
639
|
November 7, 2022
|
How to get the checkpoint path?
|
|
11
|
17435
|
November 2, 2022
|
Why in progress bar there is no train_acc display?
|
|
0
|
691
|
July 8, 2022
|
Issue Regarding DETR on custom data
|
|
0
|
307
|
June 7, 2022
|
Target size that is different to the input size
|
|
10
|
12009
|
May 19, 2022
|
Precision doesn't work
|
|
0
|
673
|
April 14, 2022
|
How to use `LightningCLI` to start training from a checkpoint at epoch 0?
|
|
0
|
864
|
February 19, 2022
|
How to customize trainer in order to restrict parameter range during training?
|
|
2
|
644
|
January 30, 2022
|
Modules that have backward hooks assigned cannot be compiled
|
|
1
|
713
|
January 29, 2022
|
ModelCheckpoint docs for every_n_epochs==None
|
|
1
|
692
|
January 29, 2022
|
How to deal with lr_find_temp_model_**.ckpt
|
|
2
|
607
|
January 29, 2022
|
Dose PL validate and train at the same time?
|
|
1
|
2096
|
January 29, 2022
|
Where is accelerator_connector?
|
|
1
|
1210
|
January 29, 2022
|
No `training_step()` method defined
|
|
10
|
7918
|
January 9, 2022
|