Trainer

Topic	Replies	Views	Activity
About the Trainer category	0	598	August 26, 2020
CPU multithreading	0	29	March 7, 2025
MLFlow model can't be registered	2	683	February 10, 2025
What does this _TunerExitException error mean?	8	933	December 23, 2024
Replacement for add_argparse_args()	0	136	October 22, 2024
ShardedDDP and Grad Accumulation Warning	0	11	October 15, 2024
Trainer flag request (run validation after N epochs of training)	0	21	October 3, 2024
Using synthetic training data	0	8	September 12, 2024
Best practices for double precision training	0	101	June 8, 2024
Bug in the trainer.predict()	0	80	June 6, 2024
Model training stops at the first epoch (epoch 0)	0	306	May 15, 2024
Optimizer step in Profiler	0	109	May 6, 2024
How to Load .CKPT for validation?	0	124	May 6, 2024
Update parameters marked by a mask	0	92	May 5, 2024
More input?(input1, label) and another input2(p)	0	130	April 1, 2024
In PyTorch Lightning, how can one extract embeddings from a pretrained model to assist another model during training_step?	1	289	March 25, 2024
How trainer.test/predict works when 2 devices are used?	0	141	March 24, 2024
FSDP sharded checkpointing slower than any other method	1	346	March 19, 2024
Progress Bar in Jupyter Notebooks (Visual Studio Code)	3	1521	March 17, 2024
Run multiple validation loops with different weights	1	357	March 13, 2024
RuntimeError When Integrating LoRA Layers	1	538	March 1, 2024
Confusions about torchmetrics in pytorch_lightning	6	647	March 1, 2024
Next cost too much time	0	125	February 28, 2024
Epochs Stuck at 0% Completion During Training	0	415	February 24, 2024
Creating custom LightningModule for Fine Tuning LLMs	0	264	February 18, 2024
Stuck in Sanity Checking	0	268	February 9, 2024
Can't train with a too old NVIDIA driver (even with CPU accelerator)	4	892	January 7, 2024
Training is very slow	0	272	January 4, 2024
Validate every epoch prior to check_val_every_n_epoch kicking in	0	214	December 19, 2023
Run validation loop and callback before training	3	720	December 18, 2023