About the LightningModule category
|
|
0
|
767
|
August 26, 2020
|
DLRM run failed in torchrec+lightning
|
|
0
|
22
|
December 1, 2024
|
Fine tuning using LLAMA models
|
|
0
|
74
|
November 12, 2024
|
Save torchmetrics plots after logging them in LightningModule
|
|
0
|
18
|
August 10, 2024
|
Go pass the sanity check but get CUDA OUT OF MEMORY when in validation loop
|
|
0
|
125
|
April 4, 2024
|
Understanding self.log()
|
|
2
|
4255
|
March 22, 2024
|
LightningModule.train_dataloader()
|
|
4
|
666
|
March 20, 2024
|
Skip instances during training
|
|
2
|
916
|
March 17, 2024
|
Do I need to detach when using self.logger.experiment.add_scalars?
|
|
1
|
483
|
March 12, 2024
|
How to interactively run inference with a model in jupyter notebook created with lightningcli?
|
|
0
|
160
|
March 1, 2024
|
Where should I load the model checkpoint when using configure_model?
|
|
1
|
633
|
February 23, 2024
|
Save and restore persisted DataLoader states from checkpoint
|
|
0
|
215
|
February 21, 2024
|
Confusions about load_from_checkpoint() and save_hyperparameters()
|
|
1
|
354
|
January 21, 2024
|
ERROR:root:Attempting to deserialize object on a CUDA device but torch.cuda.is_available() is False
|
|
3
|
2709
|
January 7, 2024
|
How can I remove metric parameters from model?
|
|
0
|
207
|
January 5, 2024
|
RuntimeError: one of the variables needed for gradient computation has been modified by an inplace operation
|
|
0
|
633
|
December 9, 2023
|
Mixed precision training (how to appropriately scale the manual gradient updates)
|
|
0
|
352
|
December 5, 2023
|
Size mismatch for model
|
|
1
|
986
|
October 26, 2023
|
Metrics not logged properly in PyTorch Lightning
|
|
1
|
917
|
October 22, 2023
|
ValueError: too many values to unpack (expected 3)
|
|
3
|
874
|
October 18, 2023
|
Question about recover nested model from checkpoint
|
|
0
|
608
|
October 17, 2023
|
Logging one value per epoch?
|
|
0
|
335
|
October 4, 2023
|
Load checkpoint with dynamically created model
|
|
0
|
150
|
September 12, 2023
|
Disabling autocast for certain modules
|
|
1
|
635
|
September 1, 2023
|
Save_hyperparameters and OptimizerCallable
|
|
0
|
186
|
August 15, 2023
|
Custom model definition is not included in checkpoint hyper_parameters
|
|
2
|
384
|
August 9, 2023
|
Why does training fails with "require grad and does not have a grad_fn"?
|
|
3
|
5610
|
August 8, 2023
|
[CLI] How to Pass Arguments to Initialize an Object in L.LightningModule?
|
|
6
|
1863
|
August 4, 2023
|
How to correctly initialize latent vector parameters that have size dependent on training data size using CLI?
|
|
0
|
177
|
July 24, 2023
|
Lightning Module isn't loading checkpoint from the path as per documentation
|
|
1
|
691
|
July 22, 2023
|