How to use textbooks for fine-tuning LLM
|
|
0
|
394
|
June 24, 2023
|
Data collate_fn makes training process super slow!
|
|
0
|
928
|
June 22, 2023
|
Using SequentialLR with Step, Epoch and ReduceLROnPlateau
|
|
0
|
499
|
June 2, 2023
|
Finetuning using lit-llama
|
|
3
|
504
|
May 24, 2023
|
Transfer learning
|
|
0
|
201
|
May 23, 2023
|
I am lost on custom batch size definition
|
|
2
|
496
|
May 17, 2023
|
Problem that many symbols are output in val_dataloaders
|
|
2
|
369
|
May 6, 2023
|
Error when predicting from checkpoint
|
|
1
|
703
|
May 6, 2023
|
Does not run validation step after epoch when running with all data
|
|
5
|
1595
|
May 1, 2023
|
Why are my training and validation losses only changing by very little?
|
|
2
|
782
|
April 28, 2023
|
Saving checkpoints and logging models
|
|
1
|
201
|
April 28, 2023
|
Different ways of logging model
|
|
0
|
131
|
April 26, 2023
|
How can we skip a step with NaN loss in the training_step when using Distributed Data Parallel (DDP)?
|
|
1
|
1067
|
April 24, 2023
|
Mac M2 MPS: failed assertion `destination kernel width and filter kernel width mismatch'
|
|
0
|
559
|
April 17, 2023
|
Error on trainer = L.Trainer(max_epochs=2000)
|
|
0
|
272
|
April 4, 2023
|
Custom training - RuntimeError due to unused parameters
|
|
0
|
1505
|
April 3, 2023
|
MLFlowLogger always generates the same run name
|
|
1
|
500
|
April 3, 2023
|
LR Scheduler monitoring multiple metrics
|
|
2
|
640
|
April 3, 2023
|
RAM usage increases quickly over the training step
|
|
2
|
335
|
March 30, 2023
|
Code structuring for text classification with hf bert-uncase
|
|
2
|
385
|
March 23, 2023
|
Use two datasets and distinguish during training
|
|
0
|
139
|
March 22, 2023
|
DeepSpeed: how to execute certain code once?
|
|
0
|
233
|
March 22, 2023
|
How to combine PTL arguments with ArgumentParser
|
|
2
|
1753
|
March 22, 2023
|
Multi GPU - Autolog with multiple runs - lightning2.0
|
|
2
|
655
|
March 22, 2023
|
Loadind saved checkpoint model.model
|
|
2
|
313
|
March 16, 2023
|
LR-Finder on ResNet 50
|
|
1
|
268
|
March 12, 2023
|
How to get max epochs in pl.LightningModule?
|
|
2
|
1811
|
March 7, 2023
|
How to use warmup lr+CosineAnnealingLR in Lightning
|
|
2
|
4462
|
March 6, 2023
|
Is automatic optimization can catch nested requires_grad?
|
|
1
|
411
|
March 4, 2023
|
RuntimeError: Trying to resize storage that is not resizable
|
|
3
|
16616
|
March 3, 2023
|