LR-Finder on ResNet 50

FJDorfner · March 9, 2023, 5:34pm

I am trying to train a binary classification model for x-rays based on a pretrained ResNet50 model (ImageNet). As soon as I adjust the last FC-Layer of the ResNet down to 2 outputs (from originally 1000 for ImageNet1k, the learning rate finder just shows a graph that looks like this:

I have rebuilt the model from scratch and tested the LR-Finder along the way, so I am fairly sure that changing the FC-Layer is what breaks it.
Has anyone ever experienced a problem like this? Thanks!

awaelchli · March 12, 2023, 7:36pm

It looks like none of the chosen learning rates is leading to a decrease in loss (note the graph doesn’t have any negative slope). I suggest first doing the sanity check that you can overfit on a small set of samples (that the model is actually trainable).

Topic		Replies	Views
LR Finder MNIST Trainer	2	765	September 18, 2023
Testing accuracy gap when training a resnet50 on ImageNet from scratch DDP/GPU	6	2944	January 19, 2022
Zzpl_bolts.models.self_supervised.resnets.resnet18 does not use the last fully connected layer	1	448	February 22, 2021
Training Steps Erroneous implementation help	0	549	March 20, 2021
Question about auto_lr_find() Trainer	1	2385	January 31, 2023

LR-Finder on ResNet 50

Related topics