so should we set lr=learning_rate or lr=learning_rate*N in configure_optimizers if using DDP backend??
lr=learning_rate
lr=learning_rate*N
configure_optimizers