BERT model throws error when used in Pytorch Lightning

I think some mismatch with positional arguments is there:

try:

or maybe