Why has Bolts Datamodules not implemented setup?

I would say this comes down to preference. If you look at something like STL10, since there are many different splits of data (labelled, unlabelled and a mix of both), keeping transforms within *_dataloader() methods is just easier to track.