Using a target size (torch.Size([100, 1])) that is different to the input size (torch.Size([100, 100])). This will likely lead to incorrect results due to broadcasting. Please ensure they have the same size

Cross-post from here with a clarifying question.
It’s currently unclear how you are interpreting the used operation as you are explicitly creating the batch size - dependent output in your model.