Train torchvision faster-rcnn on 4-channels images

Hello!
I train Faster-RCNN from torchvision on 4-channels image. I changed resnet50 backbone for 4-channels input but I get error in “torchvision/models/detection/transform.py”.

return (image - mean[:, None, None]) / std[:, None, None]
RuntimeError: The size of tensor a (4) must match the size of tensor b (3) at non-singleton dimension 0

I can somehow turn off these transforms because right now I can’t train the model with them?
Thanks for answers!

Did you fix this issue? If so, could you share how to skip this error? Thanks