SSDlite annotation format

Hello, I am fine-tuning pretrained SSDlite model from torchvision.
The training somehow goes well when I feed bounding boxes coordinates in absolute coordinates to the fed image size. However, the bb coordinates “theoretically” should be normalized.
I’d like to know that In what format does the SSDlite in torchvision expect the annotation?
In addition to it, what is the reason that the absolute coordinates make training going well while relative coordinates do not.

just in case, absolute coordinates look like this (162, 280, 185, 300). relative coordinates look like this (0.52, 0.8, 0.6, 0.9).