Hi. I’m trying to make a model, based on YOLO v1.
but I’m having struggle with the dataset.
I already made the YOLO network and the out put is 7x7x10
7x7 is grid size and 10 is for 2 * (x, y, w, h, confidence score) two bounding box parameters.
I removed the class parameter, because my work doesn’t need classification. It only needs the bounding box for localization.
my current dataset looks like this (image, target)
target = (x1, y1, w1, h1, Confidence1, x2, y2, w2, h2, Confidence2)
but I need the targets to be 7x7x10.
how can I encode the targets to fit in the right grid cell?