Hi.
I’m trying to learn about this algorithm, but any tutorial that i’ve read is like “YOLO is dividing image on SxS grid, where each cell is responsible for detecting an object…” it explains nothing to me.
How this algorithm is predicting boundry boxes which are larger than one grid cell?
Does it concatenate somehow those grid cells that are adjacent and predicting same class object?
I have read a tutorial [here fr example] to implement yolov3 from scratch, it even has a working code
In complement of reading the original papers, the idea is just to search the web for a good tutorial with working code explained step by step. There must be plenty of them available online now.