Retain Graph- Branch Convolutional Neural Network

ganLover · March 12, 2019, 6:43pm

I trying to implement B-CNN: Branch Convolutional Neural Network for Hierarchical Classification.-https://arxiv.org/pdf/1709.09890.pdf

I have a doubt-
Should I backpropagate losses from the branches individually (using retain graph)

OR

Should i simply add the losses from all the branches and then perform backpropagation.

Naruto-Sasuke · March 13, 2019, 1:41am

Both is ok, while the latter seems better.

ganLover · March 13, 2019, 3:26am

I wanted to know why 2nd is better.

Is it because- in 2nd case I need only one backprop compared to 3 backprops in 1st case?

k0pch4 · March 14, 2019, 3:51pm

By better, I think @Naruto-Sasuke means that your code would “look” better and would have less lines, therefore less surface area for software bugs. AFIK, Pytorch internally is able to see what are the variables that have been worked upon and is able to find the required grads, without us explicitly asking to calculate the grads for each of the branches.

moad_alami · May 9, 2022, 8:48pm

Hi @ganLover is it possible to share the code used to implement the B-CNN architecture?