_convolution_double_backward() parameters

tomguluson92 · May 22, 2019, 2:56am

Hi @albanD, thanks for your explanation about the definition of _convolution_double_backward(). When reading this code, I was confused about the parameter like ggI, ggW_r, ggb and gO_r and output_mask and the target of calculating ggW, gI and gW.

Under my superficial understanding of all these, ggW_r means the gradient of convolution kernel? xx_r means a requirement, ggb is the gradient of bias.

Really hope your advice on how to realize these parameters and the calculation of ggW, gI and gW.

Thank you very much!

albanD · May 29, 2019, 3:09pm

Hi,

I’m not sure about the *_r variables. Most likely some some cpp reasons.
The gg* variables contain the gradient of the gradient of something.
Since the forward is I, W, b -> O
the backward is gO, I, W -> gI, gW, gb
and the double backward ggI, ggW, ggb -> ggO, gI, gW

tomguluson92 · June 11, 2019, 6:31am

very appreciate to your answer, thank you !