what is the meaning of the gain?
what‘s the differences between gain=1 and gain=sqrt(2)
![image|423x500]
thank you.i have saw the doc, however i really do not know how it works
Hi,
From what I remember, It is just a heuristic that depends on the non linearity.
It represent roughly how the non linearity scale the outputs and by using this information, you can initialize your weights in a better way. To make sure your network will keep everything at roughtly the same scale.