I am reading the docs for initializing the weights according to torch.nn.init.kaiming_normal_ and I have trouble to understand the following:
Why the default value for nonlinearity is leaky_relu while the default value for a is 0.01?
I am reading the docs for initializing the weights according to torch.nn.init.kaiming_normal_ and I have trouble to understand the following:
Why the default value for nonlinearity is leaky_relu while the default value for a is 0.01?