Hi,
About the last line, you need to initialize it like the way you have initialized your weights. In other words, use self.pi.bias
instead of self.pi
which is the module.
Also, nn.init.constant_(self.pi, 0)
initialized the entire bias
tensor with zeroes while the TF counterpart has initialized it with [3, -3]
. To do so, you can simply assign a tensor with these values to the bias
attribute of self.pi
:
with torch.no_grad():
self.pi.bias = nn.Parameter(torch.tensor([3., -3.]))