Translate weight initialization from tensorflow to pytroch

Nikronic · March 30, 2021, 2:27pm

Hi,

About the last line, you need to initialize it like the way you have initialized your weights. In other words, use self.pi.bias instead of self.pi which is the module.

Also, nn.init.constant_(self.pi, 0) initialized the entire bias tensor with zeroes while the TF counterpart has initialized it with [3, -3]. To do so, you can simply assign a tensor with these values to the bias attribute of self.pi:

with torch.no_grad():
    self.pi.bias = nn.Parameter(torch.tensor([3., -3.]))