As the group
in torch.nn.Conv2d
said it will split channel into groups, as the example from Conv2d
At groups=2, the operation becomes equivalent to having two conv layers side by side, each seeing half the input channels, and producing half the output channels, and both subsequently concatenated.
But what will happen if the output channel is not the multiple of the groups? one of the groups will generate more output?