Do I need to consider the batch_size when defining own module

king_wang · April 12, 2018, 1:53am

From the tutorial, I got that

we need to consider the batch_size when using the Linear layer. Then if I define own module ,do i need to consider the batch_size, e.g if the original input is a input_feature Tensor, if we consider the batch_size, then the input will be batch_size*input_feature Tensor

ptrblck · April 12, 2018, 5:37am

You don’t need to consider the batch size when initializing the Modules. The Linear layer for example takes in_features as an argument, which would be dimension 1 for x = torch.randn(10, 20).

However, when you need another view on the Tensor, e.g when you need to flatten the Tensor coming from a Conv2d, you most likely want to keep the batch size and flat all remaining dimensions.
You would do it in the forward method:

x = self.conv(x)
x = x.view(x.size(0), -1) #keep batch size
x = self.fc(x)

Does this explanation make it clearer?

Dung_Hu_nh_Anh · July 16, 2024, 9:00am

How about a model from any paper? I currently working on replicate a model from a paper. And do i need to consider the batch size?

ptrblck · July 16, 2024, 12:44pm

The implementation of models “from any paper” depends on the definition of the authors. While built-in layers (and other standard layers) do not hard-code the batch size, it’s impossible to speak for all authors.