When I was writing a model definition. I noticed that with nn.Sequential there was an increase in about 0.5s. This time could cascade over several examples and increase the training time by a large factor. Why does this happen?
This should not be the case.
Could you post the model definition inside the
nn.Sequential container and the custom