Transfer learning usage with different input size

I wouldn’t recommend to use the seeding approach, as you would have to be very careful about using exactly the same calls into the pseudorandom number generator.
An easier approach would be to load the state_dict from one model to the other and compare the outputs. Since your architectures might differ a bit, you might need to manipulate the keys of the state_dict to make them match.

1 Like