Processing Sequences with Linear layers Behaviour

Yes, this should be the case as seen here and here.

1 Like