Mistake on pytorch documentation

dato_nefaridze · December 8, 2019, 7:28pm

here is what nn.Linear do but i am confused.i think thre should be A^T*x and not vice versa

ptrblck · December 9, 2019, 4:39am

The documentation refers to the implemented method as seen here.

Your suggestion won’t work with plain matmul, as x has the batch dimension in dim0.