Where is the nn.Linear cuda implementation

nn.Linear is using F.linear, which will call into addmm or matmul here and then deploy to the cublas method here.

1 Like