Could not run 'quantized::conv2d.new' with arguments from the 'QuantizedCUDA' backend

you can inspect the model and identify whether the weight is stored correctly, its possible its not transfering over or something, though usually modules move over their attributes by default.