I also saw this: Different outputs when using different batch size (only on cuda) - #2 by ptrblck
Does this constitute a different workload ?
I also saw this: Different outputs when using different batch size (only on cuda) - #2 by ptrblck
Does this constitute a different workload ?