Nonuniform batch size with CUDA MAGMA

This paper KiloNeRF executes many networks in PyTorch with variable batch sizes simultaneously using a MAGMA routine. I need to perform a similar application. I can’t find any details regarding which routine specifically performs this. Can anyone point me in the right direction to using it?

Thank you

1 Like