(More) Batch matrix operations

at this time we dont have better plans for these than a naive for-loop.
What are your use-cases that these are needed? Can you elaborate?