J_Johnson
(J Johnson)
4
I’m not sure if this is what you’re looking but it seems related:
And this thread might help:
By the way, using a Toeplitz matrix may not be the most efficient approach. This post provides a more efficient algorithm with less calculations:
But still needs the operations to be parallelized.