Backprop through Discrete Wavelet Transform (DWT) on GPU

Veril · October 28, 2017, 4:39am

It there an efficient way to perform this operation? 2D.

The naive approach is to either loop through the rows and columns or construct the matrix and multiply.

Oddly I was unable to find an example of how to do this for either theano or tensorflow.

tom · October 28, 2017, 8:55am

You can use conv layers to have single levels (using coeffs eg from pywavelets and setting requires_grad to False.
Also check out

that seems to be the most waveletty neural network thing I have recently seen.

I think I saw Edouard around here, but I forgot his username.

Best regards

Thomas

Veril · October 28, 2017, 2:14pm

I’m not sure I understand how to reduce DWT to convolutions as you propose.
Edit: Are you proposing to basically construct nlogn conv filters? since the higher frequency wavelets are just translated and mostly 0 everywhere I suppose it’s possible. Do you have an example? Not sure how I would stitch it all up.

The scattering transform is indeed interesting but not what I’m immediately looking for, will check it out though thanks. Seems to have high potential, too bad you can’t backprop through the fast implementation though.

tom · October 29, 2017, 8:45pm

As talk is cheap, I wrote up a quick notebook on using 2D Wavelet transformations with PyTorch and hope it is useful for you.

Best regards

Thomas

Veril · October 30, 2017, 1:45am

It certainly is, you seem like you have gone through a bit of effort.
Thanks.

verified.human · April 16, 2018, 1:41pm

I recently implemented a wavelet filter bank in PyTorch. Although my focus here was to write a fast temporal convolution library for wavelets, this might be of interest to you: https://github.com/tomrunia/PyTorchWavelets

fbcotter · September 19, 2018, 5:21pm

@tom Great work on the notebook. I already had a repo to do a dual tree DWT in pytorch (https://en.wikipedia.org/wiki/Complex_wavelet_transform), and inspired by your code, I have now added support for the DWT and Inverse DWT in 2 dimensions. You can check it out at
https://github.com/fbcotter/pytorch_wavelets (@verified.human sorry for the similar name!)

The repo has tests checking that gradients pass nicely through the dual tree DWT, but I am yet to write tests to check the gradients for the DWT. I would wager they work nicely, but do need to confirm. The perfect reconstruction works, as well as getting the same wavelet coefficients as pywt.

tom · September 19, 2018, 8:12pm

Thanks for sharing, @fbcotter!

It’s not terribly efficient, but wouldn’t the tracing work for backpropagation until you have an explicit backward?

chicanagram · May 25, 2019, 11:42am

Thanks so much for writing this up! Do you have any quick tips for how to adapt this for 1D DWT and inverse DWT?

alon_Tones · November 28, 2019, 6:01pm

Hi @tom @fbcotter.
One thing that appears to be different between both of your implementations is the requires_grad parameter. As far as I can understand @tom’s solution does not explicitly state that the convolution layer parameters have requires_grad = False and thus they would get updated at backprop (as would any kernel of a conv2d or transposeconv2d layer). Similarly in @fbcotter’s repo, the parameters are explicitly marked with requires_grad = False.

For my implementation I need a 1D wavelet based loss, so I would require the parameters to have requires_grad = False, it seems like it might be slightly easier for me to adapt tom’s version, so I wanted to make sure I understand this before going ahead and using the code.

Thanks again for these awesome contributions!

tom · November 28, 2019, 7:34pm

So I finally removed all the long deprecated Variable bits.
My notebook makes no assumption about whether you pass in weights which require grads or not - if you do, it will happily backpropagate into them.
I did use a similar approach for audio (1d).

Best regards

Thomas

alon_Tones · November 28, 2019, 8:52pm

Thanks for the update @tom So essentially, to make sure that the DWT is fixed =, I would simply set dec_hi.requires_grad = False (and same for the others off course). And then everything should work.
Perhaps I’ll write this as a pytorch module so it can be easily integrated into a network.

Thanks again!

tom · November 29, 2019, 5:28am

Not requiring grad is the default, so no need to set things.
My personal philosophy here is to keep stateless things as functions, but having it in a module is fine, too.

Best regards

Thomas