I’m wondering how to fine tune a complex structure as shown in the picture. Using Modules. () Or chiled () doesn’t work. I want to change all batch norm to group norm.
Yes, you need to refit the model. Group Norm and Batch Norm don’t normalize the same distribution, so I don’t think there is even a way of doing the bridge with a mathematical formula.