Add gaussian noise to parameters while training

@thnguyen996 In what paper did you find this formula?

I know something related/similar under the term “variational weight noise” but I’m not sure really where that terms come from.

I also found this other related post: Backpropagating through noise