Understanding load_state_dict() effect on computational graph

Suriya_Lakshmanan · April 8, 2018, 7:48pm

I am trying to replicate the algo outlined in the MAML paper:

Pseudo code:

The idea is to compute a second order gradient. While computing the first order gradient is straightforward, computing the second order gradient as mentioned in this paper is a little bit tricky. The pseudo code should give a good understanding.