Gpu memory gets accumulated during consecutive forward passes

Hi, may I know what is your solution? I have the same issue at inference!