Can I use CPU mem to save GPU mem at sequential model?

You could check the CPU offloading for your use case.