Hi team, I am using the FSDP2 to shard my model.
What is the best practice to evaluate (like calling eval_forward for the module) during training here?
I know we can register_forward hooks for eval_forward but it might call some unused hooks.
Can I make it back to non-FSDP module and shard again after I finish the evaluation?