After applying fully_shard to a model and its submodules, is there a way to see how parameters in each FSDP unit got sharded? Perhaps something like a mapping of submodules to a rank?
Thanks in advance ![]()
After applying fully_shard to a model and its submodules, is there a way to see how parameters in each FSDP unit got sharded? Perhaps something like a mapping of submodules to a rank?
Thanks in advance ![]()