I want to set different learning rates based on parameter names, but I found that the parameter names of fsdp have changed. If I want to set different learning rates based on parameter names in fsdp, how can I achieve this?
To enable this, you can use the use_orig_params=True
flag with FSDP.