FSDP MixedPrecision vs AMP autocast?

anonymous_user_83259 · October 11, 2024, 2:46pm

Given an arbitrary fp32 nn.Module that fits on a single GPU, is there a full enumeration of the differences between

in computation?

I noticed that certain modules/methods do not execute with correct precision using FSDP MixedPrecision, so there exists a difference.