How cancle below ```UserWarning```?

jaffe-fly · June 8, 2021, 4:07am

when use DistributedDataParallel

scaler = GradScaler()
for epoch in range(args.epochs):
    model.train()
    if num_distrib() > 1:
        train_loader.sampler.set_epoch(epoch)
    for i, (input, target) in enumerate(train_loader):  
        with autocast():  # mixed precision
            output = model(input)
            loss = loss_fn(output, target)  # note - loss also in fp16
        model.zero_grad()
        scaler.scale(loss).backward()
        scaler.step(optimizer)
        scaler.update()
        reduced_loss = reduce_tensor(loss, args.gpus)
        losses.update(reduced_loss.item(), input.size(0))
    scheduler.step()

scheduler.step() has after scaler.step(optimizer) why appearance UserWarning

how can i do ?

ptrblck · June 8, 2021, 4:15am

The warning is raised, because the GradScaler might use an initial scaling factor, which could be too large for the first batches and will thus reduce it as well as skip the optimizer.step(). Due to this, the scheduler.step() would be executed before the first optimizer.step(), which will raise the warning.
You could ignore or disable it, or on the other hand, check if the scaler decreased the scale factor and also skip the scheduler.step().

jaffe-fly · June 8, 2021, 6:52am

i want to known Correct front-rear order for my code ,thank you