What is loss.item()

hunar · November 16, 2019, 8:20pm

what does

running_loss
in this code ? i know it calculated the loss , and we need to get the probability .
please take a look at the comment sections
for e in range(epochs):
running_loss = 0
for images, labels in trainloader:
# this loop through 938 images and labels (length of trainloader
images = images.view(images.shape[0], -1)

    optimizer.zero_grad()
    
    output = model.forward(images)
    loss = criterion(output, labels)
    loss.backward()
    optimizer.step()
    
    running_loss += loss.item() #what this line does
else:
    print(f"Training loss: {running_loss/len(trainloader)}")

thanks

crowsonkb · November 16, 2019, 8:41pm

The item() method extracts the loss’s value as a Python float.

hunar · November 16, 2019, 8:57pm

thank you , but why we need (running_loss += loss.item()) the trainloader has different numbers !? why we mixed the loss values of different numbers ?
maybe a stupid question , sorry

ptrblck · November 16, 2019, 10:26pm

You are calculating the running loss with this line of code so that you could later e.g. calculate the mean loss of that epoch.
Let me know, if I misunderstood the question.

hunar · November 17, 2019, 9:30am

we iterate through the entire images and labels , per epoch

for images, labels in trainloader:

why we mixed the losses (losses of different images )! ?

{running_loss/len(trainloader)}

and calculated all together ?!
sorry , i’m new , and i want to make my final project with pytorch

ptrblck · November 17, 2019, 9:41am

The average of the batch losses will give you an estimate of the “epoch loss” during training.
Since you are calculating the loss anyway, you could just sum it and calculate the mean after the epoch finishes.
This training loss is used to see, how well your model performs on the training dataset.

Alternatively you could also plot the batch loss values, but this is usually not necessary and will give you a lot of outputs.

hunar · November 17, 2019, 10:38am

thanks alot , one more question , do you think google colab GPU is fit to a project of facial recognition in real time

ptrblck · November 17, 2019, 7:45pm

I’ve just used Colab in the past a few times for debugging purposes. Not sure if the runtime changed, but I don’t thing Colab is a good fit for “real time” deployment, e.g. since the notebook runtime is limited.

hunar · November 18, 2019, 2:05pm

so what is your suggestion ?

ptrblck · November 18, 2019, 3:18pm

You could use some cloud deployment service like Microsoft Azure, GCP, AWS or others.

hunar · November 18, 2019, 3:20pm

thank you for your help

Sharafath_Mohammed · June 19, 2020, 1:01pm

Hi, on the topic of deployment, can you guide me to any tutorial, documentation, etc on using Azure DevOps (not Azure MLOps) for Pytorch deployment?

ptrblck · June 20, 2020, 7:15am

I’m unfortunately not experienced in deployment with Azure DevOps.

Brando_Miranda · June 30, 2020, 9:08pm

How is that different from the .data field?

ptrblck · July 1, 2020, 9:17am

The .data attribute shouldn’t be used, as it might yield unwanted side effects.
The right way to get a Python scalar is via .item().

Brando_Miranda · July 1, 2020, 2:10pm

Do you have a link discussing why (and what) there might be “unwanted side effects” more precisely?

Thanks!

ptrblck · July 1, 2020, 5:24pm

@albanD lists some aspects in this post.

saurabh-2905 · November 21, 2020, 10:31am

What if we use .detach() when working with Pytorch lightning. It would give us the data without any computational graph. Will it be correct to use .detach() instead of .item() ?

ptrblck · November 21, 2020, 11:41am

.detach() will return a tensor, which is detached from the computation graph, while .item() will return the Python scalar. I don’t know how and where this is needed in PyTorch Lightning depending on the use case detach() might also work.

saurabh-2905 · November 21, 2020, 1:43pm

Thanks for such a quick reply. Yes, I understand this concept now. So because in Pytorch docs it is mentioned that tensors can be passed for logging, I am sure I can do the following:

self.log('val_loss', loss.detach(), prog_bar=False, on_step=False, on_epoch=True, logger=True)

However, I am little unsure if the following implementation is correct in which I replaced .item() with .detach() before the loss value is returned by the model. I am not getting any syntax error though but I am little worried if it would interfere with grad_calculation and affect the performance.

# Loss : Mask outputs to ignore non-existing objects (except with conf. loss)
            loss_x = self.mse_loss(x[obj_mask], tx[obj_mask])
            loss_y = self.mse_loss(y[obj_mask], ty[obj_mask]) ....

            self.metrics = {
                "loss": to_cpu(total_loss).detach(),
                "x": to_cpu(loss_x).detach(),
                "y": to_cpu(loss_y).detach(), ..... }

            return output, total_loss

NOTE - The reason I am trying to replace .item() is because I am training the model on multiple gpus and it was taking very long to train just one epoch. So while going through the pytorch lightning docs I came across this

Don’t call .item() anywhere in your code. Use .detach() instead to remove the connected graph calls. Lightning takes a great deal of care to be optimized for this. https://pytorch-lightning.readthedocs.io/en/stable/performance.html