TypeError: conv2d(): argument 'input' (position 1) must be Tensor, not int!

Du-Minmin · October 30, 2020, 2:03am

Please, could anyone help me check where I did wrong?
When I run, an error occurs. I can’t solve it.
please any help or suggestions would be highly appreciated

model = torch.load('./media/model_best.pth')
model = torch.nn.DataParallel(model['model_def'](20))
model.load_state_dict(model['state_dict'])
model.cuda()
model.eval()

ptrblck · October 30, 2020, 2:37am

Your current code snippet doesn’t show the usage of the model, but based on the error message it seems your are passing an Int into the model via:

data = 1
output = model(data)

Could you check the dtype of the inputs and make sure they are not overwritten by other variables with the same name?

Du-Minmin · October 30, 2020, 3:14am

I don’t know where to view the dtype.The model is the .pth file saved at the end of training.The model is the pth file saved at the end of training.I don’t know why Int is passed in.

ptrblck · October 30, 2020, 3:21am

I don’t think the model is loaded in a wrong way, but the data you are trying to pass to the model.
You can check it via:

print(type(data))

assuming data is the input to your model.

Du-Minmin · October 30, 2020, 9:00am

Can I send the code to your mailbox? Could you please help me to see it?

ptrblck · October 30, 2020, 9:24am

Please post code here directly by wrapping it into three backticks ```.

Du-Minmin · October 30, 2020, 1:29pm

import os
import cv2
import time
import torch
import numpy as np
from argparse import ArgumentParser

import transforms

parser = ArgumentParser(description='e-Lab Segmentation Visualizer')
_ = parser.add_argument
_('--model_path', type=str, default='/media/', help='model to load')
_('--data_path',  type=str, default='/media/', help='image folder')
_('--mode',       type=int, default=0, help='mode 0, 1, 2')
_('--fullscreen',   action='store_true', help='Show output in full screen')

args = parser.parse_args()

# Clear screen
print('\033[0;0f\033[0J')
# Color Palette
CP_R = '\033[31m'
CP_G = '\033[32m'
CP_B = '\033[34m'
CP_Y = '\033[33m'
CP_C = '\033[0m'

# Define color scheme
color_map = np.array([
    [0, 0, 0],        # Unlabled
    [128, 64, 128],   # Road
    [244, 35, 232],   # Sidewalk
    [70, 70, 70],     # Building
    [102, 102, 156],  # Wall
    [190, 153, 153],  # Fence
    [153, 153, 153],  # Pole
    [250, 170, 30],   # Traffic light
    [220, 220, 0],    # Traffic signal
    [107, 142, 35],   # Vegetation
    [152, 251, 152],  # Terrain
    [70, 130, 180],   # Sky
    [220, 20, 60],    # Person
    [255, 0, 0],      # Rider
    [0, 0, 142],      # Car
    [0, 0, 70],       # Truck
    [0, 60, 100],     # Bus
    [0, 80, 100],     # Train
    [0, 0, 230],      # Motorcycle
    [119, 11, 32]     # Bicycle
], dtype=np.uint8)
#device = torch.device("cuda:0" if torch.cuda.is_available() else "cpu")

# Load model
model = torch.load('./media/model_best.pth')
model = torch.nn.DataParallel(model['model_def'](20))
model.load_state_dict(model['state_dict'])
model.cuda()
model.eval()

root_dir = os.path.join(args.data_path, 'stuttgart_0' + str(args.mode))
first_idx = [1, 3500, 5100]
last_idx = [599, 4599, 6299]
idx = first_idx[args.mode]
fps = 'NA'
pred_map = np.zeros((256, 512, 3), dtype=np.uint8)
pred_map = np.zeros((512, 1024, 3), dtype=np.uint8)

win_title = 'Overlayed Image'
if args.fullscreen:
    cv2.namedWindow(win_title, cv2.WND_PROP_FULLSCREEN)
    cv2.setWindowProperty(win_title, cv2.WND_PROP_FULLSCREEN, cv2.WINDOW_FULLSCREEN)

while idx <= last_idx[args.mode]:
    # Load image, resize and convert into a 'batchified' cuda tensor
    start_time = time.time()
    filename = '{}/stuttgart_0{:d}_000000_{:06d}_leftImg8bit.png'.format(root_dir, args.mode, idx)

    if os.path.isfile(filename):
        x = cv2.imread(filename)
        read_time = time.time() - start_time

        resize = transforms.Resize(0.5)
        x = resize(x)
        prep_data = transforms.Compose([
            #transforms.Crop((512, 512)),
            transforms.ToTensor(),
            transforms.Normalize([[0.485, 0.456, 0.406], [0.229, 0.224, 0.225]])
            ])
        input_image = prep_data(x)
        #input_image = torch.from_numpy(cv2.cvtColor(x, cv2.COLOR_BGR2RGB).transpose(2, 0, 1))/255
        input_image = input_image.unsqueeze(0).float().cuda()
        prep_time = time.time() - start_time - read_time

        # Get neural network output
        y = model(torch.autograd.Variable(input_image))
        y = y.squeeze()
        pred = y.data.gpu().numpy()
        model_time = time.time() - start_time - read_time - prep_time

        # Calculate prediction and colorized segemented output
        prediction = np.argmax(pred, axis=0)
        num_classes = 20
        pred_map *= 0
        for i in range(num_classes):
            pred_map[prediction == i] = color_map[i]

        pred_map_BGR = cv2.cvtColor(pred_map, cv2.COLOR_RGB2BGR)
        overlay = cv2.addWeighted(x, 0.5, pred_map_BGR, 0.5, 0)
        pred_time = time.time() - start_time - read_time - prep_time - model_time

        #cv2.imshow('Original Image', x_rescaled)
        #cv2.imshow('Segmented Output', pred_map_BGR)
        cv2.imshow(win_title, overlay)
        disp_time = time.time() - start_time - read_time - prep_time - model_time - pred_time
        fps = 1/(time.time() - start_time)

        print("{}Read: {}{:4.2f} ms | {}Norm:: {}{:4.2f} ms | {}Model: {}{:4.2f} ms | {}Predict: {}{:4.2f} ms | {}Display: {}{:4.2f} ms".format(
            CP_Y, CP_C, read_time*1000, CP_G, CP_C, prep_time*1000, CP_G, CP_C, model_time*1000,
            CP_R, CP_C, pred_time*1000, CP_B, CP_C, disp_time*1000))
    else:
        print("{}Warning{}!!! {}{} image unavailable{}.".format(CP_R, CP_C, filename, CP_R, CP_C))

    idx += 1
    if cv2.waitKey(1) == 27: # ESC to stop
        break

ptrblck · October 30, 2020, 8:53pm

The code looks generally alright, besides the usage of the deprecated Variable, which should be removed.

Nevertheless input_image should be a tensor. Otherwise the previous lines of code should already create errors.

Since the code is not executable, I guess that in the model’s forward method some integers might be passed to specific layers.

Du-Minmin · October 31, 2020, 1:37am

Okay, thank you very much for your answers, I will try to debug it myself

ptrblck · October 31, 2020, 2:22am

Sure, if you get stuck feel free to post more code snippets we can have a look at and, in the best case, we could execute to isolate the issue.

Du-Minmin · November 2, 2020, 12:45pm

okk,I use this code at https://github.com/e-lab/pytorch-linknet.

ptrblck · November 2, 2020, 7:16pm

Which part of the code are you using, i.e. is the error reproducible by creating I creating the LinkNet and passing some tensor inputs to it? If so, which shape should these inputs have?

Du-Minmin · November 3, 2020, 3:23am

I run the main.py,this error appeared when I run visualize.py.

ptrblck · November 3, 2020, 9:21am

Please to try write an executable code snippet using visualize.py so that we can further debug.

Du-Minmin · November 5, 2020, 11:05am

Thank you very much for your patient answers, I have solved this problem.