LSTM feature importance

Roaldb86 · February 17, 2020, 10:41am

I have a model trained on 16 features, seq_len of 120 and in batches of 256.

I would like to test the loss on the model on a testset, with random sampling from a normal distribution for one features at a time so I can measure how important each features is (important features would give a high rise in loss when its random sampled)

Any advice on how to manipulate one feature at a time? the shape of the input is (256, 120, 16)

h = model.init_hidden(batch_size)
    with torch.no_grad():
        for inp, labels in loader:
                       
            if(train_on_gpu):
                inp, labels = inp.cuda(), labels.cuda()
            

            outputs, h = model(inp, h)
            _, predicts = torch.max(outputs, 1)

G.M · February 18, 2020, 1:19am

Try inp[ : , : , F].normal_() wherer F is the feature u want to manipulate.

anantguptadbl · February 18, 2020, 11:18am

A sample code

for curFeature in range(X.shape[1]):     
    model1Obj.zero_grad()
    print("Size of x1 is {0}".format(X.shape))
    # Alter the feature before you calculate the loss
    mu,sigma=norm.fit(X[:,curFeature])
    Y=copy.deepcopy(X)
    Y[:,curFeature]=np.random.normal(mu, sigma, 100)
    Y=Variable(torch.from_numpy(Y))
    output=model1Obj(Y)
    loss=criterion(target,output)
    print("The loss value for feature {0} is {1}".format(curFeature,loss.item()))

Hope this helps

Roaldb86 · February 18, 2020, 3:36pm

Thanks alot

I went with the following. looping over features on the outside of the function in order to log evry feature’s performance

def accuracy_test(model, loader, feature): 

    batch_size = loader.batch_size
    mu = 0
    sigma = 0.01
    
    train_on_gpu = (True if torch.cuda.is_available() else False)
    print("Training on gpu :{}".format(train_on_gpu))
    
    if(train_on_gpu):
        model.cuda()
        
    model.eval()
    losses = []

    criterion = nn.CrossEntropyLoss()
    h = model.init_hidden(batch_size)
    
    with torch.no_grad():
        for inp, labels in loader:
            perbutated_inp = copy.deepcopy(inp)
            perbutated_inp[ : , : , feature] = torch.HalfTensor(np.random.normal(mu, sigma, inp.shape[1]))
            
            inp = perbutated_inp
            
            if(train_on_gpu):
                inp, labels = inp.cuda(), labels.cuda()
            
            inp = inp.reshape(inp.shape[1], inp.shape[0], inp.shape[2])
        
            outputs, h = model(inp.float(), h)       
            loss = criterion(outputs, labels.long())
            losses.append(loss.item())
                      
            
    test_loss = np.mean(losses)
    
    return test_loss