Problem with extracting the feature

mathwseg · July 23, 2022, 2:14pm

I need to extract features for the images before classifying them… or removing the last layer for the classification model using vit-PyTorch

github.com

lucidrains/vit-pytorch/blob/main/vit_pytorch/max_vit.py

from functools import partial

import torch
from torch import nn, einsum

from einops import rearrange, repeat
from einops.layers.torch import Rearrange, Reduce

# helpers

def exists(val):
    return val is not None

def default(val, d):
    return val if exists(val) else d

def cast_tuple(val, length = 1):
    return val if isinstance(val, tuple) else ((val,) * length)

# helper classes

This file has been truncated. show original

I tried to ignore the classification layer by
self.mlp_head = nn.Identity()

then do that code

from max_vit import MaxViT
from extractor import Extractor
model = MaxViT(
    num_classes = 0,
    dim = 192,                        
    depth = (2, 6, 14, 2)
    )          
feature= model(img)

but got the shape for each image torch.Size([1, 1536, 7, 7])
and got the file size 300MB for 1000 images ! … is there anything wrong with the code, please

mathwseg · July 24, 2022, 1:11am

can someone help please

ptrblck · July 24, 2022, 1:50am

The feature dimension looks reasonable as the mlp_head would not be applied as seen here. I haven’t checked the expected shape as the einops reductions don’t show the actual values, but I would expect to see this or a similar shape.
Could you explain what the issue is or why this shape would not be expected?

mathwseg · July 24, 2022, 2:47pm

is there any help please?

ptrblck · July 24, 2022, 7:13pm

It’s unclear to me where you are currently stuck. You didn’t follow up from my previous post but are asking for help again so I guess you are hitting a different issue?

mathwseg · July 25, 2022, 2:19am

I wrote more details about the problem but after time I feel disappointed to find a solution so I deleted it … if you can help I post it again … hope to find help … Thanks Problem with extracting the feature - #4 by mathwseg

ptrblck · July 25, 2022, 7:47pm

Your model seems to be overfitting on the training set and I don’t think that your feature extraction is necessarily wrong.
Overfitting can have different reasons, e.g. the model capacity might be too large for the given data and your model is thus able to easily learn all training samples.

mathwseg · July 26, 2022, 1:44am

doe the code i wrote for extracting the feature is right, please?

model = MaxViT(
    num_classes = 0,
    dim = 128,                         
    depth = (2, 6, 14, 2) 
    )

model.mlp_head = model.mlp_head[0]

from this class vit-pytorch/max_vit.py at main · lucidrains/vit-pytorch · GitHub

ptrblck · July 26, 2022, 4:56am

It depends which features you want to use. Initially, you’ve replaced the entire mlp_head with an nn.Identity layer, now you are using the Reduce layer. Both sound reasonable as they are applied before the final linear layer which would act as the classifier.

mathwseg · July 26, 2022, 6:14am

Appreciate your reply so both ways means the features before classification or there is something i should do it else

ptrblck · July 26, 2022, 6:17am

I think both are valid approaches to extract the features from the model and you would need to check which ones would work better for your use case.