I saved the quantized weight and loaded it with the model after torch.ao.quantization.convert(). how do I print the output of each layer of the network?

TD-wzw · December 30, 2023, 6:15am

After going through torch.ao.quantization.convert(), adding printing in the original network code seems to have no effect.

HDCharles · January 5, 2024, 7:15pm

torch.ao.quantization.convert is eager mode quantization meaning it fully swaps the module to a quantized version, if you put print into the module that gets swapped, it wont do anything once its gone.

we usually do something like:

# intermediate layer analysis with forward hooks                        
                                                                        
activation = {}                                                         
def get_activation(name):                                               
    def hook(model, input, output):                                     
        activation[name] = {'input': input, 'output': output}           
    return hook                                                         
                                                                        
def add_hooks(m):                                                       
    for k, v in m.named_modules():                                      
        print(k, v)                                                     
        v.register_forward_hook(get_activation(k))                      
                                                                        
m = nn.Sequential(nn.Sequential(nn.Conv2d(1, 1, 1)), nn.Conv2d(1, 1, 1))
add_hooks(m)                                                            
m(torch.randn(1, 1, 1, 1))                                              
print(activation)

chenster_liu · January 30, 2024, 9:59am

Absolutely forward_hook is the simplest solution. I’m curious if it’s applicable on the quantized model as well. At least I’ve tried the above code snippet but after the quantization flow apply

add_hook(model_int8)

activation doesn’t hook anything. Is there any officially recommended way to extract intermediate activations from the quantized model?

HDCharles · February 13, 2024, 2:51am

when you run the above snipped it doesn’t hook anything? Because i use this regularly to capture intermediate activations for quantized models.

chenster_liu · March 21, 2024, 2:42pm

You are right, the code works.
In my last reply it was not working for me because my q_model is scripted, I think.

HDCharles · April 2, 2024, 7:02pm

yeah hooks are module oriented so scripting which removes modules would break it