I have some question that will post here.
The first question is about what two paragraphs say
For ResNet, we directly use the features of the last layer in the first three blocks, and
put these features into three corresponding FastFlow model.
…
…
For ResNet18 and Wide-ResNet50-2, we directly use the features of the last layer in the first three blocks, put these features into the 2D flow model to obtain their respective anomaly de-
tection and localization results, and finally take the average value as the final result.
Focusing on the feature level that i need to extract, do I need to extract one or three features levels?
I understood to extract one feature which implementation looks like this: