Why are 1x1 conv used in SE layers instead of linear layers

A Conv2d 1x1 kernel is identical mathematically to a Linear operation. Please see here for a code demonstration: