Conv1d layer for text classification

If you want to look at another example code, I have an old implementation of the architecture shown below.

The only difference is that I don’t use 1-max pooling but normal max-pooling. This implementation uses a nn.ModuleDict for a change :).

Please note that the code is rather verbose not make model very configurable, by specifying different conv_kernel_sizes as input parameter.