Given groups=1, weight of size [6, 1, 3, 3], expected input[13, 3, 100, 100] to have 1 channels, but got 3 channels instead

This has been very helpful. I am utterly thankful. The link which you provided for understanding different shapes was also very informative.
Graph with 10 epoch and 0.001lr
10epoch-Adampoint001lr