How ResNets and DenseNets speedup the training

Please Help!!
I am back again with a silly question :slight_smile:

Please help me to understand—

How ResNets and DenseNets accelerate the speed of training of the deep networks?

They have fewer parameters compared to VGG model. Also, they are just better architectures meaning they can generalize faster.