Emerging 2 different models to create a multi-model

Based on this link (Combining Trained Models in PyTorch - #66 by rudascience), outA and outB are collected after FC layer. Because of this, my first question is answered.

Following with this post (Combining Trained Models in PyTorch - #68 by rudascience), it seems like I only need to define an optimizer and push to classifer.cuda() for classifer instead for both 3 models (i.e., modelA, modelB and classifier) ? Is this true or am I missing something in the post?