When using binary classification model:
- When training a deep model, at each training step, the model receives a batch (i.e batch of size 32 samples).
- Let’s assume that in each training batch there are always 16 samples with label ‘0’ and 16 samples with label ‘1’.
- Does it matter how those samples are arranged in the batch?
- Is there a difference if the 16 labels with “0” appear first and then with “1” vs mix of all 32 samples?