Many unknown tiny GPU processes are started with DDP

I am using DDP for multi-GPU training. However, as demonstrated in the below figure, too many unknown GPU processes are started if I use more than one GPU.
Could anyone provide me with advice on this matter?

Are you using torch.multiprocessing or any other multiprocessing library in your code?
In a standard use case one process would be used for each device.