I start the training using the following command:
I can start one node first and then start another, and it can scale up normally.
When I end the worker node process with ctrl c, the node where the master is located will get stuck, instead of a new rendezvous.
I want to know what happens when the elastic agent is killed, and how to make other agents aware and initiate a new round of rendezvous.
(btw, I read the Kafka streaming data, but I guess the problem is not here.)