Stream node remain in "JOINING_FAILED" status
Hi All
We have a cluster in Production with one node on the Stream tab of the "Decisioning: Services" landing page remaining in JOINING_FAILED status (see attached screen shot). We traced it to this snippet in the Kafka server.log file:
...snip...
Hi All
We have a cluster in Production with one node on the Stream tab of the "Decisioning: Services" landing page remaining in JOINING_FAILED status (see attached screen shot). We traced it to this snippet in the Kafka server.log file:
...snip...
..snip...
Does anyone have an idea of what the problem is here? We know next to nothing about Kafka.
Regards,
Johan
I was instructed by Pega support in response to SR-D67690 to place the following in prconfig of the problem node:
<env name="dsm/services/stream/server_properties/broker.id" value="8">
As far as I could gather this is supposed to force the broker id. It wasn't sufficient, though. The node kept on starting up with the wrong id. I eventually traced it to some db table that still had an entry for the node with the wrong broker id. After I deleted that the Kafka node started functioning correctly again. It resolved a LOT of stability issues. Unfortunately I do not recall the name of the db table and I no longer have contact with the client. I don't think it was pr_sys_statusnodes, though.